Yıl: 2024 Cilt: 32 Sayı: 1 Sayfa Aralığı: 144 - 165 Metin Dili: İngilizce DOI: 10.55730/1300-0632.4060 İndeks Tarihi: 14-03-2024

Automated identification of vehicles in very high-resolution UAV orthomosaics using YOLOv7 deep learning model

Öz:
The utilization of remote sensing products for vehicle detection through deep learning has gained immense popularity, especially due to the advancement of unmanned aerial vehicles (UAVs). UAVs offer millimeter-level spa- tial resolution at low flight altitudes, which surpasses traditional airborne platforms. Detecting vehicles from very high-resolution UAV data is crucial in numerous applications, including parking lot and highway management, traffic monitoring, search and rescue missions, and military operations. Obtaining UAV data at desired periods allows the detection and tracking of target objects even several times during a day. Despite challenges such as diverse vehicle char- acteristics, traffic congestion, and hardware limitations, the detection task must be executed swiftly and accurately. This study successfully achieved automated detection and instance segmentation of parked and moving vehicles across a large university campus by employing the robust learning capabilities of the You Only Look Once version 7 (YOLOv7) deep learning algorithm. The generation of an ultrahigh-resolution orthomosaic of the university campus was accomplished through photogrammetric processing, employing 20-megapixel aerial images obtained from RGB UAV flights with polyg- onal nadir-view and bundle-grid oblique-view imaging geometries. The vehicle dataset was created by cropping image patches containing vehicle objects from the orthomosaic and manually labeling the boundaries of the vehicle targets using the LabelMe annotation tool. After expanding the dataset by applying data augmentation, the YOLOv7 algorithm was trained and tested using the transfer learning approach. The accuracy metric of precision, recall, and mAP@0.50 scores for the bounding boxes and masks of vehicles were estimated as 99.79, 97.54, and 99.46%, respectively. In addition, the robustness of the trained algorithm was also tested on a short video and ( > 80%) prediction scores were achieved.
Anahtar Kelime: Vehicle detection instance segmentation UAV orthomosaic deep learning YOLOv7

Belge Türü: Makale Makale Türü: Araştırma Makalesi Erişim Türü: Erişime Açık
  • [1] Kainz O, Dopiriak M, Michalko M, Jakab F, Nováková I. Traffic monitoring from the perspective of an unmanned aerial vehicle. Applied Sciences 2022; 12 (16): 7966. doi: 10.3390/app12167966
  • [2] Ding J, Zhang J, Zhan Z, Tang X, Wang X. A precision efficient method for collapsed building detection in post- earthquake UAV images based on the improved NMS algorithm and Faster R-CNN. Remote Sensing 2022; 14 (3): 663. doi: 10.3390/rs14030663
  • [3] Zhang X, Han L, Dong Y, Shi Y, Huang W et al. A deep learning-based approach for automated yellow rust disease detection from high-resolution hyperspectral UAV images. Remote Sensing 2019; 11 (13): 1554. doi: 10.3390/rs11131554
  • [4] Kose O, Oktay T. Simultaneous design of morphing hexarotor and autopilot system by using deep neural network and SPSA. Aircraft Engineering and Aerospace Technology 2023; 95 (6): 939-949. doi: 10.1108/AEAT-07-2022-0178
  • [5] Şahin H, Kose O, Oktay T. Simultaneous autonomous system and powerplant design for morphing quadrotors. Aircraft Engineering and Aerospace Technology 2022; 94 (8): 1228-1241. doi: 10.1108/AEAT-06-2021-0180
  • [6] Kose O, Oktay T. Simultaneous quadrotor autopilot system and collective morphing system design. Aircraft Engineering and Aerospace Technology 2020; 92 (7): 1093-1100. doi: 10.1108/AEAT-01-2020-0026
  • [7] Ammar A, Koubaa A, Ahmed M, Saad A, Benjdira B. Vehicle detection from aerial images using deep learning: A comparative study. Electronics 2021; 10 (7): 820. doi: 10.3390/electronics10070820
  • [8] Bouguettaya A, Zarzour H, Kechida A, Taberkit AM. Vehicle detection from UAV imagery with deep learn- ing: A review. IEEE Transactions on Neural Networks and Learning Systems 2021; 33 (11): 6047-6067. doi: 10.1109/tnnls.2021.3080276
  • [9] Srivastava S, Narayan S, Mittal S. A survey of deep learning techniques for vehicle detection from UAV images. Journal of Systems Architecture 2021; 117: 102152. doi: 10.1016/j.sysarc.2021.102152
  • [10] Wang X. Vehicle image detection method using deep learning in UAV video. Computational Intelligence and Neuroscience 2022; 8202535. doi: 10.1155/2022/8202535
  • [11] Dalal N, Triggs B. Histograms of oriented gradients for human detection. In: 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05); San Diego, CA, USA; 2005. pp. 886-893. doi: 10.1109/CVPR.2005.177
  • [12] Viola P, Jones M. Rapid object detection using a boosted cascade of simple features. In: 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’01); Kauai, HI, USA; 2001. pp. 511-518. doi: 10.1109/CVPR.2001.990517
  • [13] Fei-Fei L, Perona P. A Bayesian hierarchical model for learning natural scene categories. In: 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05); San Diego, CA, USA; 2005. pp. 524-531. doi: 10.1109/CVPR.2005.16
  • [14] Mountrakis G, Im J, Ogole C. Support vector machines in remote sensing: A review. ISPRS Journal of Photogram- metry and Remote Sensing 2011; 66 (3): 247-259. doi: 10.1016/j.isprsjprs.2010.11.001
  • [15] Freund Y, Schapire RE. A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences 1997; 55 (1): 119-139. doi: 10.1006/jcss.1997.1504
  • [16] Peterson LE. K-nearest neighbor. Scholarpedia 2009; 4(2): 1883. doi: 10.4249/scholarpedia.1883
  • [17] Kavzoglu T, Erdemir MY, Tonbul H. Classification of semiurban landscapes from very high-resolution satellite images using a regionalized multiscale segmentation approach. Journal of Applied Remote Sensing 2017; 11 (3): 035016-035016. doi: 10.1117/1.JRS.11.035016
  • [18] Kavzoglu T, and Tonbul H. A comparative study of segmentation quality for multi-resolution segmentation and watershed transform. In: 8th International Conference on Recent Advances in Space Technologies (RAST); İstanbul, Türkiye; 2017. pp. 113-117. doi: 10.1109/RAST.2017.8002984
  • [19] Yildirim E, Kavzoglu T. Ship detection in optical remote sensing images using YOLOv4 and Tiny YOLOv4. In: Ben Ahmed M, Boudhir AA, Karaș İR, Jain V, Mellouli S (eds). Innovations in Smart Cities Applications Volume 5. SCA 2021. Lecture Notes in Networks and Systems, vol 393. Springer, Cham, 2022, pp. 913-924. doi: 10.1007/978- 3-030-94191-8_74
  • [20] Coşkun D, Karaboğa D, Baştürk A, Akay B, Nalbantoğlu OU et al. A comparative study of YOLO models and a transformer-based YOLOv5 model for mass detection in mammograms. Turkish Journal of Electrical Engineering and Computer Sciences 2023; 31 (7): 1294-1313. doi: 10.55730/1300-0632.4048
  • [21] Yildirim E, Kavzoglu T. Detection of collapsed buildings from post-earthquake imagery using Mask Region-Based Convolutional Neural Network. In: 7th Intercontinental Geoinformation Days (IGD); Peshawar, Pakistan; 2023. pp. 119-122.
  • [22] Kavzoğlu T, Yılmaz EÖ. Analysis of patch and sample size effects for 2D-3D CNN models using multiplatform dataset: hyperspectral image classification of ROSIS and Jilin-1 GP01 imagery. Turkish Journal of Electrical Engineering and Computer Sciences 2022; 30 (6): 2124-2144. doi: 10.55730/1300-0632.3929
  • [23] Shabaz M, Soni M. Cognitive digital modelling for hyperspectral image classification using transfer learning model. Turkish Journal of Electrical Engineering and Computer Sciences 2023; 31(6): 1039-1060. doi: 10.55730/1300- 0632.4033
  • [24] Chen G, Chen Q, Long S, Zhu W, Yuan Z et al. Quantum Convolutional Neural Network for image classification. Pattern Analysis and Applications 2023; 26 (2): 655-667. doi: 10.1007/s10044-022-01113-z
  • [25] Gordo A, Almazán J, Revaud J, Larlus D. Deep image retrieval: Learning global representations for image search. In: 14th European Conference on Computer Vision (ECCV’16); Amsterdam, The Netherlands; 2016: 241-257.
  • [26] Öztürk Ş, Alhudhaif A, Polat K. Attention-based end-to-end CNN framework for content-based X-ray image re- trieval. Turkish Journal of Electrical Engineering and Computer Sciences 2021; 29 (8): 2680-2693. doi: 10.3906/elk- 2105-242
  • [27] Badrinarayanan V, Kendall A, Cipolla R. SegNet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 2017; 39 (12): 2481-2495. doi: 10.1109/TPAMI.2016.2644615
  • [28] Bagheri F, Tarokh MJ, Ziaratban M. Skin lesion segmentation by using object detection networks, DeepLab3+, and active contours. Turkish Journal of Electrical Engineering and Computer Sciences 2022; 30 (7): 2489-2507. doi: 10.55730/1300-0632.3951
  • [29] Yildirim E, Nazar M, Sefercik UG, Kavzoglu T. Stone Pine (Pinus Pinea L.) detection from high-resolution UAV Imagery using deep learning model. In: IEEE International Geoscience and Remote Sensing Symposium (IGARSS’22); Kuala Lumpur, Malaysia; 2022. pp. 441-444. doi: 10.1109/IGARSS46834.2022.9883964
  • [30] Girshick R, Donahue J, Darrell T, Malik J. Rich feature hierarchies for accurate object detection and semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR); Columbus, OH, USA; 2014. pp. 580-587. doi: 10.1109/CVPR.2014.81
  • [31] Girshick R. Fast R-CNN. In: IEEE International Conference on Computer Vision (ICCV); Washington, DC, USA; 2015. pp. 1440-1448. doi: 10.1109/ICCV.2015.169
  • [32] Ren S, He K, Girshick R, Sun J. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis and Machine Intelligence 2017; 39 (6): 1137-1149. doi: 10.1109/TPAMI.2016.2577031
  • [33] He K, Zhang X, Ren S, Sun J. Spatial pyramid pooling in deep convolutional networks for visual recog- nition. IEEE Transactions on Pattern Analysis and Machine Intelligence 2015; 37 (9): 1904-1916. doi: 10.1109/TPAMI.2015.2389824
  • [34] Dai J, Li Y, He K, Sun J. R-FCN: Object detection via region-based fully convolutional networks. In: 30th Annual Conference on Neural Information Processing Systems (NIPS’16); Barcelona, Spain; 2016. pp. 379-387.
  • [35] He K, Gkioxari G, Dollár P, Girshick R. Mask R-CNN. IEEE Transactions on Pattern Analysis and Machine Intelligence 2020; 42 (2); 386-397. doi: 10.1109/TPAMI.2018.2844175
  • [36] Redmon J, Divvala S, Girshick R, Farhadi A. You Only Look Once: Unified, real-time object detection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR’16); Las Vegas, NV, USA; 2016. pp. 779-788. doi: 10.1109/CVPR.2016.91
  • [37] Redmon J, Farhadi A. YOLO9000: Better, faster, stronger. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR’17); Honolulu, Hawaii; 2017. pp. 7263-7271. doi: 10.1109/CVPR.2017.690
  • [38] Redmon J, Farhadi A. YOLOv3: An incremental improvement. arXiv: 1804.02767v1, 2018. Available: https://doi.org/10.48550/arXiv.1804.02767
  • [39] Bochkovskiy A, Wang CY, Liao HYM. YOLOv4: Optimal speed and accuracy of object detection. arXiv: 2004.10934, 2020. Available: https://doi.org/10.48550/arXiv.2004.10934
  • [40] Jocher G. YOLOv5 by Ultralytics. https://github.com/ultralytics/yolov5, 2020. Accessed: June 06, 2023.
  • [41] Li C, Li L, Jiang H, Weng K, Geng Y et al. YOLOv6: A single-stage object detection framework for industrial applications. arXiv: 2209.02976, 2022. Available: https://doi.org/10.48550/arXiv.2209.02976
  • [42] Wang CY, Bochkovskiy A, Liao HYM. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. arXiv: 2207.02696, 2022. Available: https://doi.org/10.48550/arXiv.2207.02696
  • [43] Liu W, Anguelov D, Erhan D, Szegedy C, Reed S et al. SSD: Single shot multibox detector. In: European Conference on Computer Vision (ECCV’16); Amsterdam, The Netherlands; 2016. pp. 21-37.
  • [44] Lin TY, Goyal P, Girshick R, He K, Dollár P. Focal loss for dense object detection. In: IEEE International Conference on Computer Vision; Venice, Italy; 2017. pp. 2980-2988. doi: 10.1109/ICCV.2017.324
  • [45] Li S, Yang X, Lin X, Zhang Y, Wu J. Real-time vehicle detection from UAV aerial images based on improved YOLOv5. Sensors 2023; 23 (12): 5634. doi: 10.3390/s23125634
  • [46] Xie X, Yang W, Cao G, Yang J, Zhao Z et al. Real-time vehicle detection from UAV imagery. In: 2018 IEEE Fourth International Conference on Multimedia Big Data (BigMM); Xi’an, China; 2018. pp. 1-5. doi: 10.1109/BigMM.2018.8499466
  • [47] Chen Z, Cao L, Wang Q. YOLOv5-based vehicle detection method for high-resolution UAV images. Mobile Infor- mation Systems 2022; 2022: 1828848. doi: 10.1155/2022/1828848
  • [48] Ammar A, Koubaa A, Ahmed M, Saad A, Benjdira B. Vehicle detection from aerial images using deep learning: A comparative study. Electronics 2021; 10 (7): 820. doi: 10.3390/electronics10070820
  • [49] Amato G, Ciampi L, Falchi F, Gennaro C. Counting vehicles with deep learning in onboard UAV imagery. In: 2019 IEEE Symposium on Computers and Communications (ISCC); Barcelona, Spain; 2019. pp. 1-6. doi: 10.1109/ISCC47284.2019.8969620
  • [50] Benjdira B, Khursheed T, Koubaa A, Ammar A, Ouni K. Car detection using Unmanned Aerial Vehicles: Compar- ison between Faster R-CNN and YOLOv3. In: 2019 1st International Conference on Unmanned Vehicle Systems- Oman (UVS); Muscat, Oman; 2019. pp. 1-6. doi: 10.1109/UVS.2019.8658300
  • [51] Chen S, Laefer DF, Mangina E. State of technology review of Civilian UAVs. Recent Patents Engineering 2016; 10 (3): 160-174. doi: 10.2174/1872212110666160712230039
  • [52] Udeanu G, Dobrescu A, Oltean M. Unmanned aerial vehicle in military operations. Scientific Research and Education in the Air Force 2016; 18 (1): 199-206. doi: 10.19062/2247-3173.2016.18.1.26
  • [53] Kretov A, Glukhov V, Tikhonov A. Conceptual assessment of the possibility of using cryogenic fuel on unmanned aerial vehicles. Drones 2022; 6 (8): 217. doi: 10.3390/drones6080217
  • [54] Westoby MJ, Brasington J, Glasser NF, Hambrey MJ, Reynolds JM. Structure-from-Motion’photogrammetry: A low-cost, effective tool for geoscience applications. Geomorphology 2022; 179: 300-314. doi: 10.1016/j.geomorph.2012.08.021
  • [55] Sefercik UG, Kavzoglu T, Nazar M, Atalay C, Madak M. Creation of a virtual tour .exe utilizing very high- resolution RGB UAV data. International Journal of Environment and Geoinformatics 2022; 9 (4): 151-160. doi: 10.30897/ijegeo.1102575
  • [56] Baltsavias E. A comparison between photogrammetry and laser scanning. ISPRS Journal of Photogrammetry and Remote Sensing 1999; 54 (2-3): 83-94. doi: 10.1016/S0924-2716(99)00014-3
  • [57] Jacobsen K. Characteristics of nearly world-wide available digital height models. In: 10th Seminar on Remote Sensing and GIS Applications in Forest Engineering; Curitiba, Brazil; 2012. pp. 15-18.
  • [58] Sefercik UG, Glennie C, Singhania A, Hauser D. Area-based quality control of airborne laser scanning 3D models for different land classes using terrestrial laser scanning: sample survey in Houston, USA. International Journal of Remote Sensing 2015; 36 (23): 5916-5934. doi: 10.1080/01431161.2015.1110260
  • [59] Wada CY, Labelme: Image polygonal annotation with Python. https://github.com/wkentaro/labelme, 2023. Ac- cessed: July 07, 2023.
  • [60] Bodla N, Singh B, Chellappa R, Davis LS. Soft-NMS – Improving object detection with one line of code. In: IEEE In- ternational Conference on Computer Vision (ICCV); Venice, Italy; 2017. pp. 5561-5569. doi: 10.1109/ICCV.2017.593
  • [61] Qiu Z, Bai H, Chen T. Special vehicle detection from UAV Perspective via YOLO-GNS based deep learning network. Drones 2023; 7 (2): 117. doi: 10.3390/drones7020117
APA Yildirim E, Sefercik U, Kavzoglu T (2024). Automated identification of vehicles in very high-resolution UAV orthomosaics using YOLOv7 deep learning model. , 144 - 165. 10.55730/1300-0632.4060
Chicago Yildirim Esra,Sefercik Umut Gunes,Kavzoglu Taskin Automated identification of vehicles in very high-resolution UAV orthomosaics using YOLOv7 deep learning model. (2024): 144 - 165. 10.55730/1300-0632.4060
MLA Yildirim Esra,Sefercik Umut Gunes,Kavzoglu Taskin Automated identification of vehicles in very high-resolution UAV orthomosaics using YOLOv7 deep learning model. , 2024, ss.144 - 165. 10.55730/1300-0632.4060
AMA Yildirim E,Sefercik U,Kavzoglu T Automated identification of vehicles in very high-resolution UAV orthomosaics using YOLOv7 deep learning model. . 2024; 144 - 165. 10.55730/1300-0632.4060
Vancouver Yildirim E,Sefercik U,Kavzoglu T Automated identification of vehicles in very high-resolution UAV orthomosaics using YOLOv7 deep learning model. . 2024; 144 - 165. 10.55730/1300-0632.4060
IEEE Yildirim E,Sefercik U,Kavzoglu T "Automated identification of vehicles in very high-resolution UAV orthomosaics using YOLOv7 deep learning model." , ss.144 - 165, 2024. 10.55730/1300-0632.4060
ISNAD Yildirim, Esra vd. "Automated identification of vehicles in very high-resolution UAV orthomosaics using YOLOv7 deep learning model". (2024), 144-165. https://doi.org/10.55730/1300-0632.4060
APA Yildirim E, Sefercik U, Kavzoglu T (2024). Automated identification of vehicles in very high-resolution UAV orthomosaics using YOLOv7 deep learning model. Turkish Journal of Electrical Engineering and Computer Sciences, 32(1), 144 - 165. 10.55730/1300-0632.4060
Chicago Yildirim Esra,Sefercik Umut Gunes,Kavzoglu Taskin Automated identification of vehicles in very high-resolution UAV orthomosaics using YOLOv7 deep learning model. Turkish Journal of Electrical Engineering and Computer Sciences 32, no.1 (2024): 144 - 165. 10.55730/1300-0632.4060
MLA Yildirim Esra,Sefercik Umut Gunes,Kavzoglu Taskin Automated identification of vehicles in very high-resolution UAV orthomosaics using YOLOv7 deep learning model. Turkish Journal of Electrical Engineering and Computer Sciences, vol.32, no.1, 2024, ss.144 - 165. 10.55730/1300-0632.4060
AMA Yildirim E,Sefercik U,Kavzoglu T Automated identification of vehicles in very high-resolution UAV orthomosaics using YOLOv7 deep learning model. Turkish Journal of Electrical Engineering and Computer Sciences. 2024; 32(1): 144 - 165. 10.55730/1300-0632.4060
Vancouver Yildirim E,Sefercik U,Kavzoglu T Automated identification of vehicles in very high-resolution UAV orthomosaics using YOLOv7 deep learning model. Turkish Journal of Electrical Engineering and Computer Sciences. 2024; 32(1): 144 - 165. 10.55730/1300-0632.4060
IEEE Yildirim E,Sefercik U,Kavzoglu T "Automated identification of vehicles in very high-resolution UAV orthomosaics using YOLOv7 deep learning model." Turkish Journal of Electrical Engineering and Computer Sciences, 32, ss.144 - 165, 2024. 10.55730/1300-0632.4060
ISNAD Yildirim, Esra vd. "Automated identification of vehicles in very high-resolution UAV orthomosaics using YOLOv7 deep learning model". Turkish Journal of Electrical Engineering and Computer Sciences 32/1 (2024), 144-165. https://doi.org/10.55730/1300-0632.4060