UAV-based vehicle detection and tracking in urban environments using multi-task CNN and deep reinforcement learning

Park, Chae-won; Lim, Ji-hye; Lee, Seungjun; Nam, Keum Seong; Yang, Qin; Yoo, Sangjo

doi:10.1016/j.icte.2025.09.016

상세 보기

UAV-based vehicle detection and tracking in urban environments using multi-task CNN and deep reinforcement learning

Park, Chae-won;
Lim, Ji-hye;
Lee, Seungjun;
Nam, Keum Seong;
Yang, Qin;
... Yoo, Sangjo

Citations

WEB OF SCIENCE

2

Citations

SCOPUS

3

초록

This paper presents a real-time vehicle detection and tracking system using an unmanned aerial vehicle (UAV) to address challenges in dynamic urban environments. The system combines a convolutional neural network (CNN) for vehicle detection with a deep Q-network (DQN)-based navigation policy for continuous tracking. Input images are enhanced using contrast limited adaptive histogram equalization (CLAHE) and unsharp masking. The CNN jointly predicts vehicle center coordinates and probabilistic heatmaps, while a self-attention module captures long-range spatial dependencies to improve detection under clutter and occlusion. The DQN is trained on multi-step spatiotemporal states to learn optimal UAV movement strategies under diverse weather and structural conditions. Experiments conducted in a three-dimensional (3D) urban simulation environment using Unity’s machine learning agents (ML-Agents) show that the self-attention design reduced pixel-level localization error by about 7 %, and the DQN-based tracking policy achieved stable convergence after approximately 2000–3000 episodes. These results demonstrate high tracking accuracy and system stability, highlighting the potential of the proposed approach for real-world UAV-based traffic monitoring applications. 2018 The Korean Institute of Communications and Information Sciences. Publishing Services by Elsevier B.V. This is an open access article under the CC BY-NCND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). © © 2025. Published by Elsevier B.V.

키워드

Convolutional neural networks (CNN); Deep reinforcement learning (DRL); Real-time detection; Self-attention; UAV tracking; Unity ML-Agents; Vehicle detection and tracking

제목: UAV-based vehicle detection and tracking in urban environments using multi-task CNN and deep reinforcement learning

저자: Park, Chae-won; Lim, Ji-hye; Lee, Seungjun; Nam, Keum Seong; Yang, Qin; Yoo, Sangjo

DOI: 10.1016/j.icte.2025.09.016

발행일: 2025-12

유형: Article

저널명: ICT Express

권: 11

호: 6

페이지: 1173 ~ 1180