UAV-based vehicle detection and tracking in urban environments using multi-task CNN and deep reinforcement learning

  • Park, Chae-won
  • Lim, Ji-hye
  • Lee, Seungjun
  • Nam, Keum Seong
  • Yang, Qin
  • ... Yoo, Sangjo
Citations

WEB OF SCIENCE

2
Citations

SCOPUS

3

초록

This paper presents a real-time vehicle detection and tracking system using an unmanned aerial vehicle (UAV) to address challenges in dynamic urban environments. The system combines a convolutional neural network (CNN) for vehicle detection with a deep Q-network (DQN)-based navigation policy for continuous tracking. Input images are enhanced using contrast limited adaptive histogram equalization (CLAHE) and unsharp masking. The CNN jointly predicts vehicle center coordinates and probabilistic heatmaps, while a self-attention module captures long-range spatial dependencies to improve detection under clutter and occlusion. The DQN is trained on multi-step spatiotemporal states to learn optimal UAV movement strategies under diverse weather and structural conditions. Experiments conducted in a three-dimensional (3D) urban simulation environment using Unity’s machine learning agents (ML-Agents) show that the self-attention design reduced pixel-level localization error by about 7 %, and the DQN-based tracking policy achieved stable convergence after approximately 2000–3000 episodes. These results demonstrate high tracking accuracy and system stability, highlighting the potential of the proposed approach for real-world UAV-based traffic monitoring applications. 2018 The Korean Institute of Communications and Information Sciences. Publishing Services by Elsevier B.V. This is an open access article under the CC BY-NCND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). © © 2025. Published by Elsevier B.V.

키워드

Convolutional neural networks (CNN)Deep reinforcement learning (DRL)Real-time detectionSelf-attentionUAV trackingUnity ML-AgentsVehicle detection and tracking
제목
UAV-based vehicle detection and tracking in urban environments using multi-task CNN and deep reinforcement learning
저자
Park, Chae-wonLim, Ji-hyeLee, SeungjunNam, Keum SeongYang, QinYoo, Sangjo
DOI
10.1016/j.icte.2025.09.016
발행일
2025-12
유형
Article
저널명
ICT Express
11
6
페이지
1173 ~ 1180