Three-dimensional atrous inception module for crowd behavior classification

  • Choi, Jong-Hyeok
  • Kim, Jeong-Hun
  • Nasridinov, Aziz
  • Kim, Yoo-Sung
Citations

WEB OF SCIENCE

2
Citations

SCOPUS

4

초록

Recent advances in deep learning have led to a surge in computer vision research, including the recognition and classification of human behavior in video data. However, most studies have focused on recognizing individual behaviors, whereas recognizing crowd behavior remains a complex problem because of the large number of interactions and similar behaviors among individuals or crowds in video surveillance systems. To solve this problem, we propose a three-dimensional atrous inception module (3D-AIM) network, which is a crowd behavior classification model that uses atrous convolution to explore interactions between individuals or crowds. The 3D-AIM network is a 3D convolutional neural network that can use receptive fields of various sizes to effectively identify specific features that determine crowd behavior. To further improve the accuracy of the 3D-AIM network, we introduced a new loss function called the separation loss function. This loss function focuses the 3D-AIM network more on the features that distinguish one type of crowd behavior from another, thereby enabling a more precise classification. Finally, we demonstrate that the proposed model outperforms existing human behavior classification models in terms of accurately classifying crowd behaviors. These results suggest that the 3D-AIM network with a separation loss function can be valuable for understanding complex crowd behavior in video surveillance systems.

제목
Three-dimensional atrous inception module for crowd behavior classification
저자
Choi, Jong-HyeokKim, Jeong-HunNasridinov, AzizKim, Yoo-Sung
DOI
10.1038/s41598-024-65003-6
발행일
2024-06-22
유형
Article
저널명
Scientific Reports
14
1