Enhanced Pedestrian Detection and Tracking Using Multi-Person Pose Extraction and Deep Convolutional LSTM Network

A. Tashk*, M. A. Alavianmehr*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

Abstract

The paper addresses a precise pedestrian detection method with high localization accuracy for real-world applications. Due to the inherent flexibility of the human body, it's difficult to create a template-based pedestrian detector that simultaneously attains high detection rates and acceptable localization accuracy. To overcome this, we introduce a two-stage model. In the first stage, we employ a novel detection method to identify pedestrians. Simultaneously, they extract key points from the detected pedestrians' bodies. In the second stage, these extracted body key points are treated as feature vectors for each pedestrian in every frame of a video sequence. These feature vectors are fed into a series of 2D LSTM blocks, allowing for pedestrian tracking based on key points. Additionally, a 3D LSTM block is employed to aggregate temporal data, aiding in trajectory prediction. In the final step of the second stage, trajectory predictions are refined using Kalman filtering. We benchmark our method against similar approaches like Track R-CNN and YOLOv7 on both pixel-wise and region-wise metrics. Results reveal impressive performance, boasting an MOTP score of 0.803 and a MOTA score of 0.603. These outcomes underline the efficacy of our proposed method in achieving robust localization accuracy for pedestrian detection in practical scenarios.

Original languageEnglish
Title of host publicationProceedings - IEEE Congress on Cybermatics : 2024 IEEE International Conferences on Internet of Things, iThings 2024, IEEE Green Computing and Communications, GreenCom 2024, IEEE Cyber, Physical and Social Computing, CPSCom 2024, IEEE Smart Data, SmartData 2024
Number of pages6
PublisherInstitute of Electrical and Electronics Engineers Inc.
Publication date2024
Pages386-391
ISBN (Electronic)979-8-3503-5163-7
DOIs
Publication statusPublished - 2024
EventIEEE Congress on Cybermatics: 17th IEEE International Conference on Internet of Things, iThings 2024, 20th IEEE International Conference on Green Computing and Communications, GreenCom 2024, 17th IEEE International Conference on Cyber, Physical and Social Computing, CPSCom 2024, 10th IEEE International Conference on Smart Data, SmartData 2024 - Copenhagen, Denmark
Duration: 19 Aug 202422 Aug 2024

Conference

ConferenceIEEE Congress on Cybermatics: 17th IEEE International Conference on Internet of Things, iThings 2024, 20th IEEE International Conference on Green Computing and Communications, GreenCom 2024, 17th IEEE International Conference on Cyber, Physical and Social Computing, CPSCom 2024, 10th IEEE International Conference on Smart Data, SmartData 2024
Country/TerritoryDenmark
CityCopenhagen
Period19/08/202422/08/2024
SponsorIEEE, IEEE Computational Intelligence Society Technical Committee on Smart World, IEEE Computer Society, IEEE Hyper Intelligence Technical Committee (HI-TC), IEEE Systems, Man, and Cybernetics (SMC) Society Technical Committee on CyberMatics, IEEE Technical Committee on Scalable Computing (TCSC)
SeriesProceedings - IEEE Congress on Cybermatics: 2024 IEEE International Conferences on Internet of Things, iThings 2024, IEEE Green Computing and Communications, GreenCom 2024, IEEE Cyber, Physical and Social Computing, CPSCom 2024, IEEE Smart Data, SmartData 2024

Bibliographical note

Publisher Copyright:
© 2024 IEEE.

Keywords

  • and Pedestrian Tracking
  • Kalman Filtering
  • Long Short-Term Memory (LSTM) Network
  • Pedestrian Detection
  • Pose Extraction

Cite this