TY - GEN
T1 - Implementation of 34.7 fps Pose and Gaze Estimator for Real-Time Driver-Vehicle Interaction System
AU - Kim, Minjoon
AU - So, Jaehyuk
AU - Hwang, Taemin
N1 - Publisher Copyright:
© 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2023
Y1 - 2023
N2 - With the recent rapid development of autonomous driving, many researches on intelligent in-vehicle interaction technologies have been studied. Real-time driver behavior analysis is a key function for various in-vehicle interaction ap-plications. An important performance indicator here is real-time quality because it is directly related to safety. Therefore, in this paper, we design the convolutional neural network (CNN) architecture suited to in-vehicle driver behavior analysis using pose and gaze. First, we define the 11 key points for driver pose and gaze, and modeled a CNN architecture that can detect them quickly. The proposed architecture was re-generated and re-trained with layer reduction for high speed based on the residual CNN model. Furthermore, the hardware implementation result based on FPGA platform to verify the real-time performance are presented. In order to implement real-time interaction, an image processing speed of more than 30 fps and a latency time of less than 100 ms are generally required. We selected and implemented the FPGA platform to meet these requirements. The designed hardware architecture was implemented at the RTL level of the VCU118 FPGA, and simulation results show 34.7 fps and 75.3 ms latency. Finally, we implemented the driver pose and gaze estimator on the FPGA based hardware platform to experiment the driver-vehicle interaction system with the demo application. The detected pose and gaze results were transmitted to the GPU board in real time, reliably supporting 30 fps, and verified application to screen control and driver monitoring applications.
AB - With the recent rapid development of autonomous driving, many researches on intelligent in-vehicle interaction technologies have been studied. Real-time driver behavior analysis is a key function for various in-vehicle interaction ap-plications. An important performance indicator here is real-time quality because it is directly related to safety. Therefore, in this paper, we design the convolutional neural network (CNN) architecture suited to in-vehicle driver behavior analysis using pose and gaze. First, we define the 11 key points for driver pose and gaze, and modeled a CNN architecture that can detect them quickly. The proposed architecture was re-generated and re-trained with layer reduction for high speed based on the residual CNN model. Furthermore, the hardware implementation result based on FPGA platform to verify the real-time performance are presented. In order to implement real-time interaction, an image processing speed of more than 30 fps and a latency time of less than 100 ms are generally required. We selected and implemented the FPGA platform to meet these requirements. The designed hardware architecture was implemented at the RTL level of the VCU118 FPGA, and simulation results show 34.7 fps and 75.3 ms latency. Finally, we implemented the driver pose and gaze estimator on the FPGA based hardware platform to experiment the driver-vehicle interaction system with the demo application. The detected pose and gaze results were transmitted to the GPU board in real time, reliably supporting 30 fps, and verified application to screen control and driver monitoring applications.
KW - Autonomous Vehicle
KW - CNN Accelerator
KW - Driver Behavior Analysis
KW - Driver-Vehicle Interaction
KW - Human Pose Estimation
UR - https://www.scopus.com/pages/publications/85169461789
U2 - 10.1007/978-3-031-36004-6_5
DO - 10.1007/978-3-031-36004-6_5
M3 - Conference contribution
AN - SCOPUS:85169461789
SN - 9783031360039
T3 - Communications in Computer and Information Science
SP - 30
EP - 35
BT - HCI International 2023 Posters - 25th International Conference on Human-Computer Interaction, HCII 2023, Proceedings
A2 - Stephanidis, Constantine
A2 - Antona, Margherita
A2 - Ntoa, Stavroula
A2 - Salvendy, Gavriel
PB - Springer Science and Business Media Deutschland GmbH
T2 - 25th International Conference on Human-Computer Interaction, HCII 2023
Y2 - 23 July 2023 through 28 July 2023
ER -