Automotive Engineering
WANG Chang, LI Zhao, ZHAO Xia, SUN Qin-yu, FU Rui, GUO Ying-shi, YUAN Wei
Driver state monitoring technology, as a key means for improving vehicle intelligence and safety, aims to accurately identify and deeply understand the driver's actions, emotions, and attention states. Although significant progress has been made in this field, a systematic summary of the principles of deep learning algorithms is lacking. In view of this, this paper systematically reviews driver state monitoring algorithms based on images and deep learning to meet the needs of the continuous development of intelligent vehicle technology. First, the methodology in the literature is elaborated upon. The existing publicly available datasets are then organized and described. Subsequently, in-depth exploration is conducted from the aspects of data selection and processing, model architecture, model training and evaluation, and optimization. Finally, the shortcomings of the current research are summarized, and the main future development directions are outlined. The results show that: ① the research on driver state monitoring based on image and deep learning has progressed to a certain depth; ② data selection and processing techniques show diversity; ③ model architectures continue to evolve in the direction of multi-modal, multitasking, lightweight, and high robustness, gradually beginning to adopt training strategies for incomplete supervision and multi-objective optimization. However, most research methods lack systematic testing of actual driving scenarios neither fully considering the behavioral characteristics of drivers under natural driving conditions nor the changes in the human-computer interaction patterns of intelligent vehicles, making it difficult to construct an all-around monitoring function for various driving scenarios and driver personalities. The further development of driver state monitoring algorithms is mainly limited by two factors. First, the current deep learning methods still have deficiencies in their domain adaptation, interpretability, and operational efficiency. Second, large-scale high-quality datasets under natural driving environments are lacking. This review is dedicated to providing effective guidance and important references for further development of high cognitive driver state monitoring systems.