Predicting Vehicle Collision Using Transformer Network with Multi-Modal Data

Abstract

Road accidents pose a significant threat to human life, causing numerous injuries, fatalities, and economic damage worldwide. Recently, there has been growing interest in leveraging Artificial Intelligence (AI) to create systems that can predict vehicle crashes. This research focuses on vehicle collision prediction and aims to develop a solution combining pre-trained Convolutional Neural Networks (CNN) and transformer network to mitigate the occurrence of such accidents. By leveraging advanced deep learning techniques, this research addresses the limitations of traditional crash analysis methods. The Car Learning to Act (CARLA) simulator was used for data gathering, with an ego-vehicle attached with RGB and RGB-Depth cameras. Four pre-trained CNNs were used for feature extraction. With those extracted features, a transformer network was employed to train a model. After model training and testing, it was observed that the transformer model trained with VGG16-based feature extraction performs better than other methods.

Country : Sri Lanka / USA

1 Kalindu Sekarage2 Dr. A.L.A.R.R. Thanuja

  1. Department of Computational Mathematics, Faculty of Information Technology, University of Moratuwa, Sri Lanka
  2. University of North Carolina at Greensbro, USA

IRJIET, Volume 9, Issue 1, January 2025 pp. 77-82

doi.org/10.47001/IRJIET/2025.901010

References

  1. World Health Organization, Global status report on road safety 2018. Geneva: World Health Organization, 2018. Accessed: Jun. 03, 2023. [Online]. Available: https://apps.who.int/iris/handle/10665/276462
  2. L. Alzubaidi et al., “Review of deep learning: concepts, CNN architectures, challenges, applications, future directions,” Journal of Big Data, vol. 8, no. 1, p. 53, Mar. 2021, doi: 10.1186/s40537-021-00444-8.
  3. A.Cheok and E. Zhang, From Turing to Transformers: A Comprehensive Review and Tutorial on the Evolution and Applications of Generative Transformer Models. 2023. doi: 10.32388/3NTOLQ.2.
  4. Z. Halim, R. Kalsoom, S. Bashir, and G. Abbas, “Artificial intelligence techniques for driving safety and vehicle crash prediction,” Artif. Intell. Rev., vol. 46, no. 3, pp. 351–387, Oct. 2016, doi: 10.1007/s10462-016-9467-9.
  5. A.P. Staff et al., “An Evaluation of ``Crash Prediction Networks’’ (CPN) for Autonomous Driving Scenarios in CARLA Simulator”.
  6. F.-H. Chan, Y.-T. Chen, Y. Xiang, and M. Sun, “Anticipating Accidents in Dashcam Videos,” in Computer Vision –  ACCV 2016, S.-H. Lai, V. Lepetit, K. Nishino, and Y. Sato, Eds., in Lecture Notes in Computer Science. Cham: Springer International Publishing, 2017, pp. 136–153. doi: 10.1007/978-3-319-54190-7_9.
  7. J. G. Choi, C. W. Kong, G. Kim, and S. Lim, “Car crash detection using ensemble deep learning and multimodal data from dashboard cameras,” Expert Systems with Applications, vol. 183, p. 115400, Nov. 2021, doi: 10.1016/j.eswa.2021.115400.
  8. A.Vaswani et al., “Attention is All you Need”.
  9. Y. Chen, C. Dong, P. Palanisamy, P. Mudalige, K. Muelling, and J. M. Dolan, “Attention-Based Hierarchical Deep Reinforcement Learning for Lane Change Behaviors in Autonomous Driving,” in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Long Beach, CA, USA: IEEE, Jun. 2019, pp. 1326–1334. doi: 10.1109/CVPRW.2019.00172.
  10. M. Gao, J. Jiang, G. Zou, V. John, and Z. Liu, “RGB-D-Based Object Recognition Using Multimodal Convolutional Neural Networks: A Survey,” IEEE Access, vol. 7, pp. 43110–43136, 2019, doi: 10.1109/ACCESS.2019.2907071.