YOLOv11-ASM: Enhancing Real-Time Object Detection for Autonomous Driving

Authors

  • Yi Wu
  • Yuhang Jin

DOI:

https://doi.org/10.54097/cx9bbk52

Keywords:

Automatic Driving, Real-Time Monitoring Of Targets, YOLOv11, ATFL Loss Function, MLCA Mechanism.

Abstract

This paper presents YOLOv11-ASM, a lightweight object detection model designed for autonomous driving, aiming to enhance detection accuracy and robustness in complex environments. The model integrates a Spatial-Aware Feature Modulation Network (SAFMN) to dynamically adjust feature representations via spatial context, improving adaptability. A Multi-Level Cross Attention (MLCA) mechanism is introduced to effectively fuse multi-scale features, while an Adaptive Task-Focused Loss (ATFL) function is designed to optimize regression performance, particularly benefiting small object detection and mitigating class imbalance. Experimental evaluations on the COCO128 dataset demonstrate that YOLOv11-ASM achieves superior performance compared to baseline models, with a precision of 0.939, recall of 0.783, and mAP@50 and mAP@50:95 reaching 0.919 and 0.789, respectively. The proposed model improves mAP@50 by 3.5%–5.9% over existing approaches, showcasing its strong generalization and real-time detection capabilities. These results highlight the model's potential for deployment in real-world autonomous systems requiring accurate and efficient object detection.

Downloads

Download data is not yet available.

References

[1] Hemmati M, Biglari-Abhari M, Niar S. Adaptive real-time object detection for autonomous driving systems [J]. Journal of Imaging, 2022, 8(4): 106.

[2] Liu X Y, Chu W F, Hu Y H, et al. enhancing intelligent road target monitoring: a novel BGS-YOLO approach based on the YOLOv8 algorithm [J]. IEEE Open Journal of Intelligent Transportation Systems, 2024, 5: 509-519.

[3] Wang L, Jiang F F, Zhu F Y, et al. Enhanced multi-Target detection in complex traffic using an improved YOLOv8 with SE attention, DCN_C2f, and SIoU[J]. World Electric Vehicle Journal, 2024, 15(12): 586.

[4] Girshick R, Donahue J, Darrell T, et al. Region-based convolutional networks for accurate object detection and segmentation [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 38(1): 142-158.

[5] Li J N, Liang X D, Shen S M, et al. Scale-aware fast R-CNN for pedestrian detection [J]. IEEE Transactions on Multimedia, 2018, 20(4): 985-996.

[6] Xiao Y, Wang X Q, Zhang P, et al. Object detection based on Faster R-CNN algorithm with skip pooling and fusion of contextual information[J]. Sensors, 2020, 20(19): 5490.

[7] Meftah L H, Abukhodair F, Cherif A, et al. Real-time object detection in autonomous vehicles with YOLO[J]. Procedia Computer Science, 2024, 246: 2792-2801.

[8] Hoffmann J E, Tosso H G, Justo J F, et al. Real-Time adaptive object detection and tracking for autonomous vehicles[J]. IEEE Transactions on Intelligent Vehicles, 2021, 6(3): 450 - 459.

[9] Mahaur B, Mishra K.K. Small-object detection based on YOLOv5 in autonomous driving systems [J]. Pattern Recognition Letters, 2023, 168: 155-122.

[10] Yang M, Fan X Y. YOLOv8-Lite: a lightweight object detection model for real-time autonomous driving systems [J]. IECE Transactions on Emerging Topics in Artificial Intelligence, 2024, 1(1): 1-16.

[11] Geetha A S, Hussain M, Allen P, et al. Comparative analysis of YOLOv8 and YOLOv10 in vehicle detection: performance metrics and model efficacy[J]. Vehicles, 2024, 6(3): 1364-1382.

[12] Rabinandan K. Performance benchmarking of YOLOv11 variants for real-time delivery vehicle detection: a study on accuracy, speed, and computational trade-offs [J]. Asian Journal Of Researchin Computer Science, 2024, 17(12): 108-122.

[13] Ruan J G, Cui H H, Huang Y H, et al. A review of occluded objects detection in real complex scenarios for autonomous driving [J]. Green Energy and Intelligent Transportation, 2023, 2(3): 100092.

[14] Sun L, Dong J X, Tang J H, et al. Spatially-adaptive feature modulation for efffcient image super-resolution[C].Proceedings Of the IEEE/CVF International Conference on Computer Vision, 2023: 13190-13199.

[15] Wan D H, Lu R S, Shen S Y, et al. Mixed local channel attention for object detection[J]. Engineering Applications of Artificial Intelligence, 2023, 123: 106442.

[16] Lin T Y, Goyal P, Girshick R, et al. Focal loss for dense object detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020, 42(2): 318-327.

[17] Tahir A, Khalid S K, Fadzil L M. Child detection model using YOLOv5 [J]. Journal of Soft Computing and Data Mining, 2023, 4(1): 72-81.

[18] Kwon H, Choi S, Woo W, et al. evaluating segmentation-based deep learning models for real-time electric vehicle fire detection [J]. Fire, 2025, 8(2): 66.

Downloads

Published

02-07-2025

How to Cite

Wu, Y., & Jin, Y. (2025). YOLOv11-ASM: Enhancing Real-Time Object Detection for Autonomous Driving. Highlights in Science, Engineering and Technology, 146, 42-52. https://doi.org/10.54097/cx9bbk52