YOLOv11-ASM: Enhancing Real-Time Object Detection for Autonomous Driving
DOI:
https://doi.org/10.54097/cx9bbk52Keywords:
Automatic Driving, Real-Time Monitoring Of Targets, YOLOv11, ATFL Loss Function, MLCA Mechanism.Abstract
This paper presents YOLOv11-ASM, a lightweight object detection model designed for autonomous driving, aiming to enhance detection accuracy and robustness in complex environments. The model integrates a Spatial-Aware Feature Modulation Network (SAFMN) to dynamically adjust feature representations via spatial context, improving adaptability. A Multi-Level Cross Attention (MLCA) mechanism is introduced to effectively fuse multi-scale features, while an Adaptive Task-Focused Loss (ATFL) function is designed to optimize regression performance, particularly benefiting small object detection and mitigating class imbalance. Experimental evaluations on the COCO128 dataset demonstrate that YOLOv11-ASM achieves superior performance compared to baseline models, with a precision of 0.939, recall of 0.783, and mAP@50 and mAP@50:95 reaching 0.919 and 0.789, respectively. The proposed model improves mAP@50 by 3.5%–5.9% over existing approaches, showcasing its strong generalization and real-time detection capabilities. These results highlight the model's potential for deployment in real-world autonomous systems requiring accurate and efficient object detection.
Downloads
References
[1] Hemmati M, Biglari-Abhari M, Niar S. Adaptive real-time object detection for autonomous driving systems [J]. Journal of Imaging, 2022, 8(4): 106.
[2] Liu X Y, Chu W F, Hu Y H, et al. enhancing intelligent road target monitoring: a novel BGS-YOLO approach based on the YOLOv8 algorithm [J]. IEEE Open Journal of Intelligent Transportation Systems, 2024, 5: 509-519.
[3] Wang L, Jiang F F, Zhu F Y, et al. Enhanced multi-Target detection in complex traffic using an improved YOLOv8 with SE attention, DCN_C2f, and SIoU[J]. World Electric Vehicle Journal, 2024, 15(12): 586.
[4] Girshick R, Donahue J, Darrell T, et al. Region-based convolutional networks for accurate object detection and segmentation [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015, 38(1): 142-158.
[5] Li J N, Liang X D, Shen S M, et al. Scale-aware fast R-CNN for pedestrian detection [J]. IEEE Transactions on Multimedia, 2018, 20(4): 985-996.
[6] Xiao Y, Wang X Q, Zhang P, et al. Object detection based on Faster R-CNN algorithm with skip pooling and fusion of contextual information[J]. Sensors, 2020, 20(19): 5490.
[7] Meftah L H, Abukhodair F, Cherif A, et al. Real-time object detection in autonomous vehicles with YOLO[J]. Procedia Computer Science, 2024, 246: 2792-2801.
[8] Hoffmann J E, Tosso H G, Justo J F, et al. Real-Time adaptive object detection and tracking for autonomous vehicles[J]. IEEE Transactions on Intelligent Vehicles, 2021, 6(3): 450 - 459.
[9] Mahaur B, Mishra K.K. Small-object detection based on YOLOv5 in autonomous driving systems [J]. Pattern Recognition Letters, 2023, 168: 155-122.
[10] Yang M, Fan X Y. YOLOv8-Lite: a lightweight object detection model for real-time autonomous driving systems [J]. IECE Transactions on Emerging Topics in Artificial Intelligence, 2024, 1(1): 1-16.
[11] Geetha A S, Hussain M, Allen P, et al. Comparative analysis of YOLOv8 and YOLOv10 in vehicle detection: performance metrics and model efficacy[J]. Vehicles, 2024, 6(3): 1364-1382.
[12] Rabinandan K. Performance benchmarking of YOLOv11 variants for real-time delivery vehicle detection: a study on accuracy, speed, and computational trade-offs [J]. Asian Journal Of Researchin Computer Science, 2024, 17(12): 108-122.
[13] Ruan J G, Cui H H, Huang Y H, et al. A review of occluded objects detection in real complex scenarios for autonomous driving [J]. Green Energy and Intelligent Transportation, 2023, 2(3): 100092.
[14] Sun L, Dong J X, Tang J H, et al. Spatially-adaptive feature modulation for efffcient image super-resolution[C].Proceedings Of the IEEE/CVF International Conference on Computer Vision, 2023: 13190-13199.
[15] Wan D H, Lu R S, Shen S Y, et al. Mixed local channel attention for object detection[J]. Engineering Applications of Artificial Intelligence, 2023, 123: 106442.
[16] Lin T Y, Goyal P, Girshick R, et al. Focal loss for dense object detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020, 42(2): 318-327.
[17] Tahir A, Khalid S K, Fadzil L M. Child detection model using YOLOv5 [J]. Journal of Soft Computing and Data Mining, 2023, 4(1): 72-81.
[18] Kwon H, Choi S, Woo W, et al. evaluating segmentation-based deep learning models for real-time electric vehicle fire detection [J]. Fire, 2025, 8(2): 66.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Highlights in Science, Engineering and Technology

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.







