基于无锚旋转框的航拍图像车辆全向检测方法

doi:10.16097/j.cnki.1009-6744.2025.04.011

交通运输系统工程与信息 ›› 2025, Vol. 25 ›› Issue (4): 104-115.DOI: 10.16097/j.cnki.1009-6744.2025.04.011

• 智能交通系统与信息技术 • 上一篇下一篇

基于无锚旋转框的航拍图像车辆全向检测方法

王维锋^* ，黄建鑫，王晓全，吴昕韩，卞子馨

河海大学，土木与交通学院，南京210098

收稿日期:2025-03-07 修回日期:2025-05-05 接受日期:2025-06-03 出版日期:2025-08-25 发布日期:2025-08-25
作者简介:王维锋(1979—)，男，湖北京山人，教授，博士。
基金资助:
中央高校基本科研业务费专项基金 (B240201168)；江苏省交通运输科技项目 (2024Y19)。

Multi-directional Vehicle Detection in Aerial Images Based on Anchor-free Oriented Bounding Box

WANG Weifeng^*, HUANG Jianxin, WANG Xiaoquan, WU Xinhan, BIAN Zixin

College of Civil and Transportation Engineering, Hohai University, Nanjing 210098, China

Received:2025-03-07 Revised:2025-05-05 Accepted:2025-06-03 Online:2025-08-25 Published:2025-08-25
Supported by:
Fundamental Research Funds for the Central Universities of Ministry of Education of China (B240201168)；Transportation Science and Technology Projects of Jiangsu Province (2024Y19)。

摘要/Abstract

摘要： 交通场景的航拍图像具有背景复杂，车辆长宽比分布不均，以及车辆航向角动态多变等特点，导致车辆检测任务中易出现漏检或误检问题。为此，本文通过改进YOLOv8-OBB(You Only Look Once version 8-Oriented Bounding Box)网络，提出一种针对航拍图像的车辆全向检测方法。首先，在网络的颈部引入可选择性大核注意力机制(Large Selective Kernel Attention Mechanism, LSKAM)，增强对不同长宽比车辆的特征提取能力；其次，为提升对背景与目标的区分能力，在头部的路径聚合网络(Path Aggregation Network, PANet)中增加维度为10×10的深层特征提取模块；最后，在网络的颈部加入VoV-GSCSP(VoVNetGSConv Cross Stage Partial)轻量化模块兼顾检测精度与速度。在大规模数据集Drone-Vehicle上的实验结果表明，相较于Oriented-R-CNN(Oriented-Regions with Convolutional Neural Networks)、R-YOLOv3-tiny、YOLOv6OBB、YOLOv8-OBB和YOLOv12-OBB等典型检测方法，本文方法具有更优的检测精度和更低的计算复杂度，针对“Car” “Bus”类别的检测精度超过95%，且针对所有类别车辆的平均检测精度为73.7%，计算复杂度为26.9 GFLOPs(Giga Floating-Point Operations per Second)；同时，通过无人机实地采集数据进行验证，表明本文方法可有效减少漏检与误检问题，满足航拍视角下的车辆全向检测任务要求。

关键词: 智能交通, 车辆检测, YOLOv8-OBB, 航拍图像, 注意力机制

Abstract: Aerial images of traffic scenarios are characterized by complex backgrounds, uneven distribution of vehicle aspect ratios, and dynamic variations in vehicle heading angles, which often lead to missed or false vehicle detection. This paper proposes an improved YOLOv8-OBB (You Only Look Once version 8-Oriented Bounding Box) network tailored for detecting vehicles with different heading angles in aerial images. First, a Large Selective Kernel Attention Mechanism (LSKAM) was integrated into the network's neck to enhance feature extraction capabilities for vehicles with varying aspect ratios. To improve the distinction between backgrounds and targets, a deep feature extraction module with a dimension of 10×10 was added to the Path Aggregation Network (PANet) in the head. Then, a VoV-GSCSP (VoVNet GSConv Cross Stage Partial) based lightweight module was embedded into the neck of the network to balance detection accuracy and speed. Experimental results on the large-scale Drone Vehicle dataset show that the proposed method outperforms typical detection methods such as Oriented-R-CNN(Oriented-Regions with Convolutional Neural Networks), R-YOLOv3-tiny, YOLOv6-OBB, YOLOv8-OBB and YOLOv12-OBB in terms of detection accuracy and computational complexity. Specifically, the detection accuracy for "Car" and "Bus" categories exceeds 95%, with a mean average precision (mAP) of 73.7% and a computational complexity of 26.9 GFLOPs (Giga Floating-Point Operations per Second) for all types of vehicles selected in the experiment. Additionally, verification using data collected in the field by drones indicates that the proposed method can effectively reduce missed and false detection, thereby fulfilling the requirements for vehicle detection tasks from an aerial perspective.

Key words: intelligent transportation, vehicle detection, YOLOv8-OBB, aerial image, attention mechanism

中图分类号:

U491

王维锋, 黄建鑫, 王晓全, 吴昕韩, 卞子馨. 基于无锚旋转框的航拍图像车辆全向检测方法[J]. 交通运输系统工程与信息, 2025, 25(4): 104-115.

WANG Weifeng, HUANG Jianxin, WANG Xiaoquan, WU Xinhan, BIAN Zixin. Multi-directional Vehicle Detection in Aerial Images Based on Anchor-free Oriented Bounding Box[J]. Journal of Transportation Systems Engineering and Information Technology, 2025, 25(4): 104-115.

导出引用管理器 EndNote|Ris|BibTeX

链接本文: http://www.tseit.org.cn/CN/10.16097/j.cnki.1009-6744.2025.04.011

http://www.tseit.org.cn/CN/Y2025/V25/I4/104

参考文献

[1]YU C, JIANG X, WU F, et al. Research on vehicle detection in infrared aerial images in complex urban and road backgrounds[J]. Electronics, 2024, 13(2): 319.

[2] ZHU J S, SUN K, JIA S, et al. Urban traffic density estimation based on ultrahigh-resolution UAV video and deep neural network[J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2018, 11(12): 4968-4981.

[3] SHI X, ZHAO D, YAO H, et al. Video-based trajectory extraction with deep learning for high-granularity highway simulation (HIGH-SIM)[J]. Communications in Transportation Research, 2021, 1: 100014.

[4] 黄玲,吴泽荣,洪佩鑫,等.基于地空信息融合的无人机交通状态感知方法研究[J].中国公路学报,2021,34 (12): 249-261. [HUANG L, WU Z R, HONG P X, et al. Research on unmanned aircraft traffic state sensing method based on ground-space information fusion[J]. China Journal of Highway and Transport, 2021, 34(12): 249-261.]

[5]李旭,宋世奇,殷晓晴.基于目标空间分布特征的无人机航拍车辆实时检测技术研究[J].中国公路学报, 2022, 35(12): 193-204. [LI X, SONG S Q, YIN X Q. Research on real-time vehicle detection technology of unmanned aerial photography based on target spatial distribution characteristics[J]. China Journal of Highway and Transport, 2022, 35(12): 193-204.]

[6] ZHU X, LYU S, WANG X, et al. TPH-YOLOv5: Improved YOLOv5 based on transformer prediction head for object detection on drone-captured scenarios[C]// Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021: 2778–2788.

[7]张河山,谭鑫,张羽,等.无人机高空航拍视角下小尺度车辆精确检测方法[J].交通运输系统工程与信息, 2024, 24(3): 299-309. [ZHANG H S, TAN X, ZHANG Y, et al. Accurate detection method of small-scale vehicles under the viewpoint of high-altitude aerial photography by UAV[J]. Journal of Transportation Systems Engineering and Information Technology, 2024, 24(3): 299-309.]

[8]ZHENG O, ABDEL-ATY M, YUE L, et al. CitySim: A drone-based vehicle trajectory dataset for safety oriented research and digital twins[J]. Transportation Research Record, 2024, 2678(4): 606-621.

[9] LI X H, WU J P. Developing a more reliable framework for extracting traffic data from a UAV video[J]. IEEE Transactions on Intelligent Transportation Systems, 2023, 24(11): 12272-12283.

[10] XIE X, CHENG G, WANG J, et al. Oriented R-CNN for object detection[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021: 3520-3529.

[11] LI Z, PANG C X, DONG C H, et al. R-YOLOv5: A lightweight rotational object detection algorithm for real time detection of vehicles in dense scenes[J]. IEEE Access, 2023, 11: 61546-61559.

[12] LI Y X, HOU Q B, ZHENG Z H, et al. Large selective kernel network for remote sensing object detection[C]// Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023: 16794-16805.

[13] DING X, ZHANG X, HAN J, et al. Scaling up your kernels to 31x31: Revisiting large kernel design in CNNS [C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022: 11963 11975.

[1]	姚振兴, 刘贤, 赵一飞, 王亮, 王彦琛. 手机信令不均匀定位下出行端点自适应识别方法[J]. 交通运输系统工程与信息, 2025, 25(4): 44-52.
[2]	张鹏, 李兴旺, 姬炳豪, 孙超, 李文权. 路口重复放行的公交与社会车辆协同绿波优化模型[J]. 交通运输系统工程与信息, 2025, 25(4): 53-62.
[3]	代亮, 杜鹏飞, 黄自彬, 杨朋博. 基于深度强化学习的城市交通信号分层协同控制方法[J]. 交通运输系统工程与信息, 2025, 25(4): 63-72.
[4]	王庞伟, 王思淼, 雷方舒, 徐京辉, 王子鹏, 王力. 混合动作表示强化学习下的城市交叉口智慧信控方法[J]. 交通运输系统工程与信息, 2025, 25(4): 73-83.
[5]	王连震, 沈超文, 王宇萍, 薛淑祺. 网联高速公路合流区基于间隙优化的车辆协同控制方法[J]. 交通运输系统工程与信息, 2025, 25(4): 84-95.
[6]	陈峥, 张景, 陈博闻, 李春宇, 郭凤香, 魏福星. 基于异构多图时空融合的长时域车辆轨迹预测[J]. 交通运输系统工程与信息, 2025, 25(4): 126-136.
[7]	王祥, 任浩, 谭国真, 李健平, 王珏, 王妍力. 大语言模型协同强化学习的自动驾驶决策方法[J]. 交通运输系统工程与信息, 2025, 25(4): 137-146.
[8]	郑展骥, 冯昌奎, 赵杨洋, 凃强, 张河山, 徐进. 无人机航拍视角下密集场景非机动车小目标检测方法[J]. 交通运输系统工程与信息, 2025, 25(4): 147-161.
[9]	吴剑凡, 谢征宇, 秦勇, 王力, 王佳丽. 基于计算机视觉的地铁车站内乘客异常行为检测模型[J]. 交通运输系统工程与信息, 2025, 25(4): 162-174.
[10]	宋翠颖, 丁杰, 张春波. 模块化公交车辆调度研究综述[J]. 交通运输系统工程与信息, 2025, 25(4): 175-192.
[11]	谢秉磊, 冯健茜, 秦筱然. 多特征融合的网约车拼车起讫点需求时空预测[J]. 交通运输系统工程与信息, 2025, 25(4): 193-205.
[12]	陈喜群, 祝文琪, 吕朝锋. 融合轨迹时序与行为修正的车辆冲突风险预测[J]. 交通运输系统工程与信息, 2025, 25(4): 219-229.
[13]	高远, 付金龙, 冯文文. 考虑时空特征动态耦合的车辆轨迹预测方法[J]. 交通运输系统工程与信息, 2025, 25(3): 107-116.
[14]	赵霞, 李之红, 刘剑锋, 杨静, 吴梦琳, 秦伊萌. 行为模式时空动态超图聚类的公共交通异常团体检测[J]. 交通运输系统工程与信息, 2025, 25(3): 132-141.
[15]	常文文, 芦家磊, 黄霄, 闫光辉. 融合递归图的脑电驾驶行为分类方法研究[J]. 交通运输系统工程与信息, 2025, 25(3): 152-162.

基于无锚旋转框的航拍图像车辆全向检测方法

Multi-directional Vehicle Detection in Aerial Images Based on Anchor-free Oriented Bounding Box

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics