Datasets label distribution map.
<div><p>In UAV aerial photography scenarios, targets exhibit characteristics such as multi-scale distribution, a high proportion of small targets, complex occlusions, and strong background interference. These characteristics impose high demands on detection algorithms in terms of fine-gr...
Saved in:
| Main Author: | |
|---|---|
| Other Authors: | , , , |
| Published: |
2025
|
| Subjects: | |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | <div><p>In UAV aerial photography scenarios, targets exhibit characteristics such as multi-scale distribution, a high proportion of small targets, complex occlusions, and strong background interference. These characteristics impose high demands on detection algorithms in terms of fine-grained feature extraction, cross-scale fusion capability, and occlusion resistance.The YOLOv11s model has significant limitations in practical applications: its feature extraction module has a single semantic representation, the traditional feature pyramid network has limited capability to detect multi-scale targets, and it lacks an effective feature compensation mechanism when targets are occluded.To address these issues, we propose a UAV aerial small target detection algorithm named UAS-YOLO (Universal Inverted Bottleneck with Adaptive BiFPN and Separated and Enhancement Attention module YOLO), which incorporates three key optimizations. First, an Adaptive Bidirectional Feature Pyramid Network (ABiFPN) is designed as the Neck structure. Through cross-scale connections and dynamic weighted fusion, ABiFPN adjusts weight allocation based on target scale characteristics, focusing on enhancing feature integration for scales related to small targets and improving multi-scale feature representation capability. Second, a Separated and Enhancement Attention Module (SEAM) is introduced to replace the original SPPF module. This module focuses on key target regions, enhances effective feature responses in unoccluded areas, and specifically compensates for information loss in occluded regions, thereby improving the detection stability of occluded small targets. Third, a Universal Inverted Bottleneck (UIB) structure is proposed, which is fused with the C3K2 module to form the C3K2_UIB module. By leveraging dynamic channel attention and spatial feature recalibration, C3K2_UIB suppresses background noise; although this increases parameters by 34%, it achieves improved detection accuracy through efficient feature selection, striking a balance between accuracy and complexity.Experimental results show that on the VisDrone2019 dataset and the TinyPerson dataset from Kaggle, the mean Average Precision (mAP) of the algorithm is increased by 4.9 and 2.1 percentage points, respectively. Moreover, it demonstrates greater advantages compared to existing advanced algorithms, effectively addressing the challenge of small target detection in complex UAV scenarios.</p></div> |
|---|