Drone-Aided Plants Health Monitoring Using Enhanced Vision Transformer
<p dir="ltr">Precision agriculture demands robust yet efficient models capable of operating on resource-constrained edge devices for real-time plant health monitoring. Existing Vision Transformer (ViT) models often underperform in data-scarce agricultural settings due to their relian...
Saved in:
| Main Author: | |
|---|---|
| Other Authors: | , |
| Published: |
2025
|
| Subjects: | |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | <p dir="ltr">Precision agriculture demands robust yet efficient models capable of operating on resource-constrained edge devices for real-time plant health monitoring. Existing Vision Transformer (ViT) models often underperform in data-scarce agricultural settings due to their reliance on large-scale pretraining and limited local feature extraction capabilities. In this study, we propose a enhanced lightweight ViT architecture (EViT) for edge devices used in plants disease classification. The core of our approach is a domain-optimized convolutional stem (ConvStem) architecture that replaces the standard patch embedding with a convolutional stem to enhance local feature extraction, particularly effective in data-scarce scenarios. Unlike ViT architectures, which rely on large-scale pretraining, the proposed ConvStem-ViT (i.e., EViT) achieves a notable improvement in accuracy and generalization on two benchmark datasets - PlantVillage (a widely used dataset of leaf disease images) and CCMT (a more challenging, real-world crop disease dataset). Comprehensive experiments, including ablation studies using model variant analysis, supported with attention map visualizations, confirm the superiority of the proposed model over standard ViTs and contemporary CNN-based baselines. Our EViT achieves over 94.8% accuracy on PlantVillage and 78.0% on CCMT, outperforming conventional ViTs by up to 13.6%. Computational analysis shows the model achieves these gains with minimal overhead (5.55 ms/image latency, 1.30 GFLOPs); these efficiency metrics enable real-time inference on UAVs or low-power edge devices, making the model suitable for practical drone-based scouting in agricultural fields. This work offers a viable framework for scalable, accurate, and efficient plant disease classification, making it an effective choice for resource-constrained edge devices such as drones for plant disease classification.</p><h2 dir="ltr">Other Information</h2><p dir="ltr">Published in: IEEE Access<br>License: <a href="https://creativecommons.org/licenses/by/4.0/deed.en" target="_blank">https://creativecommons.org/licenses/by/4.0/</a><br>See article on publisher's website: <a href="https://dx.doi.org/10.1109/access.2025.3632545" target="_blank">https://dx.doi.org/10.1109/access.2025.3632545</a></p> |
|---|