Gebalanceerde Multi-Task Aandacht voor Satellietbeeldclassificatie: Een Systematische Aanpak om 97,23% Nauwkeurigheid te Bereiken op EuroSAT Zonder Voorafgaande Training

Samenvatting

Dit werk presenteert een systematisch onderzoek naar aangepaste convolutionele neurale netwerkarchitecturen voor satellietclassificatie van landgebruik, waarbij een testnauwkeurigheid van 97,23% wordt behaald op de EuroSAT-dataset zonder gebruik te maken van vooraf getrainde modellen. Door drie progressieve architectuuriteraties (baseline: 94,30%, CBAM-verbeterd: 95,98%, en gebalanceerde multi-task aandacht: 97,23%) identificeren en adresseren we specifieke faalmodi in satellietbeeldclassificatie. Onze belangrijkste bijdrage is een nieuw gebalanceerd multi-task aandachtmechanisme dat Coordinate Attention voor ruimtelijke feature-extractie combineert met Squeeze-Excitation-blokken voor spectrale feature-extractie, verenigd door een leerbare fusieparameter. Experimentele resultaten tonen aan dat deze leerbare parameter autonoom convergeert naar alpha ongeveer 0,57, wat wijst op een bijna gelijk belang van ruimtelijke en spectrale modaliteiten voor satellietbeelden. We gebruiken progressieve DropBlock-regularisatie (5-20% op basis van netwerkdiepte) en klasse-gebalanceerd verliesgewicht om overfitting en onbalans in verwarringspatronen aan te pakken. De uiteindelijke 12-laagse architectuur behaalt een Cohen's Kappa van 0,9692 waarbij alle klassen een nauwkeurigheid van meer dan 94,46% overschrijden, wat vertrouwenscalibratie aantoont met een kloof van 24,25% tussen correcte en incorrecte voorspellingen. Onze aanpak behaalt een prestatieniveau binnen 1,34% van een fijn afgestelde ResNet-50 (98,57%) zonder gebruik van externe data, wat de effectiviteit van systematisch architectuurontwerp voor domeinspecifieke toepassingen valideert. Volledige code, getrainde modellen en evaluatiescripts zijn publiekelijk beschikbaar.

English

This work presents a systematic investigation of custom convolutional neural network architectures for satellite land use classification, achieving 97.23% test accuracy on the EuroSAT dataset without reliance on pre-trained models. Through three progressive architectural iterations (baseline: 94.30%, CBAM-enhanced: 95.98%, and balanced multi-task attention: 97.23%) we identify and address specific failure modes in satellite imagery classification. Our principal contribution is a novel balanced multi-task attention mechanism that combines Coordinate Attention for spatial feature extraction with Squeeze-Excitation blocks for spectral feature extraction, unified through a learnable fusion parameter. Experimental results demonstrate that this learnable parameter autonomously converges to alpha approximately 0.57, indicating near-equal importance of spatial and spectral modalities for satellite imagery. We employ progressive DropBlock regularization (5-20% by network depth) and class-balanced loss weighting to address overfitting and confusion pattern imbalance. The final 12-layer architecture achieves Cohen's Kappa of 0.9692 with all classes exceeding 94.46% accuracy, demonstrating confidence calibration with a 24.25% gap between correct and incorrect predictions. Our approach achieves performance within 1.34% of fine-tuned ResNet-50 (98.57%) while requiring no external data, validating the efficacy of systematic architectural design for domain-specific applications. Complete code, trained models, and evaluation scripts are publicly available.

Gebalanceerde Multi-Task Aandacht voor Satellietbeeldclassificatie: Een Systematische Aanpak om 97,23% Nauwkeurigheid te Bereiken op EuroSAT Zonder Voorafgaande Training

Balanced Multi-Task Attention for Satellite Image Classification: A Systematic Approach to Achieving 97.23% Accuracy on EuroSAT Without Pre-Training

Samenvatting

Support