Advancing thermal image object detection for autonomous driving in adverse weather: integrating semi-supervision with a spatial edge-aware attention mechanism

Loading...
Thumbnail Image

Date

Authors

Han, Gaeul

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Thermal imaging has emerged as a critical sensing modality for object detection in low-visibility driving conditions, where conventional visible-spectrum (RGB) sensors often fail. However, thermal images also exhibit inherent limitations, including low spatial resolution and weak object boundaries. As a result, small or distant objects may be represented by only a few pixels, with faint and ambiguous edge information. To address these challenges, this thesis proposes two novel methodologies that aim to enhance detection robustness and accuracy without increasing computational complexity. The first methodology introduces a Spatial Edge-Aware Attention (SEA) module, integrated into a YOLOv8 general object detection backbone. In this approach, a conventional Sobel filter, together with multilayer convolutional subnetworks, is employed to extract edge features directly from raw thermal images. These edge features are then injected into the SEA module, enabling the proposed detector to emphasize object boundaries and improve the localization of indistinct targets in challenging scenarios. Building upon the first methodology, the second approach introduces a semi-supervised learning framework to further improve detection performance. Specifically, a self-supervised rotation prediction pretext task is employed during a warm-up training phase, allowing the model to learn orientation-invariant representations from unlabeled thermal data. This learned representation is subsequently leveraged during downstream training with limited labeled data, enabling effective semi-supervision. Together, these two methodologies address key limitations of existing thermal object detection techniques for adverse weather conditions. Experimental evaluations conducted on multiple benchmark datasets demonstrate that the proposed approaches achieve improved detection accuracy and robustness while maintaining computational efficiency.

Description

Thesis embargoed until January 8, 2027.

Keywords

Citation

Endorsement

Review

Supplemented By

Referenced By