Enhancing semantic segmentation: architectural innovations and strategies for label-efficient learning

Suresh, Tharrengini

Please use this identifier to cite or link to this item: https://knowledgecommons.lakeheadu.ca/handle/2453/5503

Title:	Enhancing semantic segmentation: architectural innovations and strategies for label-efficient learning
Authors:	Suresh, Tharrengini
Issue Date:	2025
Abstract:	Semantic segmentation is a fundamental component of modern computer vision applications. Although supervised learning models have achieved state-of-the-art performance in this domain, they rely heavily on large volumes of labeled data, which is an expensive and time-consuming requirement. Thus, this research aims to develop enhanced supervised semantic segmentation models that balance accuracy and data efficiency for visual perception tasks in autonomous driving environments. To achieve this, the thesis is organized into two distinct phases. The first phase investigates a dual-network architecture, in which an auxiliary boundary detection network is incorporated into the primary segmentation framework to mitigate pixelation artifacts at object boundaries in multiclass segmentation of complex scenes. The experimental findings demonstrate the importance of designing unified segmentation models that take advantage of architectural enhancements capable of extracting richer feature representations for improved performance. The second phase leverages insights from the previous stage and focuses on the development of an efficient deep learning model with attention mechanisms and multi-scale feature refinement. The proposed method introduces a novel depth-wise, point-wise feature pyramid module that extracts information-rich spatio-semantic context from early and deep feature representations, improving model efficacy. Exhaustive experimental studies conducted on widely used benchmark datasets validate the effectiveness of the proposed models, which achieve competitive performance while offering improved computational efficiency relative to baseline approaches. The findings highlight that strategically balancing resource utilization with architectural innovation can yield strong performance while minimizing annotation demands and environmental impact. This research sets a valuable precedent for building competitive, resource-aware vision systems suited to constrained application settings.
URI:	https://knowledgecommons.lakeheadu.ca/handle/2453/5503
metadata.etd.degree.discipline:	Electrical and Computer Engineering
metadata.etd.degree.name:	Master of Science in Electrical and Computer Engineering
metadata.etd.degree.level:	Master
metadata.dc.contributor.advisor:	Akilan, Thangarajah
metadata.dc.contributor.committeemember:	Yassine, Abdulsalam Bin Ahmed, Saad Zhou, Yushi
Appears in Collections:	Electronic Theses and Dissertations from 2009

Files in This Item:

File	Description	Size	Format
SureshT2025m-2b.pdf		13.06 MB	Adobe PDF	View/Open

Show full item record Recommend this item

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets