Design and optimization of heterogeneous coded distributed computing

dc.contributor.advisorDeng, Yong
dc.contributor.authorZhang, Siyu
dc.contributor.committeememberRathore, M. Mazhar
dc.contributor.committeememberAkilan, Thangarajah
dc.contributor.committeememberZhou, Yushi
dc.date.accessioned2025-11-21T20:10:05Z
dc.date.available2025-11-21T20:10:05Z
dc.date.created2025
dc.date.issued2025
dc.description.abstractThe massive increase in data volume in recent years has posed significant challenges for traditional data processing systems. Although distributed computing has been considered as an effective solution, its efficient implementation faces the challenge of the high communication overhead incurred by data exchange (shuffling) between workers. Coded Distributed Computing (CDC) has been proposed by utilizing coded multicasting to reduce the shuffling load. To our best knowledge, existing works on the CDC only consider input files with uniform file size, limiting their practicality in real-world applications. To address this limitation, we propose a Heterogeneous Coded Distributed Computing (HetCDC) scheme to handle input files of nonuniform sizes. We then formulate a joint optimization problem to optimize the file placement and coded shuffling strategies to minimize the shuffling load. Through reformulation, we convert the nonconvex optimization problem into an integer linear programming problem and solve it through the branch-and-cut method. Numerical studies show the proposed HetCDC outperforms existing works. Based on the Het- CDC, we further develop a Heterogeneous TeraSort algorithm to improve the sorting time of traditional TeraSort, which is a key building blocks for many big data processing algorithms.en_US
dc.identifier.urihttps://knowledgecommons.lakeheadu.ca/handle/2453/5546
dc.language.isoenen_US
dc.titleDesign and optimization of heterogeneous coded distributed computingen_US
dc.typeThesisen_US
etd.degree.disciplineEngineering : Electrical and Computeren_US
etd.degree.grantorLakehead Universityen_US
etd.degree.levelMasteren_US
etd.degree.nameMaster of Science in Electrical and Computer Engineeringen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ZhangS2025m-2b.pdf
Size:
1.62 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.24 KB
Format:
Item-specific license agreed upon to submission
Description: