Please use this identifier to cite or link to this item:
Title: Hierarchical energy monitoring for task mapping in many-core systems
Authors: Castilhos, G.
Mandelli, M.
Ost, Luciano
Moraes, F.
First Published: 9-Feb-2016
Publisher: Elsevier
Citation: Journal of Systems Architecture, 2016, 63, pp. 80-92 (12)
Abstract: This work addresses a research subject with a rich literature: task mapping in NoC-based systems. Task mapping is the process of selecting a processing element to execute a given task. The number of cores in many-core systems increases the complexity of the task mapping. The main concerns in task mapping in large systems include (i) scalability; (ii) dynamic workload; and (iii) reliability. It is necessary to distribute the mapping decision across the system to ensure scalability. The workload of emerging many-core systems may be dynamic, i.e., new applications may start at any moment, leading to different mapping scenarios. Therefore, it is necessary to execute the mapping process at runtime to support a dynamic workload assignment. The workload assignment plays an important role in the many-core system reliability. Load imbalance may generate hotspots zones and consequently thermal implications, which may generate hotspots zones and consequently thermal implications. More recently, task mapping techniques aiming at improving system reliability have been proposed in the literature. However, such approaches rely on centralized mapping decisions, which are not scalable. To address these challenges, the main goal of this work is to propose a hierarchical runtime mapping heuristic, which provides scalability and a fair workload distribution. Distributing the workload inside the system increases the system reliability in long-term, due to the reduction of hotspot regions. The proposed mapping heuristic considers the application workload as a function of the consumed energy in the processors and NoC routers. The proposal adopts a hierarchical energy monitoring scheme, able to estimate at runtime the consumption at each processing element. The mapping uses the energy estimated by the monitoring scheme to guide the mapping decision. Results compare the proposal against a mapping heuristic whose main cost function minimizes the communication energy. Results obtained in large systems, up to 256 cores, show improvements in the workload distribution (average value 59.2%) and a reduction in the maximum energy values spent by the processors (average value 32.2%). Such results demonstrate the effectiveness of the proposal.
DOI Link: 10.1016/j.sysarc.2016.01.005
ISSN: 1383-7621
Version: Post-print
Status: Peer-reviewed
Type: Journal Article
Rights: Copyright © 2016, Elsevier. The file associated with this record is distributed after the end of the embargo period under the Creative Commons “Attribution Non-Commercial No Derivatives” licence, further details of which can be found via the following link:
Description: The file associated with this record is under a 24-month embargo from publication in accordance with the publisher's self-archiving policy. The full text may be available through the publisher links provided above.
Appears in Collections:Published Articles, Dept. of Engineering

Files in This Item:
File Description SizeFormat 
1-s2.0-S1383762116000175-main.pdfPost-review (final submitted)2.02 MBAdobe PDFView/Open

Items in LRA are protected by copyright, with all rights reserved, unless otherwise indicated.