How to Efficiently Parallelize Irregular DOACROSS Loops Using Fine Granularity and OpenMP Tasks: The SPEC mcf Case

Salamanca, Juan [UNESP]; Baldassin, Alexandro [UNESP]

doi:10.1007/978-3-031-40744-4_6

How to Efficiently Parallelize Irregular DOACROSS Loops Using Fine Granularity and OpenMP Tasks: The SPEC mcf Case

dc.contributor.author	Salamanca, Juan [UNESP]
dc.contributor.author	Baldassin, Alexandro [UNESP]
dc.contributor.institution	Universidade Estadual Paulista (UNESP)
dc.date.accessioned	2025-04-29T18:07:11Z
dc.date.issued	2023-01-01
dc.description.abstract	There are certain loops that are considered hard to parallelize. Examples of this type of loops are those that have loop-carried dependencies (DOACROSS loops) and that are also irregular, that is, the dependencies between iterations vary depending on the context. Many techniques have been studied before to be able to parallelize this type of loops, however in OpenMP standard there is no efficient way to parallelize them. From the literature, it is known that many of these loops can be efficiently parallelized using fine-grained techniques (identifying strongly connected components). On the other hand, the most efficient way to parallelize this type of loops using OpenMP tasks has not been explored. Thus, this paper discusses the various forms of parallelization of this type of loops using SPEC 429.mcf as a case study; particularly, how to parallelize mcf using fine granularity in tasks. For that, this paper proposes new constructs (ste_for and ste) and speculative dependency-types (spec_in, spec_out, and spec_inout). An initial evaluation using different implementations to parallelize the mcf hottest loop shows that it is possible to achieve speed-ups of up to 2.44 × with respect to the task-depend version using Speculative Task Execution.	en
dc.description.affiliation	DEMAC/IGCE – Sao Paulo State University (Unesp), SP
dc.description.affiliationUnesp	DEMAC/IGCE – Sao Paulo State University (Unesp), SP
dc.format.extent	81-96
dc.identifier	http://dx.doi.org/10.1007/978-3-031-40744-4_6
dc.identifier.citation	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v. 14114 LNCS, p. 81-96.
dc.identifier.doi	10.1007/978-3-031-40744-4_6
dc.identifier.issn	1611-3349
dc.identifier.issn	0302-9743
dc.identifier.scopus	2-s2.0-85172120604
dc.identifier.uri	https://hdl.handle.net/11449/297614
dc.language.iso	eng
dc.relation.ispartof	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
dc.source	Scopus
dc.subject	DOACROSS Parallelization
dc.subject	OpenMP
dc.subject	Speculative Tasks
dc.title	How to Efficiently Parallelize Irregular DOACROSS Loops Using Fine Granularity and OpenMP Tasks: The SPEC mcf Case	en
dc.type	Trabalho apresentado em evento	pt
dspace.entity.type	Publication
unesp.author.orcid	0000-0002-0569-2806[1]
unesp.author.orcid	0000-0001-8824-3055[2]
unesp.campus	Universidade Estadual Paulista (UNESP), Instituto de Geociências e Ciências Exatas, Rio Claro	pt

Coleções

Rio Claro - IGCE - Instituto de Geociências e Ciências Exatas

How to Efficiently Parallelize Irregular DOACROSS Loops Using Fine Granularity and OpenMP Tasks: The SPEC mcf Case

Arquivos

Coleções