Detecting coarse-grain parallelism using an interprocedural parallelizing compiler
Abstract
This paper presents an extensive empirical evaluation of an interprocedural parallelizing compiler, developed as part of the Stanford SUIF compiler system. The system incorporates a comprehensive and integrated collection of analyses, including privatization and reduction recognition for both array and scalar variables, and symbolic analysis of array subscripts. The interprocedural analysis framework is designed to provide analysis results nearly as precise as full inlining but without its associated costs. Experimentation with this system shows that it is capable of detecting coarser granularity of parallelism than previously possible. Specifically, it can parallelize loops that span numerous procedures and hundreds of lines of codes, frequently requiring modifications to array data structures such as privatization and reduction transformations. Measurements from several standard benchmark suites demonstrate that an integrated combination of interprocedural analyses can substantially advance the capability of automatic parallelization technology.
Additional Information
© 1995 ACM. This research was supported in part by the Air Force Material Command and ARPA contract F30602-95-C-0098, ARPA contract DABT63-94-C-0054, an NSF CISE postdoctoral fellowship, Jet Propulsion Laboratory, fellowships from Intel Corporation and AT&T Bell Laboratories, and an NSF Young Investigator Award.Additional details
- Eprint ID
- 71679
- DOI
- 10.1145/224170.224337
- Resolver ID
- CaltechAUTHORS:20161102-134346400
- Air Force Material Command
- F30602-95-C-0098
- Advanced Research Projects Agency (ARPA)
- DABT63-94-C-0054
- Advanced Research Projects Agency (ARPA)
- NSF
- JPL
- Intel Corporation
- AT&T Bell Laboratories
- Created
-
2016-11-02Created from EPrint's datestamp field
- Updated
-
2021-11-11Created from EPrint's last_modified field