Parallelism and Scalability in an Image Processing Application

Publication: Research - peer-reviewConference article – Annual report year: 2009

View graph of relations

The recent trends in processor architecture show that parallel processing is moving into new areas of computing in the form of many-core desktop processors and multi-processor system-on-chips. This means that parallel processing is required in application areas that traditionally have not used parallel programs. This paper investigates parallelism and scalability of an embedded image processing application. The major challenges faced when parallelizing the application were to extract enough parallelism from the application and to reduce load imbalance. The application has limited immediately available parallelism and further extraction of parallelism is limited by small data sets and a relatively high parallelization overhead. Load balance is difficult to obtain due to the limited parallelism and made worse by non-uniform memory latency. Three parallel OpenMP implementations of the application are discussed and evaluated. We show that with some modifications relative speedups in excess of 9 on a 16 CPU system can be reached.
Original languageEnglish
JournalInternational Journal of Parallel Programming
Publication date2009
Volume37
Issue3
Pages306-323
ISSN0885-7458
DOIs
StatePublished

Conference

Conference4th International Workshop on OpenMP (IWOMP)
Number4
CountryUnited States
CityWest Lafayette, IN
Period12/05/0814/05/08
CitationsWeb of Science® Times Cited: 1

Keywords

  • OpenMP, Image processing, Parallelization, Performance evaluation
Download as:
Download as PDF
Select render style:
APAAuthorCBEHarvardMLAStandardVancouverShortLong
PDF
Download as HTML
Select render style:
APAAuthorCBEHarvardMLAStandardVancouverShortLong
HTML
Download as Word
Select render style:
APAAuthorCBEHarvardMLAStandardVancouverShortLong
Word

ID: 3798559