Abstract
The recent trends in processor architecture show that parallel processing is moving into new areas of computing in the form of many-core desktop processors and multi-processor system-on-chip. This means that parallel processing is required in application areas that traditionally have not used parallel programs. This paper investigates parallelism and scalability of an embedded image processing application. The major challenges faced when parallelizing the application were to extract enough parallelism from the application and to reduce load imbalance. The application has limited immediately available parallelism. It is difficult to further extract parallelism since the application has small data sets and parallelization overhead is relatively high. There is also a fair amount of load imbalance which is made worse by a non-uniform memory latency. Even so, we show that with some tuning relative speedups in excess of 9 on a 16 CPU system can be reached.
Original language | English |
---|---|
Title of host publication | OpenMP in New Era of Parallelism |
Volume | 5004 |
Publisher | Springer Berlin / Heidelberg |
Publication date | 2008 |
Pages | 158-169 |
ISBN (Print) | 978-3-540-79560-5 |
DOIs | |
Publication status | Published - 2008 |
Event | 4th International Workshop on OpenMP - West Lafayette, United States Duration: 12 May 2008 → 14 May 2008 Conference number: 4 http://www.iwomp.org/ |
Workshop
Workshop | 4th International Workshop on OpenMP |
---|---|
Number | 4 |
Country/Territory | United States |
City | West Lafayette |
Period | 12/05/2008 → 14/05/2008 |
Internet address |
Series | Lecture Notes in Computer Science |
---|---|
Volume | 5004 |
ISSN | 0302-9743 |
Keywords
- OpenMP
- Image processing
- Parallelization