Parallelism and Scalability in an Image Processing Application

Morten Sleth Rasmussen, Matthias Bo Stuart, Sven Karlsson

    Research output: Contribution to journalConference articleResearchpeer-review


    The recent trends in processor architecture show that parallel processing is moving into new areas of computing in the form of many-core desktop processors and multi-processor system-on-chips. This means that parallel processing is required in application areas that traditionally have not used parallel programs. This paper investigates parallelism and scalability of an embedded image processing application. The major challenges faced when parallelizing the application were to extract enough parallelism from the application and to reduce load imbalance. The application has limited immediately available parallelism and further extraction of parallelism is limited by small data sets and a relatively high parallelization overhead. Load balance is difficult to obtain due to the limited parallelism and made worse by non-uniform memory latency. Three parallel OpenMP implementations of the application are discussed and evaluated. We show that with some modifications relative speedups in excess of 9 on a 16 CPU system can be reached.
    Original languageEnglish
    JournalInternational Journal of Parallel Programming
    Issue number3
    Pages (from-to)306-323
    Publication statusPublished - 2009
    Event4th International Workshop on OpenMP - West Lafayette, United States
    Duration: 12 May 200814 May 2008
    Conference number: 4


    Workshop4th International Workshop on OpenMP
    Country/TerritoryUnited States
    CityWest Lafayette
    Internet address


    • OpenMP
    • Image processing
    • Parallelization
    • Performance evaluation


    Dive into the research topics of 'Parallelism and Scalability in an Image Processing Application'. Together they form a unique fingerprint.

    Cite this