Filter
Article in proceedings

Search results

  • 2023

    Autonomy Loops for Monitoring, Operational Data Analytics, Feedback, and Response in HPC Operations

    Boito, F., Brandt, J., Cardellini, V., Carns, P., Ciorba, F. M., Egan, H., Eleliemy, A., Gentile, A., Gruber, T., Hanson, J., Haus, U. U., Huck, K., Ilsche, T., Jakobsche, T., Jones, T., Karlsson, S., Mueen, A., Ott, M., Patki, T. & Raghavan, K. & 6 others, Simms, S., Shoga, K., Showerman, M., Tiwari, D., Wilde, T. & Yamamoto, K., 2023, Proceedings of 2023 IEEE International Conference on Cluster Computing Workshops and Posters. IEEE, p. 37-43

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • Challenges in HPCQC Integration

    Elsharkawy, A., To, X.-T. M., Seitz, P., Chen, Y., Stade, Y., Geiger, M., Huang, Q., Guo, X., Ansari, M. A., Ruefenacht, M., Schulz, L., Karlsson, S., Mendl, C. B., Kranzlmüller, D. & Schulz, M., 22 Sept 2023, Proceedings of 2023 IEEE International Conference on Quantum Computing and Engineering . IEEE, p. 405-406 2 p. 10313875

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • Improving a Multigrid Poisson Solver with Peer-to-Peer Communication and Task Dependencies

    Rydahl, A. & Karlsson, S., 2023, OpenMP: Advanced Task-Based, Device and Compiler Programming. McIntosh-Smith, S., Deakin, T., Klemm, M., de Supinski, B. R. & Klinkenberg, J. (eds.). Springer, p. 129-143 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 14114 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • Modeling of Errors in Quantum Computers with Generated Structural Circuits

    Schneider, J., Gammelmark, M. & Karlsson, S., 2023, Proceedings of 2023 IEEE International Conference on Quantum Computing and Engineering (QCE). IEEE, p. 122-126 5 p.

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • OpenMP Target Offload Utilizing GPU Shared Memory

    Gammelmark, M., Rydahl, A. & Karlsson, S., 2023, 19th International Workshop on OpenMP. Springer, Vol. 14114. p. 114-128

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • 2022

    Feasibility Studies in Multi-GPU Target Offloading

    Rydahl, A., Gammelmark, M. & Karlsson, S., 2022, OpenMP in a Modern World: From Multi-device Support to Meta Programming - 18th International Workshop on OpenMP, IWOMP 2022, Proceedings. Klemm, M., de Supinski, B. R., Klinkenberg, J. & Neth, B. (eds.). Springer, p. 81-93 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 13527 LNCS).

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • 2021

    Energy-Efficient Application-Specific Instruction-Set Processor for Feature Extraction in Smart Vision Systems

    Ferreira, L., Malkowsky, S., Persson, P., Karlsson, S., Astrom, K. & Liu, L., 2021, Proceedings of 55th Asilomar Conference on Signals, Systems, and Computers. IEEE, p. 324-328

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • 2016

    A scalable lock-free hash table with open addressing

    Nielsen, J. P. & Karlsson, S., 2016, Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. Association for Computing Machinery, p. 1-2 2 p. 33. (ACM SIGPLAN Notices).

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • Towards Unifying OpenMP Under the Task-Parallel Paradigm Implementation and Performance of the taskloop Construct

    Podobas, A. & Karlsson, S., 2016, OpenMP: Memory, Devices, and Tasks . Springer, Vol. 9903. p. 116-129 (Lecture Notes in Computer Science, Vol. 9903).

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • 2015

    A Scalable Prescriptive Parallel Debugging Model

    Jensen, N. B., Quarfot Nielsen, N., Lee, G. L., Karlsson, S., Legendre, M., Schulz, M. & Ahn, D. H., 2015, Proceedings of the 29th IEEE International Parallel and Distributed Processing Symposium (IPDPS 2015). IEEE, p. 473-483

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • Experiences with Compiler Support for Processors with Exposed Pipelines

    Jensen, N. B., Schleuniger, P., Hindborg, A. E., Walter, M. & Karlsson, S., 2015, Proceedings of the 29th International Parallel and Distributed Processing Symposium Workshops (IPDPSW 2015). IEEE, p. 137-143

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • Hardware Transactional Memory Optimization Guidelines, Applied to Ordered Maps

    Bonnichsen, L. F., Probst, C. W. & Karlsson, S., 2015, Proceedings of the 13th IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA 2015). IEEE, Vol. 3. p. 124-131

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

    Open Access
    File
    1017 Downloads (Pure)
  • 2014

    A Synthesizable Multicore Platform for Microwave Imaging

    Schleuniger, P. & Karlsson, S., 2014, Reconfigurable Computing: Architectures, Tools, and Applications. Proceedings. Springer, p. 197-204 (Lecture Notes in Computer Science, Vol. 8405).

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • Automatic generation of application specific FPGA multicore accelerators

    Hindborg, A. E., Schleuniger, P., Jensen, N. B., Walter, M., Brock-Nannestad, L., Bonnichsen, L. F., Probst, C. W. & Karlsson, S., 2014, Conference Record of the 48th Asilomar Conference on Signals, Systems & Computers. Matthews, M. B. (ed.). IEEE, p. 1440-1444

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • Code Commentary and Automatic Refactorings using Feedback from Multiple Compilers

    Jensen, N. B., Probst, C. W. & Karlsson, S., 2014, Proceedings of the 7th Swedish Workshop on Multicore Computing (MCC'14). 4 p.

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

    File
    169 Downloads (Pure)
  • Compiler Feedback using Continuous Dynamic Compilation during Development

    Jensen, N. B., Karlsson, S. & Probst, C. W., 2014, Proceedings - Workshop on Dynamic Compilation Everywhere. 12 p.

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

    File
    467 Downloads (Pure)
  • Hardware Realization of an FPGA Processor – Operating System Call Offload and Experiences

    Hindborg, A. E., Schleuniger, P., Jensen, N. B. & Karlsson, S., 2014, Proceedings of the 2014 Conference on Design and Architectures for Signal and Image Processing (DASIP). Morawiec, A. & Hinderscheit, J. (eds.). IEEE, 8 p.

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

    File
    1112 Downloads (Pure)
  • Library Support for Resource Constrained Accelerators

    Brock-Nannestad, L. & Karlsson, S., 2014, Using and Improving OpenMP for Devices, Tasks, and More: Proceedings of the 10th International Workshop on OpenMP, IWOMP 2014. DeRose, L., Supinski, B. R. D., Olivier, S. L., Chapman, B. M. & Müller, M. S. (eds.). Springer, p. 187-201 (Lecture Notes in Computer Science; No. 8766).

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • Testing Infrastructure for Operating System Kernel Development

    Walter, M. & Karlsson, S., 2014, Proceedings of the 7th Swedish Workshop on Multicore Computing (MCC'14). 4 p.

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

    File
    359 Downloads (Pure)
  • 2013

    ELB-trees an efficient and lock-free B-tree derivative

    Bonnichsen, L. F., Karlsson, S. & Probst, C. W., 2013, 2013 IEEE 6th International Workshop on Multi-/Many-core Computing Systems (MuCoCoS). IEEE, 10 p.

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • Synthetic Aperture Radar Data Processing on an FPGA Multi-Core System

    Schleuniger, P., Kusk, A., Dall, J. & Karlsson, S., 2013, Architecture of Computing Systems – ARCS 2013: 26th International Conference, Prague, Czech Republic, February 19-22, 2013. Proceedings. Springer, p. 74-85 (Lecture Notes in Computer Science, Vol. 7767).

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • 2012

    Design Principles for Synthesizable Processor Cores

    Schleuniger, P., McKee, S. A. & Karlsson, S., 2012, Architecture of Computing Systems – ARCS 2012: 25th International Conference Munich, Germany, February 28 – March 2, 2012 Proceedings. Springer, p. 111-122 (Lecture Notes in Computer Science, Vol. 7179).

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • Guiding Programmers to Higher Memory Performance

    Jensen, N. B., Larsen, P., Ladelsky, R., Zaks, A. & Karlsson, S., 2012, Proceedings of 5th Workshop on Programmability Issues for Heterogeneous Multicores (MULTIPROG-12). 12 p.

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

    File
    353 Downloads (Pure)
  • Parallelizing More Loops with Compiler Guided Refactoring

    Larsen, P., Ladelsky, R., Lidman, J., McKee, S. A., Karlsson, S. & Zaks, A., 2012, 2012 41st International Conference on Parallel Processing (ICPP). IEEE, p. 410-419 (International Conference on Parallel Processing. Proceedings).

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • 2011

    Adapt or Become Extinct! The Case for a Unified Framework for Deployment-Time Optimization

    Goumas, G., McKee, S. A., Själander, M., Gross, T. R., Karlsson, S., Probst, C. W. & Zhang, L., 2011, EXADAPT '11 Proceedings of the 1st International Workshop on Adaptive Self-Tuning Computing Systems for the Exaflop Era. University of Strathclyde

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

    File
    464 Downloads (Pure)
  • Comparing the Overhead of Lock-based and Lock-free Implementations of Priority Queues

    Passas, S. & Karlsson, S., 2011, Proceedings of Forth Workshop on Programmability Issues for Heterogeneous Multicores.

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • Compiler Driven Code Comments and Refactoring

    Larsen, P., Ladelsky, R., Karlsson, S. & Zaks, A., 2011, Proceedings of Forth Workshop on Programmability Issues for Heterogeneous Multicores.

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • Efficient Co-Simulation of Multicore Systems

    Brock-Nannestad, L. & Karlsson, S., 2011, Proceedings of the Fourth Swedish Workshop on Multicore Computing.

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

    File
    157 Downloads (Pure)
  • Hardware Support for Dynamic Languages

    Schleuniger, P., Karlsson, S. & Probst, C. W., 2011, ACACES 2011 Seventh International Summer School on Advanced Computer Architecture and Compilation for High-Performance and Embedded Systems.

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

    File
    431 Downloads (Pure)
  • Improving Performance of Software Implemented Floating Point Addition

    Hindborg, A. E. & Karlsson, S., 2011, Proceedings of the Fourth Swedish Workshop on Multicore Computing.

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

    File
    221 Downloads (Pure)
  • SRC: FenixOS - A Research Operating System Focused on High Scalability and Reliability

    Passas, S. & Karlsson, S., 2011, ICS '11: Proceedings of the international conference on Supercomputing. ACM, 371 p.

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • Towards a Time-predictable Dual-Issue Microprocessor: The Patmos Approach

    Schoeberl, M., Schleuniger, P., Puffitsch, W., Brandner, F., Probst, C. W., Karlsson, S. & Thorn, T., 2011, Bringing Theory to Practice: Predictability and Performance in Embedded Systems: PPES’11, March 18, 2011, Grenoble, France. OASICS, Vol. 18. p. 11-21

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

    Open Access
    File
    736 Downloads (Pure)
  • 2010

    Compiler Driven Code Comments and Refactoring

    Larsen, P., Ladelsky, R., Karlsson, S. & Zaks, A., 2010, Proceedings of Swedish Workshop on Multi-Core Computing. Vol. 3.

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • Expressing Inter-task Dependencies between Parallel Stencil Operations

    Larsen, P., Karlsson, S. & Madsen, J., 2010, (Accepted/In press) Proceedings of MULTIPROG 2010.

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • High Performace Triangle versus Box Intersection Checks

    Christensen, T. & Karlsson, S., 2010, Proceedings of Workshop on Parallel Programming and Applications on Accelerator Clusters (PPAAC).

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • Tinuso: A processor architecture for a multi-core hardware simulation platform

    Schleuniger, P. & Karlsson, S., 2010, (Accepted/In press) Proceeding of third swedish workshop on multi-core computing - MCC'10. Vol. 3.

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearch

    1 Downloads (Pure)
  • 2009

    Identifying Inter-task Communication in Shared Memory Programming Models

    Larsen, P., Karlsson, S. & Madsen, J., 2009, Lecture Notes in Computer Science: Evolving OpenMP in an Age of Extreme Parallelism. Springer Berlin / Heidelberg, p. 168-182 183 p. (Lecture Notes in Computer Science; No. 5568).

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • Performance Analysis of a Hardware/Software-based Cache Coherence Protocol in Shared Memory MPSoCs

    Rasmussen, M. S., Karlsson, S. & Sparsø, J., 2009, Workshop on Programming Models for Emerging Architectures.

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • 2008

    Exploiting spatial parallelism in Ethernet-based cluster interconnects.

    Passas, S., Kotsis, G., Karlsson, S. & Bilas, A., 2008, Proceedings of Workshop on Communication Architecture for Clusters, CAC 2008. IEEE Computer Society Press

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • Parallelism and Scalability in an Image Processing Application

    Rasmussen, M. S., Stuart, M. B. & Karlsson, S., 2008, OpenMP in New Era of Parallelism. Springer Berlin / Heidelberg, Vol. 5004. p. 158-169 (Lecture Notes in Computer Science, Vol. 5004).

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

  • 2007

    MultiEdge: An Edge-based Communication Subsystem for Scalable Commodity Servers

    Karlsson, S., Passas, S., Kotsis, G. & Bilas, A., 2007, IEEE International Parallel and Distributed Processing Symposium, 2007. IEEE

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review