NeuralCasting: A Front-End Compilation Infrastructure for Neural Networks

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

Abstract

This study presents the development of NeuralCasting, a front-end compiler framework capable of converting (casting) neural networks encoded in the ONNX format to optimized C code. The primary objective is to enable the compilation of neural networks depending only on standard C libraries, thus eliminating the need for a separate inference engine, such as ONNX runtime. Furthermore, this feature allows the generation of C code suitable both for bare metal embedded devices and other resource-constrained devices, extending model applicability to a plethora of hardware targets. The framework addresses critical applications such as real-time audio processing, especially regarding latency constraints. As an example use case, we compile three different models: a Multi-Layer Perceptron (MLP) composed of fully connected layers, a ResNet cell for image recognition, and an NSNet 2 for speech enhancement and noise suppression. We analyze and compare the performance with other widely used frameworks, such as ONNX Runtime and PyTorch. Our findings indicate that the developed compiler successfully generates optimized C code that meets real-time processing requirements for latency-sensitive applications, outperforming ONNX Runtime and PyTorch and reaching a speedup close to x 10 for small-sized MLP models, which are suitable for deployment on edge devices.
Original languageEnglish
Title of host publicationProceedings of the 2024 11th International Conference on Internet of Things: Systems, Management and Security (IOTSMS)
PublisherIEEE
Publication date2024
Pages161-168
ISBN (Print)979-8-3503-6651-8
ISBN (Electronic)979-8-3503-6650-1
DOIs
Publication statusPublished - 2024
Event11th International Conference on Internet of Things: Systems, Management and Security - Niagara Building, Malmö, Sweden
Duration: 2 Sept 20245 Sept 2024
https://emergingtechnet.org/IOTSMS2024/index.php

Conference

Conference11th International Conference on Internet of Things: Systems, Management and Security
LocationNiagara Building
Country/TerritorySweden
CityMalmö
Period02/09/202405/09/2024
Internet address
Series2024 11th International Conference on Internet of Things: Systems, Management and Security, Iotsms 2024
ISSN2832-3033

Fingerprint

Dive into the research topics of 'NeuralCasting: A Front-End Compilation Infrastructure for Neural Networks'. Together they form a unique fingerprint.

Cite this