Abstract
This work presents a neural architecture search approach based on evolutionary algorithms to tackle the problem of mixed-precision quantization considering 8, 16, and 32-bit depths for NSNet2, a neural network used for audio denoising tasks. The goal is to preserve high network performance while reducing the model size, making it suitable for deployment on embedded systems such as headsets. The neural architecture search (NAS) algorithm considers metrics such as (a) Perceptual Evaluation of Speech Quality (PESQ) to assess the audio quality after de-noising, (b) the inference time of the neural network, and (c) its memory footprint. Our results showcase remarkable improvements, with a 29.93% reduction in inference time and a 57.88% reduction in memory footprint compared to the full-precision float32 baseline, while preserving high performance in audio de-noising with a negligible drop of PESQ metric. This work provides a framework for estimating the optimal mixed-precision quantization configuration, which is beneficial for deploying neural networks on resource-constrained devices.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 2025 IEEE International Conference on Pervasive Computing (PerCom) |
| Publisher | IEEE |
| Publication date | 2025 |
| Pages | 273-279 |
| ISBN (Print) | 979-8-3315-3554-4 |
| ISBN (Electronic) | 979-8-3315-3553-7 |
| DOIs | |
| Publication status | Published - 2025 |
| Event | 2025 IEEE International Conference on Pervasive Computing - Washington DC, United States Duration: 17 Mar 2025 → 21 Mar 2025 |
Conference
| Conference | 2025 IEEE International Conference on Pervasive Computing |
|---|---|
| Country/Territory | United States |
| City | Washington DC |
| Period | 17/03/2025 → 21/03/2025 |
Keywords
- Neural Architecture Search (NAS)
- Mixed-Precision Quantization
- Audio Denoising
- Embedded Systems
- Evolutionary algorithm