Abstract
This paper introduces two dynamic real-time pruning techniques PeakRNN and StatsRNN for reducing costly multiplications and memory accesses in recurrent neural networks. The methods are demonstrated on a gated recurrent unit in a multi-layer network, solving a single-channel speech enhancement task with a wide variety of real-world acoustic environments and speakers. The performance is compared against the baseline gated recurrent unit and the DeltaRNN method. Compared to the unprocessed speech, the SNR and Perceptual Evaluation of Speech Quality were on average improved by 8.11 dB and 0.43 MOS-LQO, respectively. Additionally, the two proposed methods outperformed DeltaRNN by 0.7 dB and 0.11 MOS-LQO in the two objective measures, while using the same computational budget per timestep and reducing the original operations by 88%. Furthermore, PeakRNN is fully deterministic, i.e. it is always known in advance how many computations will be executed. Such worst-case guarantees are crucial for real-time acoustics applications.
Original language | English |
---|---|
Title of host publication | Proceedings of 29th European Signal Processing Conference |
Number of pages | 5 |
Publisher | IEEE |
Publication date | 2022 |
ISBN (Print) | 978-1-6654-0900-1 |
DOIs | |
Publication status | Published - 2022 |
Event | 29th European Signal Processing Conference - Virtual event, Dublin, Ireland Duration: 23 Aug 2021 → 27 Aug 2021 Conference number: 29 https://eusipco2021.org/ |
Conference
Conference | 29th European Signal Processing Conference |
---|---|
Number | 29 |
Location | Virtual event |
Country/Territory | Ireland |
City | Dublin |
Period | 23/08/2021 → 27/08/2021 |
Internet address |
Keywords
- RNN
- Determinism
- Statistics
- Peaks
- Threshold
- Single-channel speech enhancement
- Hearing instruments