Abstract
Digital archives of radio news broadcasts can possibly be made searchable by combining speech recognition with information retrieval. We explore this possibility for the retrieval of news broadcasts in Danish. An average of 84% of the words in the broadcasts was recognized. Most of the unrecognized words were compounds, names, and other words that appear of value to retrieval. Thus, the set of words describing a broadcast has to be expanded to compensate for the recognition errors. We discuss doing this by exploiting the alternative matches from the speech recognizer and by extracting words from a related corpus
Original language | English |
---|---|
Title of host publication | OzCHI '16. Proceedings of the 28th Australian Conference on Computer-Human Interaction |
Number of pages | 5 |
Publisher | Association for Computing Machinery |
Publication date | 2016 |
Pages | 160-164 |
ISBN (Electronic) | 978-1-4503-4618-4 |
DOIs | |
Publication status | Published - 2016 |
Event | 28th Australian Conference on Computer-Human Interaction - Launceston, Australia Duration: 29 Nov 2016 → 2 Dec 2016 Conference number: 28 |
Conference
Conference | 28th Australian Conference on Computer-Human Interaction |
---|---|
Number | 28 |
Country/Territory | Australia |
City | Launceston |
Period | 29/11/2016 → 02/12/2016 |