Open Source Speech Recognition on Edge Devices

Abstract

Deep learning has revived the field of automatic speech recognition (ASR) in the last ten years and pushed recognition rates into regions on par with humans. Applications like Siri, Amazon Alexa and Google Assistant are very popular, but have inherent privacy problems. In this paper, we evaluate state of the art open source ASR models regarding their usability in a smart speaker without cloud, both in terms of accuracy and runtime performance on cost-effective low power edge devices. We found Kaldi to be the most accurate solution and also among the fastest ones. It runs more than fast enough on an Nvidia Jetson Nano. It is still not on par with commercial cloud services, but getting close to it.

Mehr zum Titel

Titel Open Source Speech Recognition on Edge Devices
Medien 10th International Conference on Advanced Computer Information Technologies (ACIT)
Verlag IEEE
Heft ---
Band ---
ISBN 978-1-7281-6759-6
Verfasser/Herausgeber Prof. Dr. René Peinl, Basem Rizk, Robert Szabad
Seiten ---
Veröffentlichungsdatum 30.09.2020
Projekttitel ---
Zitation Peinl, René; Rizk, Basem; Szabad, Robert (2020): Open Source Speech Recognition on Edge Devices. 10th International Conference on Advanced Computer Information Technologies (ACIT).