Abstract
Edge devices play an ever-increasing role as to reduce latency, improve efficiency and adapt simplicity. However, their lower processing capabilities compared to traditional setup means that their usage are also limited. This research explores the possibility of implementing both wake word and speech recognition on an edge device. The proposed wake word system, which is for voice activation, is developed based on the LSTM Neural Network model. The model is trained, modelled, and evaluated to respond to the wake word of “Hey SellTron”. As for the speech recognition (voice command) system, Google Speech API was selected to recognize standard directional commands (left, right, forwards, backwards), as well as the new conceptual locational commands (“go to kitchen”, “go to bedroom”, etc.). To prove the feasibility of the developed system, these two features were integrated into an edge device on a mobile platform; representing a conceptual mini wheelchair ‘prototype’ due to project constraints. Evaluations showed that the prototype was able to activate and respond to voice commands correctly with
over 80% accuracy in low ambient noise (<50dB).
Metadata
Item Type: | Article |
---|---|
Creators: | Creators Email / ID Num. Low, Jian He UNSPECIFIED Fadhlullah, Solahuddin Yusuf solah.fadhlullah@newinti.edu.my Kamarulazizi, Khadijah khadijah.azizi@ newinti.edu.my Abdullah, Samihah samihah.abdullah@uitm.edu.my Abdul Hamid, Shabinar shabinar@uitm.edu.my |
Subjects: | T Technology > TK Electrical engineering. Electronics. Nuclear engineering > Electronics > Applications of electronics |
Divisions: | Universiti Teknologi MARA, Shah Alam > College of Engineering |
Journal or Publication Title: | Journal of Electrical and Electronic Systems Research (JEESR) |
UiTM Journal Collections: | UiTM Journal > Journal of Electrical and Electronic Systems Research (JEESR) |
ISSN: | 1985-5389 |
Volume: | 24 |
Number: | 1 |
Page Range: | pp. 80-88 |
Keywords: | Voice activated wheelchair, wake word, speech recognition, LSTM, Google Speech API |
Date: | April 2024 |
URI: | https://ir.uitm.edu.my/id/eprint/94745 |