![]() |
Recent Advance of Thai Open-Vocabulary Automatic Speech Recognition |
---|---|
รหัสดีโอไอ | |
Creator | Chai Wutiwiwatchai |
Title | Recent Advance of Thai Open-Vocabulary Automatic Speech Recognition |
Contributor | Vataya Chunwijitra, Sila Chunwijitra, Phuttapong Sertsi, Sawit Kasuriya, Patcharika Chootrakool, Kwanchiva Thangthai, Chanchai Junlouchai, Kamthorn Krairaksa |
Publisher | Sirindhorn International Institute of Technology, Bangkadi Campus (SIIT-BKD) |
Publication Year | 2560 |
Journal Title | Journal of Intelligent Informatics and Smart Technology |
Journal Vol. | 1 |
Page no. | 1-7 |
Keyword | open-vocabulary, speech recognition, Thai language |
URL Website | https://ph05.tci-thaijo.org/index.php/JIIST |
Website title | Journal of Intelligent Informatics and Smart Technology |
ISSN | 2586-9167 |
Abstract | We describe the recent development of the NECTEC Thai open-vocabulary automatic speech recognition system. Some of the techniques that were found beneficial over its baseline system are: hybrid word-subword language modeling to enhance the vocabulary coverage in a constraint resource; multi-conditioned noisy acoustic modeling to improve the system robustness and spoken-style language model interpolation using a newly developed large social media speech database; recent state-of-the-art speech features; and lastly, online decoding, speech compression, and Docker-based distributed computing to reduce the processing and data transmission time. These techniques result in a 29.0% word error rate on open-vocabulary noisy speech test sets which is 42.5% relatively low-er than the baseline system. The overall system operates at nearly 1.2xRT which is promising for real applications. |