Enter your keyword

2-s2.0-80054024480

[vc_empty_space][vc_empty_space]

Out of vocabulary detection in Indonesian speech recognition using word and syllable level decoding

Juari A.a, Purwarianti A.a

a Informatic Department, Institut Teknologi Bandung, Indonesia

[vc_row][vc_column][vc_row_inner][vc_column_inner][vc_separator css=”.vc_custom_1624529070653{padding-top: 30px !important;padding-bottom: 30px !important;}”][/vc_column_inner][/vc_row_inner][vc_row_inner layout=”boxed”][vc_column_inner width=”3/4″ css=”.vc_custom_1624695412187{border-right-width: 1px !important;border-right-color: #dddddd !important;border-right-style: solid !important;border-radius: 1px !important;}”][vc_empty_space][megatron_heading title=”Abstract” size=”size-sm” text_align=”text-left”][vc_column_text]One of the problems in speech recognition is out of vocabulary words (OOV) because they can make some words error. Out of vocabulary words are the words that cannot be recognized by speech recognizer because there is no recognizing database. Alignment, language model, and POS Tag method is proposed in order to recognize word error because of OOV words. Word and syllable level decoding from speech recognizer is the input for this method. Alignment is applied to word and syllable level decoding to get some differences from word and syllable level decoding. After that, language model and tag are also applied to determine if the words are correct. Speech recognition accuracy is about 75% if OOV rate is 15,5%. The OOV detection process reaches about 87% precision and 75% recall. Experiments also show that by using OOV detection, speech recognizer accuracy is increased by 11%. © 2011 IEEE.[/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”Author keywords” size=”size-sm” text_align=”text-left”][vc_column_text]Acoustic model,False alarms,Language model,out of vocabulary,syllable level decoding,tag,Word level[/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”Indexed keywords” size=”size-sm” text_align=”text-left”][vc_column_text]acoustic model,alignment,false alarm,language model,out of vocabulary,speech recognition,syllable level decoding,tag,word level decoding[/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”Funding details” size=”size-sm” text_align=”text-left”][vc_column_text][/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”DOI” size=”size-sm” text_align=”text-left”][vc_column_text]https://doi.org/10.1109/ICEEI.2011.6021790[/vc_column_text][/vc_column_inner][vc_column_inner width=”1/4″][vc_column_text]Widget Plumx[/vc_column_text][/vc_column_inner][/vc_row_inner][/vc_column][/vc_row][vc_row][vc_column][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][/vc_column][/vc_row]