Enter your keyword

2-s2.0-84961696261

[vc_empty_space][vc_empty_space]

Development of Indonesian-Japanese statistical machine translation using lemma translation and additional post-process

Sulaeman M.A.a, Purwarianti A.a

a School of Electrical Engineering and Informatics, Bandung Institute of Technology, Bandung, Indonesia

[vc_row][vc_column][vc_row_inner][vc_column_inner][vc_separator css=”.vc_custom_1624529070653{padding-top: 30px !important;padding-bottom: 30px !important;}”][/vc_column_inner][/vc_row_inner][vc_row_inner layout=”boxed”][vc_column_inner width=”3/4″ css=”.vc_custom_1624695412187{border-right-width: 1px !important;border-right-color: #dddddd !important;border-right-style: solid !important;border-radius: 1px !important;}”][vc_empty_space][megatron_heading title=”Abstract” size=”size-sm” text_align=”text-left”][vc_column_text]© 2015 IEEE.Despite the fact that study of statistical machine translation has been growing rapidly to date, there has not been much research done about Indonesian-Japanese statistical machine translation. The previous research about Indonesian-Japanese statistical machine translation has shown several problems in translation process, such as low coverage corpus data, unknown words, and sentence reordering problem. In this research, we propose two methods to address these problems. The proposed methods are lemma translation with generated surface form and additional post-process. Lemma translation uses lemma and POSTAG of word in its translation process. Rule based katakana translation and unknown word substitution are also used for additional post-process. Experimental data was collected from JLPT (Japanese Language Proficiency Test) Level 3 with total 1132 sentences. Experimental results using these methods showed an improvement over the baseline system with a 116% increased BLEU score on Japanese to Indonesian translation and 26% increased BLEU score on Indonesian to Japanese translation.[/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”Author keywords” size=”size-sm” text_align=”text-left”][vc_column_text]Baseline systems,Bleu scores,Language proficiency,Post process,Post processing,Statistical machine translation,Surface forms,Translation process[/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”Indexed keywords” size=”size-sm” text_align=”text-left”][vc_column_text]BLEU score,lemma translation,post-processing,statistical machine translation[/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”Funding details” size=”size-sm” text_align=”text-left”][vc_column_text][/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”DOI” size=”size-sm” text_align=”text-left”][vc_column_text]https://doi.org/10.1109/ICEEI.2015.7352469[/vc_column_text][/vc_column_inner][vc_column_inner width=”1/4″][vc_column_text]Widget Plumx[/vc_column_text][/vc_column_inner][/vc_row_inner][/vc_column][/vc_row][vc_row][vc_column][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][/vc_column][/vc_row]