[vc_empty_space][vc_empty_space]
Event-Oriented Map Extraction From Web News Portal : Binary Map Case Study on Diphteria Outbreak and Flood in Jakarta
Dewandaru A.a, Supriana S.I.a, Akbar S.a
a School of Electrical Engineering and Informatics, Institut Teknologi Bandung, Indonesia
[vc_row][vc_column][vc_row_inner][vc_column_inner][vc_separator css=”.vc_custom_1624529070653{padding-top: 30px !important;padding-bottom: 30px !important;}”][/vc_column_inner][/vc_row_inner][vc_row_inner layout=”boxed”][vc_column_inner width=”3/4″ css=”.vc_custom_1624695412187{border-right-width: 1px !important;border-right-color: #dddddd !important;border-right-style: solid !important;border-radius: 1px !important;}”][vc_empty_space][megatron_heading title=”Abstract” size=”size-sm” text_align=”text-left”][vc_column_text]© 2018 IEEE.The abundance of online news texts which contain embedded geographical name references from the internet provide motivation to produce higher level analysis in the form of thematic maps. This can be done by a performing automated geospatial information extraction and retrieval from relevant event-oriented corpora which mainly existed in natural language form. However, unified methods and framework available to address this transformation is still lacking. We propose the incorporation of unsupervised topic modeling and word embedding to help accomplishing the task of aggregating georeferenced data. The topic modeling tool would help suggesting the positive keywords and negative keywords for particular topic while the word embedding helped improve the recall score by extending the semanticaly similar keywords. The method was tested on Indonesian news corpus and achieved comparable result on two offical binary thematic maps case studies based on flood event in Jakarta and diphteria disease in Indonesia.[/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”Author keywords” size=”size-sm” text_align=”text-left”][vc_column_text]Binary Choropleth,Event extraction,Geographical information retrievals,News,Thematic maps,Word2Vec[/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”Indexed keywords” size=”size-sm” text_align=”text-left”][vc_column_text]Binary Choropleth,Event Extraction,Geographical Information Retrieval,Information Extraction,LDA,News,Thematic Map Retrieval,Word2Vec[/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”Funding details” size=”size-sm” text_align=”text-left”][vc_column_text][/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”DOI” size=”size-sm” text_align=”text-left”][vc_column_text]https://doi.org/10.1109/ICAICTA.2018.8541345[/vc_column_text][/vc_column_inner][vc_column_inner width=”1/4″][vc_column_text]Widget Plumx[/vc_column_text][/vc_column_inner][/vc_row_inner][/vc_column][/vc_row][vc_row][vc_column][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][/vc_column][/vc_row]