[vc_empty_space][vc_empty_space]
Information extraction of traffic condition from social media using bidirectional LSTM-CNN
Riza Alifi M.a, Supangkat S.H.a
a School of Electrical Engineering and Informatics, Institut Teknologi Bandung, Bandung, Indonesia
[vc_row][vc_column][vc_row_inner][vc_column_inner][vc_separator css=”.vc_custom_1624529070653{padding-top: 30px !important;padding-bottom: 30px !important;}”][/vc_column_inner][/vc_row_inner][vc_row_inner layout=”boxed”][vc_column_inner width=”3/4″ css=”.vc_custom_1624695412187{border-right-width: 1px !important;border-right-color: #dddddd !important;border-right-style: solid !important;border-radius: 1px !important;}”][vc_empty_space][megatron_heading title=”Abstract” size=”size-sm” text_align=”text-left”][vc_column_text]© 2018 IEEE.Twitter is social media that has become a source of information for real-time traffic condition in Indonesia. Generally, information contained on Twitter is used for shortterm needs, such as to find out the point of congestion at that time. Whereas, the information can be collected and processed for long-term needs, such as congestion-prone points mapping at certain times. This processed information is very useful for city stakeholders. Information extraction is needed to process information in the form of text from social media that is originally unstructured to be structured. NER technique can be applied to obtain entities that represent traffic condition. This study classifies entities into 11 classes with the BIO encoding scheme, namely B_TIME, I_TIME, B_LOCT, I_LOCT, B_COND, I_COND, B_CAUS, I_CAUS, B_WEAT, I_WEAT, B_MISC, I_MISC, and O. The defined classes represent entities of time, location, condition, cause, weather, miscellaneous, and others. This study proposes a solution in the form of model architecture design using deep learning approach. Bidirectional LSTM approach is used to handle word level. While the CNN approach is used to handle character level. The combination of these two deep learning methods accompanied by word embedding produces the value of F-measure 0.789.[/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”Author keywords” size=”size-sm” text_align=”text-left”][vc_column_text]Convolutional neural network,Model architecture,Named entity recognition,Process information,Processed information,Real-time traffic conditions,Social media,Traffic conditions[/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”Indexed keywords” size=”size-sm” text_align=”text-left”][vc_column_text]Bidirectional long short-term memory,Convolutional neural network,Information extraction,Named entity recognition,Social media,Traffic condition[/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”Funding details” size=”size-sm” text_align=”text-left”][vc_column_text][/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”DOI” size=”size-sm” text_align=”text-left”][vc_column_text]https://doi.org/10.1109/ISRITI.2018.8864265[/vc_column_text][/vc_column_inner][vc_column_inner width=”1/4″][vc_column_text]Widget Plumx[/vc_column_text][/vc_column_inner][/vc_row_inner][/vc_column][/vc_row][vc_row][vc_column][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][/vc_column][/vc_row]