[vc_empty_space][vc_empty_space]
Internet Browsing History Data Analysis for Automatic Negative Content Website Identification (Case Study: TRUST+™ Positif)
Aristofany A.a, Putri Saptawati G.A.a, Asnar Y.a
a Institut Teknologi Bandung, School of Electrical Engineering and Informatics, Bandung, Indonesia
[vc_row][vc_column][vc_row_inner][vc_column_inner][vc_separator css=”.vc_custom_1624529070653{padding-top: 30px !important;padding-bottom: 30px !important;}”][/vc_column_inner][/vc_row_inner][vc_row_inner layout=”boxed”][vc_column_inner width=”3/4″ css=”.vc_custom_1624695412187{border-right-width: 1px !important;border-right-color: #dddddd !important;border-right-style: solid !important;border-radius: 1px !important;}”][vc_empty_space][megatron_heading title=”Abstract” size=”size-sm” text_align=”text-left”][vc_column_text]© 2018 IEEE.Negative content website is a website that contains one or more of these following elements: pornography, violence and coercion in children, incitement to anarchy, and gambling. Negative content website grows along with the development of the internet. The number of internet users who’s potentially exposed to negative content is increasing because of the cheaper cost of access to the internet and the increasing number of devices that support the use of the internet. Several programs have been taken by the authorities. Among them is the creation of TRUST + ™ Positive system that holds the huge list of negative website address. The ISP (Internet Service Provider) will blocks any negative content website referring to this system. The number of negative content website listed in the TRUST + ™ Positive list increases when there’s reports about new negative content website or after doing the back-crawling process. The problem we faced is that the addition of the number of negative content website listed on the TRUST + ™ Positive list depends heavily on external reports and the ability of the TRUST + ™ Positive back-crawling engine. Therefore, by using ISP’s Internet browsing history data we will performing data mining process to identify new negative content website. Data mining is done by using an association algorithm. Some internet user browsing history data setup techniques are used to find the best results according to internet browsing patterns that may arise. To reduce the number of identification errors we will filter any websites that are believed to be a website that has no negative content. The result obtained is that although the results of the association algorithm can be used for the identification of negative content website, but more than 75% of those results are not a negative content website or still need validation about its content.[/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”Author keywords” size=”size-sm” text_align=”text-left”][vc_column_text]Association algorithms,Browsing history,Data mining process,Identification errors,Internet browsing,Negative content,Web usage,Website identifications[/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”Indexed keywords” size=”size-sm” text_align=”text-left”][vc_column_text]Association,Data mining,Negative content,Web usage[/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”Funding details” size=”size-sm” text_align=”text-left”][vc_column_text][/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”DOI” size=”size-sm” text_align=”text-left”][vc_column_text]https://doi.org/10.1109/ICODSE.2018.8705919[/vc_column_text][/vc_column_inner][vc_column_inner width=”1/4″][vc_column_text]Widget Plumx[/vc_column_text][/vc_column_inner][/vc_row_inner][/vc_column][/vc_row][vc_row][vc_column][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][/vc_column][/vc_row]