Enter your keyword

2-s2.0-85065925337

[vc_empty_space][vc_empty_space]

Domain Specific Language for Web Scraper Development

Arifanto R.a, Asnar Y.D.W.a, Liem M.M.I.a

a Bandung Institute of Technology, School of Electrical Engineering and Informatics, Indonesia

[vc_row][vc_column][vc_row_inner][vc_column_inner][vc_separator css=”.vc_custom_1624529070653{padding-top: 30px !important;padding-bottom: 30px !important;}”][/vc_column_inner][/vc_row_inner][vc_row_inner layout=”boxed”][vc_column_inner width=”3/4″ css=”.vc_custom_1624695412187{border-right-width: 1px !important;border-right-color: #dddddd !important;border-right-style: solid !important;border-radius: 1px !important;}”][vc_empty_space][megatron_heading title=”Abstract” size=”size-sm” text_align=”text-left”][vc_column_text]© 2018 IEEE.The process of web scraping is done by using an application called scraper. Various tools have been built to facilitate the process of scraper development, but these tools can only be used by programmers, while users who need web scraping not only programmers but also non-programmers. Non programmer referred to in this research is someone who is able to run the computer through the shell and less experienced in programming. It takes a suitable approach in the construction of scrapers by non-programmers. Therefore, in this research will be built scrapingDSL, a domain specific languages that can be used by non-programmers to build scraper. Scrapers built using scrapingDSL must be able to do web scraping against all types of websites with different structures and should be easy to understand, write and modify. To build a scrapingDSL that can meet those needs, the researcher analyzed the tools used in web scraping to determine the right tool for the scrapingDSL engine, analyzed the website structure to determine the need for the scrapingDSL and the translator application, and analyzed the the website content retrieval flow to determine the required language scrapingDSL elements. At the end of the study a test was conducted to determine whether scrapingDSL could be used by non-programmers to build scrapers. Testing is done by requesting 12 non-programmer respondents to build scraper using scrapingDSL.[/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”Author keywords” size=”size-sm” text_align=”text-left”][vc_column_text]Different structure,Domain specific languages,Scraper,Web scrapings,Web site contents[/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”Indexed keywords” size=”size-sm” text_align=”text-left”][vc_column_text]Domain specific languages,Scraper,Web scraping[/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”Funding details” size=”size-sm” text_align=”text-left”][vc_column_text][/vc_column_text][vc_empty_space][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][vc_empty_space][megatron_heading title=”DOI” size=”size-sm” text_align=”text-left”][vc_column_text]https://doi.org/10.1109/ICODSE.2018.8705842[/vc_column_text][/vc_column_inner][vc_column_inner width=”1/4″][vc_column_text]Widget Plumx[/vc_column_text][/vc_column_inner][/vc_row_inner][/vc_column][/vc_row][vc_row][vc_column][vc_separator css=”.vc_custom_1624528584150{padding-top: 25px !important;padding-bottom: 25px !important;}”][/vc_column][/vc_row]