A Demystified Overview of Data Scraping

Authors

  • Shehu Mustapha Universiti Malaysia Terengganu
  • Mustafa Man Universiti Malasia Terengganu
  • Wan Aezwani Wan Abu Bakar Universiti Sultan Zainal Abidin
  • Mohd Kamir Yusof Universiti Sultan Zainal Abidin
  • Ily Amalina Ahmad Sabri Faculty of Ocean Engineering and Informatics, Universiti Malaysia Terengganu, 21030 Kuala Nerus, Terengganu, Malaysia

DOI:

https://doi.org/10.69511/ijdsaa.v6i6.205

Keywords:

Data Scraping, Web Scraping, Web crawling, Data mining, API Scraping

Abstract

Data scraping is a concept that involves the extraction of relevant data from a pool of information stored in a computer. Data scraping is universally known as web scraping because the web contains massive amount of information that is easily accessible and extracted. Web scraping is valuable to all field of human endeavour. The paper gives a vivid conceptualization of data scraping and some minor misconception between data mining, web crawling and web scraping. Furthermore, the phases and procedure of data scraping were outlined. The merit of web scraping over API scraping were explicated. Moreover, the numerous software and tools that support the scraping of websites were stated.  Even though web scraping has vast prominence, there are also some technical issues and challenges associated with it. Finally, some of the legal and ethical issues related to information extraction were discussed and it is obvious that data scraping is permitted as long as users comply to the terms and conditions of the target site.  

Downloads

Published

2024-06-07

How to Cite

Mustapha, S., Man, M., Wan Abu Bakar , W. A. ., Yusof , M. K. ., & Ahmad Sabri , I. A. . (2024). A Demystified Overview of Data Scraping. International Journal of Data Science and Advanced Analytics, 6(1), 290–296. https://doi.org/10.69511/ijdsaa.v6i6.205

Issue

Section

Articles