Abstract
Data is a piece of information that exists in almost everywhere in the Internet and it is represented in either in the form of unstructured, semi-structured or structured data. In cases like these, when people want to copy certain data they do extraction from the web to the readable file. This system will be using scraping technique that can retrieve information from multiple sources and display it in the table form. The requirement for scraping the data from different sources can be done by using Python language and tools like BeautifulSoup or Pandas. BeautifulSoup can read and parse html and xml files by getting the certain data that needed. Meanwhile, Pandas purpose can clean and is a data analysis tools. The sources target limited to UiTM lecturer‟s publication which is Pure and Google Scholar. This web-based is important as it can reduce in extracting the data manually. There are many tools for collecting the data in the Internet; however, this research can help in the use of data collection method that can make the extraction easier.
Metadata
Item Type: | Thesis (Degree) |
---|---|
Creators: | Creators Email / ID Num. Muhamad Zaki, Siti Rasyiqah 2016726129 |
Contributors: | Contribution Name Email / ID Num. Thesis advisor Ismail, Azlan UNSPECIFIED |
Subjects: | T Technology > TK Electrical engineering. Electronics. Nuclear engineering > Telecommunication > Computer networks. General works. Traffic monitoring |
Divisions: | Universiti Teknologi MARA, Shah Alam > Faculty of Computer and Mathematical Sciences |
Programme: | Bachelor of Computer Science (Hons.) Computer Science |
Keywords: | Reporting, web scraping, data |
Date: | 2019 |
URI: | https://ir.uitm.edu.my/id/eprint/110708 |
Download
![[thumbnail of 110708.pdf]](https://ir.uitm.edu.my/style/images/fileicons/text.png)
110708.pdf
Download (189kB)
Digital Copy

Physical Copy
ID Number
110708
Indexing


Statistic
