Web-based RIG performance reporting system using web scraping technique / Siti Rasyiqah Muhamad Zaki

Muhamad Zaki, Siti Rasyiqah (2019) Web-based RIG performance reporting system using web scraping technique / Siti Rasyiqah Muhamad Zaki. Degree thesis, Universiti Teknologi MARA (UiTM).

Abstract

Data is a piece of information that exists in almost everywhere in the Internet and it is represented in either in the form of unstructured, semi-structured or structured data. In cases like these, when people want to copy certain data they do extraction from the web to the readable file. This system will be using scraping technique that can retrieve information from multiple sources and display it in the table form. The requirement for scraping the data from different sources can be done by using Python language and tools like BeautifulSoup or Pandas. BeautifulSoup can read and parse html and xml files by getting the certain data that needed. Meanwhile, Pandas purpose can clean and is a data analysis tools. The sources target limited to UiTM lecturer‟s publication which is Pure and Google Scholar. This web-based is important as it can reduce in extracting the data manually. There are many tools for collecting the data in the Internet; however, this research can help in the use of data collection method that can make the extraction easier.

Metadata

Item Type: Thesis (Degree)
Creators:
Creators
Email / ID Num.
Muhamad Zaki, Siti Rasyiqah
2016726129
Contributors:
Contribution
Name
Email / ID Num.
Thesis advisor
Ismail, Azlan
UNSPECIFIED
Subjects: T Technology > TK Electrical engineering. Electronics. Nuclear engineering > Telecommunication > Computer networks. General works. Traffic monitoring
Divisions: Universiti Teknologi MARA, Shah Alam > Faculty of Computer and Mathematical Sciences
Programme: Bachelor of Computer Science (Hons.) Computer Science
Keywords: Reporting, web scraping, data
Date: 2019
URI: https://ir.uitm.edu.my/id/eprint/110708
Edit Item
Edit Item

Download

[thumbnail of 110708.pdf] Text
110708.pdf

Download (189kB)

Digital Copy

Digital (fulltext) is available at:

Physical Copy

Physical status and holdings:
Item Status:

ID Number

110708

Indexing

Statistic

Loading...

Statistic details