EAI Endorsed Transactions
on Pervasive Health and Technology Research Article
1
Usage of Web Scraping in the Pharmaceutical Sector
Ruby Dahiya
1
, Nidhi
1
, Kajal Kumari
1
, Shruti Kumari
1
, Nidhi Agarwal
1,
*
1
School of CSE,Galgotias University, Greater Noida, India
Abstract
INTRODUCTION: Web scraping is a technique that provides organizations with the ability to analyse large amounts of
information and gather new information.
OBJECTIVES: Find a group that is a health check, a full body test, a blood test, and so on. In this way, the pharmaceutical
industry should consider how to improve information, information storage, information retrieval, and capture. For example,
the healthcare system may decide to standardize the assessment of speech and allow information to be shared across
organizations to improve treatment outcomes in web scraping applications.
METHODS: Web scraping is based on the pharmaceutical industry. From here, we get information about pharmacies, such
as drug names in different categories or drug sales. However, we are dealing with diseases and common medicines. Using
this information, we can find the most common viruses. There are many factors to consider when creating a junk website
for the pharmaceutical industry, such as drug names, tablet categories, and syrups found in the pharmaceutical industry.
RESULTS: As is clearly visible from the output, there are columns for drug names, manufacturers, drug types, and prices.
This is the information we get from a website called Net meds, a pharmacy site. With the help of this information, we learn
which drugs are most needed, and then we can find the most common diseases today.
CONCLUSION: The results of this web scraping can be very useful and powerful. However, the industry's success in web
scraping and data extraction techniques depends on the availability of clean chemical data.
Keywords: Web scraping, beautiful soup, drug, medicine
Received on 25 July 2023, accepted on 30 October 2023, published on 06 November 2023
Copyright © 2023 R. Dahiya et al., licensed to EAI. This is an open access article distributed under the terms of the CC BY-NC-SA
4.0, which permits copying, redistributing, remixing, transformation, and building upon the material in any medium so long as the
original work is properly cited.
doi: 10.4108/eetpht.9.4312
*
Corresponding author. Email: nidhi.agarwal@galgotiasuniversity.edu.in
1. Introduction
Web scraping is a process where we get information from
websites such as www.We get information from websites
using this method, but sometimes it becomes illegal. So only
that person uses the process to download the website with a
license. Net scraping of presidential websites is illegal. We
cannot web scrape government websites. Format, HTTP
programming, HTML parsing, DOM parsing, etc. We have
some data extraction methods such as We use web scraping
to extract the data. We use web scraping to extract large
amounts of information from websites into Excel files or
databases.
We all know that health is wealth, which means that health is
very important to life. Last year also proved that Covid-19 is
not the only disease of concern. India has also seen cases of
scarlet fever, tomato flu, black fungus, and measles in cattle,
as well as nodular skin disease. Thus, we can measure viruses
with the help of web scraping. We extract information from
the websites of online pharmaceutical companies so that we
can understand the types of diseases in our country.
Pharmaceutical companies store information about drugs in
medical records.
2. Related Work
In an article called Web Scrapping published in 2017, author
Bo Zhao [1]said the nice soup technique for web scraping is
limited, according to this theory, if the web scraper sends too
much request data, it serves as the equivalent of a denial of
service. A 2012 article by Internet advertising writer Anand
V. Saurkar, Kedar G. Pathare, and Shweta A Gode titled
"Web Scraping Using Collaborative-Based Networking"
used HTML parsing methods, but with important details [2].
EAI Endorsed Transactions on
Pervasive Health and Technology
2023 | Volume 9