EAI Endorsed Transactions on Pervasive Health and Technology Research Article 1 Usage of Web Scraping in the Pharmaceutical Sector Ruby Dahiya 1 , Nidhi 1 , Kajal Kumari 1 , Shruti Kumari 1 , Nidhi Agarwal 1, * 1 School of CSE,Galgotias University, Greater Noida, India Abstract INTRODUCTION: Web scraping is a technique that provides organizations with the ability to analyse large amounts of information and gather new information. OBJECTIVES: Find a group that is a health check, a full body test, a blood test, and so on. In this way, the pharmaceutical industry should consider how to improve information, information storage, information retrieval, and capture. For example, the healthcare system may decide to standardize the assessment of speech and allow information to be shared across organizations to improve treatment outcomes in web scraping applications. METHODS: Web scraping is based on the pharmaceutical industry. From here, we get information about pharmacies, such as drug names in different categories or drug sales. However, we are dealing with diseases and common medicines. Using this information, we can find the most common viruses. There are many factors to consider when creating a junk website for the pharmaceutical industry, such as drug names, tablet categories, and syrups found in the pharmaceutical industry. RESULTS: As is clearly visible from the output, there are columns for drug names, manufacturers, drug types, and prices. This is the information we get from a website called Net meds, a pharmacy site. With the help of this information, we learn which drugs are most needed, and then we can find the most common diseases today. CONCLUSION: The results of this web scraping can be very useful and powerful. However, the industry's success in web scraping and data extraction techniques depends on the availability of clean chemical data. Keywords: Web scraping, beautiful soup, drug, medicine Received on 25 July 2023, accepted on 30 October 2023, published on 06 November 2023 Copyright © 2023 R. Dahiya et al., licensed to EAI. This is an open access article distributed under the terms of the CC BY-NC-SA 4.0, which permits copying, redistributing, remixing, transformation, and building upon the material in any medium so long as the original work is properly cited. doi: 10.4108/eetpht.9.4312 * Corresponding author. Email: nidhi.agarwal@galgotiasuniversity.edu.in 1. Introduction Web scraping is a process where we get information from websites such as www.We get information from websites using this method, but sometimes it becomes illegal. So only that person uses the process to download the website with a license. Net scraping of presidential websites is illegal. We cannot web scrape government websites. Format, HTTP programming, HTML parsing, DOM parsing, etc. We have some data extraction methods such as We use web scraping to extract the data. We use web scraping to extract large amounts of information from websites into Excel files or databases. We all know that health is wealth, which means that health is very important to life. Last year also proved that Covid-19 is not the only disease of concern. India has also seen cases of scarlet fever, tomato flu, black fungus, and measles in cattle, as well as nodular skin disease. Thus, we can measure viruses with the help of web scraping. We extract information from the websites of online pharmaceutical companies so that we can understand the types of diseases in our country. Pharmaceutical companies store information about drugs in medical records. 2. Related Work In an article called Web Scrapping published in 2017, author Bo Zhao [1]said the nice soup technique for web scraping is limited, according to this theory, if the web scraper sends too much request data, it serves as the equivalent of a denial of service. A 2012 article by Internet advertising writer Anand V. Saurkar, Kedar G. Pathare, and Shweta A Gode titled "Web Scraping Using Collaborative-Based Networking" used HTML parsing methods, but with important details [2]. EAI Endorsed Transactions on Pervasive Health and Technology 2023 | Volume 9