Pyscrelp
Timeline: 2022
Role: Lead Developer
Advised by: Matthew Bui, Race and Data Justice Lab
TL;DR: Developed a Yelp review scraper and published the code on Test PyPI. Developed as a part of Appetite for Data
Skills: Python, Pandas, SQL, Data Visualizations (Seaborn), Web Scraping (Beautiful Soup)
Check out the code: Github
Code Publication
Moy, C. (2022). Pyscrelp. Test PyPI. test.pypi.org/project/pyscrelp/.
About
As public access APIs become increasingly restrictive, we must turn to web scraping to effectively collect information from digital platforms. To understand the effect of Yelp's Black-owned attribute on Black-owned restaurants in Detroit and Los Angeles, I developed a Yelp review scraper, Pyscrelp. Pyscrelp is accessible on Test PyPI. Web scraping as an information collecting method has its limitations, however, as Yelp continues to rate limit and change webpage organization.
Using Pyscrelp, I collected over 300,000 Yelp reviews and published findings from these reviews at ASIS&T 2023 [Read More]. This dataset is available upon request.