YelpScraper

Scrape the 100 most popular restaurants in Charlottesville from Yelp and analyze the data

This project uses the Scrapy library to parse the 100 most popular restaturants from Yelp's Charlottesville page. Then, after storing the result as a json file, the data is visualized using the Matplotlib library.

#Installing

Install the requirements using pip:

pip install -r requirements.txt

#Running

To run the spider and generate the data as a json file:

scrapy crawl yelp -o filenamehere.json

The previous run of this spider generated a restaurants.json file (already included).

Lastly, to visualize the data and view the figures that it produces, cd into the visualizeyelp directory and run:

python visualize.py

The figures (already included) that this will generate compare the different categories of food and their pricyness.

DISCLAIMER: The spider works as of this commit but may stop functioning if Yelp's website layout changes drastically

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
scrapeyelp		scrapeyelp
visualizeyelp		visualizeyelp
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

YelpScraper

About

Uh oh!

Releases

Packages

Languages

ag4ml/YelpScraper

Folders and files

Latest commit

History

Repository files navigation

YelpScraper

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages