Skip to content

garyhaze/ImageScraper

 
 

Repository files navigation

ImageScraper

A simple python script which downloads all images in the given webpage.

Build Status Version Downloads
Build Status Latest Version PyPi downloads

Download

###tar file: Grab the latest build using - Pip: https://pypi.python.org/pypi/ImageScraper

###pip install You can also download using pip:

$ pip install ImageScraper

Usage

image-scraper [-h] [--max-images MAX_IMAGES] url2scrape

###If you downloaded the tar: Extract the contents of the tar file. Note that ImageScraper depends on lxml. and requests. If you run into problems in the compilation of lxml through pip, install the libxml2-dev and libxslt-dev packages on your system.

$cd ImageScraper/
$python setup.py install
$image-scraper [url to scrap]

###If installed using pip: Open python in terminal.

$image-scraper [url to scrap]

####NOTE: A new folder called "images" will be created in the same place, containing all the downloaded images.

Upgrading

Check if a newer version if available and upgrade using:

$ sudo  pip install ImageScraper --upgrade

Issues

Q.)All images were not downloaded?

It could be that the content was injected into the page via javascript and this scraper doesn't run javascript.

Todo

Scraping sites which inject image tags via javascript, by using PhantomJS or Selenium.

About

A python scraper which scraps images from a given webpage.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%