WebHarvy can scrape text as well as images from any website. Multiple images can be scraped for each product listed in eCommerce websites. Either images can be downloaded or their URLs can be captured.
Steps to follow to scrape product images from Amazon
- 1. Download and install WebHarvy in your computer
- 2. Open WebHarvy and load the product listings page
- 3. Start Configuration
- 4. Select the required data from the product listings page
- 5. Configure pagination
- 6. Follow link of the first product to load its details page
- 7. Click and select the required data from the details page
- 8. To scrape multiple product images, click on the first thumbnail image displayed besides the main product image
- 9. Capture HTML behind the image
- 10. Apply regular expression to select the high-resolution image URL from the captured HTML
- 11. Click on the Capture Image option in Capture window
- 12. Select Yes when WebHarvy asks whether to scrape multiple product images
- 13. Stop/Save Configuration
- 14. Start Mine
- 15. Export scraped data to a file or database
Video
The following video shows how WebHarvy can be used to easily extract multiple high resolution product images from Amazon product listings. Apart from images, product details like name, price, ASIN, rank, rating etc. can also be extracted.
The video shows the following:
- 1. How the default product image displayed can be scraped
- 2. How the 500x500 larger version of multiple product images can be extracted automatically
- 3. How the highest resolution 1000x1000 images can be scraped
WebHarvy can be configured either to download image files or to capture their URLs.
We recommend that you download and try the free evaluation version of WebHarvy.