WebHarvy can scrape product data of multiple products listed across several pages and follow each product link to get detailed information including images and reviews.
How to scrape Amazon Product Data using WebHarvy?
WebHarvy is a generic & visual web scraper which can be configured to scrape data from any website. The following video shows how WebHarvy can be configured to scrape data from Amazon product listing pages.
The first step is to load the page which displays the data which you need to scrape within WebHarvy's configuration browser. Then, to select the data which you need to scrape, start configuration.
Selecting data to scrape
During configuration, you can click and select each data item which you need to scrape. Data items like product name, price, product URL etc. can be directly clicked and selected for scraping. Once you click any text or image displayed on the page, WebHarvy will display a Capture window with various options.
Product listings span across multiple pages. This is called pagination. Pagination can be configured by clicking on the link to load the next page of listings and by selecting the 'Set as Next Page link' option.
WebHarvy also allows you to follow each product link and scrape additional data. For this, use the Follow this link option in the Capture window after clicking on the link to follow.
In the product details page, the Capture following text option in Capture window is used to correctly select most of the details like price, ASIN, model number, BSR etc.
Once you have selected all required data to scrape from a page, stop configuration and start mining.
Scraped product data can be saved as a file or exported to a database. We recommend that you download and try the free evaluation version of WebHarvy.