August 2013 - WebHarvy Blog

Use 'Capture Following Text' option to scrape data from details pages

August 30, 2013 by admin

While extracting data from details pages (page reached by navigating a link from the start page), it is recommended that the ‘Capture Following Text‘ option be used whenever possible to correctly and consistently scrape data. This is because the layout and the amount of data displayed in details pages may not be consistent. For example, … Read more

Scrape HTML

August 30, 2013 by admin

WebHarvy allows you to scrape HTML of page contents in addition to plain text. In the Capture window, click ‘More Options’ button and select the ‘Capture HTML’ option to scrape the HTML of the selected content. To capture only a portion of the displayed HTML, you may select and highlight the required portion before clicking … Read more

Scraping hidden (click to display) fields using WebHarvy

August 29, 2013 by admin

Certain web pages require that you to click on a link or button for the data to be displayed. There are many websites where email addresses or phone numbers are partially displayed, they will be fully displayed only if you click on them. The ‘Click’ option under ‘More Options’ button in the Capture Window lets … Read more

Scrape with Regular Expressions using WebHarvy

August 29, 2013 by admin

WebHarvy is designed as a ‘point and click’ visual Web Scraper. The design concentrates on easy of use, so that you can start scraping data within few minutes after downloading the software. But in case you need more control over what needs to be extracted you can use Regular Expressions (RegEx) with WebHarvy. WebHarvy allows … Read more