WebHarvy 5.3 (Parallel mining, Chrome developer tools)

‘How to increase mining speed ?‘ was one of the most commonly asked questions by our users. With previous versions, the main limitation was that when links had to be followed from the starting page to get each listing details, the miner took more time to scrape a page full of listings. This is because …

WebHarvy 5.1 released (Includes direct Excel Export)

The following are the changes in 5.1.0.152 : New Features : Excel export – supports directly saving mined data as an Excel file (details) Handles page numbers in JavaScript code to load next page data (details) Updated Chromium engine from V54 to V62 Minor changes : Default values of ‘Enable Plugins’ and ‘Enable Browser Security’ …

WebHarvy 4.1.5.141 released

The main changes in this release are :- Pagination via JavaScript – see https://www.webharvy.com/tour3.html#JS This powerful feature is the main highlight of this release. When all other methods of pagination fails, this method, where you can directly provide a JavaScript code which when run would load the next page, can be used. Increased size of …

Windows Smartscreen warning while installing WebHarvy

All WebHarvy application files and installation package are digitally signed (Comodo RSA Code Signing CA) and secured. However in case you get the following Smartscreen warning while trying to install the latest version of WebHarvy, please click the ‘More info‘ link and then click the ‘Run anyway‘ button to proceed with the installation. The above …

WebHarvy 4.0.3.128 (Minor Update)

From this release on wards WebHarvy targets (depends on) .NET 4.5 which comes pre-installed on latest Windows editions. This results in smoother installation process, doing away with .NET 3.5 download and install which was previously required. Targeting .NET 4.5 also helps WebHarvy improve performance and resource usage, and to solve issues related to crashes while …

WebHarvy version 3.4 released !

We’ve just released a new WebHarvy update. The following are the changes in this version. Major: Support for pagination where a link/button has to be clicked to load the next set of pages.┬áMore Info URL based pagination – automatically increment a numeral in start page URL to load subsequent pages.┬áMore Info One-click multiple image extraction …

Use 'Capture Following Text' option to scrape data from details pages

While extracting data from details pages (page reached by navigating a link from the start page), it is recommended that the ‘Capture Following Text‘ option be used whenever possible to correctly and consistently scrape data. This is because the layout and the amount of data displayed in details pages may not be consistent. For example, …

WebHarvy Version 3.0 Released !

We are happy to announce the release of WebHarvy 3.0. We have added a lot of new features in this major update. The feature/changes list for this update is the longest among all product updates which we have done till date. Here we go. . Added the following options in the Capture Window (grouped under …

WebHarvy v2.0 Released !

The new features in the 2.0 update are : Built-in scheduler for running scraping tasks – (know more) Command Line Options – (know more) MySQL Support for exporting scraped data – (know more) Option to scrape sub text of selected text – (know more) Updated proxy settings – (know more) Supports proxies which require authentication …

How to scrape text following a heading using WebHarvy ?

In the latest update of WebHarvy, the Visual Web Scraping Software, the newly introduced ‘capture following text’ option allows you to capture text/block/paragraph following a heading within a webpage. Often with many websites the data to be scraped may not be located at the same position within all pages, but is guaranteed to be found …