WebHarvy 6.3 – Custom Data Fields, Page Screenshot, Miner Settings in Configuration

WebHarvy 6.3 comes with support for custom data fields. The following custom data fields can be added to a configuration. Current page URL Screenshot of currently loaded page Date and Time of mining data User provided text Custom data fields can be added by clicking anywhere on the page during configuration and by selecting ‘Add …

WebHarvy 6.2 (Enhanced Proxy Support, Chromium v86, New Browser Setting options)

The following are the changes in this version. Enhanced proxy support In this version we have added support for various types of proxies. Earlier, WebHarvy supported only HTTP proxies. Starting from this version the following proxy types are supported. HTTP HTTPS SOCKS4 SOCKS4a SOCKS5 In the proxy settings window you can select the type of …

WebHarvy Settings Explained

WebHarvy Settings involves various options which you can set for Miner, Browser, Proxies, Category/Keyword and Images. We have created the following video which explains the various settings and how each of them can affect the mining performance/consistency and provide additional functionality. Please contact our support if you have any questions.

How to split address to street, city, state, zip while web scraping ?

During web data extraction, you might sometimes require to split textual data, or extract only a portion of the selected text. Following are 2 scenarios: Split the address string to street, city, state and zip Extract details like price, order number, phone, email etc. from a string In such cases, you can use the ‘Capture …

How to scrape data listed under multiple categories of websites ? | Whole eCommerce website extraction

The Multi Level Category Scraping feature of WebHarvy allows you to scrape product listings from an entire website, listed under various categories and sub-categories, using a simple and single configuration. The following video demonstrates the process. For more category scraping demonstration videos for various websites please refer the following link. WebHarvy Category Scraping Screen-casts for …

WebHarvy’s new user interface

We have significantly updated the user interface of WebHarvy in the latest version available in our website and the following video explains how the features and options are laid out in the new UI. Existing users of older versions will find this video useful so that they know where to look for specific features and …

WebHarvy 5.2 | UI revamp + Oracle db support

Changes in 5.2 are mainly related to user interface and experience.┬áThe most visible change is the introduction of the ribbon menu system for providing easy access to most software features. In addition to the main interface, other windows like Scheduler / Export etc. have also been updated. The export functionality (to file or database) has …

WebHarvy 5.2 | UI revamp + Oracle db support

Changes in 5.2 are mainly related to user interface and experience.┬áThe most visible change is the introduction of the ribbon menu system for providing easy access to most software features. In addition to the main interface, other windows like Scheduler / Export etc. have also been updated. The export functionality (to file or database) has …

WebHarvy 4.1.5.141 released

The main changes in this release are :- Pagination via JavaScript – see https://www.webharvy.com/tour3.html#JS This powerful feature is the main highlight of this release. When all other methods of pagination fails, this method, where you can directly provide a JavaScript code which when run would load the next page, can be used. Increased size of …

WebHarvy version 3.3 released !

3.3 version of WebHarvy was released on June 16, 2014. The major changes are : Fixed issues related to URL encoding in Category Scraping Added option to disable automatic pattern (data field repetition) detection in start page (more details) Option to follow links (URLs) obtained by applying Regular Expression on HTML – handles both absolute …