WebHarvy 6.6 Release Update

The following are the changes in the latest version of WebHarvy

Edit Proxies

You can now edit details of saved proxy servers. In Settings > Proxy Settings, by double clicking on a saved proxy address (in Proxy List), you can edit its details (user name, password, address, port) and save changes. Earlier you needed to delete a proxy and add again with updated details to make changes.

Updated Browser

We have updated the internal browser used by WebHarvy to the latest possible version of Chromium.

User Agent Strings of Popular Browsers

In Settings > Browser window, a drop down option in the Custom User Agent String input box provides UA strings of popular browsers like Chrome, Edge, Firefox etc.

RegEx string for price selection

RegEx string to select prices in various currencies from text or HTML has been added in the ‘Apply Regular Expression‘ window. Other commonly used RegEx strings for images, email addresses etc. are already provided.

Run JavaScript on all pages without a placeholder column

To run JS code on all pages (before data selection), in older versions a placeholder primary column needed to be selected. From this version onwards it is no longer required. If you run script soon after starting configuration, it will be run on all pages (during pagination). If you need to run it only on the starting page, you should disable patterns before running the script (during configuration).

Duplicate removal feature ignores category/keyword tag

In older versions if 2 rows of data had identical values in all columns except category/keyword column (which is added by WebHarvy if Tag with Category / Keyword / URL option is enabled), they are not considered duplicates. Starting from this version, the value in category/keyword column is ignored while checking for and removing duplicates.

Screenshot images can be named based on any data column value

While saving screenshot images, you can either automatically name them or name them based on value in any other data column. In earlier version these Image Settings were not applicable for screenshot images.

Option to configure pagination when pagination links are not available on current page

While configuring Category scraping, sometimes the first category listings page will not have pagination links (due to only a single page of data), while subsequent category pages may have multiple pages of data. In such cases, pagination link can be selected using JavaScript as explained here.

Bug fixes

Bug fix in Category Scraping feature which prevents infinite looping on certain types of category pages.

Download latest version

The latest version may be downloaded and installed by following the link below.

https://www.webharvy.com/download.html

If you have any questions, please contact our technical support team.