~ Email: support@sysnucleus.com Phone: 91.484.4015479 / 91.94950.45285 Skype: sysnucleus ~
Loading Web Pages & Starting Configuration
Selecting Data to Scrape
Following a link
Capturing data from multiple pages
Saving Configuration
Editing Configuration
Scraping Data
Export captured data
Category Scraping
Keyword based Scraping
Scrape via Proxy Server
How to register ?
Using WebHarvy you can scrape text, URLs and images from web pages. As you move the mouse pointer over the page, the data items which can be captured are highlighted with yellow background. Click on any data element in the page which you intend to scrape. WebHarvy will display a 'Capture' window. (Note: Even if an element is not highlighted when you hover the mouse pointer above it, you can click on the element to capture it)

In the resulting 'Capture' window which is displayed, click 'Capture this item's text' button.

You can then specify a name for the data item to be scraped as shown below.

Once you click 'OK', WebHarvy will automatically identify all similar data elements in the page and will display a preview of captured data in the 'Captured Data Preview' pane as shown below.

In similar way, you can capture more data items from within the page. You can also capture Urls and images in addition to text.
Scrape text following a heading
In some websites the data to be extracted may not be located at the same position within all pages. Also, in some cases the text following a heading may not be selected as a single item. In such situations the 'Capture following text' option in the capture window will be helpful.
In the following example in order to capture the text which comes under the heading 'Technical Details', click on the heading 'Technical Details' while in Config mode.

In the capture window, select the option 'Capture following text'.

Provide a suitable name for the field and you will be able to see the text under the heading captured in the Preview pane as shown below.

