Scraping data from HTML by applying Regular Expressions

WebHarvy can scrape data from HTML source code of selected area (or whole of) of web pages by applying Regular Expressions.
During configuration, after clicking on an item, the ‘Capture HTML’ option under ‘More Options’ of Capture window allows the HTML of the item to be captured and displayed in the preview area. After this, Regular Expressions can be applied (More Options > Apply Regular Expression) to select data from a portion of the HTML code displayed.
The following video shows how this feature can be applied to scrape URLs from HTML.
https://www.youtube.com/watch?v=cEuGUpzJzkw
Download & try the 15 days evaluation version

Leave a Comment