WebHarvy is a visual web scraping software which can be easily configured to scrape data from any website. In this article we will see how WebHarvy can be configured to extract data from www.yellowpages.com.au listings.
A special technique is employed to extract data correctly and consistently from yellowpages.com.au listings. This is mainly because the layout of boxes of listings vary from one listing to another – some has header with their logo/image, some does not etc.
The regular expression strings used in the video to extract email, phone, website and address are given below.
We highly recommend that you download and try using the free evaluation version of WebHarvy available in our website. To get started, please follow the link below.