support@webharvy.com | sales@webharvy.com | YouTube Channel | KB Articles

Articles Home

Product Help

YouTube Channel

WebHarvy Blog


Scraping Yellow Pages Australia - Name, Email, Phone, Website, Address


WebHarvy is a visual web scraping software which can be used to extract data from any website, including various flavors of Yellow Pages like yellowpages.com (US), yellowpages.com.au (Australia), yellowpages.ca (Canada), yell.com (UK), paginasamarillas.fr (France) etc.

Scraping YellowPages.com.au without missing any data

You can load any website within WebHarvy and select the data which you need to scrape using mouse clicks. But for scraping the Australian flavor of yellow pages (yellowpages.com.au), a special configuration method needs to be followed.

This is because the layout/design of each individual box of listings on yellowpages.com.au may vary. For example, some listings have a header with logo/image and business name, while some does not. For this reason, if you select data by directly clicking over its text during configuration, WebHarvy may not be able to fetch similar data from all listings which span across multiple pages.

The technique shown in the following video will help you scrape details like name, website, email, phone number and address from yellowpages.com.au listings without missing any data.



RegEx strings used


The Regular Expression strings used in the above video to correctly select phone, email and website are given below.

tel:([^"]*)
data-email="([^"]*)
href="(http[^"]*)

Try WebHarvy


We recommend that you try the free evaluation version of WebHarvy available for download.

Getting started with web scraping using WebHarvy

Need Help?


To get started, we highly recommend that you refer this link. For further assistance you may please contact our technical support by providing details regarding your requirement or problem faced.

Related:


How to build an Yellow Pages Australia Scraper?