Scraping Yellow Pages Australia - Name, Email, Phone, Website, Address

WebHarvy is a visual web scraping software which can be used to extract data from any website, including various flavors of Yellow Pages like yellowpages.com (US), yellowpages.com.au (Australia), yellowpages.ca (Canada), yell.com (UK), paginasamarillas.fr (France) etc.

How to scrape data from YellowPages.com.au using WebHarvy?

WebHarvy Web Scraping Software can be used to extract data from any website. WebHarvy can be downloaded and installed locally in your computer.

WebHarvy contains a built-in browser using which you can navigate to the web page from which you need to scrape data. For scraping Yellow Pages Australia, load the listings page from which you need to extract data and click on the Start configuration button.

Load yellow pages listing from which data needs to be scraped

You can now click and select any required data item from the page for extraction. To scrape the name of the listing, click on the name of the first non-ad (non sponsored) listing and from the resulting Capture window displayed, select the Capture Text option.

select data to scrape from yellow pages listing

You can select other details like phone number, website, address etc. from the listing page in this manner. But before we proceed to select other details, if we scroll down the page, we can see that beyond a limit, the listings are in collapsed mode. We need to expand them. For this, click on the More Info link and select the Click option from the Capture window displayed.

expand all listings

Once this is done, you can select the remaining data from the listings page, namely the phone number, website address, business address etc., by directly clicking over the text and by selecting the Capture Text option. WebHarvy will automatically parse and select all similar items from the page and display them in the Captured Data Preview pane.

preview of data selected from YP Australia website

Since the listings span across multiple pages, in order to teach WebHarvy how to load and scrape data from all subsequent pages, scroll down the page and click on the next page link and select the Set as next page link option from the Capture window.

select pagination link

To follow each listing link to load its details page and scrape further data, click on the title/link of the first listing and select the Follow this link option from the Capture window.

Once you have selected all required data, you can stop configuration by clicking on the Stop button in the top menu. You may now optionally save the configuration so that it can be run later. Saved configurations can also be edited in case you wish to add or remove data fields from the configuration. Click the Start Mine button to start mining data using the configuration. In the resulting Miner window displayed, click the Start button and WebHarvy will start to scrape data from the website for which we created the configuration.

Scraped data can be saved to a file or exported to a database. WebHarvy also allows you to schedule web scraping tasks so that they run without user intervention and fetch the latest data to a file or database.

Video Demonstration

The following video shows how WebHarvy can be used to scrape details like name, website, email, phone number and address from yellowpages.com.au listings.

Try WebHarvy

We recommend that you try the free evaluation version of WebHarvy available for download.

Getting started with web scraping using WebHarvy

Need Help?

If you have any questions or need assistance in setting up WebHarvy for Yellow Pages scraping, please do not hesitate to contact our technical support team by providing details regarding your requirement or problem faced.

Related:

  1. 1. Scraping Yellow Pages Listings Data (www.yellowpages.com)
  2. 2. Scraping business location (latitude/longitude) from Yellow Pages