Generate Real Estate Leads using Web Scraping

Web Scraping is the automated process of extracting data from websites using software or an online service. This technique can be used to easily extract property owner or real estate agent contact details from websites like Zillow, Trulia, Realtor etc.

WebHarvy is a point and click, visual web scraper which can be used to extract data from websites.

Getting agent phone numbers

Most real estate websites allow you to search and view details of agents catering to a specific region. The following video shows how WebHarvy can be used to extract agent contact details like name, address, phone number etc from Zillow.

Getting owner/agent contact details from property listings

Owner or agent contact details can also be extracted from property listings as shown in the following videos.

Scraping agent phone numbers from property listings

Scraping owner phone numbers from property listings

Scraping leads from Realtor

The following video shows how agent contact details can be extracted from Realtor website.

We have an entire playlist of videos related to real estate data extraction which you may watch at this link. WebHarvy can be used to extract data automatically from any website.

Get Started

We recommend that you download and try using the free evaluation version of WebHarvy to know more. To get started, please follow the link below.

Getting started with web scraping using WebHarvy

How to scrape data from Bing Maps ?

WebHarvy is a generic visual web scraping software which can be easily configured to extract data from any website. In this article we will see how WebHarvy can be configured to extract data from Bing maps.

Details like business name, address, phone number, website address, rating etc. can be easily extracted from Bing maps listings using WebHarvy. Just like most map interfaces the details are opened in a popup over the map. The following video shows how WebHarvy can be configured to extract the required details.

As shown in the above video, the Open Popup feature of WebHarvy is used to open each listing details and scrape the data displayed. The Capture following text feature is used to correctly select details like address, website, phone number etc. It is recommended to use this method for data selection whenever the data is guaranteed to appear after a heading text.

Sometimes, Bing maps interface displays a ‘Website’ button, clicking which you can visit the website of the listed business. In such cases the website address as such will not be displayed in the listings popup.

1. To extract website address in such scenarios, during configuration, highlight and click the entire popup area as shown in the following image.

2. From the resulting Capture window displayed, select More Options > Apply Regular Expression. Paste and apply the following RegEx string to get the website address.

role=”button”\s*href=”(http[^”]*)

3. Click the main ‘Capture HTML‘ button to capture it.

Scraping data from Google Maps

WebHarvy also supports extracting data from Google Maps listings. We have several demonstration videos related to this, which you can watch by following the link below.

Google Maps Data Extraction using WebHarvy

Try WebHarvy

We highly recommend that you download and try using the free evaluation version of WebHarvy. To get started please follow the link given below.

Getting started with data extraction using WebHarvy

How to scrape TripAdvisor reviews and ratings ?

WebHarvy can be used to scrape data from TripAdvisor website. In this article we will be see how WebHarvy can be configured to scrape reviews and ratings from multiple listings at TripAdvisor website.

By default, TripAdvisor does not display the complete review text in its listings pages. You will have to click a ‘Read more’ link at the end of each partially displayed review, to view the complete review. This can be automated using WebHarvy as shown in the following video.

 

Regular expression strings are used to correctly select the date of review, and also the rating numerical value. The rating value is selected from the HTML source of the rating stars displayed by the website. The RegEx strings used are copied below.

wrote a review (.*)

rating bubble_([^”]*)

We have several videos in our YouTube channel related to TripAdvisor data extraction. You may watch them at the following link.

TripAdvisor Scraping Videos using WebHarvy

Try WebHarvy

We recommend that you download and try the free evaluation version of WebHarvy. To know more please follow the link below.

Getting started with data scraping using WebHarvy

How to scrape data from eBay product listings ? (price, images, specification, seller description etc.)

WebHarvy can be used to easily scrape product data from listings at eCommerce websites like Amazon, eBay etc. We have an entire playlist of demonstration videos related to eCommerce data extraction in our YouTube channel.

eBay Data Scraping

In this article we will see how WebHarvy can be used to extract product data from eBay listings. Details like product name, price, product URL, item specifications (condition, weight, UPC/MPN etc.), seller description etc. can be extracted. WebHarvy can also extract product images (thumbnail as well as high resolution images) from eBay product listings. The following video shows the steps involved.

The JavaScript code used in the above video to open seller description as a separate page is copied below.

location.href = document.getElementById(‘desc_ifr’).getAttribute(‘src’);

More videos related to eBay data extraction

Try WebHarvy

In case you are interested in exploring more, we highly recommend that you download and try using our free evaluation version. To get started, please follow the link given below.

Getting started with web data scraping using WebHarvy

 

Scraping data from paginasamarillas.es | extracción paginas amarillas

In this article we will see how WebHarvy can be used to extract data from Spanish Yellow Pages website – paginasamarillas.es

Paginas Amarillas Data Extraction

WebHarvy can extract data like business name, address, website, email and phone numbers from paginasamarillas.es listings. The following video shows how this can be done. Most of the details except email address (which is not directly displayed by the website) can be selected by directly by clicking on them during configuration. Email address can be selected from the HTML source of the business details page by applying regular expressions. The Regular Expression string used to extract email address is copied below.

customerMail[^;]*;[^;]*;([^\&]*)

Try WebHarvy

To know more we highly recommend that you download and try the free evaluation version of WebHarvy. To get started, please follow the link below.

Getting started with web scraping using WebHarvy

 

Scraping Yellow Pages Australia (yellowpages.com.au) – phone, email, website

WebHarvy is a visual web scraping software which can be easily configured to scrape data from any website. In this article we will see how WebHarvy can be configured to extract data from www.yellowpages.com.au listings.

Scraping yellowpages.com.au

A special technique is employed to extract data correctly and consistently from yellowpages.com.au listings. This is mainly because the layout of boxes of listings vary from one listing to another – some has header with their logo/image, some does not etc.

The regular expression strings used in the video to extract email, phone, website and address are given below.

https://gist.github.com/sysnucleus/436a2b0be80882f0ae61a391931abf5d

Know More

We highly recommend that you download and try using the free evaluation version of WebHarvy available in our website. To get started, please follow the link below.

Getting started with web scraping using WebHarvy

Scraping Zillow for real estate data and agent phone numbers

WebHarvy can be used to easily scrape data from real estate websites like Zillow, Realtor, Trulia, RedFin etc. In this article we will see how real estate data including agent/owner contact details (phone numbers) can be extracted using WebHarvy.

Scraping Real Estate Data from Zillow

The following video shows the steps involved. You can see that data like property address, price, zestimate, beds/baths, area, property facts and features (like type, year built, parking etc.), pricing history, tax history, neighborhood details etc. can be easily selected for extraction using a point and click interface. WebHarvy will automatically scrape the data which you select from multiple properties listed across multiple pages in Zillow.

Scraping agent phone numbers from Zillow

The following video shows how agent phone numbers can be scraped from Zillow property listings. The ‘contact agent’ button needs to be clicked in each property details page to get the agent contact details.

Try WebHarvy

We recommend that you download and try with the free evaluation version of WebHarvy available in our website and avail our free technical assistance for your first data scraping project. To get started please follow the link below.

Getting started with web scraping using WebHarvy

Scraping Flashscore – Statistics of all matches in a league

The following video shows how match statistics (possession, goal attempts, shots on goal, blocked shots, corners, off-sides  etc.) of all matches in a league from FlashScore website can be extracted using WebHarvy.

In addition to FlashScore, WebHarvy can also be used to extract sports betting odds from many other betting sites like BetExplorer, OddsPortal etc.

The regular expression string used in the video to get match ID is :

id=”([^”]*)

To form the URL, the following string is replaced :

g_1_

with

https://www.flashscore.com/match/

Know More

We recommend that you download and try using the free evaluation version of WebHarvy available in our website. To get started, please visit the link below.

Getting started with web scraping using WebHarvy

Scraping latitude, longitude from Yellow Pages listings – GPS Coordinates Extaction

Yellow Pages business listings often display the location (Map Direction) of the business. The location details are displayed on a map interface. But the latitude, longitude values (GPS coordinates) are not displayed on page. However, this information is present inside the HTML code behind the map interface.

Extracting latitude, longitude values

The Capture HTML feature along with Apply RegEx feature of WebHarvy can be used to extract the map coordinates from the HTML code of the page. The following video shows how this can be done. The Regular Expression strings used in the video are copied below.

data-lat=”([^”]*)

data-lng=”([^”]*)

Try WebHarvy

We recommend that you download and try using the free evaluation version of WebHarvy available in our website. To get started, please follow the link below.

Getting started with web scraping using WebHarvy