Scraping data from paginasamarillas.es | extracción paginas amarillas

In this article we will see how WebHarvy can be used to extract data from Spanish Yellow Pages website – paginasamarillas.es

Paginas Amarillas Data Extraction

WebHarvy can extract data like business name, address, website, email and phone numbers from paginasamarillas.es listings. The following video shows how this can be done. Most of the details except email address (which is not directly displayed by the website) can be selected by directly by clicking on them during configuration. Email address can be selected from the HTML source of the business details page by applying regular expressions. The Regular Expression string used to extract email address is copied below.

customerMail[^;]*;[^;]*;([^\&]*)

Try WebHarvy

To know more we highly recommend that you download and try the free evaluation version of WebHarvy. To get started, please follow the link below.

Getting started with web scraping using WebHarvy

 

Leave a Comment