{"id":932,"date":"2020-03-02T06:26:00","date_gmt":"2020-03-02T06:26:00","guid":{"rendered":"http:\/\/webharvy.com\/whblog\/?p=932"},"modified":"2020-03-02T06:26:00","modified_gmt":"2020-03-02T06:26:00","slug":"scraping-data-from-paginasamarillas-es-extraccion-paginas-amarillas","status":"publish","type":"post","link":"https:\/\/www.webharvy.com\/blog\/scraping-data-from-paginasamarillas-es-extraccion-paginas-amarillas\/","title":{"rendered":"Scraping data from paginasamarillas.es | extracci\u00f3n paginas amarillas"},"content":{"rendered":"<p>In this article we will see how <a href=\"https:\/\/www.webharvy.com\/\">WebHarvy<\/a> can be used to extract data from Spanish Yellow Pages website &#8211;\u00a0<a href=\"https:\/\/www.paginasamarillas.es\/\">paginasamarillas.es<\/a><\/p>\n<h3>Paginas Amarillas Data Extraction<\/h3>\n<p>WebHarvy can extract data like business name, address, website, email and phone numbers from <a href=\"https:\/\/www.paginasamarillas.es\/\">paginasamarillas.es<\/a> listings. The following video shows how this can be done. Most of the details except email address (which is not directly displayed by the website) can be selected by directly by clicking on them during configuration. Email address can be selected from the HTML source of the business details page by <a href=\"https:\/\/www.webharvy.com\/tour1.html#ScrapeByRegEx\">applying regular expressions<\/a>. The Regular Expression string used to extract email address is copied below.<\/p>\n<p>customerMail[^;]*;[^;]*;([^\\&amp;]*)<\/p>\n<p><iframe loading=\"lazy\" title=\"Scraping paginasamarillas.es | extracci\u00f3n paginas amarillas\" width=\"1200\" height=\"675\" src=\"https:\/\/www.youtube.com\/embed\/ADiq6yMDWjo?feature=oembed\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture\" allowfullscreen><\/iframe><\/p>\n<h3>Try WebHarvy<\/h3>\n<p>To know more we highly recommend that you download and try the free evaluation version of WebHarvy. To get started, please follow the link below.<\/p>\n<p><a href=\"https:\/\/www.webharvy.com\/articles\/getting-started.html\">Getting started with web scraping using WebHarvy<\/a><\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>In this article we will see how WebHarvy can be used to extract data from Spanish Yellow Pages website &#8211;\u00a0paginasamarillas.es Paginas Amarillas Data Extraction WebHarvy can extract data like business name, address, website, email and phone numbers from paginasamarillas.es listings. The following video shows how this can be done. Most of the details except email &#8230; <a title=\"Scraping data from paginasamarillas.es | extracci\u00f3n paginas amarillas\" class=\"read-more\" href=\"https:\/\/www.webharvy.com\/blog\/scraping-data-from-paginasamarillas-es-extraccion-paginas-amarillas\/\" aria-label=\"Read more about Scraping data from paginasamarillas.es | extracci\u00f3n paginas amarillas\">Read more<\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[5,8],"tags":[83,153],"class_list":["post-932","post","type-post","status-publish","format-standard","hentry","category-howto","category-webharvy","tag-paginasamarillas","tag-yellow-pages"],"_links":{"self":[{"href":"https:\/\/www.webharvy.com\/blog\/wp-json\/wp\/v2\/posts\/932","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.webharvy.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.webharvy.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.webharvy.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.webharvy.com\/blog\/wp-json\/wp\/v2\/comments?post=932"}],"version-history":[{"count":0,"href":"https:\/\/www.webharvy.com\/blog\/wp-json\/wp\/v2\/posts\/932\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.webharvy.com\/blog\/wp-json\/wp\/v2\/media?parent=932"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.webharvy.com\/blog\/wp-json\/wp\/v2\/categories?post=932"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.webharvy.com\/blog\/wp-json\/wp\/v2\/tags?post=932"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}