| | YouTube Channel | KB Articles

Articles Home

Product Help

YouTube Channel

WebHarvy Blog

Scraping Articles and Press Releases using WebHarvy

In this article we will see how WebHarvy can be easily configured to scrape articles, publications and press releases . Being a generic web scraping software, WebHarvy can be configured to extract data from any website as per your requirement.

WebHarvy can be used to scrape articles from article directories and press releases from PR websites.

How to easily scrape data from websites using WebHarvy ?

WebHarvy lets you scrape the content of the article as a file (text file) - see Scrape text as file for details. The Capture More Content option also comes in handy while scraping articles.

The following demo shows how WebHarvy can be used to scrape articles from Details like article title, author name, date, article body, keywords etc can be easily extracted using WebHarvy.

The following video shows how articles can be extracted (downloaded/saved) from CNN website using WebHarvy

WebHarvy can also extract the entire article content in HTML format, so that text formatting and embedded images are not lost. For this the Capture HTML feature should be used.

We recommend that you download and try the evaluation version and also view the video demonstrations.

Download the FREE evaluation version of WebHarvy

In case you need assistance in configuring WebHarvy, please do not hesitate to contact our support team ( with the details (URL of the webpage + details of the data to be scraped). We are happy to help you get started with your first data extracting project using WebHarvy !

Keywords : Article Scraper, Scrape Articles, PR Scraper, Scrape Press Releases