{"id":1579,"date":"2023-11-17T06:47:24","date_gmt":"2023-11-17T06:47:24","guid":{"rendered":"https:\/\/www.webharvy.com\/blog\/?p=1579"},"modified":"2023-11-17T06:47:25","modified_gmt":"2023-11-17T06:47:25","slug":"scrape-github-release-notes","status":"publish","type":"post","link":"https:\/\/www.webharvy.com\/blog\/scrape-github-release-notes\/","title":{"rendered":"Scrape GitHub Release Notes"},"content":{"rendered":"\n<p>This article demonstrates how <a href=\"https:\/\/www.webharvy.com\/index.html\">WebHarvy <\/a>can be used to scrape <a href=\"https:\/\/github.com\">GitHub<\/a> release notes. With WebHarvy, it is possible to efficiently scrape release details like version numbers and release notes from multiple pages.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"494\" src=\"https:\/\/www.webharvy.com\/blog\/wp-content\/uploads\/2023\/11\/image-1024x494.png\" alt=\"\" class=\"wp-image-1580\" srcset=\"https:\/\/www.webharvy.com\/blog\/wp-content\/uploads\/2023\/11\/image-1024x494.png 1024w, https:\/\/www.webharvy.com\/blog\/wp-content\/uploads\/2023\/11\/image-300x145.png 300w, https:\/\/www.webharvy.com\/blog\/wp-content\/uploads\/2023\/11\/image-768x370.png 768w, https:\/\/www.webharvy.com\/blog\/wp-content\/uploads\/2023\/11\/image.png 1064w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p>WebHarvy is a generic web scraping software which can be used to scrape data from any website. <\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Steps to follow<\/h2>\n\n\n\n<p>The first step is to <a href=\"https:\/\/www.webharvy.com\/download.html\">download <\/a>and install WebHarvy in your computer, if you have not done so already. Then load the page from which you need to scrape data within WebHarvy&#8217;s configuration browser.<\/p>\n\n\n\n<p>Once the page has been loaded, click on the <strong>Start <\/strong>button to <a href=\"https:\/\/www.webharvy.com\/tour.html\">start configuration<\/a>. Once in configuration mode, you can click and select any data item (text or image) which you wish to scrape. <\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"692\" src=\"https:\/\/www.webharvy.com\/blog\/wp-content\/uploads\/2023\/11\/image-1-1024x692.png\" alt=\"\" class=\"wp-image-1581\" srcset=\"https:\/\/www.webharvy.com\/blog\/wp-content\/uploads\/2023\/11\/image-1-1024x692.png 1024w, https:\/\/www.webharvy.com\/blog\/wp-content\/uploads\/2023\/11\/image-1-300x203.png 300w, https:\/\/www.webharvy.com\/blog\/wp-content\/uploads\/2023\/11\/image-1-768x519.png 768w, https:\/\/www.webharvy.com\/blog\/wp-content\/uploads\/2023\/11\/image-1.png 1423w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p>Clicking on any data item on page will bring up a <a href=\"https:\/\/www.webharvy.com\/tour1.html\">Capture window<\/a> with various options. Select the <strong>Capture Text <\/strong>option to select the text of the clicked item. Details like version number and release note text can be selected for scraping in this manner.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"655\" src=\"https:\/\/www.webharvy.com\/blog\/wp-content\/uploads\/2023\/11\/image-2-1024x655.png\" alt=\"\" class=\"wp-image-1582\" srcset=\"https:\/\/www.webharvy.com\/blog\/wp-content\/uploads\/2023\/11\/image-2-1024x655.png 1024w, https:\/\/www.webharvy.com\/blog\/wp-content\/uploads\/2023\/11\/image-2-300x192.png 300w, https:\/\/www.webharvy.com\/blog\/wp-content\/uploads\/2023\/11\/image-2-768x491.png 768w, https:\/\/www.webharvy.com\/blog\/wp-content\/uploads\/2023\/11\/image-2.png 1486w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p>While selecting release notes, if the entire block of text is not selected, you can apply <strong><a href=\"https:\/\/www.webharvy.com\/tour1.html#ScrapeMore\">Capture More Content<\/a> <\/strong>option multiple times till the desired portion is selected. <\/p>\n\n\n\n<p>To configure pagination, that is to teach WebHarvy how to scrape data from multiple pages, scroll down to the bottom of the page and click on the link to load the next page (you may either click on the &#8216;next&#8217; link or direct link to load page number 2). Then from the resulting Capture window, select the <strong><a href=\"https:\/\/www.webharvy.com\/tour3.html\">Set as Next Page link<\/a><\/strong> option. <\/p>\n\n\n\n<p>Once all data has been selected, <strong>Stop Configuration<\/strong> and <strong><a href=\"https:\/\/www.webharvy.com\/tour5.html\">Start Mine<\/a>. <\/strong><\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"690\" src=\"https:\/\/www.webharvy.com\/blog\/wp-content\/uploads\/2023\/11\/image-3-1024x690.png\" alt=\"\" class=\"wp-image-1583\" srcset=\"https:\/\/www.webharvy.com\/blog\/wp-content\/uploads\/2023\/11\/image-3-1024x690.png 1024w, https:\/\/www.webharvy.com\/blog\/wp-content\/uploads\/2023\/11\/image-3-300x202.png 300w, https:\/\/www.webharvy.com\/blog\/wp-content\/uploads\/2023\/11\/image-3-768x517.png 768w, https:\/\/www.webharvy.com\/blog\/wp-content\/uploads\/2023\/11\/image-3.png 1289w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Try WebHarvy<\/h2>\n\n\n\n<p>You may download and try the 15 days free evaluation version of WebHarvy by visiting the following link. If you have any questions, please feel free to reach out to our support.<\/p>\n\n\n\n<p><a href=\"https:\/\/www.webharvy.com\/articles\/getting-started.html\">https:\/\/www.webharvy.com\/articles\/getting-started.html<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>This article demonstrates how WebHarvy can be used to scrape GitHub release notes. With WebHarvy, it is possible to efficiently scrape release details like version numbers and release notes from multiple pages. WebHarvy is a generic web scraping software which can be used to scrape data from any website. Steps to follow The first step &#8230; <a title=\"Scrape GitHub Release Notes\" class=\"read-more\" href=\"https:\/\/www.webharvy.com\/blog\/scrape-github-release-notes\/\" aria-label=\"Read more about Scrape GitHub Release Notes\">Read more<\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-1579","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/www.webharvy.com\/blog\/wp-json\/wp\/v2\/posts\/1579","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.webharvy.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.webharvy.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.webharvy.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.webharvy.com\/blog\/wp-json\/wp\/v2\/comments?post=1579"}],"version-history":[{"count":2,"href":"https:\/\/www.webharvy.com\/blog\/wp-json\/wp\/v2\/posts\/1579\/revisions"}],"predecessor-version":[{"id":1585,"href":"https:\/\/www.webharvy.com\/blog\/wp-json\/wp\/v2\/posts\/1579\/revisions\/1585"}],"wp:attachment":[{"href":"https:\/\/www.webharvy.com\/blog\/wp-json\/wp\/v2\/media?parent=1579"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.webharvy.com\/blog\/wp-json\/wp\/v2\/categories?post=1579"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.webharvy.com\/blog\/wp-json\/wp\/v2\/tags?post=1579"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}