Interesting scrapes from scraperwiki

September 30, 2012 Apps Script & Java Script, ScraperWiki, VBA Comments Off

In scraping the scraper I showed how to get scraperwiki data into both Excel and Google Apps Script. More interestingly though, I was talking about data for which others had already done the legwork to create tables of data they had scraped from unstructured web pages and made publicly available. I thought maybe I should publish an occasional directory of interesting scrapes I came across.

Of course since these have been created by various unknown contributors, they may be plain wrong, out of date, disappeared or irrelevant, but for those of you have not yet used scraperwiki , it’s an interesting introduction to some of the data that is being extracted out there.

To take a look yourself you can of course use the downloadable excel and Google Apps Script version and download the data by giving its scraperwiki shortname in the scraperWikiData tab. Here’s a few from today’s scrape of the most recent 1000 scrapes.

Scraper Wiki Short Name

Description

interpol_wanted_persons

List of names, descriptions, photos, links etc of internationally wanted people

318_decc_speeches

uk government speeches content and details

us-states_1

list of US states, along with their short codes and numbers

relief_web_disasters_timeline

list of disasters along with dates and details

marriage_equality_scraper

list of UK members of parliament opinions on marriage equality

paralympic_athletes_london_2012

list of paralympic atheletes in London 2012

Here’s and extract from the first one. Thanks to S Woodard for publishing the scraper.

I’ll be adding to this from time to time, and you can find the most recent interesting scrapes directory here. If you come across any interesting includeworthy scrapes then please let me know by commenting or contacting me on the excel liberation forum.

About brucemcp 225 Articles

I am a Google Developer Expert and decided to investigate Google Apps Script in my spare time. The more I investigated the more content I created so this site is extremely rich. Now, in 2019, a lot of things have disappeared or don’t work anymore due to Google having retired some stuff. I am however leaving things as is and where I came across some deprecated stuff, I have indicated it. I decided to write a book about it and to also create videos to teach developers who want to learn Google Apps Script. If you find the material contained in this site useful, you can support me by buying my books and or videos.

create, implement and test a Workspace Add-on
add functionality from a collection of Google Cloud APIs such as Cloud Storage, CardService, Drive and others
organize extracted data from a variety of documents using the Google AI Platform

Manning LiveProject

All formats are available from O'Reilly, Amazon and all good bookshops. You can also read a preview on O'Reilly

A video course over about 8 hours and 70 lessons taking you through the basics of Apps Script and JavaScript. Available at O'Reilly, Amazon, Infinite Skills & all good media outlets

Google Apps Script for Beginners: A video course over about 8 hours and 70 lessons taking you through the basics of Apps Script and JavaScript. Available from O'Reilly, Infinite Skills and all good media outlets

bruce mcpherson is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. Based on a work at http://www.mcpher.com. Permissions beyond the scope of this license may be available at code use guidelines

Desktop Liberation

The definitive resource for Google Apps Script and Microsoft Office automation

Interesting scrapes from scraperwiki