In scraping the scraper I showed how to get scraperwiki data into both Excel and Google Apps Script. More interestingly though, I was talking about data for which others had already done the legwork to create tables of data they had scraped from unstructured web pages and made publicly available. I thought maybe I should publish an occasional directory of interesting scrapes I came across.
Of course since these have been created by various unknown contributors, they may be plain wrong, out of date, disappeared or irrelevant, but for those of you have not yet used scraperwiki , it’s an interesting introduction to some of the data that is being extracted out there.
To take a look yourself you can of course use the downloadable excel and Google Apps Script version and download the data by giving its scraperwiki shortname in the scraperWikiData tab. Here’s a few from today’s scrape of the most recent 1000 scrapes.
|Scraper Wiki Short Name||Description|
|List of names, descriptions, photos, links etc of internationally wanted people|
|318_decc_speeches||uk government speeches content and details|
|us-states_1||list of US states, along with their short codes and numbers|
|relief_web_disasters_timeline||list of disasters along with dates and details|
|marriage_equality_scraper||list of UK members of parliament opinions on marriage equality|
|paralympic_athletes_london_2012||list of paralympic atheletes in London 2012|
Here’s and extract from the first one. Thanks to S Woodard for publishing the scraper.
I’ll be adding to this from time to time, and you can find the most recent interesting scrapes directory here. If you come across any interesting includeworthy scrapes then please let me know by commenting or contacting me on the excel liberation forum.