4iP Blog

Jamie Arnold's photo

4iP invests in ScraperWiki

image

4iP recently invested in ScraperWiki, a platform to scrape, store, aggregate, and distribute unstructured public data in more useful, structured formats.

OK, so I admit I usually lose most people at this point so I’ll explain why this investment is so exciting for me and for 4iP.

So, what is it?

At its heart ScraperWiki is a data store. The outputs of the scraper code are stored online and accessed via a .csv, GoogleDoc or an API.  The ambition is to fill ScraperWiki with thousands of public data sets that can be used by developers, researchers, journalists, and public bodies.  For an example and to find out what crimes Londoners worry about then read the ScraperWiki blog.

For the developer it is also a beautiful browser-based coding environment.  It allows developers (“swikis”?) to extract information from websites, pdf and Word documents from the Internet and transform them into structured data. 

image

The ScraperWiki team are working hard to make this experience simple to use and familiar to developers.  Key to success is making this right for developers so the platform provides version-control, a console, shortcut keys, scheduling and access to ScraperWiki code libraries that make the job of web scraping much easier. If you’d like to get involved then .(JavaScript must be enabled to view this email address) for an alpha invite.

ScraperWiki geo-location libraries will allow a developer to scrape data and a location (a postcode, long/lat) and overlay boundary information (a constituency, a local police ward).  I am really excited about the possibilities of this.  It should produce some interesting and valuable data-sets.

Why 4iP?

I declare a personal weakness for all things open-data-ery and I once studied politics.  That declared I can tell you ScraperWiki is an important element within the 4iP eco-system of investments.  Everyone needs a bit of data in their lives and 4iP is no different. 

The ScraperWiki team have a reputation for civic hacking, transparency and holding power to account.  They are some of the people behind Public Whip (see how your MP votes), Planning Alerts (email alerts of planning applications near you), UNDemocracy (transcripts of UN meetings), FarmSubsidy.org (who gets what from the Common Agricultural Policy) and Rewired State hack days.  They are a great team and they have all been a source of inspiration.

ScraperWiki have embraced the sustainability requirements of the fund with gusto and I see success ahead.  A platform for scraping public data (i.e. it’s not the Scrapers’ data, but the Crown’s) was not an immediately obvious revenue generator but we worked together and figured it.  We discussed a volunteer sustained platform, but the ScraperWiki team are now out and about talking to interesting people about transactional and service based revenues.

Public launch is set for the end of February but you can follow ScraperWiki on Twitter to keep tabs on progress or .(JavaScript must be enabled to view this email address) for an alpha invite.

Judith on Wed, January 13, 2010 at 9:51 said:

ScraperWiki is a web application that will allow users to scrape data, primarily from council websites, to store in a database for the creation of an API.ScraperWiki is a web platform for collecting and publishing public data.
Grand Palladium Jamaica

search engine on Thu, January 14, 2010 at 12:50 said:

Ive yet to Be fully convinced of some of the Pratical applications for this ScraperWiki.

Aidan McGuire on Thu, January 14, 2010 at 10:14 said:

Which of the “Pratical applications” are you referring to?

Aidan
Scraperwiki

newsletter printing services on Fri, January 15, 2010 at 7:44 said:

There’s lots of useful data locked away on the internet. ScraperWiki helps open it up.

San Diego tattoo on Mon, February 15, 2010 at 9:59 said:

It’s really good to invest in this product… Thanks for posting.

Photography Studio Equipment on Tue, February 16, 2010 at 3:57 said:

I’m still not so sure how this one works, but the idea is really interesting. I’m really hoping for its success.

Commercial Photographer Kent on Thu, February 18, 2010 at 5:57 said:

I am pretty sure it will be a success!

Alexander Strong on Fri, February 19, 2010 at 8:47 said:

The article “4 IP invests in ScraperWiki” concerns ScraperWiki’s ambitions and possibilities on scraping, storing and distributing unstructured public data in useful formats so that the developers and other participants can obtain and perceive them. In general I should admit that the mentioned above processes with data are very important, up-to-date and necessary, as they help people know, estimate and react adequately in appropriate situations. It’s a long sustained work; much help is needed especially now when the ScraperWiki team are out, looking ahead for new people and new ideas. No matter that new volunteers are welcome to follow Scraper Wiki on Twitter, the main thing here is not to lose a good idea.

Above All on Sun, February 21, 2010 at 12:03 said:

Cool….Good luck on your investement.

El Cajon on Sun, February 21, 2010 at 12:07 said:

Hows the launch coming? Im watching this close.

MB on Tue, February 23, 2010 at 1:55 said:

Thanks for explaining it, it totally makes sense now. i think youll do well..

East County on Wed, March 03, 2010 at 1:25 said:

4ip seems like a pretty cool deal.

Body Lotion on Fri, March 05, 2010 at 8:10 said:

Hope it was a good investment.

Add your comment



(will not be published)



Terms and conditions apply to comments submitted to this site

Remember my personal information
Notify me of follow-up comments?