The Spatial Miscellany

Avatar

A weblog. A website. A geospatial miscellany…

Where [2.0]…are the KML files?

Last month saw the annual event that is [Where 2.0], a conference hosted by O’Reilly in San Francisco. Over the past couple of years it’s been somewhat of a landmark event in the geospatial calendar, often playing host to a raft of new and exciting location aware technology…Woodstock for Neogeography?

I’ve never managed to attend the conference, but always made a conscious effort to follow online; this year it felt different, no headline announcements or cool new products, and very little chatter.

One announcement that did grab my attention was made by John Hanke from Google during the ESRI\Google keynote (you can watch the keynote here). Hanke remarks that Google have provided access, via their search API’s, to the Google ‘GeoIndex’ – an index of results returned by Google robots as they crawl the web for geospatial content. On the surface, this is significant news, but it seems to have received only little comment.

The GeoIndex Footprint

Hanke suggests the Google search API’s will provide access to the “content, attribution, linkbacks and the urls” that make the geoweb…but where are the underlying resources, where are the KML files?

Barry Hunter has knocked up a useful php script that queries the GeoIndex and nicely formats the JSON response. Try it out and you’ll notice you get a handful of Google Maps url’s, but no links to the underlying resources? Hopefully, there is more to come; providing access to the GeoIndex, and the underlying content files, really would move the ‘geoweb’ forward.

OpenSocial, location aware?

Last week Google announced an umbrella API for social networks, OpenSocial. On the back of their announcement, Google have released a preview of their API. Thumbing through the API preview I expected to see some elementary support for location, perhaps a “city” or “hometown” tag that developers could geocode, so I was pleasantly surprised to find what appears to be another implementation of GeoRSS

OpenSocial GeoRSS Support

Is the writing on the wall for the OS Grid Reference?

I recently shared a few beers with an old university lecturer. We noted the ever improving access to the excellent Ordnance Survey mapping data but tongue in cheek he pondered…

“Is the writing on the wall for the OS grid reference?”

Here is a guest post from University of Edinburgh GIS lecturer, Bruce Gittings…



I see multimap have become the first to offer OS data on a slippy map
interface to a web service.

The writing is certainly on the wall for anyone contemplating a system which isn’t slippy - - that’s now the benchmark, anything else is dead technology. We’ve known space is continuous for a few years now and the average GIS toolkit gave up on ‘coverages’ and ’tiles’ a while back.

Multimap OS data

At the same time multimap also seem to have moved to using latitude and longitude, instead of eastings / northings on the national grid, probably as a function of the software they are using. The map is actually skewed to the window - most unusual. So, as more and more applications want WGS84 / GPS compatibility will the OS grid reference disappear? For decades this has been the de facto (indeed virtually the only) referencing system use for locating places on maps in the UK (excepting one-off grid systems used on town plans for example), despite a lat-long grid appearing subtly on the edge of OS sheets. Yet, the dash to lat-long driven my pan-national web services seems to be quickly moving the amateur place-referencer quickly towards lat-long too, if the dreaded wikipedia is anything to go by (wash my mouth out with soap for mentioning that word!). I wonder if they realise the implications?

The Wikipedia Flaw?

There’s a lot more misunderstanding about lat-long than the good old simplicity-itself national grid. The oft-used precision (e.g.
55.12345678901234556 degrees north) - just because the computer can calculate to that number of decimal places - takes positioning to a ridiculous sub-atomic level. Even multimap - who should know better - have fallen into that trap. Also, most people don’t understand it is not possible to convert between lat-long and eastings/northings terribly accurately, and that, unlike eastings and northings, two numerically identical lat-long positions don’t necessarily refer to the same point. But let’s not pretend I’m a mathematical whizz who actually understands coordinate systems and transformations, but its all here if you want the details: A guide to coordinate systems in Great Britain.

But all of that only slightly undermines multimap’s achievement. I wonder if they had to get special permission from the OS to use their data in this way? It makes OS copyright a little easier to infringe, but also again (like google’s first) shows the remarkable usability of the slippy-interface and its underlying AJAX technology. This ‘blogger’ particularly likes multimap’s feature which identifies the ‘locality’ which continually updates as you move cross the map. They must have built a load of theissen polygons from the OS gazetteer to underpin this. The result works well in some places, but there’s no sign of the name on the map in others. Less good is the use of getmapping aerial imagery, which seems to have a lot of gaps, many more than it should have - people’s map wins there. And performance seems to be a struggle, you need big banks of powerful servers before you can offer a professional slippy / AJAX service - the demands of the clients are probably two orders of magnitude more than the previous technology.

Good try though - 6 out of 10.

Above Cartoon by Peter Steiner. The New Yorker, July 5, 1993 issue (Vol.69, no. 20) page 61

Fight spam with reCAPTCHA

When you submit details to a website via a web form, increasingly you’re asked to interpret a picture of a word and type the answer in a text box, this type of puzzle is known as a ‘CAPTCHA’ – if you’re not sure what I’m talking about, check out the comments section of this post for an example.

The CAPTCHA was created by Luis von Ahn, in an attempt to fight Spam…if the CAPTCHA is completed successfully you are assumed to be human and the web form is submitted, if the CAPTCHA is failed you are assumed to be a computer and web form submission is prevented. The CAPTCHA is a classic example of a Turing Test, as proposed by the eponymous researcher in his 1950 paper ‘Computing Machinery and Intelligence’ - indeed the name CAPTCHA coined by Luis von Ahn is an acronym for a “Completely Automated Public Turing test to tell Computers and Humans Apart”.

An example captcha

Ahn was initially proud of his highly effective solution for preventing Spam, but was subsequently frustrated with the cumulative amount of time being consumed by millions of people across the globe filling in CAPTCHA’s and producing very little in return. Personally I found the CAPTCHA mightily offensive and avoided them like the plague, until I discovered them to be the only effective way to stop computers spamming this blog. Fortunately, Ahn has recently worked with Ben Maurer to address his frustration and recently released reCAPTCHA. Here’s the deal…

The Internet Archive is attempting to automatically digitize old books using optical text recognition software, they are largely successful but the text recognition software struggles with recognizing ye olde English, meaning that roughly 8% of words are digitized incorrectly or not at all. reCAPTCHA addresses this problem by asking a user to interpret a picture of two words (instead of just one), the first word is a known word and the second word an unknown word. If you correctly interpret the first known word, it is assumed you have also correctly interpreted the second word that wasn’t recognized by the text recognition software. So when you fill in a reCAPTCHA, not only are you proving that you are human, but you are also helping to digitize old books! For me, application of technology in this way is poetry.

Obviously Luis von Ahn has applied this technique to digitizing old books, but it had me thinking of the potential to apply such an approach to digitizing maps…is there scope for a geoCAPTCHA?

The above cartoon is the copyrighted work of David Farley

AGI Conference 2007

AGI Conference 2007I’ve spent the last couple of days in Stratford-upon-Avon, birthplace of William Shakespeare, and home to the AGI Conference 2007. It’s been interesting, the organisers clearly worked hard to get a varied agenda…the impact of Neogeography on old skool GI seems to have generated the most discussion.

Vanessa Lawrence delivered an engaging keynote speech and noted the impact of the recently published ‘Power of Information’ report authored by Tom Steinberg and Ed Mayo. Apparently, on publication it went straight to Downing Street and was read cover to cover, as a consequence the OS will act on one of the report’s recommendations and release their web mapping API ‘OpenSpaces’ in time for Christmas. Good Stuff.

Chris Parker (Head of OS research labs) also gave an inviting talk and considered the inevitable tighter integration between social networking API’s (e.g. Facebook) and web mapping API’s. With this is mind, it was interesting to speak with Widr co-founder John Abbott. Widr is compiling a database of geocoded wireless networks and provides a free API that web developers can use to tailor content to the end users geographic location. With Internet TV gaining ever more momentum I think there is real opportunity for Widr. I suspect critical to the success of Widr will be building a community of users, and to this end they have an application for facebook in the pipeline.

Nick Black of ZXV Ltd presented a compelling pitch for OpenStreetmap, and flagged up an academic paper by Yale economist Yochai Benkler titled Coase’s Penguin. In the paper Benkler discusses the merits of the much in vogue ‘commons-based peer-production’ (crowdsourcing).

Did you attend?

Continue Previous page Next page



Free GIS Software...

 

Download ArcGIS Explorer, a free globe explorer from ESRI. Use ArcGIS Explorer to answer your everyday geography questions... “How do I drive to Birmingham?” and “Which river flows through London?”. For the GIS Professional, use ArcGIS Explorer to fuse your rich GIS datasets, with server-based geoprocessing applications, and distribute your geospatial activities throughout your organisation.

Before you go

Going so soon? Test your geography with the...

 

Do you support the campaign? Should government-funded and approved agencies such as the Ordnance Survey collect data with significant indirect contributions from the UK tax-payer, but then charge users and companies for access to it?

 

Download Flash plugin