About Linked Data
We live in exciting times. At least, if you are interested in data. And everyone even remotely interested in geospatial information should be interested in data. Without data there is no information. The usefulness of information is determined by quality, quantity and availability of data. And don’t we often notice that it is hard getting the data we need? Data can be hard to find, hard to transform to an appropriate format and of uncertain quality. Those kinds of problems are now addressed by the Linked Data concept.
Linked Data foresees a step forward by the current World Wide Web. The current web can be described as a web of documents – articles, pictures, movies,.. all that content is linked and findable. Linked Data is about doing the same for raw data. And that should be possible by following these simple rules, proposed by sir Tim Berners-Lee, the father of the current World Wide Web:
- Use URIs to identify things.
- Use HTTP URIs so that these things can be referred to and looked up (“dereferenced”) by people and user agents.
- Provide useful information about the thing when its URI is dereferenced, using standard formats such as RDF/XML.
- Include links to other, related URIs in the exposed data to improve discovery of other related information on the Web.
For further reading on the subject of Linked Data we can recommend this online book.

This diagram depicts the state of Linked Open Data as of September 2010. Each circle represents a data set. Note that a portion of the data sets in the LOD cloud are primarily of a geographical nature. Source: Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/.
Geospatial Linked Data
If the Linked Data principles are to be applied to geographical data, we can can see enormous potential improvements over the present day situation. Geographical data will no longer be a special kind of data only understood by specialized people or specialized software, but it will be fully interwoven in the global data fabric. Also, geospatial analysis will be much easier to do and yield far better results if the analyst can tap in to one global database.
With Linked Data we are moving into some uncharted waters. Much information can already be found on the web, but especially in the geospatial domain some things still need to be found out. Here are a few research issues that we will have to deal with:
Editing
We see many examples of read-only databases being published, for example by enlightened governments that want to share their data with the general public. Linked datasets that are open for editing are less available. In the growing domain of crowdsourcing, naturally datasets should be open for editing on the web. It seems some further research is needed before we have a web of data that is editable in a simple and standardized way.
Vocabularies for geometries
Linked Data are self-describing and use vocabularies to describe things. Everybody is free to add vocabularies to the web, but it makes sense to share these. For example, it is common practice to use the FOAF vocabulary when data apply to a people. So if a thing is known to be of type FOAF:person, we know we are dealing with a person.
One of the main characteristics of geospatial data is that they contain geometries. Typically, two-dimensional points, lines or polygons with a known coordinate reference system. What we need is a common way of expressing those geometries in RDF. A commonly used vocabulary already exists for the very simplest of geometry: a point having a WGS84 latitude and longitude is defined in http://www.w3.org/2003/01/geo/. There is a need for a commonly accepted vocabulary that can be used for more complex geometries.
Linking by location
In the world of Linked Data, linking things is greatly encouraged. This linking can take place within a data set or between resources in separate data sets. Linking does not necessarily have to be done by the data provider, it could be a task performed by some other agent to enrich common caches of linked data, or to enrich the original data set (in which case write access is required).
One of the interesting things about geospatial data is that location is a kind of implicit link. If two things exist at the same location, they might be related or even be the same thing. The quality and quantity of links between resources is will be a large factor in the success of Linked Data. It seems the GIS world can be of help there, by using knowledge of spatial relations to add linkage.
Consuming Linked Spatial Data
Nowadays we see two kinds of software being used for dealing with spatial data: web browsers and desktop applications. We can assume this will still be the case in the near future. On both fronts some adaptation is needed to effectively deal with Linked Spatial Data.
Traditional web browsers are not ready to deal with the web of data yet, they don’t know how to deal with RDF and RDF links yet. Several web browsers extension or web applications exist that do allow meaningful interaction with the web of data. But still we notice that there is a lot of room for of improvement, especially when it comes to geographical data. Typically, a data consumer will want to inspect geographical data in a map. Because geographical data are an intrinsic part of the data web, being able to display those data on a map should be an intrinsic part of web browsers.
Desktop GIS initially were build to handle local data. Nowadays, they are able to deal with data from the web, up to a certain extent. A self respecting desktop GIS should be able to interact with a web service that uses the OGC WFS specification. No desktop GIS that is able to interact with Linked Data exists yet. In the future, one should be able to issue meta data searches, find data and perform complex analyses with the data web.
Linked Data at Geodan
We are enthusiastic about the concept of Linked Data and its potential. That is why we are looking in to the matter. One thing that we intend to do is to start publishing some linked data. This should give us some much needed practical experience, and perhaps enable us to investigate the issues raised above. Also, we hope we can make meaningful contributions to the web of data.
Keep watching this space for more information on our Linked Data efforts (and some real Linked Data too)!