The LOD2 Open Government Data Stakeholder Survey 2010
The idea for the LOD2 Open Government Data Stakeholder Survey 2010 came into being in the course of the requirements elicitation and specification phase of the EU funded project LOD2 - Creating Knowledge out of Interlinked Data (see: http://www.lod2.eu). As one out of 3 use cases in the LOD2 project is publicdata.eu - the design and implementation of a single point of access, a centralised data portal / data catalogue for open data in EU 27 using linked data principles and technologies.
The objective of this survey was to involve as many relevant stakeholders as possible in European Union 27 countries (citizens, public administration, politicians, industry, media and science) in an online questionnaire and ask these stakeholders about their needs and requirements in the area of open data as well as for a data catalogue.
The goal of the LOD2 Open Government Data Stakeholder Survey was to reach a broad audience of the main stakeholders of open data: citizens, public administration, politics and industry. The survey has been designed and set up by the LOD2 project partners Open Knowledge Foundation (UK) and the Semantic Web Company (Austria). Additional support has been given by the LOD2 partners DERI Galway (Ireland), Wolters Kluwer Germany and the University of Leipzig (Germany).
The survey has been accomplished using the web based survey tool SurveyGizmo and has been promoted via blogs, mailings, mailing lists and additional viral marketing channels as well as at related events in Europe.
In the course of the survey - that was open for 5 weeks from 08.11.2010 to 15.12. 2010 - in total 340 participants filled the survey.
The results of the LOD2 Open Government Data Stakeholder Survey has been published in April 2011 at http://survey.lod2.eu in several ways and formats as follows:
- The survey results in the form of HTML pages (including several charts as well as cross-depending analysis) structured in areas as well as giving the results along the questionnaire
- In the form of a PDF for printing and download of the results
- The raw data of the survey results in open and machine readable CSV format for unlimited re-use as for own analysis and / or visualisations et al.
- The raw data in the form of linked open data for re-use, browsing as well as easy querying via a SPARQL endpoint (using the open source tools Virtuoso and OntoWiki) put in context by establishing links to DBpedia to allow more complex queries and richer results
The survey results have been promoted via several communication channels of the LOD2 project as mailing lists, blog posts, tweets, presentations and via direct mailing to participants. The HTML survey result area provides commenting functionality to enable feedback mechanisms to include this feedback into future work to enable lessons learned mechanisms.
All results are published using CC BY 3.0 to ensure unlimited re-use!
Results of the LOD2 Open Government Data Stakeholder Survey 2010 - a short summary
As follows the main results / most important results as a short summary - for detail results please use the left hand menu for browsing:
The questions regarding the preferences for the "format of data" implicit that going for RDF/XML and APIs is the right direction. But there are also some more formats mentioned especially in the open (other formats question) that should be taken in consideration (JSON).
Regarding the quality of data the results the survey shows a focus on the following issues that seem to be very important:
- provenance/source of data
- format of data
- completeness of meta data
In relation to the format preferences mentioned before the integration of data conversion mechanisms may be an issue that should be considered. A harmonised meta data structure seems to be already a step in the right direction for a EU27 data catalogue.
The top ranked topics regarding what users want to do with open government data are:
- research / analysis
- simply consuming the data
This can also be related to the 'expected to have features' of a data catalogue:
- providing raw datasets
- information about versions of data sets
- searching exploring, grouping and clustering of data sets
and the 'like to have features':
- crowd sourcing mechanisms
- alerts on (regional) information
- analysis an visualisation tools
Again these matches the format preferences mentioned above. There should be also a strong focus on search mechanism, the visualisation of search results and data, and a focus on features for data curation.
The question if a catalogue / portal should only store meta data (and link to the data sets) or should store also the data itself brought no clear result. At least it should be possible to store data on the platform if necessary and to provide a mechanism detecting links to datasets that are not available any more.
Finally users are looking for:
- white papers & best practice
- news on Open Government Data
- and use cases & success stories
Last but not least the support features (documentation, guides etc.) of a data portal / catalogue are of high relevance.