Ejeg Volume11 Issue2 Article296
Ejeg Volume11 Issue2 Article296
Abstract: Despite the development of Open Data platforms, the wider deployment of Open Data still faces
significant barriers. It requires identifying the obstacles that have prevented e-government bodies either from
implementing an Open Data strategy or from ensuring its sustainability.This paper presents the results of a
study carried out between June and November 2012, in which we analyzed three cases of Open Data
development through their platforms, in a medium size city (Rennes, France), a large city (Berlin, Germany),
and at national level (UK). It aims to draw a clear typology of challenges, risks, limitations, barriers, all terms
used by the different stakeholders with diverse meanings and based on different motivations. Indeed the
issues and constraints faced by re-users of public data differ from the ones encountered by the public data
providers. Through the analysis of the experiences in opening data, we attempt to identify how barriers were
overcome and how risks were managed. Beyond passionate debates in favor or against Open Data, we
propose to consider the development of an Open Data initiative in terms of risks, contingency actions, and
expected opportunities. We therefore present in the next sections the risks to Open Data organized in 7
categories: (1) governance, (2) economic issues, (3) licenses and legal frameworks, (4) data characteristics, (5)
metadata, (6) access, and (7) skills.
1 Introduction
Open Data has gained a lot of interest in the e-government communities over the last years, leading to the
implementation of many initiatives and platforms to publish Open Datasets in such areas as mobility (e.g., bus
timetables), security (e.g., crime rates), or economy (e.g., statistics on business creations). Open Data is an
essential tool for the dissemination of the open government principles. However its wider deployment
requires identifying the obstacles that have prevented e-government bodies either from implementing an
Open Data strategy or from ensuring its sustainability. We have therefore carried out a study between June
and November 2012, in which we analyzed three cases of Open Data development through their platforms, in
1 2 3
a medium size city (Rennes, France ), a large city (Berlin, Germany ), and at national level (UK ). In addition,
we have studied the context in which the Open Data movement has been developed across Europe, in
particular the type of data that have been opened, and the services that were developed by Open Data re-
users.
In this study, we aim to draw a clear typology of challenges, risks, limitations, barriers, all terms used by the
different stakeholders with diverse meanings and based on different motivations. Indeed the challenges and
constraints faced by re-users of public data differ from the ones encountered by the public data providers.
Through the analysis of the experiences in opening data in the UK and in the cities of Rennes in France and
Berlin in Germany, we attempt to identify how barriers were overcome and how risks were managed.
Beyond passionate debates in favor or against Open Data, we propose to consider the development of an
Open Data initiative in terms of risks, contingency actions, and expected opportunities. We therefore present
in the next sections the risks related to Open Data organized in 7 categories:
governance,
economic issues,
licenses and legal frameworks,
data characteristics,
1
http://www.data.rennes-metropole.fr/
2
http://daten.berlin.de/
3
http://data.gov.uk/
ISSN 1479-439X 348 ©Academic Publishing International Ltd
Reference this paper as: Sébastien Martin et al “Risk Analysis to Overcome Barriers to Open Data” Electronic
Journal of e-Government Volume 11 Issue 1 2013, (pp348 -359), available online at www.ejeg.com
Sébastien Martin et al
metadata,
access, and
skills.
2 Studied experiences
We analysed both Open Datasets and services created based on those datasets. We took into consideration
data catalogues from Rennes in France and Berlin in Germany. We used the conclusions drawn by Fraunhofer
(Both, 2012) which supported the creation of the Berlin platform. Analyses of services were only carried out
for Rennes and the United Kingdom, since the Berlin initiative was more recent.
Berlin (Germany) is a good example of Open Data at the local level, with the additional advantage of showing
the relationship between different administrative scales, the level of the city itself and the region (land). Berlin
platform has served as a model and prototype for the whole Germany and even beyond since it is integrated in
in a European research project. It was also prepared with a prospective study by the Fraunhofer Institute to
understand of the data opening process since its early stages. The first datasets are progressively added to the
portal, launched in September 2011.
Rennes, the administrative centre of the French region Bretagne, also provides an example of local Open Data.
The city led the opening of its data in a broader approach to innovation based on digital technologies. Among
the first initiatives in France, this is the first community to open a portal in 2010. It has set up an effective
support for reuse with a reuse competition and has a rather dense network of re-users.
The UK portal is open since January 2010 and centralizes data nationally. The British approach has also
resulted in the publication of reports and scientific articles. The British government has asked each
department to publish its Open Data strategy of opening, each one being inserted into the overall strategy
4
outlined in a White Paper . Data from the United Kingdom have also enabled the creation of a large number of
services that can be analysed to understand how the data were reused.
These case studies were chosen because they are exemplary Open Data initiatives at different geographical
levels and suggest paths to improvement the data opening process and the creation of new services.
Figure 1: Ishikawa diagram summarising risks and barriers related to data opening
4
http://data.gov.uk/library/open-data-white-paper
Finally we reviewed existing studies and open data experiences as presented in the literature. Overall we
gathered key information on obstacles to data opening, perceived risks, contingency, remediation actions.
Alonso (Alonso, 2011) also points out the resistance to change in government administrations. He proposes to
showcase the social and economic benefits of opening data and to identify champions.
5
« Open Data ist ein wichtiger Baustein des Open Government für eine transparente und bürgernahe Verwaltung. »
http://daten.berlin.de
6
http://www.etalab.gouv.fr/
7
on Legal Aspects of Public Sector Information ) and interoperability initiatives (e.g., SEMIC/JoinUp project) can
help overcome the fragmentation of projects in Europe.
Reluctance of civil servants Engage in the early mobilization of Ask officials to identify specific issues
internal and external stakeholders, that explain their reluctance.
and of civil society organizations.
Identify champions
Involve as many people as possible
Point out economic and social benefits
Integrate into day to day process
7
http://www.lapsi-project.eu/
8
http://www.lacantine-rennes.net/
9
http://berlin.opendataday.de/ueber/
10
http://news.bis.gov.uk/Press-Releases/New-funding-to-accelerate-benefits-of-open-data-684c1.aspx
Nantes Métropole France General Local € 100 000 (cost of the Portal)
Finally, the return on investment should take into consideration the benefits to the administrations and public
institutions themselves, including benefits related to semantic interoperability (Alonso, 2011) and the change
of behaviours of public employees by ingesting a data sharing culture which benefits first of all to
eGovernment itself.
11
Retrieved from http://www.w3.org/2012/06/pmod/pmod2012_submission_19.pdf
12
Retrieved from http://www.journaldunet.com/ebusiness/le-net/budget-etalab-1111.shtml
13
http://www.govtech.com/e-government/Do-Apps-for-Democracy-and-Other.html
The cost of opening data Assessing the costs of not opening Share part of the costs with other
Open Data platforms
Benefits and return on Use APIs which allow tracing reuse Adopt a realistic approach to costs
investment and benefits; Encourage stakeholders
who use Open Data to indicate that
use.
Take into consideration and measure
benefits to eGovernment and data
producers themselves
In Rennes, the whole datasets recorded were covered by the license Rennes Métropole V2 at the time of this
study. In Berlin, the datasets follow a slightly less homogeneous licensing model. Despite differences, in both
cases, licenses applied to datasets in Berlin are open, except for 4 datasets described as "keine Freie Lizenz"
(Table 4).
License Occurrences
While most efforts currently focus on public data, Deloitte in its analysis (Deloitte, 2012) suggests that the
main value of Open Data will result from the combination of public data, business data, and personal data. The
creation of most services in the coming years would be based on combinations of datasets. This will likely lead
to even more complex situations regarding rights and licenses over datasets.
Licence is not open enough Release data complying with the Collect the concerns of re-users and
definition of openness modify licenses if the barriers are too
constraining.
Heterogeneous licences across Awareness raising among Strengthen the role of the agency that
datasets stakeholders organizes Open Data
Enforce a clear policy to publish terms
of use
The choice of the datasets to open is also critical to ensure a high return on investment. An initial phase of
data analysis is necessary in an Open Data project in order to ensure that the data which will be open both are
of high quality and have a high value for reusers (see Martin, 2013). For instance the Bluenove study in France
showed that the data most expected by companies are economic and commercial data as well as geographical
data (Bluenove, 2011).
14
http://www.keolis.com/
15
http://www.europeana.eu/
16
http://data.gov.sg/
financial aspects. Entirely dependent on funds provided by the State, these were cut repeatedly over 1990’s,
which has led to a sharp deterioration in its quality (Uhlir, 2009).
Open Data advocates discard the risks regarding data quality by showing the opportunities of involving users in
the process of data improvement. By identifying errors and warning the data curators, re-users as well as any
citizen can contribute to maintain high quality datasets through crowdsourcing mechanisms.
Berlin 61 1 9 2.3
17
http://5stardata.info
Data accuracy and bias Clarify the context of the data creation
process
Data available in heterogeneous Publish datasets in various formats Develop guidelines and
formats but early to create incentive for later encourage standard formats
standardization across government bodies
Training in semantic
technologies
In Rennes, the catalogue is being enriched with the recent addition of metadata properties, for instance the
precision of geographic data and their original reference system.
In Berlin, Both (2012) demonstrates the key role of metadata for the future of Open Data and even suggests
tracing reuses through metadata.
Identified risk Mitigation Contingency actions
Lack of single standard to Start with standards as early as Participate in the harmonization of
describe datasets possible metadata between Open Data
catalogues
18
http://dublincore.org/documents/dces/
19
http://www.w3.org/TR/vocab-dcat/
20
http://opendata.montpelliernumerique.fr/Vers-une-harmonisation-des
21
http://www.w3.org/2011/prov/wiki/Main_Page
potential re-users by establishing tedious procedures. On the opposite if the platform does not impose any
identification, it becomes very difficult to know who is accessing what data and reusing it.
More and more, platforms enable access through APIs (e.g., data.gov in the United States) for re-users who
can then automatically access and update the datasets, instead of maintaining their own copy of the data.
They relieve service creators of the task of updating data. By ensuring that data used by service creators is up-
to-date, the data providers increase the quality of services. They also control better their reputation through
the accurate representation of their datasets. Nevertheless, the proportion of data accessible through APIs is
still low. For example, in Rennes, there are only five datasets opened through an API. These datasets are also
among the most used by applications created from public data. Although it is unclear that this is due to the
presence of an API (as they also happen to belong to the domain of mobility, highly popular among data re-
users), it suggests that APIs can indeed support the reusability of data.
Balance between free access Provide all the data through an API
and the need to know the use capable of reporting access and use
of data
In particular, analysing the skills of re-users can help understand how to facilitate the reuse and the type of
services that can be developed on top of the datasets.
The issue of skills is also related to the ability of stakeholders to generate profits from Open Datasets. It is also
represented in the concerns about the privatization of public data, with a few people grabbing what should be
a common wealth. In this regard, Chignard mentions genealogical data, which represent a very important
market (Chignard, 2012).
These risks are to a large extent beyond the scope of Open Data, in particular risks related to the level of
education and information literacy. However, they can be addressed through the development of data
visualisations, which can ease the understanding and interpretation of the phenomena described in datasets.
In addition, education can help improve the skills of users and re-users through initiatives led by re-user groups
22
which organize training sessions to present available data and the tools and methods to work with them .
This calls for a more pragmatic approach grounded in demonstrated benefits and a clear assessment of the
risks associated with the implementation of an Open Data strategy. By analyzing the barriers and potential
benefits of Open Data, without the ambition of being exhaustive, we propose prevention measures and
contingency actions which can be taken. As illustrated by Alonso (Alonso, 2011), barriers to Open Data are not
technical. They are rather 1) cultural, 2) economic, 3) legal, and 4) semantic.
However, while Open Data is often considered at the level of general public policies, we note that not all types
of data raise the same risks and opportunities. The sale of certain types of datasets is potentially very
profitable, whereas others do not have existing markets. Rennes has to a large extent focused on geographic
data, while Berlin has opened many economic datasets. The services developed based on the datasets can
therefore be of very different nature, making all analyses on costs and benefits very difficult to apply across
cases. As noted by Martin (Martin, 2013) who led a survey on barriers to Open Data, the current focus is on
Open Data supply. This only represents one aspect to be tackled to match the promise of Open Data. It is
necessary to also investigate the creation of services based on Open Data so as to maximize the return on
investment of data producers and publishers.
The analysis in terms of return on investment is very different according to the type of data. However, specific
actions, such as the definition of complete and standardized metadata can enhance the potential for reuse of
datasets and therefore increase the return on investment, whichever the type of data that is considered.
All the same, different types of actors may perceive risks in a different way, due in particular to their local
context. Engaging in a risk management framework tailored to the specific context of data providers can help
considering Open Data beyond the traditional barriers highlighted by opponents. Most importantly, it
demonstrates the need to consider the deployment of an Open Data initiative as a long term process whose
sustainability can be improved through the evolution of all stakeholders: users and re-users through the
enhancement of skills and the creation of efficient associations; data creators through the prediction and
selection of formats necessary to enhance the reuse of data and the release of multiple data formats; finally
intermediary platforms such as national aggregators which can help overcome risks related to the
fragmentation of datasets, in technical, semantic, as well as legal terms.
22
Retrieved from http://lemag.lacantine-rennes.net/2012/10/atelier-infolab-a-la-chasse-aux-donnees-rennaises-de-
mobilite-1752
23
Retrieved from http://www.tnr.com/article/books-and-arts/against-transparency?page=0,0#
Future work will be dedicated to the study of the different types of datasets and services developed and the
way in which it is possible to optimize the return on investment of Open Data initiatives by selecting relevant
datasets and understanding the process by which successful services can be built on top of those datasets.
References
Alonso J.-M., (2011) “Open Government Data (approaches, concerns and barriers, lessons learned)”
Share-PSI workshop, Brussels.
Benjamin, S., Bhuvaneswari, R., Rajan, P. (2007) “Bhoomi:'E–governance', or, an anti–politics machine necessary to
globalize Bangalore?”, CASUM–m Working Paper.
Bluenove (2011). “Open Data : quels enjeux et opportunites pour l’entreprise ? “
Both, W. & Schieferdecker, I. (2012) Berliner Open Data-Strategie, Fraunhofer Verlag.
Chignard, S. ( 2012) Open Data: comprendre l’ouverture des données publiques. FYP.
Davies, T. (2010) “Open Data, democracy and public sector reform”. Available at:
http://www.opendataimpacts.net/report/wp-content/uploads/2010/08/How-is-open-government-data-being-used-
in-practice.pdf
Davies, T. & Bawa, Z. (2012) “The Promises and Perils of Open Government Data (OGD)”. The Journal of Community
Informatics. Available at: http://ci-journal.net/index.php/ciej/article/view/929/926
Deloitte. (2 12) Open Data driving growth, ingenuity and innovation. Deloitte analytics paper. Available at:
http://www.deloitte.com/assets/Dcom-
UnitedKingdom/Local%20Assets/Documents/Market%20insights/Deloitte%20Analytics/uk-insights-deloitte-analytics-
open-data-june-2012.pdf
Gilbert, D., Balestrini, P. & Littleboy, D., 2004. “Barriers and benefits in the adoption of e-government”. International
Journal of Public Sector Management Available at:
http://www.emeraldinsight.com/journals.htm?articleid=868029&show=abstract
Houghton, J., 2011. Costs and benefits of data provision. Melborne: Centre for Strategic Economic Studies (Victoria
University) Available at: https://www.oerknowledgecloud.com/sites/oerknowledgecloud.com/files/houghton-cost-
benefit-study.pdf
Janssen, M., 2012. “Benefits, Adoption Barriers and Myths of Open Data and Open Government”. Information Systems
Management.
Klessmann, J. ; Denker, P. ; Schieferdecker, I. ; Schulz, S., 2012. Open government data Deutschland. Eine Studie zu Open
Government in Deutschland im Auftrag des Bundesministerium des Innern. Berlin: Bundesministerium des Innern
(Germany).
Martin, Ch. (2013). “Understanding Barriers to Open Government Data”. Open Knowledge Foundation Blog.
Pasquier, M. & Villeneuve, J., 2007. “Organizational barriers to transparency a typology and analysis of organizational
behaviour tending to prevent or restrict access to information”. International Review of Administrative Science.
Uhlir, P.F., 2009. The Socioeconomic Effects of Public Sector Information on Digital Networks: Toward a Better
Understanding of Different Access and Reuse Policies. OECD.
Yu, H. & Robinson, D., 2012. “The New Ambiguity of Open Government”. Princeton CITP/Yale ISP Working Paper. Available
at: http://papers.ssrn.com/sol3/papers.cfm?abstract_id=2012489&