06-26-2010 Open Data Briefing Paper
06-26-2010 Open Data Briefing Paper
THE COUNCIL
COMMITTEE ON TECHNOLOGY
Hon. Daniel R. Garodnick, Chair
Daniel R. Garodnick will hold a hearing on Int. No. 29. Int. No. 29 creates an open data
“Open government data” is the concept that data generated by public sector
entities that is legally accessible should be available to the public electronically via the
Internet in “open raw formats.”1 “Open,” refers to the use of non-proprietary software or
systems used to encode the data. For example, XML (extensible markup language) is a
especially via the Internet.2 XML is available to anyone, and information that is stored in
an XML format is may be accessed by anyone. “Raw,” refers to data that has not been
processed. For example, the demographic data collected by the United States Census
Bureau would be raw, but the reports generated by the United States Census Bureau
individuals into the governmental process by making access to information easy and
affordable; it would promote transparency and accountability by providing data in its raw
form, enabling members of the public to perform their own data analysis and draw their
1
Improving Access to Government through Better Use of the Web, W3C, 10 March 2009.
2
Extensible Markup Language (XML) 1.0 (Fourth Edition), W3C, 16 August 2006, edited 29 September
2006.
2
own conclusions; and it permits deeper and more varied analysis of government data by
“mashup”), often by aggregating data and displaying the information visually on a map.
listings obtained from Craigslist to visually represent the available housing in a city
The ingenuity of the public to create data mashups is limited only by the
Technology Officer (CTO) announced the Applications for Democracy program whereby
private citizens compete for $50,000 in total prize money by creating mashups that utilize
any of the 200 data feeds available through the office of the CTO website.3 Within thirty
days, the Office of the CTO received 47 completed applications valued at $2,300,000.4
Applications included a carpool matchmaker, and a real time alert notification that would
send alerts (crime reports, building permits, etc.) based on a user’s location if they have a
GPS-enabled device.
On June 29, 2009, the Economic Development Corporation and the Department
To use in the competition, the City created NYC Data Mine, an online repository of over
80 public data sets from 32 agencies. The competition received 85 eligible applications
3
47 Applications in 30 Days for $50K, Government Technology, November 13, 2008.
4
http://www.appsfordemocracy.org/
5
Mayor Bloomberg Announces Five Technology Initiatives to Accessibility, Transparency, and
Accountability across City Government. Press Release June 29, 2009.
3
by December 15, 2009, and on February 4, 2010, $20,000 in prizes were awarded to
thirteen different categories. The Best Overall Application Grand Prize was awarded for
the application WayFinder NYC, which helps users locate the nearest subway and PATH
stations. The Investor’s Choice and Popular Choice Grand Prizes were awarded for the
application NYC Way, a bundle of over thirty features that allow users to locate nearby
principles that define open government data.6 While these principles are not binding,
1. Complete – All public data is made available. Public data is data that is not
subject to valid privacy, security or privilege limitations.
2. Primary – Data is as collected at the source, with the highest possible level of
granularity, not in aggregate or modified forms.
4. Accessible – Data is available to the widest range of users for the widest range
of purposes.
6
http://resource.org/8_principles.html
4
The City of New York currently makes an enormous amount of public data
available on NYC.gov, from the Building Information System (BIS) operated by the
operated by the Department of Finance, NYC Stat, operated by the Mayor’s Office of
(CPR), neighborhood statistics, and the American Recovery and Reinvestment Act of
2009 (ARRA), and NYC Data Mine, which is the repository for the data sets made
available for the NYC BigApps contest.7 Unfortunately, much of this data is available
only in proprietary and/or visual formats, and not all of it is raw data. Some of the more
extensive databases, such as the Building Information System (BIS), specifically forbid
automated machine processing. Combined, these factors limit the usefulness of the data
3. INT. NO. 29
Section 1 of Int. No. 29 amends Title 23 of the Administrative Code of the City of
New York by adding a new Chapter 3, titled “Accessibility to Public Data Sets”.
Section 23-301 defines the terms “agency,” “consensus,” “data,” “data set,”
Section 23-302 requires that all public data sets maintained by City agencies shall
be made available on the Internet through a single web portal, formatted to enable
7
http://www.nyc.gov/
5
viewing by web browsers and mobile devices and also in their raw or unprocessed form.
These public data sets shall be updated online as often as necessary to preserve the
integrity and usefulness of the data sets using web syndication technology, which is a
service that would notify the public of all updates for each public data set and for each
agency. These public data sets shall be made available without any registration
requirement, license requirement or restrictions on their use, and shall be presented and
structured in a format that permits automated processing. Furthermore the City shall
programs to request and receive information from the City’s website in real time and pass
technical standards manual for the publishing of public data sets on the Internet by City
agencies for the City of New York by July 5, 2010. The Department of Information
consensus standards bodies, such as the World Wide Web Consortium, and to participate
with such bodies when such participation is feasible and in the public interest.
Information Technology and Telecommunications to review the public data sets under its
control and to develop and submit an agency compliance plan to the Mayor and the
Council no later than July 5, 2010. Such plan shall describe how such agency intends to
achieve full compliance with this chapter by December 2, 2013, and shall include an
accounting of all public data sets under the control of the agency. As a part of such plan,
6
agencies shall classify the public data sets under their control as “immediate”, “priority”
and “legacy”. “Immediate” includes any public data set that can be made available on the
Internet within thirty days of the agency acquiring or creating such data set. “Legacy”
includes any public data set that, due to its size or complexity, or due to technology
constraints, cannot be made available on the Internet by January 2, 2012. All other public
Section 23-305 exempts public data sets from the requirements of Section 23-302
for a limited period of time, based on which classification they were assigned in the
agency compliance plan created pursuant to Section 23-304. “Immediate” public data
sets are exempt for thirty days after enactment of the law. “Priority” public data sets are
exempt until January 2, 2012. “Legacy” public data sets are exempt until December 2,
2013.
Section 2 of this bill states that the bill becomes effective immediately upon
enactment.
7
8
Int. No. 29
By Council Members Brewer, Fidler, Gentile, Gonzalez, James, Lander, Palma, Nelson,
Lappin, Dromm and Garodnick
A LOCAL LAW
To amend the administrative code of the city of New York, in relation to creating open
data standards.
Section 1. Title 23 of the administrative code of the city of New York is amended
CHAPTER 3
§ 23-301 Definitions.
§ 23-302 Public data sets availability.
§ 23-303 Internet data set policy and technical standards.
§ 23-304 Agency compliance plan.
§ 23-305 Exemptions and timetable.
includes a process for attempting to resolve objections by interested parties, as long as all
determination.
9
d. “Data set” means a comprehensive collection of interrelated data on any subject
shall not refer to data collected and stored on computing devices where such data has not
been added to any data set or used to create a new data set, provided that the data stored
on such computing devices is routinely added to a data set. Such term shall not refer to
copies of the data set, or portions thereof, that may be stored on computing devices.
person, including, but not limited to: (1) eligibility for services or benefits; (2) issuing a
permit; (3) registration, certification and licensing; and (4) liability for civil and criminal
penalties.
h. “Public data set” means any data set that is maintained by an agency that must
be accessible for inspection by the public in accordance with any provision of law or that
an agency shall decide to make accessible, excluding any data to which an agency may
j. “Technical standard” means (1) the common and repeated use of rules,
methods, and related management systems practices; and (2) the definition of terms;
10
describing materials, processes, products, systems, services or practices; test methods and
including employees.
voluntary consensus standards bodies, both domestic and international. These standards
include provisions requiring that owners of relevant intellectual property agree to make
an application program may request services or information from a web page or library
§23-302 Public data set availability. a. All public data sets maintained by city
agencies shall be made available for inspection by the public on the Internet through a
single web portal that is linked to nyc.gov or any successor website maintained by or on
b. All public data sets shall be formatted to enable viewing by web browsers and,
where practicable, mobile devices. All public data sets shall also be made available in
c. All public data sets shall be updated as often as necessary to preserve the
11
integrity and usefulness of the data sets. The web portal hosting the public data sets shall
make use of web syndication technology to notify the public of all updates for each
d. All public data sets shall be presented and structured in a format that permits
automated processing.
e. All public data sets shall be made available without any registration
programming interface that shall permit application programs to request and receive
§23-303 Internet data set policy and technical standards. a. The department shall
promulgate rules establishing an Internet data set policy for the city of New York no later
than July 5, 2010, for the purposes of ensuring agency compliance with the provisions
and open data principles set forth in section 23-302 of this chapter. Such policy shall
include, but not be limited to, guidelines to assist agencies in classifying their public data
sets either as immediate, priority and legacy pursuant to section 23-304 of this chapter.
b. No later than July 5, 2010, the department shall prepare and publish a technical
standards manual for the publishing of public data sets on the Internet by city agencies
for the purpose of making public data available to the greatest number of users and for
the greatest number of applications and shall, whenever practicable, use non-proprietary
technical standards for web publishing and e-government that have been developed or
adopted by voluntary consensus standards bodies. Such manual shall identify the reasons
12
why each technical standard was selected and for which types of data it is applicable and
may recommend or require that data be published in more than one technical standard.
c. The department shall consult with voluntary consensus standards bodies and
shall, when such participation is feasible, in the public interest and is compatible with
agency and departmental missions, authorities and priorities, participate with such bodies
department, shall review all public data sets under its control and shall classify them as
immediate, priority and legacy, in accordance with subdivision c of this section and
chapter.
compliance plan to the mayor and the council no later than July 5, 2010. Such plan shall
describe how such agency intends to achieve full compliance with this chapter by
December 2, 2013, and shall include an accounting of all public data sets under the
control of the agency, how such data sets have been classified pursuant to subdivision a
of this section, and the rationale for each classification. No later than the first Monday in
January of each year beginning in 2011, each agency shall submit an annual update to its
compliance plan to the mayor and the council until such agency reports full compliance
13
date of the local law that added this chapter and any public data set such agency acquires
or creates after such effective date through July 5, 2010, that can be made available on
the Internet pursuant to section 23-302 of this chapter within thirty days of the agency
acquiring or creating such public data set shall be classified as immediate. Agencies
should make best efforts to classify as many public data sets as immediate as is possible.
Any public data set acquired or created after July 5, 2010, shall be classified as
immediate.
2. Legacy. Any public data set that, due to its size or complexity, or due to
technology constraints, cannot be made available on the Internet pursuant to section 23-
302 of this chapter by January 2, 2012, shall be classified as legacy. Agencies shall make
best efforts to avoid placing public data sets into the legacy classification.
3. Priority. Any public data set that does not meet the conditions of immediate or
pursuant to section 23-304 of this chapter shall be exempt from the provisions of section
23-302 of this chapter and from applicable rules promulgated by the department pursuant
b. Public data sets classified as priority pursuant to section 23-304 of this chapter
shall be exempt from the provisions of section 23-302 of this chapter and from applicable
rules promulgated by the department pursuant to such section until January 2, 2012.
c. Public data sets classified as legacy pursuant to section 23-304 of this chapter
shall be exempt from the provisions of section 23-302 of this chapter and from applicable
rules promulgated by the department pursuant to such section until December 2, 2013.
14
§2. This local law shall take effect immediately.
15