THIS GLOSSARY IS ADAPTED FROM A VARIETY OF SOURCES SUCH AS THE OPEN DATA HANDBOOK AND DATA.GOV.
Application Programming Interface (API) is a set of instructions and standards used by an application program to communicate with the operating system or some other control program such as a database management system.
An application or app is a software program that is designed to connect to large databases and often provides real-time information on a computer, mobile phone, and other similar platforms.
Acknowledging the source of data when consuming or re-publishing it. Data licenses may include this requirement when publishing open data.
The Comprehensive Knowledge Archive Network (CKAN) is an open source data management system for storage and distribution of data across the web, built and maintained by Open Knowledge. It serves as the official data publishing platform for about 20 national governments and powers the data publishing efforts of a variety of local, community, and scientific organizations. For more information click here.
A right for the creators of creative works to restrict others’ use of those works. A copyright holder is entitled to determine how others may use or restrict use of that work through a license.
‘Comma-separated values’, a standard format for spreadsheet data. Data is represented in a plain text file, with each data row on a new line and commas separating the values on each row. As a very simple open format it is easy to consume and is widely used for publishing open data.
Data can be thought of as facts, statistics, or other values systematically collected for reference, analysis, and calculation. Data becomes informative when it's structured, often combined with other data, and analyzed to extract meaning.
An authoritative listing of available open data organized in a manner that makes it easy to search and navigate.
An organized collection of data commonly presented in a spreadsheet (where columns represent variables and each row contains values for those variables) or in a form of a map.
The format of a file is associated with the last part of the file name or extension. For example, a CSV file could be called totalcars.csv.
Geospatial refers to data that has a geographic component to it such as coordinates, address, city, or zip code.
An organizational framework to group like datasets together to provide context and meaning to a user.
A social event that brings together programmers, subject experts, and advocates to share information and work together to build applications, visualizations, or prototypes to often address an issue or a set of inter-related issues.
Hyper Text Markup Language (HTML) is a language that describes the skeletal structure of a webpage. Internet browsers reference HTML to render the contents of a webpage to a user.
Keyhole Markup Language (KML) is a XML-based language for managing the display of three dimensional data on applications such as Google Earth. KML was developed for use with Google Earth and is accepted as an Open Geospatial Consortium Standard.
The license that accompanies the publication of a dataset to convey how a user can use or reference the data.
Information or data that is in a format that can be easily read or processed by a computer without human intervention. Machine readable data must be structured data and are often found in CSV, JSON, and XML file formats.
Provides descriptive information about data to give it context. Descriptive elements such as title, description, publisher, and license information are important to the discovery and usability of the data that is published.
Open data is the proactive release of government collected data that is made publicly available through an open license to enable citizens to freely access, reuse, and redistribute. From a technical perspective, open data is available in machine readable file formats and allows users to download data related to government operations or service delivery in bulk.
Open government is a governing principle that serves to support transparency, collaboration, and engagement with the public by implementing policies and utilizing technology to emphasize the sharing of government information.
Open source software is freely available to the public. Users are free to inspect it, modify it, and use the code for their own purposes.
No copyright exists over the work and users can utilize available data for their own purposes without restrictions.
A shapefile stores non-topological geometry and attribute information for the spatial features in a dataset. The geometry for a feature is stored as a shape comprising a set of vector coordinates. Shapefiles can support point, line, and area features.
Structured data refers to data where structural relationships between elements are retained and stored on a computer disk. PDFs and word processing documents are not structured forms of data because the logical structure cannot (or is nearly impossible) to extract automatically.
Keywords that help users discover datasets of interest.
Uniform resource locator (URL) when used with HTTP is a character string or web address that references a web page.
Extensible Markup Language (XML) defines rules or standards for encoding content in a format that is easily readable for both human and machine. XML can be used by any individual or group that wants to share information in a consistent way.
A computer file whose contents are compressed to facilitate storage or transmission. It often carries the .zip file extension.