London DataStore

The London Datastore is an initiative by the Greater London Authority (GLA) to provide free and open access to various datasets related to London.
What are the main aims and objectives?

The main aims and objectives of the London Datastore are to enhance transparency and accountability within public services by providing open access to a wide range of datasets related to London. This initiative, spearheaded by the Greater London Authority (GLA), seeks to empower citizens, developers, researchers, and businesses by offering free access to raw data that can be used to create more insightful and user-friendly applications or analyses. By facilitating such access, the Datastore encourages innovation and collaboration, fostering a data-driven approach to addressing urban challenges and improving public services.

How does the program work?

The London Datastore operates as a comprehensive open data platform, providing free access to a wide range of datasets related to London. It functions as a centralized repository where various public bodies, including the Greater London Authority (GLA), Transport for London, and the Metropolitan Police Service, contribute data on topics ranging from population demographics to crime statistics and public transport information. The Datastore hosts over 6,000 datasets covering various areas such as public services, transport, environment, health, housing, and demography

At its core, the Datastore works by collecting, organizing, and making available raw data in various formats, allowing users to access and utilize this information for research, analysis, or application development. The platform features a user-friendly interface that enables easy navigation and search functionality, helping users quickly find relevant datasets. Additionally, it offers an API (Application Programming Interface) for more advanced users, allowing programmatic access to the data and facilitating integration with other systems or applications.

The Datastore continually updates and expands its repository, ensuring that the information remains current and relevant. It adheres to open data principles, making the majority of its datasets freely available for public use, subject to specific licensing conditions. This approach not only promotes transparency in public services but also encourages innovation by enabling developers, researchers, and businesses to create new applications or conduct in-depth analyses based on the available data.

To support its users, the London Datastore provides documentation, guides, and resources to help individuals and organizations effectively utilize the data. It also actively seeks feedback from its user community, using this input to drive continuous improvements to the platform's functionality and content. Through this collaborative and open approach, the London Datastore serves as a powerful tool for fostering data-driven decision-making, enhancing public services, and driving innovation within the city.

What is the overall cost?

The initial budget for the setup of the Datastore was less than £100,000.

How was it implemented?

The London DataStore was created as part of a broader initiative to promote transparency and open data in the governance of London. Launched in January 2010 under the leadership of then-Mayor Boris Johnson, the DataStore was spearheaded by the Greater London Authority (GLA), with significant input from the Intelligence Unit's Assistant Director. This initiative aligned with a global movement towards open government data, drawing inspiration from similar projects like the US Government's 'data.gov' platform.

In 2014, the London Datastore underwent a significant transformation aimed at expanding its scope and improving its functionality. This major overhaul focused on enhancing the platform's ability to handle an increasing number of datasets and publishing organizations more efficiently. The revamp addressed several key issues that had been identified in the original Datastore's operation.

One of the primary objectives was to improve the searchability of the platform. The original Datastore's search functionality was considered inadequate, particularly for users who were not subject matter experts. To address this, the team implemented an improved search engine and better search filters, making it easier for users to find the information they needed.

Another challenge was the limited number of people who could publish data on the platform. Initially, only three GLA employees had publishing permissions, which created a bottleneck as the Datastore's capacity expanded. Additionally, the publishing process was largely manual, further constraining the platform's growth potential.

To overcome these challenges, the Datastore team introduced CKAN (Comprehensive Knowledge Archive Network), an open-source data portal software. This decision was made based on a recommendation from a partner in the European iCity project. CKAN offered improved management of publishing organizations, allowing each to have a superuser with administrator rights, thus streamlining the publishing process.

The implementation of CKAN was carried out through an open competition for a customized version of the software. DataPress, a company offering a flexible cloud instance of CKAN, won this competition. Subsequently, DataPress designed the basic architecture of the current Datastore and continues to host the website. This collaboration has resulted in a more robust and efficient platform, capable of meeting the growing demands of London's data ecosystem.

What impact has been measured?

The Datastore attracts around 60,000 unique users monthly, serving a diverse audience including local government, academics, private businesses, and individuals.

CURATED BY

Research Programme Coordinator – Digital Startups
Nesta
United Kingdom