CYOpenData_Lisbon_2018%20-%20Final.pdf

Type: Document | Status: ready

Open Data in Cyprus The National Open Data Portal Lisbon, 27th September 2018 Dimitris Michail (PAPD) Christos Rodosthenous (OUC) CYPRUS

 Population: 840,000  Independent since 1960  EU member since 2004  Open Data Awareness: Medium  First PSI re-use Law: 2006  Current law: voted in Dec. 2015  National Open Data Portal re- launched in 2018 (dkan) The Republic of Cyprus

The Cyprus PSI Team Dimitris Michail (PAPD) Mantalena Tsoukka (PAPD) Christos Rodosthenous (OUC) Michalis Epiphaniou (OUC)

Main Challenges and Advantages of a Small Country  Limited Resources  human  financial  Low Demand  small population  small local market  few opportunities for commercial use  Very small or no ecosystem around open data  Absence of Local Initiatives / Portals  Absence of comprehensive FOI Law (Cyprus)  coming into effect in 2020  Centralized Government System  ease of communication across government  relatively easy to form networks  Small Population → easier to reach / inform  Large number of universities (8)
compared to population  Mainly Services oriented economy  open to initiatives that enhance business environment Main Challenges / Barriers Main Advantages

Fostering the use of open data | Hackathon 2.0 5 transposed to the “Re-use of public sector information Law N. 132(I)/2006” PSI Directive (2003/98/EC) Launch the 1st edition of National OD Portal Revision of the National PSI Law in line with the revised PSI Directive New National PSI Law May 2018 Revamped National OD portal launch PAPD commissions Deloitte Cyprus to study the potential impact of PSI re-use in Cyprus and develop a road map for the practical implementation of the PSI Directive. The results of the study formed the basis for the 2012-2016 National Action Plan launched the following year. Socioeconomic Study National Action Plan 2012-2016 Open Data Forum June 2016 2006 2012 2015 2018 2011 2014 2016 The project funded by the European Commission’s Structural Reform Support Service (SRSS) to assist the PSBs in implementing the revised directive 2013/37/EU and promote public sector information reuse to the public and private/ business sector in Cyprus. CY PSI Project Workshops for the public sector July 2016 1st Open Data Hackathon in Cyprus September 2016 2nd Open Data Hackathon in Cyprus September 2018 2017 National Action Plan 2017-2021 Open Data in Cyprus o Legislation o Infrastructure (portal) o Provision of Training to PSBs o Promotion of re-use

Situation Analysis
Identification of Users’ Requirements Implementation Design based on OS Software Finalised Partnership with OUC formed March 2017 June 2017 January 2018 October 2017 September 2017 Developing the New Open Data Portal

Situation Analysis www.data.gov.cy Situation Analysis (2017)  CY NODP developed in-house by DITS between 2014-2016  Hosted on CY GOV Network  In general it served the purpose it was built for  As demand for open data increased various issues arose:  Relatively slow navigation  Limited Filters for data search  Absence of Datastore API (only metadata API)  Absence of data visualization tools

Identification of User Requirements Improvements Infrastructure  Network speed  Access from everywhere  Flexibility Infrastructure  Extendibility  Scaling Improvements Functionality  Accessibility  Visualizations  Publish Workflow Process  APIs  Supported metadata schema Improvements Training/Monitoring  Workshops  Webinars  Surveys

Partnership with the OUC The Open University of Cyprus Public University Distance Learning State-of-the-art eLearning Platform Support FOSS Open Technologies Access to Academic Network Advanced IT Infrastructure September 2017 Memorandum of Understanding

Searching for a suitable solution Selection criteria:  Meet the requirements  License-free  Promote openness  Successful deployments  Technologies that can be supported by the team Why DKAN Feature-rich Opensource - GNU General Public License v2.0 Based on a popular CMS (Drupal) Deployed in many Countries and Large Organizations Technologies that can be supported by the team Support team expertise Readymade API infrastructure Visualizations Modular architecture

DKAN Installations

92 documented worldwide installations Source: http://docs.getdkan.com/en/latest/introduction/dkan-sites.html

DKAN Open Data Platform-Features DKAN is a community-driven, free and open source open data platform that gives organizations and individuals ultimate freedom to publish and consume structured information.

DKAN Open Data Platform-Support Community

The OUC Infrastructure 100Mbps [1Gbps] CYNET Academic Network / GEANT

DKAN Installation Current deployment state:  DKAN 1.15.3  Drupal 7.59  Ubuntu 16.04 LTS  Apache 2.4  MariaDB  PHP 7

The Cyprus Portal Development Process Design Finalization Implement Infrastructure Launch Setup Production data.gov.cy Setup Sandbox October 2017 November 2017 May 2018 January 2018 December 2017 Data Migration Data Validation December 2017 January 2018

18  Format Transformation - Compatibility with Technical Framework o Machine Readable o Open Formats CSV instead of XLS, PDF, DOC 480 datasets 1350  700 datasets Carried out by the PSI Team (not an automated process)  Assessment of all published datasets o Approve / Discard o Organize (Merged to one or Split to many) o Checks for Metadata Completeness +300 datasets Identification / Preparation & Publication of New Datasets Migration of Data and Open Formats The Cyprus National Open Data Portal Frontpage • Dataset Topics • Information Dataset • Visualization • Search APIs • Datastore • Other Backend • Administrative • Publish Workflow Support • Documentation • Training material

The Cyprus National Open Data Portal - FrontPage List of Topics (12) Datastories News Feed and social media Language Selection Direct links training and support

The Cyprus National Open Data Portal - Datasets  Datasets [1041]  From 87 Public Sector Organizations  Resources [2691]  Majority CSV  Machine-readable  JSON  XML

Organizing and searching datasets Filter data per:  Topic  Organization  Format  Tag  License Type

Dataset Metadata DCAT-AP Machine-readable  JSON  RDF

Visualizing Data Using the Portal Interface Grid format Graph format
Map format

Share datasets and visualizations in Datastories

Access to Data and Metadata using APIs The data.gov.cy provides access to the portal data and metadata using four (4) APIs:  Datastore API: The DKAN Datastore houses any CSV files that have been uploaded and then imported into the Datastore and provides an endpoint to access them  CKAN API: Harvest and index the Portal from other CKAN-based portals (CKAN harvester)  Open Data APIs: Two Open Data Standards APIs: DCAT-AP and Project Open Data  Dataset REST API: Access all Portal content nodes

Datastore API - Endpoint

Datastore API - Parameters Parameters •resource_id (mixed) – id (string) or ids (array) of the resource(s) to be searched against. •filters (mixed) – array or string of matching conditions to select •q (string) – fulltext search •offset (int) – offset this number of rows •limit (int) – maximum number of rows to return •fields (array or comma separated string) – fields to return (default: all fields in original order) •sort (string) – comma separated field names with ordering •join (array) – array of fields to join from multiple tables •group_by (array) – array of fields to group by Aggregation functions •sum (string) – field to compute the sum •avg (string) – field to compute the average •min (string) – field to compute the maximum •max (string) – field to compute the minimum •std (string) – field to compute the standard deviation •variance (string) – field to compute the variance Source: https://docs.getdkan.com/en/latest/apis/datastore-api.html

Datastore API - Example https://www.data.gov.cy/api/action/datastore/search.json?resource_id=b0de1b05-88e7-4350-96d0-ad1266bf2b35&limit=5&language=en

Administration and Backend  User Permission Controls  Administrators (OUC & PAPD)  Public sector users (publishers and moderators)  Public (Registered users)  Simple forms to add/update Datasets and Resources

Publish Workflow Roles  Contributors (each organization)  Moderators (currently very few)  Super Moderator (PAPD) Organizations www.data.gov.cy Approve and Publish Reject and Review Submit dataset and send for approval

Monitoring and Logging • Notification system (email)  Error reporting  Broken link reporting  Custom reports (e.g., non-updated datasets)

Portal Statistics (Period: 5 May 2018 – 21 September 2018) Visitors: 14,600 (4,171/month) Visitors’ Countries : 90 % Visitors from Abroad : 32% Page Views = 103,000 Page Views per Day = 755 % Access via API = 20%

Portal Statistics (Period: 5 May 2018 – 21 September 2018)

Support, Documentation and Training Guides for PSBs  Data Selection  Data Preparation  Publishing Datasets  Utilizing the DS APIs  Roles and Levels of Authorization  Creating CSVs Developers Corner  Data Cleaning Tools  Programming Languages  Base Maps  Visualization Tools  DKAN Guide  DKAN API Guides Technical Guidelines  Data Selection  Licensing  Charges  Formats  Metadata  Standards Other Material  Guides  Information on DCAT-AP  Presentations from Events/ Conferences  Videos Support, Training and Promotion

Working with DKAN  Powerful Tool  Easy to set up/support/upgrade  Try to stay close to the core  Good Documentation  Easily Extendable  Bugs (found <10 in 1 year)  Very good community support  Issues Resolved Quickly (sometimes within days)  Users engaged in prioritizing further dkan development  CY Team among contributors

What's Next  Dataset statistics dashboard  Extend search capabilities for datastores  Translations (metadata, etc.)  Create webinars for the public  Create video guides for PSBs  Increase engagement using datastories  Challenges with prizes (substitute hackathons) Examples of Datasets

Location and Measurements of E/M Radiation of Radio-communication Stations Publisher: Department of Electronic Communications Themes: Science and Technology Formats: CSV, API (json) Frequency: bi-annual Tags: EM radiation

Real-Time Recordings of Meteorological Stations Publisher: Department of Meteorology Themes: Science and Technology Formats: API (XML) Frequency: Hourly Related datasets: Weather Forecasts by Area

Real-Time and Historical Recordings of Air-Quality Stations (3 datasets) Publisher: Department of Labour Inspection Themes: Environment Formats: API (xml) – real time data CSV, API (json) – historical Frequency: Hourly, Daily

Companies, Partnerships and Business Names Registry Publisher: Department of Registrar of Companies and Official Receiver Themes: Economy and Finance Formats: CSV, API (json) Frequency: Monthly Related Datasets: statistics on registered companies, trade marks, etc

Public Transport Datasets Collection of 7 datasets (one per Bus Service Operator) that include:  OSEL  EMEL  OSYPA  OSEA Publisher: Departments of Public Works Themes: Transport Formats: csv, shp, kml, json EMEL Routes (shp)  ZINON  Nicosia Small Buses  INTERCITY Collection of Geospatial Data Collection of 200+ Layers from the:  CY Cadastral Map  CY Topographic Map  CY Hydrographic Map  CY Administrative Map + CSS, hospitals, fishing shelters, natural trails, picnic sites, etc…. Publisher: mainly Department of Lands and Surveys Themes: Regions, Cities and Zoning, Environment Formats: wms, wfs, gml, shp, kml

CY NODP Satisfaction Survey (conducted between 15-24 September) System Usability Scale (SUS)=Good (72)

Dimitris Michael Open Data Team Department of Public Administration and Personnel Ministry of Finance Tel: +357 22 60 1523 / 1515 | Fax: +357 22 602763 Web: www.data.gov.cy | Twitter: @OpenDataCY Christos Rodosthenous Open Data Team Information, Technology and Library Unit Open University of Cyprus Tel: +357 22 411783 Web: www.data.gov.cy | Twitter: @OpenDataCY Thank you…