Linked Open Data in Czech Open Data Portal Jakub Klímek Project: Implementace strategií v oblasti otevřených dat II Reg. number: CZ.03.4.74/0.0/0.0/15_025/0004172
Czech Open Data Portal (ODP) architecture and LOD OFN https://data.gov.cz Web Wiki SPARQL endpoint Catalog Viewer Forms Harvester Local catalog (CKAN) Local catalog Local catalog 2
Czech Open Data Portal (ODP) architecture and LOD OFN https://data.gov.cz Web Wiki SPARQL endpoint Catalog Viewer Forms Harvester Local catalog (CKAN) Local catalog Local catalog 3
Open Data Portal (ODP) Web and Wiki Web for everyone interested in OD Wiki with info for organizations publishing OD 4 On GitHub, using Jekyll
Czech Open Data Portal (ODP) architecture and LOD OFN https://data.gov.cz Web Wiki SPARQL endpoint Catalog Viewer Forms Harvester Local catalog (CKAN) Local catalog Local catalog 5
Open Formal Norms (OFN) Standards Anchored in Czech FOIA (106/1999) 6 ● Standard for Linked Data (itself) ○ What is URI, IRI, URL ○ IRI patterns ○ IRIs of global entities ■ Addresses ■ Regions ■ Buildings ■ … ○ RDF serializations ○ Common vocabularies ■ SKOS, The RDF Data Cube, DCAT, VoID ■ RDFS, OWL ● On GitHub, using W3C’s ReSpec
Open Formal Norms (OFN) Standards Anchored in Czech FOIA (106/1999) 7 Linked Data supported standards ● Standards for specific datasets ○ e.g. Addresses, Invoices, ... ● Conceptual overview ● Serializations in CSV, JSON, XML, RDF ● Examples of schemas ● Examples of SPARQL queries ● On GitHub, using W3C’s ReSpec
Czech Open Data Portal (ODP) architecture and LOD OFN https://data.gov.cz Web Wiki SPARQL endpoint Catalog Viewer Forms Harvester Local catalog (CKAN) Local catalog Local catalog 8
SPARQL Endpoint https://data.gov.cz/sparql ● OpenLink Virtuoso Open-Source ● DCAT-AP v1.2 content of National Open Data Catalog ● Harvested by EDP via SPARQL 1.1 HTTP Graph Store Protocol ● Used for statistics and metadata quality measurements ● Used for dereference of IRIs 9
Czech Open Data Portal (ODP) architecture and LOD OFN https://data.gov.cz Web Wiki SPARQL endpoint Catalog Viewer Forms Harvester Local catalog (CKAN) Local catalog Local catalog 10
National Open Data Catalog LinkedPipes DCAT-AP Viewer 11 ● LP-DAV ● Viewer for people ● Open-source, GitHub ● Built around DCAT-AP ○ in contrast to CKAN, DKAN ● Czech and English UI ● Uses Apache Solr and Apache CouchDB for faster access
National Open Data Catalog LinkedPipes DCAT-AP Viewer ● LP-DAV ● Viewer for people ● Open-source, GitHub ● Built around DCAT-AP ○ in contrast to CKAN, DKAN ● Czech and English UI ● Uses Apache Solr and Apache CouchDB for faster access ● Some custom features ○ e.g. multilingual EuroVoc theme word cloud => 12
Czech Open Data Portal (ODP) architecture and LOD OFN https://data.gov.cz Web Wiki SPARQL endpoint Catalog Viewer Forms Harvester Local catalog (CKAN) Local catalog Local catalog 13
National Open Data Catalog LinkedPipes DCAT-AP Forms ● LP-DAF ● Forms for people ● Open-source, GitHub ● Built around DCAT-AP ○ in contrast to CKAN, DKAN ● Czech and English UI ● Autocomplete for DCAT-AP codelists (Solr) ● Thorough help for each item ● Producing DCAT-AP v1.2 JSON-LD files 14
Czech Open Data Portal (ODP) architecture and LOD OFN https://data.gov.cz Web Wiki SPARQL endpoint Catalog Viewer Forms Harvester Local catalog (CKAN) Local catalog Local catalog 15
National Open Data Catalog Local Catalog API Standard (DCAT-AP) OFN “Definition of data catalog API” ● Generic data structure definition ○ Based on DCAT-AP v1.2 ● 3 API kinds supported ○ (extended) CKAN API ■ JSON based ■ Mapping of JSON to DCAT-AP ○ DCAT-AP v1.2 static files ■ Dereferencable Catalog IRI ■ Dereferencable Datasets IRI ○ DCAT-AP v1.2 SPARQL endpoint 16
Czech Open Data Portal (ODP) architecture and LOD OFN https://data.gov.cz Web Wiki SPARQL endpoint Catalog Viewer Forms Harvester Local catalog (CKAN) Local catalog Local catalog 17
National Open Data Catalog LinkedPipes ETL - Overview 18
National Open Data Catalog LP-ETL - Data input 19
National Open Data Catalog LP-ETL - input forms processing 20
National Open Data Catalog LP-ETL - DCAT-AP SPARQL endpoint local catalogs 21
National Open Data Catalog LP-ETL - DCAT-AP static files local catalogs 22
National Open Data Catalog LP-ETL - CKAN APIs local catalogs 23
National Open Data Catalog LP-ETL - Data output - dumps (RDF, CSV), endpoint 24
National Open Data Catalog LP-ETL - Metadata 25
Czech Open Data Portal (ODP) architecture and LOD OFN https://data.gov.cz Web Wiki SPARQL endpoint Catalog Viewer Forms Harvester Local catalog (CKAN) Local catalog Local catalog 26
National Open Data Catalog Deployment and documentation ● Open-Source, on GitHub ● All software used for the catalog is open-source 27
Czech Open Data Portal (ODP) architecture and LOD OFN https://data.gov.cz Web Wiki SPARQL endpoint Catalog Viewer Forms Harvester Local catalog (CKAN) Local catalog Local catalog 28