What is Open Data?
Open data is data that anyone can access, use and share. The Open Data Institute's Open Data FAQ has more information about open data. You may be interested in reading these guides, watching these videos or booking onto a training course.
What is Linked Open Data?
What is DaPaaS and DataGraft?
DataGraft reduces the cost of linked open data consumption by providing services for data transformation and access, such as data cleaning and scalable hosting for data and datasets.
We have validated the DataGraft platform and DaPaaS project through a use case with a smart cities approach, which focuses on aggregation of data from various open data sources and sensor networks.
When will DataGraft become available?
Who will benefit from DataGraft, and how?
The DataGraft platform provides tools and services for both data publishers and data consumers.
Data publishers can use DataGraft to create, transform, publish and host high quality linked open data in an easy and economically feasible way.
Data consumers benefit from DaPaaS as it provides commercial grade tools, clean and efficient APIs and better documentation so that they can reliably work with large datasets.
Example users include open data publishers, linked data developers, data scientists.
Can you give a typical example as to where the DataGraft platform can be used?
Our primary use case, chosen by work package leaders Saltlux, is based around the creation of PLUQI (Personalised and Localised Urban Quality Index), a customisable index model and mobile/web application that can represent and visualise the level of well-being and sustainability for given cities based on individual preferences.
It covers and demonstrates several use cases including destination evaluation for business and leisure travellers, informing citizens about their surroundings, assisting local government in prioritisation of services and promoting and informing environmental management. Essentially, PLUQI provides all kinds of city-related data to users. This use case is being used to assess the requirements that have been defined for the DaPaaS project and DataGraft platform.
How is this whole platform different from AWS, Azure, AppEngine, etc?
DataGraft is focused specifically on data hosting and access, with a particular focus on Linked Data, and will be more specific to the needs of data publishers and consumers rather than AWS, EC2 and other similar platforms.
Most importantly, DataGraft is free to use, with customisable options if you should need extra requirements.
Doesn’t DaPaaS replicate what other research projects, (for example, the LOD2 project is doing?
As the field of open and linked data becomes increasingly part of everyday life, there is likely to be some overlap between other projects. However, we are confident that there isn’t the perfect combination of tools and functionality for linked data platforms, which gives scope for experimentation; and also different platforms will be aimed at different audiences.
The combination of data transformation and linked data hosting in one platform is something we haven't come across before. We're also hoping to make it usable by people who are not linked data experts - it’s a common objective of many projects, but not one that has been solved yet.
Throughout the project’s lifetime, we continue to ensure that we support and collaborate with other research projects wherever possible.
Will it all be open source?
The majority of DataGraft tools and assets created by the DaPaaS project will be open source - to clarify, any new code that is produced by the project is open source.
However, our prototype platform is using GraphDB which is provided by one of the consortium partners, Ontotext, and isn’t open source. It has been incredibly successful and popular over the years with a wide range of organisations and industries.
Interaction with GraphDB is based on standard protocols like SPARQL and the standard Sesame/Sail interfaces, so if someone wanted to run a version of the platform with a different triple store that should be possible.
You can tinker with our Github repository and contribute to DataGraft and DaPaaS.
Why are you using proprietary databases/triplestores not open source?
Ontotext are leading on the platform development, and are using the system which they know best. In addition, GraphDB has capabilities for storing large quantities of data that will be useful for the project. The platform won't be tightly tied to a specific triple store.