Getting Started
What is Linked Data?
The main difference between the web of hypertext and the Semantic Web is that while the first links in html pages or documents, the second calls for going beyond the concept document and link structured data. In this context, Linked Data is the set of best practices for publishing and connecting structured data on the Web. Its main objective is to liberate data from silos that are framed by proprietary database schemas following four rules, defined by Tim Berners-Lee in 2006:
- to use URIs (uniform resource identifiers) to identify resources uniquely;
- to use http URIs so people can access the information about the resource;
- to provide information about the resources using standard formats like RDF/XML; and
- to include links to other resources, URIs, enhancing the linking between different resources distributed on the web.
These principles are defined as rules, but in reality are rather recommendations or best practices for the development of the semantic web. You can publish data that meets only the first three principles, but the failure to implement the fourth makes data less visible and, therefore, less reusable.
What is RDF?
RDF is the Resource Description Framework for metadata on the Web developed by the W3C. It is based on the idea of declaring resources using the expression in the form subject-predicate-object. This expression is known as RDF triple. An RDF triple contains three components, all with its own URI:
- Subject, a URI, a person, or node, is the entity to which we refer;
- Predicate is the property or relationship you want to set about the subject;
- Object is the value of the property or another resource that establishes the relationship.
By using URIs to link data, the Semantic Web becomes a kind of large database that allows people and machines to explore the information referenced and interconnected. The Web-based on LOD is a breakthrough in content syndication, which uses external data sources to create new services.
What is Linked Open Data?
Linked Open Data (LOD) is Linked Data distributed under an open license that allows its reuse for free. In 2010, Tim Berners-Lee defined a 5-star rating scheme to encourage data providers to provide linked data under open licenses. The scheme uses gold stars to evaluate the availability of linked data as linked open data.
How to facilitate the linking between resources?
Simply transforming database schemas into RDF does not create linked data. There is a chance to get stuck at the 4th star in the 5-star rating scheme. To create automatic links between RDF triple stores on the web should be possible, otherwise there is a risk of creating RDF silos. The easiest way to facilitate the establishing of automatic linking between datasets is the use of standard vocabularies, including standard vocabularies for describing data/metadata elements and standard vocabularies for indicating values.
In order to give content providers with a set of recommendations that will support the selection of appropriate encoding strategies for producing LOD-enabled data, the AIMS Team plans to prepare a series of LODE recommendations that overarch a wide range of resource types including the encoding strategies for producing LOD-enabled bibliographical data as well as the encoding of value vocabularies used in describing agents, places, and topics in bibliographic data.
Further Reading
Adoption of the Linked Data Best Practices in Different Topical Domains
- Linked Data Competency Index: Mapping the field for teachers and learners (recorded Webinar@AIMS)
- Talk by Tim Berners-Lee at TED2009
- Linked Data: evolving the Web into a Global Data Space
- Tutorial: Introducing Linked Data and the Semantic Web
- Linguistic Linked Open Data Cloud
- The Linking Open Data Diagram
- Fabien Gandon. A Survey of the First 20 Years of Research on Semantic Web and Linked Data. Revue des Sciences et Technologies de l’Information - Série ISI : Ingénierie des Systèmes d’Information, Lavoisier, 2018