The main difference between the web of hypertext and the Semantic Web is that while the first links in html pages or documents, the second calls for going beyond the concept document and link structured data. In this context, Linked Data is the set of best practices for publishing and connecting structured data on the Web. Its main objective is to liberate data from silos that are framed by proprietary database schemas following four rules, defined by Tim Berners-Lee in 2006:
These principles are defined as rules, but in reality are rather recommendations or best practices for the development of the semantic web. You can publish data that meets only the first three principles, but the failure to implement the fourth makes data less visible and, therefore, less reusable.
RDF is the Resource Description Framework for metadata on the Web developed by the W3C. It is based on the idea of declaring resources using the expression in the form subject-predicate-object. This expression is known as RDF triple. An RDF triple contains three components, all with its own URI:
By using URIs to link data, the Semantic Web becomes a kind of large database that allows people and machines to explore the information referenced and interconnected. The Web-based on LOD is a breakthrough in content syndication, which uses external data sources to create new services.
Linked Open Data (LOD) is Linked Data distributed under an open license that allows its reuse for free. In 2010, Tim Berners-Lee defined a 5-star rating scheme to encourage data providers to provide linked data under open licenses. The scheme uses gold stars to evaluate the availability of linked data as linked open data.
Simply transforming database schemas into RDF does not create linked data. There is a chance to get stuck at the 4th star in the 5-star rating scheme. To create automatic links between RDF triple stores on the web should be possible, otherwise there is a risk of creating RDF silos. The easiest way to facilitate the establishing of automatic linking between datasets is the use of standard vocabularies, including standard vocabularies for describing data/metadata elements and standard vocabularies for indicating values.
In order to give content providers with a set of recommendations that will support the selection of appropriate encoding strategies for producing LOD-enabled data, the AIMS Team plans to prepare a series of LODE recommendations that overarch a wide range of resource types including the encoding strategies for producing LOD-enabled bibliographical data as well as the encoding of value vocabularies used in describing agents, places, and topics in bibliographic data.