METADATA for Description, Discovery & Contextualisation. Check RDA METADATA Catalog
Metadata is a part of the global conversation and is already recognized as a necessity for interoperability, discovery and contextualisation. Given that metadata is essential for data science endeavors, different communitites of practice keep sharing their knowledge regarding metadata development, harmonisation and adoption to help perform better throughout the research process...
METADATA ..."is data about data, information about information ... More comprehensive definitions address metadata as structured data supporting functions associated with an object, an object being any “entity, form, or mode” (Big Metadata, Smart Metadata, and Metadata Capital: Toward Greater Synergy Between Data Science and Metadata). |
INTEROPEOPERABILITY NEEDS METADATA THAT CAN COMMUNICATE CLEARLY
A major goal of Research Data Alliance (RDA) is sharing of research datasets. For this to scale beyond one researcher sending a dataset to another, interoperability is required using computer systems to discover, contextualise, select, access, transmit or process datasets.
This interoperability is achieved through the use of metadata characterising the objects (datasets, software, users, computing resources) and techniques to match and map those descriptions leading to generation of convertors for the underlying data instances.
Interoperation among many metadata models preserves the richness of the original schemes but uses techniques to establish relationships between attributes in the different schemes (matching and mapping).
The common schemes can be generalised across multiple domains for each required purpose such as discovery, contextualisation or connection of a dataset to a software service. The MIG (Metadata Interest Group) proposes that these general, canonical, common schemes are called ‘metadata packages’ used for various purposes (discovery, contextualisation, provenance…).
Below you can find an excerpt from the document entitled:
'METADATA PRINCIPLES & THEIR USE'...
... which is the result of a collaboration among the four ‘core’ metadata groups of the RDA, including:
(1.) Metadata Standards Catalog /Directory Working Group,
(2.) Data In Context Interest Group,
(3.) Research Data Provenance Interest Group, all coordinated by the above mentioned
(4.) Metadata Interest Group,
and which explains metadata in the perspective of Description, Discovery & Contextualisation of the most important elements of interoperability that define the syntax and the semantics for the research data exchange.
Metadata Principles | Illustration & Examples | |
1. | The only difference between METADATA and DATA is mode of use
| Consider a LIBRARY CATALOGUE stored electronically. |
2. | METADATA is not just for DATA, it is also for USERS, SOFTWARE SERVICES, COMPUTING RESOURCES
| In a VRE (Virtual Research Environment; e.g. BlueBridge VRE) the amount of work a researcher has to do manually just does not scale. Autonomic services are required. In order to achieve this DATA, SERVICES, USERS and COMPUTING RESOURCES need to be DESCRIBED to middleware which manages the scheduling, allocations, connection of the components etc. These descriptions are METADATA. |
3. | METADATA is not just for DESCRIPTION and DISCOVERY; it is also for CONTEXTUALISATION (relevance, quality, restrictions, rights, costs) and for coupling users, software and computing resources to data (to provide a VRE)
| Metadata for DISCOVERY (followed by manual selection and connection) is already achievable. [Check e.g., Meaningful Bibliographic Metadata (M2B): Recommendations of a set of metadata properties and encoding vocabularies; AIMS.FAO.ORG] However the selection of appropriate datasets (or software) is greatly enhanced by using CONTEXTUAL METADATA; that is metadata characterising the object of interest. Contextual metadata concerns PERSONS, ORGANISATIONS, PROJECTS, FUNDING, OUTPUTS (publications, products, patents), FACILITIES and EQUIPMENT – in short attributes which allow the end user (or software representing the end-user) to assess the relevance and quality of an object (dataset, software) for their current purpose. |
4. | METADATA must be MACHINE-understandable as well as HUMAN UNDERSTANDABLE for autonomicity (formalism)
| The mantra is formal SYNTAX and declared SEMANTICS. [Check e.g., LODE-BD Recommendations 2.0: How to select appropriate encoding strategies for producing Linked Open Data (LOD)-enabled bibliographic data; AIMS.FAO.ORG] This allows machine processing rather than manual processing. |
5. | MANAGEMENT (META)DATA is also relevant (research proposal, funding, project information, research outputs, outcomes, impact…) | Management metadata links with (3); the CONTEXTUAL METADATA can also be used for EVALUATION of research, policy-making and other management functions at institutional or funding organisation level.
|
To learn about certain implications (with illustrations and examples) the above principles lead to, and about several use cases describing utilisation of a VRE with information at each step on the metadata required, you are invited to access and Read the full version of the RDA Metadata Principles and accompanying commentary (check principles in the attached document).
You can track progress of a prototype RDA Metadata Standards Catalog by visiting the GitHub repository. The Catalog will be seeded with records migrated from the the Metadata Standards Directory (you can continue to add standards, profiles, tools and known users of standards). If you would like to help check, clean and expand the migrated records before they are loaded into the Catalog, you can find them in the 'db' folder on GitHub. |
META/DATA & INTEROPERABILITY : LOOKING FURTHER AHEAD : |
Submit an ABSTRACT at the IGAD RDA 13th pre-meeting (1 April 2019): 'AGRICULTURE DATA INTEROPERABILITY: Opportunities and Lessons Learnt from Sharing and Re-Using Data' (Deadline : 22 February 2019)
- Metadata Basics (Dublin Core Metadata Initiative)
Metadata Services & Metadata Quality Measurement and Improvement (Marcia Lei Zeng | Jian Qin)
Metadata as Standard: improving Interoperability through the Research Data Alliance (Recorded Webinar@AIMS)
- OpenAIRE Guidelines for data providers: new Metadata Application Profile for Literature Repositories (Webinars series for repository managers)
- OpenAIRE Guidelines for Literature Repositories (on OpenAIRE)
- OpenAIRE Guidelines for CRIS Managers (ZENODO)
- OpenAIRE Guidelines for CRIS Managers (on OpenAIRE)
- OpenAIRE Guidelines for Data Archives (on OpenAIRE)
- Building a Common Infrastructure: The Challenges of Modern Metadata (Recorded Webinar, Society for Scholarly Publishing / SSP)
- Metadata for 2020 and Beyond: Collaborative approaches to advancing metadata (from SciDataCon-IDW2018)
- METADATA 2020 : details and crosswalks of the recommendations. Can we agree?
- DCAT-AP: Promoting interoperability of data catalogues in Europe (slides, 21/03/2018, ISA2 PROGRAMME)
- Assigning METADATA as method to support DIGITAL DATA CURATION in trusted repositories
- DATACITE METADATA SCHEMA is a list of core metadata properties chosen for an accurate and consistent identification of a resource for citation and retrieval purposes, along with recommended use instructions
- GEOSPATIAL METADATA (FGDC.GOV: FEDERAL GEOGRAPHICAL DATA COMMITTEE)
- LRMI : Learning Resource Metadata Initiative (Dublin Core) has been adopted by and integrated within Schema.org under 'Creative Work' and includes things like learning resource type, intended audience, typical age range, and the very important AlignmentObject. This work has more recently been harmonized into other specifications/metadata models such as CEDS, and the new IMS Global LTI Resource Search standard. The LRMI taskforce is right now discussing specific vocabularies for resource types. Lastly, and in support of these initiative, the IMS Global CASE (Competencies and Academic Standards Exchange) address how to create and exchange a learning standards or competency framework (e.g. what a student must learn and demonstrate to provide learning or master)
- W3C OER (Open Education Resource) Schema Community
- Enabling FAIR Data across the Earth, Space, and Environmental Sciences (ESES)
- FAIR data: What and Why? Easier said than implemented?
- The Metadata Catalogue Explained: The metadata-body of each DMP entry contains the following information ... (LIBER)
- Machine-actionable Data Management Plans (maDMPs): Connecting the dots or...The Journey of a DMP
- Vocabularies for Semantic Interoperability in Agriculture: discover Agrisemantics map with up to 400 standards for data exchange
- Deposit your research data as FAIR Data with help of REPOSITORY FINDER!
- Find the DATA You need ... more easily with Google Dataset Search!
STANDING ON THE SHOULDERS OF GIANTS IS THE OPPOSITE OF REINVENTING THE WHEEL... |
You are invited to Sign up for AIMS News, follow @AIMS_Community on Twitter... And, thanks again for your interest!