Metadata is one of the most important tools needed to communicate with each other about science and scholarship. It tells the story of research that travels throughout systems and subjects and even to future generations. We have metadata for organising and describing content, metadata for provenance and ownership information, and metadata is increasingly used as signals of trust.
Following our panel discussion on the same subject at the ALPSP University Press Redux conference in May 2024, in this post we explore the idea that metadata, once considered important mostly for discoverability, is now a vital element used for evidence and the integrity of the scholarly record.
For the third year in a row, Crossref hosted a roundtable on research integrity prior to the Frankfurt book fair. This year the event looked at Crossmark, our tool to display retractions and other post-publication updates to readers.
Since the start of 2024, we have been carrying out a consultation on Crossmark, gathering feedback and input from a range of members. The roundtable discussion was a chance to check and refine some of the conclusions we’ve come to, and gather more suggestions on the way forward.
https://doi.org/10.13003/ief7aibi
In our previous blog post in this series, we explained why no metadata matching strategy can return perfect results. Thankfully, however, this does not mean that it’s impossible to know anything about the quality of matching. Indeed, we can (and should!) measure how close (or far) we are from achieving perfection with our matching. Read on to learn how this can be done!
How about we start with a quiz?
We’re in year two of the Resourcing Crossref for Future Sustainability (RCFS) research. This report provides an update on progress to date, specifically on research we’ve conducted to better understand the impact of our fees and possible changes.
Crossref is in a good financial position with our current fees, which haven’t increased in 20 years. This project is seeking to future-proof our fees by:
Making fees more equitable Simplifying our complex fee schedule Rebalancing revenue sources In order to review all aspects of our fees, we’ve planned five projects to look into specific aspects of our current fees that may need to change to achieve the goals above.
The ‘research nexus’ is the vision to which we aspire:
A rich and reusable open network of relationships connecting research organizations, people, things, and actions; a scholarly record that the global community can build on forever, for the benefit of society.
The research nexus goes beyond the basic idea of just having persistent identifiers for content. Objects and entities such as journal articles, book chapters, grants, preprints, data, software, statements, dissertations, protocols, affiliations, contributors, etc. should all be identified and that is still an important part of the picture. But what is most important is how they relate to each other and the context in which they make up the whole research ecosystem.
The foundation of the research nexus is metadata; the richer and more comprehensive the metadata in Crossref records, the more value there is for our members and for others, including for future generations.
Metadata and relationships between research objects and entities can support the whole scholarly research ecosystem in many ways, including:
Research integrity: helping to provide signals about the trustworthiness of the work including provenance information such as who funded it (when and for how much), which organizations and people contributed what, whether something was updated or corrected, and whether it was checked for originality. All of these signals can be expressed through Crossref metadata.
Reproducibility: helping others to reproduce outcomes by adding relationships between literature, data, software, protocols and methods, and more. All of these relationships can be asserted through members’ ongoing stewardship of their Crossref metadata records.
Reporting and assessment: helping organizations such as universities, funders, governments, to track and demonstrate the outcomes of investment; provide benchmarking information; show compliance with funder mandates; and decide what other research to fund. This kind of information can be included in Crossref metadata.
Discoverability: helping people and systems identify work through multiple angles. Registering content with Crossref makes it possible for work to be found and used. Thousands of systems use Crossref metadata, therefore the richer the records are, the more visibility there is likely to be of your work. Including metadata like abstracts and references are very simple ways to increase the visibility of your records.
The importance of relationships
A big part of the research nexus is establishing connections between and among different research objects which establishes provenance over time. Adding relationships to your metadata records can convey much richer and more nuanced connections beyond traditional references.
These relationships may consist of versions, corrections, translations, data, formats, supplements, and components. There are no extra fees for including relationships in your metadata.
What types of resources and records can be registered with Crossref?
We are working to make our input schema more flexible so that almost any type of object can be registered and distributed openly through Crossref. At the moment, members tend to register the following:
Conference proceedings: information about a single conference and records for each conference paper/proceeding.
Datasets: includes database records or collections.
Dissertations: includes single dissertations and theses, but not collections.
Grants: includes both direct funding and other types of support such as the use of equipment and facilities.
Journals and articles: at the journal title and article level, and includes supplemental materials as components.
Peer reviews: any number of reviews, reports, or comments attached to any other work that has been registered with Crossref.
Pending publications: a temporary placeholder record with minimal metadata, often used for embargoed work where a DOI needs to be shared before the full content is made available online.
Preprints and posted content: includes preprints, eprints, working papers, reports, and other types of content that has been posted but not formally published.