Metadata

Publications

Blog posts

Metadata schema development plans

Patricia Feeney – 2024 July 22

It’s been a while, here’s a metadata update and request for feedback

In Spring 2023 we sent out a survey to our community with a goal of assessing what our priorities for metadata development should be - what projects are our community ready to support? Where is the greatest need? What are the roadblocks?

The intention was to help prioritize our metadata development work. There’s a lot we want to do, a lot our community needs from us, but we really want to make sure we’re focusing on the projects that will have the most immediate impact for now.

Celebrating five years of Grant IDs: where are we with the Crossref Grant Linking System?

Kornelia Korzec, Ginny Hendricks – 2024 July 01

In Research FundersGrant Linking SystemInfrastructureMetadataIdentifiers

We’re happy to note that this month, we are marking five years since Crossref launched its Grant Linking System. The Grant Linking System (GLS) started life as a joint community effort to create ‘grant identifiers’ and support the needs of funders in the scholarly communications infrastructure.

The system includes a funder-designed metadata schema and a unique link for each award which enables connections with millions of research outputs, better reporting on the research and outcomes of funding, and a contribution to open science infrastructure. Our first activity to highlight the moment was to host a community call last week where around 30 existing and potential funder members joined to discuss the benefits and the steps to take to participate in the Grant Linking System (GLS).

Some organisations at the forefront of adopting Crossref’s Grant Linking System presented their challenges and how they overcame them, shared the benefits they are reaping from participating, and provided some tips about their processes and workflows.

The anatomy of metadata matching

Dominika Tkaczyk, Adam Buttrick – 2024 June 27

In MetadataLinkingMetadata MatchingData Science

In our previous blog post about metadata matching, we discussed what it is and why we need it (tl;dr: to discover more relationships within the scholarly record). Here, we will describe some basic matching-related terminology and the components of a matching process. We will also pose some typical product questions to consider when developing or integrating matching solutions.

Basic terminology

Metadata matching is a high-level concept, with many different problems falling into this category. Indeed, no matter how much we like to focus on the similarities between different forms of matching, matching affiliation strings to ROR IDs or matching preprints to journal papers are still different in several important ways. At Crossref and ROR, we call these problems matching tasks.

Metadata matching 101: what is it and why do we need it?

Dominika Tkaczyk, Adam Buttrick – 2024 May 16

In MetadataLinkingMetadata MatchingData Science

At Crossref and ROR, we develop and run processes that match metadata at scale, creating relationships between millions of entities in the scholarly record. Over the last few years, we’ve spent a lot of time diving into details about metadata matching strategies, evaluation, and integration. It is quite possibly our favourite thing to talk and write about! But sometimes it is good to step back and look at the problem from a wider perspective. In this blog, the first one in a series about metadata matching, we will cover the very basics of matching: what it is, how we do it, and why we devote so much effort to this problem.

2024 public data file now available, featuring new experimental formats

Patrick Polischuk – 2024 May 14

In MetadataAPIs

This year’s public data file is now available, featuring over 156 million metadata records deposited with Crossref through the end of April 2024 from over 19,000 members. A full breakdown of Crossref metadata statistics is available here.

Like last year, you can download all of these records in one go via Academic Torrents or directly from Amazon S3 via the “requester pays” method.

Download the file: The torrent download can be initiated here. Instructions for downloading via the “requester pays” method, along with other tips for using these files, can be found on the “Tips for working with Crossref public data files and Plus snapshots” page.

Common views and questions about metadata across Africa

Johanssen Obanda – 2024 April 24

In MetadataCommunityMeetings

This past year has been a captivating journey of immersion within the Crossref community, a mix of online interactions and meaningful in-person experiences. From the engaging Sustainability Research and Innovation Conference in Port Elizabeth, South Africa, to the impactful webinars conducted globally, this has been more than just a professional endeavour; it has been a personal exploration of collaboration, insights, and a shared commitment to pushing the boundaries of scholarly communication.

Subject codes, incomplete and unreliable, have got to go

Patrick Polischuk – 2024 March 13

In MetadataAPIs

Subject classifications have been available via the REST API for many years but have not been complete or reliable from the start and will soon be deprecated. dfdfd

The subject metadata element was born out of a Labs experiment intended to enrich the metadata returned via Crossref Metadata Search with All Subject Journal Classification codes from Scopus. This feature was developed when the REST API was still fairly new, and we now recognize that the initial implementation worked its way into the service prematurely.

RORing ahead: using ROR in place of the Open Funder Registry

Rachael Lammey – 2024 January 30

In RORMetadataOpen Funder Registry

A few months ago we announced our plan to deprecate our support for the Open Funder Registry in favour of using the ROR Registry to support both affiliation and funder use cases. The feedback we’ve had from the community has been positive and supports our members, service providers and metadata users who are already starting to move in this direction.

We wanted to provide an update on work that’s underway to make this transition happen, and how you can get involved in working together with us on this.

Increasing Crossref Data Reusability With Format Experiments

Martin Eve – 2024 January 19

In MetadataAPIs

Every year, Crossref releases a full public data file of all of our metadata. This is partly a commitment to POSI and partly just what we do. We want the community to re-use our metadata and to find interesting ends to which they can be put!

However, we have also recognized, for some time, that 170GB of compressed .tar.gz files, spread over 27,000 items, is not the easiest of formats with which to work. For instance, there’s no indexing capacity on these files, meaning that it is virtually impossible simply to pull out the record for a DOI. Decompressing the .tar.gz files takes a good three hours or more even on high-end hardware, without any additional processing.

Open Funder Registry to transition into Research Organization Registry (ROR)

Amanda French, Ginny Hendricks, Rachael Lammey, Fabienne Michaud, Maria Gould – 2023 September 07

In Open Funder RegistryResearch FundersROROrganisation IdentifierIdentifiersMetadata

There is some overlap between the Open Funder Registry and the Research Organization Registry (ROR), and funders and publishers have been asking us whether they should use Open Funder Registry IDs or ROR IDs to identify funders when they appear in both registries. We aim to merge the two registries over time. We will ensure Crossref members can use ROR to simplify persistent identifier integrations, to register better metadata, and to help connect research outputs to research funders.

RSS for this topic

Get involved

Find a service

Documentation

About us

2026 July 20

Why PID strategies need more than PIDs: our first position paper

2026 July 09

Schema 5.5 now available: adding CRediT, new record types for blogs and posters, and more

2026 July 02

Take part in UX Research at Crossref

2026 June 30

Building, refining, and connecting: summary of our May 2026 community update

Publications

Blog posts

Metadata schema development plans

It’s been a while, here’s a metadata update and request for feedback

Celebrating five years of Grant IDs: where are we with the Crossref Grant Linking System?

The anatomy of metadata matching

Basic terminology

Metadata matching 101: what is it and why do we need it?

2024 public data file now available, featuring new experimental formats

Common views and questions about metadata across Africa

Subject codes, incomplete and unreliable, have got to go

RORing ahead: using ROR in place of the Open Funder Registry

Increasing Crossref Data Reusability With Format Experiments

Open Funder Registry to transition into Research Organization Registry (ROR)

Topics

Archives