Blog

How good is your matching?

Dominika Tkaczyk, Adam Buttrick – 2024 November 06

In MetadataLinkingMetadata MatchingData Science

In our previous blog post in this series, we explained why no metadata matching strategy can return perfect results. Thankfully, however, this does not mean that it’s impossible to know anything about the quality of matching. Indeed, we can (and should!) measure how close (or far) we are from achieving perfection with our matching. Read on to learn how this can be done!

How about we start with a quiz? Imagine a database of scholarly metadata that needs to be enriched with identifiers, such as ORCIDs or ROR IDs. Hopefully, by this point in our series this is recognizable as a classic matching problem. In searching for a solution, you identify an externally-developed matching tool that makes one of the below claims. Which of the following would demonstrate satisfactory performance?

Update on the Resourcing Crossref for Future Sustainability research

Kornelia Korzec, Amanda Bartell, Ginny Hendricks, Lucy Ofiesh, Ryan McFall – 2024 October 28

In FeesSustainability

We’re in year two of the Resourcing Crossref for Future Sustainability (RCFS) research. This report provides an update on progress to date, specifically on research we’ve conducted to better understand the impact of our fees and possible changes.

Crossref is in a good financial position with our current fees, which haven’t increased in 20 years. This project is seeking to future-proof our fees by:

Making fees more equitable
Simplifying our complex fee schedule
Rebalancing revenue sources

In order to review all aspects of our fees, we’ve planned five projects to look into specific aspects of our current fees that may need to change to achieve the goals above. This is an update on the research and discussions that have been underway with our Membership & Fees Committee and our Board, and what we’ve learned so far in each of these areas.

Meet the candidates and vote in our 2024 Board elections

Lucy Ofiesh – 2024 September 24

In BoardMember BriefingGovernanceElectionsMeetingsAnnual Meeting

On behalf of the Nominating Committee, I’m pleased to share the slate of candidates for the 2024 board election.

Each year we do an open call for board interest. This year, the Nominating Committee received 53 submissions from members worldwide to fill four open board seats.

We maintain a balanced board of 8 large member seats and 8 small member seats. Size is determined based on the organisation’s membership tier (small members fall in the $0-$1,650 tiers and large members in the $3,900 - $50,000 tiers). We have two large member seats and two small member seats open for election in 2024.

The myth of perfect metadata matching

Dominika Tkaczyk, Adam Buttrick – 2024 August 28

In MetadataLinkingMetadata MatchingData Science

In our previous instalments of the blog series about matching (see part 1 and part 2), we explained what metadata matching is, why it is important and described its basic terminology. In this entry, we will discuss a few common beliefs about metadata matching that are often encountered when interacting with users, developers, integrators, and other stakeholders. Spoiler alert: we are calling them myths because these beliefs are not true! Read on to learn why.

Re-introducing Participation Reports to encourage best practices in open metadata

Lena Stoll – 2024 July 25

In Participation ReportsMetadataBest Practices

We’ve just released an update to our participation report, which provides a view for our members into how they are each working towards best practices in open metadata. Prompted by some of the signatories and organizers of the Barcelona Declaration, which Crossref supports, and with the help of our friends at CWTS Leiden, we have fast-tracked the work to include an updated set of metadata best practices in participation reports for our members. The reports now give a more complete picture of each member’s activity.

Metadata schema development plans

Patricia Feeney – 2024 July 22

In Metadata

It’s been a while, here’s a metadata update and request for feedback

In Spring 2023 we sent out a survey to our community with a goal of assessing what our priorities for metadata development should be - what projects are our community ready to support? Where is the greatest need? What are the roadblocks?

The intention was to help prioritize our metadata development work. There’s a lot we want to do, a lot our community needs from us, but we really want to make sure we’re focusing on the projects that will have the most immediate impact for now.

Crossmark community consultation: What did we learn?

Martyn Rittman, Madhura Amdekar – 2024 July 02

In CrossmarkCommunity

In the first half of this year we’ve been talking to our community about post-publication changes and Crossmark. When a piece of research is published it isn’t the end of the journey—it is read, reused, and sometimes modified. That’s why we run Crossmark, as a way to provide notifications of important changes to research made after publication. Readers can see if the research they are looking at has updates by clicking the Crossmark logo. They also see useful information about the editorial process, and links to things like funding and registered clinical trials. All of this contributes to what we call the integrity of the scholarly record.

Celebrating five years of Grant IDs: where are we with the Crossref Grant Linking System?

Kornelia Korzec, Ginny Hendricks – 2024 July 01

In Research FundersGrant Linking SystemInfrastructureMetadataIdentifiers

We’re happy to note that this month, we are marking five years since Crossref launched its Grant Linking System. The Grant Linking System (GLS) started life as a joint community effort to create ‘grant identifiers’ and support the needs of funders in the scholarly communications infrastructure.

The system includes a funder-designed metadata schema and a unique link for each award which enables connections with millions of research outputs, better reporting on the research and outcomes of funding, and a contribution to open science infrastructure. Our first activity to highlight the moment was to host a community call last week where around 30 existing and potential funder members joined to discuss the benefits and the steps to take to participate in the Grant Linking System (GLS).

Some organisations at the forefront of adopting Crossref’s Grant Linking System presented their challenges and how they overcame them, shared the benefits they are reaping from participating, and provided some tips about their processes and workflows.

The anatomy of metadata matching

Dominika Tkaczyk, Adam Buttrick – 2024 June 27

In MetadataLinkingMetadata MatchingData Science

In our previous blog post about metadata matching, we discussed what it is and why we need it (tl;dr: to discover more relationships within the scholarly record). Here, we will describe some basic matching-related terminology and the components of a matching process. We will also pose some typical product questions to consider when developing or integrating matching solutions.

Basic terminology

Metadata matching is a high-level concept, with many different problems falling into this category. Indeed, no matter how much we like to focus on the similarities between different forms of matching, matching affiliation strings to ROR IDs or matching preprints to journal papers are still different in several important ways. At Crossref and ROR, we call these problems matching tasks.

Drawing on the Research Nexus with Policy documents: Overton’s use of Crossref API

Luis Montilla, Euan Adie – 2024 June 15

In APIsAPI Case Study

Update 2024-07-01: This post is based on an interview with Euan Adie, founder and director of Overton._

What is Overton?

Overton is a big database of government policy documents, also including sources like intergovernmental organisations, think tanks, and big NGOs and in general anyone who’s trying to influence a government policy maker. What we’re interested in is basically, taking all the good parts of the scholarly record and applying some of that to the policy world. By this we mean finding all the documents, finding what’s out there, collecting metadata for them consistently, fitting to our schema, extracting references from all the policy documents we find, adding links between them, and then we also do citation analysis.

RSS feed

Recent blog posts

Why PID strategies need more than PIDs: our first position paper

2026 July 20

Schema 5.5 now available: adding CRediT, new record types for blogs and posters, and more

2026 July 09

Take part in UX Research at Crossref

2026 July 02

Building, refining, and connecting: summary of our May 2026 community update

2026 June 30

Get involved

Find a service

Documentation

About us

2026 July 20

Why PID strategies need more than PIDs: our first position paper

2026 July 09

Schema 5.5 now available: adding CRediT, new record types for blogs and posters, and more

2026 July 02

Take part in UX Research at Crossref

2026 June 30

Building, refining, and connecting: summary of our May 2026 community update

How good is your matching?

Update on the Resourcing Crossref for Future Sustainability research

Meet the candidates and vote in our 2024 Board elections

The myth of perfect metadata matching

Re-introducing Participation Reports to encourage best practices in open metadata

Metadata schema development plans

It’s been a while, here’s a metadata update and request for feedback

Crossmark community consultation: What did we learn?

Celebrating five years of Grant IDs: where are we with the Crossref Grant Linking System?

The anatomy of metadata matching

Basic terminology

Drawing on the Research Nexus with Policy documents: Overton’s use of Crossref API

What is Overton?

Recent blog posts

Topics

Archives