<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Blog on Crossref</title><link>https://www.crossref.org/blog/</link><description>Recent content in Blog on Crossref</description><generator>Hugo 0.139.4</generator><language>en-us</language><managingEditor>support@crossref.org (Crossref/Cazinc/Benoît Benedetti)</managingEditor><webMaster>support@crossref.org (Crossref/Cazinc/Benoît Benedetti)</webMaster><lastBuildDate>Mon, 20 Jul 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://www.crossref.org/blog/" rel="self" type="application/rss+xml"/><item><title>Introducing board meeting summaries, starting with the January 2026 meeting</title><link>https://www.crossref.org/blog/introducing-board-meeting-summaries-starting-with-the-january-2026-meeting/</link><pubDate>Tue, 03 Mar 2026 00:00:00 +0000</pubDate><author>Lucy Ofiesh</author><guid>https://www.crossref.org/blog/introducing-board-meeting-summaries-starting-with-the-january-2026-meeting/</guid><description>&lt;h2 id="introducing-board-meeting-summaries">Introducing board meeting summaries&lt;/h2>
&lt;p>In an ongoing effort to make more of our operations transparent, we have decided to start sharing summaries of our board meetings on the blog. We already post our &lt;a href="https://www.crossref.org/board-and-governance#motions">board resolutions&lt;/a>, but the summaries will give a bit more information on what the board discusses that may or may not show up on the list of resolutions.&lt;/p>
&lt;p>We do observe the &lt;a href="https://en.wikipedia.org/wiki/Chatham_House_Rule" target="_blank">Chatham House Rule&lt;/a> for our board discussions, so we won&amp;rsquo;t disclose who says what, and there will still be executive sessions that discuss confidential matters that we can&amp;rsquo;t share. But those discussions constitute a minority of the time we spend together, so the summaries will cover much of what the board discusses.&lt;/p>
&lt;p>Let&amp;rsquo;s start with our recent meeting in January, which is online and lasts one hour.&lt;/p>
&lt;h2 id="crossref-board-of-directors-meeting-executive-summary-january-22nd-2026">Crossref Board of Directors Meeting Executive Summary, January 22nd, 2026&lt;/h2>
&lt;p>The January meeting is held primarily to conduct board business at the start of the board term.&lt;/p>
&lt;h3 id="board-business">Board business&lt;/h3>
&lt;p>The board conducted elections for this year’s board leadership, and approved each of the following positions:&lt;/p>
&lt;ul>
&lt;li>Board Chair: Marin Dacos&lt;/li>
&lt;li>Board Treasurer: James Phillpotts&lt;/li>
&lt;li>Executive Committee members (3 seats): Rose L’Huillier; Rebecca Wambua; and Aaron Wood&lt;/li>
&lt;li>Board Secretary and Assistant Secretary: Lucy Ofiesh, Ed Pentz&lt;/li>
&lt;li>Audit Committee Chair: Ashley Towne&lt;/li>
&lt;li>Nominating Committee Chair: Nick Lindsay&lt;/li>
&lt;/ul>
&lt;p>The board then turned to review and adopt the minutes from the November board meeting, December board meeting, and October executive committee meeting.&lt;/p>
&lt;p>The board discussed a proposal to update the financial authorization approval levels, a set of guidelines established by the board to define the limits on the approval of expenses that are part of Crossref’s Financial and Accounting Operating Policies. The board adopted changes to the policy that would bring it in line with the scale of Crossref’s business operations.&lt;/p>
&lt;h3 id="looking-ahead-to-2026">Looking ahead to 2026&lt;/h3>
&lt;p>Ed Pentz gave a brief overview of what the board can expect at the 2026 board meetings and reviewed topics that will be discussed at each meeting.&lt;/p>
&lt;p>The board next meets in early March.&lt;/p></description></item><item><title>Innovation in scientific publishing and its implications for Crossref DOI registration practices - MetaROR’s approach</title><link>https://www.crossref.org/blog/innovation-in-scientific-publishing-and-its-implications-for-crossref-doi-registration-practices-metarors-approach/</link><pubDate>Tue, 03 Feb 2026 00:00:00 +0000</pubDate><author>Ludo Waltman</author><guid>https://www.crossref.org/blog/innovation-in-scientific-publishing-and-its-implications-for-crossref-doi-registration-practices-metarors-approach/</guid><description>&lt;p>&lt;em>A couple of months ago, Ludo Waltman and André Brasil raised some questions about good practices for Crossref DOI registration, asking for input from the scholarly communication community. In this post, Ludo and André reflect on the input received and discuss the approach to DOI registration that the MetaROR publish-review-curate platform is going to take.&lt;/em>&lt;/p>
&lt;p>Practices for assigning DOIs and structuring the associated metadata are not merely technical details. They shape how scholarly outputs are discovered, cited, evaluated, indexed, and preserved over time. As new models of publishing emerge, especially those that decouple dissemination from evaluation, these infrastructural choices increasingly influence what counts as a scholarly object, as well as how credit and accountability mechanisms are organized.&lt;/p>
&lt;p>As editors of &lt;a href="https://metaror.org/" target="_blank">MetaROR (MetaResearch Open Review)&lt;/a>, a platform launched in 2024 and operating under the publish-review-curate model, we are interested in good practices for Crossref DOI registration in the context of innovative new approaches to scientific publishing. In the &lt;a href="https://doi.org/10.64000/76jhx-x7s23" target="_blank">earlier blog post&lt;/a>, we invited members of the broader scholarly communication community to share their perspective on the following two questions:&lt;/p>
&lt;ol>
&lt;li>For each article on the MetaROR platform, there is a corresponding article on a preprint server. Is it acceptable to have two Crossref DOIs, one registered by the preprint server and one registered by the MetaROR platform, for essentially the same article?&lt;/li>
&lt;li>If Crossref DOIs are registered for articles on the MetaROR platform, should the articles be assigned the type ‘journal-article’ or the type ‘preprint’ in their Crossref metadata, or something else entirely?&lt;/li>
&lt;/ol>
&lt;p>We were pleasantly surprised by the level of interest in these two questions. We received about 15 responses from colleagues in the scholarly communication community. Some colleagues posted a reply at the bottom of &lt;a href="https://doi.org/10.64000/76jhx-x7s23" target="_blank">our blog post&lt;/a>. Others responded on social media (&lt;a href="https://bsky.app/profile/ludowaltman.bsky.social/post/3lzpunhwv7k25" target="_blank">Bluesky&lt;/a>, &lt;a href="https://www.linkedin.com/posts/ludo-waltman-83a96a2_innovation-in-scientific-publishing-and-its-activity-7378017677300113408-6mJe?utm_source=share&amp;amp;utm_medium=member_desktop&amp;amp;rcm=ACoAAAB_ei4BJVfpY6PENFNnUrh2hpjTPZDmQdU" target="_blank">LinkedIn&lt;/a>) or shared their perspective by email.&lt;/p>
&lt;p>Below we reflect on the responses received and we outline the approach to Crossref DOI registration that MetaROR is going to take.&lt;/p>
&lt;h2 id="doi-registration-for-articles-on-the-metaror-platform">DOI registration for articles on the MetaROR platform&lt;/h2>
&lt;p>Colleagues offered mixed opinions on the question of whether articles on the MetaROR platform should have their own DOI, in addition to the DOI these articles have on the preprint server on which they were originally published. Some colleagues argued there is no good reason for registering DOIs for articles on the MetaROR platform and suggested this may cause confusion. &lt;a href="https://bsky.app/profile/richardsever.bsky.social/post/3lzvkzeuxbk2h" target="_blank">One colleague&lt;/a> reasoned that “if we want peer review to be something more ongoing and evolve beyond a single point in time judgment”, our approach should be to “better map the connections between events” rather than registering a new DOI each time an article has been peer-reviewed.&lt;/p>
&lt;p>However, other colleagues expressed support for registering DOIs for articles on the MetaROR platform. One colleague pointed out that this “allows the user to reference the exact artefact they have consulted”. This colleague also reminded us that in the past “people were worried about having a different DOI for a preprint and another for a VoR (version of record)”, while nowadays this is a generally accepted practice. &lt;a href="https://bsky.app/profile/samuelmoore.org/post/3lzvrebhxc22d" target="_blank">Another colleague&lt;/a> emphasized the value of decentralization and suggested to “let a thousand DOIs bloom”. &lt;a href="https://www.openscience.nl/en/cases/the-metaror-publish-review-curate-model-our-experience-as-authors" target="_blank">Authors of an article peer-reviewed by MetaROR&lt;/a> argued in favor of “an overarching DOI for the full package (preprint, reviews, author response and link to updated preprint)”, which in their view would make MetaROR’s “process more coherent”.&lt;/p>
&lt;p>Having considered the various arguments in favor of or against registering DOIs for articles on the MetaROR platform, we feel the arguments in favor are more compelling. Our perspective is that an article on the MetaROR platform differs in a meaningful way from the corresponding article on a preprint server, since the article on the MetaROR platform has been enriched with an evaluation by peer reviewers and editors. MetaROR provides a carefully curated package that includes not only the article itself, but also review reports and an editorial assessment. In our view, this justifies registering DOIs for articles on the MetaROR platform. We also see DOI registration for articles on the MetaROR platform as a way to promote appropriate recognition for authors of articles peer-reviewed by MetaROR, similar to the way authors get recognition for articles published in traditional peer-reviewed journals.&lt;/p>
&lt;p>Of course, when an article has multiple versions, each with their own DOI, it is important to establish a link between the different DOIs, indicating that the DOIs are associated with the same work. This is important for articles published first on a preprint server and then on a platform such as MetaROR just like it is important for articles published first on a preprint server and then in a peer-reviewed journal. In practice, we establish these links by registering relationships between DOIs in the associated metadata. In this way, we ensure that indexing services, discovery systems, and research analytics tools are able to recognize that the DOIs refer to different manifestations of the same work rather than independent outputs.&lt;/p>
&lt;h2 id="record-type-for-articles-on-the-metaror-platform">Record type for articles on the MetaROR platform&lt;/h2>
&lt;p>Our second question is about the record type to be used when registering a Crossref DOI for an article on the MetaROR platform. Many colleagues who provided input on this question argued there is a need for a new Crossref record type for ‘reviewed preprints’.&lt;/p>
&lt;p>We feel the idea of such a new record type is interesting and its pros and cons deserve further consideration. However, any solution that requires changes in Crossref’s metadata schema will take time to realize, while for MetaROR we need a solution in the short term. At the moment, the most obvious options for MetaROR therefore seem to be to use either the record type ‘journal-article’ or the record type ‘preprint’ (which is in fact a subtype of the record type ‘posted-content’).&lt;/p>
&lt;p>The use of the record type ‘preprint’ seems somewhat problematic to us, because preprints are typically understood to be articles that have not yet been formally peer-reviewed. In a way, articles on the MetaROR platform are the opposite of this, since these articles have undergone formal peer review. An article on the MetaROR platform is part of a package that also includes review reports and an editorial assessment. Such a package provides readers with a more informed understanding of an article than what they get from reading only the article itself. For this reason, we do not consider the record type ‘preprint’ to be suitable for articles on the MetaROR platform.&lt;/p>
&lt;p>Instead of the record type ‘preprint’, we have decided to use the record type ‘journal-article’ for articles on the MetaROR platform. The record type ‘journal-article’ is intended for articles published in journals. To be clear, MetaROR considers itself a ‘platform’, not a ‘journal’. However, the distinction between ‘platforms’ and ‘journals’ is not very well defined and the choice of terminology therefore involves a certain degree of arbitrariness. Moreover, articles on the MetaROR platform have been formally evaluated, and in that sense they resemble articles in traditional peer-reviewed journals. Although the nature of the evaluation is different (i.e., MetaROR provides a narrative assessment, while traditional journals provide a ‘stamp of approval’), we feel the resemblance justifies the use of the record type ‘journal-article’. We also hope that the use of this record type will help to ensure that articles evaluated by &lt;a href="https://doi.org/10.31222/osf.io/h7swt" target="_blank">publish-review-curate (PRC) platforms&lt;/a> are treated similarly to articles evaluated by traditional journals, advancing beyond &lt;a href="https://www.coalition-s.org/blog/how-the-web-of-science-takes-a-step-back/" target="_blank">more conservative ways&lt;/a> of dealing with articles on PRC platforms.&lt;/p>
&lt;p>There is a precedent for using the Crossref record type ‘journal-article’ for articles evaluated by PRC platforms. For over a decade, this approach has been used by &lt;a href="https://www.f1000.com/resources-for-researchers/where-to-publish-your-research/f1000-publishing-venues/" target="_blank">platforms operated by F1000&lt;/a>, such as F1000Research, Gates Open Research, Open Research Europe, and Wellcome Open Research. The approach we are taking at MetaROR is similar to the approach taken by these platforms. At the same time, our approach is different from the approach of &lt;a href="https://elifesciences.org/" target="_blank">eLife&lt;/a>, another prominent PRC platform. eLife uses the record type ‘preprint’ for all versions of an article on its platform except for the version that the authors consider to be final and that they choose to designate as the ‘version of record’. This version has the record type ‘journal-article’.&lt;/p>
&lt;h2 id="summary-of-metarors-approach-to-crossref-doi-registration">Summary of MetaROR’s approach to Crossref DOI registration&lt;/h2>
&lt;p>Figure 1 summarizes MetaROR’s approach to Crossref DOI registration. The figure considers the situation in which an article went through two rounds of peer review by MetaROR. Both rounds of peer review involved two reviewers. After two rounds of peer review by MetaROR, the article was published in a journal. We emphasize that journal publication is optional in MetaROR’s PRC approach. It is included in Figure 1 for the sake of completeness.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2026/metaror-crossref-doi-process.png"
alt="MetaROR’s approach to Crossref DOI registration" width="80%">&lt;figcaption>
&lt;p>Figure 1: MetaROR’s approach to Crossref DOI registration&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;p>Each element in Figure 1 represents an item that has its own Crossref DOI. The shape of an element indicates the Crossref record type of an item (‘preprint’, ‘journal-article’, ‘peer-review’). MetaROR is responsible for the blue elements in the figure. The gray elements are the responsibility of other actors, either a preprint server or a journal. Arrows represent &lt;a href="https://www.crossref.org/documentation/schema-library/markup-guide-metadata-segments/relationships/">relationships between items&lt;/a>. These relationships are captured in the Crossref metadata of the various items.&lt;/p>
&lt;p>Figure 1 shows how MetaROR treats articles, review reports, editorial assessments, and author responses as first-class research objects. Each object has its own DOI, while the objects are linked through structured metadata. Assigning DOIs to review reports, editorial assessments, and author responses is central to our commitment to transparency, recognition, and reuse of evaluative contributions.&lt;/p>
&lt;p>We note that Figure 1 assumes each version of an article on a preprint server has its own DOI. This is indeed how DOI registration is handled by many preprint servers, such as the &lt;a href="https://www.cos.io/blog/doi-versioning-and-metaror" target="_blank">OSF servers&lt;/a> (e.g., MetaArXiv, PsyArXiv, SocArXiv), ChemRxiv, Research Square, and Preprints.org. However, some preprint servers use a single DOI for all versions of an article. This is the case for &lt;a href="https://openrxiv.org/dois-for-preprints/" target="_blank">bioRxiv and medRxiv&lt;/a> and also for &lt;a href="https://blog.arxiv.org/2022/02/17/new-arxiv-articles-are-now-automatically-assigned-dois/" target="_blank">arXiv&lt;/a>, which registers DOIs with DataCite rather than Crossref. In the future, we hope these preprint servers will also adopt versioned DOIs.&lt;/p>
&lt;h2 id="outlook">Outlook&lt;/h2>
&lt;p>Over the past 25 years, practices for registering DOIs and associated metadata have evolved along with broader developments in the scholarly communication landscape. Inevitably, DOI registration practices will always be lagging behind the most recent developments in scholarly communication. From this point of view, the lack of agreement on good practices for DOI registration in the context of PRC platforms is not surprising. This lack of agreement can in fact be seen as part of a larger discussion about the pros and cons of different infrastructural approaches for handling &lt;a href="https://doi.org/10.31222/osf.io/yu4sm" target="_blank">‘preprint review metadata’&lt;/a>, including for instance the &lt;a href="https://coar-notify.net/" target="_blank">COAR Notify approach&lt;/a> and the &lt;a href="https://docmaps.knowledgefutures.org/" target="_blank">DocMaps approach&lt;/a>.&lt;/p>
&lt;p>MetaROR’s approach to DOI registration demonstrates both the power and richness of Crossref’s metadata schema and its limitations. As discussed above, several colleagues who responded to &lt;a href="https://doi.org/10.64000/76jhx-x7s23" target="_blank">our earlier blog post&lt;/a> consider the lack of a record type for ‘reviewed preprints’ to be a significant limitation. With the &lt;a href="https://asapbio.org/reimagining-scholarly-publishing-outcomes-from-a-public-forum-to-discuss-the-publish-review-curate-prc-publishing-model/" target="_blank">growing interest in PRC models for scientific publishing&lt;/a>, there appears to be a need to systematically evaluate possible improvements that can be made to Crossref’s metadata schema to offer better support for new approaches to scientific publishing.&lt;/p>
&lt;p>We see this not only as a technical challenge but also as an issue of infrastructure governance. We therefore invite further dialogue between DOI registration agencies, other metadata infrastructures, preprint servers, PRC platforms, and indexing services to explore pathways for improving metadata standards, whether through new record types, extended relationship vocabularies, or shared best practices. We hope our experiences with MetaROR will contribute to the collective effort needed to ensure that emerging models of scholarly communication are represented accurately, transparently, and responsibly in the scholarly record.&lt;/p>
&lt;div class="shortcode-divwrap lightgrey-highlight">
&lt;span>&lt;strong>Crossref note:&lt;/strong> This discussion chimes with related plans for extending our schemas: more granular vocabulary for items within journal articles, preprints, reviews, and others; clearer relationship types; and support for the forthcoming NISO JAV recommendations. Our Preprint Advisory Group will discuss the topic this year, and our Metadata Advisory Group has both &amp;lsquo;journal article type vocab&amp;rsquo; and &amp;lsquo;relationships&amp;rsquo; on its radar for 2026. We look forward to engaging further on this topic as we work towards more flexible schemas in support of the Research Nexus.&lt;/span>
&lt;/div></description></item><item><title>A spotlight on our community in Indonesia</title><link>https://www.crossref.org/blog/a-spotlight-on-our-community-in-indonesia/</link><pubDate>Wed, 28 Jan 2026 00:00:00 +0000</pubDate><author>Susan Collins</author><guid>https://www.crossref.org/blog/a-spotlight-on-our-community-in-indonesia/</guid><description>&lt;p>&lt;a href="#translation-in-bahasa-indonesia">&lt;em>Click here for the translation in Bahasa Indonesia&lt;/em>&lt;/a>&lt;/p>
&lt;p>As Crossref celebrated its 25th anniversary last year, we are highlighting some of the most active and engaged regions in our global community.&lt;/p>
&lt;p>Over the past 25 years, the makeup of Crossref membership has evolved significantly; founded by a handful of large publishers, we now have more than 24,000 members representing 165 countries. Nearly two-thirds of them self-identify as universities, libraries, government agencies, foundations, scholar publishers, and research institutions.&lt;/p>
&lt;p>The Crossref community in Indonesia is by far the most dynamically growing region. Each year since 2017, we’ve seen the highest number of new members joining from the country. There are now over 4,400 members based in Indonesia who have registered the metadata for more than 2.6 million works, connecting their research to the global community.&lt;/p>
&lt;p>Indonesia also happens to be the &lt;a href="https://rpubs.com/saurabh90/ojs-stats-2022" target="_blank">largest user of OJS globally&lt;/a>, with close to 20,000 journals publishing on the platform. Most journals are published by universities, research institutions, and government agencies.&lt;/p>
&lt;p>There is a strong emphasis on publishing as part of completing a university degree. The Ministry of National Education policy requires all students to publish their research before graduation. To provide opportunities and accessible platforms for publication, Indonesian universities and faculties have established journals to help their students meet these requirements for graduation.&lt;/p>
&lt;p>Most journals in Indonesia are indexed in SINTA (Science and Technology Index), which is managed by the Ministry of Higher Education, Science, and Technology (MoHEST). The aim of SINTA is to improve journal quality, facilitate assessment, and increase the competitiveness of Indonesian journals. The use of DOIs is a requirement for indexing on the platform.&lt;/p>
&lt;p>Members know the value of persistent identifiers for their content, but many also realise the value of Crossref’s commitment to open metadata and the open scholarly record. Being a member of Crossref means being part of a larger community. While DOIs may be required for national indexing, organisations have various reasons for becoming Crossref members. One of the most important factors is to increase the global visibility of their content and, therefore, increase the impact of their publications.&lt;/p>
&lt;blockquote>
&lt;p>“We feel like we&amp;rsquo;re part of the Crossref community because we don&amp;rsquo;t just use your service; we contribute to it. By providing DOIs and metadata, we&amp;rsquo;re helping to build the open scholarly record that benefits everyone. Being a part of the Crossref network is more than just being a member—it&amp;rsquo;s about a shared vision. We see ourselves as active contributors. Every time we register a DOI and provide metadata, we add a new link to the global chain of knowledge. This helps ensure our research can be easily found, cited, and connected to other works, which benefits everyone.” — Nita Nurdiana, Universitas PGRI Palembang&lt;/p>
&lt;/blockquote>
&lt;p>We have very &lt;a href="https://www.crossref.org/community/ambassadors/">dedicated ambassadors&lt;/a> based in Indonesia who advocate for Crossref’s mission, Fauji Nurdin ST. Mudo and Zulidyana Rusnalasari. Each has been instrumental in organising in-person events and webinars for members, as well as in representing Crossref at events throughout the region.&lt;/p>
&lt;p>In October, as part of our 25th Anniversary celebration, the ambassadors, with the support of our Sponsor Relawan Jurnal Indonesia (RJI), held a &lt;a href="https://zenodo.org/records/17465236" target="_blank">satellite event&lt;/a> in Medan, which brought together participants from universities, publishers, government agencies, research institutes, non-governmental organisations, libraries, and museums. It provided a forum for dialogue around key topics in scholarly publishing.&lt;/p>
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2026/medan-group-satellite.png"
alt="Group photo of participants" width="100%">&lt;figcaption>
&lt;p>Crossref 25th Anniversary Satellite Event, Medan, October 2025&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;p>The majority of members in Indonesia work through one of our &lt;a href="https://www.crossref.org/membership/about-sponsors">regional sponsors&lt;/a>. Sponsors provide support to smaller organisations that often face financial, technical, and language barriers, making membership challenging. Their knowledge of the unique needs of their local publishing community and extensive networks help organisations learn more about Crossref in a more accessible way.&lt;/p>
&lt;p>Our first sponsor in Indonesia, Relawan Jurnal Indonesia (RJI), joined in 2017; we now have &lt;a href="https://www.crossref.org/membership/about-sponsors/">eight sponsors&lt;/a> that together support over 3,900 members in Indonesia.&lt;/p>
&lt;p>Our sponsors are also key partners in helping us engage with the community, facilitating webinars and supporting our in-person meetings. In August 2024, in collaboration with RJI, we held a two-day in-person event &lt;a href="https://zenodo.org/records/13380587" target="_blank">in Jakarta&lt;/a>, attended by over 100 members, and joined by our sponsors and ambassadors. Along with discussions on the fundamentals of Crossref and the role of quality metadata, we&amp;rsquo;ve heard from Ahmad Saefudin Surapermana, a sub-coordinator from ISSN Indonesia. Because so many members in Indonesia use the OJS publishing platform, colleagues from the Public Knowledge Project (PKP) joined us for a session on OJS plugins and an upgrade workshop for OJS system administrators. We continue to receive feedback from members that more regular in-person and online events should be held to facilitate connections and share developments.&lt;/p>
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2026/crossref-jakarta-group-photo.png"
alt="Group photo of participants" width="100%">&lt;figcaption>
&lt;p>Crossref Jakarta, August 2024&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;p>While interest in Crossref among this community is ever-growing, there are still painpoints for Indonesian members. Though many join through a Sponsor, some report challenges with metadata deposits, errors, and submission failures, and others struggle to navigate the documentation when technical issues arise. Some members have noted that our metadata requirements can be complex and that they struggle to achieve metadata completeness in their records. These concerns can be particularly challenging for institutions with limited resources.&lt;/p>
&lt;p>To provide additional support, we developed a &lt;a href="https://zenodo.org/records/17133113" target="_blank">series of webinars&lt;/a> in Bahasa Indonesia, covering topics such as using our &lt;a href="https://www.crossref.org/members/prep/" target="_blank">Participation Reports&lt;/a> to assess metadata completeness and workshops on best practices for &lt;a href="https://www.youtube.com/watch?v=w5Ui5FYzrnc" target="_blank">using OJS&lt;/a>. These webinars have been some of the most attended by our members. The strong interest reflects the value these sessions bring to our community, and we continue to receive requests for additional training opportunities. In total, we welcomed 1,044 registrants and 501 attendees across our webinars last year. This level of participation highlights the importance of ongoing training and the enthusiasm of our members to engage, learn, and grow together.&lt;/p>
&lt;p>Despite some challenges, many members feel there is significant value in being a Crossref member. Including their metadata in Crossref enhances the visibility and accessibility of their journals globally. Because Crossref provides the infrastructure of persistent identifiers and open metadata, this ensures scholarly outputs are discoverable, connected, and part of a global research record.&lt;/p>
&lt;blockquote>
&lt;p>“Crossref&amp;rsquo;s vision of creating open, connected scholarly infrastructure directly supports our university&amp;rsquo;s core mission of advancing knowledge and research impact. As an academic institution, we rely on Crossref&amp;rsquo;s DOI system to ensure our faculty publications and institutional repository content remain permanently accessible and properly cited. This infrastructure is essential for maximizing the visibility and impact of our research output, which directly contributes to our university&amp;rsquo;s reputation and ranking. Additionally, Crossref&amp;rsquo;s commitment to open scholarly communication aligns with our values of making knowledge freely accessible, supporting our open access initiatives and helping us demonstrate research impact to funding bodies and stakeholders. The persistent linking system also supports our students and researchers in conducting reliable literature reviews and building upon existing scholarship with confidence that their citations will remain valid over time.” — Anggota dari STIS Darul Falah, Indonesia&lt;/p>
&lt;/blockquote>
&lt;p>Ratna Galuh Manika Trisista, from Universitas Islam Jakarta, has also illustrated how joining Crossref and stewardship of rich metadata supports the development of Indonesian journals in her presentation, &lt;a href="https://www.youtube.com/watch?v=0NPqLrPHhYA&amp;amp;t=9639s" target="_blank">Our Metadata Story: Improving Citation Visibility through Reference Linking&lt;/a> during the Crossref2025 Annual Meeting.&lt;/p>
&lt;p>As membership growth in Indonesia continues, we look forward to building relationships within the community, supported by our ambassadors, sponsors, and members&amp;rsquo; contributions.&lt;/p>
&lt;p>Much of the information in this report comes from a survey sent to our members, sponsors, and ambassadors in Indonesia. We appreciate all the feedback, comments, and suggestions we received, and we look forward to continuing our collaborations and increasing our engagement with the community.&lt;/p>
&lt;h3 id="translation-in-bahasa-indonesia">Translation in Bahasa Indonesia&lt;/h3>
&lt;p>Tahun lalu Crossref merayakan usia ke-25, dan momen ini menjadi kesempatan istimewa untuk menyoroti wilayah-wilayah yang paling aktif dan berperan penting dalam komunitas global Crossref. Salah satunya adalah Indonesia.&lt;/p>
&lt;p>Dalam perjalanan 25 tahun tersebut, keanggotaan Crossref telah berkembang pesat. Yang awalnya hanya digagas oleh beberapa penerbit besar, kini Crossref menaungi lebih dari 24.000 anggota dari 165 negara. Menariknya, hampir dua pertiga anggota Crossref saat ini berasal dari perguruan tinggi, perpustakaan, lembaga pemerintah, yayasan, penerbit ilmiah, serta institusi riset, menunjukkan semakin kuatnya peran komunitas akademik dalam ekosistem publikasi global.&lt;/p>
&lt;p>Indonesia menjadi wilayah dengan pertumbuhan komunitas paling dinamis di Crossref. Sejak tahun 2017, Indonesia secara konsisten mencatat jumlah anggota baru terbanyak setiap tahunnya. Saat ini, lebih dari 4.400 anggota Crossref berbasis di Indonesia telah mendaftarkan metadata untuk lebih dari 2,6 juta karya ilmiah. Kontribusi ini tidak hanya memperkuat visibilitas riset nasional, tetapi juga menghubungkan pengetahuan yang dihasilkan di Indonesia dengan komunitas ilmiah global.&lt;/p>
&lt;p>Pertumbuhan ini tentu tidak terjadi begitu saja. Ia lahir dari kerja kolektif para pengelola jurnal, penerbit perguruan tinggi, editor, dan komunitas akademik di Indonesia yang terus belajar, beradaptasi, dan saling berbagi praktik baik dalam tata kelola publikasi ilmiah. Semakin banyak institusi yang menyadari pentingnya metadata yang berkualitas, transparansi dalam publikasi, serta keterhubungan riset melalui DOI sebagai fondasi visibilitas dan keberlanjutan ilmu pengetahuan.&lt;/p>
&lt;p>Di berbagai forum, pelatihan, dan pendampingan komunitas, semangat kolaborasi ini terus tumbuh. Komunitas Crossref di Indonesia tidak hanya berkembang secara kuantitas, tetapi juga menunjukkan peningkatan kualitas dalam pengelolaan metadata, kepatuhan terhadap standar internasional, serta komitmen terhadap praktik publikasi ilmiah yang etis dan terbuka. Inilah yang menjadikan Indonesia bukan sekadar pengguna, melainkan kontributor aktif dalam ekosistem pengetahuan global.&lt;/p>
&lt;p>Indonesia juga dikenal sebagai pengguna Open Journal Systems (OJS) terbesar di dunia, dengan hampir 20.000 jurnal yang dikelola dan diterbitkan melalui platform ini. Sebagian besar jurnal tersebut diterbitkan oleh perguruan tinggi, lembaga riset, dan instansi pemerintah, yang menunjukkan kuatnya peran institusi akademik dan publik dalam ekosistem publikasi ilmiah nasional.&lt;/p>
&lt;p>Budaya publikasi ilmiah di Indonesia sangat erat kaitannya dengan dunia pendidikan tinggi. Kebijakan Kementerian Pendidikan Tinggi, Sains, dan Teknologi mewajibkan mahasiswa untuk mempublikasikan hasil penelitiannya sebagai salah satu syarat kelulusan. Untuk menjawab kebutuhan tersebut sekaligus menyediakan ruang publikasi yang inklusif dan mudah diakses, banyak universitas dan fakultas di Indonesia membentuk serta mengelola jurnal ilmiah mereka sendiri sebagai wadah bagi karya mahasiswa.&lt;/p>
&lt;p>Sebagian besar jurnal di Indonesia terindeks dalam SINTA (Science and Technology Index) yang dikelola oleh Kementerian Pendidikan Tinggi, Sains, dan Teknologi (MoHEST). SINTA bertujuan untuk meningkatkan kualitas jurnal, memfasilitasi proses penilaian, serta mendorong daya saing jurnal ilmiah Indonesia. Dalam konteks ini, penggunaan DOI menjadi salah satu persyaratan penting agar jurnal dapat terindeks di platform tersebut.&lt;/p>
&lt;p>Para anggota Crossref di Indonesia memahami pentingnya persistent identifiers untuk memastikan keberlanjutan dan keterlacakan karya ilmiah mereka. Namun, semakin banyak pula yang menyadari nilai lebih dari komitmen Crossref terhadap metadata terbuka dan rekam jejak ilmiah yang terbuka. Menjadi anggota Crossref bukan sekadar memenuhi kewajiban teknis, melainkan juga menjadi bagian dari komunitas global yang lebih besar. Meski DOI dibutuhkan untuk kepentingan pengindeksan nasional, banyak organisasi memilih bergabung dengan Crossref demi meningkatkan visibilitas global konten mereka—dan pada akhirnya, memperluas dampak dari publikasi yang dihasilkan.&lt;/p>
&lt;blockquote>
&lt;p>“Kami merasa menjadi bagian dari komunitas Crossref karena kami tidak hanya menggunakan layanannya, tetapi juga berkontribusi di dalamnya. Melalui pendaftaran DOI dan penyediaan metadata, kami ikut membangun rekam jejak keilmuan terbuka yang bermanfaat bagi semua. Menjadi bagian dari jejaring Crossref bukan sekadar status keanggotaan—ini adalah tentang visi bersama. Kami melihat diri kami sebagai kontributor aktif. Setiap kali mendaftarkan DOI dan metadata, kami menambahkan satu mata rantai baru dalam jejaring pengetahuan global. Hal ini memastikan riset kami dapat ditemukan, disitasi, dan terhubung dengan karya lain, sehingga memberi manfaat bagi semua pihak.”
— Nita Nurdiana, Universitas PGRI Palembang&lt;/p>
&lt;/blockquote>
&lt;p>Semangat kontribusi ini juga diperkuat oleh peran para &lt;a href="https://www.crossref.org/community/ambassadors/">ambassador&lt;/a> Crossref di Indonesia yang dengan penuh dedikasi mengadvokasi misi Crossref. Fauji Nurdin ST. Mudo dan Zulidyana Rusnalasari telah menjadi penggerak penting dalam penyelenggaraan berbagai kegiatan, mulai dari acara luring hingga webinar untuk para anggota, sekaligus mewakili Crossref dalam beragam forum di berbagai wilayah Indonesia.&lt;/p>
&lt;p>Pada bulan Oktober lalu, sebagai bagian dari perayaan ulang tahun ke-25 Crossref, para ambassador ini—dengan dukungan sponsor dari Relawan Jurnal Indonesia (RJI)—menyelenggarakan sebuah acara &lt;a href="https://zenodo.org/records/17465236" target="_blank">satelit di Medan&lt;/a>. Kegiatan ini mempertemukan peserta dari perguruan tinggi, penerbit, instansi pemerintah, lembaga riset, organisasi non-pemerintah, perpustakaan, hingga museum. Acara tersebut menjadi ruang dialog yang hidup untuk membahas isu-isu kunci dalam dunia publikasi ilmiah dan memperkuat jejaring kolaborasi lintas sektor.&lt;/p>
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2026/medan-group-satellite.png"
alt="Group photo of participants" width="100%">&lt;figcaption>
&lt;p>Crossref 25th Anniversary Satellite Event, Medan, October 2025&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;p>Sebagian besar anggota Crossref di Indonesia bergabung dan beraktivitas melalui &lt;a href="https://www.crossref.org/membership/about-sponsors">sponsor regional&lt;/a>. Para sponsor ini berperan penting dalam mendampingi organisasi-organisasi kecil yang kerap menghadapi berbagai tantangan—mulai dari keterbatasan finansial, kendala teknis, hingga hambatan bahasa—yang membuat proses keanggotaan menjadi tidak selalu mudah. Dengan pemahaman yang kuat terhadap kebutuhan khas komunitas penerbitan lokal serta jejaring yang luas, para sponsor membantu organisasi mengenal dan memanfaatkan Crossref dengan cara yang lebih ramah dan mudah diakses.&lt;/p>
&lt;p>Sponsor pertama Crossref di Indonesia, Relawan Jurnal Indonesia (RJI), bergabung pada tahun 2017. Hingga kini, Indonesia telah memiliki &lt;a href="https://www.crossref.org/membership/about-sponsors/">delapan sponsor&lt;/a> yang secara kolektif mendukung lebih dari 3.900 anggota di seluruh Indonesia. Peran ini menjadikan para sponsor sebagai tulang punggung pertumbuhan dan keberlanjutan komunitas Crossref di tanah air.&lt;/p>
&lt;p>Lebih dari sekadar pendamping teknis, para sponsor juga menjadi mitra strategis dalam membangun keterlibatan komunitas—mulai dari memfasilitasi webinar hingga mendukung pertemuan luring. Pada Agustus 2024, misalnya, Crossref bekerja sama dengan RJI menyelenggarakan acara luring selama dua hari &lt;a href="https://zenodo.org/records/13380587" target="_blank">di Jakarta&lt;/a>, yang dihadiri oleh lebih dari 100 anggota. Selain diskusi mengenai dasar-dasar Crossref dan pentingnya metadata berkualitas, kegiatan ini juga menghadirkan Ahmad Saefudin Surapermana dari ISSN Indonesia, serta para sponsor dan ambassador Crossref. Mengingat banyaknya anggota di Indonesia yang menggunakan platform OJS, rekan-rekan dari Public Knowledge Project (PKP) turut bergabung untuk memberikan sesi khusus tentang plugin OJS serta lokakarya peningkatan versi bagi para administrator sistem OJS. Hingga kini, Crossref terus menerima masukan dari para anggota bahwa kegiatan luring dan daring yang lebih rutin sangat dibutuhkan—tidak hanya untuk memperkuat jejaring, tetapi juga untuk berbagi perkembangan terbaru dalam dunia publikasi ilmiah.&lt;/p>
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2026/crossref-jakarta-group-photo.png"
alt="Group photo of participants" width="100%">&lt;figcaption>
&lt;p>Crossref Jakarta, August 2024&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;p>Seiring dengan meningkatnya minat komunitas ini terhadap Crossref, masih terdapat sejumlah tantangan (pain points) yang dirasakan oleh anggota di Indonesia. Meskipun banyak yang bergabung melalui sponsor, sebagian anggota melaporkan kendala dalam proses deposit metadata, munculnya error, hingga kegagalan pengiriman data. Ada pula yang merasa kesulitan menavigasi dokumentasi teknis ketika menghadapi permasalahan sistem. Beberapa anggota juga menilai bahwa persyaratan metadata Crossref cukup kompleks, sehingga mereka mengalami tantangan dalam mencapai kelengkapan metadata pada rekaman mereka. Kondisi ini tentu menjadi lebih berat bagi institusi dengan sumber daya yang terbatas.&lt;/p>
&lt;p>Untuk memberikan dukungan tambahan, Crossref kemudian mengembangkan rangkaian &lt;a href="https://zenodo.org/records/17133113" target="_blank">webinar&lt;/a> dalam Bahasa Indonesia, yang membahas topik-topik praktis seperti pemanfaatan &lt;a href="https://www.crossref.org/members/prep/" target="_blank">Participation Reports&lt;/a> untuk menilai kelengkapan metadata, serta lokakarya praktik terbaik dalam &lt;a href="https://www.youtube.com/watch?v=w5Ui5FYzrnc" target="_blank">penggunaan OJS&lt;/a>. Webinar-webinar ini menjadi salah satu kegiatan dengan tingkat kehadiran tertinggi. Minat yang kuat mencerminkan nilai yang dibawa sesi ini bagi komunitas kami, dan Crossref terus menerima permintaan untuk pelatihan tambahan. Secara keseluruhan, kami menyambut 1.044 pendaftar dan 501 peserta dalam webinar sepanjang tahun 2025. Tingkat partisipasi ini menegaskan pentingnya pelatihan berkelanjutan serta antusiasme anggota kami untuk terlibat, belajar, dan berkembang bersama.&lt;/p>
&lt;p>Di balik berbagai tantangan tersebut, banyak anggota tetap merasakan nilai strategis dari keanggotaan Crossref. Penyertaan metadata jurnal ke dalam Crossref secara signifikan meningkatkan visibilitas dan aksesibilitas jurnal Indonesia di tingkat global. Melalui infrastruktur persistent identifiers dan metadata terbuka yang disediakan Crossref, keluaran ilmiah menjadi lebih mudah ditemukan, saling terhubung, dan tercatat sebagai bagian dari rekam jejak riset global.&lt;/p>
&lt;blockquote>
&lt;p>“Visi Crossref dalam membangun infrastruktur keilmuan yang terbuka dan saling terhubung sangat mendukung misi utama universitas kami dalam memajukan pengetahuan dan dampak riset. Sebagai institusi akademik, kami mengandalkan sistem DOI Crossref untuk memastikan publikasi dosen dan konten repositori institusi kami tetap dapat diakses secara permanen dan disitasi dengan tepat. Infrastruktur ini sangat penting untuk memaksimalkan visibilitas dan dampak luaran riset kami, yang secara langsung berkontribusi pada reputasi dan peringkat universitas. Selain itu, komitmen Crossref terhadap komunikasi ilmiah terbuka sejalan dengan nilai-nilai kami dalam membuka akses pengetahuan seluas-luasnya, mendukung inisiatif open access, serta membantu kami menunjukkan dampak riset kepada lembaga pendanaan dan para pemangku kepentingan. Sistem keterhubungan yang berkelanjutan ini juga mendukung mahasiswa dan peneliti kami dalam melakukan tinjauan pustaka yang andal, dengan keyakinan bahwa sitasi yang digunakan akan tetap valid dalam jangka panjang.”&lt;br>
— Anggota dari STIS Darul Falah, Indonesia&lt;/p>
&lt;/blockquote>
&lt;p>Pengalaman serupa juga disampaikan oleh Ratna Galuh Manika Trisista dari Universitas Islam Jakarta, yang memaparkan bagaimana keikutsertaan di Crossref dan pengelolaan metadata yang kaya dapat mendukung pengembangan jurnal Indonesia. Hal ini ia sampaikan dalam presentasinya berjudul “&lt;a href="https://www.youtube.com/watch?v=0NPqLrPHhYA&amp;amp;t=9639s" target="_blank">Our Metadata Story: Improving Citation Visibility through Reference Linking&lt;/a>” pada Crossref Annual Meeting 2025.
Seiring pertumbuhan keanggotaan Crossref di Indonesia yang terus berlanjut, kami menantikan penguatan relasi dengan komunitas—dengan dukungan para ambassador, sponsor, serta kontribusi aktif dari para anggota itu sendiri.&lt;/p>
&lt;p>Sebagian besar informasi dalam laporan ini bersumber dari survei yang dikirimkan kepada anggota, sponsor, dan ambassador Crossref di Indonesia. Kami sangat menghargai seluruh umpan balik, komentar, dan saran yang telah diberikan, dan berharap dapat terus melanjutkan kolaborasi serta meningkatkan keterlibatan bersama komunitas di masa mendatang.&lt;/p></description></item><item><title>Insights from a roundtable on author affiliation metadata</title><link>https://www.crossref.org/blog/insights-from-a-roundtable-on-author-affiliation-metadata/</link><pubDate>Thu, 22 Jan 2026 00:00:00 +0000</pubDate><author>Amanda French</author><guid>https://www.crossref.org/blog/insights-from-a-roundtable-on-author-affiliation-metadata/</guid><description>&lt;p>It’s been said that Americans are unusual in tending to ask “Where do you work?” as an initial question upon introduction to a new acquaintance, indicating a perhaps unhealthy preoccupation with work as identity. But in the context of published research, “What is this author&amp;rsquo;s affiliation?” is a question of global importance that goes beyond just wanting to know the name &amp;ndash; and perhaps prestige level &amp;ndash; of the place a researcher works.&lt;/p>
&lt;p>When collected, used, and analyzed at scale, data about author affiliations can provide intriguing insights about international collaboration trends, signal trust and lack of trust in particular research institutions, generate business intelligence for publishers, help universities track the work their researchers do, help funders demonstrate the impact of their funding, and much more.&lt;/p>
&lt;p>In November we partnered with &lt;a href="https://oaswitchboard.org/" target="_blank">OA Switchboard&lt;/a> to organize a roundtable on author affiliation metadata for the Crossref community, service and infrastructure providers, production vendors, data scientists, researchers, and librarians. We aimed to bring together scholarly information professionals with many diverse perspectives; ultimately, participants from more than 40 organizations joined the roundtable to share their experiences and their thoughts.&lt;/p>
&lt;p>In focusing on a single type of metadata, we hoped to focus our discussions, as well. Similarly, in October the Barcelona Declaration on Open Research Information organized &lt;a href="https://barcelona-declaration.org/news/20251023_community_roundtable/" target="_blank">a roundtable on &amp;ldquo;Moving Funding Metadata Forward&amp;rdquo;&lt;/a> in which it became clear that “improving the quality and coverage of funding metadata was on the agenda of many organisations and there was a strong interest in collaborating on practical next steps.”&lt;/p>
&lt;p>While many of the issues and solutions discussed at both roundtables are similar, in the course of the author affiliation metadata roundtable we identified some unique challenges as well as benefits related to this particular flavor of information. In this blog post, I’ll share these insights.&lt;/p>
&lt;h2 id="insights-from-presenters">Insights from presenters&lt;/h2>
&lt;p>I opened the roundtable with a brief introduction and a working definition of affiliation metadata: names and/or identifiers such as &lt;a href="https://www.crossref.org/community/ror/" target="_blank">Research Organization Registry (ROR)&lt;/a> IDs for organizations where research was conducted or with which authors and contributors are associated, usually officially, as in their place of employment.&lt;/p>
&lt;p>Next, to create a shared context for discussion, we heard four presentations on the current state of author affiliation metadata, its importance, and Crossref’s ongoing initiative to enhance it automatically.&lt;/p>
&lt;p>Nees Jan van Eck of Leiden University’s &lt;a href="https://www.universiteitleiden.nl/en/social-behavioural-sciences/cwts" target="_blank">Center for Science and Technology Studies (CWTS)&lt;/a> shared observations on the state of author affiliations from a preprint titled “&lt;a href="https://doi.org/10.31222/osf.io/smxe5_v2" target="_blank">Crossref as a source of open bibliographic metadata&lt;/a>” that presents the findings of an analysis performed annually since 2021. Nees’s key points:&lt;/p>
&lt;ul>
&lt;li>Crossref is a foundational data source for bibliographic metadata.&lt;/li>
&lt;li>Affiliation metadata is available for only 1 out of 3 journal articles in Crossref for the period 2023-2024.&lt;/li>
&lt;li>There is considerable variation in the extent to which Crossref members deposit affiliation metadata.&lt;/li>
&lt;li>Downstream sources try to fill gaps using suboptimal approaches, leading to missing, inaccurate, and inconsistent linking of publications to institutions.&lt;/li>
&lt;li>Publications lacking affiliation metadata in Crossref are less visible in bibliometric applications, analyses, studies, and tools (such as the &lt;a href="https://open.leidenranking.com/" target="_blank">open edition of the Leiden Ranking&lt;/a> of over 2800 universities).&lt;/li>
&lt;/ul>
&lt;p>&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2026/aff-roundtable-01.png"
alt="Crossref as a source of open bibliographic metadata.">
&lt;/figure>
&lt;br />&lt;/p>
&lt;p>Next, Yvonne Campfens of OA Switchboard reiterated the desirability of the Crossref community providing complete and accurate author affiliation metadata at the source. Yvonne called upon publishers to “Integrate metadata creation in your systems and workflows before publication and relay it throughout the editorial, production, and publication processes.”&lt;/p>
&lt;p>Yvonne pointed out that in the context of managing Open Access agreements, publishers ought to keep in mind that providing good affiliation metadata improves customer satisfaction, since institutions and consortia need to have that information in order to connect research to the correct organization. In closing, Yvonne featured best practices from &lt;a href="https://www.oaswitchboard.org/dqc-publisherbestpractices" target="_blank">OA Switchboard’s Data Quality Challenge&lt;/a>:&lt;/p>
&lt;ul>
&lt;li>eLife captures affiliations at submissions with “author select,” ensuring that ROR IDs are introduced early and verified before publication, coupled with a quality assurance process during proofing. (See also our piece on &lt;a href="https://doi.org/10.64000/3gcdf-23s29" target="_blank">Metadata Excellence Award winner eLife&lt;/a>.)&lt;/li>
&lt;li>EMS Press captures metadata via manuscript extraction as early as at submission, building on globally valid identifiers whenever possible (ROR IDs, DOIs, ORCIDs).&lt;/li>
&lt;li>Pensoft Publishers uses AI-assisted metadata extraction with human review and in-house metadata validation.&lt;/li>
&lt;li>Beilstein-Institut performs post-acceptance metadata quality assurance through automation and expert review.&lt;/li>
&lt;li>The Royal Society embeds metadata in OA payment and agreement workflows.&lt;/li>
&lt;li>American Chemical Society (ACS) has a multi-method persistent identifier matching strategy with near-complete coverage.&lt;/li>
&lt;li>The American Society for Microbiology (ASM) combines AI-powered submission tools with editorial oversight via expert manual checks. (See also our piece on &lt;a href="https://doi.org/10.64000/xxwy3-xhf38" target="_blank">Metadata Excellence Award winner ASM&lt;/a>.)&lt;/li>
&lt;li>Rockefeller University Press (RUP) maintains ROR IDs across the full publishing workflow with “author select” at submission through metadata deposits upon publication. (See also the &lt;a href="https://doi.org/10.71938/t63t-g186" target="_blank">ROR case study on RUP&lt;/a>.)&lt;/li>
&lt;/ul>
&lt;p>&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2026/aff-roundtable-02.png"
alt="Having great metadata improves your operational excellence.">
&lt;/figure>
&lt;br />&lt;/p>
&lt;p>Adam Day of &lt;a href="https://clear-skies.co.uk/" target="_blank">Clear Skies Ltd&lt;/a> began his talk by wryly framing the first and second rules of data science as contradictory: “Never fix data: always use sources that produce high-quality data in the first place,” but also “Get good at fixing data, because you will have to.” Adam went on to demonstrate the central role author affiliation metadata plays in research integrity investigations, displaying anonymized data for institutions with a high number of alerts. In conclusion, Adam reiterated the importance of author affiliation metadata to research integrity efforts:&lt;/p>
&lt;ul>
&lt;li>Data analysis is critical to research integrity.&lt;/li>
&lt;li>Quality data helps enormously by giving oversight, saving time, and assisting investigations.&lt;/li>
&lt;/ul>
&lt;p>&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2026/aff-roundtable-03.png"
alt="Value comes from data.">
&lt;/figure>
&lt;br />&lt;/p>
&lt;p>Lastly, our own Director of Technology &lt;a href="https://www.crossref.org/people/dominika-tkaczyk" target="_blank">Dominika Tkaczyk&lt;/a> gave an account of our plans to enrich author affiliation metadata by matching organization name text strings to &lt;a href="https://ror.org" target="_blank">ROR IDs&lt;/a> as part of our &lt;a href="https://www.crossref.org/community/special-programs/metadata-matching/" target="_blank">metadata matching&lt;/a> initiative. A strategy for performing such matching has already been developed and tested and an &lt;a href="https://doi.org/10.5281/zenodo.15254993" target="_blank">open dataset of results made available&lt;/a>. Tests on a set of 3,000 affiliations sampled from our metadata show that the strategy can be expected to match 95 million ROR IDs to organization names with 97.35% precision, an astronomical increase over the less than 1 million ROR IDs deposited in Crossref records to date.&lt;/p>
&lt;p>Dominika concluded the presentation portion of the session by reiterating that our planned enrichment of author affiliation metadata&lt;/p>
&lt;ul>
&lt;li>Will use flexible and transparent matching strategies (and &lt;a href="https://gitlab.com/crossref/labs/marple/-/tree/main/strategies_available/affiliation_single_search" target="_blank">open code&lt;/a>),&lt;/li>
&lt;li>Will welcome community participation in developing new strategies, and&lt;/li>
&lt;li>Will be available in the REST API.&lt;/li>
&lt;/ul>
&lt;p>&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2026/aff-roundtable-04.png"
alt="Matching affiliation strings to ROR IDs.">
&lt;/figure>
&lt;br />&lt;/p>
&lt;p>Automatic matching of organization names to ROR IDs in author affiliations cannot solve the problem of missing organization names, of course, but it represents a huge leap forward in addressing metadata quality issues.&lt;/p>
&lt;p>All of our speakers&amp;rsquo; presentations are available on Zenodo at &lt;a href="https://doi.org/10.13003/661591chqlyw" target="_blank">https://doi.org/10.13003/661591chqlyw&lt;/a>.&lt;/p>
&lt;h2 id="insights-into-challenges">Insights into challenges&lt;/h2>
&lt;p>In the next stage of the event, participants broke into six breakout groups to identify factors contributing to incomplete or inaccurate affiliation metadata. Participants were pre-assigned to groups randomly by role to ensure a variety of perspectives in every discussion.&lt;/p>
&lt;p>At least two participants, it should be noted, pointed out that it would be helpful to agree on a definition of “complete” and “accurate” affiliation metadata, which in itself is a challenge, and one we did not address in this roundtable. For instance, practices most recently have trended away from defining a complete author affiliation in open metadata as including an institutional address, although many internal databases might include such information separately.&lt;/p>
&lt;p>Even without such definitions, however, all six groups were able to identify several general areas for attention, and one participant provided a particularly helpful categorization of these areas that is largely reused here.&lt;/p>
&lt;h3 id="inherent-data-complexity">Inherent data complexity&lt;/h3>
&lt;ul>
&lt;li>Research organizations have names in different languages, abbreviations, and many other name variants.&lt;/li>
&lt;li>Research organizations have frequent name changes, mergers, and rebranding.&lt;/li>
&lt;li>Research organizations have different degrees, levels, and complexity of hierarchical granularity, and authors, publishers, and software systems are often misaligned as to which level in an organization&amp;rsquo;s structure is appropriate to use in a particular instance.&lt;/li>
&lt;li>Research organizations often lack official policies on how affiliations should be written, leading to hundreds of variations for a single institution.&lt;/li>
&lt;/ul>
&lt;h3 id="author-related-issues">Author-related issues&lt;/h3>
&lt;ul>
&lt;li>Corresponding authors often submit information for all co-authors, which can lead to inaccuracies.&lt;/li>
&lt;li>Many authors have multiple profiles across multiple submission systems, which can introduce errors.&lt;/li>
&lt;li>Authors may have “octopus affiliations,” claiming affiliations with many institutions that are difficult to verify.&lt;/li>
&lt;li>Authors may fail to update affiliations when changing institutions between manuscript acceptance and publication.&lt;/li>
&lt;li>Authors may demonstrate &amp;ldquo;apathy&amp;rdquo; when repeatedly filling out submission forms, sometimes providing incomplete, inconsistent, or incorrect information.&lt;/li>
&lt;li>On occasion, authors might even provide false or purchased affiliations, which of course is a significant research integrity concern.&lt;/li>
&lt;/ul>
&lt;h3 id="technical-barriers">Technical barriers&lt;/h3>
&lt;ul>
&lt;li>Many manuscript tracking and peer review systems, especially legacy systems, lack structured fields for affiliations or don&amp;rsquo;t support open organization identifiers like ROR.&lt;/li>
&lt;li>Some systems limit authors to a single affiliation, despite many researchers having multiple institutional connections.&lt;/li>
&lt;li>Some systems only collect affiliation information for the corresponding author.&lt;/li>
&lt;li>Some systems link affiliations to user accounts instead of to publications.&lt;/li>
&lt;li>Different systems use competing identifier registries, including proprietary identifier registries, creating interoperability challenges.&lt;/li>
&lt;/ul>
&lt;h3 id="publisher-practices">Publisher practices&lt;/h3>
&lt;ul>
&lt;li>Even when publishers improve current metadata collection practices, historical data correction is resource-intensive and often not prioritized.&lt;/li>
&lt;li>Publishers collect affiliation information at submission but don’t ensure that it is maintained throughout all stages of the publication process and deposited in metadata.&lt;/li>
&lt;li>Some publishers are unaware of the importance of author affiliation metadata or do not prioritize its improvement.&lt;/li>
&lt;li>Some publishers deliberately choose not to deposit affiliation metadata to Crossref, viewing it as value-added information they&amp;rsquo;ve invested in curating.&lt;/li>
&lt;/ul>
&lt;h2 id="insights-into-solutions">Insights into solutions&lt;/h2>
&lt;p>Naturally, we didn’t rest at identifying challenges: after a break, we gathered in the same groups to brainstorm approaches to improving author affiliation metadata.&lt;/p>
&lt;h3 id="adopt-collective-approaches">Adopt collective approaches&lt;/h3>
&lt;ul>
&lt;li>Collective action, where corrections and improvements made by various stakeholders flow back into shared systems, has historically worked for proprietary systems and could be even more powerful with open infrastructure.&lt;/li>
&lt;li>Since those who do not provide metadata “upstream” will inevitably have it provided for them “downstream” by multiple separate entities using multifarious methods, provenance metadata indicating who asserted author affiliations and how (whether automatically or with the author’s or editor’s input) would help metadata users assess trust levels.&lt;/li>
&lt;/ul>
&lt;h3 id="engage-authors-and-institutions">Engage authors and institutions&lt;/h3>
&lt;ul>
&lt;li>Reach out to authors and institutions to educate them on the need for more consistent affiliation reporting, especially in terms of language, name format, and degree of hierarchical granularity.&lt;/li>
&lt;li>Demonstrate the benefit to institutions of maintaining accurate records in registries like ROR, including abbreviations and name variants.&lt;/li>
&lt;li>Publishers and/or software systems should allow authors to review (though not necessarily edit) affiliation information during the proofing process to verify accuracy. Authors should not, however, need to know, see, or use ROR IDs.&lt;/li>
&lt;/ul>
&lt;h3 id="improve-the-tech">Improve the tech&lt;/h3>
&lt;ul>
&lt;li>Publishers would welcome submission systems that incorporate structured fields for author affiliations with well-designed auto-suggestions linked to ROR or other organization identifiers.&lt;/li>
&lt;li>Making affiliation data mandatory at submission could significantly improve capture rates, although it would be important to ensure that independent researchers can use these systems as well.&lt;/li>
&lt;li>Enable collection of affiliations for all authors, not just the corresponding author.&lt;/li>
&lt;li>Pull in &lt;a href="https://info.orcid.org/trust-markers-in-orcid-records-verified-email-domains/" target="_blank">verified affiliation information from ORCID&lt;/a>.&lt;/li>
&lt;li>Increasingly, intelligent matching systems can be implemented to reduce author burden and perhaps also increase accuracy and completeness of metadata.&lt;/li>
&lt;li>Better crosswalks between different organization identifier systems would make it vastly easier for publishers to maintain better metadata. Since open registries cannot include proprietary information, proprietary registries should provide their customers with crosswalks to all standard open identifiers.&lt;/li>
&lt;/ul>
&lt;h3 id="encourage-publisher-best-practices">Encourage publisher best practices&lt;/h3>
&lt;ul>
&lt;li>Publishers can use already-available tools to help assess and improve the quality of both new and legacy author affiliation metadata.
&lt;ul>
&lt;li>&lt;a href="https://crossref.org/members/prep/" target="_blank">Crossref’s Participation Reports&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://graph.openaire.eu/docs/10.5.1/graph-production-workflow/enrichment-by-mining/affiliation_matching/" target="_blank">OpenAIRE&amp;rsquo;s affiliation matching methods and validation systems&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://ror.readme.io/docs/api-affiliation" target="_blank">ROR API affiliation matching service&lt;/a>&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>Share the benefits of improved author affiliation metadata for internal and external analytics, customer satisfaction, and research integrity.&lt;/li>
&lt;li>Identify best practices in collecting and structuring author affiliation metadata.&lt;/li>
&lt;li>Understand that the entire research ecosystem would benefit from publishers sharing collected affiliation data with Crossref.&lt;/li>
&lt;/ul>
&lt;p>It’s worth mentioning that these solutions are heterogeneous: not all strategies can be implemented by any one actor nor even by any one sector of our profession. Clearly, collaborative action is necessary for substantive change.&lt;/p>
&lt;h2 id="moving-forward">Moving forward&lt;/h2>
&lt;p>The affiliations metadata roundtable represented an important step in addressing affiliation metadata challenges in a productive and collaborative way. If there was a consensus, it was that while perfect completeness and accuracy of author affiliation metadata may not be achievable (or even definable), incremental improvements can substantially enhance the quality and availability of affiliation metadata for the entire scholarly information community.&lt;/p>
&lt;p>Here at Crossref, we intend to use the insights from this roundtable to inform our support of the Crossref community, including publishers, service providers, and metadata users. We welcome your comments, questions, and suggestions on this issue! &lt;strong>Share your thoughts with Amanda French at &lt;a href="mailto:alfrench@crossref.org">alfrench@crossref.org&lt;/a>.&lt;/strong>&lt;/p>
&lt;h3 id="references">References&lt;/h3>
&lt;ol>
&lt;li>van Eck, N. J., &amp;amp; Waltman, L. (2025). Crossref as a source of open bibliographic metadata (No. smxe5_v2). MetaArXiv. &lt;a href="https://doi.org/10.31222/osf.io/smxe5_v2" target="_blank">https://doi.org/10.31222/osf.io/smxe5_v2&lt;/a>&lt;/li>
&lt;li>Tkaczyk, D. (2025). Crossref relationships involving research organisations [Dataset]. Zenodo. &lt;a href="https://doi.org/10.5281/zenodo.15254993" target="_blank">https://doi.org/10.5281/zenodo.15254993&lt;/a>&lt;/li>
&lt;li>French, A., van Eck, N. J., Campfens, Y., Day, A., &amp;amp; Tkaczyk, D. (2026, January 19). Affiliations Metadata Roundtable 2025—All Presentations. &lt;a href="https://doi.org/10.13003/661591chqlyw" target="_blank">https://doi.org/10.13003/661591chqlyw&lt;/a>&lt;/li>
&lt;/ol>
&lt;h3 id="participating-organizations">Participating organizations&lt;/h3>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th style="text-align: center">&lt;/th>
&lt;th>&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td style="text-align: center">Africa PID Alliance / TCC Africa&lt;/td>
&lt;td style="text-align: center">Frontiers Media SA&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: center">American Association of Cancer Research (AACR)&lt;/td>
&lt;td style="text-align: center">Iowa State&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: center">American Chemical Society (ACS)&lt;/td>
&lt;td style="text-align: center">Kriyadocs / Exeter Premedia Services&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: center">American Physical Society (APS)&lt;/td>
&lt;td style="text-align: center">MDPI&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: center">American Society for Microbiology (ASM)&lt;/td>
&lt;td style="text-align: center">Noyam Publishers&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: center">Aptara&lt;/td>
&lt;td style="text-align: center">OpenAIRE / OpenOrgs&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: center">Australian Research Data Commons (ARDC)&lt;/td>
&lt;td style="text-align: center">Optica Publishing Group&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: center">Atypon&lt;/td>
&lt;td style="text-align: center">ORCID&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: center">Beilstein-Institut&lt;/td>
&lt;td style="text-align: center">Oxford University Press&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: center">California Digital Library (CDL)&lt;/td>
&lt;td style="text-align: center">Public Knowledge Project (PKP)&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: center">Cambridge University Press&lt;/td>
&lt;td style="text-align: center">Public Library of Science (PLOS)&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: center">Carnegie Mellon University&lt;/td>
&lt;td style="text-align: center">River Valley Technologies&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: center">CHORUS&lt;/td>
&lt;td style="text-align: center">Rockefeller University Press&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: center">Clarivate / Web of Science &lt;/td>
&lt;td style="text-align: center">SAGE Publications&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: center">Copernicus GmBH&lt;/td>
&lt;td style="text-align: center">Barcelona Declaration on Open Research Information &lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: center">Curtin University / Curtin Open Knowledge Initiative (COKI)&lt;/td>
&lt;td style="text-align: center">Silverchair / ScholarOne&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: center">De Gruyter Brill&lt;/td>
&lt;td style="text-align: center">Springer Science &amp;amp; Business&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: center">Digital Science / Figshare&lt;/td>
&lt;td style="text-align: center">TNQTech&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: center">Digital Science / Symplectic Elements&lt;/td>
&lt;td style="text-align: center">University of Laval&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: center">eLife&lt;/td>
&lt;td style="text-align: center">University of Chicago Press&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: center">Elsevier BV&lt;/td>
&lt;td style="text-align: center">University of Split&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: center">Enago&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table></description></item><item><title>The GEM program - Year Three and program expansion for 2026</title><link>https://www.crossref.org/blog/the-gem-program-year-three-and-program-expansion-for-2026/</link><pubDate>Wed, 14 Jan 2026 00:00:00 +0000</pubDate><author>Susan Collins</author><guid>https://www.crossref.org/blog/the-gem-program-year-three-and-program-expansion-for-2026/</guid><description>&lt;p>As Crossref membership continues to grow, finding ways to help organisations participate is an important part of our mission. Although Crossref membership is open to all organisations that produce scholarly and professional materials, cost and technical challenges can be barriers to joining for many.&lt;/p>
&lt;p>Our &lt;a href="https://www.crossref.org/gem/">Global Equitable Membership (GEM) Program&lt;/a> aims to provide greater membership equity and accessibility to organisations in the world&amp;rsquo;s least economically advantaged countries. Eligible members pay no membership or record registration fees. Eligibility for the program is based on a member&amp;rsquo;s country. Seeing its effectiveness in increasing participation in the research nexus from previously underrepresented regions, this year we are expanding the GEM program to include 18 new countries.&lt;/p>
&lt;h3 id="overview-of-the-first-3-years-of-gem">Overview of the first 3 years of GEM&lt;/h3>
&lt;p>The program began in January 2023 with 214 existing members. By the end of 2025, we had 628 organisations under the GEM program. Of these, 535 are independent members, and 89 members work through one of our &lt;a href="https://www.crossref.org/membership/about-sponsors">sponsors&lt;/a>. To date, GEM program members have contributed approximately 334,000 works to the &lt;a href="https://www.crossref.org/documentation/research-nexus">Research Nexus&lt;/a>.&lt;/p>
&lt;center>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>Global equitable membership&lt;/th>
&lt;th>2023&lt;/th>
&lt;th>2024&lt;/th>
&lt;th>2025&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>New members joining&lt;/td>
&lt;td>129&lt;/td>
&lt;td>127&lt;/td>
&lt;td>151&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Total member count&lt;/td>
&lt;td>327&lt;/td>
&lt;td>458&lt;/td>
&lt;td>628&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;/center>
&lt;p>Total number of Crossref GEM members by country until the end of 2025:&lt;/p>
&lt;center>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>GEM country – alphabetically&lt;/th>
&lt;th>Total no. of members&lt;/th>
&lt;th>GEM country – alphabetically&lt;/th>
&lt;th>Total no. of members&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>Afghanistan&lt;/td>
&lt;td>29&lt;/td>
&lt;td>Malawi&lt;/td>
&lt;td>2&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Bangladesh&lt;/td>
&lt;td>167&lt;/td>
&lt;td>Maldives&lt;/td>
&lt;td>4&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Benin&lt;/td>
&lt;td>6&lt;/td>
&lt;td>Mali&lt;/td>
&lt;td>4&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Bhutan&lt;/td>
&lt;td>6&lt;/td>
&lt;td>Marshall Islands&lt;/td>
&lt;td>0&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Burkina Faso&lt;/td>
&lt;td>7&lt;/td>
&lt;td>Mauritania&lt;/td>
&lt;td>1&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Burundi&lt;/td>
&lt;td>3&lt;/td>
&lt;td>Micronesia&lt;/td>
&lt;td>0&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Cambodia&lt;/td>
&lt;td>14&lt;/td>
&lt;td>Mozambique&lt;/td>
&lt;td>2&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Central African Republic&lt;/td>
&lt;td>1&lt;/td>
&lt;td>Myanmar&lt;/td>
&lt;td>3&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Chad&lt;/td>
&lt;td>0&lt;/td>
&lt;td>Nepal&lt;/td>
&lt;td>60&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Comoros&lt;/td>
&lt;td>1&lt;/td>
&lt;td>Nicaragua&lt;/td>
&lt;td>2&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Congo, Democratic Republic&lt;/td>
&lt;td>24&lt;/td>
&lt;td>Niger&lt;/td>
&lt;td>0&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Côte d&amp;rsquo;Ivoire&lt;/td>
&lt;td>3&lt;/td>
&lt;td>Rwanda&lt;/td>
&lt;td>9&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Djibouti&lt;/td>
&lt;td>0&lt;/td>
&lt;td>Samoa&lt;/td>
&lt;td>0&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Eritrea&lt;/td>
&lt;td>0&lt;/td>
&lt;td>São Tomé and Principe&lt;/td>
&lt;td>0&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Ethiopia&lt;/td>
&lt;td>17&lt;/td>
&lt;td>Senegal&lt;/td>
&lt;td>7&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Gambia&lt;/td>
&lt;td>0&lt;/td>
&lt;td>Sierra Leone&lt;/td>
&lt;td>2&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Ghana&lt;/td>
&lt;td>38&lt;/td>
&lt;td>Solomon Islands&lt;/td>
&lt;td>0&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Guinea&lt;/td>
&lt;td>0&lt;/td>
&lt;td>Somalia&lt;/td>
&lt;td>10&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Guinea-Bissau&lt;/td>
&lt;td>0&lt;/td>
&lt;td>South Sudan&lt;/td>
&lt;td>0&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Guyana&lt;/td>
&lt;td>3&lt;/td>
&lt;td>Sri Lanka&lt;/td>
&lt;td>31&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Haiti&lt;/td>
&lt;td>2&lt;/td>
&lt;td>Sudan&lt;/td>
&lt;td>14&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Honduras&lt;/td>
&lt;td>3&lt;/td>
&lt;td>Tajikistan&lt;/td>
&lt;td>8&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Kiribati&lt;/td>
&lt;td>0&lt;/td>
&lt;td>Tanzania, United Republic of&lt;/td>
&lt;td>28&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Kosovo&lt;/td>
&lt;td>9&lt;/td>
&lt;td>Togo&lt;/td>
&lt;td>1&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Kyrgyz Republic&lt;/td>
&lt;td>27&lt;/td>
&lt;td>Tonga&lt;/td>
&lt;td>0&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Lao, People&amp;rsquo;s Democratic Rep.&lt;/td>
&lt;td>5&lt;/td>
&lt;td>Tuvalu&lt;/td>
&lt;td>0&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Lesotho&lt;/td>
&lt;td>0&lt;/td>
&lt;td>Uganda&lt;/td>
&lt;td>23&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Liberia&lt;/td>
&lt;td>1&lt;/td>
&lt;td>Vanuatu&lt;/td>
&lt;td>0&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Madagascar&lt;/td>
&lt;td>5&lt;/td>
&lt;td>Yemen&lt;/td>
&lt;td>37&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>&lt;/td>
&lt;td>Zambia&lt;/td>
&lt;td>8&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;/center>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2026/gem-expansion-map.png"
alt="world map with GEM countries highlighted in red" width="75%">&lt;figcaption>
&lt;p>Membership Density in GEM Program Countries until the end of 2025&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;/div>
&lt;h3 id="program-expansion-in-2026">Program expansion in 2026&lt;/h3>
&lt;p>Starting on 1st of January 2026, we’re excited to invite organisations from Angola, Belize, Cabo Verde, Cameroon, Republic of Congo, Dominica, Eswatini, Fiji, Grenada, Kenya, Nigeria, Pakistan, Papua New Guinea, Saint Lucia, Saint Vincent and the Grenadines, Suriname, Timor Leste, and Uzbekistan to join Crossref and register their content and metadata with us without membership or record registration fees. There are 711 existing Crossref members based in these countries who are now eligible for the program, bringing the overall number of GEM members to 1339 across 77 countries (that’s close to 5% of all Crossref members).&lt;/p>
&lt;p>In creating our eligibility list, we refer to existing sources. For the first three years of the program, our list was predominantly based on the World Bank’s &lt;a href="https://datahelpdesk.worldbank.org/knowledgebase/articles/906519-world-bank-country-and-lending-groups" target="_blank">International Development Association (IDA)&lt;/a> classification. In 2026, we leveraged additional sources to curate our list, resulting in the inclusion of 18 new countries in the program. Following community feedback, we now refer to the IDA, the &lt;a href="https://datahelpdesk.worldbank.org/knowledgebase/articles/906519-world-bank-country-and-lending-groups" target="_blank">IDA Blend List&lt;/a>, and the &lt;a href="https://www.un.org/ohrlls/content/list-ldcs" target="_blank">United Nations Least Developed Countries&lt;/a> list. In our choices, we also keep abreast of the global situation and conversations about supporting equitability in scholarly publishing and in the future, we may consider other factors too.&lt;/p>
&lt;p>We will review our lists and the eligibility criteria annually and note any changes on our &lt;a href="https://www.crossref.org/gem/">website&lt;/a>. Members whose country moves on or off the GEM Program will be notified of any upcoming fees (or the removal of fees) with adequate time to plan and budget accordingly.
Although the GEM program reduces financial barriers, many small organisations may still need administrative, technical, and language support provided by our Sponsors, and we will continue working with suitable organisations to make participation in Crossref easier.&lt;/p></description></item><item><title>Reduction of Grant DOI registration fees: a boost for the Research Nexus</title><link>https://www.crossref.org/blog/reduction-of-grant-doi-registration-fees-a-boost-for-the-research-nexus/</link><pubDate>Thu, 08 Jan 2026 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/reduction-of-grant-doi-registration-fees-a-boost-for-the-research-nexus/</guid><description>&lt;p>We are pleased to announce that&amp;mdash;effective 1st January 2026&amp;mdash;we have made two changes to grant record registration fees that aim to accelerate adoption of Crossref&amp;rsquo;s &lt;a href="https://www.crossref.org/services/grant-linking-system/">Grant Linking System (GLS)&lt;/a> and provide a two-year window of opportunity to increase the number and availability of open persistent grant identifiers and boost the matching of relationships with research objects.&lt;/p>
&lt;div class="shortcode-divwrap align-right">
&lt;span>&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/community-images/gls/gls-benefits.png"
alt="High-level benefits of the Crossref Grant Linking System (GLS)" width="100%">
&lt;/figure>
&lt;/span>
&lt;/div>
Launched in 2019 with close input from several funders and other infrastructure organisations, the GLS primarily offers the ability to create and steward Crossref Grant DOIs, along with several benefits such as dedicated grant/award metadata like funding type, value, contributors, and projects, as well as hosted landing pages, tools to create and update metadata, and of course both member-asserted and Crossref-automatic matching of relationships within the global corpus of 180 million other research objects. Essentially, we need to identify what research objects are produced as a result of the award, and these objects could be articles, preprints, data, code, blogs, posters, and more.
&lt;p>This connected network is what we call the Research Nexus, essential for exploring research activity in general, as well as evaluating reach and return on funding and other support like use of facilities/equipment.&lt;/p>
&lt;h3 id="a-fee-reduction-and-a-two-year-fee-waiver-pilot">A fee reduction and a two-year fee waiver pilot&lt;/h3>
&lt;p>Following a review by our &lt;a href="https://www.crossref.org/committees/membership-and-fees/">Membership &amp;amp; Fees Committee&lt;/a>, the &lt;a href="https://www.crossref.org/board-and-governance/#board-members">Board&lt;/a> met in December and passed two related motions:&lt;/p>
&lt;ol>
&lt;li>&lt;strong>Current-Year (CY) grant registration fee has been cut in half to match other record types&lt;/strong>: The board approved the adjustment of the Current-Year (CY) grant registration fee down from $2.00 to $1.00 USD, effective 1st January 2026.&lt;/li>
&lt;li>&lt;strong>Back-Year (BY) grant registration fee is waived through 2027&lt;/strong>: The board approved a time-limited fee waiver as a pilot for Back-Year (BY) grant registration fees, bringing that per-record fee down from $0.30 to $0.00 for 2026 and 2027.&lt;/li>
&lt;/ol>
&lt;p>We aim to boost registration of Back-Year (BY) records and accelerate the growth of the Research Nexus with millions more grant&amp;lt;-&amp;gt;output matches. During the course of the two-year pilot, the Membership &amp;amp; Fees Committee and our &lt;a href="https://www.crossref.org/community/special-programs/resourcing-crossref/">fee project work&lt;/a> that started in 2023 and also &lt;a href="https://doi.org/10.64000/cvvj8-tax10" target="_blank">brought in other fee reductions&lt;/a>, will consider more adjustments across BY registration fees for the benefit of members beyond just funders and beyond just grants.&lt;/p>
&lt;p>All &lt;a href="https://www.crossref.org/board-and-governance/#motions">Board motions are publicly available&lt;/a> and we encourage questions from the community about our governance processes and the decisions on our members&amp;rsquo; behalf; &lt;a href="mailto:feedback@crossref.org">email us via feedback@crossref.org&lt;/a> anytime, or &lt;a href="https://community.crossref.org/c/strategy/" target="_blank">post on the forum&lt;/a>.&lt;/p>
&lt;h3 id="supercharging-the-grant-linking-system">Supercharging the Grant Linking System&lt;/h3>
&lt;p>Leading up to the &lt;a href="https://www.crossref.org/services/grant-linking-system/">GLS&lt;/a> launch in 2019, we worked with a group of funders and metadata experts to inform the design and implementation of the new service, including a funder governance and fees working group. That was seven years ago, and our &lt;a href="https://www.crossref.org/working-groups/funders/">Funder Advisory Group&lt;/a> now includes nearly 100 funding community representatives the GLS has grown to almost 50 funder members that have registered more than 185,000 open grant metadata records. But they are mostly research councils and agencies or charities from Europe and North America, and we know that for a truly comprehensive and interconnected Research Nexus, more needs to be done to include organisations from all parts of the world. The other key driver is simply to boost more metadata connections; the more grant metadata we gather, the better we can match it to all kinds of research outputs, and &lt;a href="">this metadata directly feeds thousands of services&lt;/a> available in our community, from Dimensions and Scopus, to OA.Report and OpenAlex, as well as funders&amp;rsquo; own analytics tools. See our &lt;a href="https://doi.org/10.64000/607z6-1nh09" target="_blank">recent report about the latest dataset&lt;/a> and of course use &lt;a href="htps://api.crossref.org">api.crossref.org&lt;/a> directly.&lt;/p>
&lt;p>Relatedly, we just added a &lt;a href="https://doi.org/10.64000/x7d4h-x3r11" target="_blank">new Grant DOI field&lt;/a> to our schema for all record types, to give our members a precise and accurate way of capturing funding metadata for all research outputs. With the new lower CY registration fee and a pilot waiver of BY fees for grant records, we hope to boost the creation of more Grant DOIs by more funders from more parts of the world&amp;mdash;so that others also see and can build on the momentum and reuse the data in their own tools and services. &lt;a href="https://barcelona-declaration.org/news/20251023_community_roundtable/" target="_blank">All actors need to play their role&lt;/a>, and Crossref’s part is in running the global linking infrastructure at scale, connecting research objects and making them openly available while ensuring that the barriers for the registration, use and reuse of metadata remain as low as possible.&lt;/p>
&lt;p>We feel we&amp;rsquo;re at a tipping point that only needs a small nudge to truly scale the Grant Linking System.&lt;/p>
&lt;p>By waiving BY fees entirely for two years, we&amp;rsquo;re hoping to see members fill in historical data and create more comprehensive grant&amp;lt;-&amp;gt;outcome connections. There is often a long period of time between funding being awarded, and the resulting research objects being generated and communicated. That is why historical grant metadata is so important; we think that there will be many funding outcome relationships and insights just waiting to be uncovered!&lt;/p>
&lt;h3 id="why-give-funders-a-fee-break-and-not-others">Why give funders a fee break and not others?&lt;/h3>
&lt;p>We&amp;rsquo;re not ruling out this kind of fee incentive in future for other members and other object types, but that needs more analysis (which we plan to do) and right now, the relatively small number of grant records, combined with a growing need for this kind of metadata, means the changes are small enough to have almost no impact on Crossref&amp;rsquo;s healthy financial position.&lt;/p>
&lt;p>This decision is consistent with the goals of our &lt;a href="https://www.crossref.org/community/special-programs/resourcing-crossref/">Resourcing Crossref for Future Sustainability (RCFS)&lt;/a> to review our fees to make sure they are equitable and clear, while ensuring Crossref retains a sustainable business model. Our fees can encourage or discourage the community to participate in Crossref. The RCFS project has also resulted in the creation of a &lt;a href="https://doi.org/10.64000/j2bgz-v7h50/" target="_blank">lower membership fee tier&lt;/a> for the very lowest-resourced members, and the tidying up of &lt;a href="https://doi.org/10.64000/cvvj8-tax10" target="_blank">things like outlier volume discounts&lt;/a>.&lt;/p>
&lt;p>The BY fee waiver is positioned as a pilot to allow us to measure its impact over the next two years and feed into the Membership &amp;amp; Fees Committee and RCFS project. We will evaluate the pilot results (i.e. does it indeed supercharge funding metadata connections and adoption?) and consider additional adjustments to other BY registration fees and whether such fee incentives might be extended to other members.&lt;/p>
&lt;p>We encourage all funders to take advantage of these reduced rates to contribute to the Research Nexus and help us build a more complete picture of the relationship between research funding and outcomes.&lt;/p>
&lt;p>Take a look at the recent case studies from early GLS adopters &lt;a href="https://doi.org/10.64000/n9n69-y5b75" target="_blank">FWF&lt;/a> (Austria), &lt;a href="https://doi.org/10.64000/dvqke-j4v69" target="_blank">NWO&lt;/a> (The Netherlands), &lt;a href="https://doi.org/10.64000/9gjfp-5p698" target="_blank">FCCN|FCT&lt;/a> (Portugal), and &lt;a href="https://doi.org/10.64000/c1dh8-qn968" target="_blank">Wellcome/EuropePMC&lt;/a>, reach out to them or us with any questions, or peruse the &lt;a href="https://community.crossref.org/c/crossref-services/grant-linking-system/" target="_blank">GLS community forum&lt;/a>!&lt;/p></description></item><item><title>The best way of acknowledging research funding in the metadata: Crossref Grant ID</title><link>https://www.crossref.org/blog/the-best-way-of-acknowledging-research-funding-in-the-metadata-crossref-grant-id/</link><pubDate>Tue, 06 Jan 2026 00:00:00 +0000</pubDate><author>Patricia Feeney</author><guid>https://www.crossref.org/blog/the-best-way-of-acknowledging-research-funding-in-the-metadata-crossref-grant-id/</guid><description>&lt;p>We are very pleased to kick off the New Year with another important schema update and the news that a Grant DOI field is now supported for all record types. This means that Crossref members can explicitly include the Crossref Grant IDs as part of their DOI metadata records for publications and any other output type, accurately linking research outputs to the funding that made it possible, all through metadata. We hope that our members will leverage this to respond to recent calls for &lt;a href="https://council.science/statements/isc-position-on-research-funding-transparency/" target="_blank">stronger funding transparency&lt;/a> and best practices for &lt;a href="https://publicationethics.org/guidance/discussion-document/declaring-funding-sources-research" target="_blank">reporting funding sources in research outputs&lt;/a>. &lt;/p>
&lt;p>Funding information is very important for the research community. As explored by &lt;a href="https://blogs.lse.ac.uk/impactofsocialsciences/2025/12/11/open-funder-metadata-is-essential-for-true-research-transparency/" target="_blank">some key European funder representatives&lt;/a>, providing mechanisms to clearly link funding with its outputs is essential for the community to have a full picture of the research endeavour.&lt;/p>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>When funders systematically register grants with persistent identifiers and make this information openly available, they create a foundation that publishers and infrastructure providers such as repositories can reliably build upon when depositing output metadata.”&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Hans de Jonge, Katharina Rieck and Zoé Ancion&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>Up until now, if a Crossref member wanted to include a Crossref Grant ID to unambiguously identify the output funding source, they would need to use other available fields, such as for an award number. While it was an important step towards increasing transparency and is heavily used for reporting and impact assessment, being an unstructured field, it was prone to errors, and of course, funders’ internal award identifiers are not unique, persistent, or necessarily open. This limited our ability to create unambiguous relationships with the Crossref Grant DOIs registered by our now ~50 funder members. As the new field becomes increasingly populated by our members, this rich metadata will pave the way for capturing and representing the funding relationships in a more accurate and complete way and fulfilling one of our commitments at the &lt;a href="https://barcelona-declaration.org/news/20251023_community_roundtable/" target="_blank">recent funding metadata workshop with the Barcelona Declaration.&lt;/a> &lt;/p>
&lt;p>&lt;strong>The Crossref Grant ID field in the schema is a clear signal of the growing demand for these persistent Grant IDs (Crossref DOIs), and the relationships these help us create.&lt;/strong> Those connections can in turn enable streamlined reporting for the grantees, as well as compliance tracking and programme evaluation for funders. &lt;/p>
&lt;p>As part of our work to enable the research nexus, Crossref has been proactively identifying funding information and prototyping metadata enrichment processes &lt;a href="https://www.crossref.org/community/special-programs/metadata-matching/">through matching projects&lt;/a>, ensuring that as many &lt;a href="https://doi.org/10.64000/607z6-1nh09" target="_blank">relationships as possible are established and made discoverable&lt;/a>. With this schema update, we aim to lower barriers and encourage more members to register output-funding relationships at source. This will facilitate the links that make the research nexus a connected, interoperable, and an important source of information that ensures a transparent and trustworthy research process. &lt;/p>
&lt;p>&lt;strong>We encourage all Crossref members to start incorporating Grant DOIs when available into your metadata submissions.&lt;/strong> By taking advantage of this new field, you&amp;rsquo;ll help build a more complete and transparent record of research funding, making it easier for the community to understand and trace the impact of funded research. &lt;/p>
&lt;p>When collecting funding information for your publication, please consider asking the authors for the Grant DOI (Crossref Grant ID) as well as the funder’s details (such as their name and identifier). Here’s how the U.S. Department of Energy, Office of Scientific and Technical Information&amp;rsquo;s (OSTI-DOE) grant &lt;a href="https://doi.org/10.46936/aps-182101/60010611" target="_blank">https://doi.org/10.46936/aps-182101/60010611&lt;/a> can be included in the metadata for related works, from datasets, to preprints, conference proceedings, journal articles, and more:&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-JS" data-lang="JS">&lt;span class="line">&lt;span class="cl">&lt;span class="o">&amp;lt;&lt;/span>&lt;span class="nx">assertion&lt;/span> &lt;span class="nx">name&lt;/span>&lt;span class="o">=&lt;/span>&lt;span class="s2">&amp;#34;fundgroup&amp;#34;&lt;/span>&lt;span class="o">&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="o">&amp;lt;&lt;/span>&lt;span class="nx">assertion&lt;/span> &lt;span class="nx">name&lt;/span>&lt;span class="o">=&lt;/span>&lt;span class="s2">&amp;#34;ror&amp;#34;&lt;/span>&lt;span class="o">&amp;gt;&lt;/span>&lt;span class="nx">https&lt;/span>&lt;span class="o">:&lt;/span>&lt;span class="c1">//ror.org/04qxsr837&amp;lt;/assertion&amp;gt;
&lt;/span>&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="c1">&lt;/span> &lt;span class="o">&amp;lt;&lt;/span>&lt;span class="nx">assertion&lt;/span> &lt;span class="nx">name&lt;/span>&lt;span class="o">=&lt;/span>&lt;span class="s2">&amp;#34;grant_doi&amp;#34;&lt;/span>&lt;span class="o">&amp;gt;&lt;/span>&lt;span class="mf">10.46936&lt;/span>&lt;span class="o">/&lt;/span>&lt;span class="nx">aps&lt;/span>&lt;span class="o">-&lt;/span>&lt;span class="mi">182101&lt;/span>&lt;span class="o">/&lt;/span>&lt;span class="mi">60010611&lt;/span>&lt;span class="o">&amp;lt;&lt;/span>&lt;span class="err">/assertion&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="o">&amp;lt;&lt;/span>&lt;span class="err">/assertion&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>Similarly, a grant &lt;a href="https://doi.org/10.3030/732489" target="_blank">https://doi.org/10.3030/732489&lt;/a> from European Union H2020-EU.2.1.1. - INDUSTRIAL LEADERSHIP, would be represented in related work’s metadata as follows:&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-JS" data-lang="JS">&lt;span class="line">&lt;span class="cl">&lt;span class="o">&amp;lt;&lt;/span>&lt;span class="nx">assertion&lt;/span> &lt;span class="nx">name&lt;/span>&lt;span class="o">=&lt;/span>&lt;span class="s2">&amp;#34;fundgroup&amp;#34;&lt;/span>&lt;span class="o">&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="o">&amp;lt;&lt;/span>&lt;span class="nx">assertion&lt;/span> &lt;span class="nx">name&lt;/span>&lt;span class="o">=&lt;/span>&lt;span class="s2">&amp;#34;funder_name”&amp;gt;H2020 LEIT Information and Communication Technologies
&lt;/span>&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2"> &amp;lt;assertion name=&amp;#34;&lt;/span>&lt;span class="nx">funder_identifier&lt;/span>&lt;span class="err">”&lt;/span>&lt;span class="o">&amp;gt;&lt;/span>&lt;span class="mf">10.13039&lt;/span>&lt;span class="o">/&lt;/span>&lt;span class="mi">100010669&lt;/span>&lt;span class="o">&amp;lt;&lt;/span>&lt;span class="err">/assertion&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="o">&amp;lt;&lt;/span>&lt;span class="err">/assertion&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="o">&amp;lt;&lt;/span>&lt;span class="nx">assertion&lt;/span> &lt;span class="nx">name&lt;/span>&lt;span class="o">=&lt;/span>&lt;span class="s2">&amp;#34;grant_doi&amp;#34;&lt;/span>&lt;span class="o">&amp;gt;&lt;/span>&lt;span class="mf">10.3030&lt;/span>&lt;span class="o">/&lt;/span>&lt;span class="mi">732489&lt;/span>&lt;span class="o">&amp;lt;&lt;/span>&lt;span class="err">/assertion&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="o">&amp;lt;&lt;/span>&lt;span class="err">/assertion&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>For more technical documentation and implementation guidance, please visit &lt;a href="https://www.crossref.org/documentation/schema-library/markup-guide-metadata-segments/funding-information/">our funding data documentation&lt;/a>. If you have questions or need support integrating Grant IDs into your workflow, our support team is here to help!&lt;/p></description></item><item><title>Highlights of a very busy year: our 2025 annual report</title><link>https://www.crossref.org/blog/highlights-of-a-very-busy-year-our-2025-annual-report/</link><pubDate>Thu, 18 Dec 2025 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/highlights-of-a-very-busy-year-our-2025-annual-report/</guid><description>&lt;p>As we finish &lt;a href="https://www.crossref.org/community/special-programs/25years/">celebrating our 25th anniversary&lt;/a>, we can look back on a truly transformational year, defined by the successful delivery of several long-planned, foundational projects&amp;mdash;as well as updates to our teams, services, and fees&amp;mdash;that position Crossref for success over the next quarter century as essential open scholarly infrastructure. In our &lt;a href="https://doi.org/10.64000/bm6g0-gvy36" target="_blank">update at the end of 2024&lt;/a>, we highlighted that we had restructured our leadership team and paused some projects. The changes made in 2024 positioned us for a year of getting things done in 2025. We launched cross-functional programs, modernised our systems, strengthened connections with our growing global community, and streamlined a bunch of technical and business operations while continuing to grow our staff, members, content, relationships, and community connections.&lt;/p>
&lt;p>Read on for the highlights of a very busy year, grouped around our &lt;a href="https://www.crossref.org/strategy/">four strategic themes&lt;/a>.&lt;/p>
&lt;h2 id="strategic-theme-1-contribute-to-an-environment-where-the-community-identifies-and-co-creates-solutions-for-broad-benefit">Strategic theme 1: Contribute to an environment where the community identifies and co-creates solutions for broad benefit&lt;/h2>
&lt;h3 id="enhanced-tools-and-services">Enhanced tools and services&lt;/h3>
&lt;p>In October, we released an &lt;a href="https://doi.org/10.64000/8d5ga-2n897" target="_blank">enhanced Participation Reports dashboard&lt;/a> that shows metadata coverage across all 180 million records and provides individual member organisations with actionable gap reports to guide them to improve metadata completeness. The new tool provides more complete coverage of all members and resource types, now including funders and grants, with up to 11 best-practice metadata elements publicly tracked.&lt;/p>
&lt;p>We launched support for journal articles in the &lt;a href="https://manage.crossref.org/" target="_blank">New Metadata Manager record registration form&lt;/a> (initially only for grants), which includes built-in reference and relationships deposit capabilities. In the New Metadata Manager, it’s now also possible to search for previously registered DOIs to edit your metadata records. In the coming years, we are planning to expand the new Metadata Manager to support all the many different content types that you can register with Crossref DOIs.&lt;/p>
&lt;p>After a long break between regular updates, we have fixed our process for and &lt;a href="https://gitlab.com/crossref/open_funder_registry" target="_blank">just released v.1.63 of the Open Funder registry&lt;/a>. With the updated process, we&amp;rsquo;re now able to resume more frequent updates to the registry (while of course still working towards the transition to ROR for funders).&lt;/p>
&lt;p>Throughout 2025, we conducted a website information architecture review to improve the information we provide to our members and the wider community. Based on the recommendations from this review, we will be renewing our website and documentation in 2026.&lt;/p>
&lt;h3 id="deprecations-and-modernisation">Deprecations and modernisation&lt;/h3>
&lt;p>‘Old’ Metadata Manager is to be &lt;a href="https://doi.org/10.64000/ys7s6-pwn71" target="_blank">retired at the end of 2025&lt;/a>, with users transitioning to the &amp;lsquo;New&amp;rsquo; version or to our other helper tools for registering and updating DOIs. All users have been contacted during 2025 and received &lt;a href="https://www.youtube.com/watch?v=WN3M90LKNqs" target="_blank">training on how to use the New Metadata Manager&lt;/a>.&lt;/p>
&lt;p>We also &lt;a href="https://doi.org/10.64000/w6pw6-c7y02" target="_blank">announced the deprecation of Co-access&lt;/a>, which will end in 2026, bringing an end to the service that allowed duplicate DOIs for book content. Users of co-access have been informed and are in the process of transitioning to &lt;a href="https://www.crossref.org/documentation/register-maintain-records/creating-and-managing-dois/multiple-resolution/">multiple resolution&lt;/a>.&lt;/p>
&lt;p>Together with Turnitin and our members, we are working to transition all subscribers to our Similarity Check service to a new version of iThenticate 2.0. We are happy to report that all platforms with integrations with us transitioned to 2.0 during 2025, and we will continue working with our members to get everyone transitioned during 2026.&lt;/p>
&lt;h3 id="eating-our-own-doi-dogfood">Eating our own DOI dogfood&lt;/h3>
&lt;p>In June this year, we were particularly pleased to finally &lt;a href="https://doi.org/10.64000/552ec-b8g03" target="_blank">support the registration of DOIs for our own content, this very blog&lt;/a>, through partnering with Rogue Scholar. Blogs are a growing format for scholarly discourse and our own blog is no different as it’s the main way that we share guidelines and best practices, as well as news and stories from the scholarly community. With a Crossref DOI for all blogs going back to 2006, we’re setting ourselves up to ensure better future preservation of the discussion and information about Crossref.&lt;/p>
&lt;h3 id="community-connections">Community connections&lt;/h3>
&lt;p>We delivered 29 metadata health-check webinars over the course of the year, in French, Indonesian, Spanish, and English, reaching 2,166 participants with practical advice on identifying gaps in journal metadata using &lt;a href="https://www.crossref.org/members/prep/" target="_blank">Participation Reports&lt;/a>.&lt;/p>
&lt;p>&lt;a href="https://doi.org/10.64000/x38ew-0n632" target="_blank">Crossref Accra&lt;/a> took place in March as our first in-person event in a &lt;a href="https://www.crossref.org/gem/">GEM&lt;/a> country. We also held similar events in Ecuador and Türkiye with &lt;a href="https://zenodo.org/records/17251274" target="_blank">Crossref Quito&lt;/a> in September and &lt;a href="https://zenodo.org/records/17952555" target="_blank">Crossref Ankara&lt;/a> in November. At these three events, we welcomed key figures from each country&amp;rsquo;s library, government, publishing, and academic communities and we learned so much about the thriving communities there, and also that even more dedicated workshops on the specifics of metadata quality improvements would be appreciated.&lt;/p>
&lt;p>Our &lt;a href="https://www.crossref.org/events/metadata-sprint/">metadata sprint in Madrid&lt;/a> in April brought together community members to tackle specific problems collaboratively, with teams exploring coding, documentation, translation, and research using our open metadata. We&amp;rsquo;re already planning our &lt;a href="https://www.crossref.org/events/metadata-sprint/">next sprint in São Paulo&lt;/a> for March 2026, and it will be held in three languages: Portuguese, Spanish, and English.&lt;/p>
&lt;p>A strategic goal for Crossref is to grow research funders’ adoption of the &lt;a href="https://www.crossref.org/services/grant-linking-system/">Grant Linking System&lt;/a>, and we produced the first in a series of interviews with funder members this year to highlight how and why Crossref DOIs are fulfilling goals to assess the reach and return of their research support for &lt;a href="https://doi.org/10.64000/n9n69-y5b75" target="_blank">FWF&lt;/a> (Austria), &lt;a href="https://doi.org/10.64000/dvqke-j4v69" target="_blank">NWO&lt;/a> (Netherlands), &lt;a href="https://doi.org/10.64000/9gjfp-5p698" target="_blank">FCCN|FCT&lt;/a> (Portugal), and &lt;a href="https://doi.org/10.64000/c1dh8-qn968" target="_blank">Wellcome&lt;/a>. This year, we welcomed more funders including Fonds de recherche du Québec (Canada) and Independent Research Fund Denmark as part of their national research platform NORA; we look forward to reporting on their experiences and outcomes next year and others as they work towards Crossref Grant DOI adoption.&lt;/p>
&lt;p>We continued working closely with PKP and &lt;a href="https://doi.org/10.64000/r2zgm-99706" target="_blank">renewed our partnership to help drive better experience for OJS users&lt;/a> registering metadata with Crossref. We also delivered a proportion of the metadata health-checks together to maximise the learning opportunities for our members using OJS; and we joined PKP&amp;rsquo;s Sprint in Oslo to help make improvements to OJS and OMP.&lt;/p>
&lt;p>Crossref staff members serve on almost 50 committees, boards, and other community bodies alongside our own direct work. These include in the areas of research integrity, metascience, metadata and PID standards, open science policy or monitoring, development of new models (such as Diamond OA), editorial production, library and institutional publishing, and citation and other metadata analyses. We also work with other DOI Registration Agencies and support the sustainability of the DOI Foundation with an additional annual subsidy. Many DOI RAs are also Crossref Sponsors so that their members can access our unique reference matching service. While we often might advise, we also learn a huge amount from collaborating with the numerous systems and initiatives that make up the wider research community.&lt;/p>
&lt;p>Our involvement with developing the &lt;a href="https://barcelona-declaration.org/" target="_blank">Barcelona Declaration on Open Research Information&lt;/a> led us to become the fiscal host and to participate in most of the working groups on open metadata. Of particular note this year was the Funding Metadata Working Group round table about &lt;a href="https://barcelona-declaration.org/news/20251023_community_roundtable/" target="_blank">moving forward the state of funding metadata&lt;/a>, which we co-hosted with Barcelona Declaration colleagues, and three funding bodies, NWO (Netherlands), FWF (Austria), and ANR (France) as we heard from publishers and their vendors about challenges and how to overcome them to increase the quantity and quality of available open funding metadata.&lt;/p>
&lt;p>All our community engagement activities have been enthusiastically supported and enriched by our indispensable &lt;a href="https://www.crossref.org/community/our-ambassadors/">Ambassadors&lt;/a> and our group of now 130 &lt;a href="https://www.crossref.org/community/sponsors/">Sponsors&lt;/a>, organisations that help thousands of Crossref members with local language and technical support and lower cost access to our membership.&lt;/p>
&lt;h2 id="strategic-theme-2-a-sustainable-source-of-complete-open-and-global-scholarly-metadata-and-relationships">Strategic theme 2: A sustainable source of complete, open, and global scholarly metadata and relationships&lt;/h2>
&lt;h3 id="schema-developments">Schema developments&lt;/h3>
&lt;p>The &lt;a href="https://www.crossref.org/documentation/schema-library/grants-schema/">grant schema version 0.2.0 was released in January&lt;/a>, adding support for ROR identifiers to identify funders and new funding types for in our taxonomy, including APC, BPC, and infrastructure. All of these funding types can be specified in the metadata of our grant-giving members alongside the existing types such as use of facilities or salary/training awards, etc.&lt;/p>
&lt;p>Version &lt;a href="https://doi.org/10.13003/325070" target="_blank">5.4 of our publications schema was released in March&lt;/a>, marking our first update in many years and a great opportunity to learn how to do this and make the process more efficient. This release introduced typed references to denote the type of object referenced (dataset, blog, software, etc.), preprint status indicators, and version numbering.&lt;/p>
&lt;p>Just last week, we also added a dedicated field for &lt;a href="https://www.crossref.org/documentation/schema-library/markup-guide-metadata-segments/funding-information">grant DOIs to our publications schema&lt;/a>. This means it’s now possible to indicate in an article&amp;rsquo;s metadata which grant(s) funded the research using the persistent identifier. This is an essential step toward better alignment between grant funding and research, enriching the Research Nexus.&lt;/p>
&lt;p>We also launched our new &lt;a href="https://www.crossref.org/working-groups/metadata-advisory/">Metadata Advisory Group&lt;/a> and they have already devised sub-working groups in three focus topic areas:&lt;/p>
&lt;ul>
&lt;li>Multilingual metadata&lt;/li>
&lt;li>Subjects and keywords&lt;/li>
&lt;li>Relationships&lt;/li>
&lt;/ul>
&lt;h3 id="public-data-file">Public data file&lt;/h3>
&lt;p>We released the &lt;a href="https://doi.org/10.13003/614659" target="_blank">2025 public data file&lt;/a> in March, containing metadata for (at the time) over 165 million research outputs from more than 22,000 organisations.&lt;/p>
&lt;h3 id="inaugural-metadata-awards">Inaugural Metadata Awards&lt;/h3>
&lt;p>In May, we launched the &lt;a href="https://doi.org/10.64000/xh94q-w7335" target="_blank">first-ever Metadata Awards&lt;/a> to recognise members demonstrating excellence in metadata completeness and enrichment. Winners included &lt;a href="https://doi.org/10.64000/v2v2s-r9037" target="_blank">Noyam Publishers&lt;/a> (Ghana), &lt;a href="https://doi.org/10.64000/z2qhj-7nd90" target="_blank">GigaScience Press&lt;/a> (Hong Kong), &lt;a href="https://doi.org/10.64000/3gcdf-23s29" target="_blank">eLife&lt;/a> (UK), &lt;a href="https://doi.org/10.64000/xxwy3-xhf38" target="_blank">American Society for Microbiology&lt;/a> (USA), &lt;a href="https://doi.org/10.64000/51bv6-89j85" target="_blank">Universidad La Salle Arequipa&lt;/a> (Peru), and &lt;a href="https://doi.org/10.64000/hkxmk-5qe50" target="_blank">Instituto Geologico y Minero de España&lt;/a> (Spain). The awards will be held biennially going forward.&lt;/p>
&lt;h3 id="metadata-matching-project">Metadata Matching project&lt;/h3>
&lt;p>In April, we launched the &lt;a href="https://www.crossref.org/community/special-programs/metadata-matching/">metadata matching&lt;/a> project with the aim of building a more complete picture of &lt;a href="https://www.crossref.org/documentation/research-nexus/">the research nexus&lt;/a> over time by automatically identifying missing relationships between entities across the scholarly record. The project’s goal is to modernise Crossref’s enrichment workflows by rebuilding them using modern software development and data science practices.&lt;/p>
&lt;p>We are in the throws of developing a consolidated matching workflow that will eventually replace all existing production matching processes, with results exposed through the REST API. All new matching strategies will be rigorously evaluated, and the resulting data will be accompanied by clear provenance information. This project covers six matching tasks:&lt;/p>
&lt;ul>
&lt;li>bibliographic reference matching&lt;/li>
&lt;li>funder name matching&lt;/li>
&lt;li>preprint matching&lt;/li>
&lt;li>affiliation matching&lt;/li>
&lt;li>grant matching&lt;/li>
&lt;li>title matching&lt;/li>
&lt;/ul>
&lt;p>In the meantime, while work continues on integrating matching results into the REST API, we’ve been releasing standalone matching datasets for separate download and analysis. These include &lt;a href="https://doi.org/10.5281/zenodo.15124417" target="_blank">relationships between preprints and journal articles&lt;/a>, &lt;a href="https://doi.org/10.5281/zenodo.15254993" target="_blank">relationships involving research organisations&lt;/a>, and &lt;a href="https://doi.org/10.13003/waej1een" target="_blank">relationships between grants and research outputs&lt;/a>.&lt;/p>
&lt;h3 id="data-infrastructure-and-research-nexus-participation-dashboard">Data infrastructure and Research Nexus participation dashboard&lt;/h3>
&lt;p>Staying on the data science front, we’ve established an internal data environment that combines all relevant data sources (scholarly metadata, logs and usage data, and external datasets) in their raw forms into a single place. This environment is supported by a suite of modern tools and data processing techniques, enabling data science experiments and analytics pipelines to run effectively at scale.&lt;/p>
&lt;p>Building on this foundation, we plan to develop a series of dashboards to monitor the state of the scholarly record over time. These dashboards will feature both work-level and member-level statistics (for example, how many works of a given type have been registered, or how many members are registering grant IDs) as well as more detailed insights at the relationship level (for example, how many bibliographic references have been automatically matched, or how many times ROR IDs are included in funder assertions). Some of these &lt;a href="https://docs.google.com/spreadsheets/d/1jYXAILYgGWth-1lJhsJZPJJVSpyydenjK6E8fL4r1q0/edit?gid=2029795659#gid=2029795659" target="_blank">statistics are already available&lt;/a> in a public spreadsheet for now, pending the dashboard.&lt;/p>
&lt;h3 id="retraction-watch-integration">Retraction Watch integration&lt;/h3>
&lt;p>In 2023, Crossref &lt;a href="https://doi.org/10.13003/c23rw1d9" target="_blank">acquired the Retraction Watch database&lt;/a> to make it open data. Initially, this was done through sharing simple CSV files, but this year we have set up a pipeline to feed this information into our REST API, which means that Retraction Watch data is now fully available through the REST API, integrated with Crossref member-supplied retraction and correction metadata. This is the first example of Crossref integrating third-party metadata, and we&amp;rsquo;re learning a lot about how to best incorporate other datasets in future.&lt;/p>
&lt;h3 id="metadata-api-and-services-improvements">Metadata API and services improvements&lt;/h3>
&lt;p>From 1 December 2025, we &lt;a href="https://doi.org/10.64000/wadve-3tj60" target="_blank">revised rate limits for the REST API&lt;/a> to ensure system stability whilst maintaining free access to metadata for everyone. Changes were made to the rate limits for our ‘public’ and ‘polite’ APIs, while the limits for our Metadata Plus users stayed the same. We continue to make all metadata openly available to the whole community.&lt;/p>
&lt;p>We also improved how information from our content system feeds into the REST API. A tool we call ‘pusher’&amp;mdash;because it pushes information from the content system to the REST API&amp;mdash;was rebuilt so that we now have a more reliable transfer of information between our two systems.&lt;/p>
&lt;p>While adding to technical improvements, we’ve also worked to better understand the use of and streamline the service offering for paid options. We’ll share more about this year’s Metadata Plus consultation soon. And based on feedback, we have already retired the ‘Query Affiliate’ service, where a handful of organisations still paid us a fee to access our XML API, whereas no credentials have been required for some time.&lt;/p>
&lt;h2 id="strategic-theme-3-manage-crossref-openly-and-sustainably-modernising-and-making-transparent-all-operations-so-that-we-are-accountable-to-the-communities-that-govern-us">Strategic theme 3: Manage Crossref openly and sustainably, modernising and making transparent all operations so that we are accountable to the communities that govern us&lt;/h2>
&lt;h3 id="infrastructure-modernisation">Infrastructure modernisation&lt;/h3>
&lt;div class="shortcode-divwrap align-left">
&lt;span>&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2025/data-centre-out.jpg"
alt="Saying goodbye to the Crossref data centre" width="100%">
&lt;/figure>
&lt;/span>
&lt;/div>
&lt;p>One of our biggest projects of 2025&amp;mdash;if not &lt;strong>the&lt;/strong> biggest&amp;mdash;was the move from our data centre into the cloud (AWS). For 25 years, Crossref had been running a physical data centre in Massachusetts, USA, but as part of modernising our systems, it was high time to move everything into the cloud. The move to AWS took several months, but &lt;a href="https://doi.org/10.64000/wd6rx-vpq73" target="_blank">we successfully completed this move to the cloud&lt;/a> in July this year. We’re spending these last weeks of 2025 fully decommissioning our data centre, which means that we are removing all the equipment we had there and locking the door for the last time.&lt;/p>
&lt;p>A part of the move to AWS included moving onto an open-source database solution, PostgreSQL. This reduced our reliance on closed, costly licensed solutions, while also aligning with our POSI commitment to open-source. Running our entire system in AWS provides a more stable, modern approach to our infrastructure, but it also is expensive. We expect to spend about 2 million USD on AWS fees next year, with the majority of this cost coming from REST API usage. Some of the improvements described above will help us manage those costs and better observe traffic patterns.&lt;/p>
&lt;p>Our new cloud infrastructure is a bittersweet milestone: while we are happy to not have to rely on a physical presence to support a 24/7 global infrastructure, we also say a sad farewell to our much-loved and long-suffering Sys Admin, &lt;a href="https://www.crossref.org/people/tim-pickard/">Tim Pickard&lt;/a>, who has been with Crossref since 2002, and has contributed significantly and unwaveringly to keeping our system up and running in the data centre. Tim will be leaving Crossref at the end of the year; we’re grateful to Tim for all his years of dedication, and we will greatly miss his impressive Hawaiian shirt game on our all-staff calls.&lt;/p>
&lt;p>After 25 years, it was also time to get serious about modernising our core content system, because even though it serves our community well, an older system with legacy code is a constant risk and frustration. We’ve therefore embarked on a multi-year modernisation project where we are replacing our old code piece by piece. We no longer want to have one big content system (a monolith), but are planning to identify different pieces of functionality and rebuild these as separate services (a modular, flexible, and robust approach). This year, we already managed to reconstruct some smaller pieces (for example, the ‘pusher’ mentioned above), and next year we will tackle larger projects, such as Metadata Matching and Authentication.&lt;/p>
&lt;p>We continue to prioritise open, timely communication for planned or unplanned service interruptions and encourage everyone to monitor our status page at &lt;a href="https://status.crossref.org" target="_blank">status.crossref.org&lt;/a>. We’ll further hone our incident response processes in 2026, including openly posting incident reviews, and we’ll also centre system maintenance and documentation clarity in everything we do.&lt;/p>
&lt;h3 id="rcfs-projects">RCFS Projects&lt;/h3>
&lt;p>The &lt;a href="https://www.crossref.org/community/special-programs/resourcing-crossref/">Resourcing Crossref for Future Sustainability projects (RCFS)&lt;/a> and the work of our &lt;a href="https://www.crossref.org/committees/membership-and-fees/">Membership &amp;amp; Fees Committee&lt;/a> resulted in deciding not to change some things (such as the &lt;em>basis&lt;/em> for annual membership fees), but to &lt;a href="https://doi.org/10.64000/cvvj8-tax10" target="_blank">change three things about our fees, as reported in July&lt;/a>:&lt;/p>
&lt;ul>
&lt;li>A new lower membership fee tier of 200 USD for members with annual revenues/expenses of under 1000 USD - so far, this includes around 3000 members. &lt;a href="#membership-growth-efficiencies-and-accessibility">See below&lt;/a> for more info.&lt;/li>
&lt;li>A removal of volume discounts to reduce complexity in our billing code; they were little used, and those who did use these were fine with the loss of the discount.&lt;/li>
&lt;li>A removal of the rule that only publishers of a title could register peer review reports (including comments and annotations) at the lower 0.25 USD fee for the first review; this lower fee is now available to any member to register any reviews of any other members’ works.&lt;/li>
&lt;/ul>
&lt;p>A new late-breaking addition to these fee decisions is the &lt;a href="https://doi.org/10.64000/g6vyx-1tn51" target="_blank">reduction of fees for members registering grants&lt;/a>. As of January 1st 2026, there will be no fee for back-year (BY) grant registration, to encourage the faster adoption of older grants, which are more likely to have research outputs to be matched. This will be a two-year pilot to trial how a reduced fee incentivises adoption and boosts metadata connections, and could be extended to other record types as we monitor its success and sustainability. In addition, the 2 USD fee per current-year (CY) grant record is being reduced to 1 USD in line with the next-nearest fee, this is a permanent change for the foreseeable future.&lt;/p>
&lt;h3 id="membership-growth-efficiencies-and-accessibility">Membership growth, efficiencies, and accessibility&lt;/h3>
&lt;p>In March, the board voted to &lt;a href="https://doi.org/10.64000/dtrvw-8cm10" target="_blank">update membership terms and bylaws&lt;/a> to clarify processes for suspending and revoking membership, and to be more explicit about &lt;a href="https://www.crossref.org/operations-and-sustainability/membership-operations/member-practices/">member practices that preserve the integrity of the scholarly record&lt;/a>. A short-term &lt;a href="https://www.crossref.org/working-groups/member-practices/">Member Practices Working Group&lt;/a> will be meeting in the first half of 2026 to draft these.&lt;/p>
&lt;p>Crossref now serves 23,600 members across 164 countries, with continued growth particularly in Asia and Latin America. We&amp;rsquo;ve continued our ongoing member onboarding activities to support new members joining the community. We see around 230 new members join each month, and have welcomed 2,700 this year so far. We recently reported on how the &lt;a href="https://doi.org/10.64000/tch5n-9px70" target="_blank">shape of membership has evolved over our 25 years&lt;/a> of operation.&lt;/p>
&lt;p>From January 2026, we&amp;rsquo;re introducing a &lt;a href="https://doi.org/10.64000/j2bgz-v7h50" target="_blank">new lower membership fee tier&lt;/a> of 200 USD for organisations with annual revenue or expenses of 1,000 USD or less, making membership more accessible to low-resourced organisations. Already, over 3000 members have been eligible to move into or join under that fee, and the idea is to monitor how this affects Crossref’s financial sustainability and potentially adjust the 200 USD annual fee down again in future years.&lt;/p>
&lt;p>From 1 January 2026, the &lt;a href="https://www.crossref.org/gem">GEM program, which offers fee-free membership and content registration for all members from certain countries&lt;/a>, will expand to include 18 additional countries, further reducing financial barriers to participation in the scholarly record, so we expect several hundred further members to join the existing 600 organisations in this category. &lt;a href="https://doi.org/10.64000/wbrxx-ftc39" target="_blank">More information about the GEM program expansion here&lt;/a>.&lt;/p>
&lt;p>As our membership base continues to grow, the Membership and Finance teams are constantly exploring ways to make shared processes more efficient. A key component in this work has been the efforts to automate several tasks within both teams to help us manage the additional work caused by our growth and allow our teams to focus more on providing the best quality service we can.&lt;/p>
&lt;p>Our membership team continues to support our members, sponsors, service providers, metadata users and the wider community by email and through our &lt;a href="https://community.crossref.org/" target="_blank">community forum&lt;/a>. The membership team includes staff members who focus on member support, and staff members who focus on technical support. During 2025 so far, we’ve received 36.8k member enquiries through our support system, a 17% increase from last year. This includes 22.6k inquiries related to general membership and 13k technical support enquiries. We’ve received 3.8k membership applications, and welcomed 2.7k new members.&lt;/p>
&lt;h3 id="growth-by-the-numbers">Growth by the numbers&lt;/h3>
&lt;p>Crossref continues its steady revenue growth in 2025 due to the expansion of our membership base. With the addition of new members and the general growth of Crossref, comes an increase in the transaction-based tasks our Finance team handles.&lt;/p>
&lt;p>So far in 2025 we have issued 14,833 invoices, which is a 9% increase since last year. We’ve seen an 11% increase in the number of payments received and applied, and a 12% increase in the amount of credit and debit memos applied over the same time last year. We have also seen a 42% increase in the number of billing-related tickets, totalling 20,723. A large segment of these tickets are related to fee updates associated with the new $200 membership tier.&lt;/p>
&lt;p>Not all transactional work in Finance has increased as steadily, with increased revenue of 8% we have also seen a 14% increase in operating expenses. Through the strategic consolidation of vendors and use of financial tools, we have only seen a 1% increase in Accounts Payable invoices processed.&lt;/p>
&lt;h3 id="organisational-sustainability">Organisational sustainability&lt;/h3>
&lt;p>Finance-wise, we’re doing well. We’re projecting to finish this year with revenue of 14,200,000 USD and expect revenue next year of 14,500,000 USD. We’re budgeting 2% growth in overall revenue, accounting for some of the changes to fees that will reduce our earnings on membership dues, but anticipating continued growth of content registration revenue.&lt;/p>
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/financials/2024-YE-overall.jpg"
alt="A chart showing Crossref&amp;#39;s Revenue and expenses over the years" width="100%">&lt;figcaption>
&lt;p>Revenue and expenses trends&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;p>About 67% of our expenses come from personnel costs, and the other 33% include non-personnel costs like AWS, travel, legal fees, etc. As we continue to build out the team, we have ten new positions planned for the next year (recruitment for many of these is already underway or done). With additional staff roles and AWS expenses, we’re expecting expense growth of 16%. We post our financial statements and Form 990 filings on the &lt;a href="https://www.crossref.org/operations-and-sustainability/financials">financials page on our website&lt;/a>.&lt;/p>
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/financials/2024-rev-by-tier.jpg"
alt="A chart showing revenue per member size (by tier) with smallest members providing highest revenue" width="100%">&lt;figcaption>
&lt;p>Revenue per member size (by tier)&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;p>As the chart above shows, we still see &amp;rsquo;the long tail&amp;rsquo; of smaller members in the lowest fee category (275 USD) contributing more revenue than those in the largest category (50,000 USD) at 5.8 million USD versus 5 million USD.&lt;/p>
&lt;p>Another aspect of sustainability is our impact on the world around us. And this year we were able to publish a second &lt;a href="https://doi.org/10.64000/4yc7f-4h586" target="_blank">report on Crossref’s carbon footprint&lt;/a>, having monitored and controlled for several carbon-heavy activities, primarily staff travel. Our reported emissions went up 40% from 2023 to 2024, due to more travel given our growth in staff and members, better recording our emissions (for example, with hotel stays), and including travel that we support for our partners, ambassadors and board members. In terms of travel spending, we are still well below 2019 when we were smaller, demonstrating that we are following through on not going back to the pre-pandemic norm.&lt;/p>
&lt;p>We were one of the first open infrastructure organisations to adopt the POSI Principles and now have a few years’ experience in trying to meet them. Together with other adopters, we &lt;a href="https://doi.org/10.13003/6148078" target="_blank">proposed updates and additions to the principles&lt;/a>, based on real-world practice, and gathered a lot of community comment, resulting in the group &lt;a href="https://doi.org/10.14454/G8WV-VM65" target="_blank">publishing POSI v2&lt;/a> in October. We conduct a &lt;a href="https://www.crossref.org/categories/posi">self-assessment&lt;/a> every other year and we’ll be involving all our staff in the next self-assessment, due later in 2026.&lt;/p>
&lt;h3 id="open-governance-through-board-election-and-annual-meeting">Open governance through board election and annual meeting&lt;/h3>
&lt;p>We continued our commitment to being member-led and community-driven. This year’s &lt;a href="https://doi.org/10.64000/0team-dyy285" target="_blank">anniversary Annual Meeting&lt;/a> in October brought together members to discuss strategy, metadata developments, and hear the results of their voting in our board election. It comprised two half-days of online conferencing and several in-person satellite meetings spread across five continents, gathering close to 500 members of our community. It was a platform to reflect together on the past quarter of the century of building community infrastructure and connections underpinning the progress of scholarship, and to share plans for the future.&lt;/p>
&lt;p>Each member has one vote, and together they elected the following organisations to serve a three-year term alongside the &lt;a href="https://www.crossref.org/board-and-governance/#board-members">rest of the board&lt;/a>:&lt;/p>
&lt;p>Tier 1 candidates (electing one seat):&lt;/p>
&lt;ul>
&lt;li>Rebecca Wambua, Distance, Open and e-Learning Practitioners&amp;rsquo; Association of Kenya&lt;/li>
&lt;/ul>
&lt;p>Tier 2 candidates (electing four seats):&lt;/p>
&lt;ul>
&lt;li>Damian Bird, CABI&lt;/li>
&lt;li>Rose L&amp;rsquo;Huillier, Elsevier*&lt;/li>
&lt;li>Anjalie Nawaratne, Springer Nature*&lt;/li>
&lt;li>Nick Lindsay, The MIT Press*&lt;/li>
&lt;/ul>
&lt;p>*returning board member&lt;/p>
&lt;p>Congratulations to the remaining and incoming board members as we start their new term in January 2026. Have a look at &lt;a href="https://doi.org/10.13003/431937misogo" target="_blank">all the outputs from our Annual Meeting&lt;/a>.&lt;/p>
&lt;h2 id="strategic-theme-4-foster-a-strong-teambecause-reliable-infrastructure-needs-committed-people-who-contribute-to-and-realise-the-vision-and-thrive-doing-it">Strategic theme 4: Foster a strong team—because reliable infrastructure needs committed people who contribute to and realise the vision, and thrive doing it&lt;/h2>
&lt;h3 id="team-structure">Team structure&lt;/h3>
&lt;p>We reorganised the team heading into 2025 because we had ambitious goals that required a more structured, collaborative approach. We reorganised the work around three strategic, mission-driven areas of focus described above. This was our first full year with the &lt;a href="https://doi.org/10.64000/4s2ee-wkr84" target="_blank">cross-functional program groups&lt;/a> in place, and the activities reported here make it evident that our team members, both existing and new, are firing on all cylinders.&lt;/p>
&lt;h3 id="new-staff-and-new-roles">New staff and new roles&lt;/h3>
&lt;p>We welcomed eight new team members in 2025. In February, we welcomed our new Director of Programs &amp;amp; Services, &lt;a href="https://www.crossref.org/people/helena-cousijn">Helena Cousijn&lt;/a>, and a new member of the Technical Support team, &lt;a href="https://www.crossref.org/people/arley-soto">Arley Soto&lt;/a>. In March, we welcomed our new Community Manager for funders, &lt;a href="https://www.crossref.org/people/roc%C3%ADo-gaudioso-pedraza">Rocío Gaudioso Pedraza&lt;/a>. In April, we &lt;a href="https://doi.org/10.64000/6e4f8-3yj41" target="_blank">launched our new Data Science team&lt;/a> by welcoming &lt;a href="https://www.crossref.org/people/jason-portenoy">Jason Portenoy&lt;/a> and &lt;a href="https://www.crossref.org/people/alex-b%C3%A9dard-vall%C3%A9e">Alex Bédard-Vallée&lt;/a>. In November, we welcomed our new DevOps Engineer, &lt;a href="https://www.crossref.org/people/thelma-laryea">Thelma Laryea,&lt;/a> and our new Program Technical Lead for the OSO program, &lt;a href="https://www.crossref.org/people/bharath-govindarajan">Bharath Govindarajan.&lt;/a> In December, we welcomed another member of the Technical Support team, &lt;a href="https://www.crossref.org/people/natali-giorgobiani">Natali Giorgobiani&lt;/a>.&lt;/p>
&lt;p>We also had team members step up into new roles. &lt;a href="https://www.crossref.org/people/dominika-tkaczyk">Dominika Tkaczyk&lt;/a> completed the new leadership team by taking on the Director of Technology role, &lt;a href="https://www.crossref.org/people/paul-davis">Paul Davis &lt;/a>has started his new role as Product Manager, and &lt;a href="https://www.crossref.org/people/michelle-cancel/">Michelle Cancel&lt;/a> has taken on the Head of Human Resources role. And there’s more to come! As next year begins, two team members will step into Program Technical Lead roles: &lt;a href="https://www.crossref.org/people/carlos-del-ojo-elias">Carlos del Ojo Elias&lt;/a> for the CRN program and &lt;a href="https://www.crossref.org/people/patrick-vale">Patrick Vale&lt;/a> for the CCT program. Together with the Program Technical Lead for the OSO program and the Head of Infrastructure Services, these roles will complete the new structure of the technology team. This structure is more closely aligned with how our work is organised and will enable stronger coordination both within and across cross-functional programs.&lt;/p>
&lt;h3 id="supporting-a-thriving-global-culture">Supporting a thriving global culture&lt;/h3>
&lt;p>As our team grows in different aspects within our new &lt;a href="https://www.crossref.org/people/org-chart/">org structure&lt;/a> to meet the needs of the community, we remain committed to supporting a thriving culture through training, conducting regular temperature checks, and organising our annual staff retreat. This year, we continued our work on psychological safety and introduced workshops on giving and receiving feedback and on consensus building. We were able to put some of this training into practice at our in-person all-staff event in Split, Croatia, where we all came together to &lt;a href="https://roadmap.productboard.com/e6fdeba8-a5b3-4aef-8104-d48863ba975e" target="_blank">build our roadmap&lt;/a>.&lt;/p>
&lt;p>We are ending the year with 51 staff in 14 countries and look forward to diversifying and evolving even further as a team in 2026&amp;mdash;we’re currently hiring in UX, Communications, and Membership&amp;mdash;and keep an eye on our &lt;a href="https://www.crossref.org/jobs">jobs&lt;/a> page for forthcoming opportunities in Software, DevOps, Metadata, and Operations!&lt;/p>
&lt;hr>
&lt;p>Thank you to our community of members, partners, board, ambassadors, sponsors, metadata users, service providers, integrators—and of course our team—for making 2025 such a productive year. Together, we&amp;rsquo;re building a richer, more connected research ecosystem for the benefit of society. We can’t wait to continue the work together in 2026.&lt;/p></description></item><item><title>Twenty-five years of Crossref: reflections from the 2025 annual meeting and board election</title><link>https://www.crossref.org/blog/twenty-five-years-of-crossref-reflections-from-the-2025-annual-meeting-and-board-election/</link><pubDate>Wed, 17 Dec 2025 00:00:00 +0000</pubDate><author>Rosa Morais Clark</author><guid>https://www.crossref.org/blog/twenty-five-years-of-crossref-reflections-from-the-2025-annual-meeting-and-board-election/</guid><description>&lt;p>Crossref turned twenty-five this year, and our 2025 Annual Meeting became more than a celebration—it was a shared moment to reflect on how far open scholarly infrastructure has come and where we, as a community, are heading next.&lt;/p>
&lt;p>Over two days in October, hundreds of participants joined online and in local satellite meetings in Madrid, Nairobi, Medan, Bogotá, Washington D.C., and London––a reminder that our community spans the globe. The meetings offered updates, community highlights, and a look at what’s ahead for our shared metadata network––including plans to connect funders, platforms, and AI tools across the global research ecosystem.&lt;/p>
&lt;p>Ed Pentz opened with thanks and perspective. He reflected on how it all began: twelve members, one shared goal — to make research easier to find and verify. 25 years later, the same goal underpins 174 million open metadata records, 1.9 billion citation links, and roughly 1.3 billion DOI resolutions each month. What started as reference linking is now a global network of relationships among people, institutions, and research outputs. Ed also reaffirmed &lt;a href="https://openscholarlyinfrastructure.org/" target="_blank">the Principles of Open Scholarly Infrastructure (POSI)&lt;/a> as the foundation of our operations and our collaborations with other community-governed infrastructures.&lt;/p>
&lt;blockquote>
&lt;p>“Each number represents shared effort, trust, and long-term commitment,” Ed reminded us. “Open infrastructure works because people keep showing up.”&lt;/p>
&lt;/blockquote>
&lt;p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/ed-intro.png"
alt="Black-and-white road image symbolizing scholarly progress with the words: &amp;amp;quot;To promote the development and cooperative use of new and innovative technologies to speed and facilitate scientific and other scholarly research.&amp;amp;quot;" width="100%">&lt;figcaption>
&lt;p>&lt;em>Crossref&amp;rsquo;s purpose as per the Certificate of Incorporation.&lt;/em>&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;p>Following up Ed’s talk, we showed a video timeline, ‘25 years of Crossref’, tracing milestones from the first DOIs to today’s connected Research Nexus.&lt;/p>
&lt;div style="position: relative; width: 100%; height: 0; padding-top: 56.2500%;
padding-bottom: 0; box-shadow: 0 2px 8px 0 rgba(63,69,81,0.16); margin-top: 1.6em; margin-bottom: 0.9em; overflow: hidden;
border-radius: 8px; will-change: transform;">
&lt;iframe loading="lazy" style="position: absolute; width: 100%; height: 100%; top: 0; left: 0; border: none; padding: 0;margin: 0;"
src="https://www.canva.com/design/DAG7wb4NXhc/uC4PVxNEY7alr3x16gscSQ/watch?embed" allowfullscreen="allowfullscreen" allow="fullscreen">
&lt;/iframe>
&lt;/div>
&lt;a href="https:&amp;#x2F;&amp;#x2F;www.canva.com&amp;#x2F;design&amp;#x2F;DAG7wb4NXhc&amp;#x2F;uC4PVxNEY7alr3x16gscSQ&amp;#x2F;watch?utm_content=DAG7wb4NXhc&amp;amp;utm_campaign=designshare&amp;amp;utm_medium=embeds&amp;amp;utm_source=link" target="_blank" rel="noopener">Crossref 25th anniversary timeline&lt;/a>
&lt;h3 id="shared-perspectives-from-the-community">Shared perspectives from the community&lt;/h3>
&lt;p>We featured perspectives from organizations that have built key scholarly infrastructure alongside Crossref over the years. A shared message ran through their talks: open infrastructure only works when it’s interoperable, community-led, and practical for the people who use it.&lt;/p>
&lt;p>&lt;strong>&lt;a href="https://youtu.be/csDj_YkNuG8?si=ZGOsAXGkryd-LKWy&amp;amp;t=1953" target="_blank">Urooj Nizami (PKP)&lt;/a>&lt;/strong> described PKP and Crossref as “independent and interdependent,” using the archipelago metaphor to show how open software and shared metadata services connect local publishing to a global network.&lt;/p>
&lt;p>&lt;strong>&lt;a href="https://youtu.be/csDj_YkNuG8?si=wTyUw6m-DBbh9w2D&amp;amp;t=2633" target="_blank">Todd Carpenter (NISO)&lt;/a>&lt;/strong> emphasized standards being a social, and technical contract, noting how persistent identifiers and reliable metadata underpin a broader knowledge graph—and why provenance and linking matter even more as AI systems remix content.&lt;/p>
&lt;p>&lt;strong>&lt;a href="https://youtu.be/csDj_YkNuG8?si=A9EdjLpip3m-2xrF&amp;amp;t=3233" target="_blank">Abel Packer (SciELO)&lt;/a>&lt;/strong> highlighted Latin America’s strong DOI coverage while pointing out where multilingual versions and preprint–article–data links still break visibility—arguing for metadata that connects versions, not splits them. [data point]&lt;/p>
&lt;p>&lt;strong>&lt;a href="https://youtu.be/0NPqLrPHhYA?si=_lCBwuJ9T0M37q7O&amp;amp;t=2048" target="_blank">Soichi Kubota (J-STAGE/JST)&lt;/a>&lt;/strong> showed how Crossref services (from citation linking, Cited-by, metadata, to Similarity Check) anchor Japan’s national platform and how deeper cooperation (e.g., Crossmark) will support richer, more reliable metadata.&lt;/p>
&lt;p>&lt;strong>&lt;a href="https://youtu.be/0NPqLrPHhYA?si=wzUIHxHKf2j2J6n0&amp;amp;t=2449" target="_blank">Leena Shah (DOAJ)&lt;/a>&lt;/strong> outlined DOAJ’s open index, renewed POSI commitment, and hands-on collaboration with Crossref—from the MoU and PLACE to help-desk coordination, gap analyses, and plans to boost DOAJ records via Crossref’s API and open references.&lt;/p>
&lt;p>&lt;strong>&lt;a href="https://youtu.be/0NPqLrPHhYA?si=w5w4IolcyaSI_N04&amp;amp;t=2894" target="_blank">Susan Murray (AJOL)&lt;/a>&lt;/strong> spoke of capacity building: with 900+ journals across 40 countries, benefiting from AJOL’s support in registering identifiers and metadata , and of their long-standing partnership with Crossref making it possible for journals with limited resources to take part.&lt;/p>
&lt;p>These voices echoed a common call: Build bridges, not silos.&lt;/p>
&lt;h3 id="governance-and-election-results">Governance and election results&lt;/h3>
&lt;p>Leading off the formal annual meeting, &lt;a href="https://youtu.be/csDj_YkNuG8?si=Gcjw0XERzNc45pRQ&amp;amp;t=3826" target="_blank">Lisa Schiff, Chair of the Crossref Board, looked back on our 25th anniversary&lt;/a> as one marked by progress and problem-solving. She talked about moving all our systems to the cloud—a big step that makes the organization’s work faster and more reliable. She also spoke about ongoing efforts to maintain the research record&amp;rsquo;s trustworthiness, including adding Retraction Watch data and updating member terms. Lisa noted new ways we are making membership more accessible, &lt;a href="https://doi.org/10.64000/cvvj8-tax10" target="_blank">like the lower $200 tier&lt;/a> and the expansion of &lt;a href="https://www.crossref.org/gem/">the GEM program&lt;/a>.&lt;/p>
&lt;p>Lucy Ofiesh brought it back to the role of the members themselves, reminding everyone that success still rests with its members. The annual meeting is when members directly influence Crossref’s direction––when each vote helps shape how we move forward together.&lt;/p>
&lt;p>We extend our thanks to the Board members whose terms have concluded, and we congratulate the newly elected members who will carry the work forward.&lt;/p>
&lt;p>Five directors were elected: Rebecca Wambua (Distance, Open and e-Learning Practitioners’ Association of Kenya), Damian Bird (CABI), Rose L’Huillier (Elsevier), Anjalie Nawaratne (Springer Nature), and Nick Lindsay (MIT Press).&lt;/p>
&lt;p>&lt;a
href="https://youtu.be/csDj_YkNuG8?si=VdlglVWW2n7HJzP-&amp;t=4406"
style="display:block; text-align:center;">
&lt;img
src="https://www.crossref.org/images/blog/2025/new board members 2026.jpg"
style="width:75%;"
alt="Welcome! Newly elected board members with images of Rebecca Wambua (Distance, Open and e-Learning Practitioners’ Association of Kenya), Damian Bird (CABI), Rose L’Huillier (Elsevier), Anjalie Nawaratne (Springer Nature), and Nick Lindsay (MIT Press)"
/>
&lt;/a>&lt;/p>
&lt;p>We also thank the 2025 Nominating Committee for their thoughtful work guiding this year&amp;rsquo;s process and slate selection.&lt;/p>
&lt;p>The Board plays an important role in making sure our governance remains community-led, transparent, and accountable. The volunteer members bring experience from research funders, publishers, and libraries, giving a balance of perspectives that help steer our long-term strategy and sustainability.&lt;/p>
&lt;h3 id="tools-in-practice">Tools in practice&lt;/h3>
&lt;p>Then our attention turned to the tools that many members use every day. Patrick Vale walked participants through updates to Participation Reports and the Record Registration Form— designed to make working with metadata simpler.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/prep-la-salle.png"
alt="Screenshot of a participation report for Universidad La Salle Arequipa in Peru, showing percentages per metadata element." width="100%">&lt;figcaption>
&lt;p>Updated Participation Report for Universidad La Salle Arequipa (Peru), showing metadata element coverage percentages.&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;p>&lt;a href="https://www.crossref.org/members/prep/" target="_blank">Participation Reports&lt;/a>, first launched in 2018, have now been completely rebuilt as version 1.2. &lt;a href="https://doi.org/10.64000/8d5ga-2n897" target="_blank">The refreshed interface&lt;/a> runs on a new technology stack and supports morecontent types, and offers a new “download gap report” feature that generates a CSV list of records missing key fields—so members can identify and fix gaps directly.&lt;/p>
&lt;p>Patrick then demonstrated improvements to the Record Registration Form, now streamlined for creating as well as editing records. The form includes real-time validation, auto-fill options for journals previously used, and the ability to edit existing records directly. Members can now easily add abstracts, funding data, licenses, and affiliations linked to ORCID and ROR—all within one place.&lt;/p>
&lt;p>In the final demonstration, Luis Montilla, shared a “short research story”. He showed how anyone can explore Crossref metadata to uncover global participation patterns—turning what might seem like a mass of disconnected records into something meaningful once you start asking questions. He also shared a workflow that automatically retrieves and enriches data with country and regional information, then visualises member contributions and metadata coverage.&lt;/p>
&lt;p>Luis also demonstrated an interactive notebook that lets users explore participation trends through radar charts and other visuals—illustrating how open data can help the community understand and improve the completeness of the scholarly record.&lt;/p>
&lt;h3 id="crossref-then--now">Crossref then &amp;amp; now&lt;/h3>
&lt;p>Amanda Bartell walked through how the community has changed over 25 years.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/summary-growth-over-years-2025.png"
alt="Image of statistics" width="100%">
&lt;/figure>
&lt;p>The membership has broadened dramatically: universities and scholar-led groups now form the largest share, and more organizations in Asia and Latin America have joined (with big growth in Indonesia and Brazil). Most members are small: 98% qualify for the lowest fee tier, and 57% participate via a Sponsor. In support of including members from smaller economies, Crossref launched a GEM programme, which will be expanding to 19 new countries in 2026.&lt;/p>
&lt;p>She expanded her presentation later &lt;a href="https://doi.org/10.64000/tch5n-9px70" target="_blank">with a blog post to share insights about the changes in the Crossref global community&lt;/a>.&lt;/p>
&lt;p>With our growing membership, the needs of the community are evolving too, including expectations about Crossref’s role in preserving the integrity of the scholarly record.&lt;/p>
&lt;blockquote>
&lt;p>“Our role in preserving the integrity of the scholarly record is focused on enriching the metadata to provide fuller and better trust signals while keeping barriers to participation low.” —Amanda Bartell, Crossref&lt;/p>
&lt;/blockquote>
&lt;p>In response to the growing membership across the globe, we launched our Ambassadors program in 2018. Johanssen Obanda highlighted the activities of what is now 50 volunteers across 38 countries. Ambassadors act as local contacts—running training sessions, organizing events, translating materials, and providing feedback from their regions. Over the past year, they’ve led 41 activities reaching around 1,200 people. Many also contribute to GEM outreach, metadata health checks, and regional events—often in local languages.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/map-ambassadors-2025.png"
alt="Slide titled &amp;amp;quot;Ambassador highlights: supporting GEM program&amp;amp;quot; with left-side collage of conference photos and a world map of Crossref Ambassadors with location pins." width="100%">
&lt;/figure>
&lt;h3 id="roadmap-highlights">Roadmap highlights&lt;/h3>
&lt;p>Helena Cousijn outlined progress across three programs—Co-creation and Community Trends, Contributing to the Research Nexus, and Open and Sustainable Operations.
Along with already showcased progress with Participation Reports and the new Record Registration Form, the Community Trends program involves working in partnership with others on DSpace integration and OJS plug-ins consolidation. In the near future there&amp;rsquo;s also a consideration for piloting AI detection tools.
The Contributing to Research Nexus program carried out a consultation with Metadata Plus subscribers, and develops a new data citations endpoint for the Crossref REST API. This team is also developing further matching services, in the first instance looking to match funder metadata to ROR IDs.&lt;/p>
&lt;p>Finally, Helena discussed the recent accomplishment of the Open and Sustainable Operations program, the migration of our database from the data centre to the cloud with Amazon Web Services. Other projects in this program involve ravamping resolution reports, rebuilding the Crossref authentication system, and launching new metadata schema.&lt;/p>
&lt;h3 id="resourcing-crossref-for-future-sustainability-rcfs">Resourcing Crossref for Future Sustainability (RCFS)&lt;/h3>
&lt;p>RCFS program is focused on equity, simplicity, and revenue balance. Kora shared recent developments and next steps:
:
A new $200 membership tier (for organizations with ≤$1,000 in publishing revenue/expenses) takes effect on January 1, 2026; more than 3,000 members have already moved into it.
We will keep “publishing revenue/expenses” as the sizing basis for publishers while funder sizing is still under review.
Volume discounts for content registration end on January 1, 2026.
Backfile discounts for theses/dissertations and conference proceedings are under review.
Peer-review fees are normalized at $0.25 for the first review of a work, with subsequent reviews (same member, same work) for free&lt;/p>
&lt;h3 id="behind-the-scenes-metadata-data-science">Behind the scenes: metadata, data science&lt;/h3>
&lt;p>Patricia Feeney reviewed recent and upcoming changes to our metadata schemas. Earlier this year, we began accepting ROR IDs as funder identifiers and released schema 5.4, which added versioning across all record types, a new status field for preprints, and a way to label citation types (like data sets, software, or blog posts).&lt;/p>
&lt;p>Coming soon, Crossref will add grant DOIs to funding metadata and release schema 5.5, which supports the CRediT contributor vocabulary and allows multiple contributor roles. A new grant schema will follow, including support for beneficiaries, project identifiers (like RAiD), and repeatable roles. Looking ahead to 2026, our plans to overhaul how names and organizations are modeled, add richer funding and data-availability statements, and expand abstract and multilingual metadata support. A new Metadata Advisory Group has also been formed to guide work on multilingual fields, subjects, keywords, and relationship modeling.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/ror-grant-schema.png"
alt="Slide explaining that ROR can be supplied as a funder identifier, and listing updates to the Grants schema and schema 5.4." width="100%">
&lt;/figure>
&lt;p>Finally, Patricia announced &lt;a href="https://doi.org/10.64000/zppnw-1se30" target="_blank">plans to deprecate older schemas&lt;/a>—a gradual, multi-year process—to simplify and modernize our metadata structure. She highlighted the importance of stronger relationships, richer records, and practical improvements that make metadata more useful across the community. That focus on connection carried directly into the next session about building through data science.&lt;/p>
&lt;h3 id="data-science-at-crossref">Data science at Crossref&lt;/h3>
&lt;p>Dominika Tkaczyk introduced the new data science team, formed a few months ago as part of the technology group. The team was created because of the growing scale and complexity of the data Crossref manages, driven by the expanding scholarly community. Their role is to use data science to assess, improve, and enrich scholarly metadata.&lt;/p>
&lt;p>Their work falls into two areas: data analysis and insights—to help Crossref understand the scholarly record and guide decisions—and data services and workflows—to apply data science in building and maintaining production systems. Examples include studying overlap between scholarly databases and improving metadata quality. The session then focused on two projects: creating an internal data processing environment and developing metadata matching services.&lt;/p>
&lt;p>Alex Bédard-Vallée described the team’s first project: building a data lake to bring together fragmented data from different systems. Previously, data were split across silos like the REST API, internal logs, and production databases. It enables tracking of reference deposits, closing 718M citation gaps. The system already enables analyses that were previously impractical, such as tracking how many members include reference metadata in deposits. It will also power new dashboards, monitoring tools, and other data-driven initiatives that support the integrity of the scholarly record.&lt;/p>
&lt;p>&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/two-flavors.jpg"
alt="Slide summarizing recent data science work at Crossref, including metadata analysis and matching services." width="75%">
&lt;/figure>
&lt;br>&lt;/p>
&lt;p>Jason Portenoy then outlined the metadata matching project, which links pieces of information (like citations, funder names, or affiliations) to their identifiers such as DOIs or ROR IDs. He gave examples including reference-to-DOI, funder-to-ROR ID, affiliation-to-ROR ID, grant-to-DOI, and preprint-to-published-article matching.&lt;/p>
&lt;p>He explained that much metadata is already deposited by members but large gaps remain. For example, among more than a billion citation links, about 843 million already include DOIs, while another 718 million references can’t yet be matched. The goal is to close these gaps to build a more complete and connected scholarly record—the “research nexus.”&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/matching-why-bother.png"
alt="Two “Matching — Why bother?” slides with pie charts showing gaps in DOI and ROR ID metadata. Each chart highlights deposited IDs, automatically matchable items, and items with no identifier. Crossref 25th logo included." width="100%">
&lt;/figure>
&lt;h3 id="community-highlights">Community highlights&lt;/h3>
&lt;p>Martyn Rittman, Program Lead, and Kora each opened the community highlights over the two days by noting that everyone presenting is sharing how they use metadata and contribute to the broader ecosystem.&lt;/p>
&lt;blockquote>
&lt;p>Crossref does not exist without our members and the broader community—people who provide metadata and people who use the metadata. That’s why we’re here.” ~ Martyn Rittman&lt;/p>
&lt;/blockquote>
&lt;p>&lt;strong>&lt;a href="https://youtu.be/csDj_YkNuG8?si=m_bOdQ0UrDekhHCd&amp;amp;t=8158" target="_blank">Antoine Drouin (Fonds de Recherche du Québec)&lt;/a>&lt;/strong> shared that FRQ joined Crossref earlier this year and created 22,000+ grant and scholarship DOIs, linking grants to outputs and improving interoperability with ORCID, ROR, and Crossref grant IDs.&lt;/p>
&lt;p>&lt;strong>&lt;a href="https://youtu.be/csDj_YkNuG8?si=l9euyQIB8tcIrDR5&amp;amp;t=8928" target="_blank">Agon Memeti (University of Tetova)&lt;/a>&lt;/strong> shared findings of his analysis of abstract metadata coverage across 2024 articles from 13 university journals.&lt;/p>
&lt;p>&lt;strong>&lt;a href="https://youtu.be/csDj_YkNuG8?si=AQ0Y6pDEUY2saoj3&amp;amp;t=9528" target="_blank">Charlie Rapple (Kudos)&lt;/a>&lt;/strong> presented a Crossref-supported study on how researchers engage with the UN SDGs and described Kudos’ work explaining research for wider audiences. A survey of ~4,500 researchers showed strong awareness, regional differences in SDG priorities, and some targeted budgets for promotion, alongside challenges in publishing SDG-focused local research in prestige venues.&lt;/p>
&lt;p>&lt;strong>&lt;a href="https://youtu.be/csDj_YkNuG8?si=fg9M2VzEzeZtyUc&amp;amp;t=10264" target="_blank">Pia Kretschmar (SCOAP3)&lt;/a>&lt;/strong> outlined integrating Crossref metadata into new SCOAP³ open science elements in Phase 4; SCOAP³ funds OA publishing in high-energy physics and has covered 78,000+ articles. Publishers are scored on elements such as metadata provision to Crossref, identifiers, and links to datasets/software; completeness was checked via the Crossref API, results varied, and evaluation continues next year.&lt;/p>
&lt;p>&lt;strong>&lt;a href="https://youtu.be/csDj_YkNuG8?si=fBmVBqAXG0Uy0ULr&amp;amp;t=10923" target="_blank">Barbara Rivera (Barcelona Declaration)&lt;/a>&lt;/strong> introduced the Declaration, its four commitments, and its community of 125 signatories and 52 supporters, including Crossref. Working groups are executing a joint roadmap, with recent actions such as a funding-metadata roundtable and upcoming surveys on metadata frameworks and repository workflows.&lt;/p>
&lt;p>&lt;strong>&lt;a href="https://youtu.be/0NPqLrPHhYA?si=WybCBaSxzh-Wnoa3&amp;amp;t=7753" target="_blank">Hans de Jonge (Dutch Research Council, NWO)&lt;/a>&lt;/strong> presented his and Bianca Kramer’s &lt;a href="https://doi.org/10.31222/osf.io/ndx3f_v1" target="_blank">recent study (as of 10/23/25 Preprint, not yet reviewed)&lt;/a> of metadata completeness in Crossref among publishers using different manuscript submission systems. They compared six metadata types across major publishers and found that differences had more to do with workflow choices, customization, and policy than with the system itself.&lt;/p>
&lt;p>&lt;strong>&lt;a href="https://youtu.be/0NPqLrPHhYA?si=DIs2W6wIgUHHYy2h&amp;amp;t=8350" target="_blank">Audrey Kenni (Pan African Medical Journal)&lt;/a>&lt;/strong> shared PAMJ’s journey with Crossref to increased visibility.&lt;/p>
&lt;p>&lt;strong>&lt;a href="https://youtu.be/0NPqLrPHhYA?si=8LT4O1xOYA9W1Hva&amp;amp;t=8721" target="_blank">Nurul Ain Mohd Noor (UMT Press, Malaysia)&lt;/a>&lt;/strong> described UMT Press’s evolution since 2003, rebranding in 2007 and joining Crossref in 2020. Nurul explained how registering their metadata with Crossref increased citation visibility and indexing across databases.&lt;/p>
&lt;p>&lt;strong>&lt;a href="https://youtu.be/0NPqLrPHhYA?si=BzmyKx8i0g4SYjPl&amp;amp;t=9123" target="_blank">Achal Agrawal (PostPub)&lt;/a>&lt;/strong> introduced PostPub’s dashboard providing retraction statistics by country and institution, supported by a Catalyst Grant from Digital Science, and shared their journey through disambiguation challenges.&lt;/p>
&lt;p>&lt;strong>&lt;a href="https://youtu.be/0NPqLrPHhYA?si=sOPvYU5VV5p_bL5H&amp;amp;t=9641" target="_blank">Ratna Galuh Manika Trisista (Universitas Islam Jakarta)&lt;/a>&lt;/strong> presented how enabling reference linking transformed her law journal’s citation visibility.&lt;/p>
&lt;h3 id="closing-reflections">Closing reflections&lt;/h3>
&lt;p>We closed the meeting with a panel discussion on the &lt;strong>Research Nexus in the real world&lt;/strong>: What is the impact and potential of open scholarly metadata. Ginny Hendricks, Crossref; Dominika Tkaczyk, Crossref; Bianca Kramer, Barcelona Declaration on Open Research Information; David Oliva Uribe, UNESCO; Amber Osman, XploreOpen; Mariángela Nápoli, CONICET-IICE UBA-FFYL; Crossref; Kazuhiro Hayashi, National Institute of Science and Technology Policy; Science Council of Japan, shared a diversity of perspectives, which we’ll share in an upcoming blog.&lt;/p>
&lt;p>You can also learn more about the in-person satellite events across the world from &lt;a href="https://community.crossref.org/t/celebrating-crossref-s-25th-anniversary-at-our-annual-meeting-satellite-event-highlights/14959" target="_blank">their organisers on our Community Forum&lt;/a>.&lt;/p>
&lt;p>You will find outputs from #Crossref2025 on our website, which you can cite as `#Crossref2025 Annual Meeting and Board Election, 22-23 October 2025 retrieved [date], &lt;a href="https://doi.org/10.13003/431937misogo" target="_blank">https://doi.org/10.13003/431937misogo&lt;/a> &amp;lsquo;.&lt;/p></description></item><item><title>Wellcome and Europe PMC: supporting Open Research through open metadata</title><link>https://www.crossref.org/blog/wellcome-and-europe-pmc-supporting-open-research-through-open-metadata/</link><pubDate>Mon, 15 Dec 2025 00:00:00 +0000</pubDate><author>Rocío Gaudioso Pedraza</author><guid>https://www.crossref.org/blog/wellcome-and-europe-pmc-supporting-open-research-through-open-metadata/</guid><description>&lt;p>In my latest conversations with research funders, I talked with Hannah Hope, Open Research Lead at Wellcome, and Melissa Harrison, Team Leader of Literature Services at Europe PMC. Wellcome and Europe PMC are working together to realise the potential of funding metadata and the Crossref Grant Linking System for, among other things, programmatic grantee reporting. In this blog, we explore how this partnership works and how the &lt;a href="https://www.crossref.org/blog/wellcome-explains-the-benefits-of-developing-an-open-and-global-grant-identifier/" target="_blank">Crossref Grant Linking System is supporting Wellcome&lt;/a> in realising their Open Science vision.&lt;/p>
&lt;h3 id="what-motivated-you-to-join-crossref">What motivated you to join Crossref?&lt;/h3>
&lt;p>Hannah: The motivation for Crossref Grant IDs is to be able to disaggregate research outputs between funders. Funders’ grant identifiers come in a range of formats, funders might change them over time, and there are also similarities between funders’ names, which is a challenge. Permanent identifiers, in this case, Crossref Grant IDs, are an opportunity to avoid some of the confusion if we were able to implement them throughout the research ecosystem.&lt;/p>
&lt;p>This is also being discussed in different contexts, for example, within the Barcelona Declaration working groups, &lt;a href="https://barcelona-declaration.org/news/20251023_community_roundtable/" target="_blank">funders and other stakeholders&lt;/a> are exploring the diverse motivations that exist to implement changes into our workflows, as well as the challenges that funding metadata and persistent grant IDs can help solve.&lt;/p>
&lt;h3 id="the-way-wellcome-implemented-the-grant-linking-system-is-a-bit-unique-given-that-it-partnered-with-europe-pmc-for-the-technical-implementation-and-metadata-registration-with-crossref-can-you-tell-us-more-about-how-it-works">The way Wellcome implemented the Grant Linking System is a bit unique, given that it partnered with Europe PMC for the technical implementation and metadata registration with Crossref. Can you tell us more about how it works?&lt;/h3>
&lt;p>Hannah: The collaboration between Wellcome and Europe PMC in the implementation of Crossref’s Grant Linking System started because they already had the grants &lt;a href="https://europepmc.org/grantfinder/grantdetails?query=pi%3A%22%7BDr%7D%7BFritz%7D%7BZoe%7D%7BZ%7D%22%20gid%3A%22208213%22%20ga%3A%22Wellcome%20Trust%22" target="_blank">landing page feature&lt;/a> ready and available to us.&lt;/p>
&lt;p>There was an initial hope that other funders of Europe PMC, which also have these grant landing pages, could then leverage that same system to make Crossref grant IDs more broadly available to the research community, but I am not sure if that has materialised yet.&lt;/p>
&lt;p>Melissa: Currently we are supporting Wellcome’s implementation of Crossref grant IDs, but the infrastructure remains available to other Europe PMC funders should they decide to take advantage of it. We already have funding metadata for Europe PMC funders because it is a requirement for grantees to select their grant identifier when submitting their accepted manuscripts for indexing and archiving. As we already have that metadata, naturally we can pull it together and send it to Crossref, along with the link to the Europe PMC grant landing pages!&lt;/p>
&lt;p>An additional benefit of partnering with Europe PMC is the comprehensive metadata we deliver to Crossref with the grant IDs. For example, we have invested in supplementing affiliation data with ROR iDs and we deliver to Crossref all the data we have that matches their schema for grant data.&lt;/p>
&lt;h3 id="how-is-wellcome-leveraging-the-funding-metadata-and-crossref-grants-ids-that-are-being-shared-and-registered-with-crossref">How is Wellcome leveraging the funding metadata and Crossref grants IDs that are being shared and registered with Crossref?&lt;/h3>
&lt;p>Hannah: We are discussing internally how we can better socialise the Crossref grant DOIs among the grantees, either via our grant management system or through Europe PMC. One place where the Crossref grant DOIs are being used and shared is through our publishing platform, &lt;a href="https://wellcomeopenresearch.org/" target="_blank">Wellcome Open Research&lt;/a>. The Crossref grant DOI is included in the publication metadata, ensuring that the research output is linked to the funding via the open metadata registered.&lt;/p>
&lt;p>However, as we use Europe PMC as our repository for funded written research outputs, these outputs are aggregated alongside the grant records which includes the Crossref grant DOI, facilitated by Europe PMC APIs. So we have the means to link the two things together.&lt;/p>
&lt;p>Melissa: There are some UX and technical blockers to fully integrate Crossref grant IDs within the Europe PMC grant system currently that are detrimental to the utility of these IDs, for example, you can’t search for a specific grant in &lt;a href="https://europepmc.org/grantfinder" target="_blank">Europe PMC grant finder&lt;/a> using a Crossref grant ID. We are partnering with Crossref to solve these challenges and offer users more functionality in this space next year.&lt;/p>
&lt;p>Hannah: Beyond eLife and Wellcome Open Research, I am not sure which publishers use Crossref grants DOIs in their workflows.&lt;/p>
&lt;p>Rocio: That’s an interesting question, as we aren’t seeing a massive flow of Crossref grant IDs in the works metadata records just yet. We are exploring with publishers and their service providers how to make this business-as-usual, and in the meantime, we are running a series of matching projects to ensure that, when possible, we &lt;a href="https://www.crossref.org/blog/piecing-together-the-research-nexus-uncovering-relationships-with-open-funding-metadata/" target="_blank">make those connections ourselves to enrich the metadata with funding information&lt;/a>. We already insert reciprocal relationships where one record asserts a link with another (in this case, where either a grant &lt;code>Finances&lt;/code> a work or a work &lt;code>isFinancedBy&lt;/code> a grant record, Crossref adds in the reverse). Improving and enriching these relationships directly in the metadata makes sure that metadata provided by funders can make their way to the research outputs that originate from the grant.&lt;/p>
&lt;h3 id="wellcome-is-streamlining-the-way-of-asking-grantees-to-report-on-their-publications-facilitated-by-europe-pmc-can-you-tell-us-a-bit-more-about-how-this-will-work-and-what-role-metadata-will-play">Wellcome is streamlining the way of asking grantees to report on their publications, facilitated by Europe PMC. Can you tell us a bit more about how this will work and what role metadata will play?&lt;/h3>
&lt;p>Hannah: We will stop asking researchers to report their publications directly to us as part of progress and end-of-grant reporting. We believe there is sufficient open metadata with high-quality tagging in the ecosystem for us to collect written research outputs programmatically from this public data. Under our new system, we will be directing researchers to look at their grant record within Europe PMC and make sure that their written research outputs are properly linked there; otherwise, we won’t see them. We are trying to leverage open data, existing infrastructure, and a route that enables us to improve the completeness of open metadata.&lt;/p>
&lt;p>There aren’t many mechanisms that enable our researchers to add assertions to funding and research output records retrospectively, and Europe PMC offers us that opportunity, and that is really critical for us. Rather than collecting information in our own system, we can contribute to enhancing the global corpus of knowledge and the quality of open metadata more broadly. Since correcting metadata at source isn’t easy, Europe PMC presents us with an opportunity to contribute to that system.&lt;/p>
&lt;p>Melissa: We are thinking broadly about this problem; many institutions curate their research information in spreadsheets or closed CRIS systems and struggle to make it publicly available. We are thinking about how Europe PMC can be leveraged to be a public home for that data. EMBL-EBI hosts Europe PMC and utilises it as the institutional repository, so we have started a pilot project to add ROR IDs for affiliations to EMBL-authored publications within Europe PMC. This is manually curated, high-quality metadata that would otherwise be lost from the public ecosystem.&lt;/p>
&lt;h3 id="if-you-look-into-the-future-what-would-your-hopes-be-for-the-gls-and-greater-transparency-in-funding-metadata-in-general-what-do-you-think-that-we-could-achieve-collectively-as-a-community">If you look into the future, what would your hopes be for the GLS and greater transparency in funding metadata in general? What do you think that we could achieve collectively as a community?&lt;/h3>
&lt;p>Melissa: It would be amazing (!) if everybody, from funders to publishers, to institutions and authors, would coalesce around the Crossref Grant Linking System, and add to metadata exchange workflows – you would potentially have a very clean and clear picture of where the money is going, what the outputs are, and how they relate.&lt;/p>
&lt;p>Currently, even with the Open Funder Registry, there is ambiguity around funder names - for example, different geographical national funders sharing the same exact name as their counterpart in another country - so even with the best will in the world, funder institutions could be misidentified in systems and assigned the wrong identifier. The Crossref Grant Linking System facilitates complete disambiguation because grants are associated with the issuing funder’s correct identifier, ensuring traceability of outputs and funding and enabling more precise, cleaner metadata.&lt;/p>
&lt;p>Hannah: I think that is a bit of the Holy Grail and in reality, its a bit messy, there isn’t just one system! We need to be able to move past the chicken and egg discussion, where we talk about the use of different identifiers, with sometimes competing priorities. For me, the real challenge for the metadata community is how do we enrich metadata, correct errors, and develop greater interoperability between PID systems. So that multiple parties can contribute towards the creation of a greater whole record, rather than relying on a single owner of the record to provide all the information. If we could all, funders included, connect information from individual partners to create a unified record at the end of it, we could have better records and probably save time by distributing the workload.&lt;/p>
&lt;h3 id="what-would-you-say-to-colleagues-in-other-funders-about-investing-in-open-metadata">What would you say to colleagues in other funders about investing in open metadata?&lt;/h3>
&lt;p>We all need information from other partners in the ecosystem and investing in our own internal system &lt;strong>will not give us the same return as collectively investing in opening up that information&lt;/strong> wherever possible.&lt;/p>
&lt;p>—&amp;mdash;-&lt;/p>
&lt;p>We are very grateful to Hannah Hope and Melissa Harrison for their perspectives on open funding metadata and the role of the community in ensuring a complete and comprehensive Research Nexus.&lt;/p></description></item><item><title>Some things are big because they are small – the new fee tier for Crossref members takes effect</title><link>https://www.crossref.org/blog/some-things-are-big-because-they-are-small-the-new-fee-tier-for-crossref-members-takes-effect/</link><pubDate>Thu, 11 Dec 2025 00:00:00 +0000</pubDate><author>Kornelia Korzec</author><guid>https://www.crossref.org/blog/some-things-are-big-because-they-are-small-the-new-fee-tier-for-crossref-members-takes-effect/</guid><description>&lt;p>&lt;a href="#version-in-espa%c3%b1ol">Haz clic aquí para ver la versión en español&lt;/a>&lt;/p>
&lt;p>In January 2026, our new annual membership fee tier takes effect. The new tier is US$200 for member organisations that operate on publishing revenue or expenses (whichever is higher) of up to US$1,000 annually. We &lt;a href="https://doi.org/10.64000/cvvj8-tax10" target="_blank">announced the Board’s decision&lt;/a>, making it possible in July, and––as you can infer from &lt;a href="https://doi.org/10.64000/tch5n-9px70" target="_blank">Amanda’s latest blog&lt;/a>––this is the first such change to the annual membership fee tiers in close to 20 years!&lt;/p>
&lt;p>The new fee tier resulted from the consultation process and fees review undertaken as part of the &lt;a href="https://www.crossref.org/community/special-programs/resourcing-crossref/">Resourcing Crossref for Future Sustainability program&lt;/a>, carried out with the help of our &lt;a href="https://www.crossref.org/committees/membership-and-fees/">Membership and Fees Committee&lt;/a> (made up of representatives from member organisations and community partners). The program is ongoing, and the new fee tier, intended to make Crossref membership more accessible, is one of the first changes it helped us determine.&lt;/p>
&lt;p>When our membership renewal invoices are sent out in January 2026, the new fee tier will apply to 3,194 of our existing members, who will receive annual membership invoices 27% lower than previously. Surveys preceding the introduction of the new fee tier have shown that it might be applicable to between 30-60% of the organisations in what used to be our lowest fee tier (US$275 fee for organisations with publishing revenue or expenses of up to US$1 million).&lt;/p>
&lt;p>We received positive feedback from members affected by the change.&lt;/p>
&lt;blockquote>
&lt;p>We are very grateful for the new lowest membership fee tier. The Crossref fee is indeed a significant expense for our organisation, but we accept it given its importance. This new fee structure will make it easier for us to cover the cost.” – said Marina Pérez, Análisis Filosófico.&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>This initiative by Crossref to reduce membership fees is a welcome step toward achieving a truly global and connected research ecosystem. This will undoubtedly help our journal&amp;rsquo;s mission in fostering inclusive, open, and accessible publishing.” – said Dev Roychowdhury, Journal of Psychological Experience.&lt;/p>
&lt;/blockquote>
&lt;p>Following the feedback provided in the consultations and a number of prompts over the months after the original announcement, our Membership Team gathered information necessary to transition 3,194 members into the new fee tier. That’s 14.5% of all Crossref members (please note that in the graph below the number of members in $200 tier is higher due to recent influx of new members who didn&amp;rsquo;t need to transition, further – &amp;ldquo;$0&amp;rdquo; denotes all our sponsored members, who don’t pay membership fees to us, and those included in the &lt;a href="https://www.crossref.org/gem/">GEM program&lt;/a>).&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/newfee-pie.png"
alt="pie chart showing proportion of Crossref members on each membership fee tier" width="75%">
&lt;/figure>
&lt;/div>
&lt;p>Any members out there who think their organisation should be moved to the new lowest membership fee tier and haven’t already informed us – please contact us as soon as possible, before the end of the year, so we can make the change before invoices are raised in January.&lt;/p>
&lt;p>We know – from speaking with our community (and thank you SO MUCH, for everyone’s feedback in surveys and discussions!) that this change makes participation in Crossref more accessible to smaller organisations communicating research. This will result in a continued flow of new records and associated metadata into the &lt;a href="https://www.crossref.org/documentation/research-nexus/" target="_blank">research nexus&lt;/a>, helping us to make it easier to find and assess research, achieve greater transparency in the scientific process, and continue building trust in its outputs.&lt;/p>
&lt;p>We’re not done reviewing our fees, and we don’t think the new fee tier addresses all the needs of the growing and evolving scholarly community. We continue working with Sponsors and Ambassadors, and we have upcoming changes to the Global Equitable Membership program to facilitate participation by all types and sizes of organisations sharing research.&lt;/p>
&lt;h3 id="version-in-español">Version in Español&lt;/h3>
&lt;h2 id="algunas-cosas-son-grandes-porque-son-pequeñas-la-nueva-tarifa-para-los-miembros-de-crossref-entra-en-vigencia">Algunas cosas son grandes porque son pequeñas: la nueva tarifa para los miembros de Crossref entra en vigencia&lt;/h2>
&lt;p>En enero de 2026 entrará en vigencia nuestra nueva tarifa anual. Será de 200 dólares americanos (US$) para las organizaciones miembro que operen con ingresos o gastos editoriales de hasta 1000 US$ al año. &lt;a href="https://doi.org/10.64000/cvvj8-tax10" target="_blank">Tras anunciar esta decisión de la Junta Directiva&lt;/a>, se hizo realidad en julio y, como se puede inferir &lt;a href="https://doi.org/10.64000/tch5n-9px70" target="_blank">del último blog de Amanda&lt;/a>, este es el primer cambio en las tarifas anuales de membresía en casi 20 años.&lt;/p>
&lt;p>Esta nueva tarifa fue resultado de consultas y revisiones de tarifas que hicimos y que hacen parte del programa de &lt;a href="https://www.crossref.org/community/special-programs/resourcing-crossref/">financiación para la sostenibilidad a futuro de Crossref&lt;/a> y que fue elaborada con la ayuda del comité de membresía y tarifas (compuesto por miembros representantes y aliados de la comunidad). El programa sigue en curso y la nueva tarifa, pensada para hacer más accesible la membresía de Crossref, es uno de los primeros cambios que nos ayudó a definir.&lt;/p>
&lt;p>Cuando se envíen las facturas de renovación de membresía en enero de 2026, la nueva tarifa se aplicará a 3.194 de nuestros miembros actuales, quienes notarán que esta será un 27 % más económica que en otros años. Por otro lado, queremos que tengan en cuenta que las encuestas realizadas antes de la introducción de la nueva tarifa demostraron que esta podría ser aplicable a entre el 30 y el 60 % de las organizaciones que anteriormente se encontraban en nuestro nivel de tarifa más bajo (275 US$ para organizaciones con ingresos o gastos de publicación de hasta 1 millón de US$).
Ya hemos recibido retroalimentación positiva de miembros que han sido beneficiados con el cambio:&lt;/p>
&lt;blockquote>
&lt;p>Estamos agradecidos por la nueva tarifa más baja. El costo de Crossref es, sin duda, un gasto significativo para nuestra organización, pero lo aceptamos dada su importancia. Esta nueva estructura de tarifa hará que cubrir el costo sea más fácil.”, dijo Mariana Pérez, de Análisis Filosófico.&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>La iniciativa de Crossref de reducir las tarifas de membresía es la bienvenida a lograr un verdadero ecosistema de investigación global y conectado. Sin duda, esto va a ayudar en la misión de nuestra revista de fomentar una publicación inclusiva, abierta y accesible.”, dijo Dev Roychowdhury, del Journal of Psychological Experience.&lt;/p>
&lt;/blockquote>
&lt;p>Siguiendo los comentarios proporcionados en las consultas y una serie de indicaciones a lo largo de los meses posteriores al anuncio original, nuestro equipo de membresías recopiló la información necesaria para trasladar a 3.194 miembros al nuevo nivel de tarifas, lo que representa el 14,5 % de todos los miembros de Crossref (el gráfico a continuación muestra que el número de miembros en el nivel de $200 es mayor debido a la reciente afluencia de nuevos miembros que no necesitaron hacer la transición; además, “$0” denota a todos nuestros miembros patrocinados, que no pagan cuotas de membresía, y a aquellos incluidos en el programa &lt;a href="https://www.crossref.org/gem/">Global Equitable Membership (GEM))&lt;/a>.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/newfee-pie.png"
alt="pie chart showing proportion of Crossref members on each membership fee tier" width="75%">
&lt;/figure>
&lt;/div>
&lt;p>Nota: los miembros que consideren que su organización debería pasar a esta nueva tarifa de cuota de membresía y que aún no nos lo hayan comunicado, por favor, contáctenos antes de que termine el año para que podamos hacer el cambio antes de que se emitan las facturas en enero.&lt;/p>
&lt;p>Dicho lo anterior, por medio de las conversaciones que tenemos con nuestra comunidad (y GRACIAS por todos sus comentarios en encuestas y debates), sabemos que este cambio hace que la participación en Crossref sea más accesible para organizaciones pequeñas que comunican investigación. Estamos seguros de que esto promoverá un flujo continuo de nuevos registros y metadatos asociados que sumarán al &lt;a href="https://www.crossref.org/documentation/research-nexus/">nexo de la investigación&lt;/a>, lo que nos ayudará a facilitar la búsqueda y evaluación de la investigación, lograr una mayor transparencia en el proceso científico y seguir construyendo confianza en sus resultados.&lt;/p>
&lt;p>Aún no terminamos de revisar nuestras tarifas y no creemos que este nuevo nivel de tarifas considere todas las necesidades de la comunidad académica, que está en crecimiento y evolución. Seguimos trabajando con nuestros patrocinadores y embajadores y tenemos próximos cambios en el programa GEM para facilitar la participación de organizaciones, de todo tipo y tamaño, que comparten investigación.&lt;/p>
&lt;p>&lt;em>Traducido por: Nicolás Mejía Torres&lt;/em>&lt;/p></description></item><item><title>It's Time: Planning for Metadata Schema Deprecation</title><link>https://www.crossref.org/blog/its-time-planning-for-metadata-schema-deprecation/</link><pubDate>Wed, 10 Dec 2025 00:00:00 +0000</pubDate><author>Patricia Feeney</author><guid>https://www.crossref.org/blog/its-time-planning-for-metadata-schema-deprecation/</guid><description>&lt;p>It has been 18 (!) years since Crossref last deprecated a metadata schema. In that time, we&amp;rsquo;ve released numerous schema versions, some major updates, and some interim releases that never saw wide adoption. Now, with 27 different schemas to support, we believe it&amp;rsquo;s time to streamline and move forward.&lt;/p>
&lt;p>Starting next year, we plan to begin the process of deprecating lightly-used schemas, with the understanding that this will be a multi-year effort involving careful planning and plenty of communication.&lt;/p>
&lt;h2 id="which-schema-will-be-deprecated">Which schema will be deprecated?&lt;/h2>
&lt;p>There are two types of schema used to register content metadata records: a full metadata input schema, which follows the pattern &lt;em>crossrefX.X.X.xsd&lt;/em>, and resource schema, which follows the pattern &lt;em>doi_resourcesX.X.X.xsd&lt;/em>. The resource schema are used to &lt;a href="https://www.crossref.org/documentation/register-maintain-records/maintaining-your-metadata/resource-only-deposit/">append metadata&lt;/a>, such as references or funding data, to an existing metadata record.&lt;/p>
&lt;p>I&amp;rsquo;ve categorized our schemas by usage levels to help prioritize the deprecation process:&lt;/p>
&lt;p>Light usage (planned for initial deprecation):&lt;/p>
&lt;ul>
&lt;li>crossref4.3.1.xsd&lt;/li>
&lt;li>crossref4.3.2.xsd&lt;/li>
&lt;li>crossref4.8.1.xsd&lt;/li>
&lt;li>doi_resources4.3.2.xsd&lt;/li>
&lt;li>doi_resources4.3.4.xsd&lt;/li>
&lt;li>doi_resources4.3.5.xsd&lt;/li>
&lt;li>doi_resources4.4.2.xsd&lt;/li>
&lt;/ul>
&lt;p>Moderate usage:&lt;/p>
&lt;ul>
&lt;li>crossref4.3.3.xsd&lt;/li>
&lt;li>crossref4.3.4.xsd&lt;/li>
&lt;li>crossref4.3.5.xsd&lt;/li>
&lt;/ul>
&lt;p>High usage:&lt;/p>
&lt;ul>
&lt;li>crossref4.3.0.xsd&lt;/li>
&lt;li>crossref4.3.6.xsd&lt;/li>
&lt;li>crossref4.3.7.xsd&lt;/li>
&lt;li>crossref4.4.0.xsd&lt;/li>
&lt;li>crossref4.4.1.xsd&lt;/li>
&lt;li>crossref4.4.2.xsd&lt;/li>
&lt;li>doi_resources4.3.0.xsd&lt;/li>
&lt;li>doi_resources4.3.6.xsd&lt;/li>
&lt;/ul>
&lt;p>We currently support 5 versions of our grants-specific schema and will be working with our funder members to move to new versions of that schema over time - this will follow a different timeline and process as there are fewer schemas to navigate.&lt;/p>
&lt;p>If you don&amp;rsquo;t know which version you&amp;rsquo;re currently using, now would be a good time to check. Many of our members are still using 4.3.0, the earliest supported version.&lt;/p>
&lt;h3 id="why-deprecate-now">Why deprecate now?&lt;/h3>
&lt;ul>
&lt;li>&lt;em>Supporting 27 schema is unsustainable&lt;/em>: Each schema version we maintain adds complexity to our systems and makes it harder to implement improvements that benefit everyone.&lt;/li>
&lt;li>&lt;em>Existing schema need modernization&lt;/em>. Some fundamental elements, like names and titles, need to be modeled differently to fully capture variations in language and usage patterns across different cultures and contexts. We also have too many bespoke record types. Consolidating these will create a simpler, more coherent structure. We may retain certain specialized structures for journal articles and books, but overall, simplification will benefit everyone.&lt;/li>
&lt;/ul>
&lt;p>Most importantly:&lt;/p>
&lt;ul>
&lt;li>&lt;em>Our current requirements are too minimal&lt;/em>. For most record types, we only require a title and publication year. While this low barrier has made registration accessible, it hasn&amp;rsquo;t served metadata quality well. We know you can do better, and we&amp;rsquo;d like to ask for more to improve the richness and utility of Crossref metadata.&lt;/li>
&lt;/ul>
&lt;h3 id="what-happens-next">What happens next?&lt;/h3>
&lt;p>This won&amp;rsquo;t be an abrupt change. We would like to deprecate the schema flagged ‘light usage’ by the end of 2026 and will be reaching out to impacted members early next year. For other schema, we&amp;rsquo;re planning a multi-year effort with clear communication at every stage. We&amp;rsquo;ll provide ample notice before any schema is deprecated, along with migration guidance and support.&lt;/p>
&lt;p>With the exception of recent changes to affiliation metadata, we&amp;rsquo;ve primarily been building on existing schema structures. This means upgrading should be straightforward for most users. As mentioned, we&amp;rsquo;ll judiciously making some breaking changes to names, titles, and requirements, and would like to consolidate schema as we move forward.&lt;/p>
&lt;p>Our goal is to create a more robust, modern metadata framework that better serves the scholarly community while reducing the maintenance burden that comes with supporting decades of schema versions. Stay tuned for more details on timelines and migration paths. In the meantime, if you&amp;rsquo;re unsure which schema version you&amp;rsquo;re using, we encourage you to check your current implementation.&lt;/p></description></item><item><title>Metadata in editorial workflows</title><link>https://www.crossref.org/blog/metadata-in-editorial-workflows/</link><pubDate>Wed, 03 Dec 2025 00:00:00 +0000</pubDate><author>Madhura Amdekar</author><guid>https://www.crossref.org/blog/metadata-in-editorial-workflows/</guid><description>&lt;h2 id="background">Background&lt;/h2>
&lt;p>Scholarly metadata, deposited by thousands of our members and made openly available &lt;a href="https://doi.org/10.64000/edg3w-7t592" target="_blank">can act as “trust signals” for the publications&lt;/a>. It provides information that helps others in the community to verify and assess the integrity of the work. Despite having a central responsibility in ensuring the integrity of the work that they publish, editorial teams tend not be fully aware of the value of metadata for integrity of the scholarly record. How can we change that?&lt;/p>
&lt;p>Thousands of publishers and institutions from all over the world, big and small, are Crossref members, providing us rich metadata for their publications. &lt;a href="https://doi.org/10.64000/3b445-2zr32" target="_blank">During our discussion with the community on this topic&lt;/a>, it has surfaced that it is usually the technical or production teams, which interact closely with Crossref, where the appreciation of benefits and value of metadata remain confined.&lt;/p>
&lt;p>Although editors may interact with some aspects of metadata when they screen manuscripts that come their way, it is not evident whether they see metadata as useful for signalling trust. In the last couple of years, we have been specifically engaging with editors, &lt;a href="https://community.crossref.org/t/crossref-at-the-ismte-2025-annual-conference-editors-without-borders-breaking-silos-in-a-technological-world/14375" target="_blank">meeting them&lt;/a>, &lt;a href="https://ease.org.uk/event/ease-germany-webinar-metadata-research-integrity-and-reproducibility/" target="_blank">speaking to them&lt;/a>, and &lt;a href="https://www.csescienceeditor.org/article/scholarly-metadata-as-trust-signals-opportunities-for-journal-editors/" target="_blank">writing for them&lt;/a> on this topic. As next steps in this effort, we are now keen to engage with the diverse editorial community to understand where metadata fits in their workflows, and to identify opportunities for providing visibility to the importance of rich metadata.&lt;/p>
&lt;p>To get a better grasp on this subject, I reached out to Christine Ferguson, to share her rich experience across many editorial roles with me, and to try and paint a better picture of the mutual gaps in understanding when it comes to publication metadata. Here’s what we discovered about the different editorial roles and some ideas for how Crossref might better engage with editors.&lt;/p>
&lt;h2 id="we-know-that">We know that&lt;/h2>
&lt;p>Our members come in all shapes and sizes, and that is also reflected in the diversity of editorial functions that may exist within their organisations. Some of our publishing members have editorial staff whose role is to screen submissions, which includes checking them to make sure that the manuscripts are formatted correctly, and have all the required information e.g. on ethics approvals, or ORCIDs (Open Researcher and Contributor ID) of authors. They then pass these manuscripts on to an external or an academic editor, who is usually a subject matter expert and is responsible for the editorial oversight of the content, to manage the rest of the peer review process, such as assessing the novelty and scope of the work, inviting and securing reviewers, and making a final decision on the manuscript. The academic editors make up a vast majority of the editorial community, variously serving as the editor-in-chief, section editors, and members of the editorial board. They usually volunteer their time as an editor, while having another primary job function.&lt;/p>
&lt;p>Other publishers may have in-house editors who are subject matter experts themselves and manage the peer review process. Manuscripts can come to these editors after initial checks have been performed on them or the editors may also perform these checks, following which selected manuscripts undergo the peer review process.&lt;/p>
&lt;p>Production editors assume responsibility for the manuscripts that are accepted. Their role is to make the manuscript production and publication ready, often liaising with the authors to finalise the formatting, and finally assigning it to an issue.&lt;/p>
&lt;p>Then there are editorial roles that may be a combination of one or more of the above. The size and operational structure of an organisation may determine how editorial and other responsibilities are delegated within the organisation. For some of our medium or smaller members, it may be that the same individual or team is responsible for one or more tasks related to assessing the scientific content of the manuscript, managing the peer review process, as well as being in charge of the post-production workflows such as registering metadata with Crossref.&lt;/p>
&lt;p>There are also emerging publishing workflows involving solicited peer-reviews of preprints or other types of works, which sometimes retain a form of editorial oversight.&lt;/p>
&lt;p>In summary, editorial roles and responsibilities may vary quite a lot within our member organisations and we have less clarity about editorial roles and responsibilities within member organisations.&lt;/p>
&lt;p>All of these different flavors of editors also interact with metadata at various stages in their workflows. For example, the title of the manuscripts, names of authors, whether they have ORCIDs and what is reflected in their ORCID records, and the abstracts may be used to assess the novelty and integrity of the work under consideration. The names of authors, especially if they are not known personally to the editor, can be verified in part by an ORCID check, ensuring the individuals exist, are affiliated to the organisations as claimed, that they have the relevant expertise to write or contribute to the manuscript, and to be able to find what they have written previously on the subject.
Making sure that whether all or some of the authors (e.g. the corresponding author) have provided their ORCIDs, or if the link to where the dataset has been deposited in a repository resolves correctly, is usually a part of the pre-screening or post-acceptance checklists. &lt;a href="https://doi.org/10.64000/xxwy3-xhf38" target="_blank">As our recent metadata awardee, ASM has highlighted&lt;/a> that having this metadata can be hugely beneficial during the peer-review management process, such as for identifying conflicts of interest, to ensure data policy compliance, and even for carrying out systematic analyses.&lt;/p>
&lt;h2 id="wed-like-to-know-more-about">We’d like to know more about…&lt;/h2>
&lt;p>whether all editors interact with metadata in their workflows, and whether they are sufficiently informed about the power of rich metadata. It is evident that there is a lot of diversity in editorial roles and functions. Editors, whether they are mostly concerned with scientific content or with the manuscript peer-review process, are closely connected to the researcher community and the latest research topics and trends. By virtue of this, they are in an excellent position to ascertain the important metadata elements most relevant in their scholarly community. If we have a better understanding of how editors are using metadata in their workflows, we’d be able to identify specific opportunities for engaging with this key community to create greater recognition of the role of metadata in preserving the integrity of the scholarly record.&lt;/p>
&lt;p>What we have in mind is to engage systematically with editorial community members and understand from them how, where, and which metadata are they using in their workflows. We’d like to do so by talking to editors who represent different Crossref members, perhaps in small groups, where participants will be able to share which metadata elements they interact with. We’d also like to share with them information about the use of metadata for research integrity. We’d like to understand whether they have been leveraging metadata in this context and the relevance of this information for them. Via this exercise, we hope to pick out some commonalities about the use of metadata in editorial workflows. Ultimately, we’d like to use this information to create resources that can be used for educating editors (and ultimately the researchers who submit their work for publication) about the importance of metadata, especially in signalling trust and preserving the integrity of the scholarly record.&lt;/p></description></item><item><title>Crossref members over the years: a journey through space and time</title><link>https://www.crossref.org/blog/crossref-members-over-the-years-a-journey-through-space-and-time/</link><pubDate>Wed, 26 Nov 2025 00:00:00 +0000</pubDate><author>Amanda Bartell</author><guid>https://www.crossref.org/blog/crossref-members-over-the-years-a-journey-through-space-and-time/</guid><description>&lt;p>Crossref was created back in 2000 by 12 forward-thinking scholarly publishers from North America and Europe, and by 2002, these members had registered 4 million DOI records. At the time of writing, we have over 23,600 members in 164 different countries. Half of our members are based in Asia, and 35% are universities or scholar-led. These members have registered over 176 million open metadata records with DOIs (as of today). What a difference 25 years makes!&lt;/p>
&lt;p>In our 25th anniversary year, I thought it would be time to take a look at how we got here. And so—hold tight—we’re going to go on an adventure through space and time&lt;sup id="fnref:1">&lt;a href="#fn:1" class="footnote-ref" role="doc-noteref">1&lt;/a>&lt;/sup>, stopping every 5 years through Crossref history to check in on our members. And we’re going to see some really interesting changes over the years.&lt;/p>
&lt;h2 id="2005">2005&lt;/h2>
&lt;p>Let’s go back twenty years to 2005. Crossref has been running for five years, and at this point, we have just 318 members from 31 countries, with 18 million DOI records already registered. These members and the Crossref infrastructure are supported by five Crossref employees based in just two countries—the US and the UK.&lt;/p>
&lt;p>In 2005, the majority of our members are based in North America, Northern Europe and Western Europe, and they are mostly publishers or societies. &lt;a href="https://www.crossref.org/community/sponsors/">Our sponsor program&lt;/a> doesn’t yet exist, so all members pay a membership fee directly to Crossref. Our membership fee structure is the same as it is today—we have tiered membership fees so our members can contribute to our infrastructure based on their capacity to pay. At this point, half of our members are eligible for our lowest fee tier.&lt;/p>
&lt;h3 id="2005-at-a-glance">2005 at a glance&lt;/h3>
&lt;ul>
&lt;li>318 members from 31 countries.&lt;/li>
&lt;li>18 million DOI records registered.&lt;/li>
&lt;li>Supported by five Crossref employees based in two countries - the US and the UK.&lt;/li>
&lt;li>The majority (89%) are based in North America or Northern &amp;amp; Western Europe.&lt;/li>
&lt;li>Half are eligible for our lowest fee tier.&lt;/li>
&lt;li>Mostly societies (40%) and publishers (33%).&lt;/li>
&lt;/ul>
&lt;h2 id="2010">2010&lt;/h2>
&lt;p>Let’s move on by five years to 2010. By this stage, Crossref membership had grown to 1101 members from 69 countries, and these members have now registered 44 million DOI records. They are now supported by 14 Crossref employees, still all located in either the US or the UK.&lt;/p>
&lt;p>We’re starting to see some changes in where our members are based. You’ll remember that back in 2005, 89% of Crossref members were based in North America, Northern Europe or Western Europe. By 2010, that percentage has dropped to 63%, and we&amp;rsquo;re seeing the number of members based in Asia starting to grow. In 2005, only 4% of our members were based in Asia, but by 2010, 18% of our members are based there, with 93 members in the Republic of Korea alone.&lt;/p>
&lt;p>By 2010, the percentage of members who are eligible for our lowest fee tier has grown to 78%, so we are seeing smaller and less well-funded organisations starting to join. The types of organisations joining hasn&amp;rsquo;t changed significantly—members are still mostly societies and publishers. However, we are starting to see universities and scholar-led organisations beginning to join.&lt;/p>
&lt;h3 id="2010-at-a-glance">2010 at a glance&lt;/h3>
&lt;ul>
&lt;li>1,101 members from 69 countries.&lt;/li>
&lt;li>44 million DOI records registered.&lt;/li>
&lt;li>Supported by 14 Crossref employees based in two countries - the US and the UK.&lt;/li>
&lt;li>Growth of members based in Asia (18%).&lt;/li>
&lt;li>Smaller, less well-funded organisations starting to join - 78% eligible for our lowest fee tier.&lt;/li>
&lt;li>Still mostly societies (37%) and publishers (28%), but universities and scholar-led members starting to emerge (23%).&lt;/li>
&lt;/ul>
&lt;h2 id="2015">2015&lt;/h2>
&lt;p>Jumping ahead another five years to 2015, we see Crossref membership has grown to over 3,000 members from 93 countries, with registered DOI records exceeding 77 million. These members and the Crossref infrastructure are supported by 28 employees, still all based in the US and UK.&lt;/p>
&lt;p>Membership in Asia has now really taken off, and Asian organisations now account for 38% of all Crossref members. We also see membership in Latin America emerging, representing 12% of our membership. We have members from 12 different countries in Latin America in 2015, but the most significant number are from Brazil, with 274 members.&lt;/p>
&lt;p>Our formal Sponsor program started to emerge from 2012 onwards. Our Sponsor program supports members who are otherwise eligible for our lowest fee tier and provides financial, technical and language support to organisations that would otherwise face barriers to membership. By 2015, we have 26 sponsors in 14 countries, and 20% of all members are working with us through a Sponsor. This is one of the drivers behind smaller, less well-funded members joining Crossref. We really see a leap here in 2015 with over 90% of members now eligible for our lowest fee tier.&lt;/p>
&lt;p>Around 2015, we also begin to see an interesting shift in the types of organisations that are becoming members. Increasingly, our new members are university-based, and that type of member organisation has overtaken the publisher group in number for the first time. However, societies still make up the largest number of members.&lt;/p>
&lt;h3 id="2015-at-a-glance">2015 at a glance&lt;/h3>
&lt;ul>
&lt;li>3,134 members from 93 countries.&lt;/li>
&lt;li>77 million DOI records.&lt;/li>
&lt;li>Supported by 28 Crossref employees based in two countries - US and UK.&lt;/li>
&lt;li>Growth in Asia (38%) and members in Latin America (12%) starting to emerge.&lt;/li>
&lt;li>Leap in smaller, less well-funded members - 92% eligible for the lowest fee tier.&lt;/li>
&lt;li>Sponsor program emerges - 26 sponsors in 14 countries.&lt;/li>
&lt;li>Rise of university and scholar-led members (29%) - overtaking publishers (21%). Societies (31%) are still the largest group.&lt;/li>
&lt;/ul>
&lt;h2 id="2020">2020&lt;/h2>
&lt;p>Can you believe we’re already in 2020? Crossref now has almost 12,000 members in 133 countries, with registered DOI records totalling over 120 million! These members and the Crossref infrastructure are now supported by 43 employees across five countries, with Ireland, Germany, and France added to our staff locations.&lt;/p>
&lt;p>Almost half of our members are based in Asia at this time, driven by growth from Indonesia, where we have 1681 members in 2020. Our sponsor program now contains 77 sponsors across 32 countries, including our first sponsor in North Africa.&lt;/p>
&lt;p>We can now really see how membership is weighted towards smaller, less well-funded organisations: 97% of members are eligible for the lowest fee tier, and 57% choose to work with a sponsor.&lt;/p>
&lt;p>By 2020, we also see a fundamental change in the types of organisations that are Crossref members. Societies no longer account for the largest share of our members, with both universities and publishers overtaking them. In 2016, we updated our schema to enable members to register records for preprints (and connect them to an article where relevant). By 2020, 65 members are registering preprints, and many preprint repositories have already become members.&lt;/p>
&lt;h3 id="2020-at-a-glance">2020 at a glance&lt;/h3>
&lt;ul>
&lt;li>11,976 members from 133 countries.&lt;/li>
&lt;li>120 million DOI records.&lt;/li>
&lt;li>Supported by 43 Crossref employees in five countries - France, Germany, Ireland, the UK, and the US.&lt;/li>
&lt;li>46% of members based in Asia.&lt;/li>
&lt;li>77 sponsors in 32 countries, first sponsor in N Africa.&lt;/li>
&lt;li>Membership heavily weighted to smaller, less well-funded organisations - 97% eligible for the lowest fee tier and 57% working through a sponsor.&lt;/li>
&lt;li>Universities and scholar-led are now the largest group (37%), followed by publishers (29%) and societies (24%).&lt;/li>
&lt;/ul>
&lt;h2 id="2025">2025&lt;/h2>
&lt;p>And so we find ourselves back in the present day.&lt;/p>
&lt;p>With such steady growth, it’s pretty easy to predict almost exactly how many members we will have by 31st December 2025. By year-end, we would expect to have 23,800 members in 164 countries, with registered DOI records totalling around 177 million. With recent hiring, these members and our infrastructure will be supported by 52 Crossref employees in 14 different countries.&lt;/p>
&lt;p>Member organisations are now a real mix, with museums, hospitals, botanic gardens, banks, and many more joining. The largest proportion remains those at a university or scholar-led (35%), but interestingly, we see the percentage who consider themselves to be societies starting to fall (19%) and publishers starting to grow again (29%).&lt;/p>
&lt;p>And we see the arrival of a new type of member - since the launch of the &lt;a href="https://www.crossref.org/services/grant-linking-system/" target="_blank">Grant Linking System&lt;/a> in 2019, we now see Research Funders joining Crossref in order to register identifiers for individual grants. These grant identifiers can then be included in the metadata for published content to uniquely identify the funding source, providing context and trust signals for the content, and fleshing out the Research Nexus. We currently have 45 funders who have registered over &lt;a href="https://doi.org/10.64000/607z6-1nh09" target="_blank">175,000 grant records&lt;/a>.&lt;/p>
&lt;p>By 2025 we have 129 sponsors in 51 countries - including our first sponsors in East and West Africa who joined in 2024 and 2025 respectively. Half of all members are now based in Asia. 98% of members are now eligible for our lowest fee tier and 57% are working with us through a sponsor.&lt;/p>
&lt;p>In 2023, we launched our Global Equitable Membership (GEM) program, which offers relief from any membership and content registration fees for organisations in the least economically advantaged countries in the world. We use the World Bank’s International Development Association (IDA) list as our data source for countries to include in the program. When we launched the program, 187 existing members moved under the program. Since the program’s focus is to enable participation for those who would otherwise find Crossref unaffordable, we are happy that we now have 583 organisational members in the &lt;a href="https://www.crossref.org/gem/">GEM Program&lt;/a>, showing the growth in participation from lower-income nations. Most members in the GEM Program are based in Southern Asia (48%) and Sub-Saharan Africa (33%).&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/Location-GEM-members.png"
alt="pie chart showing location of GEM members: Southern Asia (48.8%), Western Asia (4.8%), Northern Africa (1.9%), Sub Saharan Africa (33%), Latin America and the Caribbean (1.7%), Central Asia (4.5%), South Eastern Asia (3.8%)" width="75%">
&lt;/figure>
&lt;/div>
&lt;h3 id="november-2025-at-a-glance">November 2025 at a glance&lt;/h3>
&lt;ul>
&lt;li>23,622 members in 164 countries.&lt;/li>
&lt;li>175 million DOI records.&lt;/li>
&lt;li>Supported by 52 Crossref employees from 14 countries - Armenia, Austria, Canada, Ecuador, Germany, Ghana, Hong Kong, Ireland, Kenya, the Netherlands, Nigeria, Spain, the UK, and the US.&lt;/li>
&lt;li>51% of members are based in Asia.&lt;/li>
&lt;li>129 sponsors in 51 countries - first sponsors in East and West Africa.&lt;/li>
&lt;li>98% of members are eligible for the lowest fee tier, and 57% working through a sponsor.&lt;/li>
&lt;li>Real mix of organisation types - universities and scholar-led (35%), publishers (29%), societies (19%), but also research funders, museums, pharmaceutical companies, news agencies, and more!&lt;/li>
&lt;/ul>
&lt;h2 id="changes-over-the-years">Changes over the years&lt;/h2>
&lt;p>Here are some of that data over time, depicted in charts.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/Member_numbers.png"
alt="line graph showing growth of member numbers from 2005 (300 members) to 2025 (23,000 members)" width="75%">
&lt;/figure>
&lt;/div>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/Country_counts.png"
alt="line graph showing growth of countries that our member organizations come from, from 2005 (31 countries) to 2025 (164 countries)" width="75%">
&lt;/figure>
&lt;/div>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/Organization_types.png"
alt="line graph showing changes in the types of organizations that our members represent between 2005 and 2025." width="75%">
&lt;/figure>
&lt;/div>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/Members-per-staff.png"
alt="bar chart showing the number of Crossref members per Crossref staff member from 2005 (63), 2010 (78), 2015 (112), 2020 (278) and 2025 (449)." width="75%">
&lt;/figure>
&lt;/div>
&lt;h2 id="2026-and-beyond">2026 and beyond&lt;/h2>
&lt;p>As you can see from our adventure through space and time, the types of organisations that work with Crossref have changed significantly over the years as the scholarly communications world has evolved. Our members now tend to be university-based research-performing organisations or scholar-led journals, based in Asia, and with low or zero publishing revenues (and volumes).&lt;/p>
&lt;p>To meet our mission of a truly global and connected research ecosystem, it is essential to ensure that participation in Crossref and all our services and metadata is accessible to everyone involved in documenting scholarly progress.&lt;/p>
&lt;p>We want to ensure that access to the Crossref infrastructure is equitable, so we are making two key changes in 2026: we’re extending eligibility for the GEM Program (more to follow), and we are introducing a new, lower-fee tier as an outcome of the RCFS projects &lt;a href="https://doi.org/10.64000/cvvj8-tax10" target="_blank">more here&lt;/a>.&lt;/p>
&lt;p>We’re excited to see how our members will change as we head into our next 25 years—we hope you’ll continue with us on our journey and welcome all kinds of new members to the expansive and vibrant Crossref community.&lt;/p>
&lt;div class="footnotes" role="doc-endnotes">
&lt;hr>
&lt;ol>
&lt;li id="fn:1">
&lt;p>Technically, this is only an adventure through time. At the time of writing, we have no members based in space. Unless you count the European Space Agency, NASA, et al.&amp;#160;&lt;a href="#fnref:1" class="footnote-backref" role="doc-backlink">&amp;#x21a9;&amp;#xfe0e;&lt;/a>&lt;/p>
&lt;/li>
&lt;/ol>
&lt;/div></description></item><item><title>Crossref at the Frankfurt Book Fair 2025</title><link>https://www.crossref.org/blog/crossref-at-the-frankfurt-book-fair-2025/</link><pubDate>Wed, 19 Nov 2025 00:00:00 +0000</pubDate><author>Helena Cousijn</author><guid>https://www.crossref.org/blog/crossref-at-the-frankfurt-book-fair-2025/</guid><description>&lt;p>The Frankfurt Book Fair is the largest book fair in the world, and therefore a key event on our calendar. Held annually in Frankfurt, Germany, the 77th Frankfurt Book Fair (October 15–19, 2025) saw 118,000 trade visitors and 120,000 private visitors from 131 countries. The Crossref booth was located, as usual, in Hall 4.0 where all the stands with information about academic publishing can be found. Four Crossref colleagues attended the Book Fair this year, and in this blog post, you can read more about their meetings, experiences, and plans. &lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/fbm25booth.png"
alt="photo of table with giveaways" width="75%">
&lt;/figure>
&lt;/div>
&lt;h3 id="first-timer-fun-at-the-frankfurt-fair---helena">First timer fun at the Frankfurt Fair - Helena&lt;/h3>
&lt;p>Even though I’ve been working in scholarly comms for over 10 years, I’d never had a chance to visit the Frankfurt Book Fair. I was therefore really excited to have an opportunity to attend this year, and it didn’t disappoint! I arrived on Monday, October 13, in time for the STM dinner, which proved a great opportunity to meet with Crossref members and collaborators. On Tuesday, I attended the &lt;a href="https://stm-assoc.org/events/stm-dinner-conference-2025-frankfurt/" target="_blank">STM conference&lt;/a> with the exciting theme of ‘The role of publishers in science diplomacy’. I think my favorite part of the day was the last panel, where the panelists realised that even though they represent different groups, in the end, they all have the same goals and are all working towards better science and dissemination. &lt;/p>
&lt;p>On Wednesday, it was time to head over to our booth, where we prepared for the interesting conversations ahead. My meetings were mainly focused on collaborations in the area of &lt;a href="https://www.crossref.org/community/special-programs/research-integrity/">research integrity&lt;/a>, as Crossref plans to run pilots with potential partners next year. In-person meetings at the fair were a good opportunity to discuss in more detail which kinds of integrity checks could be useful to our members. I also had several meetings with organizations functioning as &lt;a href="https://www.crossref.org/community/service-providers/">Service Providers&lt;/a> –– depositing content on behalf of members –– who are eagerly awaiting the launch of our renewed Service Providers program next year. In these conversations, we shared our thinking about requirements for Crossref Service Providers and got input from organizations with experience serving our member community. Overall, it was a great opportunity to see members and collaborators in person, and I’ve already put the 2026 Frankfurt Book Fair in my calendar!&lt;/p>
&lt;h3 id="an-exciting-comeback---maryna">An exciting comeback - Maryna&lt;/h3>
&lt;p>If last year, I was a debutante at the Frankfurt Book Fair, 2025 marked an exciting comeback. It&amp;rsquo;s always a pleasure to spend time chatting with people you usually only meet through email or Zoom. Working remotely as part of a global team is something I truly value about Crossref, but it also makes those in-person moments even more special. You get to solve issues that have been sitting on your to-do list over lunch, brainstorm ideas while walking to the venue, get immediate advice in a meeting—and, of course, talk about dogs over dinner.&lt;/p>
&lt;p>Frankfurt was busy but well organised. Our booth was lively with a mix of planned and spontaneous meetings. It was nice to reconnect with members and sponsors I&amp;rsquo;ve worked with over the years. We even gave an early look at the new &lt;a href="https://www.crossref.org/members/prep/">Participation Reports&lt;/a> before the official release (what a thrill!). There were good conversations about &lt;a href="https://www.crossref.org/blog/deprecating-co-access-crossref-plans-and-timelines/">deprecating co-access&lt;/a>, the importance of &lt;a href="https://www.crossref.org/documentation/register-maintain-records/creating-and-managing-dois/transferring-responsibility-for-dois/">title transfers&lt;/a>, and how we can keep improving the member experience. One highlight: I spoke with a prospective member about our membership model and fee structure, and they joined the following week! Their account is already active, with a prefix assigned, which was great to see.&lt;/p>
&lt;p>Another key topic was the importance of &lt;a href="https://ror.org/" target="_blank">ROR IDs&lt;/a>. I talked with several publishers about how they could be implemented across other systems. At one point, I spotted an issue with unregistered DOIs and was able to fix it on the spot by finalising a title transfer—we&amp;rsquo;d had permission but never received the formal request—so it was satisfying to close that loop in real time.&lt;/p>
&lt;p>Being a relatively small team serving a global membership of more than 23,000 and growing, it&amp;rsquo;s not possible to meet with every member face-to-face to respond to every question. Our team works hard to respond to all queries by email, but it&amp;rsquo;s undeniably faster and more productive in person. That&amp;rsquo;s why we keep returning to the Frankfurt Book Fair year after year—you can definitely count on seeing us again next year! &lt;/p>
&lt;h3 id="third-time-at-bat---luis">Third time at bat - Luis&lt;/h3>
&lt;p>The Frankfurt Book Fair is always an incredible opportunity to connect with our community. We come prepared with highlights of the year, plans for developments and upcoming releases, and remind the members we meet to participate and vote in the annual elections. But most of what we learn happens during the informal moments––meetings, drop-ins, and chats over coffee and tea––where people discuss what they’re working on, trends, and interests of the scholarly and publishing community. &lt;/p>
&lt;p>This year, some of those conversations included meeting someone working with groups from Egypt and the UAE who are developing tools around our metadata. They wanted to talk through REST API use, recent Crossref updates, and how retraction metadata could fit into their systems. Another person opened their participation report with us and were surprised to see their metadata showing 0% despite the team believing they were sending complete metadata, which led to a discussion about getting their internal workflows running again. &lt;/p>
&lt;p>Booth days always fly by, but they’re deeply informative and insightful for teams that participate in person, as we can “cross-check” (pun intended) how our different support mechanisms help the community and how well we&amp;rsquo;re delivering our communications. There is a good mix of problem-solving and catching-up; often, we see members who prepare a list of questions because they find it easier to sit and navigate through them with our support or membership colleagues. Sometimes it’s about refreshing their understanding of what Crossref is and what we do, especially during team changes. We also spoke with a publisher &lt;a href="https://www.crossref.org/services/crossmark">preparing to adopt Crossmark&lt;/a>. They wanted to check they were handling updates and relationships correctly, and mentioned that increasing transparency is becoming a priority for them. Someone else, working closely with a repository, asked about using the REST API or Metadata Plus to enrich their records. &lt;/p>
&lt;p>A few visitors simply needed clarity––one was pleased to learn they could register reports and datasets after being told otherwise. Another visitor who registers a small number of book DOIs each year asked whether &lt;a href="https://www.crossref.org/documentation/register-maintain-records/web-deposit-form/">the Web Deposit Form&lt;/a> was still the best fit. We walked through the Record Registration Form together, and its new editing features helped them plan for upcoming changes.&lt;/p>
&lt;p>Personally, I enjoy seeing the cultural and organisational diversity of existing and potential Crossref members, ambassadors, sponsors, allies and colleagues from all over the world at our booth. If you have the opportunity to attend the Book Fair next year, please visit our booth and say hello!&lt;/p>
&lt;h3 id="this-years-frankfurt-veteran---paul">This year’s Frankfurt Veteran - Paul&lt;/h3>
&lt;p>I think this is my 5th (?) Frankfurt book fair,  and each year I come away thinking how much I appreciate the opportunity to speak with our members face to face, and I get to see and hear the impact that Crossref has, which is always such a pleasure.&lt;/p>
&lt;p>This year, there were only four of us in attendance, and it felt busier than ever. We had a lot of pre-booked meetings at our wonderfully designed booth again (thanks to the amazing work of our colleague Rosa) but we also had lots of ad-hoc meetings, where members came up to say “hello”, “thank you” or ask about that really knotty, niche problem that they have, which they are not sure how to explain over email. From a technical support perspective, this is great, as we can go through these issues and get a resolution––or a solid background––without the delay and confusion of long email threads. I also worked with a member who got their IT department to send over a file there and then for us to work through and try to navigate a difficult question regarding reference matching and whether the &lt;a href="https://doi.crossref.org/simpleTextQuery" target="_blank">simple text query form&lt;/a> worked using an API, which others could use. These were just two examples of many in which it was much easier to sit down and work through issues directly at the fair.&lt;/p>
&lt;p>So I would always say that if you are at the Frankfurt book fair, and you have one of these issues then it is a great opportunity to come by, say hello and work through it with us. We will send out a reminder before the fair in 2026 to get any meetings booked, or just come find us at the fair.&lt;/p>
&lt;p>A highlight for me this year was also showing some of our members our &lt;a href="https://www.crossref.org/members/prep/" target="_blank">new Participation Report&lt;/a>. It’s had a visual update as well as some new functionality: you can download a gap report that lists DOI numbers of records that are missing the metadata element you choose, making it easier to identify and update missing metadata. I always like attending the Frankfurt Book Fair and so might be there next year. It&amp;rsquo;s an important opportunity for all Crossref colleagues to engage and meet our members––many for the first time. &lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/fbm25-team.png"
alt="photo of Paul, Maryna, Luis, and Helena at the table at the booth" width="100%">
&lt;/figure>
&lt;/div>
&lt;h3 id="next-year">Next year&lt;/h3>
&lt;p>Feeling inspired after all the great meetings and conversations we had this year, we immediately started planning for next year! We’ll definitely be in Frankfurt in 2026, where you can find our team at the Crossref booth. We’re also planning to organize another roundtable on the Monday before the fair, so put October 5-9, 2026, in your calendars and stay tuned for more details.&lt;/p></description></item><item><title>The sunset is on the horizon for Metadata Manager. What's next?</title><link>https://www.crossref.org/blog/the-sunset-is-on-the-horizon-for-metadata-manager.-whats-next/</link><pubDate>Thu, 06 Nov 2025 00:00:00 +0000</pubDate><author>Lena Stoll</author><guid>https://www.crossref.org/blog/the-sunset-is-on-the-horizon-for-metadata-manager.-whats-next/</guid><description>&lt;p>TL;DR. Metadata Manager will be retired at the end of 2025. Over the past four years, we have been developing a new helper tool to replace it, and that tool has now reached a stage of maturity that means we will be able to switch off Metadata Manager by the end of the year.&lt;/p>
&lt;h2 id="how-did-we-get-here">How did we get here?&lt;/h2>
&lt;p>In 2021, we &lt;a href="https://doi.org/10.64000/30vzx-r5x16" target="_blank">said&lt;/a> that we would be retiring the deprecated &lt;a href="https://www.crossref.org/documentation/register-maintain-records/metadata-manager/">Metadata Manager&lt;/a> as soon as we can offer members a suitable replacement for registering their journal content. So this news has been a long time coming - Metadata Manager has been very challenging for us to support, and we have found it impossible to develop additional features. However, we did not want to take the final step of switching off the interface until we were able to offer a suitable replacement for members who rely on manual helper tools to register their journal content.&lt;/p>
&lt;p>That replacement, our new &lt;a href="https://www.crossref.org/documentation/register-maintain-records/record-registration-form/">record registration form&lt;/a>, has now been used by many members for over a year to register their journal content. The feedback so far has been positive, and we have been able to add functionality to the tool at a pace that we are happy with.&lt;/p>
&lt;p>In July 2025, we contacted those members who are still using Metadata Manager to let them know that the tool will no longer be available after December 2025. So if you are affected by this news, you were probably already aware of it. But we wanted to go into a little more detail on the sunsetting of Metadata Manager, why we are doing it, and what’s next for Crossref’s content registration helper tools.&lt;/p>
&lt;h2 id="what-has-happened-since-2021">What has happened since 2021?&lt;/h2>
&lt;p>We have been developing the record registration form ever since that announcement in 2021. It began its life as a helper tool for registering grant records, but we knew we wanted to expand it to cover journal articles and other record types as soon as we could.&lt;/p>
&lt;p>To see whether the concept behind the grants form could be applied to journal content, we first built an initial prototype and tested it with a number of Crossref ambassadors and volunteers. We wanted to ensure that the tool was intuitive to use, and to understand what functionality it would need to support for it to be truly useful to our members. Following some iteration on the invaluable feedback we received from our testers, we finally released the tool to production in September 2024 and began encouraging members to use it for their real-life article deposits.&lt;/p>
&lt;p>We have been continuously adding new functionality since then, from additional fields for registering richer metadata to a feature that allows members to edit their articles’ metadata without having to re-enter everything into the form.&lt;/p>
&lt;p>Now, about two months from the target date for retiring Metadata Manager, the record registration form is used by members to register about 200 articles per day, while Metadata Manager still sees about double that volume of submissions. So we have some way left to go.&lt;/p>
&lt;h2 id="why-is-now-the-right-time-to-retire-metadata-manager">Why is now the right time to retire Metadata Manager?&lt;/h2>
&lt;p>2025 has been a year of addressing technical debt for Crossref. My colleague Sara wrote about this co-ordinated push towards modernising our system in her post about our &lt;a href="https://doi.org/10.64000/wd6rx-vpq73" target="_blank">cloud migration&lt;/a> in the summer.&lt;/p>
&lt;p>Having the long-awaited replacement for Metadata Manager in place will allow us to free up the resources that have been tied up for years by troubleshooting Metadata Manager, in terms of both technology and user support, so that we can focus on projects and initiatives that align with our longer-term &lt;a href="https://www.crossref.org/strategy/">strategy&lt;/a>.&lt;/p>
&lt;h2 id="how-will-we-avoid-the-new-tool-developing-the-same-problems-as-metadata-manager">How will we avoid the new tool developing the same problems as Metadata Manager?&lt;/h2>
&lt;p>As stated above, Metadata Manager has caused us many issues and headaches in different ways - but we have also learned a lot from dealing with these problems. As Bryan Vickery &lt;a href="https://doi.org/10.64000/1a52b-7pf27" target="_blank">wrote in 2020&lt;/a>, Metadata Manager is “not flexible enough to easily add other record types, like books/book chapters, or to include any changes we may make to our input schema.” To address this, we built the record registration form in a &lt;a href="https://doi.org/10.64000/cvq2e-q8t24" target="_blank">schema-driven way&lt;/a>, which makes it adaptable to any future schema changes. It also means that we can spin up prototypes of new forms for additional record types quite quickly.&lt;/p>
&lt;p>So while Metadata Manager was custom-built in a way that could only ever work for journal content, the record registration form already supports two record types and will support more in future. This is key for our goal of building a complete &lt;a href="https://www.crossref.org/documentation/research-nexus/">research nexus&lt;/a>, which extends far beyond journal content, and even beyond “content” as such (did someone say &lt;a href="https://www.crossref.org/services/grant-linking-system/">grants&lt;/a>?).&lt;/p>
&lt;h2 id="what-happens-next">What happens next?&lt;/h2>
&lt;ul>
&lt;li>Metadata Manager will no longer be available from January 2026.&lt;/li>
&lt;li>Starting next year, if you attempt to access Metadata Manager at &lt;a href="https://www.crossref.org/metadatamanager/" target="_blank">https://www.crossref.org/metadatamanager/&lt;/a>, you will be redirected to a deprecation note on &lt;a href="https://www.crossref.org/deprecated/" target="_blank">https://www.crossref.org/deprecated/&lt;/a> which will link out to the new tool.&lt;/li>
&lt;/ul>
&lt;h2 id="what-options-do-i-have-for-registering-my-journal-content-going-forward">What options do I have for registering my journal content going forward?&lt;/h2>
&lt;p>If your organisation still uses Metadata Manager to register metadata for your journal articles, now is a good time to begin familiarising yourself with the &lt;a href="https://www.crossref.org/documentation/register-maintain-records/choose-content-registration-method/">alternatives&lt;/a> available to you from 2026 forward - these include, but are not limited to, the new record registration form.&lt;/p>
&lt;h3 id="if-your-journal-has-an-issn">If your journal has an ISSN&lt;/h3>
&lt;p>We recommend you begin using the record registration form as soon as possible. Simply go to &lt;a href="https://manage.crossref.org/records" target="_blank">https://manage.crossref.org/records&lt;/a> and sign in with your Crossref account credentials to register a journal article. You can also see a list of all the journal article records you have previously registered using our manual helper tools at &lt;a href="https://manage.crossref.org/records/edit" target="_blank">https://manage.crossref.org/records/edit&lt;/a> and edit their metadata using the form.&lt;/p>
&lt;p>To help you make the switch from Metadata Manager, we will be hosting an interactive webinar on 13 November about how to transition to the new tool. &lt;a href="https://crossref.zoom.us/webinar/register/7317600554084/WN_WF1Ykk-4SKeih4ucpTeesA" target="_blank">Register here&lt;/a> or look out for the recording, which will be shared in our &lt;a href="https://www.crossref.org/events/archive/" target="_blank">events archive&lt;/a>.&lt;/p>
&lt;h3 id="if-your-journal-does-not-have-an-issn">If your journal does not have an ISSN&lt;/h3>
&lt;p>The record registration form currently only supports ISSNs as journal identifiers. Title-level and volume/issue-level DOIs, which are at the core of how Metadata Manager handles journal metadata, have been the cause for some of the problems we have had over the years with that particular tool. Also, Crossref DOIs have always been intended primarily as citation identifiers, and entire journals/volumes/issues are very rarely cited. For that reason, we built the Record Registration Form such that it doesn&amp;rsquo;t support registering or using journal-level DOIs.&lt;/p>
&lt;p>With that being said, if you do not (yet) have an ISSN for your journal for whatever reason, you can use our &lt;a href="https://www.crossref.org/documentation/register-maintain-records/web-deposit-form/">web deposit form&lt;/a> to register your articles with journal DOI. If you do obtain an ISSN for your title later on, you can then simply begin using the record registration form from that point onward.&lt;/p>
&lt;h2 id="how-will-the-new-tool-continue-to-be-developed">How will the new tool continue to be developed?&lt;/h2>
&lt;p>We will continue to work with our members and community to develop additional functionalities for the journal article form. Currently we are working on allowing &lt;a href="https://www.crossref.org/documentation/principles-practices/best-practices/relationships/">relationships metadata&lt;/a> to be registered using the form.&lt;/p>
&lt;p>Ultimately, the goal is for the record registration form to become the one-stop shop for members who manually register and update their metadata. To this end, we are working on expanding the tool to cover additional record types - we have recently developed a prototype for registering books and chapters, and we will be looking to test this in the coming months with volunteers who are currently registering their book metadata via other avenues such as the web deposit form.&lt;/p>
&lt;p>If you would like to support these efforts, or you have begun using the new tool and would like to share your feedback, come join the discussion in our &lt;a href="https://community.crossref.org/t/feedback-on-new-helper-tool/1721" target="_blank">community forum&lt;/a>.&lt;/p>
&lt;h3 id="references">References&lt;/h3>
&lt;ol>
&lt;li>Bowman, S. (2021). Next steps for Content Registration. Crossref. &lt;a href="https://doi.org/10.64000/30vzx-r5x16" target="_blank">https://doi.org/10.64000/30vzx-r5x16&lt;/a>&lt;/li>
&lt;li>Bowman, S. (2025). We’ve migrated to the cloud; we hope you didn’t notice (but maybe you did). Crossref. &lt;a href="https://doi.org/10.64000/wd6rx-vpq73" target="_blank">https://doi.org/10.64000/wd6rx-vpq73&lt;/a>&lt;/li>
&lt;li>Vale, P. (2022). Forming new relationships: Contributing to Open source. Crossref. &lt;a href="https://doi.org/10.64000/cvq2e-q8t24" target="_blank">https://doi.org/10.64000/cvq2e-q8t24&lt;/a>&lt;/li>
&lt;/ol></description></item><item><title>Announcing changes to REST API rate limits</title><link>https://www.crossref.org/blog/announcing-changes-to-rest-api-rate-limits/</link><pubDate>Wed, 05 Nov 2025 00:00:00 +0000</pubDate><author>Martyn Rittman</author><guid>https://www.crossref.org/blog/announcing-changes-to-rest-api-rate-limits/</guid><description>&lt;p>Our &lt;a href="https://www.crossref.org/documentation/retrieve-metadata/rest-api/">REST API&lt;/a> makes all of the metadata we hold publicly available. It receives the majority of our API traffic, with around 1 billion hits per month. It’s one of the key ways that we fulfil our mission to make research objects easy to find, cite, link, assess, and reuse. From 1 December 2025, we will be revising the rate limits for the public and polite pools of the REST API to ensure that we can maintain a stable and reliable system, and that metadata is freely available to everyone.&lt;/p>
&lt;p>We haven’t changed the rate limits since the REST API was launched in 2013. In the past five years, the number of requests to the REST API has tripled and the number of metadata records has increased by a third, from 120 million to around 180 million. This means an increase in the resources needed to run it, and we’ve seen periods of instability where we haven’t been able to keep the API available for all users. We have decided that it is the right time to revisit rate limits to check that they’re in line with what our technology can provide and what our community needs. As a result, we will apply the following for the public and polite pools:&lt;/p>
&lt;p>Public pool:&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>Request type&lt;/th>
&lt;th>Rate limit&lt;/th>
&lt;th>Concurrency limit&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>Single record&lt;/td>
&lt;td>5&lt;/td>
&lt;td>1&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>List of records (queries, filters, etc.)&lt;/td>
&lt;td>1&lt;/td>
&lt;td>1&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>Polite pool:&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>Request type&lt;/th>
&lt;th>Rate limit&lt;/th>
&lt;th>Concurrency limit&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>Single DOI record&lt;/td>
&lt;td>10&lt;/td>
&lt;td>3&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>List of records (queries, filters, etc.)&lt;/td>
&lt;td>3&lt;/td>
&lt;td>3&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>The rate limit is the number of total requests that can be made per second. The concurrency limit is how many requests can be running at the same time. This means that for longer-running requests you may need to wait for previous requests to finish before you can make a new one.&lt;/p>
&lt;p>Here are some examples of single records requests:&lt;/p>
&lt;ul>
&lt;li>&lt;a href="https://api.crossref.org/v1/works/10.1002/cphy.cp010129" target="_blank">https://api.crossref.org/v1/works/10.1002/cphy.cp010129&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://api.crossref.org/v1/journals/0266-612X&amp;amp;mailto=my@email.com" target="_blank">https://api.crossref.org/v1/journals/0266-612X&amp;mailto=my@email.com&lt;/a>&lt;/li>
&lt;/ul>
&lt;p>The second case here will be directed to the polite pool because an email is included using the ‘mailto’ parameter. And here are examples of requests that return lists of records:&lt;/p>
&lt;ul>
&lt;li>&lt;a href="https://api.crossref.org/v1/works?filter=from-created-date:2025-10-21T16:20,until-created-date:2025-10-21T17:00" target="_blank">https://api.crossref.org/v1/works?filter=from-created-date:2025-10-21T16:20,until-created-date:2025-10-21T17:00&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://api.crossref.org/v1/members/13/works&amp;amp;mailto=my@email.com" target="_blank">https://api.crossref.org/v1/members/13/works&amp;mailto=my@email.com&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://api.crossref.org/v1/works?query.bibliographic=linear&amp;#43;dichroism&amp;amp;mailto=my@email.com" target="_blank">https://api.crossref.org/v1/works?query.bibliographic=linear+dichroism&amp;mailto=my@email.com&lt;/a>&lt;/li>
&lt;/ul>
&lt;p>The second and third examples here will use the polite pool.&lt;/p>
&lt;p>Our guiding principle in making these changes is to keep all of the metadata available to everyone, all of the time. These changes to rate limits won’t restrict current users from accessing the metadata they want to retrieve, but it will make it easier for us to maintain the system now and in the future.&lt;/p>
&lt;h2 id="which-use-cases-do-we-support">Which use cases do we support?&lt;/h2>
&lt;p>Our metadata has a broad range of applications. If you’re someone who uses the REST API, we’re glad that you are part of our community! Our mission includes making it easier to find, reuse, and assess scholarly research outputs. By using metadata, you’re helping us to fulfil that goal.&lt;/p>
&lt;p>The main uses of the REST API fit into several categories. The new rate limits will continue to support these, among many others:&lt;/p>
&lt;ul>
&lt;li>I have some metadata, what is the DOI?&lt;/li>
&lt;li>I have a DOI, what is its metadata?&lt;/li>
&lt;li>I want all of the metadata, just give me everything.&lt;/li>
&lt;li>Research on a specific topic or subset of metadata, often refreshing the results every few weeks or months.&lt;/li>
&lt;/ul>
&lt;p>Rate limits can encourage responsible usage. The majority of API users make requests at a low rate and will not need to make any changes, however a few send spikes of large numbers of requests in a short space of time, sometimes making it difficult for others to access the service. These can be smoothed out by lower rate limits. Complex requests that search across large numbers of items put more pressure on our systems than requests for a single content item, so we have decided to set different rate limits for different types of request.&lt;/p>
&lt;h2 id="who-will-be-affected">Who will be affected?&lt;/h2>
&lt;p>We estimate that the changes might affect around 40 users per week across the public and polite pools, and this is only for some of their requests. In all of the cases we’ve seen, the rate of requests could be slowed down and users would still be able to get the same results. In other words, the aim of these changes is to make the load on the API more predictable, not to reduce the total number of requests or amount of metadata transferred. No changes are being made to the &lt;a href="https://www.crossref.org/services/metadata-retrieval/metadata-plus/">Metadata Plus&lt;/a> service or other APIs, such as the XML API and OAI-PMH endpoint.&lt;/p>
&lt;h2 id="do-i-need-to-change-how-i-use-the-api">Do I need to change how I use the API?&lt;/h2>
&lt;p>If you’re reading this, thank you! It’s clear that you want to be a considerate user of our services. Almost all users can continue to use the REST API in exactly the same way, you won’t need to change anything. Here is some general advice that will help you make the most of the service and ensure that you won’t encounter issues.&lt;/p>
&lt;ol>
&lt;li>Use a mailto parameter. This gives you access to the polite pool meaning higher rate limits and meaning we can get in touch with you if needed. We’ll only use your address to contact you about your API requests.&lt;/li>
&lt;li>Check the HTTP response status for your requests. This is always good practice and can help you identify malformed requests and where you reach rate limits.&lt;/li>
&lt;li>Cache results to avoid repeatedly making the same requests. Most records don’t change on a regular basis. How often you update the cache will depend on what you are interested in, but most metadata fields rarely change.&lt;/li>
&lt;li>If you are making a very high volume of requests or have very complex analysis to carry out, consider downloading the &lt;a href="https://www.crossref.org/learning/public-data-file/">public data file&lt;/a> which is made available once a year and contains all of our metadata. You can update it with recent additions using the REST API.&lt;/li>
&lt;li>If you are relying on our metadata in a production service, &lt;a href="https://www.crossref.org/services/metadata-retrieval/metadata-plus/">Metadata Plus&lt;/a> can provide more stability, support, and access to monthly snapshots of our entire database.&lt;/li>
&lt;/ol>
&lt;p>We have more &lt;a href="https://www.crossref.org/documentation/retrieve-metadata/rest-api/tips-for-using-the-crossref-rest-api/">tips and tricks&lt;/a> for the REST API in our documentation. If you have questions, please join the conversation on our Community Forum.&lt;/p></description></item><item><title>Celebrating Noyam Journals’ Metadata Award</title><link>https://www.crossref.org/blog/celebrating-noyam-journals-metadata-award/</link><pubDate>Tue, 04 Nov 2025 00:00:00 +0000</pubDate><author>Johanssen Obanda</author><guid>https://www.crossref.org/blog/celebrating-noyam-journals-metadata-award/</guid><description>&lt;p>&lt;a href="https://noyam.org/" target="_blank">Noyam Journals&lt;/a>, based in Accra, Ghana, was recently recognised for the completeness of its metadata through the &lt;a href="https://www.crossref.org/blog/crossref-metadata-awards/" target="_blank">Crossref Metadata Award&lt;/a>, part of our 25th anniversary celebrations. Noyam was one of six publishers worldwide to receive the award and stood out as a leader among members of our &lt;a href="https://www.crossref.org/gem/" target="_blank">Global Equitable Membership&lt;/a> (GEM) Program.&lt;/p>
&lt;p>The GEM Program supports publishers and organisations in low- and middle-income countries to participate in the global scholarly community by reducing barriers to membership and services.&lt;/p>
&lt;p>Earlier this year, at our &lt;a href="https://www.crossref.org/blog/reflections-from-crossref-accra-2025-strengthening-open-science-and-partnerships-in-ghana/" target="_blank">Crossref Accra event&lt;/a>, representatives from Noyam spoke about how registering metadata with Crossref has expanded their readership worldwide. They also encouraged other publishers and institutions in Africa to utilise Crossref’s infrastructure to enhance the visibility and impact of their work.&lt;/p>
&lt;p>Following their award, we spoke with Naa Kai Amanor-Mfoafo from Noyam Journals about their approach to metadata quality. She shares her reflections in the Q&amp;amp;A below.&lt;/p>
&lt;h2 id="what-motivates-your-team-to-work-towards-high-quality-metadata">What motivates your team to work towards high-quality metadata?&lt;/h2>
&lt;p>Our commitment towards high-quality metadata stems from our organisational goal to promote the dissemination of usable knowledge by publishing innovative and high-quality research content. Over the last five years, registering our metadata with Crossref has strengtheed authors&amp;rsquo; trust as their institutions can verify quality through tools like Crossmark. For instance, many institutions use the Crossmark feature on our published articles to access the latest information about a scholarly article, including updates, corrections, or retractions.&lt;/p>
&lt;h2 id="do-you-have-a-strategy-for-complete-metadata">Do you have a strategy for complete metadata?&lt;/h2>
&lt;p>We prioritise inclusion of ORCID IDs, Abstracts, and References as these increase visibility of our articles. We also include Affiliations, Licenses, and Crossmark, and we use Similarity Check to help ensure research integrity.&lt;/p>
&lt;p>As part of our team structure, we have a dedicated staff member responsible for ensuring that every article is assigned a Crossref DOI on the same day it is published online. Our in-house system supports this process, allowing us to capture and register all the key metadata efficiently.&lt;/p>
&lt;h2 id="what-impact-of-good-metadata-can-you-see-on-your-organisation">What impact of good metadata can you see on your organisation?&lt;/h2>
&lt;p>Good metadata has made a real difference for our organisation. It has helped increase the visibility and discoverability of our journal articles, making it easier for researchers and readers around the world to find and cite our work. We’ve noticed more engagement with our publications since improving our metadata, which encourages us to keep strengthening the quality of the information we register.&lt;/p>
&lt;h2 id="have-you-encountered-any-challenges-in-curating-or-improving-your-metadata-and-how-did-you-address-those">Have you encountered any challenges in curating or improving your metadata, and how did you address those?&lt;/h2>
&lt;p>One major challenge we’ve faced is discovering errors in previously uploaded metadata, and we haven’t yet established a systematic process for correcting them. We’re currently working to improve our workflow to help ensure the correctness of our metadata to follow Crossref’s recommended best practices.&lt;/p>
&lt;h2 id="have-your-efforts-around-metadata-led-to-real-benefits-for-your-community">Have your efforts around metadata led to real benefits for your community?&lt;/h2>
&lt;p>Our authors appreciate the fact that their ORCID profiles are automatically updated with their published articles once they are assigned DOIs from Crossref. They are, of course, also enjoying increased visibility of our published articles globally.&lt;/p>
&lt;h2 id="looking-ahead-how-are-you-planning-to-build-on-your-metadata-quality">Looking ahead, how are you planning to build on your metadata quality?&lt;/h2>
&lt;p>We need to stay informed about developments at Crossref. Once in a while, we visit the Crossref website or participate in a webinar to stay informed. For example, a few months ago, we got to know that a new record registration form had been initiated for metadata uploads through the documentation section on the Crossref website.&lt;/p>
&lt;p>We advise others who are new to Crossref to focus on consistency. Ensure your organisational system includes staff dedicated to keeping your metadata up to date. Secondly, feel free to seek technical support from the Crossref team when the need arises.&lt;/p></description></item><item><title>New tool to report on completeness of open research information globally</title><link>https://www.crossref.org/blog/new-tool-to-report-on-completeness-of-open-research-information-globally/</link><pubDate>Tue, 21 Oct 2025 00:00:00 +0000</pubDate><author>Kornelia Korzec</author><guid>https://www.crossref.org/blog/new-tool-to-report-on-completeness-of-open-research-information-globally/</guid><description>&lt;p>&lt;em>Wednesday 22nd October 2025&lt;/em>&amp;mdash;Crossref, the open scholarly infrastructure nonprofit, today releases an enhanced dashboard showing metadata coverage and individual organisations’ contributions to documenting the process and outputs of scientific research in the open. The tool helps research-performing, funding, and publishing organisations identify gaps in open research information, and provides supporting evidence for movements like the &lt;a href="https://barcelona-declaration.org/" target="_blank">Barcelona Declaration for Open Research Information&lt;/a>, which encourages more substantial commitment to stewarding and enriching the scholarly record through open metadata.&lt;/p>
&lt;p>Crossref’s &lt;a href="https://www.crossref.org/members/prep" target="_blank">Participation Reports&lt;/a> now offer expanded features and provide full coverage of all members and all resource types registered with Crossref DOIs (Digital Object Identifiers)—over 175 million records representing a significant share of global research production from organisations in 164 countries. Each of Crossref’s 23,000 members has a dashboard to visualise their metadata contributions, display coverage of key information for scholarly works, and get actionable feedback via a gap report that specifies records that need enrichment, all helping to make more transparent the work that goes into creating and curating the scholarly record.&lt;/p>
&lt;p>For any Crossref member—whether journal publisher, research funder, university, or museum—coverage of up to 11 key elements is public and visible to everyone, including: references, abstracts, ORCID iDs, affiliation strings, ROR IDs, Open Funder Registry IDs, funding award numbers, text-mining URLs, licence URLs, Similarity Check URLs (for text-based plagiarism checking) and the presence of a Crossmark policy, indicating the organisation’s commitment to declare corrections and retractions. These metadata elements provide greater context and visibility for research objects such as journal articles and preprints, grants and awards, books and book chapters, standards, datasets, conference papers and various ‘other’ content such as scholarly blogs, images, and even physical museum artefacts.&lt;/p>
&lt;p>&lt;a href="https://orcid.org/0000-0002-6373-1199" target="_blank">Mochammad Tanzil Multazam&lt;/a>, Library Director of Universitas Muhammadiyah Sidoarjo, and Secretary of the Supervisory Board of Relawan Jurnals, says, “As a sponsoring organisation for several thousand small publishers across Indonesia, we support Crossref members to register complete metadata for their works. Despite time and resource constraints, this new actionable open report on key metadata elements will help drive improvements in the information they share for their publications. This has wide-reaching implications for the visibility of that research and trust among the community, and therefore has the potential to support Indonesian scholarship in the global context.”&lt;/p>
&lt;p>&lt;a href="https://orcid.org/0009-0008-8562-7748" target="_blank">Lena Stoll&lt;/a>, Program Lead at Crossref, explains, “We are happy to have extended participation reports to cover more diverse record types, including grants, datasets, dissertations, and more, and to make it easier for our members to act on their ongoing improvements to enrich their records and build towards the vision of an open and more complete Research Nexus.”&lt;/p>
&lt;p>&lt;a href="https://orcid.org/0000-0001-8249-1752" target="_blank">Ludo Waltman&lt;/a>, Scientific Director and Professor of Quantitative Science Studies at the Centre for Science and Technology Studies (CWTS) at Leiden University, comments, “As a representative of the researcher and metascience communities, this data is of great importance for us to analyse the trends and effects of global research activity. Crossref is one of the main driving forces in open infrastructure, and its commitment to supporting metadata completeness through this open reporting dashboard is a significant step for the open research information movement.”&lt;/p>
&lt;p>Access Crossref &lt;a href="https://www.crossref.org/members/prep" target="_blank">Participation Reports&lt;/a> and search for any Crossref member organisation.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/prep-la-salle.png"
alt="screenshot of participation report for a typical Crossref member, Universidad La Salle Arequipa in Peru, showing percentages per metadata element" width="100%">&lt;figcaption>
&lt;p>Participation report for a typical Crossref member, Universidad La Salle Arequipa in Peru&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;h4 id="about-crossref">About Crossref&lt;/h4>
&lt;p>Crossref runs an open infrastructure to link research objects, entities, and actions, creating a lasting and reusable scholarly record that underpins open science. Together with their 23,000 members in 4 Crossref drives metadata exchange and supports nearly 2 billion monthly API queries, facilitating global research communication, for the benefit of society.&lt;/p></description></item><item><title>Integrating grant metadata for seamless research interconnectivity at FCCN|FCT</title><link>https://www.crossref.org/blog/integrating-grant-metadata-for-seamless-research-interconnectivity-at-fccnfct/</link><pubDate>Wed, 15 Oct 2025 00:00:00 +0000</pubDate><author>Rocío Gaudioso Pedraza</author><guid>https://www.crossref.org/blog/integrating-grant-metadata-for-seamless-research-interconnectivity-at-fccnfct/</guid><description>&lt;p>&lt;a href="#version-in-portuguese">&lt;em>Click here for the version in Portuguese&lt;/em>&lt;/a>&lt;/p>
&lt;p>Welcome back to our series of case studies of research funders using the Grant Linking System. In this interview, I talk with Cátia Laranjeira, PTCRIS Program Manager at FCCN|FCT, Portugal’s main public funding agency, about the agency’s approach to metadata, persistent identifiers, Open Science and Open Infrastructure.
With a holistic approach to the management, production and access to information on science, FCCN|FCT&amp;rsquo;s decision to implement the Grant Linking System within their processes was not simply a technical upgrade, but a coordinated effort to continue building a strong culture of openness. With the mantra “register once, reuse always”, FCCN|FCT efforts to embrace open funding metadata was only logical.&lt;/p>
&lt;h3 id="could-you-introduce-your-organisation">Could you introduce your organisation?&lt;/h3>
&lt;p>We are FCCN, the digital services of the FCT, the Foundation for Science and Technology, which is the main public funding agency in Portugal. FCT supports research and innovation in Portugal through multiple funding instruments targeting researchers, projects, institutions and international partnerships. FCCN is focused on providing digital services to the scientific and academic community in Portugal.&lt;/p>
&lt;p>I am the manager of a program called &lt;a href="http://www.ptcris.pt" target="_blank">PTCRIS&lt;/a>, part of the FCCN, within the ‘Scientific Knowledge’ pillar of the unit. PTCRIS is a broad program, whose main goal is to fulfill the mantra ‘register once, reuse always’. We aim to develop an integrated ecosystem of scientific information, so all the projects we run have this main goal and that’s what we work towards. We develop infrastructure and added-value services, such as the &lt;a href="https://www.cienciavitae.pt/" target="_blank">scientific curriculum vitae management platform&lt;/a> and an indicator system that exposes information of all the funding that supports research and innovation in Portugal.&lt;/p>
&lt;h3 id="what-motivated-you-to-join-crossref">What motivated you to join Crossref?&lt;/h3>
&lt;p>We had already adopted ORCID and we also developed a national PID, connected to the citizen card additional to ORCIDs. In 2015 we adopted the &lt;a href="https://isni.org/page/what-is-isni/" target="_blank">ISNI&lt;/a> and we also had DOIs for research outputs. So we were clearly missing one piece, which was metadata for funding.
At the same time we started developing a national infrastructure on science and technology funding, to have an aggregated and holistic view of the funding that is distributed in Portugal.&lt;/p>
&lt;p>Before that the information was scattered across different databases and websites from many different funders, so we organised and aggregated this information into a platform called &lt;a href="https://www.fccn.pt/en/atualidade/portal-sciproj-o-novo-servico-da-fct-para-a-pesquisa-do-financiamento-cientifico-em-portugal/?utm_source=chatgpt.com" target="_blank">SciPROJ&lt;/a>, which brings together all the information on scientific funding in one place, with quick and flexible access. But we didn’t have persistent identifiers for grants, and this was at the same time that Crossref started to build the Grant Linking System, so we were actually one of the first organisations to join, and in 2023 we had a pilot, where we registered 6000 grants, and we have been registering funding metadata ever since.&lt;/p>
&lt;h3 id="can-you-tell-us-about-your-experience-using-the-grant-linking-system">Can you tell us about your experience using the Grant Linking System?&lt;/h3>
&lt;p>The beginning of the pilot was the most critical stage of the process; some effort was needed to map our data models to the &lt;a href="https://www.crossref.org/documentation/schema-library/grants-schema/" target="_blank">Crossref grant metadata schema&lt;/a>. FCCN wasn’t in a bad position to do this since we already had all that information in a registry and it was well organised, we just had to map them to make sure that the information we had could be shared following the Crossref metadata schema and &lt;a href="https://www.crossref.org/documentation/principles-practices/" target="_blank">best practices&lt;/a>.
It has been two years since the pilot, which puts us in phase 2 of the implementation of the system. During the pilot we concentrated on registering both historical and current grants&amp;rsquo; metadata, in the current phase, we are focusing on current grants’ metadata.&lt;/p>
&lt;h3 id="what-do-you-find-useful-about-registering-grant-metadata-with-crossref">What do you find useful about registering grant metadata with Crossref?&lt;/h3>
&lt;p>&lt;strong>Although this is the very beginning of this journey, we envision a world where we have the ability to link grants to any other object and entity that comprises the ecosystem: people that execute that funding, projects, institutions, outputs.&lt;/strong> Outputs are something particularly important to us, like for many other funders, because we want to be able to monitor the impact of our funding and that is something that is always at the back of our mind.&lt;/p>
&lt;p>We are actually developing more and more services that aim to show how these links can be very useful to retrieve information from the system. For example, we are developing an indicator system that is focusing on the funding but also on the outputs and the links between the two. We are also monitoring OA trends, to see how FCT funding is contributing to Open Science initiatives.&lt;/p>
&lt;p>Additionally, our &lt;a href="https://www.fct.pt/en/sobre/estudos-e-planeamento-estrategico/politicas-de-ciencia-aberta/acesso-aberto-a-publicacoes-cientificas/" target="_blank">OA policy was recently launched&lt;/a> but we currently don’t have any system that allows us to track policy compliance. We are working towards that, but to achieve this &lt;strong>it is absolutely fundamental that grants are linked to the outputs through metadata.&lt;/strong>&lt;/p>
&lt;h3 id="what-are-your-hopes-for-the-gls-and-greater-transparency-in-funding-metadata-in-general">What are your hopes for the GLS and greater transparency in funding metadata in general?&lt;/h3>
&lt;p>The interconnectivity and interoperability of entities and objects, which is something that the field of scientific information management has always wanted to do, but that it’s very difficult to do. There have been attempts in the past to achieve this using information from the acknowledgement sections of publications, but this is fairly inefficient and there needs to be more structure to it. &lt;strong>A critical piece of this puzzle would be to influence publishers, manuscript submission platforms to facilitate the systematic sharing of grant IDs and grant metadata by design.&lt;/strong> I think this is something that is still missing and that I would like to see happening soon.&lt;/p>
&lt;h3 id="has-anything-surprised-you-while-implementing-the-grant-linking-system">Has anything surprised you while implementing the Grant Linking System?&lt;/h3>
&lt;p>Something that we have seen that was surprising was that researchers, who in general are not that concerned about PIDs, when it came to grant IDs, they would ask us proactively what the Crossref grant ID for their award was! It was very refreshing to see that we didn’t need to do any advertising to socialize Crossref grant IDs among our grant holders. I think that tells you about the high level of awareness there is within our community of the importance of the Crossref grant ID, using it and putting it in the acknowledgment section of their publications.&lt;/p>
&lt;h3 id="based-on-your-experience-what-would-be-your-advice-for-colleagues-from-other-research-funders">Based on your experience, what would be your advice for colleagues from other research funders?&lt;/h3>
&lt;p>I would say go for it! The more the merrier! This is like any other similar information system – &lt;strong>it only works if there are enough people using it&lt;/strong>, registering grants metadata that facilitate the links between objects.&lt;/p>
&lt;p>It is a very easy process to get into. Once you map the metadata schema to your own data it’s not a technically difficult thing to do. For us it’s an automated process that runs very smoothly, from grant registration to communicating this information to grant holders. We can see this in action in this example: the grantee published &lt;a href="https://doi.org/10.3390/agriculture14020298" target="_blank">an article&lt;/a> that acknowledges their funding through &lt;a href="https://sciproj.ptcris.pt/157479UID" target="_blank">Crossref’s grants IDs&lt;/a> or funding received being acknowledged in the &lt;a href="https://www.citab.utad.pt/the-centre/welcome-to-citab" target="_blank">website of a Research Center&lt;/a>.&lt;/p>
&lt;h3 id="if-you-could-change-something-about-the-gls-or-how-the-grant-metadata-you-register-is-used-what-would-it-be">If you could change something about the GLS or how the grant metadata you register is used, what would it be?&lt;/h3>
&lt;p>I would love to have access to a visualization of grants’ metadata, how many outputs are linked to, and how they relate to other objects and entities. That would really give us a clearer understanding of the impact that our funding is having.
We’d also love to see better integration between Crossref and ORCID for grants—just like it works for publications. Ideally, when a grant is registered and linked to a researcher, they’d be notified and could easily add it to their ORCID record. This would allow the information to flow seamlessly into their national CV via &lt;strong>PTCRISsync&lt;/strong>, ensuring consistency and reducing manual work.&lt;/p>
&lt;hr>
&lt;p>We are grateful to Cátia Laranjeira and FCT|FCCN for sharing their perspective and long-standing experience in this space. Their experience highlights the role that funding metadata plays in an interconnected and complete research and funding ecosystem.&lt;/p>
&lt;h3 id="version-in-portuguese">Version in Portuguese&lt;/h3>
&lt;p>&lt;em>Translation by Edilson Damasio&lt;/em>&lt;/p>
&lt;h3 id="integração-de-metadados-de-financiamento-pela-fccnfct-para-reforçar-a-interoperabilidade-da-informação-sobre-a-atividade-científica">Integração de metadados de financiamento pela FCCN|FCT para reforçar a interoperabilidade da informação sobre a atividade científica&lt;/h3>
&lt;p>Bem-vindo(a) de volta à nossa série de estudos de caso sobre instituições financiadoras de investigação que utilizam o Grant Linking System. Nesta entrevista, conversamos com Cátia Laranjeira, gestora do programa PTCRIS na FCCN|FCT, a principal agência pública de financiamento à ciência em Portugal, sobre a abordagem da instituição aos metadados, identificadores persistentes, Ciência Aberta e Infraestruturas Abertas.&lt;/p>
&lt;p>Com uma abordagem holística à gestão, produção e acesso à informação científica, a decisão da FCCN|FCT de integrar o Grant Linking System nos seus processos não representou apenas uma evolução técnica, mas sim um esforço coordenado para consolidar uma forte cultura de abertura. Sob o lema “registar uma vez, reutilizar sempre”, a adoção de metadados abertos de financiamento pela FCCN|FCT foi um passo natural e coerente com essa visão.&lt;/p>
&lt;h3 id="poderia-apresentar-a-sua-organização">Poderia apresentar a sua organização?&lt;/h3>
&lt;p>A FCCN é a unidade de serviços digitais da FCT — Fundação para a Ciência e a Tecnologia, a principal agência pública de financiamento à ciência em Portugal. A FCT apoia a investigação e a inovação através de diversos instrumentos de financiamento dirigidos a investigadores, projetos, instituições e parcerias internacionais. A FCCN dedica-se a disponibilizar serviços digitais à comunidade científica e académica portuguesa.&lt;/p>
&lt;p>Na FCCN|FCT, sou gestora do PTCRIS, um programa integrado no pilar do Conhecimento Científico. O PTCRIS é um programa abrangente que tem como objetivo central concretizar o princípio “registar uma vez, reutilizar sempre”. Trabalhamos para desenvolver um ecossistema integrado de informação científica, e todos os projetos que conduzimos convergem nesse propósito. Desenvolvemos infraestruturas e serviços de valor acrescentado, como a plataforma de gestão do currículo científico &lt;a href="https://www.cienciavitae.pt/" target="_blank">CIÊNCIAVITAE&lt;/a> e um sistema de indicadores que disponibiliza informação sobre todos os financiamentos que apoiam a investigação e a inovação em Portugal.&lt;/p>
&lt;h3 id="o-que-motivou-a-adesão-à-crossref">O que motivou a adesão à Crossref?&lt;/h3>
&lt;p>A FCCN tinha já adotado o ORCID e desenvolvido um identificador nacional persistente (PID), ligado ao cartão de cidadão, como complemento aos ORCIDs. Em 2015, adotámos o &lt;a href="https://isni.org/page/what-is-isni/" target="_blank">ISNI&lt;/a> e também tínhamos DOIs para a produção científica. Ficava claramente em falta um elemento: os metadados de financiamento.&lt;/p>
&lt;p>Ao mesmo tempo, iniciámos o desenvolvimento de uma infraestrutura nacional de financiamentos de ciência e tecnologia, com o objetivo de ter uma visão agregada e holística do financiamento que suporta a investigação e inovação em Portugal.&lt;/p>
&lt;p>Antes disso, a informação estava dispersa por diferentes bases de dados e websites de múltiplos financiadores. Organizámos e agregámos esta informação numa plataforma chamada &lt;a href="https://www.fccn.pt/en/atualidade/portal-sciproj-o-novo-servico-da-fct-para-a-pesquisa-do-financiamento-cientifico-em-portugal/?utm_source=chatgpt.com" target="_blank">SciPROJ&lt;/a>, que reúne toda a informação sobre financiamentos científicos num único local, com acesso rápido e flexível. No entanto, ainda não existiam identificadores persistentes para os financiamentos, coincidindo com o momento em que a Crossref começou a desenvolver o Grant Linking System. Fomos, assim, uma das primeiras organizações a aderir. Em 2023, realizámos um piloto com 6.000 financiamentos registados, e desde então temos vindo a registar continuamente os metadados de financiamento.&lt;/p>
&lt;h3 id="pode-falar-nos-sobre-a-sua-experiência-com-o-grant-linking-system">Pode falar-nos sobre a sua experiência com o Grant Linking System?&lt;/h3>
&lt;p>A FCCN iniciou a utilização do Grant Linking System com um piloto, que constituiu a fase mais crítica do processo. Foi necessário algum esforço para mapear os nossos modelos de dados para o &lt;a href="https://www.crossref.org/documentation/schema-library/grants-schema/" target="_blank">esquema de metadados de financiamentos da Crossref&lt;/a>. A FCCN estava, no entanto, bem posicionada para isso, uma vez que já dispunha de toda a informação num registo organizado; o passo necessário foi apenas assegurar que esta informação pudesse ser partilhada de acordo com o esquema de metadados da Crossref e as &lt;a href="https://www.crossref.org/documentation/principles-pra" target="_blank">melhores práticas&lt;/a>.&lt;/p>
&lt;p>Já passaram dois anos desde o piloto, o que nos coloca na fase 2 de implementação do sistema. Durante o piloto, focámo-nos no registo de metadados de financiamentos históricos e atuais; na fase atual, estamos focados no registo de metadados de financiamentos atuais.&lt;/p>
&lt;h3 id="o-que-considera-útil-no-registo-de-metadados-de-financiamento-na-crossref">O que considera útil no registo de metadados de financiamento na Crossref?&lt;/h3>
&lt;p>&lt;strong>Embora este seja ainda o início deste percurso, a FCCN idealiza um ecossistema em que seja possível ligar financiamentos a qualquer outro objeto ou entidade do sistema científico — projetos, pessoas que executam esses financiamentos, instituições onde são executados e produções científicas que dele resultam.&lt;/strong> Estes últimos são particularmente importantes para nós, como para muitos outros financiadores, pois queremos monitorizar o impacto do financiamento — uma preocupação que está sempre presente no nosso trabalho.&lt;/p>
&lt;p>Estamos, de facto, a desenvolver serviços que demonstram o valor dessas ligações para a recuperação de informação no sistema. Um exemplo é o sistema de indicadores em desenvolvimento, que se centra nos financiamentos, nas produções científicas e nas relações entre ambos. Estamos também a acompanhar as tendências de Ciência Aberta, para perceber de que forma o financiamento da FCT está a contribuir para as iniciativas de Open Science.&lt;/p>
&lt;p>Além disso, a &lt;a href="https://www.fct.pt/en/sobre/estudos-e-planeamento-estrategico/politicas-de-ciencia-aberta/acesso-aberto-a-publicacoes-cientificas/" target="_blank">política de Acesso Aberto da FCT&lt;/a> foi recentemente lançada, mas ainda não dispomos de um sistema que permita monitorizar a conformidade com essa política. Estamos a trabalhar nesse sentido, mas para o concretizar é &lt;strong>absolutamente essencial que consigamos associar inequivocamente os financiamentos às produções científicas através de metadados.&lt;/strong>&lt;/p>
&lt;h3 id="quais-são-as-suas-expectativas-para-o-gls-e-para-uma-maior-transparência-dos-metadados-de-financiamento-em-geral">Quais são as suas expectativas para o GLS e para uma maior transparência dos metadados de financiamento em geral?&lt;/h3>
&lt;p>A interconectividade e interoperabilidade entre entidades e objetos é algo que a área da gestão de informação científica sempre procurou alcançar — embora seja um objetivo difícil de concretizar. No passado, houve várias tentativas nesse sentido, recorrendo à informação presente nas secções de agradecimentos das publicações, mas esse método revelou-se pouco eficiente e carece de uma estrutura mais sistemática.&lt;/p>
&lt;p>&lt;strong>Uma peça essencial deste puzzle seria influenciar as editoras e as plataformas de submissão de manuscritos a facilitarem a partilha sistemática de identificadores e metadados de financiamento.&lt;/strong> Este é um elemento que ainda falta concretizar, mas que gostaríamos de ver implementado em breve.&lt;/p>
&lt;h3 id="algo-o-surpreendeu-durante-a-implementação-do-grant-linking-system">Algo o surpreendeu durante a implementação do Grant Linking System?&lt;/h3>
&lt;p>Algo que nos surpreendeu durante a implementação do Grant Linking System foi a reação dos investigadores. Normalmente, os investigadores não demonstram grande preocupação com identificadores persistentes (PIDs), mas, neste caso, começaram a procurar ativamente o identificador Crossref do seu financiamento! Foi muito positivo perceber que não foi necessário fazer qualquer esforço de divulgação para promover o uso dos Grant IDs da Crossref entre os beneficiários dos financiamentos. Isso mostra o nível de consciência existente na comunidade científica sobre a importância destes identificadores — usá-los e incluí-los na secção de agradecimentos das publicações.&lt;/p>
&lt;h3 id="com-base-na-sua-experiência-qual-seria-o-seu-conselho-para-colegas-de-outros-financiadores-de-investigação">Com base na sua experiência, qual seria o seu conselho para colegas de outros financiadores de investigação?&lt;/h3>
&lt;p>Com base na nossa experiência, o conselho para outros financiadores seria simples: avancem! Quanto mais, melhor! Este tipo de sistema de informação só é verdadeiramente eficaz quando há muitas entidades a utilizá-lo, a registar metadados de financiamento e a criar ligações entre objetos.&lt;/p>
&lt;p>É também um processo simples de implementar. Uma vez feito o mapeamento entre o esquema de metadados e os dados internos da instituição, não há grandes desafios técnicos. No nosso caso, o processo é totalmente automatizado e flui de forma eficiente, desde o registo do financiamento até à comunicação dessa informação aos beneficiários. É possível ver isso em prática em vários exemplos — desde &lt;a href="https://doi.org/10.3390/agriculture14020298" target="_blank">artigos&lt;/a> que reconhecem o financiamento através dos &lt;a href="https://sciproj.ptcris.pt/157479UID" target="_blank">Grant IDs da Crossref&lt;/a> até ao reconhecimento do apoio financeiro nos &lt;a href="https://www.citab.utad.pt/the-centre/welcome-to-citab" target="_blank">sites dos centros de investigação&lt;/a>.&lt;/p>
&lt;h3 id="se-pudesse-alterar-algo-no-gls-ou-na-forma-como-os-metadados-dos-subsídios-que-regista-são-utilizados-o-que-seria">​​Se pudesse alterar algo no GLS ou na forma como os metadados dos subsídios que regista são utilizados, o que seria?&lt;/h3>
&lt;p>Se pudéssemos mudar algo no Grant Linking System ou na forma como os metadados de financiamento são utilizados, gostaríamos de ter acesso a uma visualização interativa que mostrasse quantas produções científicas estão ligadas a cada financiamento e como esses se relacionam com outras entidades e objetos. Isso permitiria compreender de forma muito mais clara o impacto real dos financiamentos.&lt;/p>
&lt;p>Gostaríamos também de ver uma melhor integração entre a Crossref e o ORCID no que respeita aos financiamentos — tal como já acontece com as publicações. Idealmente, quando um financiamento fosse registado e associado a um investigador, este seria notificado e poderia adicioná-lo facilmente ao seu registo ORCID. Assim, a informação fluiria automaticamente para o currículo nacional via &lt;strong>PTCRISsync&lt;/strong>, garantindo consistência e reduzindo o trabalho manual.&lt;/p>
&lt;hr>
&lt;p>Agradecemos à Cátia Laranjeira e à FCT|FCCN por partilharem a sua perspetiva e longa experiência neste domínio. A sua experiência destaca o papel que os metadados de financiamento desempenham num ecossistema de investigação e financiamento interligado e completo.&lt;/p></description></item><item><title>Enhancing repository integration with Crossref</title><link>https://www.crossref.org/blog/enhancing-repository-integration-with-crossref/</link><pubDate>Mon, 13 Oct 2025 00:00:00 +0000</pubDate><author>Johanssen Obanda</author><guid>https://www.crossref.org/blog/enhancing-repository-integration-with-crossref/</guid><description>&lt;p>Repositories are home to a wide range of scholarly content; they often archive theses, dissertations, preprints, datasets, and other valuable outputs. These records are an important part of the research ecosystem and should be connected to the broader scholarly record. But to truly serve their purpose, repository records need to be connected to each other, to the broader research ecosystem, and to the people behind the research. Metadata is what makes that possible. Enhancing metadata is a way to tell a fuller, more accurate story of research. It helps surface relationships between works, people, funders, and institutions, and allows us as a community to build and use a more connected, more useful network of knowledge - what Crossref calls the ‘&lt;a href="https://www.crossref.org/documentation/research-nexus/" target="_blank">Research Nexus&lt;/a>’.&lt;/p>
&lt;p>The challenge many repositories face is that metadata can be incomplete, inconsistent, or disconnected. Think of references without DOIs, authors without ORCID iDs, or research outputs that aren&amp;rsquo;t linked to funding. To address this, Crossref provides a range of services that repositories can use to improve the quality and interoperability of their metadata. Our REST API, which is openly and publicly accessible, allows repositories to retrieve structured metadata, such as DOIs, references, abstracts, contributors, ORCID iDs, and funder information, that can be used to enrich and update their local records. For repository members, with the Cited-by service and reference linking, repositories can also show how works are being cited and interconnect related content. The Grant Linking System (GLS) enables the clear indication of which research outputs are linked to specific grants, and funding bodies themselves are connected using Open Funder Registry and ROR, adding another layer of context. With Crossmark, repositories can flag updates, corrections, or retractions to ensure transparency and trust in the scholarly content they host.&lt;/p>
&lt;p>Enriching repository metadata using Crossref is a practical and empowering step toward making your records more discoverable, complete, and connected. The process is simple, and you don’t need to be a developer to get started. Repositories can query the Crossref REST API using a DOI or basic metadata like a title or author name, and receive structured, reliable information. This can include full author lists, ORCID iDs, reference lists, funding data, and licensing terms. You can then match and merge this data into your repository records. Adding Crossref DOIs to your metadata enables persistent linking, helping users trace research outputs back to their stewards. It also helps create rich relationships between articles, datasets, software, grants, and other research objects. All of this supports the FAIR principles and contributes to a more connected and reusable scholarly record. And because Crossref’s infrastructure is open, any repository can access and use this metadata to improve the quality, visibility, and long-term value of their collections.&lt;/p>
&lt;h3 id="steps-to-enrich-repository-metadata-with-crossref">Steps to enrich repository metadata with Crossref:&lt;/h3>
&lt;ul>
&lt;li>Query the REST API using DOIs or basic metadata (visit our &lt;a href="https://www.crossref.org/learning/" target="_blank">API learning hub&lt;/a> to learn how to use the Crossref API)&lt;/li>
&lt;li>Retrieve structured metadata like authors, ORCID iDs, funders, affiliations, ROR IDs, licenses, grants, and references&lt;/li>
&lt;li>Map and merge with your local records&lt;/li>
&lt;li>Display persistent links to all kinds of research objects using Crossref DOIs&lt;/li>
&lt;li>Support FAIR by including open, structured, and complete metadata&lt;/li>
&lt;/ul>
&lt;p>Across the repository community, several institutions are already integrating Crossref metadata in meaningful ways to enrich their records and improve discoverability. DSpace users can enrich their deposits by using the platform’s &lt;a href="https://wiki.lyrasis.org/display/DSDOC7x/Live&amp;#43;Import&amp;#43;from&amp;#43;external&amp;#43;sources" target="_blank">“Live Import” feature&lt;/a>, which allows them to pull in Crossref metadata, such as titles, authors, and DOIs, directly into items during the submission process. A deeper integration between DSpace and Crossref is currently in development. HAL in France uses the Crossref API to complete and standardise references, making its content more consistent and connected (hal.archives-ouvertes.fr). SciELO, a key open access platform in Latin America, leverages Crossref DOI links and citation metadata to strengthen the visibility of its journals (&lt;a href="https://scielo.org" target="_blank">scielo.org&lt;/a>). In Canada, the University of Saskatchewan’s eCommons repository queries the Crossref API to enhance metadata accuracy and link records to the broader scholarly graph (ecommons.usask.ca). The Apollo repository at the University of Cambridge uses Crossref to connect theses and articles to their published versions, creating a clearer picture of research outcomes (repository.cam.ac.uk). Zenodo, hosted by CERN, draws on Crossref metadata to link deposited datasets and software with related publications, supporting transparency and reuse (&lt;a href="https://zenodo.org/" target="_blank">zenodo.org&lt;/a>).&lt;/p>
&lt;p>These examples show how even modest integrations with Crossref can lead to substantial gains in metadata quality, interoperability, and global discoverability. Altogether, these activities and organisations are enhancing the Research Nexus, enriching a scholarly graph for the benefit of all.&lt;/p>
&lt;p>Want to learn more? You can explore the &lt;a href="https://www.crossref.org/pdfs/enhancing-repository-integration-with-crossref-services.pdf">presentation slides (PDF)&lt;/a> from &lt;strong>Open Repositories 2025&lt;/strong>, which cover the Crossref API and its capabilities, how repositories can use it to query and enrich metadata, the benefits for repository managers, researchers, and funders, as well as recent updates to our metadata schema.&lt;/p></description></item><item><title>Piecing together the Research Nexus: uncovering relationships with open funding metadata</title><link>https://www.crossref.org/blog/piecing-together-the-research-nexus-uncovering-relationships-with-open-funding-metadata/</link><pubDate>Wed, 01 Oct 2025 00:00:00 +0000</pubDate><author>Rocío Gaudioso Pedraza</author><guid>https://www.crossref.org/blog/piecing-together-the-research-nexus-uncovering-relationships-with-open-funding-metadata/</guid><description>&lt;p>The Crossref &lt;a href="https://www.crossref.org/services/grant-linking-system/">Grant Linking System (GLS)&lt;/a> has been facilitating the registration, sharing and re-use of open funding metadata for six years now, and we have reached some important milestones recently! What started as an interest in identifying funders through the Open Funder Registry evolved to a more nuanced and comprehensive way to share and re-use open funding data systematically. That’s how, in collaboration with the funding community, the Crossref Grant Linking System was developed. Open funding metadata is fundamental for the transparency and integrity of the research endeavour, so we are happy to see them included in the &lt;a href="https://www.crossref.org/documentation/research-nexus/">Research Nexus&lt;/a>.&lt;/p>
&lt;p>As emphasised recently by &lt;a href="https://doi.org/10.64000/dvqke-j4v69" target="_blank">Hans de Jonge from NWO&lt;/a>, funding metadata’s value is in the transparency of the relationships it enables. The system is powered by the collective action of the research community– including research funders – that registers open metadata with Crossref, making these relationships possible. With close to 180,000 grant records in our corpus we wanted to know how far they reach and what story they tell.&lt;/p>
&lt;p>In March 2022, we &lt;a href="https://doi.org/10.64000/ske16-xve54" target="_blank">developed an approach for linking grants to research outputs&lt;/a> and analysed how many such relationships could be established. Now we’re able to present &lt;a href="https://doi.org/10.13003/aexidu9f" target="_blank">the latest dataset&lt;/a> that contains relationships between grants and research outputs, both those deposited by Crossref members and discovered by an automated matching strategy. It includes data deposited up to the end of July 2025.&lt;/p>
&lt;p>This work is part of our ongoing &lt;a href="https://www.crossref.org/community/special-programs/metadata-matching/">Metadata Matching&lt;/a> project.&lt;/p>
&lt;h2 id="what-exactly-is-in-this-new-open-dataset-of-grantoutput-relationships">What exactly is in this new open dataset of grant&amp;lt;&amp;gt;output relationships?&lt;/h2>
&lt;ul>
&lt;li>The dataset contains 250,163 total funding relationships between grants and research outputs. &lt;/li>
&lt;li>We welcomed a number of funders, such as the &lt;a href="https://doi.org/10.64000/dvqke-j4v69" target="_blank">Dutch Research Council&lt;/a> and &lt;a href="https://frq.gouv.qc.ca/en/persistent-unique-identifiers-doi/" target="_blank">Fonds de Recherche du Quebec&lt;/a>, which together registered almost 27,000 grants in the past year. &lt;/li>
&lt;li>It’s clear that the more grant metadata is registered the more funding relationships we can uncover. &lt;/li>
&lt;li>The percentage of relationships that are registered explicitly by Crossref members providing grants IDs in funding information has grown from less than 0.1% in 2023 to 1% (modest numbers but amazing growth!).&lt;/li>
&lt;/ul>
&lt;h3 id="the-methodology">The methodology &lt;/h3>
&lt;p>We created &lt;a href="https://doi.org/10.13003/aexidu9f" target="_blank">a dataset of relationships between grants and research outputs&lt;/a> by analysing their metadata in several ways. A relationship is included in the dataset if at least one of the following conditions is met:&lt;/p>
&lt;ul>
&lt;li>A relationship was explicitly deposited by a Crossref member through a &lt;em>finances&lt;/em> or &lt;em>isFinancedBy&lt;/em> &lt;a href="https://www.crossref.org/documentation/schema-library/markup-guide-metadata-segments/relationships/">relationship&lt;/a>: 488 (0.2%) relationships&lt;/li>
&lt;li>The research output contains the grant DOI within the award number in the funding metadata: 2,003 (0.8%) relationships&lt;/li>
&lt;li>The award numbers in the grant and the research output are similar, and the associated funding organisations are either the same, or one is the sub-organisation of the other: 247,672 (99%) relationships &lt;/li>
&lt;/ul>
&lt;p>The dataset includes data deposited until the end of July 2025 and contains 250,163 total relationships.&lt;/p>
&lt;p>The code used to generate the dataset is available &lt;a href="https://gitlab.com/crossref/data-science/matching-tools/-/tree/main/grant_matching/offline_dataset?ref_type=heads" target="_blank">in our GitLab repository&lt;/a>.&lt;/p>
&lt;h3 id="the-results">The results&lt;/h3>
&lt;p>As you can see in the graph below, the number of relationships grant-research output continues to grow as the number of grants records Crossref members register with us increases.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/graph-grant-research-output.png"
alt="Graph of the number of relationships grant-research output" width="75%">
&lt;/figure>
&lt;/div>
&lt;p>&lt;strong>Figure 1:&lt;/strong> Cumulative totals of grants, linked grants, research outputs, and grant–research output relationships from 2019 to 2025. Stepwise increases correspond to the addition of major funder datasets, including Wellcome (2020), OSTI (2021), JST (2022), the European Union (2022), the Austrian Science Fund (2023), and the Fonds de recherche du Québec (2025).&lt;/p>
&lt;p>Looking at the numbers broken down by grant registrants we can see that the more grants registered the more relationships can be uncovered. The table below shows funders who have at least 1,000 total grants registered and for whom at least 10% of their registered grants are linked to research outputs, showing the number of relationships, grants, linked grants and linked research outputs (sorted by the percentage of linked grants), and compared with the data from the 2023 analysis (where available) to see how the uptake of open funding metadata is evolving.&lt;/p>
&lt;table style="border-collapse:collapse; width:100%; font-family:system-ui, -apple-system, Segoe UI, Roboto, Arial, sans-serif;">
&lt;thead>
&lt;tr style="background-color:#006d87; color:#fff;">
&lt;th rowspan="2" style="border:1px solid #ccc; padding:6px; text-align:left;">Funder&lt;/th>
&lt;th colspan="2" style="border:1px solid #ccc; padding:6px; text-align:center;">Relationships&lt;/th>
&lt;th colspan="2" style="border:1px solid #ccc; padding:6px; text-align:center;">Linked research outputs&lt;/th>
&lt;th colspan="2" style="border:1px solid #ccc; padding:6px; text-align:center;">Grants&lt;/th>
&lt;th colspan="2" style="border:1px solid #ccc; padding:6px; text-align:center;">Number of linked grants&lt;/th>
&lt;th colspan="2" style="border:1px solid #ccc; padding:6px; text-align:center;">Percentage of linked grants&lt;/th>
&lt;/tr>
&lt;tr style="background-color:#006d87; color:#fff;">
&lt;th style="border:1px solid #ccc; padding:6px; text-align:center;">2023&lt;/th>
&lt;th style="border:1px solid #ccc; padding:6px; text-align:center;">2025&lt;/th>
&lt;th style="border:1px solid #ccc; padding:6px; text-align:center;">2023&lt;/th>
&lt;th style="border:1px solid #ccc; padding:6px; text-align:center;">2025&lt;/th>
&lt;th style="border:1px solid #ccc; padding:6px; text-align:center;">2023&lt;/th>
&lt;th style="border:1px solid #ccc; padding:6px; text-align:center;">2025&lt;/th>
&lt;th style="border:1px solid #ccc; padding:6px; text-align:center;">2023&lt;/th>
&lt;th style="border:1px solid #ccc; padding:6px; text-align:center;">2025&lt;/th>
&lt;th style="border:1px solid #ccc; padding:6px; text-align:center;">2023&lt;/th>
&lt;th style="border:1px solid #ccc; padding:6px; text-align:center;">2025&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;th scope="row" style="text-align:left; border:1px solid #000; padding:6px;">European Union&lt;/th>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">86,979&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">128,572&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">78,576&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">114,491&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">39,703&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">53,473&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">14,860&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">21,402&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">37.4%&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">40%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;th scope="row" style="text-align:left; border:1px solid #000; padding:6px;">Japan Science and Technology Agency&lt;/th>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">19,549&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">30,728&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">16,265&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">25,003&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">9,923&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">11,866&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">2,609&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">3,900&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">26.3%&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">32.9%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;th scope="row" style="text-align:left; border:1px solid #000; padding:6px;">Wellcome&lt;/th>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">34,254&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">45,596&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">25,720&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">33,783&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">17,547&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">19,929&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">5,238&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">6,206&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">29.9%&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">31.1%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;th scope="row" style="text-align:left; border:1px solid #000; padding:6px;">American Cancer Society&lt;/th>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">50&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">604&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">49&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">586&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">380&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">1,162&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">34&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">277&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">8.9%&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">23.8%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;th scope="row" style="text-align:left; border:1px solid #000; padding:6px;">American Heart Association (AHA)&lt;/th>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">40&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">1,040&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">38&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">935&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">598&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">2,764&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">30&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">621&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">5%&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">22.5%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;th scope="row" style="text-align:left; border:1px solid #000; padding:6px;">Fundacao para a Ciencia e a Tecnologia&lt;/th>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">0&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">27,915&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">0&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">15,681&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">5&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">17,422&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">0&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">3,793&lt;/td>
&lt;td style="text-align:center; border:1px solid #000; padding:6px;">–&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">21.8%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;th scope="row" style="text-align:left; border:1px solid #000; padding:6px;">Austrian Science Fund (FWF)&lt;/th>
&lt;td style="text-align:center; border:1px solid #000; padding:6px;">–&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">10,387&lt;/td>
&lt;td style="text-align:center; border:1px solid #000; padding:6px;">–&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">7,459&lt;/td>
&lt;td style="text-align:center; border:1px solid #000; padding:6px;">–&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">19,576&lt;/td>
&lt;td style="text-align:center; border:1px solid #000; padding:6px;">–&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">2,712&lt;/td>
&lt;td style="text-align:center; border:1px solid #000; padding:6px;">–&lt;/td>
&lt;td style="text-align:right; border:1px solid #000; padding:6px;">13.9%&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>&lt;strong>Table 1:&lt;/strong> Comparison between data from 2023-07-31 and 2025-07-31 of a number of Crossref members registering grants. It shows the number of relationships, grants, linked grants and linked research outputs, sorted by the percentage of linked grants.&lt;/p>
&lt;p>We encourage funders to join as members once they have determined the means of effective implementation of the GLS within their processes. By further analysing metadata of matched outputs, funders have the opportunity to monitor compliance with their policies and learn more about the impact of their programs.&lt;/p>
&lt;h3 id="following-through-funders-open-science-commitments">Following through funders’ Open Science commitments&lt;/h3>
&lt;p>The relationships showcased above and in the recent analysis are powered by open funding metadata. Open funding metadata plays a central role in building a transparent, accountable and high integrity research environment by making visible the connections between the funding, grantees, research outputs, and their impact. Funders’ openness mandates and Open Science commitments emphasize the importance of traceability in the research process, so ensuring that the support given-whether financial or otherwise-can be systematically recorded and shared is instrumental. Openness is also part of the strategic plans of institutions such as the International Science Council, who has &lt;a href="https://council.science/blog/fighting-disinformation-with-sunshine-promoting-funding-transparency-in-science/" target="_blank">explicitly called for greater transparency in funding&lt;/a> as a way to strengthen trust in science and counter misinformation. At the same time, initiatives such as the &lt;a href="https://barcelona-declaration.org/" target="_blank">Barcelona Declaration on Open Research Information&lt;/a> underscores the benefits of open, reusable funding metadata for monitoring, evaluation and assessment of research and researchers.&lt;/p>
&lt;p>Crossref’s Grant Linking System offers funders’ a way to demonstrate a commitment to openness, modeling the standards they expect of the research community they support, while creating a more robust, trustworthy and collaborative research ecosystem.&lt;/p>
&lt;h3 id="economy-of-scale-unlocking-relationships-with-crossref">Economy of scale: unlocking relationships with Crossref&lt;/h3>
&lt;p>Crossref houses millions of records, from the ubiquitous research articles and preprints, to books, peer review records, technical reports, datasets – you name it. Our members not only register, but also regularly update their metadata as new or corrected information becomes available. Our matching workflows allow us to make visible the hidden relationships and complete and improve the metadata records by adding new and reciprocal assertions.&lt;/p>
&lt;p>This analysis shows the unique value of registering funding metadata with Crossref and adding an essential piece to the Research Nexus puzzle. &lt;strong>The relationship metadata allows the funding that underpins the research process to be connected, and contextualise scattered data points, acting as an anchor that links publications, people, and other research outputs.&lt;/strong> This is made possible by the impressive number of records continuously being registered by more than 23,000 member organisations, and by the increasing availability of funding information in the system with more research funders joining in and registering their grant metadata with us.&lt;/p>
&lt;h3 id="next-steps">Next steps&lt;/h3>
&lt;p>As we welcome more and more funders to the GLS, we, collectively, continue to complete the Research Nexus, record by record, field by field. The more awards we have in our corpus the more relationships we’ll uncover, so we’ll keep making these analyses periodically to make sure we don’t miss them.&lt;/p>
&lt;p>But it is not all on us. We are working towards a vision where Crossref Grant IDs are business as usual – where funders register their awards, grantees are aware of them and share them with publishers, and those publishers share them back with us when registering their content – closing the loop organically. We continue working on making this easier. In the upcoming works schema update a specific Crossref Grant ID field will be added in the funding information, alongside Award ID (for an internal identifier).&lt;/p>
&lt;p>Crucially, as the momentum of adoption among funders increases, and thousands of Crossref Grant IDs are available in the system, we are working with all members to raise their attention to the importance and desirability of funding metadata, so inclusion of that information in metadata of all works increases and consequently, the percentage of relationships asserted by Crossref members can grow.&lt;/p>
&lt;p>This matching analysis is just one example of what we do to enrich metadata to highlight relationships among works, individuals, institutions, and actions. Earlier this year, &lt;a href="https://www.crossref.org/community/special-programs/metadata-matching/">we launched the Metadata Matching project&lt;/a>, which is a major effort to rebuild our matching workflows using modern software development and data science practices. As part of the project, we plan to expose additional matched relationships between grants and research outputs in our REST API, alongside those deposited by our members. We’ll keep you updated as we go along!&lt;/p>
&lt;p>Read more about metadata matching in the blog series:&lt;/p>
&lt;ul>
&lt;li>&lt;a href="https://doi.org/10.13003/aewi1cai" target="_blank">Metadata matching 101: what is it and why do we need it?&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://doi.org/10.13003/zie7reeg" target="_blank">The anatomy of metadata matching&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://doi.org/10.13003/pied3tho" target="_blank">The myth of perfect metadata matching&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://doi.org/10.13003/ief7aibi" target="_blank">How good is your matching?&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://doi.org/10.13003/axeer1ee" target="_blank">Metadata matching: beyond correctness&lt;/a>&lt;/li>
&lt;/ul></description></item><item><title>Innovation in scientific publishing and its implications for Crossref DOI registration practices - Request for input</title><link>https://www.crossref.org/blog/innovation-in-scientific-publishing-and-its-implications-for-crossref-doi-registration-practices-request-for-input/</link><pubDate>Thu, 25 Sep 2025 00:00:00 +0000</pubDate><author>Ludo Waltman</author><guid>https://www.crossref.org/blog/innovation-in-scientific-publishing-and-its-implications-for-crossref-doi-registration-practices-request-for-input/</guid><description>&lt;p>&lt;em>Lots of exciting innovations are being made in scientific publishing, often raising fundamental questions about established publishing practices. In this guest post, Ludo Waltman and André Brasil discuss the recently launched MetaROR publish-review-curate platform and the questions it raises about good practices for Crossref DOI registration in this emerging landscape.&lt;/em>&lt;/p>
&lt;p>Digital Object Identifiers (DOIs) are unique identifiers commonly assigned to research outputs such as journal articles, preprints, peer review reports, and datasets. The DOI of a research output allows the output to be identified online in a persistent way, even when the underlying publishing infrastructure changes (e.g., a journal moving from one publisher to another).&lt;/p>
&lt;p>There are several DOI registration agencies. Most of the larger scientific publishers work with Crossref, and so do many preprint servers, and therefore our focus in this post is on Crossref. Crossref also keeps track of &lt;a href="https://doi.org/10.31222/osf.io/smxe5_v2" target="_blank">metadata associated with research outputs&lt;/a>, such as the title, authors, and publication date of an output, and it makes this metadata openly available via APIs for all kinds of services to ingest and reuse. Because indexing, discovery, and evaluation tools rely heavily on this metadata, &lt;a href="https://www.crossref.org/services/content-registration/">content registration practices&lt;/a> and metadata design choices can have major effects on the visibility and findability of research outputs and on analytics used to monitor and assess research outputs and their contributors.&lt;/p>
&lt;p>For the most common types of research outputs, such as journal articles and preprints, a broad consensus has emerged over the past decades on good practices for DOI registration. Such consensus means that articles are assigned the record type ‘article’ in their Crossref metadata. Likewise, many preprint servers register DOIs for preprints at Crossref, with the record type ‘preprint’ in the metadata. (The arXiv preprint server is an exception; it registers DOIs for preprints with DataCite rather than Crossref.)&lt;/p>
&lt;p>For innovative new publication platforms, however, good practices for DOI registration are less clear. The approaches to scientific publishing offered by these platforms often do not fit neatly into established ways of working. For instance, for some of these platforms, the traditional distinction between peer-reviewed articles published in scientific journals and non-peer-reviewed articles posted on preprint servers is no longer applicable. This raises fundamental questions about suitable DOI registration practices for new approaches to scientific publishing.&lt;/p>
&lt;h2 id="metaror">MetaROR&lt;/h2>
&lt;p>The &lt;a href="https://metaror.org/" target="_blank">MetaROR (MetaResearch Open Review) platform&lt;/a>, launched in November 2024 by the Research on Research Institute (RoRI) and the Association for Interdisciplinary Meta-Research and Open Science (AIMOS), offers an example of the challenge of developing appropriate DOI registration practices for new publishing models.&lt;/p>
&lt;p>Inspired by similar initiatives such as &lt;a href="https://elifesciences.org/" target="_blank">eLife&lt;/a> and others, MetaROR adopts the so-called &lt;a href="https://doi.org/10.31222/osf.io/h7swt" target="_blank">publish-review-curate model&lt;/a>. Authors first publish their article on a preprint server and then submit it to MetaROR. MetaROR then organizes an open peer review process for the article. Review reports are published on the MetaROR platform, along with a copy of the preprinted article and an editorial assessment. Rather than a simple binary decision (accept vs. reject), an editorial assessment is a short one-paragraph statement summarizing the strengths and weaknesses of an article. Each review report and each editorial assessment has its own DOI registered at Crossref. In this way, review reports are treated as first-class research outputs that can, for instance, be indexed in scientific literature databases and can be cited in other research outputs.&lt;/p>
&lt;p>For an article submitted to MetaROR, the publication of the review reports, the editorial assessment, and a copy of the article itself concludes MetaROR’s publish-review-curate process. The authors of the article may revise their work in light of the feedback received, and MetaROR may review the revised article. However, there is no requirement that revisions must be made. The primary aim of the review reports and the editorial assessment published on the MetaROR platform is to offer context for readers of the article, helping readers understand the strengths and weaknesses of the article.&lt;/p>
&lt;h2 id="crossref-doi-registration">Crossref DOI registration&lt;/h2>
&lt;p>&lt;a href="https://www.crossref.org/documentation/research-nexus/peer-reviews/" target="_blank">Registration of DOIs&lt;/a> for open peer review reports is &lt;a href="https://www.leidenmadtrics.nl/articles/the-growth-of-open-peer-review" target="_blank">increasingly common&lt;/a>. By registering Crossref DOIs for review reports and editorial assessments, MetaROR enables reviewers and editors to be recognized for their contributions. But what about recognition for authors?&lt;/p>
&lt;p>A crucial element in MetaROR’s philosophy is that authors of articles peer-reviewed by MetaROR deserve to be recognized in a similar way as authors of articles published in traditional peer-reviewed journals. One way to promote appropriate recognition for authors of articles peer-reviewed by MetaROR is to ensure that articles on the MetaROR platform, just like articles in peer-reviewed journals, &lt;a href="https://www.openscience.nl/en/cases/the-metaror-publish-review-curate-model-our-experience-as-authors" target="_blank">have their own DOI&lt;/a>. While this may seem straightforward to arrange, it actually raises two non-trivial questions about good practices for Crossref DOI registration:&lt;/p>
&lt;ol>
&lt;li>For each article on the MetaROR platform, there is a corresponding article on a preprint server. Is it acceptable to have two Crossref DOIs, one registered by the preprint server and one registered by the MetaROR platform, for essentially the same article?&lt;/li>
&lt;li>If Crossref DOIs are registered for articles on the MetaROR platform, should the articles be assigned the type ‘article’ or the type ‘preprint’ in their Crossref metadata, or something else entirely?&lt;/li>
&lt;/ol>
&lt;p>On the first question, it could be argued that having &lt;a href="https://doi.org/10.64000/xjgnq-a3p05" target="_blank">two Crossref DOIs for the same article&lt;/a> is problematic and that MetaROR, therefore, should not register DOIs for articles on its platform. Alternatively, one could argue that an article on the MetaROR platform differs in a meaningful way from the corresponding article on a preprint server, since the article on the MetaROR platform has been enriched with peer review reports and an editorial assessment, similar to the way an article in a peer-reviewed journal may be seen as an enriched version of the corresponding article on a preprint server. This line of reasoning would justify registering DOIs for articles on the MetaROR platform.&lt;/p>
&lt;p>On the second question, the argument could be made that articles on the MetaROR platform should be assigned the type ‘preprint’ in their Crossref metadata, since the type ‘article’ is intended for articles in journals and MetaROR does not consider itself to be a journal (in fact, MetaROR works with &lt;a href="https://cms.metaror.org/partner-journals/" target="_blank">partner journals&lt;/a> to enable articles peer-reviewed by MetaROR to be published in journals) and does not certify articles in the way journals do (i.e., MetaROR does not make accept/reject decisions). On the other hand, one could argue that articles on the MetaROR platform should be assigned the type ‘article’, since the peer-reviewed nature of articles in journals is typically seen as the key factor distinguishing these articles from articles on preprint servers. Articles on the MetaROR platform have been peer-reviewed, and in that sense, they resemble articles in journals. A third line of reasoning could be that neither the ‘preprint’ nor the ‘article’ type is fully appropriate for articles on the MetaROR platform and, consequently, that there is a need for a new Crossref record type.&lt;/p>
&lt;h2 id="what-is-your-take">What is your take?&lt;/h2>
&lt;p>The MetaROR team, in consultation with Crossref, will need to decide how to deal with the two questions discussed in this blog post. After some preliminary conversations between the MetaROR team and Crossref, we decided to share these questions more widely to solicit input from the broader community. We invite you to share your thoughts on the two questions, either by posting a comment on this blog post or by reaching out to us on social media or by email. Community perspectives will help shape good practices not only for MetaROR but also for other publish-review-curate initiatives facing similar questions. We look forward to hearing from you!&lt;/p>
&lt;p>&lt;em>Ludo Waltman and André Brasil are members of the editorial team of MetaROR. Ludo and André are grateful to Ginny Hendricks at Crossref for valuable discussions about the issues raised in this blog post.&lt;/em>&lt;/p></description></item><item><title>Crossref and PKP enter new partnership phase to support richer and more inclusive metadata</title><link>https://www.crossref.org/blog/crossref-and-pkp-enter-new-partnership-phase-to-support-richer-and-more-inclusive-metadata/</link><pubDate>Mon, 22 Sep 2025 00:00:00 +0000</pubDate><author>Kornelia Korzec</author><guid>https://www.crossref.org/blog/crossref-and-pkp-enter-new-partnership-phase-to-support-richer-and-more-inclusive-metadata/</guid><description>&lt;p>Crossref and the Public Knowledge Project (PKP) have been working closely together for many years, sharing resources and supporting our overlapping communities of organisations involved in communicating research. Now we’re delighted to share that we have agreed on a new set of objectives for our partnership, centred on further development of the tools that our shared community relies upon, as well as building capacity to enable richer metadata registration for organisations using the Open Journal Systems (OJS).&lt;/p>
&lt;p>Crossref is working towards the vision of a rich and open network underpinning global scholarship, making relationships between works, people, institutions, and actions visible, thanks to the thread of metadata – the research nexus. This vision depends upon participation of research communication organisations coming from all parts of the world, disciplines, and languages. Working with PKP towards making tools for metadata registration more comprehensive, accessible, and easier to use is a big step towards supporting our community to participate in &lt;a href="https://www.crossref.org/documentation/research-nexus/">the research nexus&lt;/a>.&lt;/p>
&lt;p>The renewed partnership has three main goals:&lt;/p>
&lt;ul>
&lt;li>Developments to improve experience and support metadata registration workflows in OJS, bringing relevant functionalities together under the Crossref plug-in, and developing an OMP Crossref plug-in.&lt;/li>
&lt;li>Joint community engagement in support of transitioning OJS users to the future Long-Term Support (LTS) version of OJS, which will enable richer metadata registration.&lt;/li>
&lt;li>Creation of a PKP School self-paced training course for system administrators.&lt;/li>
&lt;/ul>
&lt;p>&lt;a href="https://doi.org/10.59350/n3pa8-x3548" target="_blank">Crossref and PKP have a rich history of collaboration&lt;/a>, including previous investment in tools development in 2020, which resulted in some vital improvements to Crossref metadata management in OJS and a more streamlined experience for Crossref members on the platform, as well as many collaborative community events and training.&lt;/p>
&lt;p>We know that thousands of Crossref members use OJS to register their metadata. Many are based in resource-constrained institutions, so the training provided by Crossref and PKP will be key to building their capacity to participate in the research nexus. With OJS 3.5 empowering organisations to register richer metadata, we look forward to opening up more opportunities for members to enhance their participation.&lt;/p>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>At PKP, we’re excited to deepen our longstanding collaboration with Crossref, supporting our global community in amplifying the visibility and impact of their research through streamlined integration for robust metadata management. By working together on both technological innovation and capacity-building initiatives, we anticipate even greater outcomes that will strengthen open scholarship throughout the duration of this partnership and well into the future.” – said Kevin Stranack, PKP Director of Operations.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Kevin Stranack, PKP Director of Operations&lt;/cite>&lt;/p>
&lt;/div>
&lt;h3 id="about-crossref">About Crossref&lt;/h3>
&lt;p>Crossref runs an open infrastructure to link research objects, entities, and actions, creating a lasting and reusable scholarly record that underpins open science. Together with their 23,000 members in 164 countries, Crossref drives metadata exchange and supports nearly 2 billion monthly API queries, facilitating global research communication, for the benefit of society.&lt;/p>
&lt;h3 id="about-pkp">About PKP&lt;/h3>
&lt;p>Public Knowledge Project (PKP) seeks to improve the scholarly and public quality, reach, and diversity of academic research through the research, development, implementation, and support of innovative open source software to support scholarly publishing and communication.&lt;/p></description></item><item><title>Raising the standard: GigaScience Press on metadata and discoverability</title><link>https://www.crossref.org/blog/raising-the-standard-gigascience-press-on-metadata-and-discoverability/</link><pubDate>Wed, 17 Sep 2025 00:00:00 +0000</pubDate><author>Scott Edmunds</author><guid>https://www.crossref.org/blog/raising-the-standard-gigascience-press-on-metadata-and-discoverability/</guid><description>&lt;p>To mark Crossref’s 25th anniversary, we launched our first Metadata Awards to highlight members with the best metadata practices.
&lt;a href="https://www.gigasciencepress.org/" target="_blank">GigaScience Press&lt;/a>, based in Hong Kong, was the leader among small publishers, defined as organisations with less than USD 1 million in publishing revenue or expenses. We spoke with Scott Edmunds, Ph.D., Editor-in-Chief at GigaScience Press, about how discoverability drives their high metadata standards.&lt;/p>
&lt;h3 id="what-motivates-your-organisationteam-to-work-towards-high-quality-metadata-what-objectives-does-it-support-for-your-organisation">What motivates your organisation/team to work towards high-quality metadata? What objectives does it support for your organisation?&lt;/h3>
&lt;p>Our objective is to communicate science openly and collaboratively, without barriers, to solve problems in a data- and evidence-driven manner through Open Science publishing. High-quality metadata helps us address these objectives by improving the discoverability, transparency, and provenance of the work we publish. It is an integral part of the &lt;a href="https://www.go-fair.org/fair-principles/" target="_blank">FAIR principles&lt;/a> and UNESCO Open Science Recommendation, playing a role in increasing the accessibility of research for both humans and machines. As one of the authors of the FAIR principles paper and an advisor of the &lt;a href="https://makedatacount.org/" target="_blank">Make Data Count&lt;/a> project, I’ve also personally been very conscious to practice what I preach.&lt;/p>
&lt;h3 id="do-you-have-a-strategy-for-complete-metadata-which-elements-did-you-prioritise-what-workflows-tools-or-collaborations-helped-you-get-there">Do you have a strategy for complete metadata? Which elements did you prioritise? What workflows, tools, or collaborations helped you get there?&lt;/h3>
&lt;p>We’ve been privileged to work with our technical partners at &lt;a href="https://rivervalley.io/" target="_blank">River Valley Technologies&lt;/a>, and the novel XML-first publishing platform they have developed has made it particularly easy to integrate and collect persistent identifiers and other metadata, embedding it into the resulting rich-XML. As Open Access advocates, licensing and machine readability were early focuses when launching our journals. We ensured that we provided a text and data mining portal, allowing bulk downloads of our content to encourage reuse. Many specific metadata elements highlighted by the FAIR principles and &lt;a href="https://doi.org/10.54677/MNMH8546#:~:text=The%20UNESCO%20Recommendation%20on%20Open,openly%20available%2C%20accessible%20and%20reusable" target="_blank">UNESCO Open Science&lt;/a> recommendations, and so these have also helped guide what should be prioritised. If there’s one specific tool to mention, we’ve been big fans of the &lt;a href="https://www.crossref.org/documentation/reports/participation-reports/" target="_blank">Crossref participation reports&lt;/a>, as this has helped highlight what is missing and what we need to improve upon.&lt;/p>
&lt;h3 id="how-have-you-integrated-these-into-your-metadata-processes">How have you integrated these into your metadata processes?&lt;/h3>
&lt;p>The participation reports, in particular, have been useful for this, and by regularly checking them, we’ve managed to spot when processes have broken, for example. When you’ve added new fields to the reports like &lt;a href="https://ror.org/" target="_blank">ROR IDs (Research Organization Registry)&lt;/a>, this has also motivated us to prioritise integrating these, so having a curated list of metadata fields like this definitively helps users focus on what should be the most important. River Valley Technologies has been very responsive to this type of feedback, and being able to see the participation report data in real-time has helped drive them to fix and update our metadata. So I thank them for being so patient and quick to respond to our very demanding standards.&lt;/p>
&lt;h3 id="what-impact-of-good-metadata-can-you-see-for-your-organisation">What impact of good metadata can you see for your organisation?&lt;/h3>
&lt;p>From an Editorial side, our technical partners at River Valley Technologies have found having this metadata information available very useful in the Research Integrity tools they have developed and integrated into our publication platform. Things like &lt;a href="https://orcid.org/" target="_blank">ORCID IDs&lt;/a>, RORs, and other identifiers are very useful for tracking provenance and increasing trust.&lt;/p>
&lt;p>From a business side, putting the effort into collecting rich metadata has paid off in the long run by making it easier to integrate our publishing data into new platforms. Making it easier and quicker to integrate and track our data via &lt;a href="https://oaswitchboard.org/" target="_blank">OA Switchboard&lt;/a>, for example. It also helps us more easily mirror and list our content in indexes like &lt;a href="https://www.ncbi.nlm.nih.gov/pmc/" target="_blank">PMC&lt;/a>, &lt;a href="https://www.scopus.com/" target="_blank">Scopus&lt;/a>, &lt;a href="https://clarivate.com/webofsciencegroup/solutions/web-of-science/" target="_blank">Web of Science&lt;/a>, and others.&lt;/p>
&lt;h3 id="have-you-encountered-any-challenges-in-curating-or-improving-your-metadata">Have you encountered any challenges in curating or improving your metadata?&lt;/h3>
&lt;p>One of the main metadata areas that has currently let us down, funding and registries, is because our publishing model is so affordable. The automated production processes from RVT&amp;rsquo;s novel publishing platform have allowed us to publish very cost-effectively (the APC of GigaByte is $535). We’ve also received sponsorship from the WHO to publish a series of public health papers, particularly supporting authors from the Global South who may not have sources of funding listed in these registries. Because of this, we’ve published numerous papers from independent researchers, students, and self-financed projects that may not have funding IDs or grant numbers. We’d like to push to get “unfunded” counted as a metadata field to address this.&lt;/p>
&lt;h3 id="have-your-efforts-regarding-metadata-yielded-tangible-benefits-for-your-community-is-this-something-your-editors-authors-or-readers-are-aware-of-and-appreciate-if-so-why">Have your efforts regarding metadata yielded tangible benefits for your community? Is this something your editors, authors, or readers are aware of and appreciate? If so, why?&lt;/h3>
&lt;p>We’d like to think our authors find this useful, but we’ve not had any specific feedback on this. Our readers, both human and machine, should hopefully appreciate finding our work more easily, and from a purely selfish perspective, should get us higher access and citations. This is difficult to measure, but as evidence nerds, we have &lt;a href="https://doi.org/10.31222/osf.io/xv7tk" target="_blank">attempted to conduct RCTs&lt;/a> examining this for Data Citations. One anecdote I can give is about the author who told us they pasted their paper into ChatGPT and asked it which was the best journal for their work, and it suggested our journal. I’d like to think that putting in this effort in making our papers more machine-readable and comprehensible pays off at times like this to make the discoverability and visibility of our journals greater.&lt;/p>
&lt;h3 id="looking-ahead-how-are-you-planning-to-build-on-your-metadata-quality-are-there-new-elements-or-practices-youre-exploring-and-what-advice-would-you-give-to-others-just-starting-to-strengthen-their-metadata">Looking ahead, how are you planning to build on your metadata quality? Are there new elements or practices you’re exploring? And what advice would you give to others just starting to strengthen their metadata?&lt;/h3>
&lt;p>We still need to update older content with RORs, and improve it for the datasets
linked to our papers. To do this, we’ve had interns working to improve our &lt;a href="https://datacite.org/" target="_blank">DataCite&lt;/a> metadata.&lt;/p>
&lt;p>We encourage others to think about metadata issues when setting up their workflows. While it may seem like additional work, it will be increasingly important to future-proof and get journals ready for our increasingly AI-centric age. And as we show here, we can more easily carry out important tasks like getting your content more quickly and widely indexed and disseminated.&lt;/p>
&lt;p>Strong metadata ties open science, integrity, and discoverability together. GigaScience Press shows how consistent identifiers, machine-readable formats, and continuous checks deliver real benefits. As discovery becomes more AI-assisted, the priority is clear: keep metadata complete, open, and usable.&lt;/p>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>While it may seem like additional work, it will be increasingly important to future-proof and get journals ready for our increasingly AI-centric age.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Scott Edmunds, GigaScience&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>Now, a few words from Scott.&lt;/p>
&lt;div style="position: relative; width: 100%; height: 0; padding-top: 56.2500%;
padding-bottom: 0; box-shadow: 0 2px 8px 0 rgba(63,69,81,0.16); margin-top: 1.6em; margin-bottom: 0.9em; overflow: hidden;
border-radius: 8px; will-change: transform;">
&lt;iframe loading="lazy" style="position: absolute; width: 100%; height: 100%; top: 0; left: 0; border: none; padding: 0;margin: 0;"
src="https://www.canva.com/design/DAGyCZEp_QA/dzAi4Azoz_k2-3_B-Z7v8g/watch?embed" allowfullscreen="allowfullscreen" allow="fullscreen">
&lt;/iframe>
&lt;/div>
&lt;a href="https:&amp;#x2F;&amp;#x2F;www.canva.com&amp;#x2F;design&amp;#x2F;DAGyCZEp_QA&amp;#x2F;dzAi4Azoz_k2-3_B-Z7v8g&amp;#x2F;watch?utm_content=DAGyCZEp_QA&amp;amp;utm_campaign=designshare&amp;amp;utm_medium=embeds&amp;amp;utm_source=link" target="_blank" rel="noopener">Metadata Awards video - Gigascience&lt;/a></description></item><item><title>Meet the candidates and cast your vote in our 2025 Board elections</title><link>https://www.crossref.org/blog/2025-board-election/</link><pubDate>Tue, 16 Sep 2025 00:00:00 +0000</pubDate><author>Lucy Ofiesh</author><guid>https://www.crossref.org/blog/2025-board-election/</guid><description>&lt;p>On behalf of the Nominating Committee, I’m pleased to share the slate of candidates for the 2025 board election.&lt;/p>
&lt;p>Each year we do an open call for board interest. This year, the Nominating Committee received 51 submissions from members worldwide to fill five open board seats.&lt;/p>
&lt;p>We have four large member seats and one small member seat open for election in 2025. We maintain a balanced board of 8 large member seats and 8 small member seats. Size is determined based on the organization&amp;rsquo;s membership tier (small members fall in the $0-$1,650 tiers and large members in the $3,900 - $50,000 tiers).&lt;/p>
&lt;p>We were pleased to see the diversity in candidates, with applicants from 19 countries. The committee was keen to prepare a diverse slate of organization types, individual skills and perspectives, and global representation.&lt;/p>
&lt;h2 id="tier-1-small-member-seats-electing-one-candidate">Tier 1, Small member seats (electing one candidate)&lt;/h2>
&lt;ul>
&lt;li>&lt;strong>Rebecca Wambua&lt;/strong>, Distance, Open and e-Learning Practitioners&amp;rsquo; Association of Kenya&lt;/li>
&lt;li>&lt;strong>Oscar Donde&lt;/strong>, Pan Africa Science Journal&lt;/li>
&lt;li>&lt;strong>Nwachukwu Egbunike&lt;/strong>, Pan-Atlantic University Press&lt;/li>
&lt;/ul>
&lt;h2 id="tier-2-large-member-seats-electing-four-candidates">Tier 2, Large member seats (electing four candidates)&lt;/h2>
&lt;ul>
&lt;li>&lt;strong>Damian Bird&lt;/strong>, CABI&lt;/li>
&lt;li>&lt;strong>Rose L&amp;rsquo;Huillier&lt;/strong>, Elsevier&lt;/li>
&lt;li>&lt;strong>John Sivo&lt;/strong>, IEEE&lt;/li>
&lt;li>&lt;strong>Nick Lindsay&lt;/strong>, The MIT Press&lt;/li>
&lt;li>&lt;strong>Anjalie Nawaratne&lt;/strong>, Springer Nature&lt;/li>
&lt;/ul>
&lt;div class="shortcode-divwrap blue-highlight">
&lt;span>&lt;h3 id="please-read-the-candidates-statementsboard-and-governanceelections2025-slate">&lt;a href="https://www.crossref.org/board-and-governance/elections/2025-slate/">Please read the candidates&amp;rsquo; statements&lt;/a>&lt;/h3>
&lt;/span>
&lt;/div>
&lt;h2 id="every-member-has-a-vote">Every member has a vote&lt;/h2>
&lt;p>If your organisation is a voting member in good standing as of September 5th, 2025, you are eligible to vote.&lt;/p>
&lt;p>The voting contact for your organisation will receive a ballot from eBallot, a third party election platform. You should receive your ballot by Wednesday, September 17th, and you will have until 12:00 UTC on October 22nd to submit your ballot.&lt;/p>
&lt;p>The election results will be announced at &lt;a href="https://www.crossref.org/crossref-annual-meeting/">Crossref2025&lt;/a>, our annual online meeting on October 22nd, 2025.&lt;/p>
&lt;p>Special thanks to the committee: James Phillpotts of Oxford University Press, Wendy Patterson of Beilstein Institut, Abiodun Falodun of University of Benin, Amanda Ward of Taylor &amp;amp; Francis, and Chaerul Umam of the National Library of Indonesia for the time they dedicated to reviewing the expressions of interest and participating in committee meetings.&lt;/p>
&lt;p>If you have any questions about our election process, please &lt;a href="mailto:lofiesh@crossref.org">contact me&lt;/a>&lt;/p>
&lt;p>Happy voting!&lt;/p></description></item><item><title>A second look at Crossref's carbon footprint - the 2024 report</title><link>https://www.crossref.org/blog/a-second-look-at-crossrefs-carbon-footprint-the-2024-report/</link><pubDate>Mon, 15 Sep 2025 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/a-second-look-at-crossrefs-carbon-footprint-the-2024-report/</guid><description>&lt;p>In 2022, we wrote a blog post &lt;a href="https://doi.org/10.64000/bnv9r-q8f86" target="_blank">“Rethinking staff travel, meetings, and events”&lt;/a> outlining our new approach to staff travel, meetings, and events with the goal of not going back to ‘normal’ after the pandemic and said that in the future we would report on our efforts to balance online and virtual events, work life balance for staff, and track our carbon emissions. In December 2024, we wrote a blog post, &lt;a href="https://doi.org/10.64000/jjr25-es246" target="_blank">“Summary of the environmental impact of Crossref,”&lt;/a> that gave an overview of 2023 and provided the first report on our carbon emissions. Our report on 2023 only just made it into 2024, so we are happy to report on 2024 a little sooner in the year.&lt;/p>
&lt;p>On the positive side, there are a few things:&lt;/p>
&lt;ul>
&lt;li>Our spending on travel and meetings (a proxy for emissions) in 2024 was 56% of what it was in 2019, keeping below the target of not more than 60% of our 2019 spend&lt;/li>
&lt;li>We were better at tracking hotel nights in 2024 compared to 2023&lt;/li>
&lt;li>We managed to balance in-person, regional, and online meetings to engage with our global community while still not having returned to the pre-pandemic “normal”&lt;/li>
&lt;/ul>
&lt;p>In practice, our approach means thinking carefully about how to make the most of each trip. For example, when organising our Crossref Jakarta event, we travelled via Singapore and used the opportunity to meet with members there. Once in Jakarta, we combined our two-day event with an OJS workshop with colleagues from PKP, and another event with Universitas Indonesia. Similarly, when our colleague travelled by train to a conference in Amsterdam, they combined it with a day of visits to members in the area. These kinds of combinations reduce the need for separate trips and maximise the value of in-person travel.&lt;/p>
&lt;p>Some of the less positive things were:&lt;/p>
&lt;ul>
&lt;li>As our membership continues to grow globally and we expand our staff, (which are both great things in themselves), our emissions have also increased. Not only do we have more staff, but some staff travelled more in 2024 than in 2023. We’ll keep a close eye on this to avoid ever-increasing travel.&lt;/li>
&lt;li>Taking a train instead of flying can take longer, and clashes with our desire for staff needing to be away from home as little time as possible.&lt;/li>
&lt;li>It is difficult to find reliable data for some calculations - for example, we have decided not to try to calculate the impact of our Zoom use because there is no reliable way to do this.&lt;/li>
&lt;li>We don’t have good options for offsetting our emissions, and it’s unclear whether we would want to do this even if they were available.&lt;/li>
&lt;/ul>
&lt;p>There is also the issue of whether it is worth it, or possible, to collect certain data, or whether it would change what we do. An example is Zoom. The estimate for the emissions from Zoom meetings in 2024 was 100 kg (that’s kilograms, not tonnes), but the calculations were made using a tool from 2020 that made many assumptions and estimates. We have no way of verifying whether the tool we used is accurate, so we decided not to update our previous calculation. In any case, we aren’t going to ration or reduce our teleconferencing, since it’s an essential tool, and especially if we want to fly less, have fewer in-person meetings, and operate effectively as a distributed organisation in multiple countries with no offices.&lt;/p>
&lt;p>In summary, our total reported carbon emissions increased 40% from 105 tCO2e in 2023 to 147 tCO2e in 2024 (see below for the details). The positive aspect of this is that the increase is partly due to our improved ability to track our travel and hotel stays. The more concerning side of this is that we are travelling more. This enables us to engage with our growing community. We are still thinking strategically about our travel and meetings, following the approach outlined in our 2022 blog post. However, we need to carefully consider air travel in 2026, as it is our largest source of emissions (93%).&lt;/p>
&lt;h2 id="total-travel-and-carbon-spending">Total travel and carbon spending&lt;/h2>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>Year&lt;/th>
&lt;th>Amount&lt;/th>
&lt;th>Percentage of 2019&lt;/th>
&lt;th>Total carbon spent&lt;/th>
&lt;th>Total hotel nights covered&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>2019 actuals&lt;/td>
&lt;td>$585,482&lt;/td>
&lt;td>100%&lt;/td>
&lt;td>did not record&lt;/td>
&lt;td>did not record&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>2020 actuals&lt;/td>
&lt;td>$91,700&lt;/td>
&lt;td>16%&lt;/td>
&lt;td>did not record&lt;/td>
&lt;td>did not record&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>2021 actuals&lt;/td>
&lt;td>$19,066&lt;/td>
&lt;td>3%&lt;/td>
&lt;td>did not record&lt;/td>
&lt;td>did not record&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>2022 actuals&lt;/td>
&lt;td>$74,416&lt;/td>
&lt;td>13%&lt;/td>
&lt;td>did not record&lt;/td>
&lt;td>did not record&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>2023 actuals&lt;/td>
&lt;td>$305,737&lt;/td>
&lt;td>52%&lt;/td>
&lt;td>105 tCO2e&lt;/td>
&lt;td>did not record&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>2024 actuals&lt;/td>
&lt;td>$327,939&lt;/td>
&lt;td>56%&lt;/td>
&lt;td>147 tCO2e&lt;/td>
&lt;td>415&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>2025 budget&lt;/td>
&lt;td>$417,767 (reforecast)&lt;/td>
&lt;td>71%&lt;/td>
&lt;td>68 tCO2e (YTD)&lt;/td>
&lt;td>256 (YTD)&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>2026 budget&lt;/td>
&lt;td>$439,817&lt;/td>
&lt;td>75%&lt;/td>
&lt;td>TBD&lt;/td>
&lt;td>TBD&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>In 2024, we met the target of keeping our travel expenses below 60% of our 2019 level. In 2025, we will exceed this. There are a number of reasons for this. We have more staff, more members, inflation has been high, and we are subsidising a lot more travel for others, such as our ambassadors, speakers, and collaborators at local events, and some board members (since 2019, we reduced from three to one in-person board meeting per year). This aligns with our goals of inclusivity for Crossref meetings, but we have to recognise there is a trade-off. The cost of travel, particularly airfare, has increased since 2019. Using US Bureau of Labor Statistics data from 2019 to 2025 the inflation multiplier for a dollar is 1.26 so adjusted for inflation the comparison figure for 2025 spending is $737,000 and forecasted 2025 spending is 60% of this. While we use cost as a proxy for travel volume, now that we’re better at tracking actual carbon emissions, we can try to set targets of keeping under a certain carbon tonne equivalent total instead of (only) a financial target.&lt;/p>
&lt;h2 id="total-carbon-emissions-for-2024">Total Carbon Emissions for 2024&lt;/h2>
&lt;p>Our total reported carbon emissions increased 40% from 105 tCO2e in 2023 to 147 tCO2e in 2024. In 2023, we didn’t report on the estimated emissions from hotel stays, but for 2024, we have. We recorded 415 hotel nights in 2024 for 4 tCO2e using an average of Europe/US hotel per night emissions estimates &lt;a href="https://circularecology.com/news/the-carbon-emissions-of-staying-in-a-hotel" target="_blank">(Circular Ecology)&lt;/a>. The most carbon-intensive activity was flying. There were about 215 flights in 2024, accounting for emissions of 138 tCO2e - 93% of our total. Crossref staff and community members we covered took 88 train journeys with carbon emissions of .47 tCO2e - so the more travel by train, the better, but this isn’t always possible or feasible. We haven’t included estimates of the impact of home working (Crossref is fully distributed), but we have an initial estimate below and will look to improve this analysis for the 2025 analysis and going forward.&lt;/p>
&lt;h2 id="estimate-of-carbon-footprint-for-distributed-staff">Estimate of carbon footprint for distributed staff&lt;/h2>
&lt;p>Crossref is fully distributed with staff in 11 countries. We used Claude from Anthropic to calculate the emissions from home working for our staff in 2024 and asked for sources to be cited. It provided some approaches for how to go about the calculations but the results were not reliable - for our 46 staff in 10 countries (this is for 2024 - we now have 49 staff in 11 countries) estimates ranged from 5 tCO2e to 28 tCO2e depending on various assumptions such as whether to account for the grid intensity of the countries where staff are based &lt;a href="https://ourworldindata.org/grapher/carbon-intensity-electricity" target="_blank">(Our World in Data has grid intensity figures)&lt;/a> and what estimate is used for the amount of energy an employee working from home uses each day. &lt;a href="https://circularecology.com/news/the-carbon-emissions-of-homeworking-and-office-working" target="_blank">Circular Ecology&lt;/a> uses UK DEFRA figures to come up with 2.67 kgCO2e/day for home working. So a simple calculation of 46 staff working 230 days per year arrives at the 28 tCO2e amount. This is much less than the equivalent figure for office-based work, which is 70 tCO2e. A number of things aren’t factored into these calculations: staff with green energy tariffs, staff with solar panels and home batteries, or other renewable energy sources, and the different needs for heating and air conditioning in different countries.&lt;/p>
&lt;p>We decided not to include these figures in our overall emissions for 2024, but we are looking at a more reliable way to estimate this for 2025. However, we need to consider what we would do with the information and whether we would, or could, do anything to reduce this.&lt;/p>
&lt;h2 id="hosting-services">Hosting services&lt;/h2>
&lt;p>We use AWS for hosting our REST APIs, Metadata Search, and the website. In 2024, our main metadata registry was in a data centre in Massachusetts, which is not included in our calculations. In July 2025, we transitioned fully to AWS, so from 2025 onwards, our emissions from AWS will be higher and will encompass our entire system.&lt;/p>
&lt;p>In 2023, Amazon reports Crossref’s carbon emissions were 0.216 tCO2e compared with 0.266 tCO2e in 2022. In 2024, emissions were 0.132 tCO2e.&lt;/p>
&lt;p>Compared to travel, the footprint from AWS is minimal.&lt;/p>
&lt;h2 id="online-meetings">Online meetings&lt;/h2>
&lt;p>As a distributed, remote-first organisation, Crossref is a heavy Zoom user––it’s essential for staff and for engaging with our community. However, Zoom doesn’t provide tools or estimates of the carbon impact of Zoom meetings. We used a tool last year to provide an estimate, but we aren’t confident it’s accurate or meaningful. The tool was built in 2020 and made a lot of assumptions and guestimates.&lt;/p>
&lt;h2 id="tools-we-used">Tools we used&lt;/h2>
&lt;p>To calculate emissions for flights and train journeys, we chose to use &lt;a href="https://www.carbonfootprint.com/calculator.aspx" target="_blank">Carbon Calculator&lt;/a>. For hotel stays and home working estimates, we used &lt;a href="https://circularecology.com/news/the-carbon-emissions-of-staying-in-a-hotel" target="_blank">Circular Ecology&lt;/a>. For AWS, we used the &lt;a href="https://aws.amazon.com/aws-cost-management/aws-customer-carbon-footprint-tool/" target="_blank">Customer Carbon Footprint Tool (CCFT)&lt;/a> provided by AWS.&lt;/p>
&lt;h2 id="offsetting">Offsetting&lt;/h2>
&lt;p>We don’t offset our emissions from travel or other operations and don’t have plans to do this. Offsetting emissions is problematic in a number of different ways, so we don’t feel confident in doing it.&lt;/p>
&lt;h2 id="in-conclusion">In conclusion&lt;/h2>
&lt;p>In general, it feels good to have had a few years of tracking this, learning more, finding the right tools, and trying to stick to a target to limit our increases. While of course there are always reasons for the target to increase—as we grow and are able to subsidise others beyond our staff more—we remain committed to not just monitoring our carbon spend but also maintaining it at a reasonable level and finding ways to limit and mitigate our impact on the environment. This kind of sustainability isn’t included in the POSI Principles for open scholarly infrastructures, but we’d love to see other similar organisations share their tips and measurements so that, as a community, we can learn how to do even better.&lt;/p></description></item><item><title>Deprecating co-access: Crossref plans and timelines</title><link>https://www.crossref.org/blog/deprecating-co-access-crossref-plans-and-timelines/</link><pubDate>Thu, 11 Sep 2025 00:00:00 +0000</pubDate><author>Isaac Farley</author><guid>https://www.crossref.org/blog/deprecating-co-access-crossref-plans-and-timelines/</guid><description>&lt;p>To date, there are about 100 Crossref members who have made use of our co-access service for one or more of their books. The service was designed to be a last-resort measure when multiple parties - book publishers, aggregators, and other members - had rights to register book content. Unfortunately, the service allowed members to register multiple DOIs for shared books and book chapters, thereby violating our own core tenet of one DOI per content item. We should not have created a service that violated that tenet, resulting in duplicate DOIs. As we are able to offer an alternative in the form of the &lt;a href="https://www.crossref.org/documentation/register-maintain-records/creating-and-managing-dois/multiple-resolution/">multiple resolution service&lt;/a>, it is time to switch co-access off. Among other benefits – for the publisher and the authors, creation of a single DOI for each item, regardless of where it might be hosted, will result in more accurate citation counts and usage statistics. We’re retiring co-access at the end of 2026.&lt;/p>
&lt;h3 id="an-idiom-to-start">An idiom to start&lt;/h3>
&lt;p>There&amp;rsquo;s an idiom used in technology circles called &amp;lsquo;&lt;a href="https://en.wikipedia.org/wiki/Eating_your_own_dog_food" target="_blank">eating your own dog food&lt;/a>.&amp;rsquo; It&amp;rsquo;s used to describe an organization that tests or uses its own products in the real world. I&amp;rsquo;m no developer and only have a handful of years of exposure to this phrase, but I&amp;rsquo;ve always wanted to work it into one of my blog posts. The visceral reactions I have observed when it&amp;rsquo;s been used on internal calls are just too tempting. That, and I think it applies to our own rollout of and missteps with a service we call co-access. The decision to enable co-access reflected the priorities of that period, but we can now improve on it with an upgraded multiple resolution service. That rickety footing for co-access doomed it from the start. Now&amp;rsquo;s the time to face the music and swallow our own kibble.&lt;/p>
&lt;p>Always meant as a last-resort measure, co-access allows multiple Crossref members to register metadata for shared book and book chapter content. Thus, use of co-access results in multiple, duplicate DOIs registered for the same book content. There are well over 500,000 DOIs in co-access within our corpus today. At least half of those are duplicates (more on this below).&lt;/p>
&lt;p>This is far from ideal and has adverse consequences for the integrity of the scholarly record and the community. As we are able to offer an alternative in the form of the &lt;a href="https://www.crossref.org/documentation/register-maintain-records/creating-and-managing-dois/multiple-resolution/">multiple resolution service&lt;/a>, it is time to switch co-access off.&lt;/p>
&lt;p>Among other benefits &amp;ndash; for the publisher and the authors, creation of a single DOI for each item, regardless of where it might be hosted, will result in more accurate citation counts and usage statistics.&lt;/p>
&lt;h3 id="duplicate-dois">Duplicate DOIs&lt;/h3>
&lt;p>We frequently receive questions from members, metadata users, and others in the community, like &lt;a href="https://community.crossref.org/t/duplicate-dois-keep-being-minted/3554" target="_blank">this one&lt;/a>, asking us what we are doing to combat the very real problem of registration and propagation of duplicate DOIs. We do &lt;a href="https://community.crossref.org/t/the-problem-with-duplicate-dois-and-how-you-can-help/2634" target="_blank">take measures&lt;/a> to prevent the registration of duplicate DOIs, including flagging registration of potential duplicate records to our members using what we call conflicts and &lt;a href="https://www.crossref.org/documentation/reports/conflict-report/">conflict reports&lt;/a>. As you might expect, this has been a sensitive topic for us, because we have one glaring service, yes, co-access, that has been actively exacerbating the issue of duplicate DOIs.&lt;/p>
&lt;p>So, while we have been actively trying to counter the rise of duplicate DOIs, co-access enabled duplicate registrations of book DOIs. For every prefix that we configured for the service, we knew we were contributing to the problem (&lt;a href="https://doi.org/10.21428/785a6451.fb6181fd" target="_blank">our members noticed too&lt;/a>. As I said above, co-access allows multiple members to register their own DOI for shared book content. That means that book content in co-access has at least two DOIs registered. In some cases, there is book content with five or more registered DOIs for a single book. That&amp;rsquo;s a great many duplicates that this service is responsible for.&lt;/p>
&lt;h3 id="replacing-co-access">Replacing co-access &lt;/h3>
&lt;p>We plan to replace co-access with an existing tool, multiple resolution, which allows for more than one resolution URL to be registered to a single DOI. A user resolving the DOI is presented with an &lt;a href="https://doi.org/10.54675/cklg5881" target="_blank">interim page&lt;/a>, allowing them to choose from the various content sources registered with this DOI. We&amp;rsquo;ve made some &lt;a href="https://community.crossref.org/t/upcoming-changes-to-multiple-resolution-and-co-access-templates/3908" target="_blank">progress&lt;/a> toward making multiple resolution simpler for members to implement, but we still have more to do.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/multiple-resolution-ui.png"
alt="Screenshot of multiple resolution UI" width="100%">
&lt;/figure>
&lt;/div>
&lt;p>We&amp;rsquo;re aware that the technical steps involved in adopting multiple resolution might present a barrier to implementation for some of our members. To help with the transition, we are working on a basic tool (currently in beta) that simplifies the &lt;a href="https://www.crossref.org/documentation/register-maintain-records/creating-and-managing-dois/multiple-resolution/#00118">process&lt;/a>. We will make it available to members between now and the middle of 2026.&lt;/p>
&lt;h3 id="our-timeline">Our timeline&lt;/h3>
&lt;p>We are not going to make these changes tomorrow. We&amp;rsquo;re going to give members who have been using co-access time to adjust. Right now, we trigger co-access when a secondary DOI is registered by a secondary registrant (member) that: 1) is already in a co-access group within our system with the DOI prefix that registered the original DOI, 2) has at least one shared ISBN with the metadata of that original DOI, and 3) has a title (in the title element of the book or chapter XML) that exactly matches the title of the original DOI. We&amp;rsquo;re going to stop triggering co-access for book and book chapter registrations &lt;strong>starting 2026 July 1&lt;/strong>. No new DOIs will be placed in co-access starting then.&lt;/p>
&lt;p>From there, there will be six months to clean up records already in co-access. One definitive DOI should be selected by the parties in a co-access group; the DOIs that will no longer be maintained for those books and book chapters should be aliased to the primary (definitive) DOI that will be maintained going forward. The primary DOI should be the DOI used on all landing pages for that book (or, book chapter).&lt;/p>
&lt;p>In January 2027, if co-access DOIs have not been aliased to one another, we will force alias the DOIs in the record to the DOI registered by the organization identified as the publisher in the metadata records already in our system. At any point in this timeline, our team will be happy to help with the registration of secondary URLs in order to move books from co-access to multiple resolution. As a result, we will encourage members, end users, and the broader community to move back to using a single, definitive source of truth for these books and book chapters.&lt;/p>
&lt;h3 id="what-will-registration-of-books-and-book-chapters-look-like-post-co-access">What will registration of books and book chapters look like post-co-access?&lt;/h3>
&lt;p>Coordinated. We expect that our members and their publishing partners will define the single DOI for each book and book chapter well upstream of Crossref, so all entities and their systems will use that one definitive DOI.&lt;/p>
&lt;p>As for the registration process and our system, the first member to register the book (and its ISBNs) will establish the DOI for that book and its chapters. Following attempts to register the same content, with a duplicate book-level DOI(s), will fail the registration. Multiple DOIs for the same book or book chapter should be avoided starting &lt;strong>2026 July 01&lt;/strong>, as we will no longer be able to place books and book chapters into co-access.&lt;/p>
&lt;p>We believe this will result in increased cited-by and usage metrics for that single DOI, and a cleaner, more accurate scholarly record.&lt;/p>
&lt;p>We&amp;rsquo;d love to hear your reaction to this news in our &lt;a href="https://community.crossref.org/tag/blog" target="_blank">Community Forum&lt;/a>.&lt;/p></description></item><item><title>Celebrating one year of Crossref Grant IDs at NWO</title><link>https://www.crossref.org/blog/celebrating-one-year-of-crossref-grant-ids-at-nwo/</link><pubDate>Tue, 09 Sep 2025 00:00:00 +0000</pubDate><author>Hans de Jonge</author><guid>https://www.crossref.org/blog/celebrating-one-year-of-crossref-grant-ids-at-nwo/</guid><description>&lt;p>This month marks one year since the Dutch Research Council (NWO) introduced grant IDs—an important milestone in our journey toward more transparent and trackable research funding. We created over &lt;a href="https://api.crossref.org/members/47652/works?" target="_blank">1,600 Crossref Grant IDs&lt;/a> with associated metadata. We are beginning to see them appear in publications. These early examples show the enormous potential Grant IDs have. They also highlight that publishers could extend their efforts to improve the quality of funding metadata of publications.&lt;/p>
&lt;h2 id="the-promise-of-grant-linking">The promise of grant linking&lt;/h2>
&lt;p>For decades, funders have struggled with a seemingly simple challenge: tracking the research outputs that arise from their funding. The traditional approach—requiring grantees to cite their grants in acknowledgement sections of their papers—has all kinds of problems. Authors make many errors in providing this information, and even when funding organizations and schemes are cited correctly, there is no guarantee that a grant number is globally unique and not already in use by another funding council in the world.&lt;/p>
&lt;p>To address these issues, and in collaboration with the research funding community, Crossref introduced the &lt;a href="https://www.crossref.org/services/grant-linking-system/" target="_blank">Grant Linking System (GLS)&lt;/a> six years ago. The system allows funding organizations to assign globally unique and persistent identifiers to their grants, but - more importantly - the system allows connecting these grants with the outputs arising from them. The vision is straightforward: authors include Grant IDs (which are Crossref DOIs) in the funding acknowledgements of their research articles. Publishers either take these IDs from the acknowledgement or proactively ask authors for these IDs in their submission system. Next, when a publisher registers their publication with Crossref, it includes the grant identifier in the metadata of that publication, creating an unambiguous link between the publication and the grants from which the research was funded.&lt;/p>
&lt;p>This last step—including the Grant ID in the metadata of the article when registering the publication with Crossref—is a crucial part of the system as it enables anyone to automatically retrieve all publications arising from a given grant over time via the Crossref API. Funding organizations interested in tracking the impact of their funding could then stop asking their grantees to manually report on the outputs of their funding, as most still do today. Instead, this information would become open data that funding organizations harvest directly themselves, reducing administrative burden on researchers while enhancing the ability to track the impact of their funding.&lt;/p>
&lt;p>As &lt;a href="https://doi.org/10.64000/xqr28-ee750" target="_blank">Robert Kiley&lt;/a>, former head of Open Research at the Wellcome Trust, which piloted the GLS in 2018, put it: &amp;ldquo;&amp;hellip;if every funder were to adopt such a system and expose their grant metadata in a consistent, machine-readable way, it would facilitate the development of applications to help funders get a greatly enhanced picture of the global funding landscape, which in turn would inform strategic planning and resource allocation.&amp;rdquo;&lt;/p>
&lt;h2 id="nwos-implementation-journey">NWO&amp;rsquo;s implementation journey&lt;/h2>
&lt;p>NWO joined Crossref&amp;rsquo;s Grant Linking System in 2024. It reflects our broader commitment to open science and aligns with our &lt;a href="https://doi.org/10.5281/zenodo.4674512" target="_blank">Persistent Identifier Strategy&lt;/a> published in 2021, and our support for the &lt;a href="https://barcelona-declaration.org/" target="_blank">Barcelona Declaration on Open Research Information&lt;/a>. Since August 2024, all new grants awarded from July 2024 onward receive a Crossref Grant ID that persistently resolves to the information about the grant on our website, displaying all basic award information including project titles, summaries, grantee names, and affiliations. NWO is &lt;a href="https://api.crossref.org/types/grant/works?facet=publisher-name:*&amp;amp;rows=0" target="_blank">one of the 44 funding organizations&lt;/a> worldwide that have introduced Crossref Grant IDs for their collective &lt;a href="https://api.crossref.org/types/grant/works?facet=funder-name:*&amp;amp;rows=0" target="_blank">111 funding programs&lt;/a>. Other organizations include the European Commission, OSTI-DOE, the Wellcome Trust, Moore Foundation, Fonds de Recherche du Québec, CSIRO, Japan Science and Technology Agency, and the &lt;a href="https://www.fwf.ac.at/en/news/detail/neue-identifikations-nummer-fuer-fwf-projekte" target="_blank">Austrian Science Fund&lt;/a>.&lt;/p>
&lt;p>Although it took time, implementation at NWO in general proceeded smoothly. Over the course of a year, we&amp;rsquo;ve registered &lt;a href="https://api.crossref.org/members/47652/works?" target="_blank">over 1,500 grant records&lt;/a> without experiencing difficulties or complaints from researchers. On the contrary, after we announced the introduction of Grant IDs, some researchers expressed disappointment on our decision—for practical reasons—to only register DOIs for new grants instead of the entire historical record. This shows that researchers understand the importance of persistent identifiers. Already, a year after its introduction, we are seeing the first NWO Grant IDs appearing in publications— showing that researchers are taking the extra step to look up their Crossref Grant ID and include it in their articles, &lt;a href="https://www.nwo.nl/en/acknowledgement-in-publications" target="_blank">as we are asking them to do&lt;/a>.
However, publishers don’t always manage to handle these identifiers in the way we expect them to.&lt;/p>
&lt;h2 id="linking-grants-to-publications-in-real-life">Linking grants to publications in real life&lt;/h2>
&lt;p>One of the first publications to include an NWO Grant ID is a paper by &lt;a href="https://doi.org/10.1103/PhysRevX.15.021080" target="_blank">Weile et al.&lt;/a>, published by the American Physical Society (APS) in the journal Physical Review X. On the left, we see the funding information provided by the authors, as included in the acknowledgement section of the published article. Funding by NWO from its Talent Scheme VIDI is identified with a Grant ID &lt;a href="https://doi.org/10.61686/YDRHT18202" target="_blank">https://doi.org/10.61686/YDRHT18202&lt;/a>.&lt;/p>
&lt;div class='shortcode-row '>
&lt;div class="col-md-6 col-sm-12 no-first-para-highlight">&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/jk.png" width="100%">
&lt;/figure>
&lt;/div>
&lt;/div>
&lt;div class="col-md-6 col-sm-12 no-first-para-highlight">&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/18202.png" width="100%">
&lt;/figure>
&lt;/div>
&lt;/div>
&lt;/div>
&lt;p>On the right, we see how APS has included this information &lt;a href="https://api.crossref.org/works/https://www.doi.org/10.1103/PhysRevX.15.021080" target="_blank">in the metadata&lt;/a> of the publication: NWO is identified with its Funder ID and the grant with the Grant ID - forging an unambiguous link between funding and publication, initially between this particular grant and this particular publication, but potentially in the future between this grant and all other outputs arising from it. This works so long as all publishers include this information in the metadata of their publications; we need to encourage more publishers and other Crossref members (e.g., preprint services, repositories, blog platforms) to follow the APS example and do the same.&lt;/p>
&lt;h2 id="where-publishers-fall-short">Where publishers fall short&lt;/h2>
&lt;p>There are big differences among publishers in their ability to include funding metadata. Many have been including funder IDs in the metadata for more than a decade, but some are still struggling to do that. Most are yet to catch up to start including Crossref Grant IDs, too.&lt;/p>
&lt;p>Let’s demonstrate that in an example. On the left, we see the acknowledgements section of a paper by &lt;a href="https://www.doi.org/10.1002/smll.202502496" target="_blank">Van Zundert et al&lt;/a>. in the journal Small, published by Wiley. The authors acknowledge a host of funding organizations and grants, including NWO with Grant ID &lt;a href="https://doi.org/10.61686/LVZRW92421" target="_blank">https://doi.org/10.61686/LVZRW92421&lt;/a>. On the right, we see that the publisher has correctly included NWO &lt;a href="https://api.crossref.org/works/https://doi.org/10.1002/smll.202502496" target="_blank">in the metadata&lt;/a> as the funder with our Funder ID, but there’s no reference to our Grant ID, instead mentioning an award number, which seems to refer to a Marie Sklodowska-Curie grant for the same research with their internal award identifier.&lt;/p>
&lt;div class='shortcode-row '>
&lt;div class="col-md-6 col-sm-12 no-first-para-highlight">&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/ivz.png" width="100%">
&lt;/figure>
&lt;/div>
&lt;/div>
&lt;div class="col-md-6 col-sm-12 no-first-para-highlight">&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/api-code-blog.png" width="100%">
&lt;/figure>
&lt;/div>
&lt;/div>
&lt;/div>
&lt;p>Likewise, a publication by &lt;a href="https://www.doi.org/10.1016/j.plrev.2025.07.003" target="_blank">Criscuolo et al&lt;/a> in Physics of Life Reviews (a journal published by Elsevier) correctly identified NWO using our Funder ID, but omitted our Grant ID &lt;a href="https://api.crossref.org/works/https://www.doi.org/10.1016/j.plrev.2025.07.003" target="_blank">in the metadata&lt;/a>, despite its clear inclusion by the author in the acknowledgements (left). Apparently, this persistent link and open metadata is being thrown out of the infrastructure at a crucial time, when the article record could be connecting up with the grant record and making it easy and open for us all to track and report on the connection.&lt;/p>
&lt;div class='shortcode-row '>
&lt;div class="col-md-6 col-sm-12 no-first-para-highlight">&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/acknowledgement-blog-2.png" width="100%">
&lt;/figure>
&lt;/div>
&lt;/div>
&lt;div class="col-md-6 col-sm-12 no-first-para-highlight">&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/api-funder.png" width="100%">
&lt;/figure>
&lt;/div>
&lt;/div>
&lt;/div>
&lt;p>Several publishers do not seem to register funding data at all, despite the opportunity existing for almost 15 years, and sometimes even when comprehensive funding information is provided by authors.&lt;/p>
&lt;h2 id="the-broader-implications">The broader implications&lt;/h2>
&lt;p>It has &lt;a href="https://doi.org/10.1162/qss_a_00210" target="_blank">been known for some time&lt;/a> that publishers struggle with registering complete, high-quality funding metadata for their publications. They sometimes &lt;a href="https://doi.org/10.64000/3f63f-yt393" target="_blank">blame authors&lt;/a> for not providing the required information or making errors in reporting their funding. Or they call on funders to identify their funding more precisely by introducing persistent Grant IDs for their grants. While these are legitimate issues, and it’s true that more funders could also do this, the examples presented here suggest this narrative is incomplete—when authors provide clear, standardized funding information using persistent identifiers, many publishers still fail to capture it accurately.&lt;/p>
&lt;p>The Grant Linking System is still relatively new in terms of open infrastructure and open metadata development, and adoption from funders is still in the tens rather than the tens of thousands, with publishers being more accustomed to creating and providing millions of open metadata records for their publications. Most participating funders, like us, have only started registering grants in the past couple of years. Now that Crossref Grant IDs are becoming more widespread, and with publishers’ experience in creating open metadata, we would love to see publishers prioritise collecting and including Grant IDs in their Crossref metadata. By updating their production practices, they would be supporting the community at large in reaping the benefits of open grant metadata.&lt;/p>
&lt;p>To address these challenges, we are organizing &lt;a href="https://barcelona-declaration.org/working_groups/" target="_blank">a roundtable session under the Barcelona Declaration&lt;/a> in October to discuss concrete solutions for these issues. We invite publishers who are interested in participating &lt;a href="mailto:contact@barcelona-declaration.org">to contact us&lt;/a>. This follows a &lt;a href="https://doi.org/10.64000/3f63f-yt393" target="_blank">2023 workshop&lt;/a> where many publishers were very open in discussing the challenges and working towards improving the process together with funders.&lt;/p>
&lt;h2 id="looking-ahead">Looking ahead&lt;/h2>
&lt;p>The introduction of Crossref Grant IDs represents just the first step in a longer journey toward more open research information for NWO. We are happy to see how quickly researchers are adopting the system by including Crossref Grant IDs in their work. For Grant IDs to truly become a Grant Linking System and fulfil its promise, however, publishers must act on the need to collect and process funding information in their publishing workflows, just as they do for other joint efforts, such as for ORCID iDs for contributors. The information is there—authors are providing it in the acknowledgement sections of their articles (and probably would too if asked directly in a submission form). The question now is: can we encourage more publishers to take up the request to capture and transmit this information accurately and register it with Crossref?&lt;/p>
&lt;p>We’re hopeful. This first year has demonstrated the enormous potential of Crossref Grant IDs in action for NWO. We call on publishers to do their bit in ensuring this vital infrastructure reaches its full potential for the research community.&lt;/p></description></item><item><title>An eLife filled with possibility thanks to great metadata</title><link>https://www.crossref.org/blog/an-elife-filled-with-possibility-thanks-to-great-metadata/</link><pubDate>Thu, 28 Aug 2025 00:00:00 +0000</pubDate><author>Frederick Atherden</author><guid>https://www.crossref.org/blog/an-elife-filled-with-possibility-thanks-to-great-metadata/</guid><description>&lt;p>&lt;a href="https://elifesciences.org/" target="_blank">eLife&lt;/a> recently &lt;a href="https://doi.org/10.64000/xh94q-w7335" target="_blank">won a Crossref Metadata Award&lt;/a> for the completeness of its metadata, showing itself as the clear leader among our medium-sized members. In this post, the eLife team answers our questions about how and why they produce such high-quality open metadata. For eLife, the work of creating and sharing excellent metadata aligns with their mission to foster open science and supports their preprint-centred publication model, but it also lays the groundwork for all kinds of exciting potential uses.&lt;/p>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Having complete and rich metadata puts you in the best position to fulfil future, as-yet-undetermined requirements.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Fred Atherden, eLife&lt;/cite>&lt;/p>
&lt;/div>
&lt;h3 id="what-motivates-your-organisationteam-to-work-towards-high-quality-metadata-what-objectives-does-it-support-for-your-organisation">What motivates your organisation/team to work towards high-quality metadata? What objectives does it support for your organisation?&lt;/h3>
&lt;p>eLife is a mission-driven organisation tasked by its founders to help scientists accelerate discovery and encourage responsible behaviours in science. As such, we’re passionate about open science and metadata, and we&amp;rsquo;re vocal advocates of the benefits these provide to academic communities and beyond.&lt;/p>
&lt;p>Given Crossref’s position as a hub at the centre of scholarly communication, providing Crossref with complete metadata furthers our mission. It facilitates the discovery and reuse of research and enables linkage to key but often overlooked outputs such as datasets and software. As signatories of &lt;a href="https://sfdora.org/" target="_blank">DORA&lt;/a> and supporters of the &lt;a href="https://barcelona-declaration.org/" target="_blank">Barcelona Declaration&lt;/a>, we are keenly aware of the wider context - that these efforts enable research assessment and policy decisions to be derived from open and transparent information, moving beyond closed systems that have proliferated the damaging use of anachronistic metrics.&lt;/p>
&lt;h3 id="do-you-have-a-strategy-for-complete-metadata-which-elements-did-you-prioritise-what-workflows-tools-or-collaborations-helped-you-get-there">Do you have a strategy for complete metadata? Which elements did you prioritise? What workflows, tools, or collaborations helped you get there?&lt;/h3>
&lt;p>There are plenty of existing guidelines that provide a great skeleton to follow. For example, we follow &lt;a href="https://www.go-fair.org/fair-principles/" target="_blank">FAIR data&lt;/a> and &lt;a href="https://doi.org/10.7717/peerj-cs.86" target="_blank">FORCE11 software citation principles&lt;/a>, which ensure the capture of metadata for supporting datasets and software packages. There’s not any one particular element that we’ve prioritised, although we’re keen to ensure we follow best practices while also exploring the bleeding edge.&lt;/p>
&lt;p>We’ve collaborated with and relied on the advice of many organisations over the years, including (but not limited to) Crossref, Research Organization Registry &lt;a href="https://ror.org" target="_blank">(ROR)&lt;/a>, &lt;a href="https://jats4r.niso.org/" target="_blank">JATS4R&lt;/a>, &lt;a href="https://force11.org" target="_blank">FORCE11&lt;/a>, &lt;a href="https://www.softwareheritage.org/" target="_blank">Software Heritage&lt;/a>, &lt;a href="https://openrxiv.org/" target="_blank">openRxiv&lt;/a>, and our production vendors &lt;a href="https://www.kriyadocs.com/exeterpremedia" target="_blank">Exeter Premedia&lt;/a>.&lt;/p>
&lt;p>We’ve developed our own &lt;a href="https://github.com/elifesciences/elife-crossref-xml-generation" target="_blank">open source Crossref metadata generation library&lt;/a>. Keeping this process in-house has proven really fruitful. It allows us to quickly and continuously improve upon the metadata we provide.&lt;/p>
&lt;p>And we have a data team that has created a centralised data hub, serving as a really useful authoritative resource that can be queried, instead of always making use of disparate systems.&lt;/p>
&lt;h3 id="how-have-you-integrated-these-into-your-metadata-processes">How have you integrated these into your metadata processes?&lt;/h3>
&lt;p>At submission, we collect ROR IDs for (a subset of) affiliations, and structured data for funding, datasets, and other information. Our publication model is centred around preprints, so it’s necessary to capture related information such as the preprint DOI, preprint posted date, the version that pertains to each specific revision (and so on). Without this information, we could not post public reviews to the correct preprint version on the preprint server, or indeed ensure the article we publish is the correct iteration of that work.&lt;/p>
&lt;p>The systems that enable the publication of eLife Reviewed preprints are dependent on &lt;a href="https://docmaps.knowledgefutures.org/" target="_blank">DocMaps&lt;/a>, a framework for a machine-readable representation of the processes involved in the creation of a document. These are provided by our Data Hub and enable us to capture structured information about the peer review process and accompanying metadata for each article.&lt;/p>
&lt;p>Our proofing system for journal articles only permits login via &lt;a href="https://orcid.org" target="_blank">ORCID authentication&lt;/a>, and we don’t capture unauthenticated ORCID IDs that have been copied or keyed (see &lt;a href="https://info.orcid.org/whats-so-special-about-signing-in/" target="_blank">‘What’s So Special About Signing In?’)&lt;/a>. It also makes use of both the &lt;a href="https://www.crossref.org/documentation/retrieve-metadata/rest-api/">Crossref API&lt;/a> and the &lt;a href="https://pmc.ncbi.nlm.nih.gov/tools/developers/" target="_blank">PubMed Central API&lt;/a> to ensure we have persistent identifiers where possible for references. We have an in-house content validator, which uses &lt;a href="https://ror.readme.io/docs/rest-api" target="_blank">ROR’s API&lt;/a> to ensure we have ROR IDs for affiliations and funders where possible. We use Software Heritage to archive author-generated code, and include their persistent ID &lt;a href="https://www.softwareheritage.org/software-hash-identifier-swhid/" target="_blank">(SWHID)&lt;/a> in software references.&lt;/p>
&lt;p>All our published content is captured as &lt;a href="https://jats.nlm.nih.gov/index.html" target="_blank">JATS XML&lt;/a> (the industry standard format for journal articles), which our metadata generation library uses as its input.&lt;/p>
&lt;h3 id="what-impact-of-good-metadata-can-you-see-for-your-organisation-is-it-supporting-the-business-andor-editorial-side-of-your-work">What impact of good metadata can you see for your organisation? Is it supporting the business and/or editorial side of your work?&lt;/h3>
&lt;p>Persistent identifiers are very useful for reporting. Creating a report that, for example, includes publication volumes from a particular institution is trivial when content is enriched with persistent identifiers. It’s more complex when all you have are messy author-supplied strings of text. They’re also useful for content validation. For example, when we have a persistent ID and a method to retrieve the related metadata, we can confirm that the information we’ve been provided is complete and correct.&lt;/p>
&lt;p>There are, of course, many other benefits, some of which are &amp;ldquo;unknown unknowns.&amp;rdquo; Having complete and rich metadata puts you in the best position to fulfil future, as-yet-undetermined requirements.&lt;/p>
&lt;h3 id="have-you-encountered-any-challenges-in-curating-or-improving-your-metadata-if-so-what-were-they-and-how-did-you-address-those">Have you encountered any challenges in curating or improving your metadata? If so, what were they, and how did you address those?&lt;/h3>
&lt;p>In 2024, we started introducing &lt;a href="https://www.crossref.org/documentation/research-nexus/grants/">persistent grant IDs&lt;/a> for our content. While we updated our submission system to collect these from authors, it’s apparent that many authors aren’t aware when/if these have been registered by funders, and they still provide us with the (internal) grant numbers instead.&lt;/p>
&lt;p>Our workaround was to pull grant data from Crossref and then replace the grant numbers with the persistent IDs when we’re confident of a match. Since the grant number registered at Crossref might not exactly match the grant number the authors have given us, potential matches are confirmed by a team member or our production vendors. Since many organisations do a great job of creating informative landing pages (for example, &lt;a href="https://europepmc.org/" target="_blank">EuropePMC&lt;/a> for Wellcome funding), this is feasible, but we’re investigating ways we can make this less manual while remaining careful that we don’t introduce false positives.&lt;/p>
&lt;h2 id="have-your-efforts-around-metadata-led-to-real-benefits-for-your-community-is-this-something-your-editors-authors-or-readers-are-aware-of-and-appreciate-if-so-why">Have your efforts around metadata led to real benefits for your community? Is this something your editors, authors, or readers are aware of and appreciate? If so, why?&lt;/h2>
&lt;p>Yes, I think this is something that is becoming increasingly visible. Authors are very mindful of the benefits that good metadata can bring for discoverability and promotion. And much is lost without the increased interoperability it brings, both for publishers themselves but also the wider ecosystem. For example, we’ve had some great feedback from numerous organisations that appreciate that the outputs we publish directly link to the preprints they are based on.&lt;/p>
&lt;p>In recent years, there’s been an increased focus on research integrity, and this is likely to remain the case. Metadata has an obvious and key role in providing trust and transparency, whether that’s through the presence of trust markers like ORCID IDs or through the inclusion of complete post-publication metadata such as correction, retraction, or withdrawal information.&lt;/p>
&lt;h3 id="looking-ahead-how-are-you-planning-to-build-on-your-metadata-quality-are-there-new-elements-or-practices-youre-exploring-and-what-advice-would-you-give-to-others-just-starting-to-strengthen-their-metadata">Looking ahead, how are you planning to build on your metadata quality? Are there new elements or practices you’re exploring? And what advice would you give to others just starting to strengthen their metadata?&lt;/h3>
&lt;p>Several years ago, &lt;a href="https://elifesciences.org/about/peer-review" target="_blank">we introduced a &amp;ldquo;publish, review, curate&amp;rdquo; model of publishing&lt;/a>, where we publish ‘Reviewed preprints’ following each stage of review. We don’t collect the same level of structured information from authors at submission for these as we do for Versions of Record. This presents a challenge for retrieving and disseminating complete metadata for Reviewed preprints. We aim to start moving this forward so that comprehensive metadata is available at earlier stages of the publication process. For example, we recently started depositing (some) funding metadata for these.&lt;/p>
&lt;p>We’re also keen to explore the ways in which we can make our &lt;a href="https://elifesciences.org/about/elife-assessments" target="_blank">eLife Assessments&lt;/a> more discoverable. Our Editors use a common vocabulary to describe the significance of the findings and strength of evidence in a paper. Other publishers moving beyond accept/reject publication models use different rubrics and taxonomies, so having one restrictive field in a schema for the entire corpus of research won’t cut it. But nevertheless making these terms more discoverable and interoperable would be preferential.&lt;/p>
&lt;p>We’ve found that the integration of public APIs/data within systems (such as ROR’s, Crossref’s, PubMed’s, and OpenAlex’s) to be really helpful in validating the correctness and completeness of content/metadata. The effort in adding these integrations will pay dividends in the future.&lt;/p>
&lt;p>Time to enjoy Fred’s acceptance video.&lt;/p>
&lt;div style="position: relative; width: 100%; height: 0; padding-top: 56.2500%;
padding-bottom: 0; box-shadow: 0 2px 8px 0 rgba(63,69,81,0.16); margin-top: 1.6em; margin-bottom: 0.9em; overflow: hidden;
border-radius: 8px; will-change: transform;">
&lt;iframe loading="lazy" style="position: absolute; width: 100%; height: 100%; top: 0; left: 0; border: none; padding: 0;margin: 0;"
src="https://www.canva.com/design/DAGwnlQ6L28/02yxOhbLOdze9aVKwMwf5w/watch?embed" allowfullscreen="allowfullscreen" allow="fullscreen">
&lt;/iframe>
&lt;/div>
&lt;a href="https:&amp;#x2F;&amp;#x2F;www.canva.com&amp;#x2F;design&amp;#x2F;DAGwnlQ6L28&amp;#x2F;02yxOhbLOdze9aVKwMwf5w&amp;#x2F;watch?utm_content=DAGwnlQ6L28&amp;amp;utm_campaign=designshare&amp;amp;utm_medium=embeds&amp;amp;utm_source=link" target="_blank" rel="noopener">Metadata Awards video - eLife&lt;/a></description></item><item><title>Mejorando la visibilidad a través de los metadatos: una mirada desde Editorial CSIC</title><link>https://www.crossref.org/blog/mejorando-la-visibilidad-a-trav%C3%A9s-de-los-metadatos-una-mirada-desde-editorial-csic/</link><pubDate>Thu, 14 Aug 2025 00:00:00 +0000</pubDate><author>Nacho Pérez Alcalde</author><guid>https://www.crossref.org/blog/mejorando-la-visibilidad-a-trav%C3%A9s-de-los-metadatos-una-mirada-desde-editorial-csic/</guid><description>&lt;p>&lt;a href="#version-in-english">&lt;em>Click here for the version in English&lt;/em>&lt;/a>&lt;/p>
&lt;p>Hablamos con Nacho Pérez Alcalde, Vicedirector Técnico de Editorial CSIC, la editorial al mando de ´Boletín Geológico y Minero’, ganadora del Crossref Metadata Award en la categoría de Metadata Enrichment. Miembro de Crossref desde 2008, Editorial CSIC publica 41 revistas en acceso abierto Diamante, y juega un papel esencial en la diseminación del conocimiento científico a nivel internacional. Exploramos lo que este premio ha significado para Editorial CSIC y qué planes para el futuro tienen para seguir mejorando la calidad y uso de sus metadatos.&lt;/p>
&lt;h3 id="el-boletín-geológico-y-minero-ha-recibido-el-primer-premio-de-crossref-al-enriquecimiento-de-vuestros-metadatos-ya-que-en-tan-solo-dos-años-ha-visto-la-cobertura-de-los-metadatos-pasar-del-1-al-40-cuáles-han-sido-las-motivaciones-que-han-llevado-a-esta-revista-a-ver-una-mejora-tan-grande-en-sus-metadatos">El ‘Boletín Geológico y Minero’ ha recibido el primer premio de Crossref al enriquecimiento de vuestros metadatos ya que en tan solo dos años, ha visto la cobertura de los metadatos pasar del 1 al 40%. ¿Cuáles han sido las motivaciones que han llevado a esta revista a ver una mejora tan grande en sus metadatos?&lt;/h3>
&lt;p>Editorial CSIC publica 41 revistas científicas, todas ellas presentes en los principales indexadores. Son revistas de prestigio que ofrecen, desde hace muchos años, contenidos revisados de alta calidad. Sin embargo, hoy en día, no es ya suficiente para una revista científica ofrecer contenidos de calidad, hoy en día es necesario ofrecer también una alta calidad en los metadatos generados por esas publicaciones. Algo que hace no muchos años veíamos como un servicio de valor añadido se ha convertido en algo imprescindible.&lt;/p>
&lt;p>En un entorno de trabajo electrónico y en Internet, los metadatos son claves para la difusión de los contenidos, la identificación de revistas, autores/as, instituciones editoras, entidades financiadoras… Para un editor es fundamental poder transmitir esa información según unos procedimientos técnicos y unos protocolos estandarizados para garantizar su compatibilidad con las máquinas que cosechan, almacenan y distribuyen datos favoreciendo la visibilidad y la descubribilidad de nuestras revistas.&lt;/p>
&lt;h3 id="seguis-alguna-estrategia-cómo-decidís-qué-elementos-priorizar">¿Seguis alguna estrategia? ¿Cómo decidís qué elementos priorizar?&lt;/h3>
&lt;p>Llevamos años trabajando con metadatos y, de forma periódica, vamos revisando y ampliando el número de elementos que convertimos en metadatos. Damos prioridad siempre a lo que es ya un estándar claramente identificado (por ejemplo el ORCID) y también a aquellos metadatos alineados con las políticas editoriales que consideramos prioritarias (por ejemplo la licencia CC by que aplicamos).&lt;/p>
&lt;p>El flujo de trabajo requiere como primer paso la identificación, por parte del editor, de los datos que se quieren obtener y de cómo se van a pedir. Una vez se integran todos ellos en la política de envío de originales a la revista, es imprescindible la colaboración de los autores que son los que aportan los datos que, en una fase posterior son revisados por un editor técnico especializado en metadatos (diferente al revisor de texto). Por último, es imprescindible contar con una herramienta que permita automatizar la transferencia de metadatos y aquí es muy importante contar con personal técnico especializado. Nosotros trabajamos con la plataforma OJS, yo he pasado años depositando metadatos en Crossref con los archivos XML que generábamos, uno a uno. Con 1.000 artículos publicados de media al año, la creación del Módulo de exportación CrossRef XML de OJS para el depósito automatizado desde la plataforma fue de gran ayuda para nosotros porque aligera bastante el trabajo, asegura una mayor fiabilidad y nos permite dedicar nuestro tiempo a mejorar otras cosas.&lt;/p>
&lt;p>También nos da una mayor flexibilidad a la hora de revisar nuestras políticas de datos, por ejemplo, nos ha permitido abordar un depósito masivo para actualizar todas nuestras referencias para corregir errores recurrentes.&lt;/p>
&lt;h3 id="cómo-habéis-integrado-esto-en-vuestra-estrategia-de-metadatos">¿Cómo habéis integrado esto en vuestra estrategia de metadatos?&lt;/h3>
&lt;p>El Crossref Metadata Enrichment Award ha sido concedido en concreto a la revista Boletín Geológico y Minero por haber experimentado una gran mejora en sus metadatos en los últimos años. Esta revista era editada por otra institución y cuando Editorial CSIC se hizo cargo de ella le aplicamos los mismos estándares que venimos utilizando en el resto de nuestras revistas desde hace años. Nos sentimos por ello especialmente orgullosos, porque entendemos este premio como el aval a una política de metadatos que llevamos años desarrollando y que ha permitido una mejora importante para esta revista en un tiempo relativamente corto.&lt;/p>
&lt;p>&lt;strong>Para ello fue clave la colaboración de la dirección científica de la revista&lt;/strong>, nosotros explicamos primero qué datos deben ser solicitados a los autores, por qué y para qué, y luego nos ocupamos de confirmar que se han ido integrando en los artículos y de implementarlos en la plataforma OJS para proceder después a su depósito en Crossref pero también a su integración en otras vías de difusión de metadatos.&lt;/p>
&lt;h3 id="a-nivel-de-impacto-cómo-veis-que-una-buena-cobertura-de-los-metadatos-afecta-a-vuestra-organización-beneficia-de-alguna-manera-vuestro-trabajo-editorial-o-cualquier-otro-aspecto-de-vuestra-actividad">A nivel de impacto, ¿cómo veis que una buena cobertura de los metadatos afecta a vuestra organización? ¿Beneficia de alguna manera vuestro trabajo editorial? O cualquier otro aspecto de vuestra actividad?&lt;/h3>
&lt;p>Más allá de sus beneficios obvios como potenciar la visibilidad de nuestras publicaciones y contribuir a manejar una información controlada y de calidad, en última instancia deberían ayudarnos a posicionarnos como grupo profesional. Nuestra función esencial es publicar contenido científico revisado y de calidad y transmitirlo a la comunidad científica y, cada vez más, a toda la sociedad. Sin embargo, hoy en día, deberíamos aspirar a ser identificados también como proveedores de datos. Y eso, en “la era del dato”, es mucho decir. Debemos ser capaces de extraer los metadatos de nuestras publicaciones aportados por los autores (palabras claves, filiación, bibliografías&amp;hellip;) pero también debemos ser capaces de generar nosotros otros metadatos y de transmitirlos y difundirlos.&lt;/p>
&lt;p>Las revistas científicas deben seguir contando con un editor que haga una revisión ortotipografía y de pruebas, pero también deben contar con un editor de metadatos, alguien que sepa qué es FundRef y sepa dónde y cómo hay que introducir los datos en la plataforma para garantizar que se conservan y transfieren de manera correcta y eficiente.&lt;/p>
&lt;p>Por ello, quiero aprovechar esta ocasión para &lt;strong>reivindicar el papel del editor como generador y proveedor de datos&lt;/strong>. Los editores somos la fuente de datos, hay agentes como las bibliotecas e indexadores que los cosechan, archivan, transmiten y procesan para, por ejemplo, generar nuevos contenidos o servicios, pero solo nosotros tenemos la capacidad de generarlos.&lt;/p>
&lt;p>¿Habéis encontrado dificultades a la hora de mejorar y manejar vuestros metadatos?
En ocasiones los autores se quejan de que se les piden muchos datos, por ejemplo, el uso de ORCID es obligatorio en nuestras publicaciones y muchos autores, sobre todo de ámbitos no europeos, se han quejado porque no saben qué es y para qué sirve o, por motivos personales, no quieren registrar ese identificador personal. Son motivos respetables, por supuesto, pero para nosotros prima la necesidad de identificar correctamente a cada autor y creemos que el ORCID ayuda a ello.&lt;/p>
&lt;p>Otro problema habitual es que muchos autores, al citar una fuente de financiación, utilizan el nombre de la entidad financiadora pero a veces no lo ponen completo, o no incluyen el acrónimo o lo que es peor, ponen el nombre pero no el código de la institución o del proyecto. Los autores están acostumbrados a escribir pensando en los lectores “humanos” y no en las máquinas que van a procesar después toda esa información. Nuestro papel, como editores de metadatos, pasa por informarles, de forma didáctica, de la importancia de aportar esos códigos y pedírselos si vemos que no los han incluido en su manuscrito.&lt;/p>
&lt;h3 id="y-con-respecto-a-vuestra-comunidad-se-ha-visto-beneficiada-de-vuestro-esfuerzo-para-tener-unos-metadatos-completos-y-de-alta-calidad-están-los-autores-editores-o-lectores-al-tanto-de-estos-esfuerzos-o-lo-valoran">Y con respecto a vuestra comunidad, ¿se ha visto beneficiada de vuestro esfuerzo para tener unos metadatos completos y de alta calidad? ¿Están los autores, editores o lectores al tanto de estos esfuerzos o lo valoran?&lt;/h3>
&lt;p>Para el editor técnico es más sencillo valorarlo, nosotros sabemos cómo funciona el entorno, lo importante que es la interoperabilidad de las plataformas, la rapidez y amplitud de transmisión que puede alcanzar un dato y lo importante que es que esté correcto desde su origen porque luego puede ser muy, muy difícil corregirlo y controlarlo. Somos conscientes también de su posible impacto porque sabemos cómo los sistemas de información se alimentan unos de otros y comparten información, una información que generamos nosotros.&lt;/p>
&lt;p>Los editores científicos, autores y lectores suelen valorarlo menos y no siempre son conscientes de su relevancia, aunque no se puede generalizar. Y de hecho, aunque creo que todos deberían tener al menos unas nociones básicas de cómo funciona, creo que los autores ya están bastante saturados con todos los requerimientos que les pedimos para entregar sus manuscritos como para que les pidamos, además, formación específica en metadatos. Para eso (entre otras cosas) estamos los editores, para indicarles qué datos y cómo los deben aportar.&lt;/p>
&lt;p>No obstante, hoy en día todo el mundo está familiarizado con lo que son y lo que se puede hacer con los datos, todos consumimos productos muy diversos a través de internet y tenemos al menos nociones de lo que son los metadatos, los datos personales, los algoritmos… Hace años era mucho más complejo hacer didáctica de esto, pero hoy en día cualquiera lo entiende fácilmente y más en un ámbito científico y tecnológico como el de nuestras publicaciones.&lt;/p>
&lt;h3 id="con-la-vista-puesta-en-el-futuro-tenéis-algún-plan-para-seguir-construyendo-sobre-lo-ya-creado-algún-elemento-que-queráis-seguir-implementando-o-prácticas-que-queráis-incorporar-en-vuestra-manera-de-trabajar">Con la vista puesta en el futuro, tenéis algún plan para seguir construyendo sobre lo ya creado? ¿Algún elemento que queráis seguir implementando o prácticas que queráis incorporar en vuestra manera de trabajar?&lt;/h3>
&lt;p>En editorial CSIC, desde que comenzamos a publicar en formato electrónico y a distribuir nuestras revistas electrónicas en línea, hace ya casi 20 años, siempre estamos tratando de innovar en diseños, plataformas de gestión, formatos de archivo… Hablando de cosas concretas, hemos ampliado el uso obligatorio de ORCID y DOI a las contribuciones que no son puramente artículos científicos (hasta ahora nuestras reseñas, obituarios y textos similares no los tenían) y estamos valorando la implementación de identificadores ROR para organizaciones de investigación.&lt;/p>
&lt;h3 id="qué-consejos-darías-a-aquellas-organizaciones-que-están-comenzando-a-mejorar-la-calidad-de-sus-metadatos">¿Qué consejos darías a aquellas organizaciones que están comenzando a mejorar la calidad de sus metadatos?&lt;/h3>
&lt;p>Para aquellos editores que están empezando a reforzar sus metadatos me atrevería a indicar algo aparentemente lógico y sencillo pero que creo que no siempre se hace: que planifiquen con calma y en detalle una política editorial de datos basada en identificar y seleccionar los datos que consideren prioritarios e implementar, después, protocolos para solicitarlos a sus autores e integrarlos en las plataformas editoriales y, por último, configurar correctamente dichas plataformas para asegurar una correcta exportación.&lt;/p>
&lt;p>El metadato requiere de una cadena en la que trabajan diversas personas con distintos perfiles, hay que tener recursos para afianzar esa cadena y hay que tener en cuenta que no basta con pedir los datos a los autores, hay que seguir el recorrido de los datos desde su origen hasta donde podamos y eso no termina cuando los depositamos en Crossref: podemos depositarlos de manera adicional en otros sitios, podemos darles otras salidas y, además, debemos volver sobre ellos si detectamos algún error sistemático que podamos corregir.&lt;/p>
&lt;p>Los Metadata Excellence Awards fueron entregados en mayo de 2025, en el contexto del encuentro anual de Crossref con su comunidad. Os dejamos el vídeo de aceptación del premio por parte de la revista Boletín Geológico y Minero, editada por Editorial CSIC.&lt;/p>
&lt;p>Y ahora disfruta de este vídeo de aceptación.&lt;/p>
&lt;div style="position: relative; width: 100%; height: 0; padding-top: 56.2500%;
padding-bottom: 0; box-shadow: 0 2px 8px 0 rgba(63,69,81,0.16); margin-top: 1.6em; margin-bottom: 0.9em; overflow: hidden;
border-radius: 8px; will-change: transform;">
&lt;iframe loading="lazy" style="position: absolute; width: 100%; height: 100%; top: 0; left: 0; border: none; padding: 0;margin: 0;"
src="https://www.canva.com/design/DAGssqadV1c/w5SHN3-tL2wZz6Y10WVHOQ/watch?embed" allowfullscreen="allowfullscreen" allow="fullscreen">
&lt;/iframe>
&lt;/div>
&lt;a href="https:&amp;#x2F;&amp;#x2F;www.canva.com&amp;#x2F;design&amp;#x2F;DAGsxAyXmXs&amp;#x2F;rOVOK6z99_UlaclRJHPekw&amp;#x2F;watch?utm_content=DAGsxAyXmXs&amp;amp;utm_campaign=designshare&amp;amp;utm_medium=embeds&amp;amp;utm_source=link" target="_blank" rel="noopener">&lt;/a>
&lt;h3 id="version-in-english">Version in English&lt;/h3>
&lt;h2 id="improving-visibility-through-metadata-a-look-from-csic-editorial">Improving visibility through metadata: a look from CSIC Editorial&lt;/h2>
&lt;p>We spoke with Nacho Pérez Alcalde, Technical Deputy Director of Editorial CSIC, the publisher behind ‘Boletín Geológico y Minero’, recipient of the Crossref Metadata Award in the Metadata Enrichment category. A Crossref member since 2008, Editorial CSIC publishes 41 Diamond Open Access journals and plays a key role in scholarly communication at the international level. We explore what this award has meant for Editorial CSIC and what plans they have for the future to continue improving the quality and use of their metadata.&lt;/p>
&lt;h3 id="what-motivates-your-team-to-work-towards-high-quality-metadata-what-objectives-does-it-support-for-your-organisation">What motivates your team to work towards high-quality metadata? What objectives does it support for your organisation?&lt;/h3>
&lt;p>Editorial CSIC publishes 41 scientific journals, all of which are included in major indexing databases. These are prestigious journals that have offered high-quality, peer-reviewed content for many years. &lt;strong>However, today, it is no longer enough for a scientific journal to provide quality content alone; it is now also essential to deliver high-quality metadata associated with those publications.&lt;/strong> What just a few years ago was considered a value-added service has now become indispensable.&lt;/p>
&lt;p>In an electronic and internet-based working environment, metadata is key to content dissemination and to the identification of journals, authors, publishing institutions, and funding organizations. For a publisher, it is crucial to be able to transmit this information through technical procedures and standardised protocols to ensure compatibility with the systems that harvest, store, and distribute data, enhancing the visibility and discoverability of our journals.&lt;/p>
&lt;h3 id="do-you-have-a-strategy-for-complete-metadata">Do you have a strategy for complete metadata?&lt;/h3>
&lt;p>We’ve been working with metadata for years and, periodically, we review and expand the number of elements we convert into metadata. We always prioritise what is already a clearly established standard (for example, ORCID), as well as metadata aligned with editorial policies we consider a priority (such as the CC BY license we apply).&lt;/p>
&lt;p>The workflow begins with the editor identifying the data to be collected and how it will be requested. Once this is integrated into the journal&amp;rsquo;s submission guidelines, the collaboration of authors becomes essential, as they are the ones who provide the data. In a later phase, the data is reviewed by a technical editor specialising in metadata (different from the content reviewer). Finally, it&amp;rsquo;s crucial to have a tool that enables the automated transfer of metadata, and here, having specialised technical staff is very important.&lt;/p>
&lt;p>We work with the OJS platform; I spent years depositing metadata in Crossref using XML files that we generated manually, one by one. With an average of 1,000 articles published per year, the creation of the Crossref XML export module in OJS for automated deposit from the platform was a huge help for us – it significantly lightened the workload, ensured greater reliability, and allowed us to focus our time on improving other aspects.&lt;/p>
&lt;p>It also gives us more flexibility when reviewing our data policies. For example, it allowed us to carry out a bulk deposit to update all our references in order to correct a recurring error.&lt;/p>
&lt;h3 id="how-have-you-integrated-these-into-your-metadata-processes">How have you integrated these into your metadata processes?&lt;/h3>
&lt;p>The Crossref Metadata Enrichment Award was specifically granted to the journal Boletín Geológico y Minero for having shown significant improvement in its metadata in recent years. This journal was previously published by another institution, and when Editorial CSIC took over, we applied the same standards we have been using for our other journals for many years. We are especially proud of this because we see the award as recognition of a metadata policy we’ve been developing over the years, one that has led to significant improvements for this journal in a relatively short time.&lt;/p>
&lt;p>&lt;strong>The collaboration of the journal’s scientific leadership was key to achieving this.&lt;/strong> We first explained which data should be requested from authors, why, and for what purpose. Then we ensured that the data was being properly integrated into the articles and implemented it within the OJS platform. From there, we proceeded with depositing the metadata in Crossref and also integrating it into other metadata dissemination channels.&lt;/p>
&lt;h3 id="what-impact-of-good-metadata-can-you-see-for-your-organisation-is-it-supporting-the-business-andor-editorial-side-of-your-work">What impact of good metadata can you see for your organisation? Is it supporting the business and/or editorial side of your work?&lt;/h3>
&lt;p>Beyond their obvious benefits, such as increasing the visibility of our publications and contributing to the management of controlled, high-quality information, they should ultimately help us position ourselves as a professional group. Our essential role is to publish peer-reviewed, high-quality scientific content and deliver it to the scientific community and, increasingly, to society at large.
However, today, we should also aim to be recognised as data providers. And that, in the “age of data,” is a significant shift. We must be able to extract metadata from our publications-supplied by authors (keywords, affiliations, bibliographies&amp;hellip;). We also need to generate other metadata ourselves, and transmit and disseminate those effectively. Scientific journals must still have editors who perform copy editing and proofreading, but they must also have metadata editors, people who understand what FundRef is, and know where and how to input data into the platform to ensure it is preserved and transferred correctly and efficiently.&lt;/p>
&lt;p>That’s why I want to take this opportunity to highlight the role of the editor as a generator and provider of data. Editors are the source of data. There are other actors-like libraries and indexers-who harvest, archive, transmit, and process that data to, for example, create new content or services. But only we have the capacity to generate it.&lt;/p>
&lt;h3 id="have-you-encountered-any-challenges-in-curating-or-improving-your-metadata">Have you encountered any challenges in curating or improving your metadata?&lt;/h3>
&lt;p>Sometimes authors complain about being asked for too much information. For example, the use of ORCID is mandatory in our publications, and many authors, especially those from non-European regions, have complained because they don’t know what it is or what it’s for, or – for personal reasons – they don’t want to register for a personal identifier. These reasons are, of course, valid and understandable, but for us, the priority is to correctly identify each author, and we believe ORCID helps achieve that.
Another common issue is that when authors cite a funding source, they often include the name of the funding body, but sometimes don’t write it in full, or they omit the acronym, or worse – they include the name but not the institution or project code. Authors are used to writing with “human” readers in mind, not the machines that will later process all that information. Our role, as metadata editors, involves educating them about the importance of providing these codes and requesting them when we see they’ve been left out of the manuscript.&lt;/p>
&lt;h3 id="have-your-efforts-around-metadata-led-to-real-benefits-for-your-community-is-this-something-your-editors-authors-or-readers-are-aware-of-and-appreciate-if-so-why">Have your efforts around metadata led to real benefits for your community? Is this something your editors, authors, or readers are aware of and appreciate? If so, why?&lt;/h3>
&lt;p>For the technical editor, it&amp;rsquo;s easier to assess the value of metadata. We understand how the ecosystem works, how important platform interoperability is, how quickly and widely data can be transmitted, and how crucial it is for data to be correct from the very beginning. Once it&amp;rsquo;s out there, it can be very, very difficult to correct or control. We’re also aware of its potential impact because we know how information systems feed off each other and share information – information that we generate.&lt;/p>
&lt;p>Scientific editors, authors, and readers tend to value it less and aren’t always aware of its importance, though of course there are exceptions. While I believe everyone should at least have a basic understanding of how it works, I also think authors are already overwhelmed with all the requirements we ask of them when submitting manuscripts. Editors are here to guide them on what data to provide and how to provide it.&lt;/p>
&lt;p>That said, today, everyone is at least somewhat familiar with what data is and what can be done with it. We all consume a wide variety of digital content online and have at least a basic idea of what metadata, personal data, and algorithms are. A few years ago, explaining all this was much more difficult, but nowadays, it’s much easier for people to grasp, especially within the scientific and technological environment in which we publish.&lt;/p>
&lt;h3 id="looking-ahead-how-are-you-planning-to-build-on-your-metadata-quality-are-there-new-elements-or-practices-youre-exploring-and-what-advice-would-you-give-to-others-just-starting-to-strengthen-their-metadata">Looking ahead, how are you planning to build on your metadata quality? Are there new elements or practices you’re exploring? And what advice would you give to others just starting to strengthen their metadata?&lt;/h3>
&lt;p>At Editorial CSIC, ever since we began publishing in electronic format and distributing our journals online, almost 20 years ago, we have consistently sought to innovate in design, management platforms, and file formats. Speaking of specific actions, we have extended the mandatory use of ORCID and DOI to contributions that are not strictly scientific articles (until now, our book reviews, obituaries, and similar texts didn’t have them), and we are currently considering the implementation of ROR identifiers for research organizations.&lt;/p>
&lt;h3 id="do-you-have-any-advice-for-organisations-that-are-making-an-effort-to-improve-the-quality-of-their-metadata">Do you have any advice for organisations that are making an effort to improve the quality of their metadata?&lt;/h3>
&lt;p>For editors who are just beginning to strengthen their metadata, I would suggest something that seems logical and simple, but is not always put into practice: take the time to calmly and thoroughly plan a data policy. This should be based on identifying and selecting which data elements are most important, then implementing protocols to request them from authors and integrate them into editorial platforms, and finally, configuring those platforms correctly to ensure proper export.&lt;/p>
&lt;p>Metadata involves a chain of tasks carried out by people with different profiles. You need to have resources to strengthen that chain. It’s good to remember that it’s not enough to simply ask authors for data – you have to follow the data along its entire path from the source as far as possible. That journey doesn’t end when we deposit it in Crossref: we can also deposit it in other repositories, find additional ways to disseminate it, and we must revisit it if we detect any recurring errors that can be corrected.&lt;/p>
&lt;p>And now enjoy this acceptance video.&lt;/p>
&lt;div style="position: relative; width: 100%; height: 0; padding-top: 56.2500%;
padding-bottom: 0; box-shadow: 0 2px 8px 0 rgba(63,69,81,0.16); margin-top: 1.6em; margin-bottom: 0.9em; overflow: hidden;
border-radius: 8px; will-change: transform;">
&lt;iframe loading="lazy" style="position: absolute; width: 100%; height: 100%; top: 0; left: 0; border: none; padding: 0;margin: 0;"
src="https://www.canva.com/design/DAGssqadV1c/w5SHN3-tL2wZz6Y10WVHOQ/watch?embed" allowfullscreen="allowfullscreen" allow="fullscreen">
&lt;/iframe>
&lt;/div>
&lt;a href="https:&amp;#x2F;&amp;#x2F;www.canva.com&amp;#x2F;design&amp;#x2F;DAGsxAyXmXs&amp;#x2F;rOVOK6z99_UlaclRJHPekw&amp;#x2F;watch?utm_content=DAGsxAyXmXs&amp;amp;utm_campaign=designshare&amp;amp;utm_medium=embeds&amp;amp;utm_source=link" target="_blank" rel="noopener">&lt;/a></description></item><item><title>We’ve migrated to the cloud; we hope you didn’t notice (but maybe you did)</title><link>https://www.crossref.org/blog/weve-migrated-to-the-cloud-we-hope-you-didnt-notice-but-maybe-you-did/</link><pubDate>Tue, 12 Aug 2025 00:00:00 +0000</pubDate><author>Sara Bowman</author><guid>https://www.crossref.org/blog/weve-migrated-to-the-cloud-we-hope-you-didnt-notice-but-maybe-you-did/</guid><description>&lt;p>TLDR: We&amp;rsquo;ve successfully moved the main Crossref systems to the cloud! We’ve more to do, with several bugs identified and fixed, and a few still ongoing. However, it’s a step in the right direction and a significant milestone, as, whilst it is a much larger financial investment, it addresses several risks and limitations and shores up the Crossref infrastructure for the future.&lt;/p>
&lt;h3 id="some-background">Some background&lt;/h3>
&lt;p>We have been doing a lot of thinking, planning, and working on paying down our technical debt and modernising our systems. It’s not fun and flashy work, but it is vital for sustaining our infrastructure, meeting the demand on existing services, and developing new services.&lt;/p>
&lt;p>Just about a year ago, we completed phase one, migrating our main database from Oracle to PostgreSQL, an open-source database. This move brought us more in line with our commitment to the &lt;a href="https://www.crossref.org/categories/posi/">POSI principles&lt;/a>, reduced our dependencies on costly private licenses, and opened up the possibility to use and offer additional and more contemporary features. With the transition to PostgreSQL we made upgrades to the operating system, the database software, and the underlying hardware, resulting in significant improvements to the overall throughput and capacity of the deposit system. Previously, we typically maintained a queue of more than 10,000 deposits waiting to be processed; now, the queue holds fewer than 100 deposits on average. Consequently, the average latency – the elapsed time from submission to deposit – has reduced from hours to seconds.&lt;/p>
&lt;p>During phase one, a total of 35 new servers were created, and for the first time, the entire system configuration was defined through &lt;a href="https://en.wikipedia.org/wiki/Infrastructure_as_code" target="_blank">infrastructure-as-code&lt;/a>, enabling the infrastructure to be recreated as necessary. This effort not only enabled the migration but also established a solid foundation for our cloud migration strategy, as the code was leveraged to configure our infrastructure on AWS. Additionally, it serves as a critical component of our disaster recovery planning.&lt;/p>
&lt;p>Most importantly, phase one set us up for phase two and our next migration: moving the system into the cloud.&lt;/p>
&lt;h3 id="why-we-moved-to-the-cloud">Why we moved to the cloud&lt;/h3>
&lt;p>We had been running most of our services in a physical data centre near Boston, MA, USA (there are a few exceptions: the &lt;a href="api.crossref.org">REST API&lt;/a> and our test system (test.crossref.org) were already in the cloud, as was the Crossref website). We’ve been planning to move to the cloud for &lt;em>ahem&lt;/em> quite some time, but as always, competing priorities and limited resources have thwarted us, and the data centre was mainly serving us well.&lt;/p>
&lt;p>But… with staff across 12 countries, and increased global use of our system, operating our own hardware in a physical data centre was becoming increasingly challenging and risky, not to mention, frustrating.&lt;/p>
&lt;p>Moving to the cloud has solved several pain points for us:&lt;/p>
&lt;ul>
&lt;li>Physical access to the data centre was required for various tasks (e.g., hardware upgrades, troubleshooting, general maintenance), but as Crossref grew as an organisation and became more distributed, we had fewer staff in the area. Hosting services in the cloud means staff around the world can access our servers remotely from anywhere (and we can leave the hardware upgrades to our vendor).&lt;/li>
&lt;li>Scalability in the data centre required installing new hardware or upgrading connections, which also meant a good amount of time. In the cloud, we can scale up almost instantly.&lt;/li>
&lt;li>We can maintain copies of our databases and services in distributed places, providing insurance against natural or other disasters.
Upgrades now don’t involve buying physical hardware and installing it; it’s a much quicker and more straightforward process.&lt;/li>
&lt;/ul>
&lt;p>Moving from a physical data centre to the cloud also has some trade-offs; for instance, the cost will be approximately five times higher than running the system in the data centre; with initial data, it’s not unlikely the annual cost may be up to 2,000,000 USD. We aim to optimise and control this cost going forward.&lt;/p>
&lt;h3 id="what-we-did">What we did&lt;/h3>
&lt;p>The size of the undertaking was partly due to leaving it so long; technical debt has accumulated over many years of running the system in the data centre.&lt;/p>
&lt;p>The whole plan was hugely detailed, but we can distil it to a few bullets:&lt;/p>
&lt;ul>
&lt;li>We conducted an analysis of components, considered risks and sequencing, and created a test plan and timeline, including comms.&lt;/li>
&lt;li>While most of the drive and work was on the shoulders of two infrastructure services colleagues, our software engineers were heavily involved too, and we had weekly check-ins with a cross-team group to review progress, reassess risks, and adjust timelines as we got closer to the migration date (or decided to move it once or twice).&lt;/li>
&lt;li>We first created the deposit system in the cloud.&lt;/li>
&lt;li>We then created other parts of our services that aren’t in the deposit system code base, but run alongside it, such as reports, querying, and other tools.
We replicated our databases (of which there are several, in a few different flavours - PostgreSQL, MySQL).&lt;/li>
&lt;li>We gave 14 days’ notice to our members, via email, and kept this maintenance notice up to date.&lt;/li>
&lt;li>We commenced the migration on 8th July, which involved taking the whole system down and rejecting deposits for up to 24 hours.&lt;/li>
&lt;li>In the process, we scripted the process to create CS and the other services using Terraform and Ansible, so that going forward, bringing up a whole new instance of CS (should we need to) won’t be a manual process.&lt;/li>
&lt;li>We moved the DNS to point at our new system in the cloud, rather than the data centre. We brought the system back up on 9th July, after 14 hours of downtime, and watched the first few deposits come in, while testing thoroughly.&lt;/li>
&lt;li>Alongside the technical team, the membership and support team was at the ready to work through the testing in the new live production environment.&lt;/li>
&lt;/ul>
&lt;p>The message we sent to members, Metadata Plus subscribers, and key integrators like PKP and Turnitin, listed which services would be down and described what changes they might see, such as:&lt;/p>
&lt;ul>
&lt;li>The system timezone shifted from EST to UTC (universal coordinated time), which would be noticeable in the timestamps reported back to members after metadata deposits&lt;/li>
&lt;li>Our IP address became dynamic and is no longer static. If members had hardcoded our previous IP static address to connect to our services, that would no longer work.&lt;/li>
&lt;li>We previously allowed connections using the HTTP/1.0 protocol, but now require HTTP/1.1.
Likewise, we previously allowed TLS version 1.1, but now require at least version 1.2. Older ciphers will not work. A list of accepted ciphers can be found on &lt;a href="https://docs.aws.amazon.com/elasticloadbalancing/latest/application/describe-ssl-policies.html#tls-security-policies" target="_blank">this page&lt;/a> for “ELBSecurityPolicy-TLS13-1-2-2021-06”.&lt;/li>
&lt;/ul>
&lt;h3 id="how-it-went-and-whats-next">How it went and what’s next&lt;/h3>
&lt;p>We still have more to do, with both expected and unexpected issues arising from the migration. There are a couple of functions that still route through the data centre, configuration changes to wrangle, and processes to iron out, so we’ll be keeping that open for another couple of months.&lt;/p>
&lt;p>Those were the known issues…&lt;/p>
&lt;p>…we also uncovered a few bugs along the way, and we’ve been reporting those (and our progress toward fixing them) on our status page. &lt;a href="https://status.crossref.org/history" target="_blank">See history&lt;/a>.&lt;/p>
&lt;p>A few diligent members also alerted us to problems they were having. In some cases, we could tell why, and in many cases, their systems needed to be upgraded to work with ours. Thanks go to mEDRA, Spandidos Publications, and Stichting SciPost who helped us identify gaps that resulted in configuration improvements and lessons learned (that we then shared with other members).&lt;/p>
&lt;p>There were three issues that we were contacted about more than others:&lt;/p>
&lt;ol>
&lt;li>&lt;a href="https://status.crossref.org/incidents/scr3rtr8f4pm" target="_blank">Delayed delivery of notification emails&lt;/a> which is partly due to the volume of backlogged notification emails in the system.
&lt;ul>
&lt;li>Mostly solved: We have repaired delivery of notification emails for all metadata deposits and are working on a fix for the delivery of messages associated with very large queries.&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>&lt;a href="https://status.crossref.org/incidents/nyr3g5b3h05v" target="_blank">A small percentage of registered records not being indexed in the REST API&lt;/a> - this can cause downstream issues for a number of other services (e.g., Crossref metadata search - search.crossref.org, Participation Reports, ORCID auto-update, and for external services that make use of the metadata from our REST API).
&lt;ul>
&lt;li>Mostly solved: All records in July are now indexed in the REST API, albeit we have new reports of a few records missing in the last week, which we are actively investigating.&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>&lt;a href="https://status.crossref.org/incidents/9cdhzzvt1xt3" target="_blank">Delayed delivery of July’s resolution reports&lt;/a>.
&lt;ul>
&lt;li>Solved - not only has July’s resolution report run completed, but we also completed August’s ahead of schedule.&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ol>
&lt;p>This migration was a significant effort, and 2025’s top priority project for the Open and Sustainable Operations (OSO) &lt;a href="https://doi.org/10.64000/4s2ee-wkr84" target="_blank">program&lt;/a> team. Overall, we’re happy with our progress toward making Crossref infrastructure more robust, reliable, and future-proof. And judging by the messages of support we received, you are too! Onwards to the next infrastructure project… &lt;a href="https://roadmap.productboard.com/e6fdeba8-a5b3-4aef-8104-d48863ba975e" target="_blank">check out our roadmap to see what’s up next&lt;/a>.&lt;/p>
&lt;h3 id="references">References&lt;/h3>
&lt;ol>
&lt;li>&amp;lsquo;Infrastructure as code&amp;rsquo; (2025) &lt;em>Wikipedia&lt;/em>, 12 August. Available at: &lt;a href="https://en.wikipedia.org/wiki/Infrastructure_as_code" target="_blank">https://en.wikipedia.org/wiki/Infrastructure_as_code&lt;/a> (Accessed: 12 August 2025).&lt;/li>
&lt;li>&amp;lsquo;The programs approach: our experiences during the first quarter of 2025&amp;rsquo; (2025) &lt;em>Crossref&lt;/em>. Available at: &lt;a href="https://doi.org/10.64000/4s2ee-wkr84" target="_blank">https://doi.org/10.64000/4s2ee-wkr84&lt;/a> (Accessed: 12 August 2025).&lt;/li>
&lt;/ol></description></item><item><title>From storage closet to metadata champions: ASM's journey toward a smarter scholarly infrastructure</title><link>https://www.crossref.org/blog/from-storage-closet-to-metadata-champions-asms-journey-toward-a-smarter-scholarly-infrastructure/</link><pubDate>Mon, 04 Aug 2025 00:00:00 +0000</pubDate><author>David Haber</author><guid>https://www.crossref.org/blog/from-storage-closet-to-metadata-champions-asms-journey-toward-a-smarter-scholarly-infrastructure/</guid><description>&lt;p>The &lt;a href="https://asm.org/" target="_blank">American Society for Microbiology (ASM)&lt;/a> has earned recognition in &lt;a href="https://www.crossref.org/members/prep/" target="_blank">Crossref&amp;rsquo;s Participation Reports&lt;/a> for its exceptional metadata coverage among large publishing members––an achievement built on intentional change, technical investment, and collaborative work. In this Q&amp;amp;A, the ASM team shares what that journey looked like, the challenges they&amp;rsquo;ve tackled, and how centering metadata has helped them better connect research with the global scientific community.&lt;/p>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>A key lesson we learned is that meaningful progress doesn&amp;rsquo;t require perfection from day one. Start small, find manageable wins, refine as you go, and build a shared understanding across all your teams.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; David Haber, ASM&lt;/cite>&lt;/p>
&lt;/div>
&lt;h3 id="since-we-first-featured-your-metadata-efforts-in-2022httpsdoiorg1064000nhmg5-3ra76-what-developments-or-improvements-have-you-madeand-how-does-this-new-recognition-reflect-the-journey-so-far">&lt;a href="https://doi.org/10.64000/nhmg5-3ra76" target="_blank">Since we first featured your metadata efforts in 2022&lt;/a>, what developments or improvements have you made—and how does this new recognition reflect the journey so far?&lt;/h3>
&lt;p>Once we completed our initial metadata cleanup of our backfile and made sure that we were producing good, clean, and consistent Crossref metadata (no small feat), we realized that each new policy, process, or even style change should be viewed through a metadata capture lens. By looking at our publishing goals through that lens, we are better able to see the right time and method to help enrich and &amp;ldquo;grow&amp;rdquo; both our article metadata breadth and depth. Much of the metadata work is invisible or an afterthought. But the recognition of ASM&amp;rsquo;s coverage in the participation reports has affirmed that our change in perspective — shifting from viewing Crossref metadata as something produced as an afterthought to centering our processes around the creation of that metadata — has put us on the right path.&lt;/p>
&lt;h3 id="have-any-of-your-goals-around-metadata-changed-or-grown-since-then-what-feels-different-about-your-work-now-compared-to-when-you-were-first-featured">Have any of your goals around metadata changed or grown since then? What feels different about your work now compared to when you were first featured?&lt;/h3>
&lt;p>When we first started on our various metadata cleanup projects, it felt like there were just a few of us, arguing, agreeing, and arguing some more about obscure tagging structures and proper XML modeling in a closet––literally&amp;hellip; My office actually was an old storage closet, and my pre-pandemic whiteboard still has that ghostly blue haze of angle brackets scribbled with dry-erase markers.&lt;/p>
&lt;p>Since then, our goals have shifted significantly. Early on, we just wanted all our content mapped to DOIs; then we thought, &amp;ldquo;Oh wait. Let&amp;rsquo;s include as many abstracts as possible. And references. If we have the data, let&amp;rsquo;s send it.&amp;rdquo; Now that we have a strong metadata foundation, we can think proactively about what to capture and transmit, how we want to prioritize our efforts, and how to make research we publish more discoverable to those who need it.&lt;/p>
&lt;h3 id="looking-back-were-there-any-changes-in-internal-collaboration-or-external-partnerships-that-influenced-your-progress">Looking back, were there any changes in internal collaboration or external partnerships that influenced your progress?&lt;/h3>
&lt;p>Over the past three to four years, we have made some significant changes to our partnerships. We migrated to a new online platform (Atypon), a new production partner (Kriyadocs), a new submission platform (Chronoshub), and a new billing system (RLSC). Each of these partnerships allowed us to evaluate how we were capturing metadata, when that capture occurred, and how best to improve the QC process to ensure accuracy and quality. These partnerships accelerated all our efforts to improve hidden metadata and finally brought them out of the storage closet into the light.&lt;/p>
&lt;h3 id="have-you-adopted-any-new-tools-standards-or-technologies-since-your-last-blog">Have you adopted any new tools, standards, or technologies since your last blog?&lt;/h3>
&lt;p>Our production software (Kriyadocs) has centered metadata capture as a core function. We have processes and procedures that match all affiliations to Ringgold and &lt;a href="https://www.crossref.org/community/ror/">ROR IDs&lt;/a>. We have invested heavily in partnerships with organizations like Chronoshub to utilize natural language processing, automating the identification of authors and affiliations, so that users no longer have to fill out tedious forms. We embraced &lt;a href="https://orcid.org/" target="_blank">ORCID&lt;/a> and strongly encourage all authors to register for one if they don&amp;rsquo;t already have it. We have also adopted &lt;a href="https://www.niso.org/publications/z39104-2022-credit" target="_blank">the CRediT taxonomy&lt;/a> as a contributor framework and have built processes to make it easy for authors to stay within that taxonomy.&lt;/p>
&lt;h3 id="have-you-encountered-any-challenges-in-curating-or-improving-your-metadata-if-so--what-were-they-and-how-did-you-address-those">Have you encountered any challenges in curating or improving your metadata? If so – what were they and how did you address those?&lt;/h3>
&lt;p>The core problem (from our perspective) has always been the difference between author profile information and what is actually submitted in manuscripts. Auto-extraction of manuscript data into submission forms is one small step toward unifying author identity with manuscript data. One of our biggest pain points now is reconciling the chaotic data on author affiliations in manuscripts with institutional identifiers. Over the next year, this will be one of our main initiatives.&lt;/p>
&lt;h3 id="can-you-share-any-examples-where-high-quality-metadata-clearly-benefited-your-organization-community-or-publishing-processes">Can you share any examples where high-quality metadata clearly benefited your organization, community, or publishing processes?&lt;/h3>
&lt;p>The capture of ORCID IDs has improved our ability to match papers to editors and identify hidden conflicts of interest. ORCID IDs have also helped us expand our reviewer pool, as they enable us to better disambiguate individuals with similar names.&lt;/p>
&lt;p>Because we now capture CRediT roles in a controlled manner (rather than as loose text in the acknowledgments section), we are better able to identify when authors are contributing equally and how authors determine author order in the byline when this occurs. &lt;a href="https://journals.asm.org/doi/10.1128/mbio.00646-24" target="_blank">This analysis&lt;/a> was undertaken by one of our Editors-in-Chief to study gender bias when authors contributed equally to a work. Now that we capture CRediT roles as structured data, we can build on his research.&lt;/p>
&lt;p>In the last two years, we have also begun capturing Data Availability Statements and Ethics Statements in unique metadata fields (rather than as unstructured text in the body of an article or in the acknowledgments sections) because some of our editors are curious about open data policy compliance and whether there is higher uptake of open science initiatives in certain microbiology fields.&lt;/p>
&lt;p>&lt;strong>RC: These are very interesting and quite profound results, especially for integrity and equality in the publishing process! Good to see how useful you find this information as we’re approaching our schema updates to include contributor roles, among other things. I see that editors are already on board and taking advantage of high quality metadata. Are authors more engaged with metadata now than before?&lt;/strong>&lt;/p>
&lt;p>Our authors likely are engaged too––though we have tried to build author metadata QC into our proofing and typesetting process in such a way that they wouldn&amp;rsquo;t even notice.&lt;/p>
&lt;h3 id="what-challenges-have-you-encountered-while-sustaining-or-scaling-your-metadata-work">What challenges have you encountered while sustaining or scaling your metadata work?&lt;/h3>
&lt;p>In the realm of metadata, there are two standard solutions: 1) hire vendors to clean data at the end (the throw-people-at-the-problem philosophy); or 2) trust a black-box technical solution. The problem with the first method is that it is inefficient and can become expensive. The issue with the second is that, in my experience, most technical solutions have an 80% success rate. That may be acceptable for certain types of data, but it can fail spectacularly at the worst possible moment.&lt;/p>
&lt;p>For example, let&amp;rsquo;s say you find a technical solution that parses affiliation data in such a way as to assign a PID. Great, wonderful. Let&amp;rsquo;s say your parser is the best natural language processor in the world and makes matches 90% of the time (if you have one that does this, I&amp;rsquo;m all ears). You announce that you are including these IDs. Everyone cheers. It is great, right? Now, imagine you want to use those IDs to identify subscribing institutions to offer discounts or fee-less publishing for authors. You also want to use those IDs to send alerts to institutional admins of publishing activity. In both situations, achieving 90% accuracy simply won&amp;rsquo;t work. What we&amp;rsquo;ve learned is that black-box technology and &amp;rsquo;throw people at it&amp;rsquo; philosophies cannot work alone. Metadata curation must be a collaborative effort among authors, publishers, funders, and institutions, where the information grows throughout the research process.&lt;/p>
&lt;h3 id="whats-next-are-you-exploring-any-new-metadata-elements-or-areas-eg-funding-data-peer-review-metadata-preprints">What&amp;rsquo;s next? Are you exploring any new metadata elements or areas (e.g., funding data, peer review metadata, preprints)?&lt;/h3>
&lt;p>Over the next year, we will focus on CRediT identifiers and pass them to Crossref, along with institutional PIDs (ROR, Ringgold, and ISNI). We are also exploring various ways to capture peer reviewer activity and contributions, which will inevitably lead us down new and interesting paths.&lt;/p>
&lt;h3 id="anything-else-you-want-to-share">Anything else you want to share?&lt;/h3>
&lt;p>Here&amp;rsquo;s the thing about metadata that I wish I&amp;rsquo;d known when I started: it&amp;rsquo;s not a project with a finish line. It&amp;rsquo;s more like tending a garden that keeps growing in unexpected directions. Every time you think you&amp;rsquo;ve got it figured out, someone invents a new identifier, or your authors start doing something creative with their affiliations, or a funder changes their requirements, and suddenly you&amp;rsquo;re back to the drawing board.&lt;/p>
&lt;p>But what I&amp;rsquo;ve also learned from our journey out of that metaphorical (and literal) storage closet: the best metadata work happens when you start thinking of it as infrastructure. Good metadata is like good plumbing; when it&amp;rsquo;s working, nobody notices it, but when it&amp;rsquo;s not, everything backs up and gets messy fast.&lt;/p>
&lt;p>If you&amp;rsquo;re just starting this journey, my advice is this: don&amp;rsquo;t try to boil the ocean (gosh, I still need to remember that one). Pick one thing. Perhaps it could be ORCID IDs or institutional identifiers. Do it really, really well. Then build on that success. And please, for the love of all that is holy, invest in good partnerships. We couldn&amp;rsquo;t have done any of this without partners who understood that metadata isn&amp;rsquo;t just data entry; it&amp;rsquo;s the connective tissue of scholarly communication.&lt;/p>
&lt;p>Of course, even with the best partners and aligned teams, there will still be moments when you&amp;rsquo;ll sit dumbfounded in front of a screen where an author&amp;rsquo;s affiliation that was listed as &amp;ldquo;Bloomberg School of Public Health&amp;rdquo; matched to the identifier linked to the &amp;ldquo;Escuela Nacional de Sanidad.&amp;rdquo; On those days, just remember: at least you&amp;rsquo;re not still working in a storage closet with a haunted whiteboard.&lt;/p>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Good metadata is more than just a technical specification, and it&amp;rsquo;s not just for those XML wonks and nerds. It&amp;rsquo;s a service to science, and its core mission is to help us understand the world around us.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; David Haber, ASM&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>ASM&amp;rsquo;s story is a reminder that building a strong metadata infrastructure isn&amp;rsquo;t just about meeting technical requirements—it&amp;rsquo;s about aligning people, tools, and values around the idea that clean, connected, and consistent metadata is foundational to open and discoverable research. Whether you&amp;rsquo;re starting small or overhauling major systems, their experience shows what&amp;rsquo;s possible when you treat metadata not as a checkbox, but as a core part of scholarly publishing.&lt;/p>
&lt;div style="position: relative; width: 100%; height: 0; padding-top: 56.2500%;
padding-bottom: 0; box-shadow: 0 2px 8px 0 rgba(63,69,81,0.16); margin-top: 1.6em; margin-bottom: 0.9em; overflow: hidden;
border-radius: 8px; will-change: transform;">
&lt;iframe loading="lazy" style="position: absolute; width: 100%; height: 100%; top: 0; left: 0; border: none; padding: 0;margin: 0;"
src="https://www.canva.com/design/DAGsxAyXmXs/rOVOK6z99_UlaclRJHPekw/watch?embed" allowfullscreen="allowfullscreen" allow="fullscreen">
&lt;/iframe>
&lt;/div>
&lt;a href="https:&amp;#x2F;&amp;#x2F;www.canva.com&amp;#x2F;design&amp;#x2F;DAGsxAyXmXs&amp;#x2F;rOVOK6z99_UlaclRJHPekw&amp;#x2F;watch?utm_content=DAGsxAyXmXs&amp;amp;utm_campaign=designshare&amp;amp;utm_medium=embeds&amp;amp;utm_source=link" target="_blank" rel="noopener">&lt;/a>
&lt;p>Thank you, David, for taking the time to share your insights. Again, congratulations!&lt;/p></description></item><item><title>Changing fees to increase equity and reduce complexity</title><link>https://www.crossref.org/blog/changing-fees-to-increase-equity-and-reduce-complexity/</link><pubDate>Mon, 28 Jul 2025 00:00:00 +0000</pubDate><author>Amanda Bartell</author><guid>https://www.crossref.org/blog/changing-fees-to-increase-equity-and-reduce-complexity/</guid><description>&lt;p>The Crossref Board recently approved three recommendations for changes to our fees: introduction of a new lowest membership fee tier, removal of volume discounts for record registration, and normalisation of registration fees for peer reviews. The changes will be applied from January 2026.&lt;/p>
&lt;p>This is the first outcome of &lt;a href="https://www.crossref.org/community/special-programs/resourcing-crossref/">the Resourcing Crossref for Future Sustainability (RCFS)&lt;/a> program, launched in 2023, as a comprehensive effort to review all aspects of Crossref revenue and how we&amp;rsquo;re adapting to growth and the diversification of our membership. The program aims to make fees more equitable, simplify our complex fee schedule, and rebalance revenue sources.&lt;/p>
&lt;p>Following two rounds of member surveys, feedback gathered from the community in polls, open discussions, and emails, the &lt;a href="https://www.crossref.org/committees/membership-and-fees/">Membership and Fees (M&amp;amp;F) Committee&lt;/a> (made up of 30+ representatives from members, service providers, sponsors, and community partners) discussed evidence and made the first round of recommendations to the Board this month. We&amp;rsquo;re very thankful for their time spent reviewing data and sharing their experiences to get to this point.&lt;/p>
&lt;h2 id="goal-1-more-equitable-fees">GOAL 1: More equitable fees&lt;/h2>
&lt;p>Our membership has changed over the years - members now tend to be less well-resourced, more likely to be based in Asia or Latin America, and more likely to be much smaller operations, some of which may not even be organisations but volunteer groups. We are seeing more universities join as members, and fewer members now consider themselves publishers first and foremost. With our mission of creating a complete global research nexus, this growing diversity is excellent news.&lt;/p>
&lt;p>While new member growth is steady (2.3k members per year), over half join via a &lt;a href="https://www.crossref.org/community/sponsors/">Sponsor&lt;/a> (that makes membership more accessible both financially and technically), and close to 300 members have their membership revoked due to unpaid invoices each year, indicating that the current fee may be a barrier to participation for some.&lt;/p>
&lt;h3 id="area-of-focus-define-a-new-basis-for-sizing-and-tiering-members-for-their-capacity-to-pay">Area of focus: Define a new basis for sizing and tiering members for their capacity to pay&lt;/h3>
&lt;p>&lt;a href="https://www.crossref.org/fees/#annual-membership-fees">Our annual membership fees&lt;/a> are currently tiered according to the publishing revenue or expenses (whichever is higher) of each member. This enables each member to contribute to the community infrastructure according to their capacity to pay.&lt;/p>
&lt;p>One of the first areas under consideration throughout 2024 was an option to change the basis of our membership fees from the publishing revenue (or expenses) of each organization to their &lt;em>overall&lt;/em> organisational revenue (or expenses) instead.&lt;/p>
&lt;p>Through surveys, discussions with the M&amp;amp;F committee, and at the Crossref 2024 Annual Meeting, we received strong feedback, particularly from those based at institutions and/or following a diamond open-access model, that making this change would put Crossref beyond their reach.&lt;/p>
&lt;p>&lt;strong>It became clear therefore that we should NOT change the basis for sizing and tiering members.&lt;/strong>&lt;/p>
&lt;p>Instead, we will maintain the current basis for sizing and tiering members by considering their &lt;em>publishing&lt;/em> revenue or expenses, whichever is higher. For non-publisher members, we advise taking &amp;lsquo;publishing&amp;rsquo; to mean &amp;lsquo;producing&amp;rsquo;, so taking their cost of producing the works being registered with us, whether that is data, software, imagery, physical objects, etc.&lt;/p>
&lt;h3 id="area-of-focus-evaluate-the-usd-275-annual-membership-fee-tier-and-propose-a-more-equitable-pricing-structure-which-might-entail-breaking-this-down-into-two-or-more-different-tiers">Area of focus: Evaluate the USD 275 annual membership fee tier and propose a more equitable pricing structure, which might entail breaking this down into two or more different tiers.&lt;/h3>
&lt;p>We also looked into making our fees more equitable. It&amp;rsquo;s been long recognised that our lowest fee tier (an annual fee payment of USD 275 for all members with publishing revenue up to USD 1 million) represents a huge diversity of organisations operating within a range of financial contexts - over 95% of our non-sponsored members are in this category, and this is the category almost all new members join in. Throughout the project, we ran various surveys with our members to learn more about the makeup and factors affecting the capacity to pay for this group.&lt;/p>
&lt;p>&lt;strong>From January 2026, we will create a new annual membership tier for members whose publishing revenue/expenses (whichever is higher) is equal to or lower than USD 1,000 per year.&lt;/strong>&lt;/p>
&lt;p>Based on survey data, we expect 30-60% of our current members in the current USD 275 tier to move to this new category. This new membership fee tier will be set at USD 200 in 2026, which is 27% lower than the current 275 membership fee. We will monitor the uptake in this category, with a view to identify necessary adjustments in future years. As a result, we expect a decrease in revenue of between USD 174k (if 30% of current lowest tier members move into the new tier) and USD 348k (if 60% of those members move into the new tier).&lt;/p>
&lt;p>Our Membership team will reach out to help qualifying members change to the new tier well before January 2026. If your publishing revenue or expenses are equal to or lower than USD 1,000 per year, look out for our email in the next couple of weeks to help you transition to the lower USD 200 tier.&lt;/p>
&lt;h2 id="goal-2-simplify-complex-fees">GOAL 2: Simplify complex fees&lt;/h2>
&lt;h3 id="area-of-focus-address-and-adjust-volume-discounts-for-content-registration">Area of focus: Address and adjust volume discounts for Content Registration&lt;/h3>
&lt;p>We currently offer volume discounts for several of our record types. These are calculated at the end of each quarter.&lt;/p>
&lt;p>&lt;strong>In order to reduce the complexity of our pricing, we will eliminate all volume discounts.&lt;/strong>&lt;/p>
&lt;p>They are underused, accessible only to a small percentage of members, and the financial impact of making the change is small. These discounts contribute to complexity in our billing process and block our ability to offer members a running total or provide leaving members with a timely final invoice.&lt;/p>
&lt;p>Having consulted with affected organisations, we&amp;rsquo;re reassured that the change will not adversely affect their ability to register their works with us. We appreciate their understanding of the overall positive impact of this change for Crossref and their support for our sustainability.&lt;/p>
&lt;h3 id="area-of-focus-reduce-complexity-in-peer-review-fees">Area of focus: Reduce complexity in peer review fees&lt;/h3>
&lt;p>Finally, prompted by feedback from our members, we looked into normalising fees for peer review registration. We currently have two sets of fees for peer reviews based on whether the review is registered by the owner of the item being reviewed. There is a charge for the first review for a specific article, and a different charge for subsequent reviews for the same article by the same member. This charge for the subsequent reviews also varies depending on who registered the review. Very few members register peer reviews for records that they do not own, so having a separate, higher set of fees just adds complexity to the fee schedule with no financial or strategic benefit.&lt;/p>
&lt;p>&lt;strong>Starting from January 2026, we will consolidate all peer review fees, regardless of who registers it, to USD 0.25 for the first review for an article, and free registration for any subsequent reviews of that same record by the same member.&lt;/strong>&lt;/p>
&lt;h3 id="area-of-focus-address-and-adjust-back-year-discounts-for-record-registration">Area of focus: Address and adjust back-year discounts for record registration&lt;/h3>
&lt;p>Another recommendation, related to the removal of back-year discounts for select record types (conference proceedings, technical reports and working papers, theses and dissertations, and posted content/preprints) due to under use, hasn&amp;rsquo;t been approved yet. Based on feedback from the board, more research will be conducted on trends related to specific record types, such as theses and dissertations, so we can better understand potential unintended consequences of such changes.&lt;/p>
&lt;p>We&amp;rsquo;re looking to retain back-year discounts for record types where they continue to be well-used, including those for journal articles and book titles. We&amp;rsquo;re also looking to retain back-year discounts for grants, as these are at an early stage of adoption, and new funders coming on board naturally start with a backlog of grants to register in the Grant Linking System.&lt;/p>
&lt;h3 id="what-happens-next">What happens next?&lt;/h3>
&lt;p>The Resourcing Crossref for Future Sustainability (RCFS) initiative is very broad, and in the coming months and years you can expect progress with other aspects of our fees and resourcing. There is more work to come, including the rebalancing of revenue from the use of our metadata, the future of fees for our funder members, and further changes to record registration fees.&lt;/p>
&lt;p>We&amp;rsquo;re glad to see the first changes progressing to implementation, and would like to thank our Membership and Fees Committee and all members who took part in the consultations so far for your continued support.&lt;/p></description></item><item><title>Metadata excellence among new members: La Salle University, Perú</title><link>https://www.crossref.org/blog/metadata-excellence-among-new-members-la-salle-university-per%C3%BA/</link><pubDate>Fri, 25 Jul 2025 00:00:00 +0000</pubDate><author>Yasiel Pérez Vera</author><guid>https://www.crossref.org/blog/metadata-excellence-among-new-members-la-salle-university-per%C3%BA/</guid><description>&lt;p>&lt;a href="#version-in-english">Click here for the version in English&lt;/a>&lt;/p>
&lt;p>En 2025, lanzamos los Premios Crossref a los Metadatos, con el objetivo de destacar el rol de nuestra comunidad en la gestión y el enriquecimiento del registro académico. En esta publicación, destacamos a la Universidad La Salle, Perú, ganadora del premio a la excelencia entre los nuevos miembros, y contamos con la participación de Yasiel Pérez, Responsable Técnico y Editor de la Revista, quien comparte sus ideas:&lt;/p>
&lt;h2 id="por-qué-los-metadatos-importan-para-nosotros">Por qué los metadatos importan para nosotros&lt;/h2>
&lt;p>La Universidad La Salle se convirtió en miembro de Crossref hace relativamente poco tiempo, en 2023. Gestionamos nuestras revistas usando Open Journal Systems (OJS), y una vez que nos unimos a esta comunidad, los diferentes Consejos Editoriales compartimos la motivación de lograr una mayor visibilidad global, y vimos una oportunidad de mejora al proporcionar más metadatos y más completos.&lt;/p>
&lt;h2 id="el-lado-técnico-de-subsanar-las-deficiencias">El lado técnico de subsanar las deficiencias&lt;/h2>
&lt;p>Nuestras revistas, que llevan activas entre dos y cuatro años, han comenzado a enriquecer sus metadatos faltantes a niveles aceptables (¡creemos que aún podemos mejorar a niveles excelentes!). Gracias a mi formación como ingeniero de software, adaptamos el plugin de OJS para que admita campos de metadatos adicionales que no están disponibles en las versiones anteriores. El plugin requiere actualizaciones, por lo que realizamos modificaciones personalizadas para que sea compatible con los esquemas Crossref más recientes. Debido a limitaciones de tiempo, recursos humanos y financieros, consideramos más eficiente adaptar el plugin en lugar de adaptar nuestras instalaciones de OJS a las últimas versiones. Con estas modificaciones, depositamos los ROR ID, las licencias, las páginas de políticas y las actualizaciones de las revistas en Crossmark.&lt;/p>
&lt;p>Por otro lado, hemos probado la versión con Soporte a Largo Plazo actual y la versión 3.5 de OJS, y recomiendo encarecidamente a cualquier usuario que actualice a cualquiera de estas versiones más recientes. Incluyen importantes parches de seguridad y, además, los plugins de Crossref son compatibles con los esquemas más recientes. Desafortunadamente, para nosotros, actualizar los sistemas desde una versión anterior a la 3.3 requiere tiempo adicional y soporte técnico, dada la importancia de los cambios de la v3.2 a la v3.3.&lt;/p>
&lt;h2 id="haciendo-las-políticas-sobre-metadatos-una-prioridad">Haciendo las políticas sobre metadatos una prioridad&lt;/h2>
&lt;p>Tenemos un compromiso institucional con la provisión de metadatos enriquecidos. Contamos con políticas que exigen metadatos lo más completos posible como parte de nuestros flujos de trabajo, y lo convertimos en un requisito estricto. Naturalmente, existen algunos desafíos. Los metadatos abiertos y transparentes aún están relativamente poco valorados. A veces, los editores no comprenden completamente las implicaciones de proporcionar metadatos enriquecidos; mostrar su nombre en el sitio web no es lo mismo que tenerlo en los metadatos, por lo que la conexión entre la versión de registro y su visibilidad no siempre es evidente para autores y editores. Los apoyamos proporcionando directrices y capacitación a los consejos editoriales y equipos de las revistas. Por ejemplo, si una afiliación no está disponible en ROR, animamos a los autores a solicitar su inclusión en el registro.&lt;/p>
&lt;p>Por otro lado, esto también nos motiva. Nos estamos preparando para empezar a incluir metadatos de subvenciones y financiación en nuestros flujos de trabajo. También apuntamos a utilizar estos datos para estudiar el impacto de nuestras políticas editoriales en la visibilidad, el uso, las citas, la indexación y otras métricas institucionales. La Universidad La Salle es una organización interesante porque formamos una red de universidades de todo el mundo, lo que provoca errores en la identificación adecuada.&lt;/p>
&lt;p>Creemos que ciertamente otras organizaciones pueden lograr altos niveles de enriquecimiento de metadatos. Esto tiene dos aspectos fundamentales: uno técnico y otro organizativo. Desde nuestra perspectiva, el primer paso es obtener el apoyo de la organización y establecer políticas a nivel de toda la organización. Las soluciones técnicas pueden seguir después y no son fundamentalmente difíciles en comparación con conseguir que la comunidad proporcione metadatos buenos y completos.&lt;/p>
&lt;p>Una vez que se consigue la asignación de recursos, se planifica la hoja de ruta para recopilar más metadatos. Es mejor tenerlos y no usarlos que necesitarlos y no tenerlos. Por ejemplo, ya estamos recopilando los roles de autor utilizando la taxonomía CRediT, por lo que una vez que sea totalmente compatible con el esquema de Crossref, queremos estar preparados para enviarlos. Idealmente, nos gustaría ver compatibilidad con identificadores alternativos y más tipos de fechas. Recopilamos las fechas de envío y aceptación a través de Crossmark y asignamos simultáneamente DOI, PURL y ARK. Con el tiempo suficiente, también planeamos implementar la revisión por pares abierta en nuestras revistas.&lt;/p>
&lt;h2 id="lo-que-el-reconocimiento-nos-ayudó-a-lograr">Lo que el reconocimiento nos ayudó a lograr&lt;/h2>
&lt;p>Recibir este premio ha tenido un profundo impacto en nuestra organización; nos ayuda a reforzar el mensaje que intentamos transmitir a nuestra comunidad. Abrió los ojos de las autoridades y los gestores de presupuesto, y también está aumentando la visibilidad de la organización en la región. Queremos ser vistos como un ejemplo en la comunidad local y regional: «Si una institución provincial puede hacerlo, otras también». Hemos comenzado a recibir llamadas solicitando capacitación para otras organizaciones. Por lo tanto, este premio ha sido sin duda fundamental para nosotros.&lt;/p>
&lt;h3 id="version-in-english">Version in English&lt;/h3>
&lt;p>In 2025, we launched the Crossref Metadata Awards, aiming to highlight our community’s role in stewarding and enriching the scholarly record. In this post, we put the spotlight on La Salle University, Perú, winner of the award for excellence among new members, and have Yasiel Pérez, Technical Head and Journal Editor, sharing his insights:&lt;/p>
&lt;h2 id="why-metadata-matters-to-us">Why metadata matters to us&lt;/h2>
&lt;p>La Salle University became a Crossref member relatively recently, in 2023. We manage our journals using Open Journal Systems (OJS), and once we became part of this community, the different Editorial Boards had as a common motivation achieving more global visibility, and we saw an opportunity for improvement by providing more and richer metadata.&lt;/p>
&lt;h2 id="technical-side-of-filling-the-gaps">Technical side of filling the gaps&lt;/h2>
&lt;p>Our journals that have been active for two to four years started enriching their missing metadata to acceptable levels (we still think we can improve to excellent levels!). Because of my background as a software engineer, we adapted the OJS plugin to support additional metadata fields not yet available in the older versions. The plugin requires updates, so we made custom modifications to support the latest Crossref schemas. Because of time, human, and financial constraints, we found it most efficient to adapt the plugin rather than to adapt our OJS installations to the latest versions. With these modifications, we deposit ROR IDs, licences, and the journals&amp;rsquo; policy pages and updates to Crossmark.&lt;/p>
&lt;p>On the other hand, we have tested the current Long-term support and the 3.5 versions of OJS and I fully recommend to any user to upgrade to any of these more recent versions, there are important security patches and also the Crossref plugins are compatible with the latest schemas. Unfortunately, for us, upgrading the systems from a version older than 3.3 requires additional time and technical support, given the importance of changes from v3.2 to v3.3.&lt;/p>
&lt;h2 id="making-metadata-a-policy-priority">Making metadata a policy priority&lt;/h2>
&lt;p>We have an institutional commitment to the provision of rich metadata. We have policies in place to require metadata as complete as possible as part of our workflows and we make this a strict requirement. Naturally, there are some challenges. Open and transparent metadata is still relatively underappreciated. Sometimes editors don’t fully understand the implications of providing rich metadata; displaying your name in the website is not the same as having it on the metadata so the connection between the version of record and its visibility is not always evident for authors and editors. We support them by providing guidelines and training to the editorial boards and journal teams. E.g. if an affiliation is not available in ROR we encourage authors to request their inclusion in the registry.&lt;/p>
&lt;p>On the other hand, this is also a motivational push for us. We are preparing to start including grant and funding metadata in our workflows. We also aim to use this data to study the impact of our editorial policies on the visibility, use, citations, indexation, and other institutional metrics. La Salle University is an interesting organization because we are a network of universities across the world, leading to mistakes in proper identification.&lt;/p>
&lt;p>We certainly think that other organizations can achieve high levels of metadata enrichment. There are two fundamental aspects to it: A technical aspect and an organizational aspect. From our point of view, the first step is gaining organizational support, establishing organization-wide policies. The technical solutions can follow and are not fundamentally difficult compared with having the community provide good and complete metadata.&lt;/p>
&lt;p>Once you manage to secure the assignment of resources, then you plan the roadmap for collecting more metadata. It&amp;rsquo;s better to have it and not use it than to need it and not have it. For example, we already collect author roles using the CRediT taxonomy, so once it is fully supported by Crossref’s schema, we want to be prepared to submit them. Ideally, we would like to see support for alternative identifiers and more types of dates. We collect submission and acceptance dates via Crossmark and we simultaneously assign DOI, PURL, and ARK. Given enough time, we are also planning to implement open peer review in our journals.&lt;/p>
&lt;h2 id="what-the-recognition-helped-us-achieve">What the recognition helped us achieve&lt;/h2>
&lt;br>
&lt;div style="float: right; width: 50%; max-width: 480px; margin-left: 1em; margin-bottom: 1em;">
&lt;div style="position: relative; width: 100%; height: 0; padding-top: 56.25%;
box-shadow: 0 2px 8px 0 rgba(63,69,81,0.16); overflow: hidden;
border-radius: 8px;">
&lt;iframe loading="lazy" style="position: absolute; width: 100%; height: 100%; top: 0; left: 0; border: none;"
src="https://www.canva.com/design/DAGsskStCs8/_S2x99q3DaD8UdscVeKfew/watch?embed"
allowfullscreen allow="fullscreen">
&lt;/iframe>
&lt;/div>
&lt;/div>
&lt;p>
Receiving this award has been profoundly impactful for our organization; it helps us reinforce the message that we are trying to deliver to our community. It opened the eyes of the authorities and budget managers, and it is also increasing the organization’s visibility in the region. We want to be seen as an example in the local and regional community—“if a provincial institution can do it, others can too.” We have started receiving calls requesting training for other organizations. So, this award has certainly become pivotal for us.
&lt;/p></description></item><item><title>Crossref at Beijing International Book Fair 2025</title><link>https://www.crossref.org/blog/crossref-at-beijing-international-book-fair-2025/</link><pubDate>Thu, 24 Jul 2025 00:00:00 +0000</pubDate><author>Johanssen Obanda</author><guid>https://www.crossref.org/blog/crossref-at-beijing-international-book-fair-2025/</guid><description>&lt;p>This June, we presented at the Beijing International Book Fair (BIBF) and connected directly with our growing community in China. With a surge of interest from Chinese publishers and partners, it was clear: there’s a strong and rising curiosity around how metadata plays a vital role in maintaining the integrity of the scholarly record.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/bibf-booth.png"
alt="Visitors at Crossref Booth" width="100%">
&lt;/figure>
&lt;/div>
&lt;p>And we were not alone: our incredible Crossref Ambassadors based in the region joined us at the booth, and together we hosted visitors and answered questions. Throughout the fair, we engaged in passionate conversations, provided metadata guidance, and shared our knowledge as part of a panel session focused on how metadata supports scholarship. Ms. Ran Dang, Editorial Director at Atlantis Press (Springer Nature), supports Crossref outreach and advocates for Open Access and Open Science. Ms. Xiaofeng Guo, Director at Sin-Chn Scientific Press, leads DOI infrastructure efforts in China and supports Crossref members across the region. Mr. Gantulga Lkhagva, Founder and CEO of Mongolian Digital Knowledge Solutions and MongoliaJOL, works to strengthen local scholarly publishing and promote metadata best practices.&lt;/p>
&lt;div style="margin:10px;">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/bibf-group-photo.JPG"
alt="Photo: Crossref Ambassadors and Staff" width="100%">&lt;figcaption>
&lt;p>Photo: Crossref Ambassadors and Staff&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;/div>
&lt;p>This was the first time some of us had met in person after years of online collaboration, and the sense of connection and shared purpose was energising. Our Ambassadors also contributed to this post, sharing their favourite moments, key takeaways, and stories from the fair.&lt;/p>
&lt;h3 id="a-snapshot-from-the-panel-discussion">A snapshot from the panel discussion&lt;/h3>
&lt;p>During BIBF, we hosted a panel session focused on the role of metadata in supporting scholarship. Ms. Alicia Wang, Vice President - CNPIEC Kexin Technology Co., Ltd, Robbykha Rosalien, Membership Support Specialist - Crossref, Johanssen Obanda - Community Engagement Manager - Crossref, and our Ambassadors joined the panel, and we were glad to have a mix of Crossref members, Metadata Plus users, and curious participants join the discussion.&lt;/p>
&lt;div style="margin:10px;">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/bibf-panel-1.JPG"
alt="Photo: Panel session - Ms. Alicia Wang, Mr. Gantulga Lkhagva, Ms. Robbykha Rosalien, Mr. Johanssen Obanda, Ms. Xiaofeng Guo, Ms. Ran Dang." width="100%">&lt;figcaption>
&lt;p>Photo: Panel session - Ms. Alicia Wang, Mr. Gantulga Lkhagva, Ms. Robbykha Rosalien, Mr. Johanssen Obanda, Ms. Xiaofeng Guo, Ms. Ran Dang.&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;/div>
&lt;div style="margin:10px;">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/bibf-panel-2.2.jpg"
alt="Ms Xiaofeng Guo making a presentation about how metadata supports scholarship" width="100%">&lt;figcaption>
&lt;p>Ms Xiaofeng Guo making a presentation about how metadata supports scholarship&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;/div>
&lt;p>Key questions from the session included the status of open abstracts in Crossref, how retracted articles affect citation tracking and research integrity, and what happens when DOIs no longer resolve due to unmaintained landing pages.&lt;/p>
&lt;p>Robbykha explained our DOI resolution and archival systems, clarifying that DOIs are designed to always resolve, even when the original content moves or becomes unavailable. We also touched on the work Crossref is doing to support transparency around retractions, and the goals of The &lt;a href="https://i4oa.org/#" target="_blank">Initiative for Open Abstracts&lt;/a>, which aims to make research summaries more accessible.&lt;/p>
&lt;h3 id="metadata-plus-use-cases-from-china">Metadata Plus use cases from China&lt;/h3>
&lt;p>Two of our &lt;a href="https://www.crossref.org/services/metadata-retrieval/metadata-plus/" target="_blank">Metadata Plus&lt;/a> users were present during the panel and generously shared how they are leveraging Crossref metadata in their work.&lt;/p>
&lt;p>Jie He from &lt;a href="https://www.scienceing.com/en" target="_blank">ScienceRiver&lt;/a> described how their team translates Crossref metadata from English into Chinese, making it possible for users in China to search for relevant academic literature originally published outside the mainland. Their efforts open up global research to local audiences, bridging language and accessibility gaps. This conversation also led to broader discussions about multilingual metadata and the work our &lt;a href="https://www.crossref.org/blog/metadata-advisory-group-call-for-applications/" target="_blank">Metadata Advisory Group&lt;/a> hopes to support in this area.&lt;/p>
&lt;p>&lt;a href="https://eaapublishing.org/" target="_blank">Eurasia Academic Publishing Group&lt;/a>, based in Hong Kong, talked about using Crossref metadata coupled with AI approaches to develop a tool for readers, editors, and institutions to help assess the integrity of research articles and detect paper mills.&lt;/p>
&lt;h3 id="reflections-from-our-ambassadors-and-the-community">Reflections from our Ambassadors and the community&lt;/h3>
&lt;p>One common thread throughout our time at BIBF was the recognition that many of our resources, documentation, and support materials are still primarily in English. For Chinese-speaking community members who are new to Crossref or metadata concepts, this creates a pretty steep learning curve. We heard this clearly, and we know there’s work to do in making our services more accessible across languages.&lt;/p>
&lt;p>From personal highlights to fascinating conversations, here’s what some of our Ambassadors had to say:&lt;/p>
&lt;p>I am very happy to have met with colleagues from Crossref and several Ambassadors from Asia! We have met many times online, but this was the first time we met face-to-face and worked together to engage with our members and host events! I learned a great deal from our face-to-face exchanges, including updates on Crossref&amp;rsquo;s latest use cases, industry development trends, and even information about my colleagues&amp;rsquo; hometowns. We built friendships and successfully participated in the first BIBF event for Crossref, which was the biggest takeaway!&lt;/p>
&lt;p>我非常高兴，能够与Crossref的同事和亚洲的几位大使见面！我们曾经多次在网络会议中见面，但是这是第一次面对面，并且共同面对用户、举办活动！在我们面对面的交流中我也学到了很多，包括Crossref的最新应用案例，行业发展情况，甚至同事们自己家乡的情况！我们建立了友谊，成功举办了第一次BIBF活动，这是最大的收获！&lt;/p>
&lt;p>At the BIBF exhibition and events, we had good conversations with our Chinese partners and some members, and learned about actual application needs and use cases, which was very helpful to me. Most of the people I met spoke Chinese, but their publishers or institutions may have come from countries and regions outside mainland China, such as Singapore, Hong Kong, and Taiwan.&lt;/p>
&lt;p>在此次BIBF展览和活动中，我们与中国的合作伙伴以及很多用户面对面交流，了解到实际的应用需求和应用案例，这对我帮助很大。我接触的客户多半讲华语，但是他们的出版社或机构可能来自新加坡、香港、台湾等中国大陆以外的国家和地区。&lt;/p>
&lt;p>I also participated in the BIBF Forum events held before the exhibition, including the PubTech Conference, the first STM Asia-Pacific Conference, and the networking dinner. These three events were jointly organised by China National Publications Import and Export (Group) Corporation (CNPIEC), STM, and the Chinese Society of China University Journals (SCUJ). During the events, I heard about the latest developments in the publishing industry and gained valuable insights into hot topics. I also met many new and old friends and partners, some from China and others from around the world. Interacting with them not only allowed me to reminisce about the past but also provided me with new perspectives and expanded my professional network.&lt;/p>
&lt;p>我这次也参加了在展览之前举办的BIBF论坛活动，包括的PubTech论坛，以及首界STM亚太会议和交流晚宴。这三个活动是由中国图书进出口公司（CNPIEC）、STM和中国高校科技期刊研究会（SCUJ）联合举办的。在活动中我听到了很多出版行业的最新发展以及针对热点问题的真知灼见，见到了很多新老朋友和伙伴，他们部分来自中国，部分来自世界各地。与他们交流不仅让我重温旧时光，也获得了新的见解、新的人脉。&lt;/p>
&lt;p>Discussion with Ms. Bo Li from China Education Publication Import &amp;amp; Export Corporation (CEPIEC) on matching papers with their funding grants from China. This is an excellent use case for Crossref&amp;rsquo;s &lt;a href="https://www.crossref.org/services/grant-linking-system/" target="_blank">Grant Linking System (GLS)&lt;/a> service and related metadata. We introduced the GLS service and Crossref metadata to Ms. Bo Li and will follow up with her and her colleagues to help them use Crossref&amp;rsquo;s metadata to complete this task more easily.&lt;/p>
&lt;p>与中国教育图书进出口公司的李博女士讨论为科研基金匹配项目资助的论文元数据。这是一个非常好的应用案例，可以利用Crossref的GLS服务以及相关元数据。我们向李博介绍了GLS服务以及元数据的相关情况，之后还将与她和她的同事进行深入讨论，帮助他们利用Crossref的元数据更快捷地完成此项工作。&lt;/p>
&lt;p>Discussion with Dr. Zhu Xuefeng. Their team has developed an application that identifies research integrity issues in journals and articles. They primarily utilise Crossref metadata (including article metadata and retraction observation data), withdrarXiv, ORCID and Research Organization Registry (ROR) data, among others. By linking and integrating these data, they calculate the research integrity risk of relevant journals and articles, providing a reference for authors submitting manuscripts, editors reviewing manuscripts, and institutions monitoring research integrity issues.&lt;/p>
&lt;p>与朱学峰博士的讨论。他们的团队开发了一款应用程序，识别期刊/文章的科研诚信问题。他们主要利用了Crossref元数据（包括文章元数据和撤稿观察数据），arXiv的撤回数据集，以及ORCID和ROR数据等，通过关联、集成这些数据计算相关期刊/文章的科研诚信风险，为作者投稿、编辑审稿、机构监测科研诚信问题等提供参考。&lt;/p>
&lt;p>At the Crossref BIBF event, Ms. Wang Xuan, Vice President of CNPIEC Kexin Technology Co., Ltd, a Crossref sponsor in China, discussed the strong demand for reliable data sources when applying AI in the field of scientific research, as well as how Crossref metadata can provide strong support. She proposed that all AI products focusing on scientific research should show the original DOIs for the academic resources they cite in the results they provide to users, to enhance the reliability and traceability of data sources. She committed that her company, Ke Xin, as a provider of research AI assistants, will implement this functionality in its products and hopes to promote this as a best practice to all research AI application developers and providers. This reflects that, as cutting-edge technology advances and requirements for research integrity and compliance continue to rise, Crossref metadata continues to play an important role in scholarship and will become increasingly extensive and indispensable.&lt;/p>
&lt;p>在Crossref BIBF活动上，中图科信公司（Crossref中国赞助机构）副总经理王轩女士在讨论中阐述了关于AI在科研领域应用时对于可信数据来源的强烈需求，以及Crossref元数据如何能提供有力支撑的想法。她倡议所有的科研AI产品在为用户提供结果时，应对引用的学术资源提供原始的DOI标识，以增强数据来源的可信度和可追踪性。她承诺中图科信公司作为科研AI助手的提供者将在其产品中实现这一功能，并希望能将此作为最佳实践向所有科研AI应用的开发者、提供者进行推广。这反映了随着前沿科技发展以及科研诚信与合规要求不断提升，Crossref元数据对于学术研究提供的支撑作用将越来越广泛、越来越重要。&lt;/p></description></item><item><title>Connecting the dots: FWFs transition to linked grant metadata to support a thriving culture of openness</title><link>https://www.crossref.org/blog/connecting-the-dots-fwfs-transition-to-linked-grant-metadata-to-support-a-thriving-culture-of-openness/</link><pubDate>Wed, 23 Jul 2025 00:00:00 +0000</pubDate><author>Rocío Gaudioso Pedraza</author><guid>https://www.crossref.org/blog/connecting-the-dots-fwfs-transition-to-linked-grant-metadata-to-support-a-thriving-culture-of-openness/</guid><description>&lt;p>&lt;a href="#version-in-german">&lt;em>Click here for the version in German&lt;/em>&lt;/a>&lt;/p>
&lt;p>As a new Community Engagement Manager at Crossref, dedicated to working with the funders community, I frequently hear requests for examples and case studies of adopting Crossref&amp;rsquo;s &lt;a href="https://www.crossref.org/services/grant-linking-system/">Grant Linking System (GLS)&lt;/a> by &amp;lsquo;funders like us&amp;rsquo;. This has spurred me to start a series of blog posts presenting funders&amp;rsquo; perspectives on joining Crossref and using our system &amp;ndash; to demonstrate how it&amp;rsquo;s done. &lt;/p>
&lt;p>In the first case study of a series, I speak with Katharina Rieck, Open Science Manager at the Austrian Science Fund (FWF), Austria&amp;rsquo;s national funding agency for basic research, about the agency&amp;rsquo;s approach to research metadata, transparency and openness, and the role that the Grant Linking System plays in it. &lt;/p>
&lt;p>With a strong track record in Open Access and Open Science, the FWF&amp;rsquo;s decision to implement grant IDs represents more than a mere technical upgrade. What began as an initiative to enhance the openness and interoperability of grant information illustrates that truly open research infrastructure is not solely a matter of systems, but about people, policies and collaboration.&lt;/p>
&lt;p>Katharina was also elected to the &lt;a href="https://www.crossref.org/board-and-governance">Crossref Board&lt;/a> at our November 2024 Annual Meeting, and started her three-year term in January 2025.&lt;/p>
&lt;h3 id="could-you-introduce-your-organisation-and-what-is-your-role">Could you introduce your organisation? And what is your role?&lt;/h3>
&lt;p>The Austrian Science Fund (FWF) is Austria&amp;rsquo;s national funding agency for basic research. The FWF funds all disciplines, from Social Sciences and Humanities to Life Sciences and Natural Sciences and Technology. As Open Science Manager, I am responsible for developing the FWF&amp;rsquo;s Open Science strategy, including the development of the &lt;a href="https://www.fwf.ac.at/en/about-us/what-we-do/open-science/open-access-policy/open-access-policy-for-peer-reviewed-publications" target="_blank">Open Access Policy for Peer-Reviewed Publications&lt;/a>, the &lt;a href="https://www.fwf.ac.at/en/about-us/what-we-do/open-science/open-access-policy/open-access-policy-for-research-data" target="_blank">Open Access Policy for Research Data&lt;/a> as well as the FWF &lt;a href="https://www.fwf.ac.at/en/about-us/what-we-do/open-science/research-data-management" target="_blank">Research Data Management Policy&lt;/a>. I am also responsible for the development and implementation of funding instruments such as the FWF &lt;a href="https://www.fwf.ac.at/en/funding/portfolio/communication/open-access-block-grant" target="_blank">Open-Access Block Grant&lt;/a> and support for &lt;a href="https://www.fwf.ac.at/en/about-us/what-we-do/open-science/open-science-infrastructures" target="_blank">Open Science infrastructures&lt;/a>.&lt;/p>
&lt;h3 id="what-motivated-you-to-join-crossref">What motivated you to join Crossref?&lt;/h3>
&lt;p>For more than two decades, the FWF has actively promoted and supported various aspects of Open Science. In 2004, it published its first Open Access Policy, making it one of the first funding organizations worldwide to adopt an Open Access policy for publications. In line with the commitment to open research information as a core pillar of Open Science, the FWF has taken further steps to strengthen openness and transparency: it joined &lt;a href="https://api.crossref.org/funders/501100002428/works?filter=type:grant" target="_blank">Crossref to register grant DOIs&lt;/a> and became a signatory of the &lt;a href="https://www.fwf.ac.at/en/news/detail/fwf-signs-barcelona-declaration-on-open-research-information" target="_blank">Barcelona Declaration on Open Research Information&lt;/a> and joined &lt;a href="https://api.crossref.org/funders/501100002428/works?filter=type:grant" target="_blank">Crossref to register grant DOIs.&lt;/a>&lt;/p>
&lt;p>While funding metadata––information about projects funded by the FWF––has long been freely available on our website, the launch of the &lt;a href="https://www.fwf.ac.at/en/discover/research-radar" target="_blank">Research Radar&lt;/a> in 2023 marked a significant step forward. Our goal was not only to maintain accessibility but to ensure that the data published in the Research Radar is interoperable and aligned with the FAIR principles. By implementing the Grant Linking System from Crossref, we assign each FWF funded project a unique, persistent identifier with associated metadata, helping to make FWF grant information open, interoperable and sustainable.&lt;/p>
&lt;h3 id="can-you-tell-us-about-your-experience-using-the-grant-linking-system">Can you tell us about your experience using the Grant Linking System?&lt;/h3>
&lt;p>We have been using the Grant Linking System since November 2023. With the launch of the FWF&amp;rsquo;s new website and the introduction of the Research Radar, we began registering Crossref grant IDs (DOIs) for all grants included in the Research Radar database. As a result, all FWF-funded projects dating back to 1995 are now uniquely identifiable. The process of registering grant metadata with Crossref is straightforward, and we have set up a smooth internal workflow that enables the registration of DOIs after the FWF&amp;rsquo;s funding decision.&lt;/p>
&lt;p>It is important to note that implementing Crossref grant IDs involved more than just a technical setup––it required the development of new internal processes and coordination through a dedicated Crossref grant DOI implementation group. The implementation process also resulted in a revised structure for grant numbers (DOI suffixes) for FWF-funded projects, establishing a sustainable and future-proof system.&lt;/p>
&lt;h3 id="how-was-your-journey-to-socialise-the-grant-linking-system-within-your-research-community-how-did-you-communicate-the-importance-of-identifiers-and-grant-metadata-to-your-grant-holders">How was your journey to socialise the Grant Linking System within your research community? How did you communicate the importance of identifiers and grant metadata to your grant holders?&lt;/h3>
&lt;p>The introduction of grant DOIs was supported by a comprehensive communication strategy, including dedicated online resources (e.g., &lt;a href="https://www.fwf.ac.at/en/news/detail/neue-identifikations-nummer-fuer-fwf-projekte" target="_blank">New Identification Numbers for FWF Projects –– FWF&lt;/a>), updates across multiple pages of the FWF website (such as &lt;a href="https://www.fwf.ac.at/en/funding/steps-to-your-fwf-project/carrying-out-your-project" target="_blank">Carrying out Your Project –– FWF&lt;/a>), and presentations at various events. This communication strategy aimed to explain the purpose and value of the &amp;ldquo;new numbers&amp;rdquo; ensuring that researchers and stakeholders understood how this contributes to greater visibility, traceability, and openness of funded research.&lt;/p>
&lt;p>As a funding organisation, we require grant recipients to acknowledge FWF support in all research outputs resulting from their projects. With the integration of grant DOIs into FWF&amp;rsquo;s metadata, the standardised acknowledgment text was updated to ensure that the DOIs are now included in outputs. The new required wording is: &amp;lsquo;This research was funded in whole or in part by the Austrian Science Fund (FWF) [grant DOI],&amp;rsquo; and is now a requirement in the FWF funding agreement. Including the grant DOI both in the output metadata and the acknowledgment text enhances traceability and supports more effective analysis of FWF-funded outputs.&lt;/p>
&lt;h3 id="what-do-you-find-useful-about-registering-grant-metadata-with-crossref">What do you find useful about registering grant metadata with Crossref?&lt;/h3>
&lt;p>One of the key benefits of registering grant metadata is the enhanced interconnectivity and the unique identification of FWF&amp;rsquo;s grant information. By registering our grants with Crossref, funding information becomes more than just information on the FWF website––it becomes interoperable data that is accessible and reusable. This not only increases visibility but also enables us to better analyse the outcomes of funded projects and ensures that the data is accessible as well as (re)usable by the broader research community.&lt;/p>
&lt;p>In addition to assigning Crossref Grant IDs and registering grant metadata, the FWF has required &lt;a href="https://orcid.org/" target="_blank">ORCID IDs&lt;/a> for researchers since 2016 and mandates the use of &lt;a href="https://ror.org/" target="_blank">ROR IDs&lt;/a> for institutions. The consistent use of persistent identifiers in metadata ensures the interoperability of FWF grant information and facilitates seamless integration with external data sources.&lt;/p>
&lt;h3 id="what-are-your-hopes-for-the-gls-and-greater-transparency-in-funding-metadata-in-general">What are your hopes for the GLS and greater transparency in funding metadata in general?&lt;/h3>
&lt;p>The FAIRness and openness of research information––including metadata on funding information, research outputs, researchers, and institutions––are fundamental to a well-functioning research ecosystem. I hope to see a broader adoption of persistent identifiers in metadata, particularly in grant information, as well as a broader commitment to openly sharing research information as expressed in the Barcelona Declaration. Moreover, a key objective should be to ensure the highest possible accuracy of metadata at the point of entry. This entails, for instance, that publication metadata accurately includes funding metadata.&lt;/p>
&lt;h3 id="what-were-the-key-challenges-you-encountered-when-embracing-the-gls-and-how-did-you-overcome-them">What were the key challenges you encountered when embracing the GLS, and how did you overcome them?&lt;/h3>
&lt;p>One of the key challenges we encountered when adopting the GLS was ensuring seamless integration in our existing IT infrastructure and workflows. Integrating the new number across different systems required considerable coordination. We overcame this challenge by establishing a dedicated implementation team that included IT experts.&lt;/p>
&lt;p>Another challenge involved communicating and disseminating information regarding the grant DOI, ensuring that researchers and other relevant stakeholders were adequately informed. This was successfully managed through targeted and comprehensive communication efforts.&lt;/p>
&lt;h3 id="based-on-your-experience-what-would-be-your-advice-for-colleagues-from-other-research-funders">Based on your experience, what would be your advice for colleagues from other research funders?&lt;/h3>
&lt;p>It is important to recognise that registering grant identifers and metadata goes beyond a mere technical implementation. This is an opportunity to engage with diverse stakeholders, rethink processes and highlight the value of open funding metadata for the entire research community.&lt;/p>
&lt;p>We are grateful to Katharina Rieck and FWF for generously sharing their insights and know-how. Their experience highlights the importance of seeing metadata not just as information, but as a shared resource that connects and empowers the research community.&lt;/p>
&lt;h3 id="version-in-german">Version in German&lt;/h3>
&lt;p>&lt;em>The title has been changed slightly from the original version. Translation by Lena Stoll.&lt;/em>&lt;/p>
&lt;h2 id="connecting-the-dots-wie-der-fwf-durch-die-umstellung-auf-vernetzte-fördermetadaten-eine-kultur-der-offenheit-fördert">Connecting the Dots: Wie der FWF durch die Umstellung auf vernetzte Fördermetadaten eine Kultur der Offenheit fördert&lt;/h2>
&lt;p>Als neue Community-Engagement-Managerin bei Crossref, die sich der Zusammenarbeit mit Fördergebern widmet, werde ich häufig gefragt, ob ich Beispiele und Fallstudien von „Förderern wie uns“ geben kann, die Crossrefs &lt;a href="https://www.crossref.org/services/grant-linking-system/">Grant Linking System (GLS)&lt;/a> bereits eingeführt haben. Dies hat mich dazu veranlasst, eine Blogreihe zu starten, in der ich die Perspektiven von Fördergebern auf eine Crossref-Mitgliedschaft und die Nutzung unseres Systems vorstelle – um zu zeigen, wie es funktioniert.&lt;/p>
&lt;p>In der ersten Fallstudie dieser Reihe spreche ich mit &lt;strong>Katharina Rieck&lt;/strong>, Open-Science-Managerin beim Österreichischen Wissenschaftsfonds FWF, Österreichs nationaler Förderagentur für Grundlagenforschung, über den Ansatz des FWF zu Forschungsmetadaten, Transparenz und Offenheit sowie über die Rolle, die das &lt;a href="https://www.crossref.org/services/grant-linking-system/">Grant Linking System&lt;/a> dabei spielt.&lt;/p>
&lt;p>Mit seiner langjährigen Erfahrung im Bereich Open Access und Open Science stellt die Entscheidung des FWF, Grant-IDs (DOIs für Fördermittel) einzuführen, mehr als nur eine technische Verbesserung dar. Die Initiative begann mit dem Ziel, die Offenheit und Interoperabilität von Förderinformationen zu verbessern, aber schon bald wurde klar, dass eine wirklich offene Forschungsinfrastruktur nicht nur eine Frage der Systeme ist, sondern auch Menschen, Regelwerke, Abläufe und die Zusammenarbeit betrifft.&lt;/p>
&lt;p>Katharina Rieck wurde auf unserer Jahresversammlung im November 2024 außerdem in Crossrefs Board of Directors gewählt und ist im Januar 2025 ihre dreijährige Amtszeit angetreten.&lt;/p>
&lt;h3 id="bitte-stellen-sie-den-fwf-kurz-vor-und-erklären-sie-unseren-leserinnen-was-ihre-rolle-dort-ist">Bitte stellen Sie den FWF kurz vor und erklären Sie unseren Leser:innen, was Ihre Rolle dort ist.&lt;/h3>
&lt;p>Der Österreichische Wissenschaftsfonds FWF ist Österreichs nationale Förderorganisation für Grundlagenforschung. Der FWF fördert alle Disziplinen, von den Sozial- und Geisteswissenschaften über die Lebenswissenschaften bis hin zu Naturwissenschaften und Technik. Als Open-Science-Managerin bin ich für die Entwicklung der Open-Science-Strategie des FWF verantwortlich, einschließlich der Entwicklung der Open-Access-Policy für begutachtete Publikationen, der Open-Access-Policy für Forschungsdaten sowie der FWF-Richtlinie zum Forschungsdatenmanagement. Darüber hinaus bin ich verantwortlich für die Entwicklung und Umsetzung von Förderinstrumenten wie der Open-Access-Pauschale des FWF sowie die Unterstützung von Open-Science-Infrastrukturen.&lt;/p>
&lt;h3 id="was-hat-sie-dazu-bewogen-crossref-beizutreten">Was hat Sie dazu bewogen, Crossref beizutreten?&lt;/h3>
&lt;p>Der FWF fördert und unterstützt seit mehr als zwei Jahrzehnten aktiv verschiedene Aspekte von Open Science. 2004 veröffentlichte er seine erste Open-Access-Policy und war damit eine der ersten Förderorganisationen weltweit, die eine Open-Access-Policy für Publikationen eingeführt haben. Im Einklang mit seinem Engagement für offene Forschungsinformationen als zentrale Säule von Open Science hat der FWF weitere Schritte unternommen, um Offenheit und Transparenz zu stärken: Der FWF ist Crossref beigetreten, um Grant-DOIs zu registrieren, und ist Unterzeichner der &lt;a href="https://www.coalition-s.org/Barcelona-declaration/" target="_blank">Barcelona Declaration on Open Research Information&lt;/a>.&lt;/p>
&lt;p>Zwar sind Metadaten zur Forschungsförderung – also Informationen über FWF-geförderte Projekte – schon seit Langem über unsere Website frei verfügbar. Doch die Einführung des &lt;a href="https://www.fwf.ac.at/en/discover/research-radar" target="_blank">Research Radar&lt;/a> im Jahr 2023 war nochmal ein bedeutender Fortschritt. Unser Ziel war es nicht nur, den offenen Zugang zu den Metadaten aufrechtzuerhalten, sondern auch sicherzustellen, dass die im Forschungsradar veröffentlichten Daten interoperabel und mit den FAIR-Prinzipien vereinbar sind. Durch die Anwendung von Crossrefs &lt;a href="https://www.crossref.org/services/grant-linking-system/">Grant Linking System&lt;/a> bekommt jetzt jedes vom FWF geförderte Projekt eine eindeutige, unveränderliche ID mit dazugehörigen Metadaten – und die Informationen zu FWF-Fördermitteln sind somit offen, interoperabel und nachhaltig verfügbar.&lt;/p>
&lt;h3 id="können-sie-uns-mehr-über-ihre-erfahrungen-mit-dem-grant-linking-system-erzählen">Können Sie uns mehr über Ihre Erfahrungen mit dem Grant Linking System erzählen?&lt;/h3>
&lt;p>Wir nutzen das &lt;a href="https://www.crossref.org/services/grant-linking-system/">Grant Linking System&lt;/a> seit November 2023. Mit dem Launch der neuen FWF-Website und des &lt;a href="https://www.fwf.ac.at/en/discover/research-radar" target="_blank">Research Radar&lt;/a> begannen wir damit, Crossref-Grant-IDs (DOIs) für alle in der Forschungsradar-Datenbank enthaltenen Förderungen zu registrieren. Dadurch sind nun alle FWF-geförderten Projekte seit 1995 eindeutig identifizierbar. Die Registrierung von Grant-Metadaten bei Crossref ist unkompliziert, und wir haben einen reibungslosen internen Workflow entwickelt, um DOIs nach der Förderentscheidung des FWF zu registrieren.&lt;/p>
&lt;p>Es ist wichtig zu erwähnen, dass es für die Einführung von Crossref-Grant-IDs mehr als nur den Aufbau technischer Prozesse brauchte – wir haben auch neue interne Abläufe entwickelt und eine eigene Arbeitsgruppe für die Koordination von Crossref-Grant-DOIs gebildet. Im Zuge dieses Prozesses haben wir auch die Struktur der Projektnummern für FWF-geförderte Projekte (also der DOI-Suffixe) überarbeitet und somit ein nachhaltiges und zukunftssicheres System aufgebaut.&lt;/p>
&lt;h3 id="welche-erfahrungen-haben-sie-damit-gemacht-das-grant-linking-system-in-ihrer-forschungscommunity-zu-bewerben-wie-haben-sie-ihren-fördernehmerinnen-die-wichtigkeit-von-identifiern-und-metadaten-vermittelt">Welche Erfahrungen haben Sie damit gemacht, das Grant Linking System in Ihrer Forschungscommunity zu bewerben? Wie haben Sie Ihren Fördernehmer:innen die Wichtigkeit von Identifiern und Metadaten vermittelt?&lt;/h3>
&lt;p>Wir haben die Einführung der Grant-DOIs mit einer umfassenden Kommunikationsstrategie unterstützt, inklusive spezieller Online-Ressourcen (z. B. &lt;em>Neue Identifikationsnummern für FWF-Projekte&lt;/em>), der Aktualisierung mehrerer Seiten auf der FWF-Website (z. B. &lt;em>Projekt durchführen&lt;/em>) sowie Vorträgen bei diversen Veranstaltungen. Ziel dieser Kommunikationsstrategie war es, Zweck und Nutzen der „neuen Nummern“ zu erläutern und sicherzustellen, dass Forschende und Stakeholder verstehen, wie diese zu mehr Sichtbarkeit, Nachvollziehbarkeit und Offenheit der geförderten Forschung beitragen.&lt;/p>
&lt;p>Als Förderorganisation verlangen wir von unseren Fördernehmer:innen, die Unterstützung durch den FWF in allen Forschungsergebnissen zu erwähnen, die aus dem Projekt resultieren. Mit der Integration der Grant-DOIs in die Metadaten des FWF haben wir den standardisierten Acknowledgement-Text aktualisiert, um sicherzustellen, dass die DOIs in den Ergebnissen erwähnt werden. Der neue erforderliche Wortlaut ist: &lt;em>„Diese Forschung wurde gänzlich oder teilweise durch den Wissenschaftsfonds FWF finanziert [Grant-DOI].“&lt;/em> und ist in jedem FWF-Fördervertrag festgeschrieben. Die Angabe von Grant-DOIs sowohl in den Metadaten als auch im Acknowledgement-Text von wissenschaftlichem Output verbessert die Rückverfolgbarkeit und ermöglicht eine genauere Analyse der vom FWF geförderten Ergebnisse.&lt;/p>
&lt;h3 id="was-finden-sie-an-der-registrierung-von-fördermetadaten-bei-crossref-am-hilfreichsten">Was finden Sie an der Registrierung von Fördermetadaten bei Crossref am hilfreichsten?&lt;/h3>
&lt;p>Einer der Hauptvorteile der Registrierung von Fördermetadaten ist die verbesserte Vernetzung und die eindeutige Identifizierung der Förderinformationen des FWF. Durch die Registrierung unserer Projekte bei Crossref werden Förderinformationen zu mehr als nur Informationen auf unserer Website – sie werden zu interoperablen Daten, die abrufbar und wiederverwendbar sind. Dies erhöht nicht nur die Sichtbarkeit, sondern ermöglicht uns auch eine bessere Analyse der Ergebnisse geförderter Projekte und stellt sicher, dass die Daten für die allgemeine Forschungsgemeinschaft zugänglich und (wieder-)verwendbar sind.&lt;/p>
&lt;p>Neben der Vergabe von Crossref-Grant-IDs und der Registrierung von Fördermetadaten schreibt der FWF seit 2016 &lt;a href="https://orcid.org/" target="_blank">ORCID&lt;/a> für Forschende sowie die Verwendung von &lt;a href="https://ror.org/" target="_blank">ROR IDs&lt;/a> für Institutionen vor. Die konsequente Verwendung persistenter IDs in den Metadaten gewährleistet die Interoperabilität der FWF-Förderinformationen und erleichtert die nahtlose Integration mit externen Datenquellen.&lt;/p>
&lt;h3 id="was-erhoffen-sie-sich-vom-gls-und-von-mehr-transparenz-bei-fördermetadaten-im-allgemeinen">Was erhoffen Sie sich vom GLS und von mehr Transparenz bei Fördermetadaten im Allgemeinen?&lt;/h3>
&lt;p>Die FAIRness und Offenheit von Forschungsinformationen – einschließlich der Metadaten zu Förderinformationen, Forschungsergebnissen, Forschenden und Institutionen – sind für ein gut funktionierendes Forschungsökosystem wesentlich. Ich hoffe auf eine weiterreichende Anwendung von persistenten IDs in Metadaten, insbesondere in Förderinformationen, und auf ein größeres Engagement für den offenen Austausch von Forschungsinformationen, wie es zum Beispiel in der &lt;a href="https://www.coalition-s.org/Barcelona-declaration/" target="_blank">Barcelona Declaration on Open Research Information&lt;/a> gefordert wird. Darüber hinaus sollte sichergestellt werden, dass die Metadaten bereits bei der Eingabe und damit bei ihrer Generierung möglichst korrekt sind. Das bedeutet unter anderem, dass die Metadaten von Publikationen die korrekten Fördermetadaten enthalten sollten.&lt;/p>
&lt;h3 id="welche-herausforderungen-sind-bei-der-einführung-des-gls-aufgetreten-und-wie-haben-sie-diese-gemeistert">Welche Herausforderungen sind bei der Einführung des GLS aufgetreten und wie haben Sie diese gemeistert?&lt;/h3>
&lt;p>Eine der größten Herausforderungen bestand darin, das &lt;a href="https://www.crossref.org/services/grant-linking-system/">Grant Linking System&lt;/a> nahtlos in unsere bestehende IT-Infrastruktur und Arbeitsabläufe zu integrieren. Die „neue Nummer“ in die unterschiedlichen Systeme zu integrieren, bedeutete einen hohen Koordinationsaufwand. Gemeistert haben wir diese Herausforderung durch die Bildung einer eigenen Arbeitsgruppe für die Anwendung von Crossref-Grant-DOIs, in der auch IT-Expert:innen vertreten waren.&lt;/p>
&lt;p>Eine weitere Herausforderung bestand in der Kommunikation und Verbreitung von Informationen zu Grant-DOIs, um Forschende und andere Stakeholder angemessen zu informieren. Das haben wir durch gezielte und umfassende Kommunikationsmaßnahmen erreicht.&lt;/p>
&lt;h3 id="basierend-auf-ihrer-eigenen-erfahrung-welchen-ratschlag-würden-sie-kolleginnen-bei-anderen-fördergebern-mitgeben">Basierend auf Ihrer eigenen Erfahrung, welchen Ratschlag würden Sie Kolleg:innen bei anderen Fördergebern mitgeben?&lt;/h3>
&lt;p>Es ist wichtig zu verstehen, dass die Registrierung von Grant-IDs und Metadaten über eine bloße technische Umsetzung hinausgeht. Der Prozess bietet die Gelegenheit, mit verschiedenen Stakeholdern in Kontakt zu treten, Abläufe zu überdenken und den Wert offener Fördermetadaten für die gesamte Forschungsgemeinschaft zu unterstreichen.&lt;/p>
&lt;p>Wir danken Katharina Rieck und dem FWF für ihre Bereitschaft, ihre Erkenntnisse und ihr Know-how so großzügig zu teilen. Ihr Erfahrungsbericht hat uns gezeigt, wie wichtig es ist, Metadaten nicht nur als Informationen zu betrachten, sondern als eine gemeinsame Ressource, die die gesamte Forschungsgemeinschaft vernetzen und stärken kann.&lt;/p></description></item><item><title>Data Science @Crossref</title><link>https://www.crossref.org/blog/data-science-@crossref/</link><pubDate>Mon, 07 Jul 2025 00:00:00 +0000</pubDate><author>Dominika Tkaczyk</author><guid>https://www.crossref.org/blog/data-science-@crossref/</guid><description>&lt;p>To address the growing scale and complexity of scholarly data, we&amp;rsquo;ve launched a new data science function at Crossref. In April, we were excited to welcome our first data scientists, &lt;a href="https://www.crossref.org/people/jason-portenoy/">Jason Portenoy&lt;/a> and &lt;a href="https://www.crossref.org/people/alex-b%C3%A9dard-vall%C3%A9e/">Alex Bédard-Vallée&lt;/a>, to the team. With their arrival, the Data Science team is now fully up and running. In this blog post, we&amp;rsquo;re sharing our vision and what&amp;rsquo;s ahead for data science at Crossref.&lt;/p>
&lt;h2 id="new-approach-to-achieve-our-mission">New approach to achieve our mission&lt;/h2>
&lt;p>Over the last few years, we have witnessed substantial growth of the scholarly community in general, and Crossref in particular. This has been reflected in the increase in the volume and variety of the data we collect, store and process, including scholarly metadata and Crossref operational data related to membership, DOI registrations, billing, usage measurement, and other activities.&lt;/p>
&lt;p>On the one hand, this growth opens new possibilities for using the data to better understand the scholarly landscape, serve our community, develop services, and make informed decisions. On the other hand, it forces us to address a set of challenges related to the scale and complexity of the data.&lt;/p>
&lt;p>The new Data Science team, created as part of &lt;a href="https://doi.org/10.64000/bm6g0-gvy36" target="_blank">last year&amp;rsquo;s broader organisational changes&lt;/a>, will address these challenges and fulfil our data-related ambitions. As part of our &lt;a href="https://www.crossref.org/strategy/">strategic mission&lt;/a>, we created the following vision for the Data Science team within Crossref and our community:&lt;/p>
&lt;p>&lt;strong>The Data Science team uses scientific research and data science to deliver, assess, improve, and enrich scholarly metadata.&lt;/strong>&lt;/p>
&lt;p>The work of the Data Science team broadly entails two types of projects: 1) data analysis &amp;amp; insights; and 2) data services &amp;amp; workflows.&lt;/p>
&lt;p>&lt;strong>Data analysis &amp;amp; insights&lt;/strong>: The goal of these kinds of projects is to broaden our understanding of the scholarly record and our community and help Crossref make decisions in a data-driven way, without trying to create any specific application or product. They will help Crossref explore new strategic directions, make more informed decisions, monitor the trends and outcomes of certain decisions and policies, and discover and share new insights with the community. This category also involves large and small data assessments and analyses, measuring and monitoring certain metrics, verifying hypotheses, answering questions using data, monitoring trends in the metadata, forecasting, data visualisation, reporting, and interpreting results.&lt;/p>
&lt;p>&lt;strong>Data services &amp;amp; workflows&lt;/strong>: The goal of these kinds of projects is to apply scientific knowledge and data analysis to build and maintain Crossref services, tools, and workflows. The Data Science team collaborates with other Crossref teams on the research, design and implementation of the Crossref system and its various components. This will involve modelling across different data stores and APIs, as well as designing efficient and robust data workflows for various processes, including metadata deposit, validation, and dissemination. Furthermore, the team will investigate and implement modern tools and techniques for efficient data processing, storage and analysis, and strategies for data enrichment. Finally, the Data Science team is involved in planning and implementing comprehensive monitoring and reporting for various features and services.&lt;/p>
&lt;h2 id="connecting-with-the-community">Connecting with the community&lt;/h2>
&lt;p>Crossref exists as part of a diverse, global community of 22,000 members from 160 countries, plus countless systems that rely on our metadata. Launching the new Data Science function gives us a great opportunity to connect more deeply and in new ways with the wider scholarly community. We&amp;rsquo;re keen to engage with Crossref members, users of our services, and partner organisations to better understand trends and needs, and to contribute to others&amp;rsquo; community initiatives and awareness.&lt;/p>
&lt;p>One area we&amp;rsquo;re particularly interested in is the growing range of initiatives in the &lt;a href="https://en.wikipedia.org/wiki/Metascience" target="_blank">metascience&lt;/a> space. We&amp;rsquo;re looking to expand and solidify our understanding of how researchers use our data and services, and to learn more about their needs and perspectives. These insights will help inform the design and functionality of our data workflows and APIs over the long term.&lt;/p>
&lt;p>We&amp;rsquo;re also committed to supporting the scholarly community&amp;rsquo;s efforts to preserve the &lt;a href="https://www.crossref.org/community/special-programs/research-integrity/">integrity of the scholarly record (ISR)&lt;/a>. By applying modern, scalable data processing techniques, we aim to help detect and investigate potential issues affecting metadata quality, including both intentional manipulation and unintentional errors or inconsistencies.&lt;/p>
&lt;p>More broadly, we&amp;rsquo;re looking forward to engaging with our community on scalable data processing approaches, as well as best practices and standards for processing and enriching scholarly metadata.&lt;/p>
&lt;h2 id="introducing-new-members-of-the-team">Introducing new members of the team&lt;/h2>
&lt;p>We couldn&amp;rsquo;t pursue our ambitious goals without the dedication and passion of our team. In April, we were thrilled to welcome two data scientists, Jason Portenoy and Alex Bédard-Vallée, to the Crossref team.&lt;/p>
&lt;p>Alex Bédard-Vallée brings over six years of experience extracting meaningful insights from data within the research and scholarly publishing sector, applying it to large-scale bibliometric data, aiming to better serve the scholarly community. Prior to Crossref, during his tenure at Elsevier, he was instrumental in modernising data infrastructure, significantly enhancing the efficiency of massive research data pipelines. His contributions included developing automated data quality checks, creating reusable Python tools to streamline data access, and leveraging machine learning techniques to uncover research trends. Alex provided key insights for major reports, contributing to evaluations for the Canada Research Chairs Program and the NSF Science and Engineering Indicators between 2020 and 2024. Alex holds an M.Sc. in Quantum Physics (2018) and a B.Sc. in Physics (2016) from the Université de Sherbrooke.&lt;/p>
&lt;p>&lt;a href="https://www.jasport.org/" target="_blank">Jason Portenoy&lt;/a> is a New York-based data scientist with a background in bibliometric research and building applications using scholarly data. Through his work, he has become a passionate advocate for the maintenance and improvement of high-quality scholarly metadata. He holds a PhD in Information Science from the University of Washington where he studied how scholarly metadata can offer insights into scientific activity and help develop tools to address information overload. He brings experience working at OpenAlex, Semantic Scholar, and other organisations concerned with scholarly communication. Most recently, he was the Senior Data Engineer at OpenAlex, and he is now excited to continue his work using data science to support and strengthen crucial open scholarly infrastructure.&lt;/p>
&lt;h2 id="whats-next-for-us">What&amp;rsquo;s next for us?&lt;/h2>
&lt;p>In the short term, we are focusing on two main projects: analysing how reliably DOIs resolve, and detecting discrepancies in bibliographic references at scale.&lt;/p>
&lt;p>&lt;strong>DOI resolutions&lt;/strong>: DOIs are persistent identifiers and links that are meant to consistently resolve to landing pages that represent the object they identify and Crossref has certain obligations that members have to adhere to, one of which is that if the location of the landing page changes, it is the responsibility of the member to update the metadata so the DOI continues to resolve correctly. &lt;a href="https://doi.org/10.64000/hv6t0-0h481" target="_blank">Some prior work&lt;/a> has suggested this doesn&amp;rsquo;t always happen, so there are some gaps in the scholarly record. We&amp;rsquo;re now analysing metadata from a broad sample of members to better understand the scale of the issue, and to identify cases where members may need to update their metadata records.&lt;/p>
&lt;p>&lt;strong>Detecting discrepancies in bibliographic references&lt;/strong>: Following &lt;a href="https://doi.org/10.48550/arXiv.2501.03771" target="_blank">last year&amp;rsquo;s reports&lt;/a> of discrepancies between bibliographic references in metadata records and those found in full-text PDFs, we&amp;rsquo;ve explored ways to run broader, systematic checks across a larger set of members and metadata records. The goal was to understand how widespread these inconsistencies are and to identify cases where members may need support in correcting references in their metadata records. Ultimately, we aim to create a collaborative process that improves the accuracy and reliability of bibliographic references across the scholarly record, enhancing research discovery and reproducibility and ensuring impact assessments are reliable.&lt;/p>
&lt;p>Look out for forthcoming blog posts with more details on these projects!&lt;/p>
&lt;p>Looking further ahead, Crossref has two big projects for which the Data Science team will serve central roles: developing dashboards, and improving metadata matching.&lt;/p>
&lt;p>&lt;strong>Data dashboards&lt;/strong>: We are planning to develop a series of dashboards to monitor the state of the scholarly record over time. These will include both work-level statistics (e.g., how many works of a given type have been registered?) and more detailed insights at the relationship level (e.g., how many bibliographic references have been automatically matched? How often are ROR IDs included in funder assertions?). Upstream, this will require us to build an environment where all relevant data sources can be combined, as well as adopting a suite of scalable tools and data processing techniques.&lt;/p>
&lt;p>&lt;strong>Metadata matching&lt;/strong>: In April, we commenced the matching project. It is a major effort to rebuild Crossref&amp;rsquo;s metadata matching workflows using modern software development and data science practices. The goal is to create a dedicated consolidated matching workflow that will eventually replace all existing production matching processes, with results made available through the REST API. This project covers six matching tasks: bibliographic reference matching, funder name matching, preprint matching, affiliation matching, grant matching, and title matching.&lt;/p>
&lt;p>(In the meantime, as we do not have a good mechanism to add matching results to the REST API yet, we separately released two datasets with relationships discovered by automated matching strategies: &lt;a href="https://doi.org/10.5281/zenodo.15124417" target="_blank">a dataset of relationships between preprints and journal articles&lt;/a>, and &lt;a href="https://doi.org/10.5281/zenodo.15254993" target="_blank">a dataset of relationships involving research organisations&lt;/a>.)&lt;/p>
&lt;p>As you can tell, we are very excited about Crossref&amp;rsquo;s role in the modern, open, community-focused future of scholarly infrastructure. The new Data Science team is a crucial component of this vision. If you&amp;rsquo;re interested in collaborating or learning more about data science at Crossref, we&amp;rsquo;d love to hear from you!&lt;/p></description></item><item><title>Scholarly blogs and their place in the research nexus</title><link>https://www.crossref.org/blog/scholarly-blogs-and-their-place-in-the-research-nexus/</link><pubDate>Tue, 24 Jun 2025 00:00:00 +0000</pubDate><author>Lena Stoll</author><guid>https://www.crossref.org/blog/scholarly-blogs-and-their-place-in-the-research-nexus/</guid><description>&lt;p>If you are reading this blog on our website, you may have noticed that alongside each post we now list a Crossref DOI link, which was not the case a few months ago (though we have retroactively added DOIs to all older posts too). You can find the persistent link for this post right above this paragraph. Go on, click on it, we’ll wait.&lt;/p>
&lt;p>Are you back here? Good. As you probably expected, the DOI link for this post resolves to the post itself, and you should use it anytime you want to &lt;a href="https://www.crossref.org/documentation/reference-linking/">cite this post&lt;/a>. But the DOI does more than just point readers to this page––it is part of a rich metadata record that includes the authors’ ORCID iDs, the publication date, and more. In other words, the posts on this blog are part of what we call the &lt;a href="https://www.crossref.org/documentation/research-nexus/">research nexus&lt;/a>: the open network of relationships connecting research outputs, people, organisations, and actions.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2022/research--nexus-2021.png"
alt="Crossref research nexus vision" width="75%">&lt;figcaption>
&lt;p>Crossref research nexus vision&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;/div>
&lt;h3 id="why-blogs-deserve-a-place-in-the-scholarly-record">Why blogs deserve a place in the scholarly record&lt;/h3>
&lt;p>A blog post may not be the first thing that comes to mind when you think of scholarly outputs. But scholarly blogs have been around since at least the early 2000s and have carved out a niche for themselves as a type of “grey literature” that allows researchers to write about research in a way that may not fit neatly into more traditional, peer-reviewed publishing venues, but also is too long-form for social media. Science blogs can give readers a window into ongoing work that isn’t ready to publish yet, serve as a self-publishing venue, or allow researchers to comment on others’ work and recent developments in science and science communication. These kinds of perspectives add crucial context to the scholarly record that should not be overlooked.&lt;/p>
&lt;p>However, as Martin Fenner &lt;a href="https://doi.org/10.13003/t8azz4brot" target="_blank">explained&lt;/a> at the #Crossref2023 annual meeting, blogs have largely not benefitted from the metadata and long-term archiving solutions that tend to be applied to more “traditional” forms of publishing. As a result, most blogs have been left out of the scholarly record. But in recent years, there have been some efforts in the community to change this. Earlier this year, ORCID added support for the work type &lt;code>blog post&lt;/code>, &lt;a href="https://info.orcid.org/new-work-types/" target="_blank">among others&lt;/a>, to align more closely with the Confederation of Open Access Repositories (COAR) vocabulary of resource types.&lt;/p>
&lt;p>At our &lt;a href="https://doi.org/10.5281/zenodo.15389087" target="_blank">2025 midyear community update&lt;/a>, we asked our community what content types they saw as growing in importance. Blog posts were mentioned several times as a ‘trending’ record type, and as one that members would like to see support for in the Crossref system.&lt;/p>
&lt;h3 id="eating-our-own-dog-food">Eating our own dog food&lt;/h3>
&lt;p>We had already been thinking for a while about how our own blog should be a part of the research nexus. We started out by &lt;a href="https://www.crossref.org/documentation/register-maintain-records/direct-deposit-xml/admin-tool/">manually uploading XML files through our Admin tool&lt;/a> for each post. We did this for a few months and quickly found, like many of our members do, that this can be a laborious and error-prone process.&lt;/p>
&lt;p>In the product management world, the process of using the products you usually spend your time building and maintaining is often referred to as &lt;a href="https://doi.org/10.1109/MS.2006.72" target="_blank">dogfooding&lt;/a>. The idea is that firsthand experience makes it easier to understand your end users’ needs and feel their pain - and we have certainly found that registering metadata for our blog posts has reinforced the importance of &lt;a href="https://doi.org/10.64000/30vzx-r5x16" target="_blank">making manual registration easier for our members&lt;/a>, but also of supporting and enabling machine-to-machine integrations.&lt;/p>
&lt;h3 id="what-did-we-do">What did we do?&lt;/h3>
&lt;p>The Crossref website, which includes this blog, uses an open-source static site generator named &lt;a href="https://gohugo.io/" target="_blank">Hugo&lt;/a>. Rather than using a content management system (CMS), we edit the website content in Markdown format using code editors. Whenever we start working on a post for this blog, we not only write the content of the post itself, but also include some front matter for the page, which contains some key metadata about the post.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2025/blog-front-matter-example.png"
alt="Screenshot of the front matter of a Crossref blog post in Hugo" width="65%">&lt;figcaption>
&lt;p>The front matter of a &lt;a href="https://doi.org/10.64000/x8xqg-95792" target="_blank">recent post&lt;/a> on this blog&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;/div>
&lt;p>We wanted this metadata to be part of the research nexus. But then there was also the question of archiving. Our &lt;a href="https://www.crossref.org/membership/terms/">membership terms&lt;/a> state that:&lt;/p>
&lt;blockquote>
&lt;p>The Member shall use best efforts to contract with a third-party archive or other content host (an &amp;ldquo;Archive&amp;rdquo;) (a list of which can be found &lt;a href="https://keepers.issn.org/keepers" target="_blank">here&lt;/a>) for such Archive to preserve the Member’s Content and, in the event that the Member ceases to host the Member’s Content, to make such Content available for persistent linking.&lt;/p>
&lt;/blockquote>
&lt;p>So we knew that if this blog was to be part of the scholarly record, we would need to ensure that it would be available in perpetuity, even if &lt;a href="https://www.crossref.org" target="_blank">www.crossref.org&lt;/a> were to go offline one day.&lt;/p>
&lt;p>Doing this properly was starting to look like a sizeable project!&lt;/p>
&lt;p>Fortunately, we knew that others had already done some great work in this field, so we would not have to start from scratch. After considering our options, we opted to integrate our blog with an established workflow for registering blog metadata: the &lt;a href="https://rogue-scholar.org" target="_blank">Rogue Scholar&lt;/a> service.&lt;/p>
&lt;p>The Rogue Scholar was launched in 2023 by Martin Fenner as an archive for scholarly blog posts, hosted by &lt;a href="https://front-matter.io" target="_blank">Front Matter&lt;/a>. Rogue Scholar improves science blogs in important ways, including full-text search, long-term archiving, and DOIs and metadata, such as versions and relationships along with identifiers such as ORCID iDs and ROR IDs. It provides the necessary tools to treat blog posts as research outputs through better attribution, preservation, and discoverability.&lt;/p>
&lt;h3 id="how-did-we-do-it">How did we do it?&lt;/h3>
&lt;p>Rogue Scholar works on the basis of consuming RSS and ATOM feeds (you may remember them from the days of getting headlines direct to your browser or feed reader). We created a &lt;a href="https://www.crossref.org/blog/feed.xml" target="_blank">new feed&lt;/a>, including the proposed DOI as each entry’s &lt;code>id:&lt;/code> and taking full advantage of the ATOM format by listing the post’s authors and including their ORCID iDs. We also provide the entire post as the entry’s &lt;code>&amp;lt;content&amp;gt;&lt;/code> to allow for full-text indexing and archiving.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2025/blog-xml-feed-entry.png"
alt="Screenshot of the XML feed entry for a Crossref blog post" width="120%">&lt;figcaption>
&lt;p>The XML feed entry for a &lt;a href="https://doi.org/10.64000/x8xqg-95792" target="_blank">recent post&lt;/a> on this blog&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;/div>
&lt;p>For each post, we generate and assign a unique DOI under the Crossref prefix &lt;code>10.64000&lt;/code>. The Rogue Scholar integration then registers the DOI along with the metadata of the post as &lt;a href="https://www.crossref.org/documentation/research-nexus/posted-content-includes-preprints/">posted content&lt;/a>. If you are interested in getting a similar workflow set up for your blog, you can read more in the Rogue Scholar &lt;a href="https://doi.org/10.53731/fz73s-sv368" target="_blank">blog&lt;/a> and &lt;a href="https://docs.rogue-scholar.org/" target="_blank">documentation&lt;/a>.&lt;/p>
&lt;h3 id="what-does-the-future-hold-for-scholarly-blogs">What does the future hold for scholarly blogs?&lt;/h3>
&lt;p>Researchers are increasingly sharing their early work, or commenting on others’ work, in less formal ways, and if you look at the growth in the number of blogs covered in the Rogue Scholar platform in just a couple of years, it seems like science blogging is here to stay and will only increase. We believe that this practice is an integral part of a healthy scholarly ecosystem, and it needs to be represented in the research nexus.&lt;/p>
&lt;p>The Crossref input schema does not include a &lt;code>blog&lt;/code> work type, but we are planning to add it as a subtype of posted content in our next schema update. We will discuss this and other plans and ideas in the &lt;a href="https://doi.org/10.64000/n23nw-3d593" target="_blank">metadata advisory group&lt;/a> that we are currently forming.&lt;/p>
&lt;p>If you have thoughts on the role of blogs in the public discourse around science and science communications, or you would like to share your experience of registering metadata for your blog, let us know by commenting below. Your comments will be threaded in our &lt;a href="https://community.crossref.org/" target="_blank">community forum&lt;/a> for discussion.&lt;/p></description></item><item><title>Sprinting to Progress: Behind the scenes of our first metadata sprint</title><link>https://www.crossref.org/blog/sprinting-to-progress-behind-the-scenes-of-our-first-metadata-sprint/</link><pubDate>Mon, 23 Jun 2025 00:00:00 +0000</pubDate><author>Luis Montilla</author><guid>https://www.crossref.org/blog/sprinting-to-progress-behind-the-scenes-of-our-first-metadata-sprint/</guid><description>&lt;p>If you take a peek at our blog, you’ll notice that metadata and community are the most frequently used categories. This is not a coincidence – community is central to everything we do at Crossref. Our first-ever Metadata Sprint was a natural step in strengthening both. &lt;em>Cue fanfare!&lt;/em>. And what better way of celebrating 25 years of Crossref?&lt;/p>
&lt;p>We designed the Crossref Metadata Sprint as a relatively short event where people can form teams and tackle short problems. What kind of problems? While we expected many to involve coding, teams also explored documenting, translating, researching—anything that taps into our open, member-curated metadata. Our motivation behind this format was to create a space for networking, collaboration, and feedback, centered on co-creation using the scholarly metadata from our REST API, the Public Data File, and other sources.&lt;/p>
&lt;h2 id="what-have-we-learned-in-planning">What have we learned in planning&lt;/h2>
&lt;p>The journey towards the event was filled with valuable lessons and learnings from our community. Our initial call received submissions from 71 people, which was exciting but presented the first challenge: we felt our event would work better with a relatively smaller group. An additional challenge we faced was the enthusiasm from people from different regions of the world who were eager to join, but needed support to attend in person. It reminded us how global our community is, and how important it is to think about different ways of making participation possible, especially in future events.&lt;/p>
&lt;p>We also wanted to make sure that participation wasn’t limited by technical background. The selection process included a preliminary review by several members of our team to bring in a mix of perspectives and reduce bias. The event welcomed participants from all kinds of expertise levels, including colleagues who had never worked with APIs before. We sought to provide common ground for all with several group calls, where we presented introductions to our tools and used the opportunity to collect requests about tools, specific data, and questions from the participants that could enhance their preparation during the sprint.&lt;/p>
&lt;h2 id="at-the-crossref-metadata-sprint">At the Crossref Metadata Sprint&lt;/h2>
&lt;p>I’ve recently stumbled upon the following quote from a recognized data scientist:&lt;/p>
&lt;blockquote>
&lt;p>Numbers have an important story to tell. They rely on you to give them a clear and convincing voice. (Stephen Few) &lt;sup id="fnref:1">&lt;a href="#fn:1" class="footnote-ref" role="doc-noteref">1&lt;/a>&lt;/sup>&lt;/p>
&lt;/blockquote>
&lt;p>It made me think that we can replace &lt;em>numbers&lt;/em> for &lt;em>metadata&lt;/em> and the idea still holds. Surrounded by the paleontological collections of the National Museum of Natural History, on 8th of April in Madrid, 21 participants and 5 Crossref staff came together to work on twelve different projects. These ranged from improvements to our Public Data file formats and exploring metadata completeness, to tackling multilingual metadata challenges, understanding citation impact for retracted works, and connecting Retraction Watch metadata with other knowledge graphs metadata.
&lt;br>&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/sprint_collage.jpg"
alt="A mosaic of pictures depicting groups of people working on their laptops" width="70%">
&lt;/figure>
&lt;/div>
&lt;center>The different teams that participated in the first Crossref Metadata Sprint. &lt;/center>
&lt;br>
The initial hours were the most energetic (but not chaotic!) as most of the participants had the chance to interact in person for the first time, ideas were exchanged, and pre-formed groups became more stable (however, one of the advantages of the format is that teams don't have to be rigid). Twelve coffee- and tea-powered projects started taking shape, a few of which are part of larger ideas under development. By the end of the second day, we saw:
&lt;ul>
&lt;li>Author changes between preprints and published articles.&lt;/li>
&lt;li>Coverage of funding information by publisher.&lt;/li>
&lt;li>Enriching citations with Crossref metadata.&lt;/li>
&lt;li>Funding metadata completeness.&lt;/li>
&lt;li>Improvement to the Public Data File.&lt;/li>
&lt;li>Interoperability between Crossref DOIs and hash-based identifiers.&lt;/li>
&lt;li>University of Tetova’s metadata coverage.&lt;/li>
&lt;li>Retraction Watch data mash-up.&lt;/li>
&lt;li>Perspective about AI-driven multilingual metadata.&lt;/li>
&lt;li>Public Data File in Google Big Query.&lt;/li>
&lt;li>Visibility of retractions across citations.&lt;/li>
&lt;li>Visualising Crossref geographic member data.&lt;/li>
&lt;/ul>
&lt;p>Our team worked as part of some of these projects, providing valuable insights and feedback to the participants. We ended the first session with a group dinner and re-energised for the second day, which started with everybody fully immersed in their tasks. As we approached the conclusion, the groups started preparing some quick slides for a short presentation (that you can find &lt;a href="https://doi.org/10.13003/gpvx-dbde" target="_blank">here&lt;/a>).&lt;/p>
&lt;p>Our team and the participants left excited and looking forward to the next opportunity to collaborate. We certainly see the potential of recreating these spaces, and we&amp;rsquo;ll work on future editions in a different location. All of the project summaries and notes will remain stored in our &lt;a href="https://gitlab.com/crossref/metadata-sprints/sprint-2025" target="_blank">metadata sprint Gitlab repo&lt;/a>. Would you like to know more about any of these ideas? Let us know in the comments.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/sprint_hex.jpg"
alt="An arragement of hexagons summarizing key facts about the 1st Metadata Sprint." width="70%">&lt;figcaption>
&lt;p>The first Crossref Metadata Sprint in a nutshell&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;/div>
&lt;h2 id="participants">Participants&lt;/h2>
&lt;p>None of this would’ve been possible without our enthusiastic participants. Huge thanks to everyone! Here is the full list of those who attended our inaugural Sprint:&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>Name&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>&lt;a href="https://orcid.org/0009-0000-8076-8420" target="_blank">Blessing Abumere&lt;/a>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Ana Bermejo&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Robert Bianchi&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;a href="https://orcid.org/0000-0003-1507-1031" target="_blank">Adam Buttrick&lt;/a>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;a href="https://orcid.org/0009-0007-7718-4126" target="_blank">María de la Paz&lt;/a>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;a href="https://orcid.org/0000-0002-1150-3469" target="_blank">Nicoleta Roxana Dinu&lt;/a>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;a href="https://orcid.org/0009-0002-7388-2166" target="_blank">Jack Ekinsmyth&lt;/a>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;a href="https://orcid.org/0000-0002-5014-4809" target="_blank">Castedo Ellerman&lt;/a>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Álvaro Hontanar&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;a href="https://orcid.org/0000-0002-5965-6560" target="_blank">Bianca Kramer&lt;/a>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;a href="https://orcid.org/0000-0003-1353-5584" target="_blank">Anne L&amp;rsquo;Hôte&lt;/a>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;a href="https://orcid.org/0000-0003-4855-7038" target="_blank">Cyril Labbe&lt;/a>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;a href="https://orcid.org/0009-0003-9439-1443" target="_blank">Alexandra Malaga&lt;/a>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;a href="https://orcid.org/0000-0002-6824-3856" target="_blank">Agon Memeti&lt;/a>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;a href="https://orcid.org/0000-0001-8739-5823" target="_blank">Kaitlin Newson&lt;/a>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;a href="https://orcid.org/0000-0002-2843-8990" target="_blank">Yağmur Öztürk&lt;/a>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;a href="https://orcid.org/0000-0003-3331-9889" target="_blank">Dietrich Rordorf&lt;/a>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;a href="https://orcid.org/0000-0003-1615-1471" target="_blank">Mohamed Selim&lt;/a>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;a href="https://orcid.org/0000-0002-8976-3404" target="_blank">Sajad Sepehri&lt;/a>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;a href="https://orcid.org/0000-0002-7958-9828" target="_blank">Ramazan Turgut&lt;/a>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;a href="https://orcid.org/0000-0001-6403-5550" target="_blank">Iñaki Úcar&lt;/a>&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;div class="footnotes" role="doc-endnotes">
&lt;hr>
&lt;ol>
&lt;li id="fn:1">
&lt;p>&lt;a href="https://www.forbes.com/sites/brentdykes/2016/03/31/data-storytelling-the-essential-data-science-skill-everyone-needs/" target="_blank">https://www.forbes.com/sites/brentdykes/2016/03/31/data-storytelling-the-essential-data-science-skill-everyone-needs/&lt;/a>&amp;#160;&lt;a href="#fnref:1" class="footnote-backref" role="doc-backlink">&amp;#x21a9;&amp;#xfe0e;&lt;/a>&lt;/p>
&lt;/li>
&lt;/ol>
&lt;/div></description></item><item><title>Evolving the preprint evaluation world with Sciety</title><link>https://www.crossref.org/blog/evolving-the-preprint-evaluation-world-with-sciety/</link><pubDate>Tue, 17 Jun 2025 00:00:00 +0000</pubDate><author>Luis Montilla</author><guid>https://www.crossref.org/blog/evolving-the-preprint-evaluation-world-with-sciety/</guid><description>&lt;p>This post is based on an interview with Sciety team at eLife.&lt;/p>
&lt;h2 id="what-is-sciety">What is Sciety?&lt;/h2>
&lt;p>Sciety is a community-led initiative developed by a team within eLife, that brings together expert evaluations of papers in one place. It is focused on preprints, preprint review and curation.&lt;/p>
&lt;h2 id="can-you-tell-us-more-about-how-sciety-works">Can you tell us more about how Sciety works?&lt;/h2>
&lt;p>Sciety aggregates preprints from different sources to facilitate the processes of discovery and evaluation. Groups can triage the content and offer preprint reviews and endorsements, and individual researchers can learn about and share preprints of interest and their evaluations. We see the value of increasing trust in preprints, and transparency around the process of peer review, and we are trying to highlight this value and encourage more people to take part.&lt;/p>
&lt;p>There are two key angles to Sciety: first, as preprints proliferate, we’re helping to make people more productive in their research by only surfacing the content they might be interested in and that they know they can trust. Second, we are also trying to get more people involved in the public review and curation of preprints. Contributors on Sciety are part of ‘groups’, representing organisations and other communities that facilitate some form of preprint evaluation. We&amp;rsquo;re broadly talking about peer review, but we also see the highlighting and summarisation of research. eLife, Biophysics Colab, MetaROR and Gigabyte, for example, are all providing some kind of review summary which Sciety shows as a ‘curation statement’. There’s also this additional layer of individual curation on top of it: we have people creating their own highlights in lists which they curate by topic; for example, ‘preprints by authors in the Global South’ or ‘Papers we want to discuss in our lab’. There is also an update feed available to users to help them keep track of all the reviews and endorsements from the groups they follow. We post these assessments and reviews alongside the preprint, which others can then use as an indicator of trust: why should one care about this particular study? As a given group – let’s say GigaByte – and its reviewers highlight the specific strengths of a preprint or reference an updated version, this feedback offers essential context for readers.&lt;/p>
&lt;p>By making this evaluation and curation activity visible, Sciety clarifies who has reviewed the work and which groups have added it to their lists. These signals are invaluable for readers seeking reliable, curated research. The activity feed, which at present shows you all the added value in the form of comments, reviews and curation we are bringing from diverse sources, could be expanded to show different forms of curation activity in the future. Furthermore, other providers ingest and surface this information on their own platforms, such as Europe PMC and bioRxiv.&lt;/p>
&lt;h2 id="what-is-your-main-use-of-crossref-resources">What is your main use of Crossref resources?&lt;/h2>
&lt;p>We started using the Crossref API to pull in the front matter of articles. Originally, these were only bioRxiv preprints, and then we expanded to various other preprint servers. We would aggregate reviews and build on top of all the preprint servers that have put the authors&amp;rsquo; content out there.&lt;/p>
&lt;p>We were mostly after a representation of the papers that we could link to: titles, authors, abstracts, publication dates, and, to have a way to go from the DOI of a paper, a classic Crossref entry point. Initially, we used the public API, but the performance wasn&amp;rsquo;t high enough for what we needed and we switched to Metadata Plus. This immediately increased the speed at which we got data to the point where we could compose pages on the fly and talk to Crossref simultaneously. Even if we needed to pull 10 or 20 different paper titles at the same time to show a list of articles, it stayed that way for a long time. Next, we implemented caching – that is, we started storing temporal local copies to improve performance further. Eventually, we expanded the set of preprint servers we were interested in. It&amp;rsquo;s always been quite a good experience to be able to put in a DOI and use the same code, essentially, to pull out titles, author information and so on. Crossref does this great job of aggregating the world of content so that we don&amp;rsquo;t have to. The metadata standardisation via Crossref’s API saves us the need to write special code for every new preprint server.&lt;/p>
&lt;p>By the end of 2023, we were interested in multiple revisions and versions of a single preprint. Because the scholarly world is moving on, we can now see cases where the updates to a manuscript produce multiple versions in bioRxiv, and these might eventually evolve into an article in eLife, Nature, or another journal. The publication history complexity of papers has been increasing and we started relying a lot on Crossref to trace the relationships and the different versions of a paper across time. There is some good support on the relationship metadata on Crossref APIs, where you can see that a preprint has a new version with a different DOI, or conversely, that a preprint has an older version. Or you can see that a preprint has become a journal article, or the journal article was originally a preprint – along with all the dates that accompany these different versions. And we can establish the time it took for a preprint to become a journal article. In some cases it can take years, which is not great, right? We don&amp;rsquo;t want science to be stuck and not relied upon for years. So it helps us to make our case that preprints are the evolution of publishing, that authors publish them and then the preprints evolve rather than being stuck between gates kept by journals.&lt;/p>
&lt;h2 id="what-can-you-tell-us-about-the-use-of-preprints">What can you tell us about the use of preprints?&lt;/h2>
&lt;p>We have noticed an increase in the interest in how a paper evolves over time and the cross-links between different preprint expressions or journal articles. We&amp;rsquo;re now seeing enthusiasm from those who are trying alternative publishing models to bring reviewed scientific preprints to people faster, and there is also interest in the transparency of a journal. And I think that&amp;rsquo;s part of what the Crossref relationship metadata gives us.&lt;/p>
&lt;p>For example, we collaborated on a paper aimed at enhancing the culture of preprint peer review. One of the things we observed was that it was published on an OSF preprint server, and then went on to be published in PLOS Biology. As we&amp;rsquo;d started this project to show the relationships between something that had originally been a preprint, we noticed that the connection between PLOS and OSF for that specific preprint was not explicit. So, we asked a colleague if this was something that could be done. And our contact at PLOS said, “yes, we&amp;rsquo;ll do this”. At the time, we were aware of Crossref’s intention to either make this more manageable or to do it in bulk. This also prompted another group on Sciety to explore whether they could do the same. Consequently, GigaByte and GigaScience, two other reviewing communities on Sciety, inquired with their publishing platform, Riverview, if they could do the same. Eventually, they realised there was a way to connect the dots through Crossref, and they also started doing it. So, there seems to be a lot of enthusiasm around this idea of making the relationships more explicit: we should show if something has been a preprint, because it&amp;rsquo;s important to the authors, and it’s important to show the transparency in the journey. That was a real-world example of something that we&amp;rsquo;re able to service through Sciety by using the Crossref metadata, and the community is responding in a very positive manner to that.&lt;/p>
&lt;h2 id="how-has-your-experience-been-using-crossref-services-what-are-you-looking-forward-to-seeing-in-the-future">How has your experience been using Crossref services? What are you looking forward to seeing in the future?&lt;/h2>
&lt;p>The &lt;em>works&lt;/em> endpoint is really the 99% of what we have been historically interested in. We generally experiment by putting DOIs in the public API or trying to discover content in the API itself. The amount of data is so big that there are always different examples of what we seek. And we don&amp;rsquo;t have many performance problems now because we have adopted some aggressive caching. So anything that comes from Crossref is typically cached for 24 hours.&lt;/p>
&lt;p>For example, take a bioRxiv preprint that might have multiple versions available on bioRxiv itself, because it&amp;rsquo;s quite common for authors to update the preprint as they make new changes to it. With this context, an example of something we would like to see is supporting &lt;strong>the preprint version number&lt;/strong>. So this is something that we could implement for bioRxiv over some specific preprint servers on Sciety. But in the end, as we expanded our set of preprint servers, we had to get rid of that, because there wasn&amp;rsquo;t a sustainable way to aggregate it across most servers, like we would do with Crossref. So there&amp;rsquo;s probably a space there for papers as living documents. And we certainly have an interest in preprint-specific metadata – that&amp;rsquo;s where we will place our bets.&lt;/p>
&lt;p>Also, as part of the preprint review metadata group, which is something that formed out of the recent meeting with EMC Europe and ASAPbio, we&amp;rsquo;re trying to drive forward a &lt;strong>recommendation and prototypes for more consistency in preprint review metadata&lt;/strong>. It&amp;rsquo;s quite exciting to be involved in this and, as you can see, Sciety is a place where we&amp;rsquo;re starting to pull all this stuff together. And like I say, it is a bit of a Wild West. &lt;strong>There are so many things that are called a review, but in metadata, we know there are different terminologies.&lt;/strong> As people are saying, everyone should be commenting on preprints, everyone should be curating them, and we&amp;rsquo;re trying to make some sense of that.&lt;/p>
&lt;p>Working on Sciety and exploring Crossref metadata to make preprint review more open and valuable has been a rewarding experience.&lt;/p>
&lt;p>&lt;em>With thanks to Giorgio Sironi, former Tech Lead Manager, and Mark Williams, Product Manager, at eLife&lt;/em>&lt;/p></description></item><item><title>Destacando nuestra comunidad en Colombia</title><link>https://www.crossref.org/blog/destacando-nuestra-comunidad-en-colombia/</link><pubDate>Thu, 05 Jun 2025 00:00:00 +0000</pubDate><author>Susan Collins</author><guid>https://www.crossref.org/blog/destacando-nuestra-comunidad-en-colombia/</guid><description>&lt;p>&lt;a href="#english">&lt;em>English version&lt;/em>&lt;/a>&lt;/p>
&lt;p>Dado que Crossref celebra su 25º aniversario este año, nos gustaría destacar algunas de las regiones activas y comprometidas en nuestra comunidad global.&lt;/p>
&lt;p>Durante los primeros 25 años, la composición de los miembros de Crossref ha evolucionado significativamente. De un puñado de grandes editoriales fundadoras, ahora tenemos más de 22.000 miembros de 160 países. Casi dos tercios de ellos se identifican como universidades, bibliotecas, entidades gubernamentales, fundaciones, editoriales académicas, e institutos de investigación.&lt;/p>
&lt;p>Una de las regiones de mayor crecimiento es Latinoamérica, con más de 3.200 miembros, la mitad de los cuales se unió en los pasados cinco años. Colombia fue uno de los primeros miembros de Crossref en Latinoamérica y continua siendo uno de los países más activos con 242 organizaciones.&lt;/p>
&lt;p>&lt;em>&amp;ldquo;Creo que las organizaciones en Colombia siempre están abiertas a nuevos cambios, y a implementar nuevas estrategias que permitan mejorar o generar vínculos entre diversos actores, el programa Nexo podría verse de gran utilidad puesto que Colombia está uno de los grandes generadores de investigación en la región, y el poder conectar de una manera ágil y rápida toda una red de investigación va a representar grandes ventajas en los procesos&amp;rdquo;&lt;/em>, &amp;ndash; dice nuestro Embajador Juan Felipe Vargas Martínez, Cofundador y Director de Journals &amp;amp; Authors, en Medellín.&lt;/p>
&lt;p>Una de las razones del aumento en la participación en Colombia es nuestro &lt;a href="https://www.crossref.org/community/sponsors/">programa de patrocinadores&lt;/a>. Los patrocinadores proveen apoyo a organizaciones más pequeñas que a menudo enfrentan barreras financieras, técnicas, y linguísticas que les dificultan convertirse en miembros de Crossref. Uno de los primeros patrocinadores en Colombia, Journals &amp;amp; Authors, se unió en 2016, siendo de los primeros en Latinoamérica. Ahora tenemos cinco patrocinadores ubicados en Colombia, apoyando 114 miembros.&lt;/p>
&lt;p>Nuestros patrocinadores también han sido aliados clave en ayudarnos a interactuar con la comunidad, facilitando numerosos webinars y apoyando nuestras reuniones presenciales en Colombia en 2019 y 2024. Su conocimiento de la comunidad editorial a lo largo del país y sus extensas redes ayudan a las organizaciones nuevas a aprender más sobre Crossref de manera accesible, y a crecer continuamente la participación con nosotros.&lt;/p>
&lt;p>También tenemos embajadores altamente dedicados ubicados en Colombia que son fuertes promotores de la misión de Crossref: Nicolás Mejía Torres y Juan Felipe Vargas Martínez. A lo largo de los años, ellos han sido instrumentales en ayudar a organizar eventos presenciales y webinarios para miembros, así como también en representar a Crossref en eventos a en Latinoamérica. Puedes aprender sobre nuestras discusiones en el resumen de los eventos más recientes en nuestro &lt;a href="https://community.crossref.org/" target="_blank">Foro Comunitario&lt;/a>. Recientemente Juan Felipe y Nicolás participaron en la Feria Internacional del Libro en Bogotá donde presentaron una charla sobre los beneficios de los &lt;a href="https://drive.google.com/file/d/1B1u4SpSjsJRydfcBSfplonU6GMNjMyrc/view?usp=drive_link" target="_blank">metadatos académicos abiertos&lt;/a>.&lt;/p>
&lt;p>Nuestra membresía en Colombia está conformada fundamentalmente por universidades, sociedades, e instituciones públicas. Casi todas las revistas dejan su contenido disponible abiertamente. La mayoría del contenido de revistas se publica usando la plataforma de publicación OJS de PKP - Colombia es &lt;a href="https://rpubs.com/saurabh90/ojs-stats-2022" target="_blank">el 8vo mayor usuario de OJS globalmente&lt;/a>, y el segundo mayor en Latinoamérica.&lt;/p>
&lt;p>&lt;em>&amp;ldquo;Entendemos que hay todavía mucho margen de uso de editoriales colombianas de Crossref.&amp;rdquo;&lt;/em> Jaime Iván Hurtado, CEO &amp;amp; Fundador de Hipertexto-Netizen, un patrocinador de Crossref, reporta que &lt;em>&amp;ldquo;algunas hacen uso del DOI pero centradas en revistas tímidamente en los libros y poco en los capítulos de libros,&amp;rdquo;&lt;/em> Hipertexto ha estado contribuyendo al incremento en el uso de identificadores persistentes para libros y capítulos de libros a través de sus herramientas y manejo estadarizado de metadatos.&lt;/p>
&lt;p>Los miembros de Crossref a menudo conocen la importancia de los identificadores persistentes para su contenido, pero hay una necesidad de incrementar la conciencia sobre los beneficios y la importancia de incluír metadatos adicionales. Estamos concientes que muchos editores ofrecen su tiempo de manera voluntaria lo cual puede limitar su disponibilidad para entrenamiento adicional y participación en eventos relacionados con la edición y las buenas practicas para el manejo de metadatos. Queremos aumentar las oportunidades para el entrenamiento tanto presencial como remotamente, y nuestros patrocinadores y embajadores han sido aliados clave en la facilitación de estos eventos. En febrero de 2024 nos aliamos con nuestro patrocinador Biteca en un &lt;a href="https://zenodo.org/records/10728097" target="_blank">evento de dos días&lt;/a> en Bogotá, en el que participaron más de 100 miembro. Hubo diuscusiones activas sobre los fundamentos de Crossref y el rol de los metadatos de calidad en la visibilidad de contenido, así como también presentaciones sobre la integridad y ética en la investigación y la publicación, con compañeros clave como COPE, PKP, Scielo, y DOAJ.&lt;/p>
&lt;p>En Colombia no hay un requerimiento de usar identificadores persistentes (o no específicamente el DOI). Cada institución decide si usarlos de manera independiente, así que vemos con agrado tantos miembros de Crossref activos, registrando su contenido, y cada mes se unen más. Ellos reconocen el beneficio de los metadatos, así como también el ser parte de la comunidad de Crossref en general: &lt;em>&amp;ldquo;En Colombia, Crossref es un referente gracias al uso del DOI. Si bien en sus inicios este identificador se veía como otro requisito más que complicaba el trabajo de las editoriales, hoy es reconocido como una herramienta clave para mejorar la visibilidad y el impacto de las publicaciones. Asimismo, Crossref, a través de sus encuentros y recursos, brinda apoyo a los equipos editoriales al ofrecer pautas, herramientas e información valiosa que facilita la adopción de buenas prácticas y el cumplimiento de estándares de calidad&amp;rdquo;&lt;/em> reporta Luz Ayda Becerra, Consultora de Innovación con nuestro patrocinador Biteca.&lt;/p>
&lt;p>Las organizaciones tienen varias razones para convertirse en miembros de Crossref - la principal motivación es incrementar la visbilidad global de su contenido y, por lo tanto, incrementar el impacto de sus publicaciones. Los metadatos de Crossref son accesibles de manera abierta para todos en la comunidad. Cada mes tenemos millones de búsquedas en nuestra base de datos por parte de investigadores, bibñiotecas, herramientas que perfilan autores, servicios de búsqueda, y muchos más. Otras partes usan estos metadatos para crear herramientas y servicios que incrementan la visibilidad y la recuperabilidad del contenido de los miembros.&lt;/p>
&lt;p>Sin embargo, existen desafíos que los miembros aún enfrentan cuando trabajan con nosotros. El obstáculo más frecuentemente mencionado al trabajar con Crossref es el lenguaje. La mayoría de nuestros correos electrónicos, documentación y herramientas están en inglés, y a los miembros les gustaría tener la oportunidad de recibir soporte, recursos y correspondencia en español. Aquellos que trabajan con patrocinadores se benefician de soporte de esta manera. Estamos aumentando el número de oportunidades de&lt;a href="https://zenodo.org/records/14902103" target="_blank"> entrenamiento remoto&lt;/a> y webinarios en español, y nuestros embajadores han estado interactuando con la comunidad local para proveer &lt;a href="https://www.crossref.org/services/">recursos adicionales&lt;/a>. A principios de este año, el primer miembro de nuestro equipo ubicado en un país de Latinoamérica se unió a nuestro equipo de soporte técnico, y ahora podemos proveer soporte en español (&lt;a href="https://community.crossref.org/t/ticket-of-the-month-april-2025-como-hacer-consultas-en-la-rest-api-de-crossref/13740" target="_blank">recursos como este&lt;/a> aparecerán más frecuentemente ahora). Reconocemos que aun tenemos trabajo por hacer para que Crossref sea más accesible a las comunidades globalmente.&lt;/p>
&lt;p>Nuestros miembros han sugerido que más eventos locales y presenciales serian beneficiosos. Y estamos de acuerdo que las interacciones cara a cara son una manera clave para nosotros construir relaciones e incrementar la representación y visibilidad en las comunidades, y aspiramos a crear oportunidades de interacturas con nuestros miembros en todos los rincones del mundo.&lt;/p>
&lt;p>Mostrar como se utilizan los metadatos puede resaltar los beneficios y la importancia de incluir metadatos adicionales. Varios de nuestros miembros y Patrocinadores han solicitado entrenamiento adicional en español sobre el uso de nuestras APIs, lo cual les permitiría obtener y analizar &lt;a href="https://drive.google.com/file/d/1PY1LtIWGktRD4IRpTV1EZSeR2OPTKDSS/view?usp=drive_link" target="_blank">elementos clave de los metadatos&lt;/a>.&lt;/p>
&lt;p>&lt;em>&amp;ldquo;Al especializarse en este tipo de tecnologías, puedo analizar y estructurar la información de manera efectiva, generando informes útiles para los editores. Esto facilita la toma de decisiones informadas sobre sus publicaciones, optimizando la gestión editorial y asegurando una mejor visibilidad e impacto de los contenidos académicos.&amp;rdquo;&lt;/em> (Luz Ayda Becerra)&lt;/p>
&lt;p>En años anteriores Crossref ha sido invitado a participar en webinars y eventos presenciales en Colombia, dado el interés en crecimiento y la conciencia de la importancia de los metadatos para la comunidad de investigadores y la visibilidad de las publicaciones.&lt;/p>
&lt;p>Gran parte de la información en este reporte proviene de encuestas enviadas a nuestros miembros, patrocinadores, y embajadores en Colombia. Apreciamos toda la retroalimentación, comentarios y sugerencias que hemos recibido, y queremos continuar la colaboración e incrementar la interacción con la comunidad.&lt;/p>
&lt;hr>
&lt;p>&lt;a id="english">&lt;/a>&lt;/p>
&lt;h2 id="english-version">English Version&lt;/h2>
&lt;h2 id="a-spotlight-on-our-community-in-colombia">A spotlight on our community in Colombia&lt;/h2>
&lt;p>As Crossref celebrates its 25th anniversary this year, we would like to highlight some of the active and engaged regions in our global community.&lt;/p>
&lt;p>Over the past 25 years, the makeup of Crossref membership has evolved significantly; from a handful of founding large publishers, we now have more than 22,000 members from 160 countries. Nearly two-thirds of them self-identify as universities, libraries, government agencies, foundations, scholar publishers, and research institutions.&lt;/p>
&lt;p>One of our fastest-growing regions is Latin America, with over 3,200 members, half of whom joined us in the past five years. Colombia was one of the early adopters of Crossref from Latin America and remains one of our most active countries with 242 organisations.&lt;/p>
&lt;p>&lt;em>&amp;ldquo;I believe that organisations in Colombia are always open to new changes and to implementing new strategies that allow for improvement or the creation of connections between diverse actors. The Research Nexus program could be very useful since Colombia is one of the largest producers of research in the region, and being able to connect an entire research network quickly and efficiently will represent significant advantages in the processes&amp;rdquo;,&lt;/em> &amp;ndash; says our Ambassador Juan Felipe Vargas Martínez, Co-founder and Director, Journals &amp;amp; Authors, in Medellín.&lt;/p>
&lt;p>One of the reasons for increased participation in Colombia is our &lt;a href="https://www.crossref.org/community/sponsors/">sponsor program&lt;/a>. Sponsors provide support for smaller organisations that often face financial, technical, and language barriers that make becoming a member difficult.  Our first sponsor in Colombia, Journals &amp;amp; Authors, joined in 2016, one of our first in Latin America. We now have five sponsors based in Colombia, supporting 114 members.&lt;/p>
&lt;p>Our sponsors have also been key partners in helping us engage with the community, facilitating numerous webinars and supporting our in-person meetings in Colombia in 2019 and 2024. Their knowledge of the publishing community across the country and extensive networks help new organisations learn more about Crossref in an accessible way, and continuously grow participation with us.&lt;/p>
&lt;p>We also have very dedicated &lt;a href="https://www.crossref.org/community/ambassadors/">ambassadors&lt;/a> based in Colombia who are strong advocates for Crossref&amp;rsquo;s mission: Nicolás Mejía Torres and Juan Felipe Vargas Martínez. Over the years, they have been instrumental in helping to organise in-person events and webinars for members, as well as representing Crossref at events throughout Latin America. You can learn more about our discussions from the summary of the latest event on our &lt;a href="https://community.crossref.org/" target="_blank">Community Forum&lt;/a>. Most recently, Juan Felipe and Nicolás attended the Bogotá International Book Fair, where they gave a presentation on the &lt;a href="https://drive.google.com/file/d/1B1u4SpSjsJRydfcBSfplonU6GMNjMyrc/view?usp=drive_link" target="_blank">benefits of open academic metadata&lt;/a>.&lt;/p>
&lt;p>Our membership in Colombia is made up primarily of universities, societies, and public institutions. Almost all journals make their content openly available. Most of the journal content is published using the OJS publishing platform from PKP. Colombia is the&lt;a href="https://rpubs.com/saurabh90/ojs-stats-2022" target="_blank"> eighth-largest user of OJS globally&lt;/a> and the second-largest in Latin America.&lt;/p>
&lt;p>&lt;em>&amp;ldquo;There is still considerable scope for Colombian publishers to utilise Crossref&amp;rdquo;&lt;/em>  Jaime Iván Hurtado, CEO &amp;amp; Founder of Hipertexto-Netizen, a Crossref sponsor, reports that &lt;em>&amp;ldquo;while organisations use DOIs most commonly for journals, there&amp;rsquo;s potential for greater use for books and chapters.&amp;rdquo;&lt;/em> Hipertexto has been contributing to the increased use of persistent identifiers for books and book chapters through their tools and standardised metadata management.&lt;/p>
&lt;p>Members often know the importance of persistent identifiers for their content, but there is a need to increase awareness of the benefits and importance of including additional metadata. We&amp;rsquo;re aware that many editors volunteer their time, which can limit their availability for additional training and participation in events related to publishing and metadata best practices. We aim to increase opportunities for training, both in-person and online, and our sponsors and ambassadors have been key partners in facilitating these events. In February 2024, we partnered with our Sponsor, Biteca, on a &lt;a href="https://zenodo.org/records/10728097" target="_blank">two-day event&lt;/a> in Bogotá, attended by over 100 members. There were lively discussions on the fundamentals of Crossref and the role of quality metadata for content discovery, as well as additional presentations on research integrity and publication ethics, with key partners including COPE, PKP, Scielo, and DOAJ.&lt;/p>
&lt;p>There is no requirement to use persistent identifiers (or specifically DOIs) in Colombia. Each institution decides whether to use them independently, so we&amp;rsquo;re delighted to see so many are active Crossref members, registering their content, and more are joining every month. They recognise the benefit of metadata, as well as being part of the Crossref community at large: &lt;em>&amp;ldquo;In Colombia, Crossref is a benchmark thanks to its use of the DOI. While initially viewed as yet another requirement that complicated the work of publishers, this identifier (and related metadata) is now recognised as a key tool for improving the visibility and impact of publications. Furthermore, through its meetings and resources, Crossref supports editorial teams by offering guidelines, tools, and valuable information that facilitate the adoption of best practices and compliance with quality standards,&amp;rdquo;&lt;/em> reports Luz Ayda Becerra, Innovation Advisor with our sponsor, Biteca.&lt;/p>
&lt;p>Organisations have various reasons for becoming members with Crossref &amp;ndash; the main motivation is to increase the global visibility of their content and, therefore, to increase the impact of their publications. Crossref&amp;rsquo;s metadata is openly accessible and free for everyone in the community. Each month, we have millions of queries to our database from researchers, libraries, author profiling tools, discovery services and many more. Third parties use this metadata to create tools and services that increase visibility and discoverability of members&amp;rsquo; content.&lt;/p>
&lt;p>There are, however, challenges that members still face when working with us. The most frequently listed obstacle in working with Crossref is language. Most of our emails, documentation and tools are in English, and members would like the opportunity for support, resources, and correspondence in Spanish. Those working with sponsors benefit from their support in this way. For all, we are increasing the number of Spanish language &lt;a href="https://zenodo.org/records/14902103" target="_blank">online training opportunities &lt;/a>and webinars, and our ambassadors have been engaging with the local community to provide &lt;a href="https://www.crossref.org/services/">additional resources&lt;/a>. Earlier this year, the first staff member based in Latin America joined our technical support team, and we can now provide Spanish language support (&lt;a href="https://community.crossref.org/t/ticket-of-the-month-april-2025-como-hacer-consultas-en-la-rest-api-de-crossref/13740" target="_blank">resources like this&lt;/a> will appear more frequently now). We recognise that we still have work to do to make Crossref more accessible to global communities.&lt;/p>
&lt;p>Members have suggested that more local in-person events would be beneficial. And we agree - face-to-face interactions are a key way for us to build relationships and increase representation and visibility in communities, and we aspire to create opportunities to engage with members in all corners of the world.&lt;/p>
&lt;p>Showing how metadata is utilised can show the benefits and importance of including additional metadata. Several of our members and sponsors have requested additional Spanish language training on using our APIs, which would enable them to obtain and analyse &lt;a href="https://drive.google.com/file/d/1PY1LtIWGktRD4IRpTV1EZSeR2OPTKDSS/view?usp=drive_link" target="_blank">key metadata elements&lt;/a>.&lt;/p>
&lt;p>&lt;em>&amp;ldquo;By specialising in these technologies, I can effectively analyse and structure information, generating useful reports for editors. This facilitates informed decision-making regarding their publications, optimising editorial management, and ensuring greater visibility and impact of scholarly content.&amp;rdquo;&lt;/em> (Luz Ayda Becerra)&lt;/p>
&lt;p>Over the past several years, Crossref has been invited to participate in webinars and in-person events in Colombia, as there is an increased interest and awareness of the importance of metadata for the research community and the visibility of publications.&lt;/p>
&lt;p>Much of the information in this report is taken from a survey sent to our members, sponsors, and ambassadors in Colombia. We appreciate all the feedback, comments, and suggestions we received, and we look forward to continuing our collaborations and increasing our engagement with the community.&lt;/p></description></item><item><title>Our annual open call for expressions of interest to join our board</title><link>https://www.crossref.org/blog/our-annual-open-call-for-expressions-of-interest-to-join-our-board/</link><pubDate>Wed, 14 May 2025 00:00:00 +0000</pubDate><author>Lucy Ofiesh</author><guid>https://www.crossref.org/blog/our-annual-open-call-for-expressions-of-interest-to-join-our-board/</guid><description>&lt;p>The Crossref Nominating Committee invites expressions of interest to join the Board of Directors of Crossref for the term starting in January 2026. The committee will gather responses from those interested and create the slate of candidates that our membership will vote on in an election in September.&lt;/p>
&lt;p>&lt;strong>Expressions of interest will be due Monday, June 9th, 2025&lt;/strong>&lt;/p>
&lt;p>This is an exciting time to join the board, as we have a number of active projects underway. Our focus is on how our community and metadata can contribute to ensuring the integrity of the scholarly record. We are redesigning our content system to better serve the changing needs of our community. We’re broadening our metadata record to capture richer funding and institutional affiliations. New board members will be part of on-going discussions about how to make our fees simpler and more equitable. Additionally, we envision a future where the scholarly record prioritizes relationships between research outputs to build a holistic research nexus. The board helps guide this work.&lt;/p>
&lt;h3 id="about-our-board-elections">About our board elections&lt;/h3>
&lt;p>The board is elected through the “one member, one vote” policy wherein every member organisation of Crossref has a single vote to elect representatives to the Crossref board. Board terms are for three years, and this year, there are five seats open for election.&lt;/p>
&lt;p>The board maintains a balance of seats, with eight seats for smaller members and eight seats for larger members (based on total revenue to Crossref). This is an effort to ensure that the scholarly community&amp;rsquo;s diversity of experiences and perspectives is represented in decisions made at Crossref.&lt;/p>
&lt;p>This year, we will elect four of the larger member seats (membership tiers $3,900 and above) and one of the smaller member seats (membership tiers $1,650 and below). You don’t need to specify which seat you are applying for; we will provide that information to the nominating committee.&lt;/p>
&lt;p>The online election will open in September, with results announced at the annual meeting scheduled for October 22nd. New members will begin their term in January 2026.&lt;/p>
&lt;h3 id="about-the-nominating-committee">About the Nominating Committee&lt;/h3>
&lt;p>The Nominating Committee reviews the expressions of interest and selects a slate of candidates for election. The slate put forward will exceed the total number of open seats. The committee considers the statements of interest, organisational size, geography, and experience.&lt;/p>
&lt;ul>
&lt;li>James Phillpotts*, Oxford University Press, committee chair&lt;/li>
&lt;li>Abiodun Falodun, University of Benin&lt;/li>
&lt;li>Wendy Patterson*, Beilstein Institut&lt;/li>
&lt;li>Chaerul Umam, National Library of Indonesia&lt;/li>
&lt;li>Amanda Ward*, Taylor &amp;amp; Francis&lt;/li>
&lt;/ul>
&lt;p>(*) indicates Crossref board member&lt;/p>
&lt;h3 id="board-roles-and-responsibilities">Board roles and responsibilities&lt;/h3>
&lt;p>Crossref’s services provide a central infrastructure for scholarly communications. Crossref’s board helps shape the future of our services and by extension, impacts the broader scholarly ecosystem. We are looking for board members to contribute their experience and perspective.&lt;/p>
&lt;p>The role of the board at Crossref is to provide strategic and financial oversight of the organisation, as well as guidance to the Executive Director and the staff leadership team, with the key responsibilities being:&lt;/p>
&lt;ul>
&lt;li>Setting the strategic direction for the organisation;&lt;/li>
&lt;li>Providing financial oversight; and&lt;/li>
&lt;li>Approving new policies and services.&lt;/li>
&lt;/ul>
&lt;p>The board represents of our membership base and guides the staff leadership team on trends affecting scholarly communications.&lt;/p>
&lt;p>The work of the board takes place in board meetings and board committees. The board sets strategic directions for the organisation while also providing oversight into policy changes and implementation. Board members join four meetings each year that typically take place in January, March, July, and November. The July meeting is in-person and may take place in a variety of international locations; travel support is provided when needed. January, March, and November board meetings are held virtually, and all committee meetings take place virtually. Each board member should sit on at least one Crossref committee. Care is taken to accommodate the wide range of time zones in which our board members live.&lt;/p>
&lt;p>While the expressions of interest are specific to an individual, the seat that is elected to the board belongs to the member organisation. The primary board member also names an alternate who may attend meetings in the event that the primary board member is unable to. There is no personal financial obligation to sit on the board. The member organisation must remain in good standing.&lt;/p>
&lt;p>Board members are expected to be comfortable assuming the responsibilities listed above and to prepare and participate in board meeting discussions.&lt;/p>
&lt;h3 id="who-can-apply-to-join-the-board">Who can apply to join the board?&lt;/h3>
&lt;p>Any active member of Crossref can apply to join the board. Crossref membership is open to organisations that produce content, such as academic presses, commercial publishers, standards organisations, and research funders.&lt;/p>
&lt;h3 id="what-does-the-committee-look-for">What does the committee look for?&lt;/h3>
&lt;p>The committee looks for skills and experience that will complement the rest of the board. Candidates from countries and regions not currently reflected on the board are strongly encouraged to apply. Successful candidates often have some or all of these characteristics:&lt;/p>
&lt;ul>
&lt;li>Demonstrate a commitment to or understanding of our &lt;a href="https://www.crossref.org/strategy/" target="_blank">strategic agenda&lt;/a> or the &lt;a href="https://openscholarlyinfrastructure.org/" target="_blank">Principles of Open Scholarly Infrastructure&lt;/a>&lt;/li>
&lt;li>Have expertise that may be underrepresented on the board currently;&lt;/li>
&lt;li>Hold decision-making positions in their organisations;&lt;/li>
&lt;li>Have experience with governance or community involvement;&lt;/li>
&lt;li>Represent member organisations that are active in the scholarly communications ecosystem;&lt;/li>
&lt;li>Demonstrate metadata best practices as shown in the member’s &lt;a href="https://www.crossref.org/members/prep/" target="_blank">participation report&lt;/a>&lt;/li>
&lt;/ul>
&lt;p>The board is also encouraging Crossref members who are research funders to apply.&lt;/p>
&lt;h3 id="what-does-the-application-evaluation-process-look-like">What does the application evaluation process look like?&lt;/h3>
&lt;p>&lt;strong>Open call for board interest, May 14 to June 9th&lt;/strong>: Any active member in good standing can apply for a seat on the board. This includes direct members, sponsored members, and GEM members. Sponsoring organisations, service providers, and Metadata Plus subscribers who are not also members are not eligible to sit on the board.&lt;/p>
&lt;p>&lt;strong>Application review, June through August&lt;/strong>: Applications will be reviewed by our Nominating Committee. We also gather internal information about the member organisation, such as metadata habits, history with Crossref, any previous experience in Crossref working groups or community initiatives.&lt;/p>
&lt;p>We might also refer to external information to help the committee’s review including LinkedIn profiles or member organisation websites and publications.&lt;/p>
&lt;p>&lt;strong>Brief interviews with final candidates, August&lt;/strong>: The committee will hold brief virtual interviews with the top candidates before finalising the slate of nominations.&lt;/p>
&lt;p>&lt;strong>Announcement of the slate and election, September&lt;/strong>: The committee will announce the final slate of candidates in September and the online election will begin, culminating at the annual meeting at the end of October.&lt;/p>
&lt;h3 id="how-to-apply">How to apply&lt;/h3>
&lt;p>Please &lt;a href="https://docs.google.com/forms/d/e/1FAIpQLSc_4uF4kSEPy6GN6p2LjLAMWF2YY7g_NEmTNXPXqZM_NkbhOQ/viewform?usp=dialog" target="_blank">click here to submit your expression of interest&lt;/a> by Monday, June 9th. We ask for a brief statement about how your organisation could enhance the Crossref board and a brief personal statement about your interest and experience with Crossref.&lt;/p>
&lt;p>Please contact me with any questions at &lt;a href="mailto:voting@crossref.org">voting@crossref.org&lt;/a>&lt;/p></description></item><item><title>Notice of amendments to Crossref membership terms and bylaws</title><link>https://www.crossref.org/blog/notice-of-amendments-to-crossref-membership-terms-and-bylaws/</link><pubDate>Sun, 11 May 2025 00:00:00 +0000</pubDate><author>Amanda Bartell</author><guid>https://www.crossref.org/blog/notice-of-amendments-to-crossref-membership-terms-and-bylaws/</guid><description>&lt;p>In its March 2025 meeting, the Crossref board unanimously voted to update both the Crossref bylaws and the Crossref membership terms to:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>Provide more clarity and alignment between our bylaws and membership terms, where they had become out of sync over the years.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Reflect previous board motions and bring both documents up-to-date with current processes for suspending and revoking membership, and reviewing those decisions.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Work towards being more explicit about what &amp;ldquo;Member Practices&amp;rdquo; should look like in terms of preserving the integrity of the scholarly record.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;h4 id="link-to-updated-membership-termsmembershiptermsmember-terms-2025-and-link-to-updated-bylawsboard-and-governancebylaws">&lt;a href="https://www.crossref.org/membership/terms/member-terms-2025">Link to updated membership terms&lt;/a> and &lt;a href="https://www.crossref.org/board-and-governance/bylaws/">link to updated bylaws&lt;/a>&lt;/h4>
&lt;br>
&lt;p>The bylaw changes are effective immediately, and the updated version of the membership terms will come into effect on 11th July 2025.&lt;/p>
&lt;p>In accordance with the 60-day notice period, we have emailed the Primary contact on all our active member accounts today. Note: Members do not need to do anything in response to these changes - by continuing to use our services after 11th July, they are accepting the latest version of the terms.&lt;/p>
&lt;h3 id="changes-to-the-membership-terms">Changes to the membership terms&lt;/h3>
&lt;p>The membership terms will be updated on 11th July to be clearer on, among other things, the importance of accurate metadata, using DOI links everywhere, the all-important reference linking obligation, and the process for suspending and revoking/terminating membership. It also introduces the new concept of &amp;ldquo;&lt;a href="https://www.crossref.org/operations-and-sustainability/membership-operations/member-practices">Member Practices&lt;/a>&amp;rdquo;, which a dedicated community committee will propose for board approval. More information about this will follow soon.&lt;/p>
&lt;p>You can find the specific changes below, or take a look at &lt;a href="https://www.crossref.org/pdfs/compare-crossref-member-terms-revisions-july-2025.pdf">this marked-up PDF&lt;/a> showing the changes between the current (from June 2022) terms and the revised (July 2025) terms.&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>&lt;strong>Topic&lt;/strong>&lt;/th>
&lt;th>&lt;strong>Section&lt;/strong>&lt;/th>
&lt;th>&lt;strong>Summary of Change(s)&lt;/strong>&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>Terminology&lt;/td>
&lt;td>Various sections (e.g., 1, 2(i), 2(k), 5)&lt;/td>
&lt;td>Streamlines some legal language to enhance clarity and readability.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Member Practices&lt;/td>
&lt;td>2(a)&lt;/td>
&lt;td>Establishes an obligation of Members to comply with Member Practices, to be established soon through a dedicated committee.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Unauthorised use of metadata&lt;/td>
&lt;td>2(d)&lt;/td>
&lt;td>Highlights the harmful impact of unauthorised use or deposit of metadata on Crossref, its Members, and the integrity of the scholarly record.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Reference linking&lt;/td>
&lt;td>2(f), (g)&lt;/td>
&lt;td>Updates the language referring to reference linking, and makes explicit Members’ obligation to maintain reference linking throughout membership, not only upon first joining Crossref. It also makes it clear that members should use DOI links wherever they communicate about any item with a DOI.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Displaying identifiers&lt;/td>
&lt;td>2(h)&lt;/td>
&lt;td>Strengthens Members’ obligation to display DOIs in accordance with Crossref’s Display Guidelines (by eliminating the “commercially reasonable efforts” qualifier).&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Fees&lt;/td>
&lt;td>3&lt;/td>
&lt;td>Expands the definition of “Fees” to include all usage fees and fees for optional services, in addition to annual fees and Content Registration fees. Crossref’s right to suspend or terminate a Member’s account for non-payment extends to any of these fees.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Termination of Membership&lt;/td>
&lt;td>9&lt;/td>
&lt;td>Significantly revises the provision regarding termination of a Member’s membership by Crossref:&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>&lt;/td>
&lt;td>Updates the bases for ‘for-cause’ termination, to include ongoing misrepresentations in a Member’s practices; misleading use or creation of DOIs; and failure to pay fees due (without the former 120-day minimum duration of nonpayment);&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>&lt;/td>
&lt;td>Clarifies the distinction between suspension and termination (also referred to as revocation or expulsion) of a Member’s Crossref membership;&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>&lt;/td>
&lt;td>Eliminates the existing procedures for automatic Board review of a termination or extended suspension. (Crossref’s bylaws have been amended to prescribe a new suspension/termination process and right to request Board review);&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>&lt;/td>
&lt;td>Adds a termination trigger for cases where a Sponsor cancels its agreement with a Sponsored Member. (The member, of course, has the option to move to a new Sponsor, or re-join Crossref as an independent member).&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Notice contacts&lt;/td>
&lt;td>8(d)&lt;/td>
&lt;td>Updates Crossref’s Notice contact; updates the list of required Member contacts.&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;h3 id="changes-to-the-bylaws">Changes to the bylaws&lt;/h3>
&lt;p>Our bylaws have needed updating for a while, but since these seldom change, we&amp;rsquo;ve saved up a few changes, also to bring them in line with the revised membership terms.&lt;/p>
&lt;p>We&amp;rsquo;ve now modernised the language, ensured that the bylaws match what&amp;rsquo;s in the membership terms, and we&amp;rsquo;ve added in motions that have been agreed by the board but not updated in the bylaws over the last few years. We&amp;rsquo;ve also updated the bylaws in line with the new membership revocation process in the new July 2025 membership terms. The new bylaws also allow for a new group of members to be created to help Crossref define Member Practices.&lt;/p>
&lt;p>You can find a summary of the changes below, or take a look at &lt;a href="https://www.crossref.org/pdfs/compare-crossref-bylaws-revisions-march-2025.pdf">this marked-up PDF&lt;/a> showing all the changes to the bylaws.&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>&lt;strong>Topic&lt;/strong>&lt;/th>
&lt;th>&lt;strong>Section&lt;/strong>&lt;/th>
&lt;th>&lt;strong>Summary of Changes&lt;/strong>&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>Terminology&lt;/td>
&lt;td>Various sections&lt;/td>
&lt;td>Eliminates gender-specific terminology, e.g. replaces “Chairman” with “Chair”.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>&lt;/td>
&lt;td>Makes minor clean-up edits (e.g. deletion of unused “Reserved” section and renumbering).&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Membership Qualification&lt;/td>
&lt;td>Art. I Sec. 1&lt;/td>
&lt;td>Replaces “publishes” professional and scholarly materials with “produces” professional and scholarly materials to match the language in the already-current membership terms.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Non-Voting Membership&lt;/td>
&lt;td>Art. I Sec. 2; Art. IV Secs. 7, 8; Art. VII Sec. 4&lt;/td>
&lt;td>Reflects the establishment of a non-voting Member category as previously approved by the Board.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Membership Procedures&lt;/td>
&lt;td>Art. I Sec. 3; Art. I Sec. 5&lt;/td>
&lt;td>Clarifies that acceptance of new Members is delegable to Crossref personnel generally, replacing a narrow reference to the Executive Director.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>&lt;/td>
&lt;td>Eliminates superfluous procedural steps regarding Member resignation.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Suspension and Termination of Membership&lt;/td>
&lt;td>Art. I Sec. 6&lt;/td>
&lt;td>Significantly revises the provision regarding termination of a Member’s membership by Crossref:&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>&lt;/td>
&lt;td>Updates the bases for ‘for-cause’ termination, to include various specific prongs (matching those already in the Member Terms), while maintaining the catch-all for conduct prejudicial to Crossref’s best interests.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>&lt;/td>
&lt;td>Authorises the Board to define standards and procedures for &amp;lsquo;for-cause&amp;rsquo; terminations, or establish a committee (which can be comprised of both Board members and non-Board members) for that purpose.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>&lt;/td>
&lt;td>Specifies that Crossref staff is responsible for implementing the ‘for-cause’ termination standards.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>&lt;/td>
&lt;td>Eliminates the existing procedures for automatic Board review of a termination or extended suspension; specifies the Board’s authority to delegate discretionary appeals/review to the ExCo or other committee of Board members.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>&lt;/td>
&lt;td>Restates that temporary suspension may be used in lieu of, or in advance of, termination.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Annual Meeting&lt;/td>
&lt;td>Art. IV Sec. 1&lt;/td>
&lt;td>Updates language around the timing of the annual Member meeting:&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>&lt;/td>
&lt;td>Replaces reference to the “second week of November” with “during the month of October or November”.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>&lt;/td>
&lt;td>Eliminates language regarding avoiding legal or religious holidays; given Crossref’s global footprint, this is not feasible.&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>Thanks for reading this far!&lt;/p>
&lt;p>Don’t forget, members do not need to do anything in response to these changes - by continuing to register metadata after 11th July, they are accepting the latest version of the terms. But do let us know if you have any questions by emailing &lt;a href="mailto:member@crossref.org?subject=Membership%20Terms%20and/or%20bylaws">&lt;a href="mailto:member@crossref.org">member@crossref.org&lt;/a>&lt;/a>.&lt;/p></description></item><item><title>Meet six winners of the first ever Crossref Metadata Awards</title><link>https://www.crossref.org/blog/crossref-metadata-awards/</link><pubDate>Wed, 07 May 2025 00:00:00 +0000</pubDate><author>Kornelia Korzec</author><guid>https://www.crossref.org/blog/crossref-metadata-awards/</guid><description>&lt;p>Marking our 25th anniversary, we launch the Crossref Metadata Awards to emphasise our community’s role in stewarding and enriching the scholarly record.&lt;/p>
&lt;p>We are pleased to recognise Noyam Publishers, GigaScience Press, eLife, American Society for Microbiology, and Universidad La Salle Arequipa Perú with the Crossref Metadata Excellence Awards, and Instituto Geologico y Minero de España wins the Crossref Metadata Enrichment Award. These inaugural awards highlight the leadership of members who show dedication to the best metadata practices.&lt;/p>
&lt;p>Crossref exists to make scholarly communications better by making research objects easy to find, cite, link, assess, and reuse. Our members weave the &lt;a href="https://www.crossref.org/documentation/research-nexus/">research nexus&lt;/a>: a rich and reusable open network of connections between works resulting from the scholarly process and the people and institutions engaged in it.&lt;/p>
&lt;p>Rich metadata improves discoverability of and trust in published works. Many institutions now strive to turn towards open research information in their reporting, assessment and evaluation. And so we believe it’s time to give credit to members that are doing the best work in supporting others across the scholarly ecosystem with their metadata.&lt;/p>
&lt;p>The awards presented today will be followed by a series of blog interviews, where the winners will share how they achieved their high level of metadata completeness.&lt;/p>
&lt;p>Starting in 2025, we will hold the awards every other year.&lt;/p>
&lt;p>Read on to get more acquainted with the winners, learn about other high performing organisations and overall trends in metadata practices we see at Crossref.&lt;/p>
&lt;h2 id="recognising-metadata-excellence">Recognising Metadata Excellence&lt;/h2>
&lt;p>&lt;a href="https://noyam.org/" target="_blank">Noyam Publishers&lt;/a> is based in Ghana. Colleagues had the pleasure of meeting them in person, during the &lt;a href="https://doi.org/10.64000/x38ew-0n632" target="_blank">Crossref Accra&lt;/a> event this March. Striving for visibility motivates Noyam&amp;rsquo;s high performance when it comes to metadata. With 57% coverage of key metadata elements across their records, they are a leader among the members in our Global Equitable Membership (GEM) program.&lt;/p>
&lt;p>Among other GEM members who show high participation in the research nexus, we see more than 40% coverage of key metadata elements for the records registered by University of Sierra Leone Teaching Hospitals Complex in Sierra Leone, Queen Arwa University in Yemen, Kathmandu University School of Education in Nepal, and International Journal for Innovation Education and Research in Bangladesh.&lt;/p>
&lt;p>&lt;a href="http://gigasciencejournal.com/" target="_blank">GigaScience Press&lt;/a>, based in Hong Kong, is the leader among small members (organisations of less than USD 1 mln of publishing revenue or expenses). Discoverability drives their high metadata standards, and GigaSciencePress sees those having advantages in terms of service integrations and development too. They are quick to credit the expertise of their technology partner, River Valley Technologies as the strategic contributor to them achieving 82% coverage of key metadata elements across their records.&lt;/p>
&lt;p>It&amp;rsquo;s worth highlighting that the competition among our small members was much closer than in any other category! Stichting SciPost (Netherlands) also show more than 80% coverage across their records, followed by Life Science Alliance, LLC (United States), National Institute for Health and Care Research (United Kingdom), and Universidad La Salle Arequipa (Peru), each of which achieved more than 70% metadata coverage across their registered works.&lt;/p>
&lt;p>&lt;a href="https://elifesciences.org" target="_blank">eLife&lt;/a> leads among our medium members (organisations between USD 1 mln and 10 mln of publishing revenue or expenses) with 85% coverage of key metadata elements. They have shown dedication to metadata quality and consistently high performance over the years. They are also the first publisher to include Crossref grant IDs in their records, adopting the Grant Linking System.&lt;/p>
&lt;p>Other medium-sized organisations to note are MDPI AG in Switzerland, and XMLink in South Korea &amp;ndash; while there&amp;rsquo;s a significant gap to the leader, each of these organisations has more than 50% coverage of key metadata elements across their records.&lt;/p>
&lt;p>It appears that large members (organisations with more than USD 10 mln of publishing revenue or expenses) struggle to achieve consistency in metadata quality across all of their records. Yet, we are delighted to recognise the &lt;a href="https://asm.org" target="_blank">American Society for Microbiology&lt;/a> in the United States, who have embarked on a large metadata quality improvement project several years ago, and it continues to bear fruit as we see 56% of metadata coverage across ASM&amp;rsquo;s records. They&amp;rsquo;ve &lt;a href="https://doi.org/10.64000/nhmg5-3ra76" target="_blank">shared their experience on our blog already&lt;/a>, so this time we&amp;rsquo;ll invite them to follow up with the latest updates on their metadata practices.&lt;/p>
&lt;p>American Geophysical Union (AGU), Public Library of Science (PLOS), SAGE Publications, and Wiley, all based in the United States, are ASM&amp;rsquo;s closest runners up. While the gap is significant &amp;ndash; still each of these organisations has more than 40% of metadata coverage across their records. PLOS has an impressive proportion of Crossmark-enabled works (99%), and American Geophysical Union and Wiley are registering a significant proportion of abstracts for their records (87% and 59% respectively).&lt;/p>
&lt;p>It often takes time to hone new processes and learn about metadata practices, so we decided to recognise metadata excellence among our new members: organisations that joined Crossref within the past two years. Our inaugural award for excellence among new members goes to &lt;a href="https://www.ulasalle.edu.pe/" target="_blank">Universidad La Salle Arequipa Perú&lt;/a>, who joined Crossref in May 2023, and have 71% metadata coverage across their records.&lt;/p>
&lt;h2 id="rewarding-metadata-enrichment">Rewarding Metadata Enrichment&lt;/h2>
&lt;p>Our members don&amp;rsquo;t just register their records with us &amp;ndash; they also steward and maintain their metadata over time. As new technical capabilities and metadata elements become available, members have the ability to update their metadata. We decided to recognise the member who achieved the biggest transformation to their records in the past two years: Instituto Geologico y Minero de España, based in Spain, jumped from just over 1% to more than 40% metadata coverage for their records in the space of the past two years.&lt;/p>
&lt;p>Others who made more than 30% jump in their metadata completeness in the past two years are Cabrera Research Lab (United States), Centro de Investigaciones Sociologicas (Spain), Bon View Publishing PTE (Singapore), Asociacion Colombiana de Neurologia (Colombia), Instituto Superior Tecnológico Almirante Illingworth (Ecuador), and Tashkent State University of Economics (Uzbekistan).&lt;/p>
&lt;h2 id="how-did-we-select-the-winners">How did we select the winners?&lt;/h2>
&lt;p>Our Metadata Excellence Awardees have been selected on the basis of the overall highest coverage of metadata elements included in &lt;a href="https://www.crossref.org/members/prep/" target="_blank">Participation Reports&lt;/a> as of March 2025, and the Metadata Enrichment Award was based on the comparison between performance on the same criteria between March 2023 and March 2025. Participation Reports are openly available and provide information about the proportion of a given member&amp;rsquo;s records that include the following high-value metadata elements:&lt;/p>
&lt;ul>
&lt;li>References&lt;/li>
&lt;li>Abstracts&lt;/li>
&lt;li>ORCID iDs&lt;/li>
&lt;li>Affiliations&lt;/li>
&lt;li>ROR IDs&lt;/li>
&lt;li>Funder Registry IDs&lt;/li>
&lt;li>Funding award numbers&lt;/li>
&lt;li>Crossmark enabled&lt;/li>
&lt;li>Text mining URLs&lt;/li>
&lt;li>License URLs&lt;/li>
&lt;/ul>
&lt;p>The report also includes Similarity Check URLs. However, since Similarity Check is an optional service that attracts a separate fee &amp;ndash; it wouldn&amp;rsquo;t be equitable to include it in our analysis.&lt;/p>
&lt;p>We encourage all members to periodically monitor their participation reports, and we offer frequent &lt;a href="https://www.crossref.org/events/metadata-health-check-webinars/">drop-in metadata health-check sessions&lt;/a>, where we review the reports together and offer advice on making improvements in areas where our members experience challenges.&lt;/p>
&lt;p>In a membership of more than 22,000 organisations, it&amp;rsquo;s difficult to recognise just one organisation as a model of best practices. There are many nuances that influence the performance and we would like to be transparent about some considerations we made in our awarding process.&lt;/p>
&lt;p>First of all, we considered volume of publishing as a key variable, and decided to qualify organisations with a minimum of 20 items of registered content.&lt;/p>
&lt;p>We also recognise that size matters &amp;ndash; and decided to award our Metadata Excellence Awards in four categories corresponding with organisational size and resourcing.&lt;/p>
&lt;h2 id="beyond-the-winners----overview-of-good-metadata-practice-across-different-types-of-works">Beyond the winners &amp;ndash; overview of good metadata practice across different types of works&lt;/h2>
&lt;p>The scholarly communications landscape is always evolving, and new types of content arise all the time. Crossref schema enables rich metadata collection about journal articles, books, book chapters, preprints, conference proceedings, technical reports, as well as grants, and more.&lt;/p>
&lt;p>At this point, the most prolific way of sharing scholarship - at least judging by the number of records registered with Crossref &amp;ndash; is a journal article. There are 112,982,290 journal articles in the Crossref database, and in 2024 alone our members created 6,747,031 journal articles records with us.&lt;/p>
&lt;p>When it comes to books (2,212,221 total records) and book chapters (22,892,785 total records), publishers with the richest metadata records include Universitatsbibliothek Kiel (Germany) with more than 50% coverage of key metadata elements across their book records, and 70% for their book chapters. RTI Press (US) also has strong metadata for books (52%), while Firenze University Press (Italy) has 56% of metadata coverage across their book chapters. Incidentally, Universitatsbibliothek Kiel (Germany) are also leaders in metadata for conference proceedings (53% metadata coverage of those records).&lt;/p>
&lt;p>Preprints and posted content (including preprints, eprints, working papers, reports) are relatively new on the scene and growing rapidly &amp;ndash; Crossref has 1,683,351 preprint records (413,742 registered in 2024). The richest metadata records for preprints belong to eLife (UK) - they cover more than 50% of key metadata elements across their preprints records in Crossref. Springer Science and Business Media LLC (Netherlands) have 48% metadata coverage for their preprints, American Chemical Society (ACS; United States) with 46%, and UNISA Press (South Africa) and PeerJ (US) follow with 44% coverage.&lt;/p>
&lt;p>The newest of record types that can be registered with us are grants. At present this is an early adopters domain, with 152,810 registered grants so far. The European Union (represented in Crossref by the Publications Office of the European Union) registered the most grants to date.&lt;/p>
&lt;h2 id="beyond-the-winners----overview-of-coverage-in-key-metadata-elements">Beyond the winners &amp;ndash; overview of coverage in key metadata elements&lt;/h2>
&lt;p>When speaking about key metadata elements reflected in our Participation Report, the coverage varies widely. For example, overall 21% of records in Crossref have abstract metadata; 2,000 members have a full coverage of their records with abstracts, while 1,000 don&amp;rsquo;t include any. Deposition of ORCID iDs is growing but still very low, with only 10% of records including ORCID iDs.&lt;/p>
&lt;p>Affiliation metadata, broadly sought after by many stakeholders in the scholarly ecosystem - not least because of its role as a key marker of trust - is growing steadily but slowly: only 16% of records included it at the end of March 2025. With recent improvements in our helper tools (especially the latest version of the &lt;a href="https://www.crossref.org/documentation/register-maintain-records/record-registration-form/">record registration &lt;/a>form), and the upcoming developments in other publishing software (notably the upcoming 3.5 version of OJS), which support affiliation metadata better &amp;ndash; we&amp;rsquo;re expecting a significant improvement in the coming months.&lt;/p>
&lt;p>As with affiliations, when research integrity judgements are concerned, another key element is the funding information. The growing interest in metadata among funders further strengthens the case for increasing inclusion of funder information in this way, ideally including Crossref grant DOIs that funders are registering in the hope of using the Grant Linking System to help their assessment and evaluation work. At the moment the space for improvement is vast, with only 6% of Crossref metadata including funder IDs and award numbers.&lt;/p>
&lt;p>We support ROR IDs in both affiliation and funding metadata, but adoption among our members is slow. So far the top five contributors of ROR IDs to Crossref are Fonds de recherche du Québec, eLife, American Physical Society (APS), Optica Publishing Group, and Wellcome.&lt;/p>
&lt;p>Licence metadata is currently included for 43% of records in Crossref, and we see that thousands of members don&amp;rsquo;t include it. Not all members realise that this is a practical challenge for their authors, as it hinders institutions and funders who seek to monitor compliance with their openness mandates.&lt;/p>
&lt;p>Finally, references metadata is the lifeblood of the research nexus, supporting transparency and discoverability of scholarship. We&amp;rsquo;ve got 44% coverage of reference metadata across records registered in Crossref. While &lt;a href="https://www.crossref.org/documentation/reference-linking/">reference linking&lt;/a> is a member obligation, including references in the metadata is a recommended best practice. The way references are recognised and included in works varies by publication type and discipline, which makes it harder for some members to provide it.&lt;/p>
&lt;p>There&amp;rsquo;s an ongoing need to raise awareness about the role of metadata among the wider community, including editors and researchers. We have collaborated with practitioners, supporters, and users of metadata to develop relevant resources as part of the &lt;a href="https://metadata2020.org" target="_blank">Metadata 20/20 initiative&lt;/a>.&lt;/p>
&lt;p>We make efforts to educate our members about best practices when it comes to registering their metadata with us and offer a range of support options, including &lt;a href="https://community.crossref.org/c/tech-support/8" target="_blank">technical support on our Community Forum&lt;/a>. Recognising the leaders in metadata participation is part of that process too. With the upcoming blog series from our awardees, we hope to spur peer-to-peer learning to facilitate widespread improvements and to raise the profile of metadata quality among the community.&lt;/p></description></item><item><title>Metadata Advisory Group call for applications</title><link>https://www.crossref.org/blog/metadata-advisory-group-call-for-applications/</link><pubDate>Fri, 02 May 2025 00:00:00 +0000</pubDate><author>Patricia Feeney</author><guid>https://www.crossref.org/blog/metadata-advisory-group-call-for-applications/</guid><description>&lt;p>We’ve been accelerating our metadata development efforts and recently released &lt;a href="https://doi.org/10.13003/325070" target="_blank">version 5.4&lt;/a> of our metadata schema, and are planning to release version 5.5 (including support for multiple contributor roles and the &lt;a href="https://credit.niso.org/" target="_blank">CRediT&lt;/a> taxonomy) this summer. We will also extend our grants schema based on the Funders Advisory Group work, and make progress on other changes as set out on our new &lt;a href="https://roadmap.productboard.com/e86bfb0f-1a13-49bc-b72d-f8e893041fb4" target="_blank">metadata development roadmap&lt;/a>.&lt;/p>
&lt;p>As we work towards the vision of the rich and reusable open network of relationships connecting research organisations, people, things, and actions, dubbed the Research Nexus, our schemas need to change to accommodate the evolving landscape of research processes and communications.&lt;/p>
&lt;p>In the past we convened the Metadata Interest Group that helped shape the current set of updates we’re now working through, including changes to names, expansion of support for abstracts, dates, and multilingual metadata. As we’ll soon move into new territory (support for subjects, keywords, and other metadata essential to developing a robust research nexus), we want to further enlist the support of our community to help shape the metadata we collect and the metadata best practices we promote.&lt;/p>
&lt;p>We are inviting Crossref members, metadata users, and others with an interest in shaping metadata development at Crossref to apply to join our new Metadata Advisory Group.&lt;/p>
&lt;p>The purpose of the group is to contribute your advice and insight to help shape our metadata development as we broaden the metadata we collect and outputs we support to better align with the Research Nexus. Group participants will help shape metadata development at Crossref, and will discuss potential new metadata to adopt, best practices, and the overall needs of metadata providers and users.&lt;/p>
&lt;p>We’re looking for participants with experience with XML, JSON, and other metadata formats. We’ll cover a range of topics but we would particularly like to engage with those of you with an interest in emerging content types.&lt;/p>
&lt;p>The Metadata Advisory Group will meet quarterly and we’ll accommodate multiple time zones as needed as we want participation to reflect the regional diversity of our membership.&lt;/p>
&lt;p>If you’re interested, please &lt;a href="https://forms.gle/D9xYn7Y72hzXnDa18" target="_blank">submit an application&lt;/a>!&lt;/p></description></item><item><title>Reflections from Crossref Accra 2025 - Strengthening open science and partnerships in Ghana</title><link>https://www.crossref.org/blog/reflections-from-crossref-accra-2025-strengthening-open-science-and-partnerships-in-ghana/</link><pubDate>Tue, 29 Apr 2025 00:00:00 +0000</pubDate><author>Johanssen Obanda</author><guid>https://www.crossref.org/blog/reflections-from-crossref-accra-2025-strengthening-open-science-and-partnerships-in-ghana/</guid><description>&lt;p>Crossref is a membership organisation, and it’s the global community of members that creates the &lt;a href="https://www.crossref.org/documentation/research-nexus/">Research Nexus&lt;/a> together. Meeting our community locally is a highlight and an important learning experience. This year, we started by connecting with a growing community in Accra, Ghana - our first in-person event in the country included in &lt;a href="https://www.crossref.org/gem/">our GEM program&lt;/a>. From 14 members in 2023 to 31 in 2025, our community in Ghana is blooming.&lt;/p>
&lt;p>At its core, Crossref Accra 2025 was about showing up for the community in Ghana - listening, learning, and building together. On the 20th of March, we welcomed 66 participants: journal editors, university staff, librarians, and researchers. People who are doing the real work of making scholarly publishing happen in the region.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/group-photo-crossref-accra.jpeg"
alt="Photo: Participants from across Ghana’s research and publishing landscape." width="100%">&lt;figcaption>
&lt;p>Photo: Participants from across Ghana’s research and publishing landscape.&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;/div>
&lt;p>We started the day with a &lt;a href="https://www.crossref.org/services/">walkthrough of Crossref’s services&lt;/a>, then shifted into more tailored conversations - talking metadata quality, improving discoverability, and making Crossref tools work for the local context. The panel featuring &lt;a href="https://www.ajol.info/index.php/ajol" target="_blank">AJOL&lt;/a>, &lt;a href="https://wacren.net/en/" target="_blank">WACREN&lt;/a>, and &lt;a href="https://www.carligh.org/" target="_blank">CARLIGH&lt;/a> was a key moment. We heard honest reflections about journal sustainability, the barriers to indexing, and how Open Access can grow if local infrastructure is supported. Each organisation shared how they’re working to strengthen research communities and where they see Crossref fitting into that bigger picture.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/panel-session-crossref-accra.jpeg"
alt="Photo: Crossref Ambassador Richard Lamptey moderates a panel with WACREN’s Effah Amponsah, CARLIGH’s Mac Anthony Cobblah, and AJOL’s Kylie van Zyl on sustaining journals and advancing Open Access in the region." width="70%">&lt;figcaption>
&lt;p>Photo: Crossref Ambassador Richard Lamptey moderates a panel with WACREN’s Effah Amponsah, CARLIGH’s Mac Anthony Cobblah, and AJOL’s Kylie van Zyl on sustaining journals and advancing Open Access in the region.&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;/div>
&lt;p>During the dedicated listening session, participants spoke candidly about the cost burden of APCs, the over-reliance on foreign journals for recognition, and the uphill battle local journals face, from limited resources to slow workflows. There was a clear push for stronger local publishing platforms and more training around &lt;a href="https://www.crossref.org/documentation/register-maintain-records/ojs-plugin/">tools like OJS&lt;/a>. People want technical clarity: How does Crossref fit into their workflows? What’s involved in registering metadata and DOIs? What’s the actual value? Many also voiced interest in strengthening relationships with indexing services, and connecting university presses more directly with Crossref. The afternoon breakout sessions were hands-on. One group explored how to use the &lt;a href="https://www.crossref.org/documentation/reports/participation-reports/">Participation Reports&lt;/a> to check metadata completeness, while the other dove into using &lt;a href="https://www.crossref.org/documentation/retrieve-metadata/rest-api/">the Crossref API&lt;/a>. People started swapping tips, asking questions, and brainstorming ways to improve how their institutions handle metadata. Several wanted to know how to automate more of their workflows through OJS, boost reference linking, and pull better reports from the Crossref system.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/snapshots-crossref-accra.jpeg"
alt="Photo: A collage of snapshots capturing activities at the Crossref Accra event." width="50%">&lt;figcaption>
&lt;p>Photo: A collage of snapshots capturing activities at the Crossref Accra event.&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;/div>
&lt;p>Outside the main event, we also visited some of our members and stopped by the &lt;a href="https://aau.org/" target="_blank">Association of African Universities&lt;/a>. These visits gave us more time for deeper conversations about publishing workflows, ORCID uptake, metadata visibility, and the bigger picture of Open Access in Ghana. We heard a lot about the potential for more equitable partnerships and stronger local ownership of publishing infrastructure.&lt;/p>
&lt;p>Post-event feedback made one thing clear: people want more opportunities to learn - more practical workshops, more guidance on using Crossref tools, and more support navigating the technical side of things. There’s growing interest in forming a local user group, a space to keep sharing, troubleshooting, and moving forward together. And the desire to improve indexing and visibility was a recurring theme. People see registering identifiers for content as an essential step on that journey. There’s also a broader concern about long-term sustainability and ethical publishing practices. Many journals are doing their best in tough conditions, and there’s a real appetite for honest conversations about quality, trust, and resilience.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2025/staff-ambassadors-crossref-accra.jpg"
alt="Photo: Crossref staff and ambassadors with member Amy Asimah from Regional Maritime University. Pictured: Johanssen Obanda, Oumy Ndiaye, Evans Atoni, Patience Mbum, Audrey Kenni Nganmeni, Ginny Hendricks, and Richard Lamptey." width="75%">&lt;figcaption>
&lt;p>Photo: Crossref staff and ambassadors with member Amy Asimah from Regional Maritime University. Pictured: Johanssen Obanda, Oumy Ndiaye, Evans Atoni, Patience Mbum, Audrey Kenni Nganmeni, Ginny Hendricks, and Richard Lamptey.&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;/div>
&lt;p>Crossref Accra 2025 reminded us how valuable these local gatherings are - not just for sharing tools and workflows, but for building lasting connections. We’re grateful to &lt;a href="https://www.crossref.org/community/ambassadors/">our Ambassadors&lt;/a> and team who helped make it happen, and we’re committed to deepening our support across the region. There’s so much potential in Ghana’s scholarly community, and in West Africa more broadly, as we’ve seen again at WACREN in Senegal a couple of weeks later. We’re committed to working with local partners to help it grow.&lt;/p></description></item><item><title>Enhancing DOI Accessibility for All Users</title><link>https://www.crossref.org/blog/enhancing-doi-accessibility-for-all-users/</link><pubDate>Mon, 28 Apr 2025 00:00:00 +0000</pubDate><author>Patrick Vale</author><guid>https://www.crossref.org/blog/enhancing-doi-accessibility-for-all-users/</guid><description>&lt;h2 id="2025-update">2025 Update&lt;/h2>
&lt;p>In 2022, we set out to update our DOI display guidelines with the intention to adopt &lt;a href="https://doi.org/10.64000/gyw3h-trd87" target="_blank">the proposals&lt;/a> in 2025. It’s important to note from the outset that we are not mandating any immediate changes to the &lt;a href="https://www.crossref.org/display-guidelines/" target="_blank">DOI display guidelines&lt;/a>. Instead, we are working with our community to co-create a solution that addresses the diverse needs of all users, rather than imposing technical changes that may not suit everyone.&lt;/p>
&lt;h2 id="background">Background&lt;/h2>
&lt;p>DOI links are the lifeblood of scholarly communication. They’re the canonical identifiers that enable researchers to find, cite, and assess academic work. In essence, they’re stable, reliable, and easy to use—provided you can see them. But what happens when a user can’t rely on visual cues?&lt;/p>
&lt;h2 id="the-accessibility-challenge">The Accessibility Challenge&lt;/h2>
&lt;p>For users of screen readers and other assistive technologies, the full value of a DOI link can be lost. While sighted users benefit from the context surrounding a DOI link—such as the title, abstract, and other metadata—screen reader users often hear just the bare URL. This means they might not know what content the DOI link represents, leading to confusion and a diminished browsing experience.&lt;/p>
&lt;p>The problem is compounded by the technical nature of DOI links. Being URIs (Uniform Resource Identifiers), they don&amp;rsquo;t naturally lend themselves to the same accessibility techniques as standard URLs. When we attempted to tweak DOI links directly, every change that improved accessibility for one group inadvertently hindered another. Whether it was a WCAG (Web Content Accessibility Guidelines) rule or an ARIA (Accessible Rich Internet Applications) attribute, a solution that worked in one area would break in another.&lt;/p>
&lt;h2 id="a-community-driven-approach">A Community-Driven Approach&lt;/h2>
&lt;p>Realizing that a one-size-fits-all fix wouldn’t work, we took a different approach - one that involved the community from the outset. After consulting with early adopters and attending an insightful session with the &lt;a href="https://jats4r.niso.org/" target="_blank">JATS4R&lt;/a> accessibility group, it became clear that the answer lay in experimentation and iteration. Rather than modifying the DOI display guidelines immediately, we are developing a tool that enhances the user experience without disrupting the current standards.&lt;/p>
&lt;p>It’s worth noting that this solution places the responsibility on the end user rather than on publishers and platform providers. However, by doing so, users can have a consistent browsing experience regardless of the platform they use to access scholarly content. This approach also serves as an important stepping stone toward a future publisher-provided solution—be it via accessibility-focused JavaScript or a mandated dual-link implementation—and any efforts to recommend or mandate such changes will benefit greatly from concrete evidence of the effectiveness and scalability of this approach.&lt;/p>
&lt;h2 id="introducing-the-doi-accessibility-enhancer">Introducing the DOI Accessibility Enhancer&lt;/h2>
&lt;p>First &lt;a href="https://www.youtube.com/watch?v=VBnfkOxVr6s&amp;amp;t=1916s" target="_blank">demonstrated at the recent Crossref Annual Meeting&lt;/a>, here we share our DOI Accessibility Enhancer browser extension. Available now on the &lt;a href="https://addons.mozilla.org/en-GB/firefox/addon/crossref-doi-a11y-tool/" target="_blank">Firefox Add-on Store&lt;/a> and the &lt;a href="https://chromewebstore.google.com/detail/crossref-doi-accessibilit/nmpnpkdfcdnbnpiekngokijfoilpfpbc" target="_blank">Chrome Web Store&lt;/a>, this extension is designed to improve the experience of DOI links for screen reader users without altering the default behavior for sighted users.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/2025/doi-a11y-enhancer-2025.png" width="60%" alt="The Crossref DOI Accessibility Enhancer browser extension running in Firefox" >
&lt;/div>
&lt;h2 id="how-it-works">How It Works&lt;/h2>
&lt;ul>
&lt;li>
&lt;p>&lt;strong>Scanning for DOI Links:&lt;/strong> The extension scans any webpage for DOI links.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;strong>Querying Metadata:&lt;/strong> Once a DOI is detected, it queries the Crossref REST API to retrieve the title of the corresponding scholarly work.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;strong>Enhancing the Link:&lt;/strong> The title is then injected as a screen-reader–only link. This means that when a screen reader user navigates to the DOI, they hear the title of the paper rather than the opaque URL.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;strong>Maintaining Visual Integrity:&lt;/strong> For sighted users, the original DOI link remains unchanged—visible, clickable, and easy to copy-and-paste.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;strong>Highlighting for Testing:&lt;/strong> An optional feature highlights updated links, making it easier for developers and testers to see the changes in action.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;h2 id="get-involved">Get Involved&lt;/h2>
&lt;p>This project is very much a community effort. The extension is open-source, and we welcome feedback and contributions via our &lt;a href="https://gitlab.com/crossref/doi-accessibility-enhancer" target="_blank">GitLab repository&lt;/a>, email, or &lt;a href="https://community.crossref.org/" target="_blank">Community Forum&lt;/a>. Your real-life experiences and insights will drive future improvements, ensuring that our solution meets the diverse needs of all users.&lt;/p>
&lt;h2 id="try-it-out">Try It Out&lt;/h2>
&lt;div style="text-align:center;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/2025/doi-a11y-enchancer-firefox-addons-2025.png" width="60%" alt="The Crossref DOI Accessibility Enhancer browser extension in Firefox Add-ons" >
&lt;/div>
&lt;p>If you’re using Firefox, head over to the Firefox Add-on Store and install the &lt;a href="https://addons.mozilla.org/en-GB/firefox/addon/crossref-doi-a11y-tool/" target="_blank">DOI Accessibility Enhancer&lt;/a> today. If you’re a Chrome user, you can find the extension directly in the &lt;a href="https://chromewebstore.google.com/detail/crossref-doi-accessibilit/nmpnpkdfcdnbnpiekngokijfoilpfpbc" target="_blank">Chrome Web Store&lt;/a>. If you use a screen reader you’ll experience the difference firsthand - and if you don’t, give it a try with VoiceOver enabled (Command-F5 on a Mac).&lt;/p>
&lt;p>Together, we can advance scholarly accessibility and ensure that critical research remains discoverable for everyone.&lt;/p></description></item><item><title>Supporting Membership through the Sponsor Program</title><link>https://www.crossref.org/blog/supporting-membership-through-the-sponsor-program/</link><pubDate>Fri, 18 Apr 2025 00:00:00 +0000</pubDate><author>Susan Collins</author><guid>https://www.crossref.org/blog/supporting-membership-through-the-sponsor-program/</guid><description>&lt;p>Sponsors make Crossref membership accessible to organisations that would otherwise face barriers to joining us. They also provide support to facilitate participation, which increases the amount and diversity of metadata in the global Research Nexus. This in turn improves discoverability and transparency of scholarship behind the works.&lt;/p>
&lt;h3 id="growing-number-of-sponsors">Growing number of sponsors&lt;/h3>
&lt;p>Our first sponsors joined in 2008, but the program started to grow rapidly between 2012-2014, with the addition of sponsors in South Korea, Türkiye, Russia, India, and Ukraine. In 2015, we welcomed our first South American sponsor from Brazil, followed by more sponsors in Latin America starting in 2016, and our first sponsor in Indonesia in 2017.&lt;/p>
&lt;blockquote>
&lt;p>As of December 2024, Crossref works with 124 sponsoring organisations that support 12,195 sponsored members.&lt;/p>
&lt;/blockquote>
&lt;p>In 2021, we updated the criteria for organisations to be accepted as sponsors, raising the bar to ensure that potential sponsors accurately and successfully represent Crossref in the community. We also &lt;a href="https://doi.org/10.64000/yjcny-cbd06" target="_blank">paused the acceptance of new Sponsors&lt;/a> from regions where such organisations are already prolific. By doing so, we can focus on growing the program in areas with the greatest need.&lt;/p>
&lt;p>In 2024, we added eight new sponsors to the program; these included our first sponsor in Bangladesh &lt;a href="https://www.crossref.org/gem/">(our first GEM sponsor)&lt;/a>, as well as sponsors in China, Kazakhstan, Pakistan, Türkiye, Tunisia, Iraq, and Kenya.&lt;/p>
&lt;h3 id="sponsor-growth-by-country-by-year">Sponsor growth by country by year&lt;/h3>
&lt;div style="text-align:center;margin:10px">
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2025/sponsor-growth-by-country-by-year-graph.png"
alt="graph showing growth by country" width="75%">
&lt;/figure>
&lt;/div>
&lt;p>Our five largest sponsors, based on the number of members they support (as of the end of 2024) are:&lt;/p>
&lt;ul>
&lt;li>Relawan Jurnal Indonesia, Indonesia - 3076 members&lt;/li>
&lt;li>Associacao Brasileira de Editores Cientificos do Brasil (ABEC Brasil) - 1312&lt;/li>
&lt;li>Tubitak Ulakbim DergiPark, Türkiye - 1248&lt;/li>
&lt;li>NEICON ISP, Russia- 713&lt;/li>
&lt;li>Kyobobook Center, South Korea - 419&lt;/li>
&lt;/ul>
&lt;p>The majority of sponsors are much smaller than this, looking after 25 or fewer Sponsored Members.&lt;/p>
&lt;p>Each sponsor has specific criteria for what kind of organisations they work with. Some are dedicated to supporting organisations in a specific country or region, while others may be based on geography, language, subject area, or usage of a specific platform, e.g. OJS.&lt;/p>
&lt;p>Our sponsors are distributed across all regions of the world, and we’re continuously working to forge networks with organisations in regions with the least coverage, to ensure scholarly communicators anywhere can join Crossref and contribute to the Research Nexus.&lt;/p>
&lt;ul>
&lt;li>Asia Pacific: 22&lt;/li>
&lt;li>Central and Eastern Europe: 29&lt;/li>
&lt;li>Central and South Asia: 25&lt;/li>
&lt;li>Latin America and the Caribbean: 24&lt;/li>
&lt;li>North Africa and the Middle East: 3&lt;/li>
&lt;li>Sub-Saharan Africa: 2&lt;/li>
&lt;li>​US and Canada: 5&lt;/li>
&lt;li>Western Europe: 14&lt;/li>
&lt;/ul>
&lt;p>Currently, sponsored members represent 115 different countries, with the largest proportions from Latin America, South-eastern Asia, and Eastern Europe. Nearly two-thirds of sponsored members self-identify as universities, libraries, government agencies, foundations, scholar publishers, and research institutions.&lt;/p>
&lt;blockquote>
&lt;p>To date, sponsored members have contributed 6.5 million works to the Research Nexus.&lt;/p>
&lt;/blockquote>
&lt;p>Importantly, the sponsored members have the ability to fully participate in Crossref – they are stewards of their records (even if some choose to delegate this activity to their sponsor), they can vote, stand in for elections to our Board of Directors, and collaborate with others in the Crossref community, just as any other member.&lt;/p>
&lt;h3 id="benefits-of-the-sponsor-program">Benefits of the Sponsor Program&lt;/h3>
&lt;p>Sponsors are key partners for us in making participation easier for organisations in their communities. They work with us to provide administrative, billing, technical, and local language support to the members they work with. Depending on the financial model, they may charge members for their services.&lt;/p>
&lt;p>Technical support they provide for members makes it more tailored and often quicker than the Crossref team could offer. For example, sponsors can provide service in their local language using their preferred method (helpdesk, WhatsApp, phone, email), which varies widely by region; or, where they charge any fees – they tend to collect those in the local currency. Some sponsors even take care of all the records registration for the members they support.&lt;/p>
&lt;p>It’s important to note that sponsors can only support the participation of organisations that would otherwise be in the current $275 fee tier (or up to $500 for funders) if these organisations were to join independently. Regardless of the number of sponsored members, the sponsor pays one membership fee on behalf of them all, and then they also pay all the registration fees that are due on behalf of their sponsored members, which alleviates challenges related to paying in foreign currency. Overall, sponsors make Crossref membership more economical for the organisations that participate this way, and Crossref benefits from billing efficiencies.&lt;/p>
&lt;p>In a recent survey of sponsored members (carried out in July 2023, with 204 responses from members working with 53 sponsors), the majority of sponsored members (88%) said that sponsors met their expectations and 85% are likely or very likely to recommend their sponsors to another organisation.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2025/sponsor-survey-results-graph.png"
alt="graph showing survey results" width="75%">
&lt;/figure>
&lt;/div>
&lt;p>Respondents indicated that the aspects of working with a sponsor that were most valued are technical support (72%), financial assistance/no annual fee (37.3%), ability to pay in local currency (43%), and local language support (44%).&lt;/p>
&lt;p>It’s important to note that sponsors often offer many non-Crossref services to members too, including anything from website design, copy editing, typesetting, set up of publishing platform, XML-JATS markup, to assistance with submitting content to third-party databases.&lt;/p>
&lt;p>Sponsors represent Crossref in the community. They also assist us in connecting with their communities locally. In 2024, we collaborated with Biteca for an event in Bogotá, and Relawan Jurnal Indonesia for a two-day event in Jakarta. Both sponsors advised on venues, promoted the event to the members they support, coordinated local guest speakers, and provided translation services as needed. We also collaborated with Hipertexto-Netizen on engaging our community at the Guadalajara Book Fair. The success of these events was in part due to our collaboration with each sponsor.&lt;/p>
&lt;h3 id="ensuring-quality-experience-for-our-members">Ensuring quality experience for our members&lt;/h3>
&lt;p>We try to make sure that every sponsor we work with will be able to commit to helping our members long-term. We offer training too, with an expectation that they can disseminate the learning to their members. The majority of sponsored members report receiving some training from their Sponsors (with 70% in our survey saying they’ve received adequate training on all services, while only 3% haven’t received any so far). Most recently we engaged sponsors with the &lt;a href="https://www.crossref.org/members/prep/" target="_blank">Participation Reports&lt;/a> to help them improve metadata completeness for their members.&lt;/p>
&lt;p>In 2024, we’ve been meeting sponsors individually to review how things are going for them and their members – assessing member metadata quality, and additional services, as well as inviting their feedback about the program and suggestions for improvements that Crossref could make.&lt;/p>
&lt;p>We’ve learnt a lot about practices related to record registration and training, business models and especially – a whole range of attitudes and approaches related to metadata completeness. Some sponsors register content for all or some of their members, while others provide technical support but do not register the content directly for members.&lt;/p>
&lt;p>Members who used OJS often had higher scores because of the ease of use and availability of the plugins. Some sponsors noted that many journal editors are volunteers and don’t have the time or financial resources to collect extra metadata or update existing metadata records; they collect only what is required to register an item. Several sponsors also reported a barrier with authors&amp;rsquo; mindset – they don’t tend to see the value of including ORCiDs or ROR IDs in their submissions. Somewhat surprisingly, we learned that not all members see the value in including references in their deposits or don’t wish to take the time to add them – this is a concern, as relationships created by references are a cornerstone of the Research Nexus, and markedly support discoverability of the content.&lt;/p>
&lt;p>Sometimes, sponsors are unable to continue to provide services, or they are unable to meet the obligations of being a sponsor and their accounts are closed. In the cases where a sponsor account is closed, we will work with their members to find an alternative sponsor when possible.&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/services/similarity-check/">Similarity Check&lt;/a> is an external service provided in partnership with iThenticate, that’s available to Crossref members at a more competitive price, and it is in demand among the sponsored members too. Currently, 78 Sponsors offer Similarity Check to their members (however, not all sponsored members working with these sponsors have elected to use the service).&lt;/p>
&lt;p>Sponsor LIBCOM Piotr Karwasinski was pleased that “All the rules of Crossref are unified. Everything is the same for everyone - the same for big publishers as well as small. Equal for everyone.”&lt;/p>
&lt;p>Costs can sometimes be a concern; sponsors in India and Algeria both noted that &lt;a href="https://www.crossref.org/fees/">$1USD&lt;/a> is a lot of money for some. We mentioned the fee review being conducted with the RCFS project.&lt;/p>
&lt;h3 id="in-summary">In summary&lt;/h3>
&lt;p>As we move toward realizing our vision of a connected Research Nexus, building a network for the global community must include input from all of the global community. When Crossref began 25 years ago our first members were mainly from the United States and Western Europe, but today our membership is much more global and diverse. Though our membership has grown to more than 22,000 organisations around the world, we are not seeing significant membership growth from all regions.&lt;/p>
&lt;p>In the last few years, almost half of our members came from Southeastern Asia, Eastern Europe, and Latin America combined. However, there is much slower growth in other regions, mostly notably Northern and Sub-Saharan Africa, and parts of Central Asia, with only 5% of new member applications coming from these regions collectively. We know there are organisations in those areas contributing to the scholarly record, however, many continue to face financial, technical, and administrative barriers to become members.&lt;/p>
&lt;p>The Sponsor Program is one of the avenues established to address and reduce barriers and to help facilitate membership and participation to all knowledge-sharing organisations worldwide. Ensuring it remains strong and successful requires collaboration, communication, and comprehensive training.&lt;/p></description></item><item><title>Request for proposals: Crossref website information architecture review</title><link>https://www.crossref.org/blog/request-for-proposals-crossref-website-information-architecture-review/</link><pubDate>Thu, 17 Apr 2025 00:00:00 +0000</pubDate><author>Lena Stoll</author><guid>https://www.crossref.org/blog/request-for-proposals-crossref-website-information-architecture-review/</guid><description>&lt;p>We are looking for an organisation to perform an audit of, and propose changes to, the structure and information architecture underlying &lt;a href="https://www.crossref.org/">our website&lt;/a>, with the aim of making it easier for everyone in our community to navigate the website and find the information they need.&lt;/p>
&lt;div class="shortcode-divwrap yellow-highlight">
&lt;span>UPDATE, August 2025: We are partnering with &lt;a href="https://cazinc.co.uk/" target="_blank">Cazinc&lt;/a> and &lt;a href="https://aisolutions.cactusglobal.com/" target="_blank">Cactus AI Solutions&lt;/a> on this work. Stay tuned for updates on the progress of this project over the coming months.&lt;/span>
&lt;/div>
&lt;h3 id="about-crossref">About Crossref&lt;/h3>
&lt;p>Crossref is a nonprofit membership organisation that exists to make scholarly communications better. We run open infrastructure to link research objects, entities, and actions, creating a lasting and reusable scholarly record that underpins open science and makes research outputs easy to find, cite, link, assess, and reuse.&lt;/p>
&lt;p>Together with our 22,000 members in 160 countries, we drive metadata exchange and support nearly 2 billion monthly API queries, facilitating global research communication, for the benefit of society. Our members include research institutions, publishers, libraries, funders, government bodies, and other stakeholders in the scholarly communications ecosystem.&lt;/p>
&lt;h3 id="about-the-crossref-website">About the Crossref website&lt;/h3>
&lt;p>We launched the current website in 2016. A few years later, we custom-developed the current Documentation section, moving from a separate site (Zendesk, and prior to that HelpIQ). We subsequently launched a Discourse &lt;a href="https://community.crossref.org/" target="_blank">community forum&lt;/a> and actively encourage self-service there. Despite these efforts, we still answered about 50,000 support emails in 2024.&lt;/p>
&lt;p>We use the &lt;a href="https://gohugo.io/" target="_blank">Hugo&lt;/a> static site generator, and all the content, assets, and code are open in &lt;a href="https://gitlab.com/crossref/crossref-website/" target="_blank">GitLab&lt;/a>. We have dedicated staging and sandbox branches, and use staging for editing instead of the usual git merge requests, and sandbox for testing more substantial code or navigation changes.&lt;/p>
&lt;p>We share the responsibility for editing across the teams, with a page owner/author denoted for each page. Most staff use &lt;a href="https://code.visualstudio.com/" target="_blank">VSCode&lt;/a> for editing; we don’t have or need a CMS. We deploy changes to the live site around twice a week. Several custom shortcodes are in place, such as for tables and displaying related information based on tags, or for presentation elements like highlight boxes or columns. We host (many) images and files directly in the repository, rather than using a CDN. We use Algolia for site search, which was chosen because it can support multiple languages.&lt;/p>
&lt;h4 id="current-website-structure">Current website structure&lt;/h4>
&lt;p>There are currently four main sections of the website:&lt;/p>
&lt;ol>
&lt;li>&lt;a href="https://www.crossref.org/community/">Get involved&lt;/a>: this landing page is the most up-to-date with our current positioning and messaging. The section includes how to join as a member and the ways you can participate, obligations and benefits; a welcome page for new members to get started; events and webinars like our annual meeting; special projects or campaigns that need landing pages; fees; programs such as for service providers and ambassadors; global equitable membership; code of conduct; and working groups (which are different from board committees).&lt;/li>
&lt;li>&lt;a href="https://www.crossref.org/services/">Find a service&lt;/a>: listing the purpose and value/benefits for each service, such as content registration, metadata retrieval/APIs/Search, Crossmark, Similarity Check, Grant Linking System, and some other quasi-services that require members to develop or enable something, like reference linking or the Open Funder Registry or ROR.&lt;/li>
&lt;li>&lt;a href="https://www.crossref.org/documentation/">Documentation&lt;/a>: following more-or-less our “managed member journey” pathway, this includes getting set up, how to create DOI suffixes, how to select the right tool for content registration, how to interpret the various reports that members receive, what to expect in terms of invoicing, schema library and best practices for metadata sharing incl. guidance on principles to follow and sample XML files to edit. Each ‘service’ then has it’s own documentation section too.&lt;/li>
&lt;li>&lt;a href="https://www.crossref.org/about/">About&lt;/a>: governance, including information about our board, committees, and bylaws. Financial information and annual reports. Staff pages, org chart, jobs, and policies incl. employee handbooks. History of Crossref and mission. Under the sub-heading “Operations &amp;amp; sustainability”, there is also detailed information about membership processes such as revocations, managing legal sanctions, member practices, and member offboarding.&lt;/li>
&lt;/ol>
&lt;p>Additionally, the website hosts our &lt;a href="https://www.crossref.org/blog/">blog&lt;/a> and allows users to sign up for our &lt;a href="https://www.crossref.org/subscribe-newsletter/">newsletter&lt;/a>, which are two key ways in which we keep our community informed.&lt;/p>
&lt;h3 id="project-overview">Project overview&lt;/h3>
&lt;h4 id="end-goal">End goal&lt;/h4>
&lt;p>We want to allow our community to self-serve with information about what Crossref does, how to become a member, how to use our tools, and how to participate in our programs and services. The &lt;a href="https://openscholarlyinfrastructure.org/" target="_blank">Principles of Open Scholarly Infrastructure&lt;/a> are central to how we operate, and we want the information about the how, what, and why of Crossref to not only be openly available, but also easy to discover and reuse.&lt;/p>
&lt;p>Visitors to &lt;a href="https://www.crossref.org" target="_blank">www.crossref.org&lt;/a> should be offered the information that they are looking for quickly and intuitively. A reduction in the number of help-desk tickets we receive (in 2024 we answered 50,000 of them) would be an indication of an improved self-service website, as would lower bounce rates.&lt;/p>
&lt;h4 id="scope-and-deliverables">Scope and deliverables&lt;/h4>
&lt;p>At the end of this information architecture review project, we expect to have agreed on a set of recommendations for tackling the problem statements laid out in the appendix of this document, as well as a plan for how the recommendations should be implemented. This plan will form the basis for an implementation project in 2026. We encourage applications both from organisations who would also be comfortable taking on the implementation project and from those who feel their expertise is specific to the review project described herein.&lt;/p>
&lt;p>Specifically, we expect the following deliverables:&lt;/p>
&lt;ul>
&lt;li>Assessment of key user needs (through analytics and/or user interviews incl. editors)&lt;/li>
&lt;li>Audit and analysis of current site structure and how it serves key user pathways&lt;/li>
&lt;li>Recommendations for content re-architecture, navigation and search improvements&lt;/li>
&lt;li>Strategy for taxonomy and/or tagging system&lt;/li>
&lt;li>Strategy for documentation site setup&lt;/li>
&lt;li>Strategy for information pathways between website, docs, community forum, ticketing systems&lt;/li>
&lt;li>Recommended roadmap for 2026 implementation project&lt;/li>
&lt;li>Nice to have: Wireframes or annotated sitemaps for future site layout&lt;/li>
&lt;/ul>
&lt;h4 id="problem-statements">Problem statements&lt;/h4>
&lt;ol>
&lt;li>&lt;strong>It is difficult to find information about our services&lt;/strong>. Even Crossref staff often use search engines to find a page on our website rather than navigating to it or using the built-in search on the website. It’s often not clear whether the information you are looking for is on the “Find a service” page or the “Documentation” page for a given service, and there is no consistent cross-linking between the two groups of pages. There is a search bar prominently placed on the home page, but the search currently only looks for direct matches between the search terms and page contents (with some declensions, stopwords, and fuzziness to allow for typos). We have limited tracking available in Algolia, but can see that in a 7-day span in March 2025, a large portion of searches (78%) returned no results.&lt;/li>
&lt;li>&lt;strong>It is difficult to navigate our website&lt;/strong>. The home page contains some quick links to key pages, but they are not very visible. In order to navigate the website from the home page, users have to expand a hamburger menu which takes up the whole page, and are then presented with an overwhelming amount of options. Once users have left the home page, the way they navigate depends on which section of the website a user finds themselves in: all pages have breadcrumbs going back to Home, while only Documentation pages have a hierarchical sidebar. In order to switch between the basic groups of pages (&lt;em>Get involved&lt;/em>, &lt;em>Find a service&lt;/em>, &lt;em>Documentation&lt;/em>, &lt;em>About us&lt;/em>), users have to use the global hamburger menu.&lt;/li>
&lt;li>&lt;strong>Our home page doesn’t do a very good job of explaining who we are and what we do&lt;/strong>. A lot of real estate is taken up by images and recent news items without much context. Bounce rates from the home page are high (65% as of March 2025).&lt;/li>
&lt;li>&lt;strong>Our user interfaces and reports are not easily accessible from our website&lt;/strong>. While we are not a SaaS organisation, there is an established pattern of being able to access an organisation’s services directly from its website (often via a login button at the top right). This is complicated by the fact that we don’t have one single frontend “platform”. In fact we don’t have a single page linking out to the various frontends and interfaces, nor do we have a consistent pattern of linking out to an interface from the documentation page describing how to use it.&lt;/li>
&lt;li>&lt;strong>Some of the pages and grouping of pages are outdated and don’t reflect our current priorities or ways of working anymore&lt;/strong>. For example, the &lt;em>Get involved&lt;/em> section still features &lt;em>Special programs&lt;/em> and &lt;em>Service providers&lt;/em> quite prominently, but the cross-functional programs that shape most of our strategic work now (&lt;em>Co-creation and Community Trends&lt;/em>, &lt;em>Contributing to the Research Nexus&lt;/em>, &lt;em>Open and Sustainable Operations&lt;/em>, &lt;em>Metadata Development&lt;/em>) are not represented. &lt;em>Find a service&lt;/em> strongly suggests we’re a service provider, whereas most of our services are enabling infrastructure, requiring members to build or act on something. Some more recently created pages don’t fit neatly into any of the current groupings: e.g., &lt;em>API Learning Hub&lt;/em> can be found under &lt;em>Get involved&lt;/em> and in the home page footer, but doesn’t really belong in either. We also have time-limited, special projects or campaigns like the 25th anniversary of Crossref or the Resourcing Crossref for Future Sustainability project, for which there isn’t a great home. Lastly, we want to host additional content on our website in future, such as our own staff publications; instructions on how to find our codebases and how to contribute to them; how to build technical integrations; how to report bugs; and general best practices in scholarly communications (e.g. in the context of our work on the &lt;a href="https://www.crossref.org/community/special-programs/research-integrity/">integrity of the scholarly record&lt;/a>), which is not really part of the documentation of our services.&lt;/li>
&lt;/ol>
&lt;h3 id="project-budget-and-timeline">Project budget and timeline&lt;/h3>
&lt;p>We have a maximum budget of $20,000 allocated to the information architecture review project. The projected timeline is as follows:&lt;/p>
&lt;ul>
&lt;li>RFP issued: April 17, 2025&lt;/li>
&lt;li>Final deadline for proposals: May 15, 2025&lt;/li>
&lt;li>Shortlisted applicant interviews: May 2025&lt;/li>
&lt;li>Appointment made: June 2025&lt;/li>
&lt;li>Project kick-off: July 2025&lt;/li>
&lt;li>Final deliverables due: October 2025&lt;/li>
&lt;/ul>
&lt;p>If you are interested in applying but don’t think this timeline is deliverable for you, please contact us to suggest what would be realistic for you or your organisation before applying.&lt;/p>
&lt;h3 id="proposal-submission-requirements">Proposal submission requirements&lt;/h3>
&lt;p>Proposals, as well as any questions, should be submitted to &lt;a href="mailto:lstoll@crossref.org">Lena Stoll&lt;/a> by 15 May 2025.&lt;/p>
&lt;p>Please include the following in your proposal:&lt;/p>
&lt;ul>
&lt;li>Company background and relevant experience with open-source static sites and mission-driven communications&lt;/li>
&lt;li>Case studies or examples of comparable work&lt;/li>
&lt;li>Your approach to the proposed project and how you would structure it&lt;/li>
&lt;li>Team bios and roles incl. typical timezones&lt;/li>
&lt;li>Timeline and milestone estimates&lt;/li>
&lt;li>Proposed budget, including breakdown&lt;/li>
&lt;li>Proposed cadence of check-ins, communications, milestones, and deliverables&lt;/li>
&lt;li>Contact information&lt;/li>
&lt;/ul>
&lt;h3 id="proposal-evaluation-criteria">Proposal evaluation criteria&lt;/h3>
&lt;p>We will evaluate proposals based on:&lt;/p>
&lt;ul>
&lt;li>Demonstrated understanding of our mission and community needs&lt;/li>
&lt;li>Proven experience designing for multilingual and multinational audiences&lt;/li>
&lt;li>Expertise in mission-driven business-to-business communications and information architecture&lt;/li>
&lt;li>Quality of previous work and case studies&lt;/li>
&lt;li>Value for money&lt;/li>
&lt;/ul>
&lt;h3 id="we-look-forward-to-hearing-from-you">We look forward to hearing from you!&lt;/h3></description></item><item><title>The programs approach: our experiences during the first quarter of 2025</title><link>https://www.crossref.org/blog/the-programs-approach-our-experiences-during-the-first-quarter-of-2025/</link><pubDate>Tue, 08 Apr 2025 00:00:00 +0000</pubDate><author>Helena Cousijn</author><guid>https://www.crossref.org/blog/the-programs-approach-our-experiences-during-the-first-quarter-of-2025/</guid><description>&lt;p>At the end of last year, we were excited to announce &lt;a href="https://doi.org/10.64000/bm6g0-gvy36" target="_blank">our renewed commitment to community &lt;/a>and the launch of three cross-functional programs to guide and accelerate our work. We introduced this new approach to work towards better cross-team alignment, shared responsibility, improved communication and learning, and make more progress on the things members need.&lt;/p>
&lt;p>In line with the Crossref strategic agenda, the three programs focus on:&lt;/p>
&lt;ul>
&lt;li>&lt;strong>Co-creation and Community Trends (CCT)&lt;/strong>: This program is responsible for interfaces such as reports/dashboards, record registration interfaces, connections and collaborations such as &lt;a href="https://www.crossref.org/services/funder-registry/">Open Funder Registry&lt;/a>, &lt;a href="https://ror.org/" target="_blank">ROR&lt;/a>, &lt;a href="https://orcid.org/" target="_blank">ORCID&lt;/a> auto-update, as well as &lt;a href="https://pkp.sfu.ca/ojs/" target="_blank">OJS&lt;/a> and other partner integrations. This program also includes the Crossref website and any front-end interfaces to support other programs. It includes initiatives aimed at upholding the integrity of the scholarly record and our tools in this area, such as &lt;a href="https://www.crossref.org/services/crossmark/">Crossmark&lt;/a> and retraction/correction tooling, and &lt;a href="https://www.crossref.org/services/similarity-check/">Similarity Check&lt;/a> for text comparisons.&lt;/li>
&lt;li>&lt;strong>Contributing to the Research Nexus (CRN)&lt;/strong>: This program manages and oversees all activities relating to contributing to the Research Nexus. A lot of the work in this program revolves around our REST API, but also includes our other APIs, incorporating external data sources like &lt;a href="https://retractionwatch.com/" target="_blank">Retraction Watch&lt;/a> and Event Data, building out metadata matching services with the new data science team, supporting the community of metadata users with API sprints and more modern options for retrieving metadata based on usage and need.&lt;/li>
&lt;li>&lt;strong>Open and Sustainable Operations (OSO):&lt;/strong> This program manages and oversees all activities related to making our operations more open, transparent, and sustainable. This program focuses on supporting and strengthening the core functions our members rely on and enabling future growth. It includes metadata deposit and processing, most apps for e.g. managing titles, authentication, and architectural and infrastructural projects like moving from the data centre to the AWS cloud service. This program also includes modernising our operations in general, which is not just technology but also finance and human resources, so projects like membership process automation, financial analyses, and business system integrations.&lt;/li>
&lt;/ul>
&lt;div style="text-align:center;margin:10px">
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2024/strategic-themes-programs-landscape-slide.png"
alt="screenshot from Strategy page showing Crossref strategic themes." width="75%">
&lt;/figure>
&lt;/div>
&lt;p>The approach we are taking is to support the work within the programs through (internal) cross-functional steering groups. Led by three program leads (who share updates on their programs below), three program steering groups meet regularly to discuss the topics and work that fall within the scope of each program. The steering groups consist of representatives from all teams within Crossref, which means every steering group has people from the community team, membership team, technical team, data science team, and operations and finance team, bringing all the perspectives and expertise needed to prioritise the next steps for Crossref and fostering broad knowledge sharing and shared responsibility.&lt;/p>
&lt;p>Although the whole organisation contributes to these programs, they are coordinated by the Programs and Services team. The team was formed towards the end of 2024, and on the 1st of February, &lt;a href="https://www.crossref.org/people/helena-cousijn">Helena Cousijn&lt;/a> joined Crossref in the new role of Director of Programs and Services. Helena has a background in both product management and community engagement and is very excited to help Crossref shape the programs approach and work with all teams across the organisation to drive the strategic agenda forward!&lt;/p>
&lt;p>If you’d like to keep an eye on the work that is happening within each program, you can find more information on the &lt;a href="https://roadmap.productboard.com/e6fdeba8-a5b3-4aef-8104-d48863ba975e" target="_blank">Crossref productboard&lt;/a>.&lt;/p>
&lt;h2 id="co-creation-and-community-trends-cct">Co-creation and Community Trends (CCT)&lt;/h2>
&lt;p>The mission of the CCT program is to build and foster relationships with our community and other services and organisations within it, so that Crossref can meet and anticipate community needs. Curiosity and listening are at the core of how we co-create to tackle emerging challenges, develop best practices, and explore new ideas for building the Research Nexus. We want our work to benefit all of Crossref’s diverse stakeholders - from our own colleagues and members to underrepresented communities in the wider scholarly ecosystem.&lt;/p>
&lt;p>In the first quarter of 2025, our focus areas have been:&lt;/p>
&lt;ul>
&lt;li>Improvements to our &lt;a href="https://manage.crossref.org/records" target="_blank">new record registration form for journal articles&lt;/a>, which already supports grants, and was launched in beta for articles in 2024. For example, the form now has a built-in reference deposit feature. Join the conversation on the &lt;a href="https://community.crossref.org/t/feedback-on-new-helper-tool/1721" target="_blank">community forum&lt;/a> for updates and feedback on this new helper tool.&lt;/li>
&lt;li>Running a series of multilingual &lt;a href="https://www.crossref.org/events/metadata-health-check-webinars/">metadata health check webinars&lt;/a>. There are more of these coming up throughout Q2, so it’s not too late to sign up for one if you are interested.&lt;/li>
&lt;li>Integrating with &lt;a href="https://rogue-scholar.org/" target="_blank">Rogue Scholar&lt;/a> to automate the assignment of DOIs to, and the archiving of, posts on this very blog.&lt;/li>
&lt;li>Planning for the inaugural Crossref Metadata Awards - join our upcoming &lt;a href="https://www.crossref.org/events/crossref-community-call-2025/" target="_blank">community call&lt;/a> on 7 May to find out what this is all about.&lt;/li>
&lt;/ul>
&lt;p>In the coming months, we are hoping to tackle the following:&lt;/p>
&lt;ul>
&lt;li>Kick off a project to review the information architecture of this website and look into how we can make our documentation and related information more helpful and easier to navigate.&lt;/li>
&lt;li>Expand the record registration form for journal articles to allow easy editing of previously submitted records. This will allow us to sunset the long-deprecated Metadata Manager tool, as was first &lt;a href="https://doi.org/10.64000/30vzx-r5x16" target="_blank">announced&lt;/a> in 2021.&lt;/li>
&lt;li>Begin building new record registration forms for more work types. Watch this space.&lt;/li>
&lt;li>Explore options for supporting the integration of additional software systems with Crossref, building on our existing approach with OJS plugins, with a focus on open-source tools relied upon by our members for registering metadata.&lt;/li>
&lt;li>Restore faceted search on &lt;a href="https://search.crossref.org/" target="_blank">Crossref Metadata Search&lt;/a>. This feature &lt;a href="https://community.crossref.org/t/some-changes-to-crossref-metadata-search-search-crossref-org/2529" target="_blank">was disabled in 2022&lt;/a> following intermittent performance issues. We believe recent improvements to Metadata Search will allow us to bring some filters back, although we will need to start small so as not to overload our systems with these more complex queries.&lt;/li>
&lt;/ul>
&lt;h2 id="contributing-to-the-research-nexus-crn">Contributing to the Research Nexus (CRN)&lt;/h2>
&lt;p>The research nexus is a rich and reusable open network that represents scholarly activity. It consists of connections between research organisations, people, things, and actions; it’s an evolving model of the scholarly record that the global community can build on forever for the benefit of society.&lt;/p>
&lt;p>Our metadata is already a contribution to the research nexus, however, there is much more we would like to do. Our next steps will be to consolidate our existing data and services, and build the technical capacity, partnerships, and knowledge to enhance our contribution with new relationships. Some parts of our data storage and workflows don’t yet have the flexibility to fully capture all types of research objects and how they are connected.&lt;/p>
&lt;p>To support this process, the main priorities in the program are:&lt;/p>
&lt;ol>
&lt;li>Collaborate with our community. We want to get to know users of our metadata better and work more collaboratively alongside them. Also, we seek partners to contribute new data sources that will enhance our metadata with additional relationships.&lt;/li>
&lt;li>Share the research nexus vision. We know that we aren’t alone in developing the research nexus, so we will reach out to others with a similar vision and identify where we have common goals.&lt;/li>
&lt;li>Maintain our technology. We have already identified technical improvements we can make to our REST API, and we need to keep on top of monitoring and fixing bugs. We also need to build capacity for new types of data and relationships. Our other endpoints, such as the XML API and forwardlinks (for citations), need maintenance and are likely to be affected by a planned redesign of our core architecture.&lt;/li>
&lt;li>Building a new matching service. Identifying relationships between metadata records is a key part of the research nexus. We have already improved reference matching over the years, and we’re looking to implement funding, affiliations, and version matching next. We’ve carried out &lt;a href="https://www.crossref.org/categories/metadata-matching/" target="_blank">research&lt;/a> on several types of matching and are looking at building a new service to handle it in production.&lt;/li>
&lt;/ol>
&lt;p>In the first quarter of 2025, we’ve been working on:&lt;/p>
&lt;ul>
&lt;li>Schema changes, making the first significant updates to our schema for several years, including adding the capacity for depositing ROR IDs for funding organisations in funding metadata.&lt;/li>
&lt;li>Delivering Retraction Watch retractions via the REST API, integrated with member-supplied retraction/correction data.&lt;/li>
&lt;li>Getting the community involved and understanding needs, planning a &lt;a href="https://www.crossref.org/events/metadata-sprint/" target="_blank">sprint&lt;/a> and various workshops.&lt;/li>
&lt;li>Plenty of under-the-hood updates to the REST API, and more significant upgrades to come later this year.&lt;/li>
&lt;/ul>
&lt;p>Next up, we will:&lt;/p>
&lt;ul>
&lt;li>Plan and build out the new matching service.&lt;/li>
&lt;li>Improve representation of some metadata in the REST API, including Crossref members, journals, and typed citations such as data citations.&lt;/li>
&lt;li>Update the grants schema to extend the award types and respond to new funder member requests&lt;/li>
&lt;li>Add contributor roles to the schema, including CRediT.&lt;/li>
&lt;li>Ask our community about metadata retrieval, including the various APIs and the Metadata Plus subscription service.&lt;/li>
&lt;li>Upgrade elements of the REST API and optimise the underlying technical infrastructure.&lt;/li>
&lt;/ul>
&lt;h2 id="open-and-sustainable-operations-oso">Open and Sustainable Operations (OSO)&lt;/h2>
&lt;p>The OSO program is centered on transparency and sustainability of our technical systems and our business and people operations. We focus on maintaining critical systems and operations and ensuring their security, addressing technical and operational debt, and controlling or reducing costs - to Crossref, our community, or the environment. We’re always keen to tackle projects to automate repetitive and manual tasks – of which we have many – and pay down technical debt, being as open and transparent as possible along the way.&lt;/p>
&lt;p>Our most recently completed work includes:&lt;/p>
&lt;ul>
&lt;li>Moving from Oracle to an open-source database, PostgreSQL. This work aligns with the POSI principles and sets us up for a more robust, reliable, and modern infrastructure.&lt;/li>
&lt;li>Implementing metadata schema changes for deposit submission and processing, so we can now accept &lt;a href="https://doi.org/10.13003/156081" target="_blank">ROR IDs in funding metadata&lt;/a>, as well as the changes in &lt;a href="https://doi.org/10.13003/325070" target="_blank">latest schema version&lt;/a> (5.4.0) which includes the new ability to label references with a type (such as dataset, software, blog post, article, etc.).&lt;/li>
&lt;li>Automating parts of the process to keep &lt;a href="https://www.crossref.org/community/sponsors/" target="_blank">Sponsor information&lt;/a> on our website up to date and make it easier to search, so our community can find relevant and accurate information about our Sponsors and how to work with them, and our membership team spends less time keeping the website current.&lt;/li>
&lt;/ul>
&lt;p>Ongoing work in our program includes:&lt;/p>
&lt;ul>
&lt;li>Moving from a physical data center into the cloud (AWS). The PostgreSQL migration was the first step needed to enable our move to the cloud, which will allow us to operate more sustainably and efficiently.&lt;/li>
&lt;li>Automating new member setup in our systems, which is largely a manual process now.&lt;/li>
&lt;/ul>
&lt;p>And coming up are:&lt;/p>
&lt;ul>
&lt;li>Making changes in our core system to accept the upcoming 5.5 metadata schema version.&lt;/li>
&lt;li>Extracting billing code from our main codebase, to set up as its own service. This will allow us to simplify our code and make it easier to maintain. We’ll also be implementing the changes to billing enacted as part of the &lt;a href="https://www.crossref.org/community/special-programs/resourcing-crossref/" target="_blank">Resourcing Crossref for Future Sustainability&lt;/a> program (TBD!).&lt;/li>
&lt;li>Holding a “systems workshop” in April, to understand how our current system(s) are and aren’t meeting staff, member, and community needs, and how we might go about building the open, sustainable Crossref system of the future.&lt;/li>
&lt;/ul>
&lt;h2 id="what-have-we-learned-so-far">What have we learned so far?&lt;/h2>
&lt;h3 id="internal-communication">Internal communication&lt;/h3>
&lt;p>One of the reasons to implement a programs approach was to improve internal communication across the organisation. With all teams being represented on all steering groups, everyone is in the loop when decisions are taken. We see that this way, people feel more connected to the strategic agenda and, importantly, the ‘why’ is clearer to people. It is easier to get perspectives from across the organisation because contributing to these conversations is now part of people’s day jobs and so it’s easier to ask for their time. We are still looking to improve how we facilitate group discussions and decision-making to ensure we make the most of the program steering groups.&lt;/p>
&lt;h3 id="planning-and-delivery">Planning and delivery&lt;/h3>
&lt;p>Working closely with people from across the organisation has helped with more effective planning. A closer collaboration between program leads and developers makes the delivery of new features and functionality more accurate and predictable. With the community and support teams also being part of the conversation, they can plan related comms and support/documentation efforts in a timely manner. So far, it has also been easier to get more things delivered. We have some big projects coming up this year that will be a good test for the programs approach!&lt;/p>
&lt;h3 id="cross-cutting-topics">Cross-cutting topics&lt;/h3>
&lt;p>The implementation of a cross-functional approach facilitates discussions around cross-cutting topics, but also leads to the question of how cross-cutting topics fit within a specific program! Maybe you already noticed that work on metadata schema 5.4 and the planned work on 5.5 is included under both Contributing to the Research Nexus and Open and Sustainable Operations in the update above. Because metadata development impacts many of our systems, work was needed within all programs to enable these changes - the input, the output and the interfaces. Later this year, we’re planning to share some visuals that better explain which projects sit with which program and how we deal with cross-cutting topics.&lt;/p>
&lt;h3 id="alignment">Alignment&lt;/h3>
&lt;p>One of the most important things for the approach to be successful is that people are bought in and willing to participate and communicate. For cross-organisational alignment, a culture needs to be in place (or developed) where people are willing to collaborate and be open and transparent about their work. In a practical sense, we are still looking at how we can better align our code bases with the current programs so that it is easier to develop the relevant expertise within the programs.&lt;/p>
&lt;p>We hope to see many of you at our upcoming &lt;a href="https://www.crossref.org/events/crossref-community-call-2025/">community call on 7 May&lt;/a>. Please register to join as we discuss some of the work included in this update.&lt;/p></description></item><item><title>Version 5.4.0 metadata schema update now available</title><link>https://www.crossref.org/blog/version-5.4.0-metadata-schema-update-now-available/</link><pubDate>Wed, 19 Mar 2025 00:00:00 +0000</pubDate><author>Patricia Feeney</author><guid>https://www.crossref.org/blog/version-5.4.0-metadata-schema-update-now-available/</guid><description>&lt;p>This year, metadata development is one of our key priorities and we’re making a start with the release of version 5.4.0 of our input schema with some long-awaited changes. This is the first in what will be a series of metadata schema updates.&lt;/p>
&lt;h2 id="what-is-in-this-update">What is in this update?&lt;/h2>
&lt;h3 id="publication-typing-for-citations">Publication typing for citations&lt;/h3>
&lt;p>This is fairly simple; we’ve added a ‘type’ attribute to the citations members supply. This means you can identify a journal article citation as a journal article, but more importantly, you can identify a dataset, software, blog post, or other citation that may not have an identifier assigned to it. This makes it easier for the many thousands of metadata users to connect these citations to identifiers. We know many publishers, particularly journal publishers, do collect this information already and will consider making this change to deposit citation types with their records.&lt;/p>
&lt;h3 id="support-for-version-numbering">Support for version numbering&lt;/h3>
&lt;p>Members can now supply a version number across all relevant record types, including journal articles, book chapters, conference papers, posted content/preprints, datasets, reports, standards, and dissertations. The versioning update also includes an optional description field.&lt;/p>
&lt;p>Members who version content are encouraged to register a new DOI with each version and supply the &lt;code>isVersionOf&lt;/code>’ relationship to connect versions to each other, facilitating the Research Nexus and allowing members to avoid additional content registration fees, which don&amp;rsquo;t apply for versions.&lt;/p>
&lt;h3 id="preprint-status">Preprint status&lt;/h3>
&lt;p>This is specific to the &amp;lsquo;posted content&amp;rsquo; record type and comes as a result of the &lt;a href="https://doi.org/10.31222/osf.io/qzusj" target="_blank">recommendations&lt;/a> of the Preprints Advisory Group. The new status field allows repositories to flag a preprint as ‘withdrawn’ or ‘removed,’ a situation specific to posted content.&lt;/p>
&lt;p>There are some other minor updates as well, including:&lt;/p>
&lt;ul>
&lt;li>An expansion of the language codes supported by a language attribute.&lt;/li>
&lt;li>Additions to the archive locations we collect. Our membership terms ask members to archive their content where possible, ensuring their DOIs are able to resolve to the content persistently, and we ask that the archive(s) they use are identified in the metadata records registered with us.&lt;/li>
&lt;li>We’ve increased the number of ISBNs supported per item from 6 to 100.&lt;/li>
&lt;/ul>
&lt;p>If you would like to begin using this schema, a brief transition guide is &lt;a href="https://docs.google.com/document/d/16peqiXX66l9w-VCsieiuNjzrXGNchNAhy9g7Q4dtlpA/edit" target="_blank">here&lt;/a>. A full set of schema files are in our &lt;a href="https://gitlab.com/crossref/schema/-/releases/0.3.3" target="_blank">GitLab repository,&lt;/a> and more information is available in our &lt;a href="https://www.crossref.org/documentation/schema-library/metadata-deposit-schema-5-4-0/">website documentation for schema 5.4.0&lt;/a>.&lt;/p>
&lt;h2 id="whats-next">What’s next?&lt;/h2>
&lt;p>We’ve already begun working on our next update, which will be an expansion of contributor roles. We’ll allow multiple contributor roles instead of the single role we currently support, we’ll add ‘corresponding author’ and ‘other’ to the Crossref role vocabulary. We will be also adding full support for &lt;a href="https://credit.niso.org/" target="_blank">CRediT&lt;/a>.&lt;/p>
&lt;p>We&amp;rsquo;re also hoping to fit in a remodeling of our group contributor (currently labeled ‘organisation’ in our input schema) in the next update, and I would appreciate &lt;a href="https://docs.google.com/document/d/1NJlNS2DqlWgns-2pdNQ3xALmQasBFrujHgViu5t5Lz0/edit" target="_blank">feedback on this planned update&lt;/a>.&lt;/p>
&lt;p>More changes are planned, including an update to our grants schema, and expanded support for abstracts. We’ll be circulating details about those updates soon.&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/events/crossref-community-call-2025/">Join us for the Mid-year Community Call on 7th May to hear more&lt;/a>!&lt;/p></description></item><item><title>2025 public data file now available</title><link>https://www.crossref.org/blog/2025-public-data-file-now-available/</link><pubDate>Wed, 12 Mar 2025 00:00:00 +0000</pubDate><author>Martyn Rittman</author><guid>https://www.crossref.org/blog/2025-public-data-file-now-available/</guid><description>&lt;p>Every year we release metadata for the full corpus of records registered with us, which can be downloaded for free in a single compressed file. This is one way in which we fulfil our mission to make metadata freely and widely available. By including the metadata of over 165 million research outputs from over 22,000 members worldwide and making them available in a standard format, we streamline access to metadata about scholarly objects such as journal articles, books, conference papers, preprints, research grants, standards, datasets, reports, blogs, and more.&lt;/p>
&lt;p>Our metadata is used by thousands of services, researchers, and other organisations. We make it openly available &lt;a href="https://www.crossref.org/documentation/retrieve-metadata/">through our APIs&lt;/a>, which can be used to obtain a subset of records. If you want to work with our full corpus, the best way is to get a copy of the public data file and update it via the &lt;a href="https://api.crossref.org" target="_blank">REST API&lt;/a> with any new records created or changed since its release.&lt;/p>
&lt;p>By providing an annual copy of the full corpus, we also expand the ways in which the metadata can be used and interrogated. It is ideal for groups using large samples of the scholarly record, such as metaresearchers or research integrity experts. You can find examples of the public data file used in &lt;a href="https://doi.org/10.5281/zenodo.14766414" target="_blank">research on journal editorial practices&lt;/a> and in &lt;a href="https://doi.org/10.5281/zenodo.4734512" target="_blank">projects investigating gaps in the scholarly record&lt;/a>.&lt;/p>
&lt;h2 id="how-to-access-the-public-data-file">How to access the public data file&lt;/h2>
&lt;p>The total size of the file is 197 GB and it is available in JSON-lines format. We also provide an experimental tool to convert the file &lt;a href="https://gitlab.com/crossref/labs/dois2sqlite" target="_blank">to an Sqlite database&lt;/a>. Before downloading the full dataset, you may wish to download the sample dataset containing 100 files (with 100 records in each, around 24 MB). This is a randomly sampled subset of metadata records and can be used for prototyping and development.&lt;/p>
&lt;p>To get a copy of the annual data file you can access it directly via &lt;a href="https://doi.org/10.13003/87bfgcee6g" target="_blank">https://doi.org/10.13003/87bfgcee6g&lt;/a>, or get the sample dataset and previous public data files from &lt;a href="https://academictorrents.com/browse.php?search=Crossref" target="_blank">Academic Torrents&lt;/a>. We make a donation to Academic Torrents to support their work, which allows the data to be accessible in this way. Some organisations have reported policies that prevent access to torrents, so we provide a copy that can be downloaded from AWS, which requires an AWS account and a small payment to cover the data transfer costs. You can find the details about access &lt;a href="https://www.crossref.org/documentation/retrieve-metadata/rest-api/tips-for-using-public-data-files-and-plus-snapshots/" target="_blank">here&lt;/a>.&lt;/p>
&lt;p>We have some &lt;a href="https://www.crossref.org/learning/public-data-file/">tips for working with the public data file&lt;/a>. If you would like to have access to monthly snapshots of the whole corpus, along with higher API rate limits and other benefits, you can subscribe to &lt;a href="https://www.crossref.org/documentation/metadata-plus/">Metadata Plus&lt;/a>.&lt;/p>
&lt;h2 id="whats-different-this-year">What’s different this year?&lt;/h2>
&lt;p>This year&amp;rsquo;s public data file contains an additional 9 million records, and many updates to previously deposited records. The formats and method of access are the same as last year, except that it uses JSON lines, meaning that each metadata record is on a single line and the file suffix is jsonl instead of json. The records have been sorted by DOI, meaning it should be easier to navigate.&lt;/p>
&lt;p>A change this year is that the file does not contain aliased DOIs, which are DOI that are redirected to another DOI. Aliasing is necessary on rare occasions, for example when two DOIs are registered for the same content. Previously we haven’t indicated aliasing in the REST API and public data files; this year only the prime DOIs (the ones to which they are redirected) are included. This makes statistical analysis of the metadata more accurate, but beware that it may give different results in cases where many aliased DOIs were previously counted. See &lt;a href="https://community.crossref.org/t/adding-redirects-for-aliased-dois-in-the-rest-api/13138" target="_blank">this community forum post&lt;/a> for more details.&lt;/p>
&lt;p>The file also contains retractions from the Retraction Watch database, which was &lt;a href="https://doi.org/10.13003/c23rw1d9" target="_blank">acquired by Crossref in September 2023&lt;/a> and recently &lt;a href="https://doi.org/10.13003/692016" target="_blank">integrated into the REST API&lt;/a>.&lt;/p>
&lt;p>If you have questions, want to let us know how you will use the metadata, or want to discuss anything on the topic of retrieving Crossref metadata, head to our &lt;a href="https://community.crossref.org/c/metadata-retrieval/27" target="_blank">community forum&lt;/a>. From there, you can also keep updated about changes to our schema and APIs.&lt;/p></description></item><item><title>Come ROR with us: Using ROR IDs in place of Funder IDs</title><link>https://www.crossref.org/blog/come-ror-with-us-using-ror-ids-in-place-of-funder-ids/</link><pubDate>Wed, 05 Mar 2025 00:00:00 +0000</pubDate><author>Patricia Feeney</author><guid>https://www.crossref.org/blog/come-ror-with-us-using-ror-ids-in-place-of-funder-ids/</guid><description>&lt;p>Today, we&amp;rsquo;re delighted to let you know that Crossref members can now use ROR IDs to identify funders in any place where you currently use Funder IDs in your metadata. Funder IDs remain available, but this change allows publishers, service providers, and funders to streamline workflows and introduce efficiencies by using a single open identifier for both researcher affiliations and funding organisations.&lt;/p>
&lt;p>As you probably know, the &lt;a href="https://ror.org" target="_blank">Research Organisation Registry (ROR)&lt;/a> is a global, community-led, carefully curated registry of open persistent identifiers for research organisations, including funding organisations. It’s a joint initiative led by the California Digital Library, Datacite and Crossref launched in 2019 that fulfills the long-standing need for an open organisation identifier.&lt;/p>
&lt;p>In 2023, we shared our plan to &lt;a href="https://doi.org/10.64000/v3429-p7810" target="_blank">transition the Open Funder Registry into ROR&lt;/a>. More recently, we announced that we were planning to &lt;a href="https://doi.org/10.64000/6tzsa-7dj24" target="_blank">update our schema so that it is possible to collect ROR IDs where we currently collect Funder IDs&lt;/a> such as in the funding metadata section for works and funder section for grants. Now that we have completed this work, Crossref members can start depositing ROR IDs where they would normally deposit Funder IDs. This update also means that the community, including funders, service providers, researchers, and data scientists can retrieve this metadata &lt;a href="https://api.crossref.org/works?filter=has-ror-id:true" target="_blank">via our API&lt;/a>.&lt;/p>
&lt;p>So come and ROR with us and start depositing ROR IDs for both researcher affiliations and funding organisations.&lt;/p>
&lt;h2 id="open-funder-registry-ror-transition">Open Funder Registry-ROR transition&lt;/h2>
&lt;p>This is of course a significant first step in the Open Funder Registry to ROR transition.&lt;/p>
&lt;p>We’ve &lt;a href="https://ror.readme.io/docs/funder-registry" target="_blank">always said&lt;/a> that we would continue supporting Funder IDs in our schema and in our tools and services until the community is ready to transition - and we will. In the last year, Crossref and ROR conducted a series of Open Funder Registry user interviews to help us understand how it was being used and identify practical challenges to this transition in our members’ workflow (thank you to those who took part, it was incredibly useful!).&lt;/p>
&lt;p>One major takeaway from this consultation was around the pivotal role that peer review management systems played in the Open Funder Registry-ROR transition. We look forward to seeing more service providers integrating with ROR in the future. If you are a service provider and are ready to integrate with ROR, drop &lt;a href="mailto:support@ror.org">support@ror.org&lt;/a> an email.&lt;/p>
&lt;h2 id="including-ror-ids-in-crossref-metadata">Including ROR IDs in Crossref metadata&lt;/h2>
&lt;p>If you are ready to begin including ROR IDs in your funding metadata, you only need to include the ROR itself to identify a funder.&lt;/p>
&lt;p>For example:&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-XML" data-lang="XML">&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;fr:program&lt;/span> &lt;span class="na">name=&lt;/span>&lt;span class="s">&amp;#34;fundref&amp;#34;&lt;/span>&lt;span class="nt">&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;lt;fr:assertion&lt;/span> &lt;span class="na">name=&lt;/span>&lt;span class="s">&amp;#34;ror&amp;#34;&lt;/span>&lt;span class="nt">&amp;gt;&lt;/span>https://ror.org/00fq5cm18&lt;span class="nt">&amp;lt;/fr:assertion&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;lt;fr:assertion&lt;/span> &lt;span class="na">name=&lt;/span>&lt;span class="s">&amp;#34;award_number&amp;#34;&lt;/span>&lt;span class="nt">&amp;gt;&lt;/span>10.3030/725840&lt;span class="nt">&amp;lt;/fr:assertion&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;/fr:program&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>Examples of more complex combinations of funding information are available in our &lt;a href="https://www.crossref.org/documentation/funder-registry/funding-data-overview/" target="_blank">documentation&lt;/a>. This update has been made across all schema that support funding metadata.&lt;/p>
&lt;p>Our grants schema has recently been updated to &lt;a href="https://www.crossref.org/documentation/schema-library/grants-schema/" target="_blank">version 0.2.0&lt;/a> to support ROR IDs in place of funder identifiers as well. As with funding metadata, only the ROR ID needs to be supplied within the record:&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-XML" data-lang="XML">&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;funding&lt;/span> &lt;span class="na">amount=&lt;/span>&lt;span class="s">&amp;#34;750&amp;#34;&lt;/span> &lt;span class="na">currency=&lt;/span>&lt;span class="s">&amp;#34;USD&amp;#34;&lt;/span> &lt;span class="na">funding-percentage=&lt;/span>&lt;span class="s">&amp;#34;75&amp;#34;&lt;/span> &lt;span class="na">funding-type=&lt;/span>&lt;span class="s">&amp;#34;APC&amp;#34;&lt;/span>&lt;span class="nt">&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;lt;ROR&amp;gt;&lt;/span>https://ror.org/02twcfp32https://ror.org/02twcfp32&lt;span class="nt">&amp;lt;/ROR&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;lt;funding-scheme&amp;gt;&lt;/span>Sofa Lending Programme&lt;span class="nt">&amp;lt;/funding-scheme&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;/funding&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>Although previously a funder name was collected with the funder identifier, for both grants records and funding data in an attempt to avoid redundant, incorrect or conflicting metadata, now we’re accepting an identifier only as the ROR ID has an existing metadata record. The organisation name exists within the record in the &lt;a href="https://ror.org/search" target="_blank">ROR registry&lt;/a> and the ROR record is the authoritative source of the name.&lt;/p>
&lt;h2 id="ror-ids-in-json-outputs">ROR IDs in JSON outputs&lt;/h2>
&lt;p>We have an existing legacy practice of representing Open Funder Registry IDs as just a DOI, but ROR IDs are represented in the JSON outputs as a full URL with id-type “ROR”, for example:&lt;/p>
&lt;p>&lt;strong>Funding metadata&lt;/strong>&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-JSON" data-lang="JSON">&lt;span class="line">&lt;span class="cl">
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="s2">&amp;#34;funder&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="p">[&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="p">{&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;award&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="p">[&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="s2">&amp;#34;10.3030/725840&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="p">],&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;id&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="p">[&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="p">{&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;id&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;https://ror.org/02twcfp32&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;id-type&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;ROR&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;asserted-by&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;publisher&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="p">}&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="p">]&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="p">}&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="p">]&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>&lt;strong>Grant funder information&lt;/strong>&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-JSON" data-lang="JSON">&lt;span class="line">&lt;span class="cl">
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;funding&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="p">[&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="p">{&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;type&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;infrastructure&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;award-amount&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="p">{&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;amount&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="mf">750.0&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;currency&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;USD&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;percentage&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="mi">75&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="p">},&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;funder&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="p">{&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;id&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="p">[&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="p">{&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;id&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;https://ror.org/02twcfp32&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;id-type&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;ROR&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;asserted-by&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;publisher&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="p">}&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="p">]&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="p">}&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="p">}&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="p">]&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="err">}&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="err">],&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>If you have any questions or feedback, get in touch with us &lt;a href="mailto:support@crossref.org">support@crossref.org&lt;/a> !&lt;/p></description></item><item><title>The GEM program - Year Two 2024</title><link>https://www.crossref.org/blog/the-gem-program-year-two-2024/</link><pubDate>Thu, 27 Feb 2025 00:00:00 +0000</pubDate><author>Susan Collins</author><guid>https://www.crossref.org/blog/the-gem-program-year-two-2024/</guid><description>&lt;p>We began our &lt;a href="https://www.crossref.org/gem/">Global Equitable Membership (GEM) Program&lt;/a> to provide greater membership equitability and accessibility to organisations in the world&amp;rsquo;s least economically advantaged countries. Eligibility for the program is based on a member&amp;rsquo;s country; our list of countries is predominantly based on the &lt;a href="https://datahelpdesk.worldbank.org/knowledgebase/articles/906519-world-bank-country-and-lending-groups" target="_blank">International Development Association (IDA)&lt;/a>. Eligible members pay no membership or content registration fees. The list undergoes periodic reviews, as countries may be added or removed over time as economic situations change.&lt;/p>
&lt;p>The program began in January 2023 with 214 existing members; and 131 more joined throughout the year. In 2024, we saw 127 organisations joining via the GEM program, bringing the total number of participants to 458. We welcomed our first-ever members from Sierra Leone and Honduras, as well as our first Sponsor in Bangladesh (&lt;a href="https://www.crossref.org/membership/about-sponsors/">Sponsors&lt;/a> are organisations that work with us to provide administrative, billing, technical, and local language support to the members they work with).&lt;/p>
&lt;p>Of 458 organisations participating in the GEM program, 380 are independent members, 77 are sponsored, and there is one sponsoring organisation. To date, these members have contributed over 279,000 works to the &lt;a href="https://www.crossref.org/documentation/research-nexus/">Research Nexus&lt;/a>, our concept of a fully connected global scholarly ecosystem.&lt;/p>
&lt;p>Though we have Sponsors based elsewhere, working with members who are in GEM countries (e.g. PKP), we will continue to consult with &lt;a href="https://www.crossref.org/community/our-ambassadors/">our ambassadors&lt;/a> and other partners to identify potential new sponsors that are based in GEM countries.&lt;/p>
&lt;h2 id="number-of-crossref-gem-members-by-country">Number of Crossref GEM members by country:&lt;/h2>
&lt;div class='shortcode-row '>
&lt;div class="col-md-6 col-sm-12 no-first-para-highlight">&lt;table>
&lt;thead>
&lt;tr>
&lt;th>GEM Country (Alphabetically)&lt;/th>
&lt;th>Total No. &lt;br> of Members&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>Afghanistan&lt;/td>
&lt;td>17&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Bangladesh&lt;/td>
&lt;td>120&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Benin&lt;/td>
&lt;td>5&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Bhutan&lt;/td>
&lt;td>6&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Burkina Faso&lt;/td>
&lt;td>4&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Burundi&lt;/td>
&lt;td>2&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Cambodia&lt;/td>
&lt;td>8&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Central African Republic&lt;/td>
&lt;td>1&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Congo, Democratic Republic&lt;/td>
&lt;td>15&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Ethiopia&lt;/td>
&lt;td>13&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Ghana&lt;/td>
&lt;td>27&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Guyana&lt;/td>
&lt;td>2&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Haiti&lt;/td>
&lt;td>1&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Honduras&lt;/td>
&lt;td>1&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Kosovo&lt;/td>
&lt;td>8&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Kyrgyz Republic&lt;/td>
&lt;td>23&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Lao, People&amp;rsquo;s Democratic Republic&lt;/td>
&lt;td>2&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Madagascar&lt;/td>
&lt;td>4&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Malawi&lt;/td>
&lt;td>2&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;/div>
&lt;div class="col-md-6 col-sm-12 no-first-para-highlight">&lt;table>
&lt;thead>
&lt;tr>
&lt;th>GEM Country (Alphabetically)&lt;/th>
&lt;th>Total No. &lt;br>of Members&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>Maldives&lt;/td>
&lt;td>3&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Mali&lt;/td>
&lt;td>3&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Mauritania&lt;/td>
&lt;td>1&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Mozambique&lt;/td>
&lt;td>2&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Myanmar&lt;/td>
&lt;td>1&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Nepal&lt;/td>
&lt;td>50&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Nicaragua&lt;/td>
&lt;td>2&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Rwanda&lt;/td>
&lt;td>7&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Senegal&lt;/td>
&lt;td>7&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Sierra Leone&lt;/td>
&lt;td>1&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Somalia&lt;/td>
&lt;td>9&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Sri Lanka&lt;/td>
&lt;td>14&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Sudan&lt;/td>
&lt;td>19&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Tajikistan&lt;/td>
&lt;td>4&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Tanzania, United Republic of&lt;/td>
&lt;td>21&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Togo&lt;/td>
&lt;td>1&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Uganda&lt;/td>
&lt;td>17&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Yemen&lt;/td>
&lt;td>30&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Zambia&lt;/td>
&lt;td>5&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;/div>
&lt;/div>
&lt;h2 id="number-of-crossref-members-in-gem-program-countries">Number of Crossref members in GEM Program Countries&lt;/h2>
&lt;div style="text-align:center;margin:10px">
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2025/V2map-gem-program-countries-2025.png"
alt="screenshot of mapy showing membership density in GEM Program countries." width="75%">
&lt;/figure>
&lt;/div>
&lt;p>We are excited about our in-person event taking place in a few weeks in Accra, Ghana, as a direct result of the increasing participation and interest in Crossref from the region.&lt;/p>
&lt;p>We can see a clear connection between outreach activities conducted by us and our Ambassadors and the increase in awareness and the number of members joining from related countries. These were Bangladesh, Nepal, Uganda, and Tanzania in 2023, and Ghana, Zambia, Sri Lanka, and Tanzania in 2024.&lt;/p>
&lt;p>From our Ambassadors’ activities in the GEM countries, some recurring questions emerged highlighting barriers to joining Crossref. It’s important to recognise that many institutions struggle with funding and technical expertise. It’s no surprise that they are often concerned with the maintenance of their membership over the long term. We emphasize that GEM is a sustained measure to accommodate knowledge-sharing organisations from the regions of financial strain. Whilst the program addresses the costs of membership and content registration, our Ambassadors can assist further, offering technical support with record registration, metadata best practices, and integrating Crossref services with existing systems, including Open Journal Systems (OJS); and discuss how registering metadata improves research visibility.&lt;/p>
&lt;p>We are grateful to our Ambassadors for directly supporting the GEM program within their countries through webinars and presenting in person at conferences: Shaharima Parvin and MD Jahangir in Bangladesh, Richard Bruce Lamptey in Ghana, Niranjan Koirala in Nepal, Oumy Ndiaye in Senegal, Lasith Gunawardena in Sri Lanka, and Baraka Manjale Ngussa in Tanzania.&lt;/p></description></item><item><title>Retraction Watch retractions now in the Crossref API</title><link>https://www.crossref.org/blog/retraction-watch-retractions-now-in-the-crossref-api/</link><pubDate>Wed, 29 Jan 2025 00:00:00 +0000</pubDate><author>Martyn Rittman</author><guid>https://www.crossref.org/blog/retraction-watch-retractions-now-in-the-crossref-api/</guid><description>&lt;p>Retractions and corrections from Retraction Watch are now available in Crossref’s REST API. Back in September 2023, we announced the acquisition of the Retraction Watch database with an ongoing shared service. Since then, they have sent us regular updates, which are publicly available as a &lt;a href="https://gitlab.com/crossref/retraction-watch-data" target="_blank">csv file&lt;/a>. Our aim has always been to better integrate these retractions with our existing metadata, and today we’ve met that goal.&lt;/p>
&lt;p>This is the first time we have supplemented our metadata with a third-party data source. Until now, our APIs have included metadata provided by Crossref members along with outputs from our internal enrichment workflows, such as matches found for bibliographic reference matching and funders. Third party metadata has been gathered in Event Data, but this has been stored and delivered separately.&lt;/p>
&lt;p>Knowing when work has been retracted is critical for assessing the integrity of research, and this enhancement of the data will be a great benefit to the community.&lt;/p>
&lt;h2 id="where-does-the-data-come-from">Where does the data come from?&lt;/h2>
&lt;p>Retraction Watch carefully curates retractions, pulling them from several non-Crossref sources, including PubMed and publisher websites. Each entry is manually checked and annotated before being added to the database. The high level of curation and broad coverage is what made a partnership between Crossref and Retraction Watch attractive, and our shared goal of making changes to metadata more visible.&lt;/p>
&lt;blockquote>
&lt;p>&amp;ldquo;Our goal with the Retraction Watch Database has always been for it to be as useful to as many people as possible, and available from as many sources as possible,” says Ivan Oransky, co-founder of Retraction Watch and executive director of The Center For Scientific Integrity, its parent nonprofit organisation. “Integration with Crossref’s REST API is a huge step in that direction.”&lt;/p>
&lt;/blockquote>
&lt;h2 id="where-can-i-see-the-retractions">Where can I see the retractions?&lt;/h2>
&lt;p>If you use a service that collects Crossref metadata, you will start to see the Retraction Watch retractions as they are picked up. To access the data directly, you can find retractions from both Crossref members and Retraction Watch in our REST API, for example with the following request for all retractions:&lt;/p>
&lt;p>&lt;a href="https://api.crossref.org/v1/works?filter=update-type:retraction" target="_blank">https://api.crossref.org/v1/works?filter=update-type:retraction&lt;/a>&lt;/p>
&lt;p>Or for an individual record:&lt;/p>
&lt;p>&lt;a href="https://api.crossref.org/v1/works/10.1177/17588359231172420" target="_blank">https://api.crossref.org/v1/works/10.1177/17588359231172420&lt;/a>&lt;/p>
&lt;p>In the results here you will see an &lt;code>update-to&lt;/code> field:&lt;/p>
&lt;pre tabindex="0">&lt;code>&amp;#34;update-to&amp;#34;: [
{
&amp;#34;updated&amp;#34;: {
&amp;#34;date-parts&amp;#34;: [
[2023,4,22]
],
&amp;#34;date-time&amp;#34;: &amp;#34;2023-04-22T00:00:00Z&amp;#34;,
&amp;#34;timestamp&amp;#34;: 1682121600000
},
&amp;#34;DOI&amp;#34;: &amp;#34;10.1177/1758835920922055&amp;#34;,
&amp;#34;type&amp;#34;: &amp;#34;retraction&amp;#34;,
&amp;#34;source&amp;#34;: &amp;#34;publisher&amp;#34;,
&amp;#34;label&amp;#34;: &amp;#34;Retraction&amp;#34;
},
{
&amp;#34;updated&amp;#34;: {
&amp;#34;date-parts&amp;#34;: [
[2023,4,22]
],
&amp;#34;date-time&amp;#34;: &amp;#34;2023-04-22T00:00:00Z&amp;#34;,
&amp;#34;timestamp&amp;#34;: 1682121600000
},
&amp;#34;DOI&amp;#34;: 10.1177/17588359231172420&amp;#34;,
&amp;#34;type&amp;#34;: &amp;#34;retraction&amp;#34;,
&amp;#34;source&amp;#34;: &amp;#34;retraction-watch&amp;#34;,
&amp;#34;label&amp;#34;: &amp;#34;Retraction&amp;#34;,
&amp;#34;record-id&amp;#34;: 44124
}
]
&lt;/code>&lt;/pre>&lt;p>The &lt;code>source&lt;/code> field states where the retraction came from. Currently, it can have two values: &lt;code>publisher&lt;/code> or &lt;code>retraction-watch&lt;/code>. Note that the same retraction may be included multiple times from different sources.&lt;/p>
&lt;p>Retraction Watch retractions will remain available &lt;a href="https://gitlab.com/crossref/retraction-watch-data" target="_blank">on Gitlab in csv format&lt;/a> and be updated on working days. The &lt;code>record-id&lt;/code> refers to the entry in the csv file with further details, such as the reason for retraction.&lt;/p>
&lt;p>There is &lt;a href="https://api.crossref.org/swagger-ui/index.html" target="_blank">full documentation available for the Crossref REST API&lt;/a> and if you are new to REST APIs, see our &lt;a href="https://www.crossref.org/learning/" target="_blank">learning hub&lt;/a> to get started which includes &lt;a href="https://crossref.gitlab.io/tutorials/get-rw-metadata/" target="_blank">a tutorial&lt;/a> about accessing retractions.&lt;/p>
&lt;h2 id="what-can-i-do-with-the-retractions">What can I do with the retractions?&lt;/h2>
&lt;p>Like the rest of our metadata, the retractions are freely available. If you use or operate a tool that ingests retractions, the new entries will start to be picked up immediately. The Retraction Watch database includes a larger number of retractions than the Crossref database, so you should see an increase in the total.&lt;/p>
&lt;p>We have heard from organisations that would like to build new research integrity tools based on this data. We look forward to seeing the benefits brought by wider availability of the Retraction Watch retractions, and how they can provide better context to research outputs.&lt;/p>
&lt;p>While Crossref metadata is freely available to reuse without a license, if you make use of the Retraction Watch retraction metadata in a published work, we kindly request that you provide a citation to the source.&lt;/p>
&lt;p>If you have questions or comments, please head over to the &lt;a href="https://community.crossref.org/c/strategy/research-integrity/46" target="_blank">section of our forum&lt;/a> dedicated to integrity of the scholarly record.&lt;/p></description></item><item><title>POSI 2.0 feedback</title><link>https://www.crossref.org/blog/posi-2.0-feedback/</link><pubDate>Tue, 28 Jan 2025 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/posi-2.0-feedback/</guid><description>&lt;p>As a provider of foundational open scholarly infrastructure, Crossref is an adopter of the &lt;a href="https://openscholarlyinfrastructure.org" target="_blank">Principles of Open Scholarly Infrastructure (POSI)&lt;/a>. In December 2024 we posted our &lt;a href="https://doi.org/10.64000/7ybx5-m7924" target="_blank">updated POSI self-assessment&lt;/a>. POSI provides an invaluable framework for transparency, accountability, susatinability and community alignment. There are 21 other &lt;a href="https://openscholarlyinfrastructure.org/posse/" target="_blank">POSI adopters&lt;/a>.&lt;/p>
&lt;p>Together, we are now undertaking a public consultation on proposed revisions for a version 2.0 release of the principles, which would update the current &lt;a href="https://doi.org/10.24343/C34W2H" target="_blank">version 1.1 of the principles, released in November 2023&lt;/a>.&lt;/p>
&lt;p>This is a crucial step in ensuring that POSI evolves to meet the needs of the community. Whether you are part of an organisation that has adopted POSI, is considering adoption, interacts with POSI-aligned groups, or you have a personal interest in open scholarly infrastructure, your perspective is invaluable.&lt;/p>
&lt;h2 id="some-additional-context-about-posi">Some additional context about POSI&lt;/h2>
&lt;ul>
&lt;li>
&lt;p>POSI is not an organisation; POSI adopters are an informal group of those that have conducted self-assessments.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>The POSI principles are not rules or a checklist; organisations or groups can adopt or interpret them to fit many different circumstances.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Our goal is for POSI self-assessments to be made publicly available and for interested communities to assess and monitor updates and progress.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;h2 id="how-to-participate">How to Participate&lt;/h2>
&lt;p>If your organisation has adopted POSI, is considering adoption, interacts with POSI-aligned groups, or you have a personal interest in open scholarly infrastructure, your perspective is invaluable.&lt;/p>
&lt;ul>
&lt;li>
&lt;p>Review the &lt;a href="https://openscholarlyinfrastructure.org/public-comment-v2/" target="_blank">Proposed POSI 2.0 Revisions&lt;/a>.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Share your thoughts via our &lt;a href="https://forms.gle/4KkaRJoar6KjrsbW7" target="_blank">short survey&lt;/a>.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>Deadline: March 5, 2025&lt;/p>
&lt;p>Together, we can shape the future of open scholarly infrastructure. Join the conversation and make your voice heard!&lt;/p></description></item><item><title>Metadata matching: beyond correctness</title><link>https://www.crossref.org/blog/metadata-matching-beyond-correctness/</link><pubDate>Wed, 08 Jan 2025 00:00:00 +0000</pubDate><author>Dominika Tkaczyk</author><guid>https://www.crossref.org/blog/metadata-matching-beyond-correctness/</guid><description>&lt;p>In our &lt;a href="https://doi.org/10.13003/ief7aibi" target="_blank">previous entry&lt;/a>, we explained that thorough evaluation is key to understanding a matching strategy&amp;rsquo;s performance. While evaluation is what allows us to assess the correctness of matching, choosing the best matching strategy is, unfortunately, not as simple as selecting the one that yields the best matches. Instead, these decisions usually depend on weighing multiple factors based on your particular circumstances. This is true not only for metadata matching, but for &lt;a href="https://www.wired.com/2012/04/netflix-prize-costs/" target="_blank">many technical choices&lt;/a> that require navigating trade-offs. In this blog post, the last one in the metadata matching series, we outline a subjective set of criteria we would recommend you consider when making decisions about matching.&lt;/p>
&lt;h2 id="openness">Openness&lt;/h2>
&lt;p>Matching tools come in many different shapes and sizes: web applications, APIs, command-line tools, sometimes even &lt;a href="https://adambuttrick.github.io/mysterious-crystal-ball-matching/" target="_blank">enchanted crystal balls showing matched identifiers emerging from a mysterious mist&lt;/a>! No matter what form they take, an important consideration is whether the source code and all the related resources for the matching are openly available.&lt;/p>
&lt;p>Matching strategies that are either closed-source, or rely on closed-source services for their matching logic, make it difficult to fully understand and explain matching processes. This lack of transparency also makes it impossible to adjust or improve the matching logic, since we cannot understand or improve code we cannot see.&lt;/p>
&lt;p>Users are similarly impeded from identifying flaws or suggesting improvements to processes they are unable to examine. By blocking this community participation, we also lose the proven cycle of real-world testing, refinement, and validation that has strengthened myriad of open source projects. The cumulative impact of both minor and major community-driven refinements over time is incredibly valuable and should not be underestimated.&lt;/p>
&lt;p>Using open source matching will also help build trust in the matching workflows and results. This is one reason why open source is one of the tenets of the &lt;a href="https://openscholarlyinfrastructure.org" target="_blank">Principles of Open Scholarly Infrastructure&lt;/a>, adopted by Crossref, DataCite, ROR, and other organisations who build and maintain open scholarly infrastructure.&lt;/p>
&lt;p>When evaluating matching strategies, we strongly recommend prioritizing those that are fully open source. This not only ensures their transparency and trustworthiness, but also allows for the kind of continuous improvement that results from this visibility and community engagement.&lt;/p>
&lt;h2 id="explainability">Explainability&lt;/h2>
&lt;p>In terms of our ability to understand and improve a matching strategy, using an open source model is only the first step. What typically matters most in the context of building and maintaining matching services is that we are able to understand their underlying code and have a clear model of how matches are derived from their corresponding inputs. Even if the matching code itself and all of the resources used in the matching are open, if they are poorly documented, lack reproducibility or tests, or are otherwise opaque, there is no guarantee that it will be possible to understand or improve the strategy. Striving for a high level of interpretability in our matching plays a determinative role in how well we can understand and modify our strategies in the future.&lt;/p>
&lt;p>Being able to explain the behaviour of the matching will also help you to respond to and incorporate user feedback. When users encounter errors, you will be able to do things like advise them on how to modify or clean their inputs so that the results are better. Conversely, examining the behaviour of the strategy relative to user inputs and feedback can provide you with ideas for improving the matching.&lt;/p>
&lt;p>Typically, heuristic-based strategies, such as those that use forms of search or string similarity measures, like &lt;a href="https://en.wikipedia.org/wiki/Edit_distance" target="_blank">edit distance&lt;/a>, are easier to explain than, say, machine learning models. If a strategy uses machine learning, at least some internal decisions might be made by passing data through a complex network of algebraic equations. Those can be mysterious, non-deterministic, and are famous for being &lt;a href="https://xkcd.com/1838/" target="_blank">hard to interpret&lt;/a>. This doesn&amp;rsquo;t mean they should be avoided entirely - we have built and use many machine-learning based tools ourselves! Instead, it is a good idea to weigh how their inherent lack of explainability could affect your ability to continue work on the strategy and respond to user needs, relative to all the available options.&lt;/p>
&lt;h2 id="complexity">Complexity&lt;/h2>
&lt;p>Complexity is another aspect that can greatly affect how easy it is to maintain the strategy. Complexity is related to how many different components the strategy has and how difficult they are to use and maintain. When a strategy has multiple interconnected parts, each component becomes a potential failure point that requires discrete assessment and maintenance.&lt;/p>
&lt;p>Consider, for example, two different approaches to a matching strategy: one that uses a single machine learning model versus another that uses an ensemble of models. A single model requires maintaining one set of training data, a single training pipeline, and one deployment process. If the model&amp;rsquo;s performance unexpectedly deteriorates, whether because of an issue with the training data, a configuration error, or the need for additional input sanitization, the source of the problem is easier to isolate and fix.&lt;/p>
&lt;p>The ensemble, by contrast, combines multiple, specialized models, each requiring its own training data, tests, updates, and deployments. If one model in the ensemble is found to reduce the performance of the strategy, the interdependence between models can cause this degradation to cascade through the entire system and undermine its overall reliability. Correcting for these errors becomes more challenging. If fixing one model&amp;rsquo;s performance requires retraining or adjusting its outputs, this could require recalibrating the entire ensemble to maintain the balance between models, identify regressions, and prevent new errors from emerging.&lt;/p>
&lt;p>In general, preferring simpler strategies not only reduces operational overhead, but also makes it easier to diagnose issues, test changes, and iterate on user feedback. When problems arise, having fewer moving parts means less places to look for the root cause and fewer components that could be affected by any fixes.&lt;/p>
&lt;h2 id="flexibility">Flexibility&lt;/h2>
&lt;p>The metadata to which we match grows and changes over time. New records are created, existing ones are updated, with schemas changing and evolving alongside. The resources that underlie our matching are also not static. The libraries we depend on may deprecate features between versions or the taxonomies we used to categorize results might undergo significant revisions. We thus rarely have the luxury of deploying a matching strategy once and using it forever without any changes. A good strategy has to be flexible enough to adapt to such changes, with this adaptation also being both technically feasible and practical to implement.&lt;/p>
&lt;p>Much of this flexibility is also determined by a matching strategy&amp;rsquo;s ability to incorporate new data. Strategies that use continuously updated databases or indices can immediately match against new metadata as it appears in the system. By contrast, some machine learning-based approaches require training on target matches and can thus be limited in flexibility and face more constraints. While some models can be incrementally updated to recognize new matches, others require retraining from scratch to incorporate these changes - a process that can be both time-consuming and resource-intensive.&lt;/p>
&lt;p>Paying close attention to a strategy&amp;rsquo;s flexibility and favoring this aspect, when possible, can significantly impact its long-term viability. When comparing different matching strategies, flexibility should thus be a primary concern in your decision-making process.&lt;/p>
&lt;h2 id="resources">Resources&lt;/h2>
&lt;p>Matching strategies can vary significantly in their resource requirements, including things like CPU and GPU utilization, memory consumption, storage capacity, and network bandwidth. These requirements are directly related to infrastructure costs and energy consumption, so when evaluating a matching strategy, it is necessary to assess its resource demands across all phases of the matching lifecycle. This includes things like initial model training, re-training, index construction, updates and management for all aspects of the strategy, as well as the real-world processing of matching requests. It is a good idea to measure and monitor resource usage carefully in considering which strategies to use, as the best performing strategy may also be too resource intensive to run as a service or might grow to this state over time with additional utilization.&lt;/p>
&lt;h2 id="speed">Speed&lt;/h2>
&lt;p>Matching strategies can operate at a wide range of speeds, from milliseconds to minutes per match. Since the overall response time of a strategy can affect both system scalability and user experience, we should always assess the strategy&amp;rsquo;s performance for different usage scenarios and scales of data. While some strategies might perform adequately with small datasets, they can also exhibit exponential slowdowns as data volume and complexity increases or as concurrent requests grow in number. We should therefore consider carefully how requirements for matching speed might evolve with increased usage, data complexity, and total anticipated growth. The fastest matching strategy might not always be the best choice if it comes at the cost of reduced accuracy or requires large amounts of resources, but unacceptable latency can make an otherwise excellent strategy unusable in practice for many use cases.&lt;/p>
&lt;h2 id="putting-it-all-together">Putting it all together&lt;/h2>
&lt;p>The typical life cycle of developing a metadata matching strategy is as follows:&lt;/p>
&lt;ol>
&lt;li>&lt;strong>Scoping&lt;/strong>: we define the matching task, along with its inputs and outputs.&lt;/li>
&lt;li>&lt;strong>Research&lt;/strong>: we research what existing strategies are available for our task and/or we develop our own.&lt;/li>
&lt;li>&lt;strong>Evaluation&lt;/strong>: we evaluate all available strategies, internally or externally-developed, exploring all of the aspects described above.&lt;/li>
&lt;li>&lt;strong>Decision&lt;/strong>: we choose which strategy (if any) we want to use in our production system.&lt;/li>
&lt;li>&lt;strong>Production setup&lt;/strong>: we prepare the production models, indexes, and other resources needed for the matching.&lt;/li>
&lt;li>&lt;strong>Maintenance&lt;/strong>: we monitor and adapt the strategy relative to changing data, user feedback, and new resource requirements.&lt;/li>
&lt;/ol>
&lt;p>In practice, these phases do not happen all at once, nor in this strict order. Often we need to proceed through multiple iterations of them to arrive at the best strategy. For example, if initial evaluation of a strategy yields poor results, we might return to the research phase to investigate other strategies or refine our understanding of the task. Often, during the maintenance phase, we receive feedback from users that indicates potential areas of improvement and then pursue them with a new round of research and evaluation.&lt;/p>
&lt;p>As we cycle through these phases, ideally all the aspects described in this entry, along with the results of the evaluation, would be taken into account. Of course, this means that these decisions have to be based on multiple criteria and by making trade-offs between their performance and all other considerations. In making these complex and difficult choices, it is useful to consider two primary questions:&lt;/p>
&lt;ol>
&lt;li>Are any of the considered matching strategies good enough for our use case?&lt;/li>
&lt;li>Out of all the considered strategies that are sufficient for our use case, which would be the best?&lt;/li>
&lt;/ol>
&lt;p>The first question requires us to create clear and quantifiable criteria that allow for eliminating some of the potential strategies. As we have indicated, these could include things like the strategy being open source, minimum performance baselines using measures like precision or recall, and operational thresholds, like the strategy being able to return results quickly, relative to user expectations or the volume of data to be processed. It should be fairly easy to test these requirements and eliminate any strategies that fall short of them. If the strategies are difficult to assess, that is likely a mark against them.&lt;/p>
&lt;p>If no strategies meet these criteria, we have two options: either to abandon matching entirely or to reassess and relax our criteria to align with the available options. While the former is always an option, adopting a more pragmatic lens, framing in terms of potential value (or harm) to the users, might be beneficial. Sometimes we approach matching tasks with too high expectations and a dose of realism helps us to re-center our perspectives. After more consideration, you might decide that your criteria were too stringent or realize that you need to better define and decompose the tasks to fit the available options.&lt;/p>
&lt;p>When multiple strategies appear viable, the selection process becomes more nuanced. When evaluating strategies across these various dimensions, we should try to avoid placing undue weight on minor performance differences. Evaluation metrics are useful estimates of performance, but do not always translate to real-world applications and changing data. In cases where a more complex strategy offers only marginal improvements over a simpler alternative, the maintenance and operational benefits of the simpler solution often outweigh small performance gains.&lt;/p>
&lt;p>This concludes our series on metadata matching, where we described the conceptual, product, and technical aspects of matching and its applications. We hope this overview was instructive and helps you to make better decisions about the use of matching in your own tools and services!&lt;/p></description></item><item><title>A progress update and a renewed commitment to community</title><link>https://www.crossref.org/blog/a-progress-update-and-a-renewed-commitment-to-community/</link><pubDate>Thu, 12 Dec 2024 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/a-progress-update-and-a-renewed-commitment-to-community/</guid><description>&lt;p>Looking back over 2024, we wanted to reflect on where we are in meeting our goals, and report on the progress and plans that affect you - our community of 21,000 organisational members as well as the vast number of research initiatives and scientific bodies that rely on Crossref metadata.&lt;/p>
&lt;p>In this post, we will give an update on our roadmap, including what is completed, underway, and up next, and a bit about what&amp;rsquo;s paused and why. We&amp;rsquo;ll describe how we have been making resourcing and prioritisation decisions, including a revised management structure, and introduce new cross-functional program groups to collectively take the work forward more effectively.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2024/scale-of-crossref.png"
alt="screenshot from slidedeck titled Scale of Crossref. Contains various stats." width="75%">
&lt;/figure>
&lt;/div>
&lt;p>It’s important to acknowledge that Crossref has evolved significantly from just five years ago - our member count has more than doubled from 10,000 to 21,000 organisations since 2019 and they include all kinds of organisations such as funders, universities, government bodies, NGOs, and of course scholar- and library-led publishers. The smaller organisations now collectively contribute the majority of Crossref funding. We’ve gone from 100 million records to 160 million in five years, and our metadata is retrieved more than 2 billion times monthly, quadrupling what it was five years ago.&lt;/p>
&lt;p>It’s within this context that we’ve spent quite a lot of time thinking about scalability, how we collect and process feedback and contributions from many organisations, how to automate our operations, and refining the plans for the next few years.&lt;/p>
&lt;h2 id="our-strategic-agenda-remains-the-same">Our strategic agenda remains the same&lt;/h2>
&lt;div style="text-align:center;margin:10px">
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2024/strategic-themes-programs-landscape-slide.png"
alt="screenshot from Strategy page showing Crossref strategic themes." width="75%">
&lt;/figure>
&lt;/div>
&lt;p>A few times a year we update the &lt;a href="https://www.crossref.org/strategy">strategy page&lt;/a> where there is a quadrant of projects showing what’s completed, in progress, up next, and in planning/ideas - for each strategic theme. We also link from there to our live &lt;a href="https://roadmap.productboard.com/e6fdeba8-a5b3-4aef-8104-d48863ba975e" target="_blank">public roadmap&lt;/a> which shows more specifics about individual projects, including projected timelines, and is updated more frequently.&lt;/p>
&lt;p>If you’ve been watching the strategy page, checking in on the public roadmap or this blog, or joining &lt;a href="https://www.crossref.org/events">webinars and annual meetings&lt;/a>, you’ll know that we’ve had some longstanding plans to—among other things—reduce technical debt, rebuild our metadata management system, move to the cloud, modernise our schema, support multiple languages, and partner with multiple data sources to build the &lt;a href="https://www.crossref.org/documentation/research-nexus/">Research Nexus&lt;/a>.&lt;/p>
&lt;p>You’ve heard us talk about these initiatives a lot, but you&amp;rsquo;ve not seen particularly swift action.&lt;/p>
&lt;h2 id="moving-the-work-forward-more-effectively">Moving the work forward more effectively&lt;/h2>
&lt;p>Earlier this year, it became clear that our almost three-year project to build a new relationships API had not worked out. The project, dubbed ‘manifold’, was to initially deliver data citations, and eventually replace our central metadata system, but what was prototyped didn’t scale, even with a subset of our metadata. We weren’t confident enough about the project’s timeline or costs to justifiably continue investing further time and resources.&lt;/p>
&lt;p>Meanwhile, we’d barely scratched the surface of our aim to pay down technical and operational debt, and we’d also been neglecting to keep the live system up to date with the numerous metadata changes that have been queued up, waiting to be implemented.&lt;/p>
&lt;p>We knew the manifold project was ambitious – our system has grown in complexity over the years. We were trying to rebuild the car while driving it (our system needed to continue to operate and be maintained by our team) while trying to design a new approach to manage the many relationships between 160+ million database records. In the years we worked on this project, we learned a lot that will inform future plans for a large system redesign.&lt;/p>
&lt;p>In March this year, we decided to pause the manifold project. We apologised to our community partners for not delivering the promised data&amp;lt;-&amp;gt;literature matches they hoped to use. They were frustrated but thankfully understanding.&lt;/p>
&lt;p>We then resolved to focus on backend infrastructural changes, conduct cross-training so that all of our staff would become familiar with current in-use systems instead of greenfield tech (for now), and start to make a dent in the backlog of bugs and long-promised schema updates in our mainstream services.&lt;/p>
&lt;p>We’re happy to report some movement on these things and some milestones that have been achieved in these areas in recent months.&lt;/p>
&lt;h2 id="fostering-a-happy-and-dedicated-team">Fostering a happy and dedicated team&lt;/h2>
&lt;p>Any kind of work can only happen when our staff are in a good place, feeling supported and comfortable to question things, and well-equipped with information, purpose, and clear priorities. In June, when the whole staff met up in person, we had some really good conversations about culture, communication, and about sharing responsibilities. Some people ran birds-of-a-feather sessions to explore the issues that had been keeping them up at night, such as authentication/security, and rebuilding the Crossref System (CS), and the team also co-created a set of prioritisation drivers that are now in use within our roadmap and planning processes.&lt;/p>
&lt;p>Taking on feedback from the all-staff meeting and then the July board meeting, we thought strategically about the organisational structure Crossref would need over the next few years to reflect the growth in scope and size, and fulfil its longer term goals. We have long had an ambitious agenda but realised we didn’t yet have the capacity to do it all. So we came to the conclusion that we needed an updated team and management structure to take us through the next phase of our development.&lt;/p>
&lt;p>The structural changes were concluded at the end of November. They included:&lt;/p>
&lt;ul>
&lt;li>Moving Technology under Operations, since Technology&amp;mdash;though a vital enabler&amp;mdash;still works in service to our mission and in support of our community, just like other operational things like board governance and finance.&lt;/li>
&lt;li>Reframing product development as Programs and Services, and reducing our workstreams from five product portfolios to three programs. We formed cross-team steering groups around clearly articulated program areas (more on those below).&lt;/li>
&lt;li>Broadening the leadership to include an Executive team and an extended Director team, and forming a Senior Management Team (SMT). These changes ensure that the collective responsibility for Crossref now rests on a wider group of experts who can back each other up and share the risk and the knowledge, rather than on just a few individuals.&lt;/li>
&lt;li>We started recruiting for directors for two new leadership positions. We’ll welcome a new Director of Programs and Services and a new Director of Technology in the new year.&lt;/li>
&lt;li>Evolving the strategic initiatives team into a data science team, integrating research &amp;amp; development functions throughout all teams and with the SMT taking collective responsibility for strategic initiatives.&lt;/li>
&lt;/ul>
&lt;p>Unfortunately, with the shift in approach for product development and by sharing responsibility for strategic initiatives and research among the wider team, we made the difficult decision that four positions would no longer work within the new structure.&lt;/p>
&lt;h2 id="a-new-approach-joined-up-initiatives-and-cross-functional-programs">A new approach: joined-up initiatives and cross-functional programs&lt;/h2>
&lt;p>Research has always been an important role for Crossref, but as this function had been annexed from our regular work, it became hard to coordinate strategic initiatives across the wider organisation. In recent years we inadvertently created more technical debt for ourselves, i.e., built multiple prototype tools without plans for adoption or moving them into production. Strategic initiatives, by their nature, need thorough research and high-level alignment, so we made such initiatives—things like &lt;a href="https://www.crossref.org/community/special-programs/resourcing-crossref/">Resourcing Crossref for Future Sustainability (RCFS)&lt;/a> and improving the &lt;a href="https://www.crossref.org/community/special-programs/research-integrity">Integrity of the Scholarly record (ISR)&lt;/a>—the responsibility of the whole senior management team.&lt;/p>
&lt;p>Some useful research had been conducted, but we were never in a position to act on any of it. Particularly promising work has been in the field of &lt;a href="https://www.crossref.org/categories/metadata-matching">metadata matching&lt;/a>, and with the growth in the community reliance on our metadata, and attention on data quality rightly increasing, we decided to create a new data science team to be dedicated to this work, led by Dominika Tkaczyk.&lt;/p>
&lt;p>We had also struggled with a traditional product management approach since all our tools and activities are interconnected, and we found we were trying to do too many things at once but not all of them very effectively. We also acknowledged that product management comes from the commercial e.g. retail world and therefore is designed to help companies sell/upsell, which is not our goal. So we looked to other approaches more suitable to mission-based nonprofits.&lt;/p>
&lt;h3 id="introducing-three-programs">Introducing three programs&lt;/h3>
&lt;p>We have introduced cross-functional program management in order to work towards the following:&lt;/p>
&lt;ul>
&lt;li>better cross-team alignment&lt;/li>
&lt;li>shared responsibility&lt;/li>
&lt;li>improve communication and learning&lt;/li>
&lt;li>make more progress on the things members need.&lt;/li>
&lt;/ul>
&lt;p>Supporting the strategic theme of co-creation, a new program, facilitated by Program Lead Lena Stoll, now manages and oversees all activities around &lt;strong>co-creation and community trends&lt;/strong>. A cross-team steering group just began meeting regularly and will be responsible for interfaces such as reports/dashboards, record registration interfaces, connections and collaborations such as &lt;a href="https://www.crossref.org/services/funder-registry/">Open Funder Registry&lt;/a>, &lt;a href="https://ror.org/" target="_blank">ROR&lt;/a>, &lt;a href="https://orcid.org/" target="_blank">ORCID&lt;/a> auto-update, as well as &lt;a href="https://pkp.sfu.ca/ojs/" target="_blank">OJS&lt;/a> and other partner integrations. This program also includes the Crossref website and any front-end things to support other programs. And it includes ISR (the integrity of the scholarly record) and our tools in this area such as Crossmark and retraction/correction tooling, and Similarity Check for text comparisons.&lt;/p>
&lt;p>Supporting the strategic theme of complete and global metadata and relationships, a new program, facilitated by Program Lead Martyn Rittman, now manages and oversees all activities relating to &lt;strong>contributing to the Research Nexus&lt;/strong>. Working particularly closely with the metadata team, led by Patricia Feeney, this program addresses how metadata is modelled, used, enriched, and extended. Work includes our APIs, incorporating external data sources like &lt;a href="https://retractionwatch.com/" target="_blank">Retraction Watch&lt;/a> and &lt;a href="https://www.crossref.org/services/event-data">Event Data&lt;/a>, building out metadata matching services with the new data science team, supporting the community of metadata users with API sprints and more modern options for retrieving metadata based on usage and need.&lt;/p>
&lt;p>Supporting the strategic theme of open and sustainable operations and keeping to the POSI framework, a new program, facilitated by Program Lead Sara Bowman, now manages and oversees all activities relating to &lt;strong>making our operations more open, transparent, and sustainable&lt;/strong>. This program focuses on supporting and strengthening the core functions our members rely on and enabling future growth. It includes metadata deposit and processing, most apps for e.g. managing titles, authentication, and architectural and infrastructural projects like moving from the data centre to the AWS cloud service. This program also includes modernising our operations in general, which is not just technology but also finance and human resources, so projects like membership process automation, fee modelling and financial analyses, and business system integrations.&lt;/p>
&lt;p>The Programs will start to be reflected across our website and in our communications from next year.&lt;/p>
&lt;h2 id="what-are-crossrefs-new-prioritisation-drivers">What are Crossref&amp;rsquo;s new prioritisation drivers?&lt;/h2>
&lt;p>These are the drivers that our ~40 staff co-created in June that are guiding decisions about the priorities on our roadmap. New ideas will be evaluated in the following areas:&lt;/p>
&lt;ol>
&lt;li>Encourage participation from new or under-represented communities&lt;/li>
&lt;li>Respond to and lead trends in scholarly communications&lt;/li>
&lt;li>Benefit the greatest number of members and users&lt;/li>
&lt;li>Reflect on how the community works with each other and allow members to self-serve&lt;/li>
&lt;li>Expand to support and connect relevant resource types and metadata fields&lt;/li>
&lt;li>Make it easier to create and update metadata&lt;/li>
&lt;li>Enhance metadata for completeness and accuracy&lt;/li>
&lt;li>Make it easier to retrieve and use metadata&lt;/li>
&lt;li>Automate repetitive/manual tasks&lt;/li>
&lt;li>Address technical and operational debt&lt;/li>
&lt;li>Maintain critical systems and operations and ensure their security&lt;/li>
&lt;li>Control or reduce costs - to Crossref, our community, or the environment&lt;/li>
&lt;/ol>
&lt;p>We’re happy to report that the changes made this year have resulted in a productive last few months of the year. As reported in our &lt;a href="https://doi.org/10.64000/zg6c6-pab71" target="_blank">annual meeting&lt;/a>, here is the progress update.&lt;/p>
&lt;h3 id="whats-paused">What’s paused&lt;/h3>
&lt;ul>
&lt;li>A relationships API endpoint and, therefore, a specific data citation feed&lt;/li>
&lt;li>Manifold, the three-year effort to modernise our tech stack&lt;/li>
&lt;li>Most of the strategic initiatives prototypes that can’t yet be scaled, such as Labs API and Labs reports&lt;/li>
&lt;/ul>
&lt;h3 id="whats-recently-completed">What’s recently completed&lt;/h3>
&lt;ul>
&lt;li>We succeeded in moving the entire Crossref corpus to an open-source database, PostgreSQL&lt;/li>
&lt;li>Fixed numerous REST API data quality issues and lots of troublesome bugs&lt;/li>
&lt;li>Schema development - support for ROR as a Funder identifier is live and currently in testing&lt;/li>
&lt;li>We automated some very manual membership and billing processes, saving hundreds of staff hours a year&lt;/li>
&lt;li>Released a new form for journal article record registration, building on the grant registration form&lt;/li>
&lt;li>Upgraded &lt;a href="https://doi.org/10.64000/wxjpp-20570" target="_blank">Participation Reports&lt;/a> to include Affiliations and ROR IDs&lt;/li>
&lt;li>Launched a new &lt;a href="https://www.crossref.org/learning/">API Learning Hub&lt;/a>&lt;/li>
&lt;/ul>
&lt;p>Since the rest of the community stops for no Crossref product roadmap issue, we also progressed a number of community and governance initiatives:&lt;/p>
&lt;ul>
&lt;li>The &lt;a href="https://www.crossref.org/services/grant-linking-system/">Grant Linking System (GLS)&lt;/a> reached 5 years with over 40 funders joining Crossref and registering over 130,000 grants and awards, including use of facilities and projects&lt;/li>
&lt;li>Our research for Resourcing Crossref for Future Sustainability (RCFS) with the Membership &amp;amp; Fees Committee is going well, and we’ll have new fee proposals for review in 2025&lt;/li>
&lt;li>The integrity of the Scholarly Record (ISR) conversations have deepened, and we’ve formed strong relationships with editorial experts and research integrity sleuths, who are getting up to speed on our metadata, and we’re working with some sleuthing consultants to change our processes to handle deceptive member behaviour such as paper mills, cloned journals, and citation manipulation. The new data science team plays a role here, along with membership and governance.&lt;/li>
&lt;/ul>
&lt;h3 id="whats-currently-in-focus">What’s currently in focus&lt;/h3>
&lt;p>In our efforts to do less but do it more effectively, we have two current priorities:&lt;/p>
&lt;ol>
&lt;li>Get out of the physical data centre and into the cloud.&lt;/li>
&lt;li>Develop &lt;a href="https://doi.org/10.64000/cmnhc-fy462" target="_blank">Schema 5.4&lt;/a>.&lt;/li>
&lt;/ol>
&lt;p>These two projects are underway, involving lots of communication and learning. Since we haven’t released any schema updates in many years, all our staff are learning for the first time how a metadata schema model is interpreted in a systemic way, learning about the structure of research objects, and honing the process as they go. We’ve high hopes we’ll be in a position to release continuous metadata schema versions and catch up on the backlog over the coming years.&lt;/p>
&lt;h3 id="whats-next">What’s next&lt;/h3>
&lt;ul>
&lt;li>Continuous metadata development, with contributor roles up next&lt;/li>
&lt;li>Retraction Watch data integrated into the REST API so users have a single source of retraction/correction data&lt;/li>
&lt;li>Upgraded preprint matching and notifications&lt;/li>
&lt;li>Modelling more equitable fees through the RCFS projects&lt;/li>
&lt;li>Piloting a non-voting membership category&lt;/li>
&lt;/ul>
&lt;p>Once we’re fully in the cloud and in the groove of metadata updates, and with the support of newly-hired technology and program directors joining in the new year, we’ll turn our attention to rebuilding the central metadata system that we call the Crossref System, or “CS” and report more on this next year.&lt;/p>
&lt;p>So that was our summary of 2024 and an indication of what’s coming in 2025 and beyond; sorry it’s so long, and thanks for reading this far! Next year we’ll get back to more regular updates as the strategic agenda and the programs progress.&lt;/p></description></item><item><title>A summary of our Annual Meeting</title><link>https://www.crossref.org/blog/a-summary-of-our-annual-meeting/</link><pubDate>Mon, 09 Dec 2024 00:00:00 +0000</pubDate><author>Rosa Morais Clark</author><guid>https://www.crossref.org/blog/a-summary-of-our-annual-meeting/</guid><description>&lt;p>The Crossref2024 annual meeting gathered our community for a packed agenda of updates, demos, and lively discussions on advancing our shared goals. The day was filled with insights and energy, from practical demos of Crossref’s latest API features to community reflections on the Research Nexus initiative and the Board elections.&lt;/p>
&lt;img src="https://www.crossref.org/images/blog/2024/2025-board.png" alt="graphic with headshots of panelists" style="float: left; margin: 0 15px 15px 0; width: 50%;">
&lt;p>
Our Board elections are always the focal point of the Annual Meeting. We want to start reflecting on the day by congratulating our newly elected board members: Katharina Rieck from Austrian Science Fund (FWF), Lisa Schiff from California Digital Library, Aaron Wood from American Psychological Association, and Amanda Ward from Taylor and Francis, who will officially join (and re-join) in January 2025. Their diverse expertise and perspectives will undoubtedly bring fresh insights to Crossref’s ongoing mission.&lt;div>
&lt;/div>
&lt;p>The meeting started with a recap of our mission and priorities. Ed Pentz reiterated the Research Nexus vision of increasing transparency of the connections that make up the scholarly record and underpin the research ecosystem.&lt;/p>
&lt;p>Crossref is dedicated to openness, community ownership, and a stable, accessible infrastructure that researchers, publishers, funders, and institutions can rely on for the long term. This is demonstrated by Crossref’s commitment to the &lt;a href="https://openscholarlyinfrastructure.org/" target="_blank">the Principles of Open Scholarly Infrastructure (POSI)&lt;/a>, which constitute commitments to building a resilient and transparent infrastructure for research—sustainability, community governance, and openness. Ed emphasized how Crossref is aligning with these principles and collaborates with other adopters to reflect and continuously align these with the needs of the scholarly community, with a public consultation on proposed revisions to POSI forthcoming next year.&lt;/p>
&lt;p>&lt;a href="https://youtu.be/VBnfkOxVr6s?si=ebg6NvNDb7hiGdPe&amp;amp;t=80" target="_blank">Ginny Hendricks highlighted key membership and metadata trends&lt;/a>. She noted that as of 2024, half of Crossref members are based in Asia. This year, as always in recent years, we saw many new organisations from Indonesia, Turkey, India, and Brazil join us. Removing those fast-growing countries for the chart’s clarity, we can see that some of the next most active countries are Pakistan, Mexico, Spain, Bangladesh, and Ecuador, among others.&lt;/p>
&lt;p>There are now ~163 million open metadata records with Crossref DOIs, and Ginny pointed out increases in the registration of preprints, peer-review reports, and grants. In terms of metadata elements, it&amp;rsquo;s good to see that more publishers recognize the importance of including abstracts and ROR IDs in their metadata records. Also, in line with the community’s concerns about integrity, our members have been enriching their records with direct assertions of retractions.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2024/scale-of-crossref.png"
alt="screenshot from slidedeck titled Scale of Crossref. Contains various stats" width="75%">
&lt;/figure>
&lt;/div>
&lt;p>Then, Ginny went on to report on the progress towards our &lt;a href="https://www.crossref.org/strategy/">strategic goals&lt;/a>:&lt;/p>
&lt;ul>
&lt;li>Contribute to an environment where the community identifies and co-creates solutions for broad benefit&lt;/li>
&lt;li>A sustainable source of complete, open, and global scholarly metadata and relationships&lt;/li>
&lt;li>Manage Crossref openly and sustainably, modernizing and making transparent all operations so that we are accountable to the communities that govern us.&lt;/li>
&lt;li>Foster a strong team because reliable infrastructure needs committed people who contribute to and realize the vision and thrive in doing it.&lt;/li>
&lt;/ul>
&lt;h3 id="demos">Demos&lt;/h3>
&lt;p>&lt;a href="https://youtu.be/VBnfkOxVr6s?si=yVVxcwPCRYJL5JWd&amp;amp;t=1916" target="_blank">Lena Stoll and Patrick Vale’s session&lt;/a> gave members a practical preview of our latest tools.&lt;/p>
&lt;p>Patrick started by reflecting on the challenge of making our identifiers useful for people using screen readers (and other assistive technologies). He thanked all who responded to our past consultation on the topic and presented the Crossref DOI Accessibility Enhancer – the browser plug-in initially available for Firefox (and soon also for Chrome). He shared the &lt;a href="https://gitlab.com/crossref/doi-accessibility-enhancer" target="_blank">Gitlab repo&lt;/a> for anyone interested in trying it and invited feedback as we’re hoping to iterate on this.&lt;/p>
&lt;p>Patrick then went on to talk about our openness to community contributions to Crossref tools, with an example of the recent contribution from CWTS Leiden to our &lt;a href="https://crossref.org/members/prep/" target="_blank">Participation Reports&lt;/a>. Thanks to their work, our members can now see the proportion of works they’ve registered that include affiliation information and ROR IDs, alongside the previously available key metadata such as references, abstracts, ORCID iDs, funding information, or Crossmark.&lt;/p>
&lt;p>Finally, Lena demonstrated the latest extension of our record management tool that’s just been made available to make manual registration of metadata records for journal articles easier. &lt;a href="https://manage.crossref.org/records" target="_blank">The new form&lt;/a> is flexible and driven by our metadata schema. Importantly for our members, it simplifies the workflow with input validations and automated ISSN matching, and it enables members to register author affiliations with an integrated ROR look-up. We hope this will support our smaller members, who are relying on our helper tools to register their content.&lt;/p>
&lt;p>Throughout the session, members were encouraged to use these tools and explore new resources available through Crossref. We believe that by taking advantage of these resources, you can enhance your research and publishing experience, and contribute to the growth and development of the scholarly community.&lt;/p>
&lt;h3 id="the-discussion-about-open-scholarly-infrastructure">The discussion about open scholarly infrastructure&lt;/h3>
&lt;p>The panel on open scholarly infrastructure brought together experts with a wide range of experience in the field. Moderated by Lucy Ofiesh, Crossref’s Chief Operating Officer, the discussion featured six invited speakers who shared their insights on the opportunities and challenges facing the scholarly ecosystem: Ed Pentz, Crossref; Sarah Lippincott, Dryad; Amélie Church, Sorbonne University; Joanna Ball, DOAJ; Ann Li, Airiti; and Richard Bruce Lamptey, Kwame Nkrumah University of Science and Technology.&lt;/p>
&lt;p>The panel talked about what openness in scholarly infrastructure means, why it’s important, its sustainability, and how to tackle challenges and gaps across the ecosystem. They highlighted frameworks like the Principles of Open Scholarly Infrastructure (POSI), the &lt;a href="https://barcelona-declaration.org/" target="_blank">Barcelona Declaration&lt;/a>, and the &lt;a href="https://doi.org/10.5281/zenodo.6557302" target="_blank">FOREST Framework&lt;/a> as key tools for guiding work on governance, sustainability, and equity. The discussion highlighted the need for more collaboration, inclusivity, and practical ways to ensure open infrastructure remains sustainable in the long run.&lt;/p>
&lt;p>They also stressed how openness supports research integrity. How transparent systems allow researchers to question methods, verify findings, and preserve data. Amelie Church expanded on this point, underscoring the important role of open infrastructure in addressing challenges to integrity. She explained that such transparency enables the scholarly community to scrutinize research processes, ensuring the quality of outputs and their impact on society. Without openness, researchers face barriers to maintaining trust in their work, making open infrastructure necessary for research integrity and public confidence in science.&lt;/p>
&lt;blockquote>
&lt;p>“By focusing on accessibility, transparency, and community engagement, open infrastructure can reshape academic and research ecosystems in transformative ways.” ~Richard Bruce Lamptey&lt;/p>
&lt;/blockquote>
&lt;p>Regarding sustainability, Sarah Lippincott stressed the importance of aligning funding models with community needs while addressing governance challenges. She pointed out that while initial funding can launch infrastructure, long-term sustainability requires consistent community investment and robust governance frameworks. This balance, she explained, is essential to ensure equity and transparency.&lt;/p>
&lt;p>Collaboration was another important topic. Joanna Ball and Sarah Lippincott shared examples of how pooling expertise and resources—such as in the global support for ROR—can strengthen systems and make them more sustainable. These initiatives show the power of collective efforts in addressing technical and resource barriers. However, inclusivity remains an ongoing challenge.&lt;/p>
&lt;p>The panel discussed the ways in which language barriers, resource limitations, and reliance on proprietary systems continue to exclude researchers from underrepresented regions. Ann Li highlighted how addressing these disparities is critical to ensuring the global accessibility of open infrastructure. By fostering inclusive practices, the scholarly community can mitigate biases and build tools that reflect a broader range of research contributions.&lt;/p>
&lt;blockquote>
&lt;p>”My hope is that open infrastructure can have the resources that it needs to thrive, not just merely survive, and also that open infrastructure communities and organisations look to the value of frameworks that we&amp;rsquo;ve talked about today to help align themselves and improve their policies and practices, because there&amp;rsquo;s always room for growth, even in the best, most well-intentioned communities.” ~Sarah Lippincott, Dryad&lt;/p>
&lt;/blockquote>
&lt;p>The panel wrapped up the discussion by expressing optimism for the future of open scholarly infrastructure and emphasized the importance of continued investment, collaboration across organisations, and transparency in operations. The discussion reinforced the idea that open infrastructure provides a strong foundation for research that is equitable, sustainable, and accessible to all.&lt;/p>
&lt;h3 id="updates-from-our-community">Updates from our Community&lt;/h3>
&lt;p>We enjoyed talks from our community about increasing their participation in the Research Nexus by adopting, using and enhancing metadata in different ways. Robbykha Rosalien hosted talks from the EuropePMC, Dutch Research Council, eLife, and CSIRO featured in Session I, and Amanda French hosted CLOCKSS, Sciety, and Redalyc in Session II.&lt;/p>
&lt;p>&lt;a href="https://youtu.be/VBnfkOxVr6s?si=1VV79KhplCHsWXNt&amp;amp;t=3701" target="_blank">Michael Parkin talked about preprints in Europe PMC&lt;/a>. Europe PMC is a database for life science literature and a platform for content-based innovation. They started indexing preprints via Crossref REST API in 2018. Michael presented their work on discoverability of preprints in their database, including reflections on early challenges, as well as the latest efforts in surfacing available community reviews.&lt;/p>
&lt;p>&lt;a href="https://youtu.be/VBnfkOxVr6s?si=euFBcIGYp1UEDrHz&amp;amp;t=4169" target="_blank">Hans de Jonge talked about the Dutch Research Council&amp;rsquo;s (NWO)&lt;/a> dedication to open science, with policies ensuring that publications and data funded by NWO are openly available. They embrace open science principles for their own metadata and is a signatory of the Barcelona Declaration on Open Research Information. Hans focused on NWO&amp;rsquo;s recent introduction of Grant IDs through Crossref’s Grant Linking System (GLS). He shared their approach, the motivations behind introducing Grant IDs, and some challenges they faced.&lt;/p>
&lt;p>&lt;a href="https://youtu.be/VBnfkOxVr6s?si=eNYAyjvIlX0OkCBJ&amp;amp;t=5002" target="_blank">Frederick Atherden explained how eLife&lt;/a>, a nonprofit led by scientists, use Crossref’s Grant Linking System to include grant DOIs in their publication metadata. It allows authors to add grant DOIs during submission, and they developed a tool to match grant numbers with DOIs during the proofing process to improve accuracy. Their goal is to follow best practices for metadata, making content easier to find and link to.&lt;/p>
&lt;p>&lt;a href="https://youtu.be/VBnfkOxVr6s?si=1haopH2ahnb-xllw&amp;amp;t=5522" target="_blank">Brietta Pike covered how CSIRO&lt;/a> is working to improve metadata quality for its journals, making research more discoverable and trustworthy. CSIRO faced challenges like inconsistent XML tagging, outdated systems, and data loss. To address these, they formed a project team, created a clear XML stylesheet, and updated their workflows. Recent progress includes better funding data, clearer license information, and more complete affiliation tagging. These efforts aim to support a more transparent and accessible research environment.&lt;/p>
&lt;p>&lt;a href="https://youtu.be/5ZI8idIDL_A?si=5FaVcSbwCfyo_OOX&amp;amp;t=9150" target="_blank">Alicia Wise of CLOCKSS&lt;/a> talked about recent collaborations seeking to safeguard our cultural and scholarly heritage over the long term. CLOCKSS, a community-run archive, is dedicated to preserving scholarly content to remain accessible and unchanged for future generations. True preservation requires securely storing content in trusted archives that are actively maintained. A group of librarians and publishers developed a guide to help publishers preserve content, they also established an archival standard for EPUB formats to ensure ebooks can be stored effectively, and launched a pilot project to track preserved books, helping libraries and scholars identify safely stored titles.&lt;/p>
&lt;p>&lt;a href="https://youtu.be/5ZI8idIDL_A?si=0fLneFHGEaSsnSzC&amp;amp;t=10082" target="_blank">Mark Williams from Sciety&lt;/a> talked about how Sciety uses Crossref metadata to create detailed preprint histories. By partnering with organisations and communities worldwide, Sciety platform gathers public reviews, highlights, and recommendations on preprinted research, helping researchers evaluate the quality and relevance of new studies. Through linking related preprints and journal articles, Sciety builds a connected view of each research work. Although challenges like inconsistent terminology and identifier gaps persist, these efforts enhance the visibility and credibility of preprints.&lt;/p>
&lt;p>&lt;a href="https://youtu.be/5ZI8idIDL_A?si=93KJA-36wgJ3Apg2&amp;amp;t=10708" target="_blank">Arianna Becerril-García of AmeliCA/Redalyc&lt;/a> shared insights on diamond open-access journals in Latin America. Redalyc is an open-access infrastructure that supports journals by providing free services like visibility and production tools. Redalyc has a role in sustaining Latin America’s unique approach to open-access publishing, where most journals are backed by academic institutions and public funds, allowing free access for both readers and authors. Arianna stressed the need to treat these journals as digital public goods and urged the communities they serve to help ensure their long-term sustainability. Despite limited resources and global under-recognition, these journals serve an international research audience, including authors from Europe, Africa, and Asia. Redalyc and other open infrastructures play a key role by offering tools that reduce production co-sts and improve discoverability, all without financial barriers. Noted was how this approach aligns with UNESCO’s open science framework, which promotes inclusivity and addresses long-standing inequalities in scholarly publishing.&lt;/p>
&lt;h3 id="afternoon-of-more-resources-and-updates-from-crossref">Afternoon of more resources and updates from Crossref&lt;/h3>
&lt;p>After a mid-day break (in Europe), &lt;a href="https://www.youtube.com/watch?v=5ZI8idIDL_A&amp;amp;t=98s" target="_blank">Luis Montilla kicked off the second session with a practical tutorial of Crossref’s REST API&lt;/a>. Following his last year’s &lt;a href="https://doi.org/10.13003/h3yygefpyf" target="_blank">intro to the Crossref API&lt;/a>, this time he offered a step-by-step guide to help attendees maximize the API’s capabilities for metadata retrieval with advice on:&lt;/p>
&lt;ul>
&lt;li>&lt;strong>Managing large data requests with pagination and iterations&lt;/strong>&lt;/li>
&lt;li>&lt;strong>Incorporating safety mechanisms&lt;/strong> - to avoid hitting rate limits, Luis recommended adding pauses between requests and sharing example scripts to streamline this.&lt;/li>
&lt;/ul>
&lt;p>For those interested in learning more, look at the new Crossref &lt;a href="https://www.crossref.org/learning/">API Learning Hub&lt;/a>— a new resource offering guides, scripts, and training materials to simplify complex queries. Please share questions about things you&amp;rsquo;re not sure about in our &lt;a href="https://community.crossref.org/c/metadata-retrieval/27" target="_blank">community forum&lt;/a>, to help guide development of future demos.&lt;/p>
&lt;p>&lt;a href="https://www.youtube.com/watch?v=5ZI8idIDL_A&amp;amp;t=1508s" target="_blank">Patricia Feeney followed with updates on metadata schema changes&lt;/a>. She introduced our recent shift to integrate the Funder Registry with ROR, which allows members to use a single identifier system, simplifying data management by reducing redundancy. Patricia explained that, for now, the current identifiers remain valid, so members won’t need to make immediate changes. She also outlined planned support for version metadata, typed citations, and future plans to expand support for contributor role vocabularies, and invited community participation in a planned multilingual metadata working group.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2024/schema-5.4.0-graphic.png"
alt="screenshot of a slide titled - in progress schema 5.4.0" width="75%">
&lt;/figure>
&lt;/div>
&lt;p>Next, &lt;a href="https://www.youtube.com/watch?v=5ZI8idIDL_A&amp;amp;t=3370s" target="_blank">Kora Korzec offered an update on the progress in our research on Resourcing Crossref for Future Sustainability&lt;/a> and opened up a discussion about the best ways of assessing our members’ size and ability to pay. In light of our ambition to streamline discounts, we also invited suggestions for discounts to support accessibility and fuller participation in the Research Nexus.&lt;/p>
&lt;p>As part of the discussion, we’ve learned who was in attendance during the session:&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2024/answers-to-poll-questions.png"
alt="Responses to the poll question: If you are a Crossref member, which fee tier is your organisation? 20 of 45 responses selected the &amp;lt;1mln USD, 4 out of 45 selected 5-10mln USD, &amp;gt;100mln USD and something else - we&amp;#39;re a funder member; 11 selected Not applicable option" width="50%">
&lt;/figure>
&lt;/div>
&lt;div style="text-align:center;margin:10px">
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2024/answers-to-poll-questions2.png"
alt="Responses to the poll question: Is publishing scholarly content the primary activity of your organisation? 21 out of 53 said Yes, 31 said No, and 1 was not sure" width="50%">
&lt;/figure>
&lt;/div>
&lt;p>We’ve heard a lot of support for our current &lt;a href="https://www.crossref.org/gem/">GEM program&lt;/a>. While it was clear from our poll that publishing revenue is not the most relevant measure of size or capacity for all those present – establishing a good alternative proved challenging. The idea of considering the size of the organisation as its largest entity has been discussed, and important points were raised about budgets in different types of distributed organisations (e.g., on the position of libraries within large universities).&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2024/member-stats-new-members-per-year.png"
alt="screenshot of a slide titled Memebership Stats: &amp;gt;2000 new members per year - line graph illustrating increases in the number of Crossref mebmres for each year from 2001 until 2024" width="75%">
&lt;/figure>
&lt;/div>
&lt;p>The official Annual Meeting part commenced after the discussion, with a report on the State of Crossref from Lucy Ofiesh, and commenced with our Board election. &lt;a href="https://youtu.be/5ZI8idIDL_A?si=UHj-O3PGG58AyQxF&amp;amp;t=6396" target="_blank">Lucy highlighted some of the key accomplishments of the year so far&lt;/a>, including:&lt;/p>
&lt;ul>
&lt;li>Research for Resourcing Crossref for Future Sustainability (RCFS)&lt;/li>
&lt;li>Integrity of the Scholarly Record (ISR)&lt;/li>
&lt;li>Grant Linking System (GLS) reached 5 years&lt;/li>
&lt;li>Automated some very manual membership processes&lt;/li>
&lt;li>Released new form for journal article record registration&lt;/li>
&lt;li>Upgraded Participation Reports to include Affiliations and ROR IDs&lt;/li>
&lt;li>Launched a new API Learning Hub&lt;/li>
&lt;li>Paused further development of a Relationships API&lt;/li>
&lt;li>Migrated to a new open-source database&lt;/li>
&lt;li>Schema development - ROR as Funder identifiers&lt;/li>
&lt;li>REST API bug fixes and metadata consistency fixes.&lt;/li>
&lt;/ul>
&lt;p>Then she reflected on the membership growth––Crossref is now made up of 21,000 organisations from 160 countries. We reviewed our 2024 year-end financial forecast. As we’re bouncing back from COVID-19, our travel expenses have grown this year, and so have the fees for cloud services hosting. These are all as planned and happen in the context of healthy growth, including that from adoption and increased usage of paid services. We’re in a healthy financial position as membership revenue and usage fees, like content registration and Similarity Check document checking fees, continue to grow from the previous year.&lt;/p>
&lt;p>Thank you to everyone who joined us for Crossref2024. This year&amp;rsquo;s meeting showcased our collective dedication to advancing open, accessible research infrastructure and underscored the power of collaboration in building a stronger scholarly community. As we reflect on the rich discussions and insights shared during the event, it’s clear our community is committed to advancing open and sustainable scholarly infrastructure.&lt;/p>
&lt;p>Looking ahead, we’ll continue collaborating with members and partners to tackle challenges, expand accessibility, and foster collaboration. A key focus will be enhancing tools and metadata standards to serve the community better. Through innovative solutions and strategic initiatives like the Research Nexus, our collective efforts will make research more connected and accessible for all.&lt;/p>
&lt;p>For anyone who couldn’t attend live, &lt;a href="https://www.crossref.org/crossref-annual-meeting" target="_blank">recordings are now available on our website&lt;/a>. We’re excited to see how the ideas exchanged during this meeting spark progress across the scholarly ecosystem in the coming months.&lt;/p></description></item><item><title>2024 POSI audit</title><link>https://www.crossref.org/blog/2024-posi-audit/</link><pubDate>Sat, 07 Dec 2024 00:00:00 +0000</pubDate><author>Lucy Ofiesh</author><guid>https://www.crossref.org/blog/2024-posi-audit/</guid><description>&lt;h2 id="background">Background&lt;/h2>
&lt;p>The &lt;a href="https://doi.org/10.24343/C34W2H" target="_blank">Principles of Open Scholarly Infrastructure (POSI)&lt;/a> provides a set of guidelines for operating open infrastructure in service to the scholarly community. It sets out 16 points to ensure that the infrastructure on which the scholarly and research communities rely is openly governed, sustainable, and replicable. Each POSI adopter regularly reviews progress, conducts periodic audits, and self-reports how they’re working towards each of the principles.&lt;/p>
&lt;p>In 2020, Crossref’s board &lt;a href="https://doi.org/10.64000/hzemx-j7n79" target="_blank">voted&lt;/a> to adopt the Principles of Open Scholarly Infrastructure, and we completed our first self-audit. We published our next review in &lt;a href="https://doi.org/10.64000/1a8fc-3jq97" target="_blank">2022&lt;/a>.&lt;/p>
&lt;p>The POSI adopters have continued to review the principles, &lt;a href="https://doi.org/10.54900/n6az7-4xb07" target="_blank">reflecting&lt;/a> on the effects of adopting them and providing a &lt;a href="https://openscholarlyinfrastructure.org/posi-v1.1-revisions.pdf" target="_blank">revision to the principles in late 2023&lt;/a>. We use the revised principles for this latest review.&lt;/p>
&lt;h2 id="key">Key&lt;/h2>
&lt;p>We use a traffic light system to indicate where we believe we stand against each of the 16 principles. Now with up/down arrows to show any significant movement, and an &amp;lsquo;i&amp;rsquo; where there is something of note with narrative.&lt;/p>
&lt;p>&lt;i class='fa fas fa-circle font-small font-crossref-red'>&lt;/i>
red indicates we are not fulfilling the principle. &lt;br>
&lt;i class='fa fas fa-circle font-small font-crossref-yellow'>&lt;/i>
yellow indicates we are making progress towards meeting the principle. &lt;br>
&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
green indicates we are fulfilling the principle. &lt;br>
&lt;i class='fa fas fa-circle-arrow-up font-small font-crossref-yellow'>&lt;/i>
or &lt;i class='fa fas fa-circle-arrow-up font-small font-crossref-green'>&lt;/i>
means this is a new change, where we&amp;rsquo;ve moved &amp;lsquo;up&amp;rsquo; the traffic lights, in comparison to the previous audit. We would use the same if &amp;lsquo;down&amp;rsquo; ever happens too. &lt;br>
&lt;i class='fa fas fa-circle-info font-small font-crossref-yellow'>&lt;/i>
or &lt;i class='fa fas fa-circle-info font-small font-crossref-green'>&lt;/i>
means that something has changed of note and in comparison to the previous audit.&lt;/p>
&lt;div class="shortcode-divwrap darkgrey-highlight">
&lt;span>&lt;h2 id="governance">GOVERNANCE&lt;/h2>
&lt;p>&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
Coverage across the scholarly enterprise &lt;br>
&lt;i class='fa fas fa-circle-info font-small font-crossref-yellow'>&lt;/i>
Stakeholder governed &lt;br>
&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
Non-discriminatory participation or membership &lt;br>
&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
Transparent governance &lt;br>
&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
Cannot lobby &lt;br>
&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
Living will &lt;br>
&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
Formal incentives to fulfil mission &amp;amp; wind-down&lt;/p>&lt;/span>
&lt;/div>
&lt;h3 id="whats-changed-with-governance">What’s changed with governance&lt;/h3>
&lt;h4 id="stakeholder-governed">Stakeholder governed&lt;/h4>
&lt;p>We’ve been yellow and we’re still yellow, but it has been improving. In the past, we’ve reported that we are working towards this but we’re not there yet because we didn’t have representation on the board from certain types of members, specifically research funders and research institutions. In the incoming 2025 board class, we have both. Six out of our 16 board seats are held by universities, university presses, or libraries. We also look forward to adding a new research funder, the Austrian Science Fund (FWF), to the board in January.&lt;/p>
&lt;p>None of this, though, is hardcoded into the structure of the board. We extend an open call for board interest; any active member can apply for consideration. The Nominating Committee prepares a slate with a diverse range of candidates and organisations, and it is then up to the membership to elect board members.&lt;/p>
&lt;p>With only 16 board seats and &amp;gt;21,000 members in 160 countries, being fully stakeholder-governed is challenging. Further, there are important contributors to the community that we all rely on who are not eligible for board seats because they are not members, as defined in our &lt;a href="https://www.crossref.org/board-and-governance/bylaws/">by-laws&lt;/a>, such as sponsors, service providers, and metadata users.&lt;/p>
&lt;p>We don’t consider this principle fulfilled, and that’s a good thing to keep note of; we must keep aspiring to have a broader, more comprehensive representation of our evolving community. The board continues to discuss stakeholder representation.&lt;/p>
&lt;div class="shortcode-divwrap darkgrey-highlight">
&lt;span>&lt;h2 id="sustainability">SUSTAINABILITY&lt;/h2>
&lt;p>&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
Time-limited funds are used only for time-limited activities &lt;br>
&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
Goal to generate surplus &lt;br>
&lt;i class='fa fas fa-circle-up font-small font-crossref-green'>&lt;/i>
Goal to create financial reserves &lt;br>
&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
Mission-consistent revenue generation &lt;br>
&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
Revenue based on services, not data&lt;/p>&lt;/span>
&lt;/div>
&lt;h3 id="whats-changed-with-sustainability">What’s changed with sustainability&lt;/h3>
&lt;h4 id="goal-to-create-financial-reserves">Goal to create financial reserves&lt;/h4>
&lt;p>This was yellow and is now green. In 2023, we met our goal of maintaining a contingency fund of 12 months of operating costs. We also topped up this fund in 2024 to keep pace with our growing operating expenses. The revisions for POSI 1.1 actually removed the specificity of a 12-month timeline, allowing each adopting organisation to set its own goal; in Crossref’s case, 12 months remains appropriate.&lt;/p>
&lt;div class="shortcode-divwrap darkgrey-highlight">
&lt;span>&lt;h2 id="insurance">INSURANCE&lt;/h2>
&lt;p>&lt;i class='fa fas fa-circle-info font-small font-crossref-yellow'>&lt;/i>
Open source &lt;br>
&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
Open data (within constraints of privacy laws) &lt;br>
&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
Available data (within constraints of privacy laws) &lt;br>
&lt;i class='fa fas fa-circle-up font-small font-crossref-green'>&lt;/i>
Patent non-assertion&lt;/p>&lt;/span>
&lt;/div>
&lt;h3 id="whats-changed-with-insurance">What’s changed with insurance&lt;/h3>
&lt;h4 id="open-source">Open source&lt;/h4>
&lt;p>This was yellow and still is, but we’re making improvements. In September of this year we migrated our database off of a closed-source solution and onto &lt;a href="https://www.postgresql.org/" target="_blank">PostgreSQL&lt;/a>. This has improved the performance of the system and is an important step towards paying down technical debt and moving the system fully into the cloud.&lt;/p>
&lt;h4 id="patent-non-assertion">Patent non-assertion&lt;/h4>
&lt;p>This was yellow and is now green. We confirm that we do not hold any patents, and we have a published &lt;a href="https://www.crossref.org/operations-and-sustainability/patent-policy/">policy&lt;/a> on it that is available for inspection and reuse by anyone in the community.&lt;/p>
&lt;h2 id="in-summary">In summary&lt;/h2>
&lt;p>These are the main changes of note for our 2024 POSI update. The summary is that we&amp;rsquo;ve maintained all our greens, and of the four principles that were yellow last time, two have moved to green (financial reserves; patent non-assertion) and two have remained yellow but seen some progress of note (stakeholder governed; open source).&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2024/crossref-posi-2024.png#floatstart"
alt="Crossref POSI self-audit in a nutshell" width="75%">
&lt;/figure>
&lt;p>Please let us have any comments or questions; by commenting here it will add a public record of the discussion on our community forum. Here is an image to share, if needed.&lt;/p>
&lt;p>We continue to learn from the &lt;a href="https://openscholarlyinfrastructure.org/posse/" target="_blank">POSI adopters group&lt;/a>&amp;mdash;now numbering 23 organisations&amp;mdash;and the group will soon share a draft of POSI v2 for community comment. We look forward to the ongoing discussions with this group, and others, to keep improving and holding ourselves to account.&lt;/p></description></item><item><title>Summary of the environmental impact of Crossref</title><link>https://www.crossref.org/blog/summary-of-the-environmental-impact-of-crossref/</link><pubDate>Thu, 05 Dec 2024 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/summary-of-the-environmental-impact-of-crossref/</guid><description>&lt;p>In June 2022, we wrote a blog post “&lt;a href="https://doi.org/10.64000/bnv9r-q8f86" target="_blank">Rethinking staff travel, meetings, and events&lt;/a>” outlining our new approach to staff travel, meetings, and events with the goal of not going back to ‘normal’ after the pandemic. We took into account three key areas:&lt;/p>
&lt;ol>
&lt;li>The environment and climate change&lt;/li>
&lt;li>Inclusion&lt;/li>
&lt;li>Work/life balance&lt;/li>
&lt;/ol>
&lt;p>We are aware that many of our members are also interested in minimizing their impacts on the environment, and we are overdue for an update on meeting our own commitments, so here goes our summary for the year 2023!&lt;/p>
&lt;p>To be honest, the picture is mixed. On the positive side, we are traveling less and differently compared with 2019. Most of our events have been online, with some regional in-person ones, reducing our carbon footprint and increasing inclusivity with more people attending Crossref events. On the negative side, it hasn’t been easy to collect the data and figure out the best tools for calculating emissions, and we certainly haven’t captured all of our carbon emissions. Our approach has been to not let the perfect be the enemy of the good and we’ve focused on our largest source of carbon emissions - air travel.&lt;/p>
&lt;h3 id="some-of-the-positive-things">Some of the positive things:&lt;/h3>
&lt;ul>
&lt;li>We have maintained our strategic approach to consider environmental, inclusion, and work/life balance issues when we plan travel and to make the most of in-person events by focusing on those that involve interaction, such as listening and learning from our members and users, deepening relationships, co-creating, and forming new alliances&lt;/li>
&lt;li>Crossref Annual Meetings and community updates have been online and in different time zones.&lt;/li>
&lt;li>Crossref board meetings have been reduced from three in-person meetings per year to one face-to-face and two online meetings per year.&lt;/li>
&lt;li>We had an optional all-staff in-person meeting in June 2023 (and this year too).&lt;/li>
&lt;li>For the in-person board and staff meetings, we have selected locations that minimize the overall amount of travel and maximize direct flights.&lt;/li>
&lt;li>We have maintained our country focus for in-person local meetings supported by regional Ambassadors.&lt;/li>
&lt;li>We met our goal of keeping total travel and meeting expenses below 60% of 2019 costs even though we have more staff and membership growth has continued. The amount of money spent is a rough proxy for our carbon impact.&lt;/li>
&lt;li>We no longer have an office in Oxford and will not renew the lease on our Lynnfield, MA office, so we will have no physical offices by the end of 2024. This is not a large carbon emission reduction and is more a result of being a “distributed first” organisation with staff in 11 different countries.&lt;/li>
&lt;li>We recorded data on staff travel (flights, trains, cars, hotels) for 2023 to use as a baseline for comparison with future years. In 2023 the carbon emissions from travel and meetings was about 105 tCO2e.&lt;/li>
&lt;li>We used tools provided by Amazon Web Services (AWS) and Zoom to estimate the impact of these services. In 2023 this was 0.266 tCO2e for AWS and .1 tCO2e for Zoom.&lt;/li>
&lt;/ul>
&lt;h3 id="some-challenges">Some challenges&lt;/h3>
&lt;ul>
&lt;li>Compiling data is difficult and time-consuming for a small organisation&lt;/li>
&lt;li>There are many different calculators and metrics to use and it’s difficult to decide which to use and how much detail to go into&lt;/li>
&lt;li>We haven’t yet estimated the carbon footprint of staff home working&lt;/li>
&lt;li>We were able to calculate the emissions from AWS but not our data center&lt;/li>
&lt;li>We didn’t estimate the emissions from our offices. We had a small office in Oxford until November 2023, and we have an office near Boston - we won’t be renewing the lease in 2025 so won’t have any offices.&lt;/li>
&lt;/ul>
&lt;h3 id="total-travel-and-meetings-spending">Total travel and meetings spending&lt;/h3>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>Year&lt;/th>
&lt;th>Amount&lt;/th>
&lt;th>Percentage of 2019&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>2019 actuals&lt;/td>
&lt;td>$585,482&lt;/td>
&lt;td>100%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>2020 actuals&lt;/td>
&lt;td>$91,700&lt;/td>
&lt;td>16%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>2021 actuals&lt;/td>
&lt;td>$19,066&lt;/td>
&lt;td>3%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>2022 actuals&lt;/td>
&lt;td>$74,416&lt;/td>
&lt;td>13%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>2023 actuals&lt;/td>
&lt;td>$305,737&lt;/td>
&lt;td>52%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>2024 budget&lt;/td>
&lt;td>$333,500&lt;/td>
&lt;td>56%&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>We have recorded carbon emissions from travel at about 105 tCO2e, so we will compare 2023 with future years. Now that we have started collecting travel data, it will be easier—staff can do it as they travel throughout the year.&lt;/p>
&lt;p>Our Executive Director, Ed Pentz, looked at his personal and work flights and the carbon emissions in 2019 were 18 tCO2e and in 2023 were 2.7 tCO2e so this is a big change in the right direction.&lt;/p>
&lt;h3 id="hosting-services">Hosting services&lt;/h3>
&lt;p>We use AWS for hosting our REST APIs, Crossref Metadata Search, the website, and Labs projects. Our main metadata registry is still in a data center, which is not included in this calculation. For 2023 Amazon reports Crossref’s carbon emissions were 0.216 tCO2e compared with 0.266 tCO2e in 2022. Crossref is planning to move out of the data center and fully to AWS by the end of 2024 so this will increase our AWS usage and therefore our emissions from related activities will increase. Compared to travel, the footprint from AWS is minimal.&lt;/p>
&lt;h3 id="online-meetings">Online meetings&lt;/h3>
&lt;p>As a distributed, remote-first organisation Crossref is a heavy Zoom user –– it’s essential for staff and for engaging with our community. However, Zoom doesn’t provide tools or estimates of the carbon impact of Zoom meetings. We used &lt;a href="https://www.utilitybidder.co.uk/business-electricity/zoom-emissions/" target="_blank">a tool provided by Utility Bidder&lt;/a>, which makes a lot of estimates and assumptions. In 2023 Crossref had almost 800,000 meeting minutes. This translated into an average of 1.92 kg of CO2 emissions per week, or 100 kg per year.&lt;/p>
&lt;p>Some studies have estimated that turning off video reduces the carbon footprint of meetings. However, this can be a false savings since video is often important for creating a connection and having a productive meeting, and a Zoom meeting with video is still much, much better than traveling, particularly if flying is involved.&lt;/p>
&lt;h3 id="tools-we-used">Tools we used&lt;/h3>
&lt;p>In order to calculate emissions for flights and train journeys, we chose to use &lt;a href="https://www.carbonfootprint.com/calculator.aspx" target="_blank">Carbon Calculator&lt;/a>. We didn’t calculate emissions from hotel stays but looked at the &lt;a href="https://www.hotelfootprints.org/" target="_blank">Hotel Footprinting tools&lt;/a> and may add hotels to calculations in the future.&lt;/p>
&lt;h3 id="offsetting">Offsetting&lt;/h3>
&lt;p>We don’t offset our emissions from travel or other operations and don’t have plans to do this. &lt;a href="https://en.wikipedia.org/wiki/Carbon_offsets_and_credits#Assuring_quality_and_determining_value" target="_blank">Offsetting emissions is problematic in a number of different ways&lt;/a> so we don’t feel confident in doing it.&lt;/p>
&lt;p>We did tree-planting as a “thank you” for the time of respondents in our metadata survey. Intended as an alternative to more commercial types of incentives rather than off-setting for our emissions, this resulted in 921 trees planted for the &lt;a href="https://ecologi.com/projects/restoring-degraded-land-in-ethiopia" target="_blank">Gewocha Forest, Ethiopia&lt;/a> via Ecologi.&lt;/p>
&lt;h3 id="wrapping-up">Wrapping up&lt;/h3>
&lt;p>Moving forward, we’ve learned a lot over the last couple of years. Collecting accurate data is challenging and time-consuming, especially for a small organisation. For us, this has been a new lens for viewing our activities, and it remains a true learning journey and we have made permanent changes. In 2024 and beyond we are going to continue to follow our travel, meetings, and events policies that we announced in 2022. We will continue to capture our air travel emissions, and in 2025 we will more accurately capture train journeys and hotel stays. We will also continue calculating our Zoom and AWS emissions as best as we can. What we&amp;rsquo;ve learnt in the process of capturing and calculating our 2023 emissions helped us set things up to enable more prompt reporting on these impacts in the future.&lt;/p>
&lt;p>We expect that many of our members and our community at large assess their environmental impact or are embarking on similar projects, to understand and curb emissions. We’re keen to discuss this and learn together to reduce our environmental impact as an organisation.&lt;/p></description></item><item><title>Metadata beyond discoverability</title><link>https://www.crossref.org/blog/metadata-beyond-discoverability/</link><pubDate>Tue, 03 Dec 2024 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/metadata-beyond-discoverability/</guid><description>&lt;p>Metadata is one of the most important tools needed to communicate with each other about science and scholarship. It tells the story of research that travels throughout systems and subjects and even to future generations. We have metadata for organising and describing content, metadata for provenance and ownership information, and metadata is increasingly used as signals of trust.&lt;/p>
&lt;p>Following our panel discussion on the same subject at the ALPSP University Press Redux conference in May 2024, in this post we explore the idea that metadata, once considered important mostly for discoverability, is now a vital element used for evidence and the integrity of the scholarly record. We share our experiences and views on the metadata significance and workflows from the perspective of academic and university presses – thus we primarily concentrate on the context of books and journal articles.&lt;/p>
&lt;p>The communication of knowledge is facilitated by tiny elements of metadata flitting around between thousands of systems telling minuscule parts of the story about a research work. And it isn’t just titles and authors and abstracts – what we think of as metadata has really evolved as more nuance is needed in the assessment and absorption of information. Who paid for this research and how much, how exactly did everyone contribute, what data was produced and is it available for me to reuse it, as well as, increasingly, things like post-publication comments, assertions from “readers like me”, who has reproduced this research or refuted these conclusions.&lt;/p>
&lt;p>Different types of published works are described by different types of metadata – journal articles, book chapters, preprints, dissertations. And those metadata elements can be of varying importance for different users. In this article, we will talk about metadata from the perspectives of four personas highlighted by the &lt;a href="https://metadata2020.org" target="_blank">Metadata 20/20&lt;/a>:&lt;/p>
&lt;ul>
&lt;li>&lt;strong>Metadata Creators&lt;/strong>, who provide descriptive information (metadata) about research and scholarly outputs.&lt;/li>
&lt;li>&lt;strong>Metadata Curators&lt;/strong>, who classify, normalise and standardise this descriptive information to increase its value as a resource.&lt;/li>
&lt;li>&lt;strong>Metadata Custodians&lt;/strong>, who store and maintain this descriptive information and make it available for consumers.&lt;/li>
&lt;li>&lt;strong>Metadata Consumers&lt;/strong>, who knowingly or unknowingly use the descriptive information to find, discover, connect, and cite research and scholarly outputs.&lt;/li>
&lt;/ul>
&lt;p>Our approach delineates the metadata lifecycle, from authorship, through production, discovery and through continuous curation. Though some of the metadata is generated outside of that linear process, and much happens before the authorship step, we see it as a clear and useful breakdown of how metadata contributes to a new piece of content.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2024/alpspup_redux_%20metadata_roles.png"
alt="illustration of the 4 roles in the metadata lifecycle with text explaining each role- authorship: where the author/editor and publisher collaborate to create basic metadata, production: where the publisher prepares the metadata for external distribution, discovery: where the metadata is integrated into a diverse range of systems, and beyond: where the metadata is used, reviewed and updated over time." width="75%">
&lt;/figure>
&lt;/div>
&lt;h2 id="authorship">Authorship&lt;/h2>
&lt;p>The first stage in the metadata lifecycle, authorship, is just the beginning of a dynamic process with many collaborators. A formative piece of the puzzle, authorship involves the authors or contributors, the editorial team and/or the marketing team and this is when the shape of the project and its metadata takes form. During this stage, the book or journal&amp;rsquo;s metadata exists only between the originators and the publisher, allowing the most opportunity for creativity and enhancement. Once the metadata reaches the next checkpoint along the lifecycle and is sent out externally, it&amp;rsquo;s more difficult and riskier to make major changes to the key metadata elements. In scholarly monograph publishing especially, we have the advantage of longer production lead times during which to amend and manipulate metadata during this stage.&lt;/p>
&lt;p>At this stage, authors may have ideas of titles, subtitles and descriptions and it is up to the editors and other team members at the publisher to think strategically about how this can be optimised. The marketing and sales teams may be thinking about how the abstracts, keywords, and classifications can be best optimised for the web, leading to increased sales. Discoverability and interoperability of metadata for a book or journal, especially the use of persistent identifiers, is beneficial both for the author – in that their book is easily discovered, used, and cited – and for the publisher – increased visibility, sales, and usage.&lt;/p>
&lt;p>Current challenges at the authorship stage include changing goalposts for metadata standards and accessibility requirements, which also have knock-on effects in subsequent stages in the metadata lifecycle. One of the key challenges with these is that they require buy-in from multiple players to keep up with and amend, and publishers must think closely about how these changes may affect metadata workflows for books at different stages of publication.&lt;/p>
&lt;h2 id="production">Production&lt;/h2>
&lt;p>As a book or journal article comes into production, it’s time to update and release the metadata to retailers, libraries, data aggregators and distributors. The metadata should be updated and checked to make sure that it’s still a good reflection of the product or the content that it describes and complete enough to release, including a final cover image in the case of books.
This is still very much a collaborative effort with multiple roles involved. Technical details, such as spine width, page extents, and weight, are added, capturing the final specification. The editorial team may update metadata entered into systems earlier in the process. For example reviewing the prices, updating subject classification codes or amending the chapter order. If any of the content is to be published open access, appropriate licensing and access metadata need to be included, so that users of the content are clear about what they can (and can’t!) do with it.
Metadata that’s not yet captured upstream can be added or enhanced. For example, vendors already involved in the production process can verify that persistent identifiers (PIDs) are present and correct in funding metadata.&lt;/p>
&lt;p>More and more metadata elements are being requested by supply chain partners. For example, new requirements being introduced to provide commodity codes, spine width, carton quantities, gratis copy value and country of manufacture. There may be differences in metadata depending on the methods of production. For example, country of manufacture will be supplied differently when using traditional print methods where the whole print run is carried out at a location, or where a title is manufactured print on demand and the location of printing is determined by the delivery address.&lt;/p>
&lt;p>In an XML-first workflow, metadata can be captured with the content files to aid with discovery. This usually requires multiple systems, both internal and external. These systems need to be able to work together to ensure that only up-to-date metadata is used. Metadata will change throughout the production process, whether it’s the publication of an accepted manuscript through to the final version of record, or pre-order information to the published version, so updates need to feed out regularly.&lt;/p>
&lt;p>The right metadata needs to go to the right recipient. Some is not useful or cannot be processed by certain recipients. For example, a printer, retailer, librarian or data aggregator each have their own needs and use cases and may receive and process metadata in different formats or require different fields.&lt;/p>
&lt;h2 id="discovery">Discovery&lt;/h2>
&lt;p>Discovery is the series of actions taken by an end user to retrieve and access relevant content they do not know about. Discovery can happen everywhere: Google (a search engine), a library catalog, a publisher platform, etc. However, Discovery is associated with using Discovery systems in the academic sector.&lt;/p>
&lt;p>The technological landscape of libraries has developed in the last 15 years. Discovery systems are tools libraries subscribe to in order to allow their end users to have one search experience within their library holdings. It is paramount for librarians that library collections are used; hence, it is very important for them that the discovery system of their choice contains all the relevant metadata. Libraries expect their discovery service to include their content coverage as comprehensively as possible. Content items not represented or misrepresented in a discovery system create challenges to libraries in how they might otherwise ensure that these materials are discovered and accessed.&lt;/p>
&lt;p>Libraries&amp;rsquo; adoption and usage of discovery systems are surrounded by the belief that the great benefits of this technology are the one search box and the configuration flexibility, which are the most important benefits. Libraries invest a significant amount of money in discovery services. The increase in usage is the success indicator of this adoption and a positive return on investment.&lt;/p>
&lt;p>The backbone of discovery systems is formed by three crucial elements: a user interface, a metadata index, and a link resolver or Knowledge Base. These elements, along with a back-end control panel for librarian configuration, are the key components that enable the discovery process.&lt;/p>
&lt;p>The discovery index, a database storing descriptive data from various content providers, data sets, and content types, is a testament to the collaborative efforts of content providers and discovery systems vendors. Their work under the Discovery Metadata Sharing partnership agreements, which establish the &lt;em>format, scope, frequency, and support&lt;/em> of the collaboration, is instrumental in meeting librarians&amp;rsquo; expectations.&lt;/p>
&lt;h3 id="format">Format&lt;/h3>
&lt;p>The discovery metadata integration processes have settled down for most cases in these two metadata delivery workflows.&lt;/p>
&lt;p>&lt;em>Metadata for the index of discovery:&lt;/em> Discovery systems have traditionally made efforts to work with various metadata formats like MARC, proprietary templates, etc., but the preferred format is XML. This metadata could include all the bibliographic information data, including index terms and full text at the article and chapter level.&lt;/p>
&lt;p>&lt;em>Metadata for link resolvers and Knowledge bases:&lt;/em> Knowledge bases are tools that contain information about what is included in a product, packages, and/or databases. KBART is the preferred format in this area. It includes a set of basic bibliographic descriptions at the publication level and linking information for direct and OpenURL syntaxes.&lt;/p>
&lt;h3 id="frequency">Frequency&lt;/h3>
&lt;p>The delivery channels vary, and the frequency could vary daily to yearly, depending on the publication schedule.&lt;/p>
&lt;h3 id="scope">Scope&lt;/h3>
&lt;p>Library collections include various content types, including archival materials, open access, and multimedia alongside the more traditional books and periodicals. Different content types will require different metadata elements to make a comprehensive discovery-friendly description, and the metadata elements will impact the formats in use.&lt;/p>
&lt;p>Discovery services will receive this data and prioritise uploading. They will select and manipulate the required metadata elements according to their system requirements. These metadata tweaks and selections are not always communicated to the content providers and/or libraries.
Ultimately, librarians decide which metadata will be visible on their discovery tool and the linking methods of their choice.&lt;/p>
&lt;p>As described, Discovery is a complex area where the activities of its main stakeholders are interconnected. The success of the end users&amp;rsquo; discovery journey from search to access depends on the successful integration, implementation, and maintenance of the discovery systems. This necessitates a combined effort from the three discovery stakeholders: content providers, discovery system providers, and libraries. Their collaborative work is not just crucial, but integral to supporting discovery and fulfilment in the most efficient manner possible. Your active involvement in this process is what makes it successful.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2024/metadata-beyond-discoverability-blog-graph.png"
alt="A pie chart divided into three sections, each labeled to represent the key discovery stakeholders: “Content Provider” (in yellow), “Library” (in orange), and “System Provider” (in gray). These sections visually represent the collaborative roles for successful metadata integration and discovery.">
&lt;/figure>
&lt;/div>
&lt;h3 id="how-do-we-ensure-discoverability">How do we ensure discoverability?&lt;/h3>
&lt;p>Electronic resources do not exist in isolation but are assessed and used depending on their level of integration in the discovery landscape where libraries and patrons are active.
From a content provider&amp;rsquo;s perspective, discoverability is about the number and efficiency of entry points to our products created in third-party discovery products.&lt;/p>
&lt;p>The level of discovery integration has a direct impact on sales and upsell opportunities. Products that are not discoverable are difficult to work with, and the opposite is true for products that are considered discoverable. Your role in ensuring discoverability directly influences the user experience and sales, making your work crucial and impactful.
The term &amp;lsquo;Discoverability&amp;rsquo; is critical in discovery library systems. It refers to the extent to which eResources are searchable in a discovery system, and it directly influences the ease with which users can find the information they need, thereby enhancing their overall experience.
In practical terms, the degree of discoverability will be impacted by the quality of the metadata supplied, the transformations the metadata suffers in the integration process to discovery systems, and the configuration&amp;rsquo;s maintenance.&lt;/p>
&lt;p>The general principles of metadata quality also apply in this area: accuracy, completeness, and timely delivery. Your attention to these principles is crucial to contributing to the effectiveness of the discovery process. Metadata enrichment practices like identifiers and standards are also applicable. Your meticulous attention to detail in maintaining metadata quality ensures the effectiveness of the discovery process.&lt;/p>
&lt;p>Discovery as a mindset in the publishing process will increase discoverability, as it will be influenced by product designs (whether the content is linkable) and which metadata outputs are possible. For example, author-generated index terms will be more effective for meeting research search terms, and detailed article titles will probably be more discoverable than general titles.
Finally, all the integration, descriptive metadata, configurations, etc., leave much room for errors. The flow is complex; on occasion, the products and content are more complicated to describe than tools can handle, and there are millions of holdings per library to manage. Constant maintenance and troubleshooting are crucial elements to maintaining and increasing discoverability.&lt;/p>
&lt;h2 id="metadata-beyond-publication">Metadata beyond publication&lt;/h2>
&lt;p>In the lead-up to publication, finalising rich complete metadata can seem like establishing a fixed set of information. Post-publication, however, the metadata workflow should be dynamic, able to evolve to keep pace with new demands and opportunities. Think of metadata as a journey rather than a one-time destination, and look at ways to futureproof your metadata by actively adapting to some of the following types of change.&lt;/p>
&lt;h3 id="changing-publisher-goals-and-product-needs">Changing Publisher Goals and Product Needs&lt;/h3>
&lt;p>Metadata should align with changing priorities for a publisher. Developing new formats, shifts in commissioning focus or building new distribution partnerships may require metadata updates. For instance, re-releasing content in audiobook form or digitising a backlist title warrants a metadata review to ensure current and prospective readers find accurate, relevant information.&lt;/p>
&lt;h3 id="changing-technology-and-metadata-standards">Changing Technology and Metadata Standards&lt;/h3>
&lt;p>Advances in technology, from artificial intelligence to emerging metadata standards, offer enhanced possibilities for capturing and updating metadata. AI, for example, can help enrich metadata with more precise subject tagging, while new metadata formats may offer greater compatibility across platforms and discovery services. Staying current with these tools can help publishers manage metadata more efficiently and enhance discoverability.&lt;/p>
&lt;h3 id="changing-societal-values">Changing Societal Values&lt;/h3>
&lt;p>As society evolves, so do expectations for inclusive and socially responsible metadata. Utilising new categorisation codes, such as those for the &lt;a href="https://ns.editeur.org/thema/en/5YS" target="_blank">United Nations Sustainable Development Goals&lt;/a>, can align metadata with emerging social goals. Similarly, publishers may need to revisit keywords and category codes to reflect language changes, balancing the integrity of historic records with the need for current, appropriate terminology.&lt;/p>
&lt;h3 id="changing-industry-priorities">Changing Industry Priorities&lt;/h3>
&lt;p>Commitments to accessibility and sustainability have prompted developments in metadata. Increasingly, publishers need to be able to use metadata to build a record of sustainable production methods, such as paper sources, printing methods or ink types. New metadata fields for accessibility specifications will also support more inclusive reader experiences going forward. Metadata will play an increasingly vital role in meeting industry standards for accessibility, EUDR and EAA compliance, and environmental transparency.&lt;/p>
&lt;h3 id="changing-customer-and-librarian-expectations">Changing Customer and Librarian Expectations&lt;/h3>
&lt;p>Finally, as the metadata expectations of customers grow and the nature of roles and responsibilities in library and collection management professions develops, teamwork and making good use of available resources are essential. Publishers don’t have to tackle this alone. Working with organisations such as Crossref or Book Industry Communication (BIC), signing up to newsletters and webinars, and forming an in-house discovery group are all great ideas for sharing ideas and best practice, and ensuring your metadata workflow is adaptable and responsive. Be part of the conversation now rather than struggling to keep up down the line!&lt;/p>
&lt;h2 id="what-are-some-challenges-and-opportunities-with-metadata">What are some challenges and opportunities with metadata?&lt;/h2>
&lt;p>&lt;strong>JM&lt;/strong>: Metadata that establishes permanence is a real opportunity in a digital landscape where content can move or be taken down, links can rot, website certificates can expire. Persistent identifiers like ORCiDs for people and DOIs for content are key examples of metadata that establish enduring routes to, and provenance of, published digital content.&lt;/p>
&lt;p>&lt;strong>KM&lt;/strong>: Metadata creation, maintenance and change has long been seen as a manual process. AI tools offer a real opportunity for metadata creation and review, especially for keywords and classification codes, at a scale and speed that has the potential to transform metadata workflows. Especially for backlist transformation, AI could offer real opportunities in this area. A challenge we face for monograph metadata more specifically is that much of the scholarly metadata infrastructure is built around the journal article, and it can be difficult to fit longer form content into these systems of discovery.&lt;/p>
&lt;p>&lt;strong>MT&lt;/strong>: Metadata is crucial. Good metadata (complete, accurate, and timely) is the base for smooth integrations and easy discovery interactions with eResources. Bad metadata (inaccurate, incomplete, late) will be the main reason for undiscovered content. At this point, the eResources industry is still based on different versions of the same metadata, which is the leading cause of problems. It is probably time to start considering a unique record approach. This unique record, which will be complete and accurate, could be used by different systems for different purposes. I know there are many details to define here, but if you think about it, it is not impossible and could solve the many known issues.&lt;/p>
&lt;h2 id="how-do-you-ensure-the-quality-and-completeness-of-your-metadata-do-you-have-ways-of-auditing-it">How do you ensure the quality and completeness of your metadata? Do you have ways of auditing it?&lt;/h2>
&lt;p>&lt;strong>SP&lt;/strong>: Validation of data is really important, so choosing or building a system that’s set up to do this is an important foundation. It’s straightforward to check for completeness of fields and I run daily checks on our book metadata to make sure there’s nothing missing in the files feeding out. Quality can be more challenging to monitor. Feedback from data recipients is key, and accreditation schemes such as the &lt;a href="https://bic.org.uk/resources/accreditations-overview/metadata-excellence-award/" target="_blank">BIC Metadata Excellence Award&lt;/a> are a great way to benchmark progress. Good training and clear documentation help to make sure that everyone involved in creating and updating metadata understands exactly what they need to do and the standards they need to meet.&lt;/p>
&lt;p>&lt;strong>KM&lt;/strong>: Earlier this year we completed a year-long data cleansing project as part of our move to a new title management database. This gave us the time to address gaps in backlist metadata as well as to identify any inconsistencies across records for the same book, and enrich key metadata fields like classification codes, keywords and PIDs. For frontlist titles, each person owns a number of fields to ensure they are complete before a book&amp;rsquo;s metadata is distributed – some of these have validation tools which will prevent a book&amp;rsquo;s metadata from being sent out unless it is complete.&lt;/p>
&lt;p>&lt;strong>MT&lt;/strong>: Strict and consistent internal processes are essential to ensure quality and completeness. Following the different standards and industry recommendations helps to keep the quality at high standards. Random manual checks and system-based checks help to ensure everything is good. We carry out projects where we work with specific aspects of the metadata. This building-blocks approach ensures the different data layers are as good as possible. As with any project, metadata projects should have specific goals, outcomes, resources, and documentation.&lt;/p>
&lt;h2 id="how-do-you-know-if-and-how-much-metadata-helps-achieve-your-goals">How do you know if (and how much) metadata helps achieve your goals?&lt;/h2>
&lt;p>&lt;strong>JM&lt;/strong>: Take any available opportunities to find out what people think of your metadata – via library conferences, institutional customer feedback, and by working with the library team at our home institution, we’ve had some really useful and interesting conversations about MUP’s metadata and where we can improve it to make it as relevant as possible for different stakeholder needs.&lt;/p>
&lt;p>&lt;strong>MT&lt;/strong>: Customers and Discovery partners will inform us if something is incorrect. Usage data is also a good indicator of how healthy our metadata is. Following industry standards is another good reference point for assessing the metadata. Finally, the metadata is only good when we know what we want to use it for. So, always considering what we are trying to achieve helps us understand how effective the metadata is.&lt;/p>
&lt;p>&lt;strong>KM&lt;/strong>: As the others have noted here, and we represent a range of different types and sizes of publishers, measuring the direct impact of metadata is an ongoing challenge. We think about the different end users who might encounter our metadata further down the supply chain – retail customers searching on Amazon, librarians filtering results on purchasing platforms, researchers finding our books and journals through citations on popular online search engines – and consider what elements of our metadata might help reach those people in the right ways.&lt;/p>
&lt;p>&lt;strong>JM&lt;/strong>: Ideally, you’ll see an uplift in sales or usage for every metadata element that you add, review or expand, although it can be challenging to quantify and prove a direct correlation between richer metadata and higher revenue or discoverability, as there are will be other factors involved. For my Operations team, what is certain is that richer, more comprehensive metadata means fewer errors are thrown up by the distribution systems and feeds we use, which means colleagues save time and gain productivity by not having to resolve and rerun failed jobs, chase missing information from other teams, or manually send information to third parties. My job is also made easier because things like size and weight of every printed product are recorded in our bibliographic database as standard, easy to report on and analyse, which helps with forecasting costs for inventory storage or shipping. Metadata can be powerful.&lt;/p></description></item><item><title>Research Integrity Roundtable 2024</title><link>https://www.crossref.org/blog/research-integrity-roundtable-2024/</link><pubDate>Fri, 15 Nov 2024 00:00:00 +0000</pubDate><author>Martyn Rittman</author><guid>https://www.crossref.org/blog/research-integrity-roundtable-2024/</guid><description>&lt;p>For the third year in a row, Crossref hosted a roundtable on research integrity prior to the Frankfurt book fair. This year the event looked at &lt;a href="https://www.crossref.org/documentation/crossmark/" target="_blank">Crossmark&lt;/a>, our tool to display retractions and other post-publication updates to readers.&lt;/p>
&lt;p>Since the start of 2024, we have been carrying out a &lt;a href="https://doi.org/10.64000/2jdxh-7dh12" target="_blank">consultation on Crossmark&lt;/a>, gathering feedback and input from a range of members. The roundtable discussion was a chance to check and refine some of the conclusions we’ve come to, and gather more suggestions on the way forward. As in previous years, we were able to include a range of organisations, which led to lively and interesting discussions. See below for the full participant list.&lt;/p>
&lt;h2 id="crossmark-feedback">Crossmark feedback&lt;/h2>
&lt;p>We started by presenting Crossmark and a summary of the consultation process. There are a number of areas where we have learned more about how the community operates or found that Crossmark needs to adapt. These include:&lt;/p>
&lt;p>&lt;em>Implementation&lt;/em>: Our members have struggled to implement Crossmark and uptake is low. At the same time, in many organisations the workflows for handling retractions are not well-defined because they are rarely used, if ever. The responsibility for updating Crossref metadata can be unclear and this may be a factor in the low uptake.&lt;/p>
&lt;p>&lt;em>Education&lt;/em>: There are different levels of understanding about how to handle retractions. Some members are very defensive when asked about retractions, others state they will never make updates to published works. How can we have a constructive conversation where the value of communicating updates appropriately is recognised?&lt;/p>
&lt;p>&lt;em>Community engagement&lt;/em>: Given the different scales, locations, disciplines, and technologies used by our members, it looks like one size will not fit all when it comes to updates. How can we get continual, representative feedback on new tools and processes?&lt;/p>
&lt;p>&lt;em>Metadata assertions&lt;/em>: Crossmark allows the deposit of metadata using custom field names, however this metadata seems to have low usefulness and is not highly valued by the community. Should we continue to collect it? Can we make some of the most-used field names part of our standard schema?&lt;/p>
&lt;p>&lt;em>Changing the Crossmark UI&lt;/em>: Although we didn’t specifically ask about it during the consultation, the look of the Crossref logo often came up, and concern that it is not recognised and not well-used. Can we change the look and behaviour so that it has more impact?&lt;/p>
&lt;h2 id="niso-recommendations">NISO Recommendations&lt;/h2>
&lt;p>Patrick Hargitt represented the NISO group on &lt;a href="https://www.niso.org/standards-committees/crec" target="_blank">Communication of Retractions, Removals, and Expressions of Concern (CREC)&lt;/a>. The group’s recommendations were published earlier this year and cover how retractions are communicated. CREC arose from an earlier project, &lt;a href="https://doi.org/10.1186/s41073-022-00125-x" target="_blank">IRSRS&lt;/a>. A large part of the motivation is that retracted works continued to be cited, with citing authors apparently unaware of the retraction.
Patrick presented the CREC recommendations, which cover:&lt;/p>
&lt;ul>
&lt;li>Metadata receipt, display, and distribution,&lt;/li>
&lt;li>Which metadata elements to communicate,&lt;/li>
&lt;li>How to implement the recommendations,&lt;/li>
&lt;li>Discussion of some special cases,&lt;/li>
&lt;li>Key stakeholders and their responsibilities.&lt;/li>
&lt;/ul>
&lt;p>The two presentations prompted discussion, which was taken into the first of two workshops.&lt;/p>
&lt;h2 id="first-workshop-improving-collection-of-retractions-and-crossmark">First workshop: Improving collection of retractions and Crossmark&lt;/h2>
&lt;p>The first workshop looked at proposed changes to Crossmark and how to encourage more members to deposit their retractions, corrections, and other post-publication updates. Several important themes emerged.&lt;/p>
&lt;p>First, the question of whose responsibility it should be to provide metadata on retractions and similar updates. Crossref has a responsibility to work with the community to obtain high quality and complete metadata; publishers should take responsibility for handling issues of research integrity and reporting them to relevant downstream services, like Crossref; and platforms need to provide tools that allow easy reporting of retractions.&lt;/p>
&lt;p>The value of Crossmark appearing in PDFs was reiterated. The fact that a PDF can be downloaded, and years later there is a way to tell whether it has been retracted or not is highly valued. There was also the suggestion that the Crossmark logo on web pages can indicate a change before it has been clicked. This is something that we have been considering at Crossref and it was useful to have the idea reinforced. Another suggestion was that a browser plugin would make a good complement to Crossmark.&lt;/p>
&lt;p>Implementation issues with Crossmark were raised, including that it’s difficult to validate whether a specific implementation is complete. There are a number of different changes (to metadata deposit and content, and websites) that need to work together to have Crossmark fully functional. There were several questions and a discussion about Retraction Watch data. Some were about understanding its collection and validation. A number of participants are actively using the data and it was great to see the variety of applications.&lt;/p>
&lt;h2 id="second-workshop-community-use-of-retraction-metadata">Second workshop: Community use of retraction metadata&lt;/h2>
&lt;p>The second workshop focused on a broader set of downstream organisations that might want to make use of retraction metadata. We looked at stakeholders and their needs, and attempted to match them up with existing tools. Several gaps were identified as a result, which may provide opportunities for new services or collaborations to fill them.&lt;/p>
&lt;p>We identified a number of tools available for publishers, editorial systems, metadata researchers, and readers. A good example is reference managers, many of which are now highlighting retracted works to authors. This can help to reduce the number of retracted works being cited. Publishing platforms are also providing support to editors, using tools that include retraction metadata.&lt;/p>
&lt;center>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2024/frankfurt-roundtable-workshop2-postits.jpg"
alt="A whiteboard showing post-it notes from the second workshop." width="40%">
&lt;/figure>
&lt;/center>
&lt;p>Some of the stakeholders identified have limited tools for identifying retractions that are relevant to them. These include funders, archives and repositories, journalists, and institutions.&lt;/p>
&lt;p>Often, there are pathways for retraction data to be communicated but they are not being sufficiently used. There needs to be a concerted effort to improve the quality of retraction metadata for tools to function better. For example, a second author on a paper might not know that a correction or retraction is planned for their article. If their email or ORCID isn’t included in the metadata, an alerting tool wouldn’t be able to let them know. A similar argument can be made for institutions or funders if they are not well-identified in the metadata.&lt;/p>
&lt;p>The question of standardisation of metadata was raised. It seems too early to implement a full set of standards at the moment. CREC and similar initiatives have documented and accommodated for a range of practices while providing guidance and principles to work towards. More discussion is needed in the community to work out paths that could be applied across the broad spectrum of scholarly communication.&lt;/p>
&lt;h2 id="conclusion">Conclusion&lt;/h2>
&lt;p>The event was very valuable in bringing up a range of topics related to retraction and communication of post-publication changes to scholarly works. We are grateful to all of the participants for their contributions and sharing their diverse experience and opinions with us.&lt;/p>
&lt;p>Research integrity is an area of flux, with significant changes over the past few years. While there has been progress, there remain gaps in metadata and tools to communicate retractions. This is something that Crossref will continue to contribute to, and Crossmark clearly still has a role to play.&lt;/p>
&lt;p>Some of the ideas and suggestions from the discussion can be implemented in the near future. Others need further development, and we will continue to engage the community. Reading this, there may be topics where you feel you have a role to play. We are keen to partner with other organisations in this space as we continue to improve the transparency and communication of metadata for post-publication updates.&lt;/p>
&lt;h2 id="participants">Participants&lt;/h2>
&lt;p>Many thanks to the participants. Here is the full list of those that attended:&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>Name&lt;/th>
&lt;th>Role&lt;/th>
&lt;th>Organisation&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>Aaron Wood&lt;/td>
&lt;td>Head, Product &amp;amp; Content Management&lt;/td>
&lt;td>American Psychological Association&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Adya Misra&lt;/td>
&lt;td>Associate Director, Research Integrity&lt;/td>
&lt;td>Sage&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Bianca Kramer&lt;/td>
&lt;td>&lt;/td>
&lt;td>Sesame Open Science&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Constanze Schelhorn&lt;/td>
&lt;td>Head of Indexing&lt;/td>
&lt;td>MDPI&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Guillaume Cabanac&lt;/td>
&lt;td>Full Professor&lt;/td>
&lt;td>University of Toulouse&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Hong Zhou&lt;/td>
&lt;td>Director of AI Product&lt;/td>
&lt;td>Wiley&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Jennifer Wright&lt;/td>
&lt;td>Head of Publication Ethics and Research Integrity&lt;/td>
&lt;td>Cambridge University Press&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Johanssen Obanda&lt;/td>
&lt;td>Community Engagement Manager&lt;/td>
&lt;td>Crossref&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Joris van Rossum&lt;/td>
&lt;td>Program Director&lt;/td>
&lt;td>STM Solutions&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Kathryn Weber-Boer&lt;/td>
&lt;td>Data &amp;amp; Analytics&lt;/td>
&lt;td>Digital Science&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Kornelia Korzec&lt;/td>
&lt;td>Director of Community&lt;/td>
&lt;td>Crossref&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Kruna Vukmirovic&lt;/td>
&lt;td>Publisher- Journals&lt;/td>
&lt;td>The Institution of Engineering and Technology&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Lena Stoll&lt;/td>
&lt;td>Product Manager&lt;/td>
&lt;td>Crossref&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Leslie McIntosh&lt;/td>
&lt;td>VP, Research Integrity&lt;/td>
&lt;td>Digital Science&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Liying Yang&lt;/td>
&lt;td>Professor&lt;/td>
&lt;td>CAS Library&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Luis Montilla&lt;/td>
&lt;td>Technical Community Manager&lt;/td>
&lt;td>Crossref&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Madhura Amdekar&lt;/td>
&lt;td>Community Engagement Manager&lt;/td>
&lt;td>Crossref&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Martyn Rittman&lt;/td>
&lt;td>Progam Lead&lt;/td>
&lt;td>Crossref&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Maryna Kovalyova&lt;/td>
&lt;td>Member Experience Manager&lt;/td>
&lt;td>Crossref&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Mina Roussenova&lt;/td>
&lt;td>Project Manager, Strategic Projects&lt;/td>
&lt;td>Karger&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Osnat Vilenchik&lt;/td>
&lt;td>VP Content Operations&lt;/td>
&lt;td>Ex Libris, part of Clarivate&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Patrick Hargitt&lt;/td>
&lt;td>Senior Director of Product Management&lt;/td>
&lt;td>Atypon/Wiley&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Paul Davis&lt;/td>
&lt;td>Tech Support &amp;amp; R&amp;amp;D Analyst&lt;/td>
&lt;td>Crossref&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Sami Benchekroun&lt;/td>
&lt;td>CEO&lt;/td>
&lt;td>Morressier&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Scott Delman&lt;/td>
&lt;td>Director of Publications&lt;/td>
&lt;td>Association of Computing Machinery (ACM)&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Shilpi Mehra&lt;/td>
&lt;td>Head, Research Integrity &amp;amp; Paperpal Preflight&lt;/td>
&lt;td>Cactus Communications&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Sichao Tong&lt;/td>
&lt;td>&lt;/td>
&lt;td>Chinese Academy of Sciences, Library&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table></description></item><item><title>How good is your matching?</title><link>https://www.crossref.org/blog/how-good-is-your-matching/</link><pubDate>Wed, 06 Nov 2024 00:00:00 +0000</pubDate><author>Dominika Tkaczyk</author><guid>https://www.crossref.org/blog/how-good-is-your-matching/</guid><description>&lt;p>In our &lt;a href="https://doi.org/10.13003/pied3tho" target="_blank">previous blog post&lt;/a> in this series, we explained why no metadata matching strategy can return perfect results. Thankfully, however, this does not mean that it&amp;rsquo;s impossible to know anything about the quality of matching. Indeed, we can (and should!) measure how close (or far) we are from achieving perfection with our matching. Read on to learn how this can be done!&lt;/p>
&lt;p>How about we start with a quiz? Imagine a database of scholarly metadata that needs to be enriched with identifiers, such as ORCIDs or ROR IDs. Hopefully, by this point in our series this is recognizable as a classic matching problem. In searching for a solution, you identify an externally-developed matching tool that makes one of the below claims. Which of the following would demonstrate satisfactory performance?&lt;/p>
&lt;ol>
&lt;li>It is a cutting-edge, state-of-the-art, intelligent-as-they-come, bullet-proof technology! All the big players are using it. You won&amp;rsquo;t find anything better!&lt;/li>
&lt;li>The tool was tested on the metadata of 10 articles we authored, and many identifiers were matched.&lt;/li>
&lt;li>The quality of our matching is 98%.&lt;/li>
&lt;/ol>
&lt;p>Okay, okay, trick question. The correct answer here is to opt for secret answer #4: &amp;ldquo;I wouldn&amp;rsquo;t be satisfied by any of these claims!&amp;rdquo; Let&amp;rsquo;s dig in a bit more to why this is the correct response.&lt;/p>
&lt;h2 id="the-importance-of-the-evaluation">The importance of the evaluation&lt;/h2>
&lt;p>Before we decide to integrate a matching strategy, it is important to understand as much as possible about how it will perform. Whether it is used in a semi or fully automated fashion, metadata matching will result in the creation of new relationships between things like works, authors, funding sources, and institutions. Those relationships will then, in turn, be used by the consumers of this metadata to guide their understanding and perhaps even to make important decisions about those same entities. As organisations providing scholarly infrastructure, we must therefore take it as our paramount responsibility to understand any caveats or shortcomings of the scholarly metadata we make available, including that resulting from matching.&lt;/p>
&lt;p>Proper evaluation is what allows us to do this, as it is impossible to know how well a given matching strategy will perform in its absence. This is true no matter how simple or complex a matching strategy may seem. Complex methods can be tailored to data with specific characteristics and might fail when faced with something different from this. Simple methods might be only appropriate for clean metadata or a narrow set of use cases.&lt;/p>
&lt;p>Beyond complexity, matching strategies themselves vary widely in character, inheriting biases from their design, training data, or how a problem has been formulated. Some prioritise avoiding false negatives, while others focus on minimising false positives. Even a generally high-performing strategy might not be perfectly aligned with your specific needs or data. In some cases, the task also itself might be too challenging, or the available metadata too noisy, for any matching strategy to perform adequately.&lt;/p>
&lt;p>Evaluation is, again, how we understand these nuances and make informed decisions about whether to implement matching or avoid it altogether. By now, it should also be clear that the notion &amp;ldquo;we don&amp;rsquo;t need to evaluate&amp;rdquo; is far from ideal! Given its importance, let&amp;rsquo;s explore how evaluation is actually done.&lt;/p>
&lt;h2 id="evaluation-process">Evaluation process&lt;/h2>
&lt;p>In general, a proper evaluation procedure should follow the following steps:&lt;/p>
&lt;ol>
&lt;li>Preparation of an evaluation dataset containing many examples of matching inputs and the corresponding expected outputs.&lt;/li>
&lt;li>Applying the strategy to all inputs from the dataset and recording the responses.&lt;/li>
&lt;li>Comparing the expected outputs with the outputs from the strategy.&lt;/li>
&lt;li>Converting the results of the above comparison into evaluation metrics.&lt;/li>
&lt;/ol>
&lt;p>From this accounting, we can see that there are two primary components for the evaluation process: an evaluation dataset and metrics.&lt;/p>
&lt;h3 id="evaluation-dataset">Evaluation dataset&lt;/h3>
&lt;p>It&amp;rsquo;s useful to conceive an evaluation dataset as the specification for an ideal matching strategy, describing what would be returned from our forever-elusive perfect matching. When creating such a dataset, what this means in practice is that it should contain a number of real-world, example inputs, along with the corresponding ideal or expected outputs, and that all data should be in the same format as the strategy is expected to process. The outputs should themselves also confirm the strategy&amp;rsquo;s overall requirements, for example, by being consistent with its cardinality, meaning whether zero, one, or multiple matches should be returned and under what circumstances. In terms of size, it&amp;rsquo;s generally useful to calculate the ideal number of evaluation examples using a sample size calculator or using &lt;a href="https://doi.org/10.1520/E0122-17R22" target="_blank">standardised measures&lt;/a>, but as a quick rule of thumb: less than 100 examples is probably insufficient, more than 1,000 or 2,000 is generally acceptable.&lt;/p>
&lt;p>It is also important that the evaluation dataset be representative of the data to be matched in order to ensure reliable results. Using unrepresentative data, even if convenient, can lead to biassed or misleading evaluations. For example, if matching affiliations from various journals, building an evaluation dataset solely from one journal that already assigns ROR IDs to authors&amp;rsquo; affiliations might be tempting. The data, having been already annotated, allow us to avoid the tedious work of labelling, and we might even know that it is produced by a high-quality source. This is still, unfortunately, a flawed approach. In practice, such datasets are unlikely to represent the entire range of affiliations to be matched, potentially leading to a significant discrepancy between the evaluated quality and the actual performance of the matching strategy, when applied to the full dataset. To assess a matching strategy&amp;rsquo;s effectiveness, we have to resist shortcuts and instead do our best to create truly representative evaluation datasets to be confident that we&amp;rsquo;ve accurately measured their performance.&lt;/p>
&lt;h3 id="evaluation-metrics">Evaluation metrics&lt;/h3>
&lt;p>Evaluation metrics are what allow us to summarise the results of the evaluation into a single number. Metrics give us a quick way to get an estimation of how close the strategy was to achieving perfect results. They are also useful if we want to compare different strategies with each other or decide whether the strategy is sufficient for our use case, removing the need to compare countless evaluation examples from different strategies against one another.&lt;/p>
&lt;p>The simplest metric is &lt;a href="https://en.wikipedia.org/wiki/Accuracy_and_precision" target="_blank">accuracy&lt;/a>, which can be calculated as the fraction of the dataset examples that were matched correctly. While a commonsense benchmark, accuracy can be misleading, and we generally do not recommend using it. To understand why, let&amp;rsquo;s consider the following small dataset and the responses from two strategies:&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>Input&lt;/th>
&lt;th>Expected output&lt;/th>
&lt;th>Strategy 1&lt;/th>
&lt;th>Strategy 2&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>string 1&lt;/td>
&lt;td>ID 1&lt;/td>
&lt;td>ID 1&lt;/td>
&lt;td>ID 1&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>string 2&lt;/td>
&lt;td>ID 2&lt;/td>
&lt;td>ID 3&lt;/td>
&lt;td>Empty output&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>string 3&lt;/td>
&lt;td>Empty output&lt;/td>
&lt;td>Empty output&lt;/td>
&lt;td>Empty output&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>Both strategies achieved the same accuracy, 0.67, making one mistake each on the second affiliation string. However, a closer examination reveals that these error types are distinct. The first strategy matched to an incorrect identifier, while the second refused to return any value illustrating the limitation of accuracy as a measure: it generally fails to capture important nuances in strategy behaviour. In our example, the first strategy appears more permissive, returning matches even in unclear circumstances, while the second is more conservative, withholding them when uncertain. Although using such a small dataset would preclude drawing any definitive conclusions, it highlights how relying on accuracy alone can obscure differences in performance.&lt;/p>
&lt;p>For evaluating matching strategies, we instead recommend using two metrics: &lt;a href="https://en.wikipedia.org/wiki/Precision_and_recall" target="_blank">precision and recall&lt;/a>. To recap from our previous blog post:&lt;/p>
&lt;ul>
&lt;li>Precision is calculated as the number of correctly matched relationships resulting from a strategy, divided by the total number of matched relationships. It can also be interpreted as the probability that a match is correct. Low precision indicates a high rate of false positives, which are incorrect relationships created by the strategy.&lt;/li>
&lt;li>Recall is calculated as the number of correctly matched relationships resulting from a strategy, divided by the number of true (expected) relationships. It can also be interpreted as the probability that a true (correct) relationship will be created by the strategy. Low recall means a high rate of false negatives, which are relationships that should have been created by the strategy but were not made.&lt;/li>
&lt;/ul>
&lt;p>Applying these measures to our prior example, the strategies achieved the following results:&lt;/p>
&lt;ul>
&lt;li>Strategy 1: accuracy 0.67, precision 0.5, recall 0.5&lt;/li>
&lt;li>Strategy 2: accuracy 0.67, precision 1.0, recall 0.5&lt;/li>
&lt;/ul>
&lt;p>As we can see, while both strategies have the same accuracy, using precision and recall better describes the difference between the two sets of results. Strategy 1&amp;rsquo;s lower precision indicates it made false positive matches, while Strategy 2&amp;rsquo;s perfect precision shows that it made none. The identical recall scores show both identified half of the possible matches.&lt;/p>
&lt;p>Of course, results calculated using such a small dataset are not very meaningful. If we obtained these scores from a large, representative evaluation dataset, it would indicate to us that Strategy 1 risks introducing many incorrect relationships, while Strategy 2 would be unlikely to do so. In both cases, we would still expect approximately half of the possible relationships to be missing from the strategies&amp;rsquo; outputs.&lt;/p>
&lt;p>Which one is more important to prioritise, precision or recall? It depends on the use case. As a general rule, if you want to use the strategy in a fully automated way, without any form of manual review or correction of the results, we recommend paying more attention to precision. Privileging precision will allow you to better control the number of incorrect relationships added to your data. If you want to use the strategy in a semi-automated fashion, where there is a manual examination of and a chance to correct the results, pay more attention to recall. Doing so will guarantee that enough options are presented during the manual review stage and fewer relationships will be missed as a result.&lt;/p>
&lt;p>To get a more balanced estimation of performance, we can also consider both precision and recall at the same time using a measure called &lt;a href="https://en.wikipedia.org/wiki/F-score" target="_blank">F-score&lt;/a>. F-score combines precision and recall into a single number, with variable weight given to either aspect. There are three commonly used types, each calculated as the weighted &lt;a href="https://en.wikipedia.org/wiki/Harmonic_mean" target="_blank">harmonic mean&lt;/a> of precision and recall:&lt;/p>
&lt;ul>
&lt;li>F0.5: Precision is weighted more heavily. It can be understood as a score that is 50% more sensitive to precision than recall. A high F0.5 score indicates a measure of performance that minimises false positives.&lt;/li>
&lt;li>F1: Equal weight is given to both precision and recall. It can be interpreted as the most balanced score in this set. High F1 indicates good overall performance, with both false positives and false negatives being minimised equally.&lt;/li>
&lt;li>F2: Recall is weighted more heavily. It can be understood as a score that is 50% more sensitive to recall than precision. A high F2 score indicates a measure of performance where false negatives are minimised.&lt;/li>
&lt;/ul>
&lt;p>Each of these variants allows for fine-tuning the evaluation metric to align with your expectations for a specific matching task. Choose whichever reflects the relative importance of precision versus recall for your use case.&lt;/p>
&lt;p>To summarise, to avoid falling prey to misleading sales pitches or silly quizzes, it is important to have a good understanding of the performance of any strategies you are building or integrating. With thorough evaluation, including a representative dataset and carefully considered metrics, we can estimate the quality of matching and, by extension, its resulting relationships.&lt;/p>
&lt;p>Now that we&amp;rsquo;ve covered how to evaluate effectively, we can move on to some other aspects of metadata matching. Our next blog post will take a final, more holistic view of matching, exploring some complementary considerations to all of the preceding. Stay tuned for more!&lt;/p></description></item><item><title>Update on the Resourcing Crossref for Future Sustainability research</title><link>https://www.crossref.org/blog/update-rcfs/</link><pubDate>Mon, 28 Oct 2024 00:00:00 +0000</pubDate><author>Kornelia Korzec</author><guid>https://www.crossref.org/blog/update-rcfs/</guid><description>&lt;p>We’re in year two of the &lt;a href="https://www.crossref.org/community/special-programs/resourcing-crossref/">Resourcing Crossref for Future Sustainability&lt;/a> (RCFS) research. This report provides an update on progress to date, specifically on research we’ve conducted to better understand the impact of our fees and possible changes.&lt;/p>
&lt;p>Crossref is in a good financial position with our &lt;a href="https://www.crossref.org/fees/">current fees&lt;/a>, which haven’t increased in 20 years. This project is seeking to future-proof our fees by:&lt;/p>
&lt;ul>
&lt;li>Making fees more equitable&lt;/li>
&lt;li>Simplifying our complex fee schedule&lt;/li>
&lt;li>Rebalancing revenue sources&lt;/li>
&lt;/ul>
&lt;p>In order to review all aspects of our fees, we’ve planned five projects to look into specific aspects of our current fees that may need to change to achieve the goals above. This is an update on the research and discussions that have been underway with our &lt;a href="https://www.crossref.org/committees/membership-and-fees/">Membership &amp;amp; Fees Committee&lt;/a> and our &lt;a href="https://www.crossref.org/board-and-governance/#officers">Board&lt;/a>, and what we’ve learned so far in each of these areas.&lt;/p>
&lt;h2 id="goal-1-more-equitable-fees">Goal 1: More equitable fees.&lt;/h2>
&lt;p>To ensure our fees going into the future are more equitable, we’re carrying out two parallel projects: evaluation of the lowest membership tier, and the review of the basis for deciding the membership tiers and distribution of membership across them.&lt;/p>
&lt;h3 id="project-1-evaluate-the-lowest-membership-tier-and-propose-a-more-equitable-pricing-structure">Project 1: Evaluate the lowest membership tier and propose a more equitable pricing structure.&lt;/h3>
&lt;p>All Crossref members pay an annual membership fee. These fees are tiered, and different members pay a different fee depending on the annual publishing revenue that their organisation receives (or publishing expenses if they don’t receive any publishing revenue).&lt;/p>
&lt;p>We entered into this project recognising that we have too many membership tiers and the definition we use to size members is not consistent and can be confusing (e.g. different basis for funders than other organisations, and both are different still from subscribers to our Metadata Plus service). The idea of the membership tiers was to use publishing revenue as a proxy for “ability to pay”. We really want to develop proposals for a more equitable pricing structure. However we don’t know enough about our members’ capacity to pay to be able to model an alternative approach.&lt;/p>
&lt;p>Our current lowest fee tier is $275 (USD) for any organisation with annual publishing revenue (or publishing expenses where the organisation doesn’t receive publishing revenue) of $0 to $1 million, and this is the tier where we focus our attention in our first project of the RCFS program. The difference between an organisation with revenue or expenses of USD 0, and an organisation with revenue or expenses of USD 1 million, is huge. Hardly any new members have joined in any other tier in the past several years. Of the 21,000 active members, more than 20,000 fall into the USD 275 tier - either directly (as an independent member) or indirectly (through a sponsor, where their fees would be lower). A fee structure that would fit better with the realities of our community might entail breaking our current $275 fee tier down into two or more more granular tiers.&lt;/p>
&lt;p>At the moment, the majority of Crossref’s revenues come from the bottom membership tiers; 65% of membership revenues come from organisations in the USD 275 tier. We also know that many of those members (86%) are paying more in membership dues than in content registration, whereas other members have the inverse relationship between annual dues and content registration. Overall, the members in the USD 275 tier contributed 34% of Crossref’s revenue last year, and the members in the &amp;gt;USD 50 million tier – contributed 29%.&lt;/p>
&lt;h3 id="members-survey">Members’ survey&lt;/h3>
&lt;p>Between April and May this year, we surveyed all independent members in the USD 275 tier. We asked questions about their operating size, how they’re funded, and how Crossref’s fees affect them. At the time of the survey, there were 8,027 members in this category. We received 1,054 responses; with a 13% response rate and broad representation globally, we are confident in the sample size. One-third of respondents said they were part of a larger organisation (such as a department or a library in a research institution).&lt;/p>
&lt;p>Chart 1: Organisation revenue or funding
The majority of respondents in this category (65%) have annual revenue or expense of less than USD 100,000; with 48% operating with less than USD 10,000.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2024/update-on-rcfs-2024-10-28/income.bik.png"
alt="Responses to the question about the income or funding levels in the members survey" width="50%">
&lt;/figure>
&lt;/div>
&lt;p>Chart 2: Sources of funding
When asked about the sources of funding (as an indicator of how stable these organisations might be and how readily accessible their funding is) the most frequent answer was public or government funding, and then article processing charges. If organisations relied on two sources of funding, the most common combination was public funding and article processing charges, and it was relatively rare for these organisations to have multiple sources of funding.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2024/update-on-rcfs-2024-10-28/sources.funding.top.20.png"
alt="Responses to the question about the main sources of income for independent members in the $275 tier" width="100%">
&lt;/figure>
&lt;/div>
&lt;p>Chart 3: What percentage of expenses do you spend on Crossref fees?&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2024/update-on-rcfs-2024-10-28/feestocrossrefwithregions.png"
alt="Responses to the question about the proportion of overall expenses paid in Crossref fees by independent members in $275 tier" width="100%">
&lt;/figure>
&lt;/div>
&lt;p>The majority (61%) of respondents spend less than 5% of their expenses on Crossref fees. However, we have also learnt that for some volunteer-run publications, Crossref fees might be some of the only expenses they incur. Interestingly, the percentage of expenses spent on Crossref is fairly consistently spread across the continents.&lt;/p>
&lt;h3 id="project-2-review-the-basis-and-distribution-of-membership-tiers">Project 2: Review the basis and distribution of membership tiers&lt;/h3>
&lt;p>This project examines options for how we define the capacity to pay, how members are distributed across tiers, and the right levels of member fees.&lt;/p>
&lt;p>There are currently a range of prices for our annual fees, based on an organisation&amp;rsquo;s ability to pay. We have used the metric of annual publishing expense or revenue as an indicator of that ability, but in some cases it doesn’t apply. As per our &lt;a href="https://www.crossref.org/operations-and-sustainability/#fee-principles">fee principles&lt;/a>, we have not differentiated between organisation types. Nonprofit and commercial entities pay the same price (caveat: research funders still have a separate fee schedule, but that was intended to be temporary).&lt;/p>
&lt;p>We conducted a review of other annual fee models to benchmark our approach against six like-minded organisations working in the context of scholarly communications and infrastructure. We looked at whether these organisations based their fees on one more more of the following:&lt;/p>
&lt;ul>
&lt;li>Volume: e.g., research output, # of journals&lt;/li>
&lt;li>Budget: e.g., total annual revenue or expenses&lt;/li>
&lt;li>Relevant budget: e.g. publishing revenue&lt;/li>
&lt;li>Organisation type: e.g. variance in fee based on publisher, institution, or funder&lt;/li>
&lt;li>Country-level economic data: e.g., discounting based on World Bank classification, discounting based on purchasing power calculation.&lt;/li>
&lt;/ul>
&lt;p>Chart 4: Annual fee schedules comparisons between Crossref and CORE, DOAJ, Dryad, OA Switch-board, OpenCitations and ORCID.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2024/update-on-rcfs-2024-10-28/annual-fee-schedules-comparisons.png"
alt="Annual fee schedules comparisons between Crossref and CORE, DOAJ, Dryad, OA Switch-board, OpenCitations and ORCID" width="100%">
&lt;/figure>
&lt;/div>
&lt;p>There are three consistent themes among our peers: the total annual revenue and volume levels are the most common basis for membership fees among other organisations, and almost all offer discounted fees to accommodate country-based economic circumstances, utilising World Bank’s data (this is currently achieved at Crossref via the &lt;a href="https://www.crossref.org/gem/">GEM program&lt;/a>, which we have full intention of incorporating into our future fees whatever other decisions we might take). Only one other organisation uses publishing revenue or expenses as a basis for annual fees, while the potentially more transparent and less ambiguous data point of the total revenue factors in three other annual fee models.&lt;/p>
&lt;p>For subscribers to our &lt;a href="https://www.crossref.org/fees/#metadata-plus-subscriber-fees">Metadata Plus service&lt;/a>, the fee tier is selected based on whichever is the higher between their total annual revenue (including earned and fundraised, e.g. grants) or annual operating expenses (including staff and non-staff, e.g. occupancy, equipment, licences etc.). At present, we have limited understanding of the budgets of our members and how this may compare to their publishing revenues or expenses. We are looking to learn more about this as part of our annual membership data checking process, where we email all our members to ask them to confirm contact details for their organisation and the staff involved in managing their Crossref account. This year, we’re also asking all members about their organisation’s annual operating budget (or planned annual expenses) to help inform our discussions. In our case, the volume of outputs (in this case the number of items and associated metadata registered with Crossref) is recognised by the registration fees mechanism.&lt;/p>
&lt;h2 id="consulting-with-organisations-outside-crossref-membership">Consulting with organisations outside Crossref membership&lt;/h2>
&lt;p>To help us inform how our fees can be more equitable, it’s important to invite voices of organisations that may currently be unable to join us - due to fees or technical barriers. We hope that learning more about their circumstances will help us make sure that we improve accessibility of Crossref membership to all organisations that publish scholarly and professional works. We commissioned Accucoms to carry out a consultation on our behalf.&lt;/p>
&lt;p>So far, from a handful of interviews with publishers from Nigeria, DRC, Canada and USA, we’ve learnt that while virtually all offer open access to their publications, the majority has no publishing income, and where the income is derived via APCs it’s modest and only applicable in rare circumstances. Through institutional funding and/or grants, these organisations have modest operational budgets, yet our respondents lacked clarity over the particulars. In terms of participation in professional networks and international publishing organisations, only one of the organisations we interviewed participates in DOAJ, and another is a member of OASPA, in both cases their participation is free. Among the interviewees, two organisations were interested in Crossref membership in the past but encountered technical barriers to joining.&lt;/p>
&lt;p>With only five interviews to date, &lt;a href="https://survey.alchemer.com/s3/7879005/673ef7e88ae5" target="_blank">the consultation is still open&lt;/a> and we’re keen to hear from more organisations that are not Crossref members but have considered our membership at some point.&lt;/p>
&lt;h2 id="goal-2-simplify-complex-fees">Goal 2: Simplify complex fees&lt;/h2>
&lt;h3 id="projects-3--4-review-volume-and-back-year-discounts-for-content-registration">Projects 3 &amp;amp; 4: Review volume and back-year discounts for Content Registration&lt;/h3>
&lt;p>Along with our membership fees, our members also pay usage-based registration fees for records (scholarly works and grants) they register with us. Different content types render different costs for our members, and the fees are subject to discounts related to the age of publication and volume of registrations. Records for items older than two years have a lower fee associated with them, to help incentivise registration of such &amp;lsquo;back-year&amp;rsquo; materials with great gains for the &lt;a href="https://www.crossref.org/documentation/research-nexus/">Research Nexus&lt;/a>. There are also discounts related to the volume of transactions – which again depend on the content types.&lt;/p>
&lt;p>These discounts are intended to encourage certain behaviours, specifically encouraging members to register older records in large quantities to better complete the scholarly record. Not all content types have back-year or volume discounts, and the rate of discount varies. This creates quite a complex system of fees. To the extent that the discount is successful in encouraging this behaviour, we want to preserve it, but in many cases these discounts see little to no activity.&lt;/p>
&lt;p>Following the discussions of the &lt;a href="https://www.crossref.org/committees/membership-and-fees/">Membership and Fees Committee&lt;/a>, chaired by Vincas Grigas, Vilnius University, we are preparing to consult with the small number of members who currently receive volume discounts to discuss what the impact would be if we removed them.&lt;/p>
&lt;p>We plan to identify and preserve the well-used back-year discounts, which encourage registration of old content, such as books, journal articles, grants. However, there are types of discounts that are hardly ever used and we are considering removing these to simplify the fees. This work will focus on the technical implications of removing some of the underused back-year discounts from the billing code and consulting with members to understand any impact .&lt;/p>
&lt;h2 id="goal-3-rebalance-revenue-sources">Goal 3: Rebalance revenue sources&lt;/h2>
&lt;h3 id="project-5-reflect-increase-in-metadata-usage-and-perceived-shift-of-value-toward-metadata-distribution">Project 5: Reflect increase in metadata usage and perceived shift of value toward metadata distribution&lt;/h3>
&lt;p>All Crossref metadata is made freely and openly available to everyone. However, some organisations may be looking for a service level agreement in delivery of the metadata, plus more regular snapshots and priority service/rate limits. For those organisations, we have an optional Metadata Plus service.&lt;/p>
&lt;p>The final project is looking at the fees for this service. We are interested in making sure that Crossref metadata is available and used by the community where it can contribute to their objectives – related to discovery, analysis, integrity, and more. The optional paid service we offer aims to support the external tools that facilitate business and scholarly processes for the community. We are heartened to see that the appetite for the use of metadata seems to be growing, and the value of open research information is increasingly and widely recognised. We want to ensure that the users of metadata contribute proportionally to the maintenance of the records created and curated by our members.&lt;/p>
&lt;h2 id="conclusion">Conclusion&lt;/h2>
&lt;p>At this point, most projects generate a lot of questions and the work is underway to deliver answers related to capacity to pay, discounts as well as available metadata usage, and barriers faced by organisations in our community.&lt;/p>
&lt;p>What we have found so far is that two of our goals – simplification and equity – are often at odds with each other, and this is especially true with the $275 tier.&lt;/p>
&lt;p>We welcome comments, suggestions and questions.&lt;/p></description></item><item><title>Meet the candidates and vote in our 2024 Board elections</title><link>https://www.crossref.org/blog/2024-board-election/</link><pubDate>Tue, 24 Sep 2024 00:00:00 +0000</pubDate><author>Lucy Ofiesh</author><guid>https://www.crossref.org/blog/2024-board-election/</guid><description>&lt;p>On behalf of the Nominating Committee, I’m pleased to share the slate of candidates for the 2024 board election.&lt;/p>
&lt;p>Each year we do an open call for board interest. This year, the &lt;a href="https://www.crossref.org/committees/nominating">Nominating Committee&lt;/a> received 53 submissions from members worldwide to fill four open board seats.&lt;/p>
&lt;p>We maintain a balanced board of 8 large member seats and 8 small member seats. Size is determined based on the organisation&amp;rsquo;s membership tier (small members fall in the $0-$1,650 tiers and large members in the $3,900 - $50,000 tiers). We have two large member seats and two small member seats open for election in 2024.&lt;/p>
&lt;p>We were pleased to see the diversity in candidates, with applicants from 24 countries. We also received three applications from research funders, which we specifically identified as a priority in the committee’s remit for this year. The committee was keen to prepare a diverse slate of organisation types, individual skills, and global representation.&lt;/p>
&lt;p>The Nominating Committee presents the following slate.&lt;/p>
&lt;h2 id="the-2024-slate">The 2024 slate&lt;/h2>
&lt;h3 id="tier-1-candidates-electing-two-seats">Tier 1 candidates (electing two seats):&lt;/h3>
&lt;ul>
&lt;li>&lt;strong>Katharina Rieck&lt;/strong>, Austrian Science Fund (FWF)&lt;/li>
&lt;li>&lt;strong>Lisa Schiff&lt;/strong>, California Digital Library&lt;/li>
&lt;li>&lt;strong>Ejaz Khan&lt;/strong>, Health Services Academy, Pakistan Journal of Public Health&lt;/li>
&lt;li>&lt;strong>Karthikeyan Ramalingam&lt;/strong>, MM Publishers&lt;/li>
&lt;/ul>
&lt;h3 id="tier-2-candidates-electing-two-seats">Tier 2 candidates (electing two seats):&lt;/h3>
&lt;ul>
&lt;li>&lt;strong>Aaron Wood&lt;/strong>, American Psychological Association&lt;/li>
&lt;li>&lt;strong>Dan Shanahan&lt;/strong>, PLOS&lt;/li>
&lt;li>&lt;strong>Amanda Ward&lt;/strong>, Taylor and Francis&lt;/li>
&lt;/ul>
&lt;div class="shortcode-divwrap blue-highlight">
&lt;span>&lt;h3 id="please-read-the-candidates-statementsboard-and-governanceelections2024-slate">&lt;a href="https://www.crossref.org/board-and-governance/elections/2024-slate/">Please read the candidates&amp;rsquo; statements&lt;/a>&lt;/h3>
&lt;/span>
&lt;/div>
&lt;h2 id="every-member-has-a-vote">Every member has a vote&lt;/h2>
&lt;p>If your organisation is a voting member in good standing as of September 11th, 2024, you are eligible to vote.&lt;/p>
&lt;p>The voting contact for your organisation will receive a ballot from eBallot, a third party election platform. You should receive your ballot by Wednesday, September 25th, and you will have until 15:00 UTC on October 29th to submit your ballot.&lt;/p>
&lt;p>The election results will be announced at &lt;a href="https://www.crossref.org/crossref-annual-meeting/">Crossref2024&lt;/a>, our anual online meeting on October 29th, 2024.&lt;/p>
&lt;p>If you have any questions about our election process, please &lt;a href="mailto:lofiesh@crossref.org">contact me&lt;/a>&lt;/p>
&lt;p>Happy voting!&lt;/p></description></item><item><title>The myth of perfect metadata matching</title><link>https://www.crossref.org/blog/the-myth-of-perfect-metadata-matching/</link><pubDate>Wed, 28 Aug 2024 00:00:00 +0000</pubDate><author>Dominika Tkaczyk</author><guid>https://www.crossref.org/blog/the-myth-of-perfect-metadata-matching/</guid><description>&lt;p>In our previous instalments of the blog series about matching (see &lt;a href="https://doi.org/10.13003/aewi1cai" target="_blank">part 1&lt;/a> and &lt;a href="https://doi.org/10.13003/zie7reeg" target="_blank">part 2&lt;/a>), we explained what metadata matching is, why it is important and described its basic terminology. In this entry, we will discuss a few common beliefs about metadata matching that are often encountered when interacting with users, developers, integrators, and other stakeholders. Spoiler alert: we are calling them myths because these beliefs are not true! Read on to learn why.&lt;/p>
&lt;p>If you have stuck with us this far in our series, hopefully, you are at least a bit excited about the possibility of creating new relationships between the works, authors, institutions, preprints, datasets, and myriad other objects in our existing scholarly metadata. Who would not want all of these to be better connected?&lt;/p>
&lt;p>We have to pause for a moment and be honest with you: metadata matching is a complex problem, and doing it correctly requires significant effort. What is worse, even if we do everything right, our matching won&amp;rsquo;t be perfect. This may be counterintuitive. Perhaps you&amp;rsquo;ve heard that matching is not a hard problem, or have encountered people surprised that a matching strategy returned a wrong or incomplete answer. Sometimes, it is obvious to a person from looking at some specific example that a match should (or should not) have been made, so they naturally assume that a change to account for this has to be simple.&lt;/p>
&lt;p>Misconceptions like these can be problematic. They create confusion around matching, drive users&amp;rsquo; expectations to unreasonable levels, and make people drastically underestimate the effort needed to build and integrate matching strategies. So let&amp;rsquo;s dive right in and debunk a few common myths about metadata matching.&lt;/p>
&lt;h2 id="myth-1-a-metadata-matching-strategy-should-be-100-correct">Myth #1: A metadata matching strategy should be 100% correct&lt;/h2>
&lt;p>Anyone who has built or supported a matching strategy has likely encountered the following belief: it is possible to develop a perfect strategy, meaning one that always returns the correct results, no matter the inputs. The unfortunate truth is that while one&amp;rsquo;s aim should always be to design matching strategies that return correct results, once we move beyond the simplest class of problems or artificially clean data, no strategy can achieve this outcome. In thinking through why this is the case, some inherent constraints become obvious:&lt;/p>
&lt;p>The inputs to matching are often strings in human-readable formats, which can vary wildly in their structure, order and completeness. Since they&amp;rsquo;re intended to be parsed by people, instead of machines, they&amp;rsquo;re inherently lossy and frequently unstructured, anticipating that a person can infer from the source context what is being referenced. Matching strategies, although built to make sense of unstructured data, unfortunately, don&amp;rsquo;t have the luxury of this flexibility. A strategy has to account for translating a messy, partial, or inconsistent input into a correct and structured match.&lt;/p>
&lt;p>Consider, for example, the following inputs to an affiliation matching strategy:&lt;/p>
&lt;ol>
&lt;li>&amp;ldquo;Department of Radiology, St. Mary&amp;rsquo;s Hospital, London W2 1NY, UK&amp;rdquo;&lt;/li>
&lt;li>&amp;ldquo;Saint Mary&amp;rsquo;s Hospital, Manchester University NHS Foundation Trust&amp;rdquo;&lt;/li>
&lt;li>&amp;ldquo;St. Mary&amp;rsquo;s Medical Center, San Francisco, CA&amp;rdquo;&lt;/li>
&lt;li>&amp;ldquo;St Mary&amp;rsquo;s Hosp., Dublin&amp;rdquo;&lt;/li>
&lt;li>&amp;ldquo;St Mary&amp;rsquo;s Hospital Imperial College Healthcare NHS Trust&amp;rdquo;&lt;/li>
&lt;li>&amp;ldquo;聖マリア病院&amp;rdquo;&lt;/li>
&lt;/ol>
&lt;p>In order to correctly identify the organisations mentioned here, the matching strategy must be able to distinguish between different ways of representing the same institution, disambiguate multiple institutions that have similar names, and handle variant forms for the parts of each name (Saint/St./St), identify the same name in different languages (&amp;ldquo;聖マリア病院&amp;rdquo; is Japanese for &amp;ldquo;St. Mary&amp;rsquo;s Hospital&amp;rdquo;), and make assumptions about partial or ambiguous locations translating to more precise references. While a person reviewing each of these strings might be able to accomplish these tasks, even here there are some challenges. Does &amp;ldquo;St Mary&amp;rsquo;s Hosp., Dublin&amp;rdquo; refer to the hospital in Ireland or a separate hospital in one of the many cities that share this name? Should we presume that because &amp;ldquo;聖マリア病院&amp;rdquo; is in Japanese, this refers to a hospital in Japan? Would someone, by default, be aware that St. Mary&amp;rsquo;s Hospital in London is part of the Imperial College Healthcare NHS Trust, such that inputs one and five refer to the same organisation?&lt;/p>
&lt;p>An additional challenge lies in the quality of the data, which in the context of matching, encompasses both the input and the dataset being matched against. In real world circumstances, no dataset is fully accurate, complete, or current and certainly not all three. As a result, there will always be functionally random differences between inputs to the strategy and the entities to be matched. A theoretically perfect matching strategy would thus need to distinguish between inconsequential discrepancies resulting from gaps, errors, and variable forms of reference and actual, meaningful differences indicating an incorrect match. As one might imagine, this would require near total knowledge of the meaning and context for all inputs and outputs, a nigh-on impossible task for any person or system!&lt;/p>
&lt;p>As a consequence, no metadata matching strategy will ever be perfect. It is unreasonable for us to expect them to be. This does not mean, of course, that all strategies are equally flawed or destined to forever return middling results. Some are better than others and we can improve them over time. Which brings us to the next myth:&lt;/p>
&lt;h2 id="myth-2-it-is-always-a-good-idea-to-adapt-the-matching-strategy-to-a-specific-input">Myth #2: It is always a good idea to adapt the matching strategy to a specific input&lt;/h2>
&lt;p>Matching strategies are not static. They can - and should - be improved. There is, however, a deceptive trap that one can fall into when attempting to improve a matching strategy. Whenever we encounter an incorrect or missing result for a specific input, we treat this problem like a software bug and try to adapt the strategy to work better for it, without considering all other cases.&lt;/p>
&lt;p>The more complicated reality is that the quality of matching results is controlled through a complex set of trade-offs between &lt;a href="https://en.wikipedia.org/wiki/Precision_and_recall" target="_blank">precision and recall&lt;/a> that determine the kind and number of relationships created between items:&lt;/p>
&lt;ul>
&lt;li>Precision is calculated as the number of correctly matched relationships resulting from a strategy, divided by the total number of matched relationships. It can also be interpreted as the probability that a match is correct. Low precision indicates a high rate of false positives, which are incorrect relationships created by the strategy.&lt;/li>
&lt;li>Recall is calculated as the number of correctly matched relationships resulting from a strategy, divided by the number of true (expected) relationships. It can also be interpreted as the probability that a true (correct) relationship will be created by the strategy. Low recall means a high rate of false negatives, which are relationships that should have been created by the strategy but were not made.&lt;/li>
&lt;/ul>
&lt;div style="text-align:center;margin:10px">
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2024/false-positives-negatives.png"
alt="False positives and false negatives" width="75%">&lt;figcaption>
&lt;p>The diagram depicts false negatives and false positives. The ideal outcome would be that the ellipses are identical, matched relationships are exactly the same as true relationships, and there are no false negatives or false positives. In practice, we try to make the intersection as big as possible.&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;/div>
&lt;p>The tradeoff between precision and recall roughly means that modifying the strategy to improve recall will decrease precision, and vice versa.&lt;/p>
&lt;p>Imagine, for example, we received a report about a relationship that was missed by matching because of a partial, noisy, or ambiguous input. We might be tempted to resolve this issue by relaxing our matching criteria. Unfortunately, this will have a cost of a higher overall rate of false positive matches.&lt;/p>
&lt;p>Conversely, if we encounter a case where the matching has returned an incorrect match, we might attempt to make the matching strategy stricter to avoid this result. We should remember, however, that this may have the consequence of causing the strategy to skip many perfectly valid matches.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2024/precision-recall-tradeoff.png"
alt="The tradeoff between precision and recall" width="50%">&lt;figcaption>
&lt;p>The tradeoff between precision and recall. (a) A strict strategy prioritises precision over recall resulting in more false negatives. (b) A relaxed strategy prioritises recall over precision resulting in more false positives.&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;/div>
&lt;p>Striking this balance becomes even more difficult when attempting to address multiple issues at once, or considering constraints like the time and resources consumed by each aspect of the strategy. Each choice can compound the individual effects in unanticipated and expensive ways. The aim of matching ultimately then can&amp;rsquo;t be to achieve perfect results for every single case. Fixing one particular situation might not be desirable, as it can result in breaking multiple other cases. Instead, we have to find a locally optimal balance that optimises the strategy&amp;rsquo;s utility, relative to these inherent limitations. This means accepting some level of imperfection as not just inevitable, but necessary for implementing a workable strategy. When you consider all this, you might conclude that…&lt;/p>
&lt;h2 id="myth-3-we-shouldnt-do-large-scale-unsupervised-matching">Myth #3: We shouldn&amp;rsquo;t do large-scale, unsupervised matching&lt;/h2>
&lt;p>Imperfect matching strategies, when applied automatically to real-world large datasets, might:&lt;/p>
&lt;ul>
&lt;li>Fail to discover some relationships (false negatives), an outcome that may not be terribly problematic. In the worst case scenario, we have wasted a great deal of effort developing matching strategies that do not improve our metadata.&lt;/li>
&lt;li>Create incorrect relationships between items (false positives), what seems like a potentially larger problem, where we have added incorrect relationships to the metadata.&lt;/li>
&lt;/ul>
&lt;p>Many have the instinct to avoid false positives at any cost, even if this means missing many additional correct relationships at the same time. They might come to the conclusion that if we cannot have 100% precision (see our previous myth), we simply should not allow matching strategies to act in an automated, unsupervised way on large datasets. While there might be circumstances where this belief is rational, in the context of the scholarly record, this notion is seriously flawed.&lt;/p>
&lt;p>First, if you are dealing with any medium to large-sized dataset, it almost certainly contains errors, even before you apply any automated processing to it. Even if data is submitted and curated by users, they can still make mistakes, and might themselves be using automated tools for extracting the data from other sources, without your knowledge. It is thus not entirely obvious that applying an (imperfect) matching strategy to create more relationships would actually make the data quality worse.&lt;/p>
&lt;p>Second, while we cannot eliminate all matching errors, we can place a high priority on precision when developing strategies, with the aim of keeping the number of incorrectly matched results as low as possible. We can also make use of additional mechanisms to easily correct for incorrectly matched results, for example doing so manually, in response to error reports.&lt;/p>
&lt;p>Finally, the results of matching should always contain provenance information to distinguish them from those that have been manually curated. This way, the users can make their own decisions about whether to use and trust the matching results, relative to their use case.&lt;/p>
&lt;p>By applying those additional checks, we can minimise the negative effects of incorrect matching, while at the same time reap the benefits of filling gaps in the scholarly record.&lt;/p>
&lt;h2 id="myth-4-we-can-only-ever-guess-at-the-accuracy-of-our-matching-results">Myth #4: We can only ever guess at the accuracy of our matching results&lt;/h2>
&lt;p>In attempting to determine the correctness of our matching, we immediately encounter a number of inherent limitations. The sheer amount of entries in many datasets prevents a thorough, manual validation of the results, but if instead, we use too few or specific items as our benchmarks, these are unlikely to be representative of overall performance. The unpredictable nature of future data adds another wrinkle: will our matching always be as successful as when we first benchmarked it or will its performance degrade relative to some change in the data?&lt;/p>
&lt;p>With so many unknowns, are we then doomed? No! We have rigorous and scientific tools at our disposal that can help us estimate how accurate our matching will be. How do we use them? Well, that is a big and fairly technical topic, so we will leave you with this little cliffhanger. See you in the next post!&lt;/p></description></item><item><title>Re-introducing Participation Reports to encourage best practices in open metadata</title><link>https://www.crossref.org/blog/re-introducing-participation-reports-to-encourage-best-practices-in-open-metadata/</link><pubDate>Thu, 25 Jul 2024 00:00:00 +0000</pubDate><author>Lena Stoll</author><guid>https://www.crossref.org/blog/re-introducing-participation-reports-to-encourage-best-practices-in-open-metadata/</guid><description>&lt;p>We’ve just released an update to our &lt;a href="http://crossref.org/members/prep" target="_blank">participation report&lt;/a>, which provides a view for our members into how they are each working towards best practices in open metadata. Prompted by some of the signatories and organizers of the &lt;a href="https://barcelona-declaration.org/" target="_blank">Barcelona Declaration&lt;/a>, which Crossref supports, and with the help of our friends at &lt;a href="https://www.cwts.nl/" target="_blank">CWTS Leiden&lt;/a>, we have fast-tracked the work to include an updated set of metadata best practices in participation reports for our members. The reports now give a more complete picture of each member’s activity.&lt;/p>
&lt;h2 id="what-do-we-mean-by-participation">What do we mean by ‘participation’?&lt;/h2>
&lt;p>Crossref runs open infrastructure to link research objects, entities, and actions, creating a lasting and reusable scholarly record. As a not-for-profit with over 20,000 members in 160 countries, we drive metadata exchange and support nearly 2 billion monthly API queries, facilitating global research communication.&lt;/p>
&lt;p>To make this system work, members strive to provide as much metadata as possible through Crossref to ensure it is openly distributed throughout the scholarly ecosystem at scale rather than bilaterally, thereby realizing the collective benefit of membership. Together, our membership provides and uses a rich nexus of information— known as &lt;a href="https://www.crossref.org/documentation/research-nexus">the research nexus&lt;/a>—on which the community can build tools to help progress knowledge.&lt;/p>
&lt;p>Each member commits to certain terms, such as keeping metadata current, updating links for their DOIs to redirect to, linking references and other objects, and preserving their content in perpetuity. Beyond this, we also encourage members to register as much rich metadata as is relevant and possible.&lt;/p>
&lt;p>Creating and providing richer metadata is a key part of participation in Crossref; we’ve long encouraged a more complete scholarly record, such as through &lt;a href="https://metadata2020.org" target="_blank">Metadata 20/20&lt;/a>, and through supporting or leading initiatives for specific metadata, like open citations (I4OC), open abstracts (I4OA), open contributors (&lt;a href="https://orcid.org" target="_blank">ORCID&lt;/a>), and open affiliations (&lt;a href="https://ror.org" target="_blank">ROR&lt;/a>).&lt;/p>
&lt;h2 id="which-metadata-elements-are-considered-best-practices">Which metadata elements are considered best practices?&lt;/h2>
&lt;p>Alongside basic bibliographic metadata such as title, authors, and publication date(s), we encourage members to register metadata in the following fields:&lt;/p>
&lt;div class="shortcode-divwrap ">
&lt;span>&lt;figure>&lt;a href="https://crossref.org/members/prep/5401">&lt;img src="https://www.crossref.org/images/blog/prep-2024.png"
alt="screenshot of Crossref participation report for member University of Szeged showing the 11 best practice metadata fields" width="70%">&lt;/a>&lt;figcaption>
&lt;p>Example participation report for Crossref member University of Szeged&lt;/p>
&lt;/figcaption>
&lt;/figure>&lt;/span>
&lt;/div>
&lt;h4 id="references">References&lt;/h4>
&lt;p>A list of all the references used by a work. This is particularly relevant for journal articles but the references can include any type of object, including datasets, versions, preprints, and more. Additionally, we encourage these to be added into &lt;a href="https://www.crossref.org/documentation/principles-practices/best-practices/relationships/">relationships&lt;/a>, where relevant.&lt;/p>
&lt;h4 id="abstracts">Abstracts&lt;/h4>
&lt;p>A description of the work. These are particularly useful for discovery systems that will promote the work, and are often used in downstream analyses such as for detecting integrity issues.&lt;/p>
&lt;h4 id="contributor-ids-orcid">Contributor IDs (ORCID)&lt;/h4>
&lt;p>All authors should be included in a work’s metadata, ideally alongside their verified ORCID identifier.&lt;/p>
&lt;h4 id="affiliations--affiliation-ids-ror">Affiliations / Affiliation IDs (ROR)&lt;/h4>
&lt;p>Members are able to register contributor affiliations as free text, but we are encouraging everyone to add ROR IDs for affiliations as the recommended best practice, as this differentiates and avoids mistyping. These two fields have newly been added to the participation reports interface in the most recent update.&lt;/p>
&lt;h4 id="funder-ids-ofr">Funder IDs (OFR)&lt;/h4>
&lt;p>Acknowledging the organisation(s) that funded the work. We encourage the inclusion of &lt;a href="https://www.crossref.org/documentation/funder-registry/">Open Funder Registry&lt;/a> identifiers to make the funding metadata more usable. This will evolve into &lt;a href="https://doi.org/10.64000/v3429-p7810" target="_blank">an additional use case for ROR&lt;/a> over time.&lt;/p>
&lt;h4 id="funding-award-numbers--grant-ids-crossref">Funding award numbers / Grant IDs (Crossref)&lt;/h4>
&lt;p>A number or identifier assigned by the funding organisation to identify the specific award of funding or other support such as use of equipment or facilities, prizes, tuition, etc. The &lt;a href="https://www.crossref.org/services/grant-linking-system/">Crossref Grant Linking System&lt;/a> includes a unique persistent link that can be connected with outputs, activities, people, and organisations.&lt;/p>
&lt;h4 id="crossmark">Crossmark&lt;/h4>
&lt;p>The &lt;a href="https://www.crossref.org/services/crossmark/">Crossmark service&lt;/a> gives readers quick and easy access to the current status of a record, including any corrections, retractions, or updates, via a button embedded on PDFs or a web article. Openly adding corrections, retractions, and errata is critical part of publishing, and the button provides readers with an easy in-context alert.&lt;/p>
&lt;h4 id="similarity-check-urls">Similarity Check URLs&lt;/h4>
&lt;p>The &lt;a href="https://www.crossref.org/services/similarity-check/">Similarity Check service&lt;/a> helps editors to identify text-based plagiarism through our collective agreement for the membership to access to Turnitin’s powerful text comparison tool, iThenticate. Specific full-text links are required to participate in this service.&lt;/p>
&lt;h4 id="license-urls">License URLs&lt;/h4>
&lt;p>URLs pointing to a license that explains the terms and conditions under which readers can access content. These links are crucial to denote intended downstream use.&lt;/p>
&lt;h4 id="text-mining-urls">Text mining URLs&lt;/h4>
&lt;p>Full-text URLs that help researchers in meta-science easily locate your content for text and data mining.&lt;/p>
&lt;h2 id="what-is-a-participation-report">What is a participation report?&lt;/h2>
&lt;p>Participation reports are are a visualization of the data representing members’ participation to the scholarly record which is available via our open REST API. There’s a separate participation report for each member, and each report shows what percentage of that member’s metadata records include 11 key metadata elements. These key elements add context and richness, and help to open up members’ work to easier discovery and wider and more varied use. As a member, you can use participation reports to see for yourself where the gaps in your organisation’s metadata are, and perhaps compare your performance to others. Participation reports are free and open to everyone - so you can also check the report for any other members you are interested in.&lt;/p>
&lt;p>We first &lt;a href="https://doi.org/10.64000/h00dz-jw569" target="_blank">introduced&lt;/a> participation reports in 2018. At the time, Anna Tolwinska and Kirsty Meddings wrote:&lt;/p>
&lt;blockquote>
&lt;p>Metadata is at the heart of all our services. With a growing range of members participating in our community—often compiling or depositing metadata on behalf of each other—the need to educate and express obligations and best practice has increased. In addition, we’ve seen more and more researchers and tools making use of our APIs to harvest, analyze and re-purpose the metadata our members register, so we’ve been very aware of the need to be more explicit about what this metadata enables, why, how, and for whom.&lt;/p>
&lt;/blockquote>
&lt;p>All of that still rings true today. But as the research nexus continues to evolve, so should the tools that intend to reflect it. For example, in 2022, we removed the &lt;em>Open references&lt;/em> field from participation reports after &lt;a href="https://doi.org/10.64000/b7a98-vbz07" target="_blank">a board vote to change our policy and update the membership terms&lt;/a> meant that &lt;em>all&lt;/em> references deposited with Crossref would be open by default. And now we’ve expanded the list of fields again, adding coverage data for contributor affiliation text and ROR identifiers.&lt;/p>
&lt;h2 id="putting-it-in-practice">Putting it in practice&lt;/h2>
&lt;p>To find out how you measure up when it comes to participation, type the name of your member organisation into the &lt;a href="https://www.crossref.org/members/prep/" target="_blank">search box&lt;/a>. You may be surprised by what you find—we often speak to members who thought they were registering a certain type of metadata for all their records, only to learn from their participation report that something is getting lost along the way.&lt;/p>
&lt;p>You can only &lt;a href="https://www.crossref.org/documentation/register-maintain-records/maintaining-your-metadata/updating-your-metadata/">address&lt;/a> gaps in your metadata if you know that they exist.&lt;/p>
&lt;p>More information, as well as a breakdown of the now 11 key metadata elements listed in every participation report and tips on improving your scores, is available in our &lt;a href="https://www.crossref.org/documentation/reports/participation-reports/">documentation&lt;/a>.&lt;/p>
&lt;p>And if you have any questions or feedback, come talk to us on the &lt;a href="https://community.crossref.org/" target="_blank">community forum&lt;/a> or request a metadata Health Check by emailing the &lt;a href="mailto:feedback@crossref.org?subject=Participation%20reports%20and%20metadata%20health%20checks">community team&lt;/a>.&lt;/p></description></item><item><title>Metadata schema development plans</title><link>https://www.crossref.org/blog/metadata-schema-development-plans/</link><pubDate>Mon, 22 Jul 2024 00:00:00 +0000</pubDate><author>Patricia Feeney</author><guid>https://www.crossref.org/blog/metadata-schema-development-plans/</guid><description>&lt;h2 id="its-been-a-while-heres-a-metadata-update-and-request-for-feedback">It’s been a while, here’s a metadata update and request for feedback&lt;/h2>
&lt;p>In Spring 2023 we sent out a &lt;a href="https://community.crossref.org/t/take-our-metadata-priorities-survey-by-may-18/3498" target="_blank">survey&lt;/a> to our community with a goal of assessing what our priorities for metadata development should be - what projects are our community ready to support? Where is the greatest need? What are the roadblocks?&lt;/p>
&lt;p>The intention was to help prioritize our metadata development work. There’s a lot we want to do, a lot our community needs from us, but we really want to make sure we’re focusing on the projects that will have the most immediate impact for now.&lt;/p>
&lt;p>Several projects were proposed, based on community demand over time. All are projects we intend to support long-term.&lt;/p>
&lt;h2 id="projects">Projects&lt;/h2>
&lt;p>The projects included in the survey were:&lt;/p>
&lt;ul>
&lt;li>&lt;strong>Alternate names&lt;/strong> - We proposed adding a repeatable ‘name’ element to allow for names that aren’t separated by given/family/surname.&lt;/li>
&lt;li>&lt;strong>Updates to funding data&lt;/strong> -&lt;a href="https://doi.org/10.64000/6tzsa-7dj24" target="_blank">this update&lt;/a> will be released in the near future and includes:
&lt;ul>
&lt;li>Expand ROR support - Allow members to supply ROR ID instead of funder ID in funding data and grant records.&lt;/li>
&lt;li>Include Grant DOIs in funding metadata.&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>&lt;strong>Publication typing in citations&lt;/strong> - Support citation type in citation metadata (for example article, preprint, data, software, etc.).&lt;/li>
&lt;li>&lt;strong>Expand contributor role support&lt;/strong> - Allow multiple contributor roles to be provided per contributor and add support for external vocabularies (like CRediT)&lt;/li>
&lt;li>&lt;strong>Expand abstract support&lt;/strong> - We currently require all abstracts to be formatted using JATS. We will be adding new abstract formats, including BITS and ONIX (which have been requested), as well as a generic abstract format (non-JATS).&lt;/li>
&lt;li>&lt;strong>Statements&lt;/strong> - Add support for free text statements such as data availability, acknowledgments, funding, and conflict of interest.&lt;/li>
&lt;li>&lt;strong>Contributor identifiers&lt;/strong> - Accept contributor identifiers such as ISNI (in addition to ORCID, which is already supported).&lt;/li>
&lt;li>&lt;strong>Conference event IDs&lt;/strong> - Identifiers for &lt;a href="https://doi.org/10.64000/skv7b-cef25" target="_blank">conference events&lt;/a>.&lt;/li>
&lt;/ul>
&lt;h2 id="whats-next">What’s next?&lt;/h2>
&lt;p>There is a clear preference for publication types in citations and abstract markup, expanded support for multilingual metadata, followed by expanding contributor roles to support multiple roles and the CRediT taxonomy. The results have helped us prioritize our work and we’re advancing several projects soon based on our readiness to move forward.&lt;/p>
&lt;p>First up is &lt;strong>publication typing in citations and statements&lt;/strong> - we hope to be able to make this ready for registration in the coming months, but want to confirm a few things first, primarily the list of ‘types’ to apply to citations, so please review and comment: &lt;a href="https://docs.google.com/document/u/0/d/1VPXhTPMZzfvAPmTOlNp-bZf9cTLkw0dPZFTuDtDIPls/edit" target="_blank">Metadata updates in need of feedback July 2024&lt;/a>&lt;/p>
&lt;p>We also have been discussing expansions to our &lt;strong>support for preprints metadata&lt;/strong> with our &lt;a href="https://www.crossref.org/working-groups/preprints/" target="_blank">Preprints Advisory Group&lt;/a> and have a number of preprint-specific updates that will be rolled out in the coming months as well, including support for versions and status. These proposed changes are also available for comment.&lt;/p>
&lt;p>And finally, we will be expanding &lt;strong>support for contributor roles&lt;/strong> to include multiple roles per contributor, as well as adding support for the &lt;a href="https://credit.niso.org/" target="_blank">CRediT&lt;/a> taxonomy. This update is yet to be scheduled but we do have the inputs and output planning done and welcome any comments on this as well.&lt;/p>
&lt;p>We will also be continuing work on other projects highlighted in the survey that aren’t quite ready to go:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>&lt;strong>Multilingual metadata&lt;/strong>: Support for multilingual metadata in particular is very important and will require a fairly significant technical effort, so we want to be sure we get this right - at minimum we need to include repeatable fields flagged with language metadata for most items, there may be other considerations as well such as the scope of languages supported.&lt;/p>
&lt;p>As we develop new metadata segments we’re keeping language metadata in mind, but I’d like to form a short-term working group to help shape this update - this group will be focused on the details of supporting multilingual metadata in our inputs and outputs, so conversations will be very XML and JSON heavy. If you are interested and available please contact &lt;a href="mailto:pfeeney@crossref.org">pfeeney@crossref.org&lt;/a>. &lt;p>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;strong>Abstract markup:&lt;/strong> we are currently in the research phase of this project but will be proposing updates and asking for input this fall. At the moment support for BITS and ONIX abstracts have been requested, as well as an agnostic format.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;strong>Expansion of name and contributor ID support&lt;/strong>: work is under way for this as well, and I should have inputs and outputs for feedback in the coming months.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>We anticipate more developments and requests for feedback in the future as we still have other projects from the list above to get to. I’ve opened up a ‘&lt;a href="https://community.crossref.org/c/metadata-development/15" target="_blank">Metadata Development&lt;/a>’ section in our Community Forum to invite discussion and will be kicking off a renewed Metadata Interest Group in the fall.&lt;/p></description></item><item><title>Crossmark community consultation: What did we learn?</title><link>https://www.crossref.org/blog/crossmark-community-consultation-what-did-we-learn/</link><pubDate>Tue, 02 Jul 2024 00:00:00 +0000</pubDate><author>Martyn Rittman</author><guid>https://www.crossref.org/blog/crossmark-community-consultation-what-did-we-learn/</guid><description>&lt;p>In the first half of this year we’ve been talking to our community about post-publication changes and Crossmark. When a piece of research is published it isn’t the end of the journey—it is read, reused, and sometimes modified. That&amp;rsquo;s why we run Crossmark, as a way to provide notifications of important changes to research made after publication. Readers can see if the research they are looking at has updates by clicking the Crossmark logo. They also see useful information about the editorial process, and links to things like funding and registered clinical trials. All of this contributes to what we call the &lt;a href="https://doi.org/10.64000/edg3w-7t592" target="_blank">integrity of the scholarly record&lt;/a>.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2024/Crossmark-popup-example.png"
alt="The Crossmark popup provides information about whether a work is current and other metadata about integrity of the scholarly record." width="75%">
&lt;/figure>
&lt;/div>
&lt;p>Crossmark has been around a long time and the context around it is constantly changing. It last had a major update in &lt;a href="https://doi.org/10.64000/cfdpk-ke436" target="_blank">2016&lt;/a> and in &lt;a href="https://doi.org/10.64000/h2vh2-35t60" target="_blank">2020&lt;/a> we removed fees for its use.&lt;/p>
&lt;p>The past few years have seen a more intense focus on research integrity among the scholarly communications community, leading to more retractions and calling out large-scale manipulation of editorial processes. At the same time, we haven’t seen an increase in the uptake of Crossmark, which is still used by only a minority of our members. We would like to know why the uptake is low and whether there is more we can do in this area. To dig into this, in the first part of 2024 we reached out to members of our community.&lt;/p>
&lt;h2 id="what-did-we-do">What did we do?&lt;/h2>
&lt;p>We wanted to learn about attitudes towards Crossmark and related aspects of research integrity. This was done in several ways:&lt;/p>
&lt;ul>
&lt;li>Structured interviews with eight of our members.&lt;/li>
&lt;li>Round tables at Crossref LIVE events in Bogota and Nairobi&lt;/li>
&lt;li>Surveying a selection of our members, which led to 94 responses.&lt;/li>
&lt;/ul>
&lt;p>The topics we asked about were related to how post-publication updates are made and communicated, and which metadata demonstrates good practice.&lt;/p>
&lt;p>We are extremely grateful to the members who contributed. They provided valuable feedback and have helped to shape the future of Crossmark and our approach to the integrity of the scholarly record.&lt;/p>
&lt;h2 id="what-did-we-find">What did we find?&lt;/h2>
&lt;p>Across the various groups there were a few common themes, which fell into several areas.&lt;/p>
&lt;ol>
&lt;li>
&lt;p>Communication of updates is highly valued, and seen as the most important role that Crossmark can play. Some of those we spoke to would like readers to see if there is an update as soon as a page opens, without having to open a popup. This could be done by having a logo that changes colour, shape, or size.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Conversely, not as much enthusiasm was shown for the metadata assertions. These are additional fields that can be displayed to readers in the Crossmark popup. There wasn’t a strong consensus on which commonly-made assertions are the most important for research integrity.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>There is diversity in attitudes towards making updates to published works, what research integrity means, and approaches to workflows for updates. Even within a single organisation, a number of different workflows and multiple staff members might be called on to update published research. This makes things complex and means that it can be difficult to fit Crossmark in.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>There are technical challenges to getting started with Crossmark. Those responsible for implementing Crossmark are often technical staff who struggle with the documentation we provide in English. There is also no plugin for OJS, a widely-used open source editorial software. It is more difficult to deposit Crossmark metadata for books than journal articles, and many article types don’t permit Crossmark metadata at all. On the other hand, those who successfully installed Crossmark found it easy to use and low-maintenance.&lt;/p>
&lt;/li>
&lt;/ol>
&lt;p>Overall, it seems that Crossmark still has an important role to play but there are changes and improvements we can make.&lt;/p>
&lt;h2 id="whats-next">What’s next?&lt;/h2>
&lt;p>Here are the main areas we intend to follow up on in the coming months.&lt;/p>
&lt;h3 id="implementation">Implementation&lt;/h3>
&lt;p>We need to look at how to make implementation more straight-forward. Can we provide multilingual documentation, plugins, run workshops or webinars, or make changes to Crossmark to lower the barrier to entry?&lt;/p>
&lt;h3 id="understanding-workflows">Understanding workflows&lt;/h3>
&lt;p>Can we collaborate with our members and other organisations to reach a better understanding of how to update published works? Are there alternative workflows we need to support? Have we made it too difficult to understand and implement the options we currently have?&lt;/p>
&lt;p>While updates are always likely to be rare, we want to help members understand the benefits of making them. We talked to some members who were proud of never having published a retraction or correction, which left us wondering whether they are missing legitimate opportunities to correct the scholarly record. We also know that for some members and many work types (preprints, for example), updates are made without a separate published notification. Can we better understand the role that the published updates play and communicate updates even if there isn’t a published notice?&lt;/p>
&lt;h3 id="ongoing-feedback">Ongoing feedback&lt;/h3>
&lt;p>Clearly one size doesn’t fit all when it comes to implementing and communicating updates. We need to find ways of keeping in touch with the community to test new solutions with as broad a range of members as possible. We want to avoid catering to a minority and leaving others struggling to find ways to implement a solution.&lt;/p>
&lt;h3 id="custom-metadata">Custom metadata?&lt;/h3>
&lt;p>Is there an ongoing need for metadata assertions? Many of the assertions currently made are possible as standard metadata and others could be included in our deposit schema. We want to consider removing the option to add assertions. This needs more feedback from the community, especially those who currently make use of assertions.&lt;/p>
&lt;h3 id="redesign-the-ui">Redesign the UI&lt;/h3>
&lt;p>Crossmark doesn’t have the recognition with readers we would like. Is there a way we can redesign it to make it more associated with Crossref and accurate metadata? We intend to explore different designs, and test them with members and readers.&lt;/p></description></item><item><title>Celebrating five years of Grant IDs: where are we with the Crossref Grant Linking System?</title><link>https://www.crossref.org/blog/celebrating-five-years-of-grant-ids-where-are-we-with-the-crossref-grant-linking-system/</link><pubDate>Mon, 01 Jul 2024 00:00:00 +0000</pubDate><author>Kornelia Korzec</author><guid>https://www.crossref.org/blog/celebrating-five-years-of-grant-ids-where-are-we-with-the-crossref-grant-linking-system/</guid><description>&lt;p>We’re happy to note that this month, we are marking five years since Crossref launched its Grant Linking System. The Grant Linking System (GLS) started life as a joint community effort to create ‘grant identifiers’ and support the needs of funders in the scholarly communications infrastructure.&lt;/p>
&lt;div class="shortcode-divwrap align-right">
&lt;span>&lt;img src="https://www.crossref.org/images/community-images/gls/gls-logo-stacked.png" alt="Crossref Grant Linking System logo" width="100%" >&lt;/span>
&lt;/div>
The system includes a funder-designed metadata schema and a unique link for each award which enables connections with millions of research outputs, better reporting on the research and outcomes of funding, and a contribution to open science infrastructure. Our first activity to highlight the moment was to host a community call last week where around 30 existing and potential funder members joined to discuss the benefits and the steps to take to participate in the Grant Linking System (GLS).
&lt;p>Some organisations at the forefront of adopting Crossref’s Grant Linking System presented their challenges and how they overcame them, shared the benefits they are reaping from participating, and provided some tips about their processes and workflows.&lt;/p>
&lt;p>The funding organisations whose experiences were shared included &lt;a href="https://wellcome.org/" target="_blank">Wellcome&lt;/a>, &lt;a href="https://www.fct.pt/en/" target="_blank">FCT (Foundation for Science and Technology, Portugal)&lt;/a>, and &lt;a href="https://www.nwo.nl/en" target="_blank">NWO (Dutch Research Council)&lt;/a>. They were joined by a new group of foundations, research councils, and private research funders from around the world&amp;mdash;from Kenya to Singapore to Estonia&amp;mdash;to have a first introduction to the GLS and connect them with colleagues who are further along on their journey.&lt;/p>
&lt;p>We also heard about tools such as a new &lt;a href="https://github.com/oaworks/create-grant-doi-in-fluxx" target="_blank">open source Crossref plugin&lt;/a> for the Fluxx platform, grant management systems with in-built Crossref integrations such as &lt;a href="https://proposalcentral.com/" target="_blank">ProposalCentral&lt;/a>, &lt;a href="https://europepmc.org/grantfinder" target="_blank">Europe PMC GrantFinder&lt;/a> which was first to implement the GLS on Wellcome’s behalf and hosts their grants, and one of the first publishers, &lt;a href="https://elifesciences.org/" target="_blank">eLife&lt;/a> to start referencing Crossref grant links in their publications both online and in the open metadata for others to retrieve.&lt;/p>
&lt;p>Read on for further information or watch &lt;a href="https://www.youtube.com/watch?v=LuM2eMOTmN8" target="_blank">the recording of the event&lt;/a>.&lt;/p>
&lt;iframe width="560" height="315" src="https://www.youtube.com/embed/LuM2eMOTmN8?si=GefNp773GN36XGTp" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen>&lt;/iframe>
&lt;h2 id="what-is-the-crossref-grant-linking-system">What is the Crossref Grant Linking System?&lt;/h2>
&lt;p>The Crossref Grant Linking System, conceptualised in 2017, and launched in 2019, captures and helps clarify funding relationships for scholarly outputs. Thanks to interconnectedness with the 160 million metadata records collected and curated by Crossref members, it enables funders as well as scholars to track and analyse funding patterns and evaluate programmes, and it supports assertions about the integrity of scholarly records.&lt;/p>
&lt;h3 id="features-of-the-gls">Features of the GLS&lt;/h3>
&lt;ul>
&lt;li>Globally unique persistent link and identifier for each grant&lt;/li>
&lt;li>Connected with 160 million published outputs&lt;/li>
&lt;li>Funder-designed metadata schema, including project, investigator, value, and award-type information&lt;/li>
&lt;li>Programmatic or no-code methods to send metadata
&lt;ul>
&lt;li>Thanks to the &lt;a href="https://www.moore.org/" target="_blank">Gordon and Betty Moore Foundation&lt;/a> who funded development of the &lt;a href="https://manage.crossref.org/records" target="_blank">online grant registration form&lt;/a>&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>Open search and API for all to discover funding outcomes; all metadata is distributed openly to thousands of tools and services&lt;/li>
&lt;li>Crossref-hosted landing pages&lt;/li>
&lt;li>A global community of ~50 funder advisors and 35+ funders already in the Grant Linking System&lt;/li>
&lt;li>Membership of Crossref; influence the foundational infrastructure powering open research&lt;/li>
&lt;/ul>
&lt;p>The last five years has seen the GLS grow through membership, metadata, and community contributions.&lt;/p>
&lt;p>&lt;figure>&lt;img src="https://www.crossref.org/images/community-images/gls/gls-5-years.png"
alt="graph showing the effects of specific funders joining that increase matches and relationships in the Crossref Grant Linking System" width="100%">
&lt;/figure>
&lt;br>
The momentum for this programme is building - as illustrated by increasing numbers of metadata records (and related relationships we’re seeing). The 35 funder members represent over 100 funding programmes and have created 125,000 grant records already.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/community-images/gls/gls-growth.png"
alt="timeline of the Crossref Grant Linking System from 2019 to 2024" width="100%">
&lt;/figure>
&lt;p>During last week&amp;rsquo;s call, it was helpful to hear from the community what they see as key benefits of the Crossref Grant Linking System:&lt;/p>
&lt;div class='shortcode-row '>
&lt;div class="col-md-6 col-sm-12 no-first-para-highlight">&lt;ul>
&lt;li>Meaningfully delivering on and supporting Open Science policies and mandates, and contributing ‘their bit’ to the transparency of the evidence trail in the scholarly ecosystem.&lt;/li>
&lt;li>Reporting and evaluating the funding programmes, essential for the public funders who need to demonstrate the value for money in allocating their funds and other support.&lt;/li>
&lt;li>Supporting a more holistic assessment of scholarship and scholars, especially as and when metadata becomes included with a full array of outputs, not limited to books and articles.&lt;/li>
&lt;/ul>
&lt;/div>
&lt;div class="col-md-6 col-sm-12 no-first-para-highlight">&lt;img src="https://www.crossref.org/images/community-images/gls/gls-benefits.png" alt="High-level benefits of the Crossref Grant Linking System (GLS)" width="100%" >
&lt;/div>
&lt;/div>
&lt;h2 id="how-the-crossref-grant-linking-system-supports-open-science-policy">How the Crossref Grant Linking System supports Open Science policy&lt;/h2>
&lt;p>Since 2020, all the grant records are openly available through our REST API which is queried more than 1.8 billion times every month so these metadata records are distributed to thousands of systems across the research enteprise. In a 2022 blog, Ed Pentz and Ginny Hendricks laid out &lt;a href="https://doi.org/10.64000/nfzyk-mfw64" target="_blank">guidelines for research funders to meet open science guidelines&lt;/a> using existing open infrastructure such as Crossref, ORCID, and ROR. Syman Stevens, a grantmaking and private philanthropy consultant, highlighted on the call that the funders he works with are increasingly interested in ways to deliver on their open science policy and that participation in the GLS is a tangible thing they can do to meet this goal.&lt;/p>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>As part of its open science policy, NWO will start participating in the Crossref Grant Linking System from July 2025. Research funders are a part of the scholarly communications system; we not only provide the funding to do the actual research but can also be the authoritative source of data about the projects we have funded and the outputs arising from that funding. Increasingly, all these elements – grants, researchers, outputs - are linked with metadata and unique identifiers to ensure that research is findable and accessible.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Hans de Jonge, Director of Open Science NL, part of the Dutch Research Council (NWO)&lt;/cite>&lt;/p>
&lt;/div>
&lt;h2 id="how-funders-leverage-the-grant-linking-system-in-their-reporting-and-assessment">How funders leverage the Grant Linking System in their reporting and assessment&lt;/h2>
&lt;p>Looking back to the origins of the system, it’s important to recognise the work of the initial working groups. Through their contribution, funders helped design the initial metadata schema for grants as well as establish the governance and fees for this service, and our Advisory Group continues to inform further developments. In this way, the Grant Linking System enables the needs and wishes of funders to contribute and see their data as part of the wider ecosystem.&lt;/p>
&lt;p>An excellent example of that synergy in action is the use case presented by Cátia Laranjeira, manager of the PTCRIS programme at the Foundation for Science and Technology, Portugal (FCT). PTCRIS is the Foundation’s integrated national information ecosystem that supports scientific activity management. Cátia reflected on the relative fragmentation of spaces where the scientific outputs are found, and PTCRIS’s ambition for aggregating metadata in one place to be able to trace and evaluate programmes in light of the related outputs. At the start of the programme, they identified lack of a persistent identifier for grants as a major shortcoming of the system. Crossref GLS naturally fits in with their goals.&lt;/p>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>The initiative by FCT to assign unique DOIs to national public funding through Crossref is a game-changer for open science, linking funding directly to scientific outcomes and boosting transparency. Join us in this effort—let&amp;rsquo;s make every grant count and ensure open access to research information!&amp;quot;&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Cátia Laranjeira, PTCRIS Program Manager at Fundacao para a Ciencia e a Tecnologis (FCT Portugal)&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>FCT initially piloted a small subset of their grants (approximately 6,000 recent awards) at the end of 2023. Cátia pointed to researchers’ keen participation in this programme as one of its successes – and thanks to the word of mouth, FCT has already been approached by researchers requesting unique Crossref links for their grants! This appetite for grant IDs will soon be more fully satisfied, as FCT is readying to register all of their grants with Crossref, to enable further insights into funding and outcome flows, supporting them in demonstrating the value for money for the public resources they manage. Via interfaces for grant management and standardised online CVs, the system is also enabling researchers to use the system in their own future reporting and career development.&lt;/p>
&lt;p>In the ensuing discussion, Rachel Bruce of UKRI mentioned that she’s hopeful that GLS will help funders ‘close the loop’ on more holistic reward and recognition, allowing for inclusion of evidence for a broader set of outputs in those processes.&lt;/p>
&lt;h2 id="how-the-community-is-working-to-integrate-open-infrastructure">How the community is working to integrate open infrastructure&lt;/h2>
&lt;p>Melissa Harrison, Team Leader at EMBL-EBI, manages Europe PMC and a complementary data science team, who were part of the initial FREYA project – supporting infrastructure delivery for unique identifiers for grants. The team has been adding grant records to Crossref on Wellcome’s behalf since 2019. Melissa highlighted the shortcomings of internal award numbers, which don’t tend to be understood outside of the ecosystem where they are produced (that is the funder’s administrative system), are almost certainly not unique, and don’t resolve to or connect with anything in the wider ecosystem. Therefore internal award numbers can’t signify relationships with other outputs or assets in the wider world. By contrast, Crossref’s Grant IDs are unique, persistent, resolvable, and interrelated with other Crossref metadata, whilst being retrievable for other systems to link to too.&lt;/p>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Persistent identifiers for grants was the next logical step after identifiers for funders - open metadata registered with a PID in a central service like Crossref is invaluable to build the full picture of the research enterprise.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Melissa Harrison, Team Leader, Literature Services at EMBL-EBI)&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>Ease of execution is important for scaling the Grant Linking System, and enabling its use in a diverse set of circumstances in the open science ecosystem. Altum was the trailblazer, first integrating its grant management platform Proposal Central with GLS. It was good to hear that others are now joining the integration efforts. Syman Stevens talked about the recent work initiated by Joe McArthur at &lt;a href="https://oa.works/" target="_blank">OA Works&lt;/a>, to develop a simple, open-source plug-in for any of the major grant management systems, to enable funders to deposit their grant metadata with Crossref GLS with a click of the button. Syman demonstrated the resulting interface in Fluxx, that allows for creating a record and sending grant metadata to Crossref as part of the regular grant management within the platform. He pointed out that, while this integration was developed for Fluxx, all code and documentation is openly available on &lt;a href="https://github.com/oaworks/create-grant-doi-in-fluxx" target="_blank">GitHub&lt;/a> and this can potentially be forked or adapted as necessary for reuse in other grant management systems.&lt;/p>
&lt;p>It is heartening that others in the community are seeing such a need for this that they&amp;rsquo;re funding and creating their own tools to advance participation and use of the GLS.&lt;/p>
&lt;p>Finally, Fred Atherden, Head of Production Operations at eLife, presented how they include Crossref grant identifiers in publication metadata for the version of record of the works published on their platform. eLife is the first publisher to fully integrate Crossref grant identifiers both within the article display and in the metadata. Fred shared that in addition to collecting the data from the authors, eLife also attempts matching, albeit using very restrictive methodology, to enable more grant metadata in their publication records. They recognise that so far there are very few publishers including persistent links for grants in this way, and talked about plans to start collecting and including this data further upstream, and including them in the future for reviewed preprints.&lt;/p>
&lt;h2 id="acknowledgements-and-how-to-participate-in-the-gls">Acknowledgements and how to participate in the GLS&lt;/h2>
&lt;p>Reflecting on the last five years, thanks must go to the &amp;gt;35 funders who are already participating (see logo mashup below), to our &lt;a href="https://www.crossref.org/working-groups/funders">current volunteers&lt;/a> and to those partners working to promote and make use of the Grant Linking System. We also acknowledge that the GLS would not have been possible without the Crossref board members at the time, our staff including alumni Josh Brown, Jennifer Kemp, Rachael Lammey, and Geoffrey Bilder, or without the early dedicated time and input from the following people and organisations on our working groups for governance and fees, and for metadata modelling:&lt;/p>
&lt;ul>
&lt;li>Yasushi Ogasaka and Ritsuko Nakajima, Japan Science &amp;amp; Technology Agency&lt;/li>
&lt;li>Neil Thakur and Brian Haugen, US National Institutes of Health&lt;/li>
&lt;li>Jo McEntyre and Michael Parkin, Europe PMC&lt;/li>
&lt;li>Robert Kiley and Nina Frentop, Wellcome&lt;/li>
&lt;li>Alexis-Michel Mugabushaka and Diego Chialva, European Research Council&lt;/li>
&lt;li>Lance Vowell and Carly Robinson, OSTI/US Dept of Energy&lt;/li>
&lt;li>Ashley Moore and Kevin Dolby, UKRI (Research Councils UK / Medical Research Council)&lt;/li>
&lt;li>Salvo da Rosa, Children&amp;rsquo;s Tumor Foundation&lt;/li>
&lt;li>Trisha Cruse, DataCite&lt;/li>
&lt;/ul>
&lt;figure>&lt;img src="https://www.crossref.org/images/community-images/gls/gls-members.png"
alt="funding bodies participating in the Crossref Grant Linking System (GLS)" width="100%">
&lt;/figure>
&lt;p>To learn more about the &lt;a href="https://www.crossref.org/services/grant-linking-system">Crossref Grant Linking System&lt;/a>, the best place to start is our service page. And for the next step, please reach out to us for a conversation about any questions specific to your organisation and any questions that may need to be addressed in order to enable your full participation.&lt;/p>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Grant DOIs enhance the discovery and accessibility of funded project information and are one of the important links in a connected research ecosystem. I&amp;rsquo;m grateful and proud to contribute to the robustness and interconnectedness of the research infrastructure. Few funders are currently participating in the Crossref Grant Linking System, and I encourage others to consider doing so. This adoption follows the &amp;ldquo;network effect,&amp;rdquo; where the value and utility increase as more people participate, encouraging even wider adoption.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Kristin Eldon Whylly, Senior Grants Manager and Change Management Lead at Templeton World Charity Fund (TWCF)&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>You can email me via &lt;a href="mailto:feedback@crossref.org?subject=Grant%20Linking%20System">feedback@crossref.org&lt;/a> or &lt;a href="https://savvycal.com/kkorzec/68502be2" target="_blank">set up a call with me when it suits you&lt;/a> (you can overlay your own calendar using the toggle at the top right). We look forward to welcoming even more funders and to see those relationships in the open science infrastructure grow even further in the coming years.&lt;/p></description></item><item><title>The anatomy of metadata matching</title><link>https://www.crossref.org/blog/the-anatomy-of-metadata-matching/</link><pubDate>Thu, 27 Jun 2024 00:00:00 +0000</pubDate><author>Dominika Tkaczyk</author><guid>https://www.crossref.org/blog/the-anatomy-of-metadata-matching/</guid><description>&lt;p>In our &lt;a href="https://doi.org/10.13003/aewi1cai" target="_blank">previous blog post&lt;/a> about metadata matching, we discussed what it is and why we need it (tl;dr: to discover more relationships within the scholarly record). Here, we will describe some basic matching-related terminology and the components of a matching process. We will also pose some typical product questions to consider when developing or integrating matching solutions.&lt;/p>
&lt;h2 id="basic-terminology">Basic terminology&lt;/h2>
&lt;p>Metadata matching is a high-level concept, with many different problems falling into this category. Indeed, no matter how much we like to focus on the similarities between different forms of matching, matching affiliation strings to ROR IDs or matching preprints to journal papers are still different in several important ways. At Crossref and ROR, we call these problems matching tasks.&lt;/p>
&lt;p>Simply put, a &lt;strong>matching task&lt;/strong> defines the kind or nature of the matching. Examples of matching tasks are bibliographic reference matching, affiliation matching, grant matching, or preprint matching.&lt;/p>
&lt;p>Every matching task has an input, which is all the data that is needed to perform the matching. Input data can come in many shapes and forms, depending on the matching task. For example, all of the following could be inputs to a matching task:&lt;/p>
&lt;pre tabindex="0">&lt;code>Department of Molecular Medicine, Sapporo Medical University, Sapporo 060-8556, Japan
&lt;/code>&lt;/pre>&lt;pre tabindex="0">&lt;code>&amp;lt;fr:program xmlns:fr=&amp;#34;http://www.crossref.org/fundref.xsd&amp;#34; name=&amp;#34;fundref&amp;#34;&amp;gt;
&amp;lt;fr:assertion name=&amp;#34;fundgroup&amp;#34;&amp;gt;
&amp;lt;fr:assertion name=&amp;#34;funder_name&amp;#34;&amp;gt;
European Union&amp;#39;s Horizon 2020 Research and Innovation Program through Marie Sklodowska Curie
&amp;lt;fr:assertion name=&amp;#34;funder_identifier&amp;#34;&amp;gt;http://dx.doi.org/10.13039/501100000780&amp;lt;/fr:assertion&amp;gt;
&amp;lt;/fr:assertion&amp;gt;
&amp;lt;fr:assertion name=&amp;#34;award_number&amp;#34;&amp;gt;721624&amp;lt;/fr:assertion&amp;gt;
&amp;lt;/fr:assertion&amp;gt;
&amp;lt;/fr:program&amp;gt;
&lt;/code>&lt;/pre>&lt;pre tabindex="0">&lt;code>Everitt, W. N., &amp;amp; Kalf, H. (2007). The Bessel differential equation and the Hankel transform. Journal of Computational and Applied Mathematics, 208(1), 3–19.
&lt;/code>&lt;/pre>&lt;pre tabindex="0">&lt;code>{
&amp;#34;title&amp;#34;: &amp;#34;Functional single-cell genomics of human cytomegalovirus infection&amp;#34;,
&amp;#34;issued&amp;#34;: &amp;#34;2021-10-25&amp;#34;,
&amp;#34;author&amp;#34;: [
{&amp;#34;given&amp;#34;: &amp;#34;Marco Y.&amp;#34;, &amp;#34;family&amp;#34;: &amp;#34;Hein&amp;#34;},
{&amp;#34;given&amp;#34;: &amp;#34;Jonathan S.&amp;#34;, &amp;#34;family&amp;#34;: &amp;#34;Weissman&amp;#34;, &amp;#34;ORCID&amp;#34;: &amp;#34;http://orcid.org/0000-0003-2445-670X&amp;#34;}
]
}
&lt;/code>&lt;/pre>&lt;p>Every matching task also has an &lt;strong>output&lt;/strong>. For our purposes, this is almost exclusively zero or more matched identifiers. In the context of a specific matching task, output identifiers may be of a specific type (e.g. we might match to a ROR ID, and never to an ORCID ID). In some cases, there can be a certain target set as well (i.e. matching only to DataCite DOIs). The output identifiers can have different &lt;a href="https://en.wikipedia.org/wiki/Cardinality" target="_blank">cardinality&lt;/a> depending on the task, meaning that the matching task might allow for zero, one, or more identifiers as a result of matching to a single input.&lt;/p>
&lt;p>A &lt;strong>matching strategy&lt;/strong> defines how the matching is done. Multiple strategies can exist for a specific matching task. Compound strategies can run other strategies and combine their outcomes into a single result.&lt;/p>
&lt;p>In some cases, we may also want the matching strategy to output a confidence score for each matched identifier. A confidence score represents the degree of certainty or likelihood that the matched identifier is correct, typically expressed as a value between 0 and 1. This score may help with post-processing or further interpretation of the results.&lt;/p>
&lt;p>To summarise, the anatomy of the matching task can be diagrammed as follows:&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2024/matching-task-anatomy.png"
alt="The anatomy of the matching task" width="75%">
&lt;/figure>
&lt;/div>
&lt;br />
&lt;h2 id="how-to-specify-a-matching-task">How to specify a matching task&lt;/h2>
&lt;p>Whenever we plan the development or integration of a matching solution, it is good to begin by answering a few basic questions:&lt;/p>
&lt;ol>
&lt;li>What problem do we plan to solve with our matching task? What would we call our matching task and how would we describe it?&lt;/li>
&lt;li>What do we expect as the input for this matching task? Which input formats do we need to be able to accept? What information do we expect to find in this input?&lt;/li>
&lt;li>What kind of identifiers should be output? Is there a target set of identifiers? Can our matching output zero/one/or multiple identifiers, and under what conditions might that occur?&lt;/li>
&lt;/ol>
&lt;p>These sound fairly simple, but the answers to these questions can be remarkably complex. Once one tries to apply these concepts to real-world problems, they might encounter several non-obvious challenges.&lt;/p>
&lt;p>For example, one common concern is at what level we should define each matching task. Consider the following problems:&lt;/p>
&lt;ol>
&lt;li>Matching bibliographic reference strings to DOIs. Example input:&lt;/li>
&lt;/ol>
&lt;pre tabindex="0">&lt;code>Everitt, W. N., &amp;amp; Kalf, H. (2007). The Bessel differential equation and the Hankel transform. Journal of Computational and Applied Mathematics, 208(1), 3–19.
&lt;/code>&lt;/pre>&lt;ol start="2">
&lt;li>Matching structured bibliographic reference to DOIs. Example input:&lt;/li>
&lt;/ol>
&lt;pre tabindex="0">&lt;code>{
volume: &amp;#34;208&amp;#34;,
author: &amp;#34;Everitt&amp;#34;,
journal-title: &amp;#34;J. Comput. Appl. Math.&amp;#34;,
article-title: &amp;#34;The Bessel differential equation and the Hankel transform&amp;#34;,
first-page: &amp;#34;3&amp;#34;,
year: &amp;#34;2007&amp;#34;,
issue: &amp;#34;1&amp;#34;
}
&lt;/code>&lt;/pre>&lt;p>Are those discrete matching tasks (&lt;em>unstructured reference matching&lt;/em> vs. &lt;em>structured reference matching&lt;/em>), or are they the same task (&lt;em>reference matching&lt;/em>) that can accept different types of inputs (unstructured or structured)?&lt;/p>
&lt;p>Similarly, let&amp;rsquo;s compare the following tasks:&lt;/p>
&lt;ol>
&lt;li>Matching affiliation strings to ROR IDs. Example input:&lt;/li>
&lt;/ol>
&lt;pre tabindex="0">&lt;code>Department of Molecular Medicine, Sapporo Medical University, Sapporo 060-8556, Japan
&lt;/code>&lt;/pre>&lt;ol start="2">
&lt;li>Matching funder names to ROR IDs. Example input:&lt;/li>
&lt;/ol>
&lt;pre tabindex="0">&lt;code>Alexander von Humboldt Foundation
&lt;/code>&lt;/pre>&lt;p>Are these different matching tasks (&lt;em>affiliation matching&lt;/em> vs. &lt;em>funder matching&lt;/em>), or the same task with different inputs (&lt;em>organisation matching&lt;/em>)?&lt;/p>
&lt;p>Defining the boundaries of a matching task can also be difficult. Consider, for example, the need to obtain ROR IDs for organisations mentioned in the acknowledgements section of a full-text academic paper. To begin, one may first extract the acknowledgement section from the full text, then run something like a named entity recognition (NER) tool to isolate the organisation names from the extracted text, and finally match these names to ROR IDs. Is this entire process matching, with the input being the full text of a paper? Or perhaps matching starts with the acknowledgement section as the input? Instead, is it only the last phase, where we try to match the extracted name to the ROR ID, that constitutes the matching task, with the extraction phases being completely separate processes?&lt;/p>
&lt;p>There are also important questions related to the expected behaviour of a matching strategy. Consider, for example, developing an affiliation matching strategy where we define our input as &amp;ldquo;an affiliation string&amp;rdquo;. What should happen when the strategy gets something else on the input, for example, song lyrics? Perhaps the strategy should simply return no matches, or an error, or we could say that in such a situation the behaviour is undefined and it simply doesn&amp;rsquo;t matter what is returned. But what should happen if in this input we have the lyrics of &lt;a href="https://www.azlyrics.com/lyrics/roxymusic/streetlife.html" target="_blank">Street Life by Roxy Music&lt;/a>, a song that mentions the names of a few universities that happen to have ROR IDs?&lt;/p>
&lt;p>It is likewise important to consider what should happen if different parts of the input match to different identifiers, like in the following example:&lt;/p>
&lt;pre tabindex="0">&lt;code>Department of Haematology, Eastern Health and Monash University, Box Hill, Australia
&lt;/code>&lt;/pre>&lt;p>Here, &amp;ldquo;Eastern Health&amp;rdquo; matches to &lt;a href="https://ror.org/00vyyx863" target="_blank">https://ror.org/00vyyx863&lt;/a> and &amp;ldquo;Monash University&amp;rdquo; to &lt;a href="https://ror.org/02bfwt286" target="_blank">https://ror.org/02bfwt286&lt;/a>. Should the matching strategy return all the identifiers, one of them (if so, which one?), or nothing at all?&lt;/p>
&lt;p>Similar questions arise when it is possible to match to multiple versions (or duplicates) in the target identifier set. This can happen, for example, in the context of bibliographic reference matching or preprint matching. Multiple matches may occur when there are different editions, reprints, or variations of the same publication in the target dataset, each with its own unique identifier.&lt;/p>
&lt;p>If you are waiting for an answer to these questions, we unfortunately must disappoint you here. These can only be answered in the context of a specific problem, considering who the users are and what it is they need and expect.&lt;/p>
&lt;p>Did you notice any other subtleties related to metadata matching and its concerns? Are there other non-obvious questions that should be considered when planning to develop or integrate metadata matching strategies? Let us know—we&amp;rsquo;d love to hear from you!&lt;/p></description></item><item><title>Drawing on the Research Nexus with Policy documents: Overton’s use of Crossref API</title><link>https://www.crossref.org/blog/drawing-on-the-research-nexus-with-policy-documents-overtons-use-of-crossref-api/</link><pubDate>Sat, 15 Jun 2024 00:00:00 +0000</pubDate><author>Luis Montilla</author><guid>https://www.crossref.org/blog/drawing-on-the-research-nexus-with-policy-documents-overtons-use-of-crossref-api/</guid><description>&lt;p>&lt;em>Update 2024-07-01: This post is based on an interview with Euan Adie, founder and director of Overton.&lt;/em>_&lt;/p>
&lt;h2 id="what-is-overton">What is Overton?&lt;/h2>
&lt;p>Overton is a big database of government policy documents, also including sources like intergovernmental organisations, think tanks, and big NGOs and in general anyone who&amp;rsquo;s trying to influence a government policy maker. What we&amp;rsquo;re interested in is basically, taking all the good parts of the scholarly record and applying some of that to the policy world. By this we mean finding all the documents, finding what&amp;rsquo;s out there, collecting metadata for them consistently, fitting to our schema, extracting references from all the policy documents we find, adding links between them, and then we also do citation analysis.&lt;/p>
&lt;h2 id="what-do-you-mean-by-the-good-parts-of-the-scholarly-record">What do you mean by the good parts of the scholarly record?&lt;/h2>
&lt;p>What I mean by the good parts of the scholarly record is, from a data perspective, having persistent open metadata for items on different stable, interoperable platforms and being able to build up layers of data to suit specific use cases. That&amp;rsquo;s a better approach than trying to do everything in a silo here and a silo there and trying to do stuff bit by bit or in a hundred different ways.&lt;/p>
&lt;p>There’s also a bad part, which is less to do with metadata and more around citation analysis and responsible metrics. With all this data… as the famous Spiderman quote goes… with great power comes a great responsibility: once you start systematically collecting this data, it’s very easy to fall into the trap of thinking that if we can put numbers on it, and then maybe we could start reading meaning into those numbers, and then it spirals out of control. So the idea for Overton was: can we take the system, some of the infrastructure and apply those ideas? But then come at it already knowing where the later pitfalls are and try to avoid them.&lt;/p>
&lt;h2 id="what-is-your-main-use-of-crossref-resources">What is your main use of Crossref resources?&lt;/h2>
&lt;p>We rely heavily on Crossref to link policy documents to the scholarly record. The question we’re trying to answer is: does this government document cite academic work? We work a lot with universities, think tanks, and IGOs. They’re asking where is the research we produce ending up? Is it being used by the government? In some countries, like the UK, there&amp;rsquo;s a big impact agenda where it&amp;rsquo;s quite important to demonstrate that for government funding. In the US as well, state universities for example aim to impact the local policy environment. Right? Are we producing things that went on to change life for local residents for the better? And that&amp;rsquo;s really what we&amp;rsquo;re trying to support. And so that&amp;rsquo;s one of the main use cases of the database.&lt;/p>
&lt;h2 id="can-you-tell-us-a-little-bit-more-about-the-story-of-overton-how-did-this-idea-start">Can you tell us a little bit more about the story of Overton, how did this idea start?&lt;/h2>
&lt;p>It really came from two things. The first one is that I&amp;rsquo;d always been interested in this area and before Overton, I founded a company called Altmetric.com, which was looking at kind of broader impact metrics for papers. And we looked at Twitter, and news, and blogs, and other things, including policy. But policy wasn&amp;rsquo;t a primary focus.&lt;/p>
&lt;p>When I left Altmetric two things were happening in the UK – not that everything is about Brexit, but Brexit was happening, and then COVID happened as well. And in both cases, I think it just drove home to me that other people seemed to be very interested in the evidence that the government has used to make decisions. Be they good decisions like some of the evidence based initatives in COVID or bad decisions like Brexit. So, how can you find out what it was? And it is actually very difficult to do. You can&amp;rsquo;t really track back how this decision was made. I thought that there is a growing need for that kind of impact analysis. So the second thing was, can we do something that helps make it easy to see what evidence goes into policy? The scholarly evidence but also the other kind of policy influence that goes into any document or discussion.&lt;/p>
&lt;h2 id="what-are-the-main-challenges-that-you-face-when-you-are-trying-to-retrieve-these-policy-documents">What are the main challenges that you face when you are trying to retrieve these policy documents?&lt;/h2>
&lt;p>Well, first is another thing that the scholarly record does well, which is persistence. We have CLOCKSS and all the &lt;a href="https://www.wiley.com/en-us/network/publishing/societies/publishing-strategy/what-is-a-dark-archive" target="_blank">dark archives&lt;/a>&lt;sup id="fnref:1">&lt;a href="#fn:1" class="footnote-ref" role="doc-noteref">1&lt;/a>&lt;/sup>. So the whole idea is that if you have a DOI, if something moves, we can track it and it maintains the ID, and even if the publisher goes bust it&amp;rsquo;ll never disappear. For citing it, then there&amp;rsquo;s always going to be a copy of it somewhere available even if it&amp;rsquo;s in a library or a dark archive.&lt;/p>
&lt;p>One of the biggest challenges with policy documents is that kind of persistence doesn’t exist&amp;hellip; There are a lot of statistics about &lt;a href="https://en.wikipedia.org/wiki/Link_rot" target="_blank">link rot&lt;/a>&lt;sup id="fnref:2">&lt;a href="#fn:2" class="footnote-ref" role="doc-noteref">2&lt;/a>&lt;/sup>, and they hold true for policy documents as much as anywhere else. Every year a percentage of the links everywhere basically break because websites are redesigned or a government changes, it&amp;rsquo;s even worse because it can be by design. If you think about it, a new government comes into power, they change… let’s say the Department of Agriculture and they merge it with the Department of Fisheries. That would refer to a completely new third thing. And the other two departments disappear or they start linking off, like, redirecting or whatever.&lt;/p>
&lt;p>One of the challenges is just keeping track of all the changes in the landscape and constantly trying to stay on top of the data. And that&amp;rsquo;s a big part of what we do. Another challenge for us, and I think about it compared to journals, when you cite something in a scholarly document, you cite it in a given style, but there are no standards for referencing styles in policy documents. So even in the same document, we can see, like, four or five different ways of referring to something, and sometimes they&amp;rsquo;re missing important data and sometimes they&amp;rsquo;re not. And it means when we&amp;rsquo;re using Crossref search, we usually have much more unparsable text.&lt;/p>
&lt;h2 id="how-has-your-experience-been-so-far-using-our-crossref-api-or-our-services-in-general">How has your experience been so far using our Crossref API or our services in general?&lt;/h2>
&lt;p>It&amp;rsquo;s been great. I would happily say this anywhere, I always talk about the Crossref API as being one of the best examples of a well-done scholarly infrastructure API. It&amp;rsquo;s well-documented. It&amp;rsquo;s fast. It&amp;rsquo;s clear. The rate limits are clear. It&amp;rsquo;s up when it should be up. I like that you can trust it. So the technical aspect is great. From an organisational aspect, in contrast with a lot of infrastructure in the scholarly world that you don’t know if it&amp;rsquo;s even going to be there in a given time, Crossref is pretty stable.&lt;/p>
&lt;h2 id="what-would-you-say-are-the-main-challenges-or-things-that-we-can-improve-in-the-future-what-other-expectations-or-suggestions-do-you-have">What would you say are the main challenges or things that we can improve in the future? What other expectations or suggestions do you have?&lt;/h2>
&lt;p>It depends, if we&amp;rsquo;re talking about how the service could be improved versus how the data could be improved. Data-wise, and I appreciate this is a publisher problem, not a Crossref one, but, we still have to pull other data from OpenAlex, for example, for things like affiliations just because it&amp;rsquo;s missing from so many articles. And then equally things like ORCID for authors. And in fact also disambiguation in general. This is a huge problem that either the user doesn’t solve or you end up using a hundred different author disambiguation systems. I don&amp;rsquo;t know if there&amp;rsquo;s necessarily something Crossref wants to get into, but there&amp;rsquo;s definitely not something out there generally accepted already.&lt;/p>
&lt;p>Another kind of improvement I see is to make sure that changes in one API are reflected in the other, and they don&amp;rsquo;t get out of sync. When somebody updates their ORCID record, I’d like it reflected in the Crossref record if we’re using that as the “canonical” metadata record for the DOI. Retrospectively enriching records.&lt;/p>
&lt;p>I think it&amp;rsquo;s harder than I expected to just find preprints because you can&amp;rsquo;t simply use the item type but I understand that this is maybe a bigger issue. So maybe it&amp;rsquo;s not for a short time.&lt;/p>
&lt;p>Finally, this is very specific, but we experienced friction when going from the snapshots to having something useful, either in Elasticsearch or in, like, Postgres. It might be nice to have some open-source scripts to download and process everything, convert it to relational tables, or send it to an Elasticsearch cluster or something.&lt;/p>
&lt;div class="footnotes" role="doc-endnotes">
&lt;hr>
&lt;ol>
&lt;li id="fn:1">
&lt;p>&lt;a href="https://www.wiley.com/en-us/network/publishing/societies/publishing-strategy/what-is-a-dark-archive" target="_blank">Platt, C. (2022). What is a Dark Archive? Wiley. Retrieved 10 January, 2024, from&lt;/a>&amp;#160;&lt;a href="#fnref:1" class="footnote-backref" role="doc-backlink">&amp;#x21a9;&amp;#xfe0e;&lt;/a>&lt;/p>
&lt;/li>
&lt;li id="fn:2">
&lt;p>&lt;a href="https://en.wikipedia.org/wiki/Link_rot" target="_blank">Link rot. (n.d.). In Wikipedia. Retrieved 10 January, 2024, from https://en.wikipedia.org/wiki/Link_rot.&lt;/a>&amp;#160;&lt;a href="#fnref:2" class="footnote-backref" role="doc-backlink">&amp;#x21a9;&amp;#xfe0e;&lt;/a>&lt;/p>
&lt;/li>
&lt;/ol>
&lt;/div></description></item><item><title>Rebalancing our REST API traffic</title><link>https://www.crossref.org/blog/rebalancing-our-rest-api-traffic/</link><pubDate>Tue, 04 Jun 2024 00:00:00 +0000</pubDate><author>Stewart Houten</author><guid>https://www.crossref.org/blog/rebalancing-our-rest-api-traffic/</guid><description>&lt;p>Since we first launched our REST API around 2013 as a Labs project, it has evolved well beyond a prototype into arguably Crossref’s most visible and valuable service. It is the result of 20,000 organisations around the world that have worked for many years to curate and share metadata about their various resources, from research grants to research articles and other component inputs and outputs of research.&lt;/p>
&lt;p>The REST API is relied on by a large part of the research information community and beyond, seeing around 1.8 billion requests each month. Just five years ago, that average monthly number was 600 million. Our members are the heaviest users, using it for all kinds of information about their own records or picking up connections like citations and other relationships. Databases, discovery tools, libraries, and governments all use the API. Research groups use it for all sorts of things such as analysing trends in science or recording retractions and corrections.&lt;/p>
&lt;p>So the chances are high that almost any tool you rely on in scientific research has somewhere incorporated metadata through us.&lt;/p>
&lt;h2 id="optimising-performance">Optimising performance&lt;/h2>
&lt;p>For some time, we’ve been noticing reduced performance in a number of ways, and periodically we have a flurry of manually blocking/unblocking IP addresses from requesters that are hammering and degrading the service for everyone else, and this is of course only minimally effective and very short term. You can always watch our status page for alerts. This is the current one about REST API performance: &lt;a href="https://status.crossref.org/incidents/d7k4ml9vvswv" target="_blank">https://status.crossref.org/incidents/d7k4ml9vvswv&lt;/a>.&lt;/p>
&lt;p>As the number of users and requests has grown, our strategies for serving those requests must evolve. This post discusses how we’re approaching balancing the growth in usage for the immediate term and provides some thoughts about things we could try in the future on which we’ll gladly take feedback and advice.&lt;/p>
&lt;h3 id="load-balancing">Load balancing&lt;/h3>
&lt;p>In 2018, we started routing users through three different pools (&lt;em>public&lt;/em>, &lt;em>polite&lt;/em>, and &lt;em>plus&lt;/em>). This coincided with the launch of &lt;a href="https://www.crossref.org/services/metadata-retrieval/metadata-plus">Metadata Plus&lt;/a>, a paid-for service with monthly data dumps and very high rate limits. Note that all metadata is exactly the same and real-time across all pools. We also, more recently, introduced an &lt;em>internal&lt;/em> pool. Here&amp;rsquo;s more about them:&lt;/p>
&lt;ul>
&lt;li>&lt;em>Plus&lt;/em>: This is the aforementioned premium option; it’s really for ‘enterprise-wide’ use in production services and is not really relevant here.&lt;/li>
&lt;li>&lt;em>Public&lt;/em>: This is the default and is the one that is struggling at the moment. You don’t have to identify yourself and, in theory, we don’t have to work through the night to support it if it’s struggling (although we often do). &lt;em>Public&lt;/em> currently receives around &lt;strong>30,000&lt;/strong> requests per minute.&lt;/li>
&lt;li>&lt;em>Polite&lt;/em>: Traffic is routed to &lt;em>polite&lt;/em> simply by detecting a mailto in the header. Any system or person including an email is being routed to a currently-quieter pool, this means we can always get in touch for troubleshooting (and only troubleshooting). &lt;em>Polite&lt;/em> currently receives around &lt;strong>5,000&lt;/strong> requests per minute.&lt;/li>
&lt;li>&lt;em>Internal&lt;/em>: In 2021, we introduced a new pool just for our own tools where we can control and predict the traffic. &lt;em>Internal&lt;/em> currently receives around &lt;strong>1,000&lt;/strong> requests per minute.&lt;/li>
&lt;/ul>
&lt;p>The volumes of traffic across &lt;em>public&lt;/em>, &lt;em>polite&lt;/em> and &lt;em>internal&lt;/em> pools are very different and yet each pool has always had similar resources. The purpose of each of these pools has been long-established but our efforts to ask the community to use &lt;em>polite&lt;/em> by default have not been particularly successful and it is clear that we don’t have the right balance.&lt;/p>
&lt;p>The &lt;em>internal&lt;/em> pool has been dedicated to our internal services that have predictable usage and that have requests that are not initiated by external users. The &lt;em>internal&lt;/em> pool has previously included reference matching but not Crossmark, Event Data, or search.crossref.org, which all use the &lt;em>polite&lt;/em> pool instead, along with the community. We have the capacity on the &lt;em>internal&lt;/em> pool to shift all of this “internal” traffic across, and in doing so we will create more capacity for genuine &lt;em>polite&lt;/em> users and redefine what we consider to be “internal”.&lt;/p>
&lt;p>Creating more capacity on &lt;em>polite&lt;/em> will also give us the opportunity to load-balance requests to both &lt;em>polite&lt;/em> and &lt;em>public&lt;/em> across the two pools. We are at a point where we cannot eke more performance out of the API without architectural changes. In order to buy ourselves time to address this properly, we will modify the routing of &lt;em>polite&lt;/em> and &lt;em>public&lt;/em> and evenly distribute requests to the two pools 50/50.&lt;/p>
&lt;p>The &lt;em>public&lt;/em> and &lt;em>polite&lt;/em> pools have equal resources at the moment yet handle very different volumes of traffic (30,000 req/min vs 5,000 req/min), and with the proposed changes to internal traffic the &lt;em>polite&lt;/em> pool would handle a fraction of this. The result would look something like 31,000 req/min evenly distributed across &lt;em>public&lt;/em> and &lt;em>polite&lt;/em>.&lt;/p>
&lt;h3 id="rate-limiting">Rate limiting&lt;/h3>
&lt;p>Our rate-limiting also needs review. We track a number of metrics in our web proxy but only deny requests on one of them - the number of requests per second. On &lt;em>public&lt;/em> and &lt;em>polite&lt;/em> we limit each IP address to sending 50 req/sec and if this rate is exceeded users are denied access for 10 seconds. These limits are generous and we cannot realistically support this volume of request for all users of the &lt;em>public&lt;/em> or &lt;em>polite&lt;/em> API.&lt;/p>
&lt;p>However, when requests are taking a long time to return, we potentially have a separate problem of high concurrency as hundreds of requests could be sent before the first one has returned. We intend to identify and impose an appropriate rate limit on concurrent requests from each IP to prevent a small number of users from disproportionately affecting all users with long-running queries.&lt;/p>
&lt;h3 id="longer-term">Longer-term&lt;/h3>
&lt;p>So, in the short-term we will revise our pool traffic as described above. We’ll do that this week. Then we will review the current rate limits and reduce them to something more reasonable for the majority of users. And we’ll identify and introduce a rate limit for concurrent requests from each user.&lt;/p>
&lt;p>Longer-term, we need to rearchitect our Elasticsearch pools so that we can:&lt;/p>
&lt;ul>
&lt;li>Reduce shard sizes to improve performance of queries&lt;/li>
&lt;li>Balance data shards and replicas more evenly&lt;/li>
&lt;li>Optimise our instance types for our workload&lt;/li>
&lt;/ul>
&lt;h2 id="want-to-help">Want to help?&lt;/h2>
&lt;p>Thanks for asking!&lt;/p>
&lt;p>Firstly, please, everyone, do always put an email in your API request headers - while the short term plan will help stabilise performance, this habit will always help us troubleshoot e.g. we can always contact you instead of blocking you!&lt;/p>
&lt;p>Secondly, we know many of you incorporate Crossref metadata, add lots of value to it in order to deliver important services, and also develop APIs of your own. We’d love any comments or recommendations from those of you handling similar situations on scaling and optimising API performance. You can comment on this post which is managed via our Discourse forum. We’ll also be adding updates to this thread as well as on status.crossref.org. If you’d like to be in touch with any of us directly, all our emails are &lt;a href="mailto:firstinitiallastname@crossref.org">firstinitiallastname@crossref.org&lt;/a>.&lt;/p></description></item><item><title>Metadata matching 101: what is it and why do we need it?</title><link>https://www.crossref.org/blog/metadata-matching-101-what-is-it-and-why-do-we-need-it/</link><pubDate>Thu, 16 May 2024 00:00:00 +0000</pubDate><author>Dominika Tkaczyk</author><guid>https://www.crossref.org/blog/metadata-matching-101-what-is-it-and-why-do-we-need-it/</guid><description>&lt;p>At Crossref and ROR, we develop and run processes that match metadata at scale, creating relationships between millions of entities in the scholarly record. Over the last few years, we&amp;rsquo;ve spent a lot of time diving into details about metadata matching strategies, evaluation, and integration. It is quite possibly &lt;a href="https://doi.org/10.64000/pdm9z-20m09" target="_blank">our&lt;/a> &lt;a href="https://doi.org/10.64000/e6ey2-wce96" target="_blank">favourite&lt;/a> &lt;a href="https://doi.org/10.64000/txft6-s1481" target="_blank">thing&lt;/a> to &lt;a href="https://www.youtube.com/watch?v=Tx5y7lX030U" target="_blank">talk&lt;/a> and &lt;a href="https://doi.org/10.64000/ske16-xve54" target="_blank">write&lt;/a> &lt;a href="https://doi.org/10.64000/dpcc9-k4564" target="_blank">about&lt;/a>! But sometimes it is good to step back and look at the problem from a wider perspective. In this blog, the first one in a series about metadata matching, we will cover the very basics of matching: what it is, how we do it, and why we devote so much effort to this problem.&lt;/p>
&lt;h2 id="what-is-metadata-matching">What is metadata matching?&lt;/h2>
&lt;p>Would you be able to find the DOI for the work referenced in this citation?&lt;/p>
&lt;pre tabindex="0">&lt;code>Everitt, W. N., &amp;amp; Kalf, H. (2007). The Bessel differential equation and the Hankel transform. Journal of Computational and Applied Mathematics, 208(1), 3–19.
&lt;/code>&lt;/pre>&lt;p>We bet you could! You might begin, for example, by pasting the whole citation, or only the title, into a search engine of your choice. This would probably return multiple results, which you would quickly skim. Then you might click on the links for a few of the top results, those that look promising. Some of the websites you visit might contain a DOI. Perhaps you would briefly compare the metadata provided on the website against what you see in the citation. If most of this information matches (see what we did there?), you would conclude that the DOI from that website is, in fact, the DOI for the cited paper.&lt;/p>
&lt;p>Well done! You just performed metadata matching, specifically, bibliographic reference matching. Matching in general can be defined as the task or process of finding an identifier for an item based on its structured or unstructured &amp;ldquo;description&amp;rdquo; (in this case: finding a DOI of a cited article based on a citation string).&lt;/p>
&lt;p>But matching doesn&amp;rsquo;t have to just be about citations and DOIs. There are many other instances of matching we can think of, for example:&lt;/p>
&lt;ul>
&lt;li>finding the ROR ID for an organisation based on an affiliation string,&lt;/li>
&lt;li>finding the ORCID ID for a researcher based on the person&amp;rsquo;s name and affiliation,&lt;/li>
&lt;li>finding the ROR ID for a funder based on the acknowledgements section of a research paper,&lt;/li>
&lt;li>finding the grant DOI based on an award number and a funder name.&lt;/li>
&lt;/ul>
&lt;p>Matching doesn&amp;rsquo;t have to be done manually. It is possible to develop fully automated strategies for metadata matching and employ them at scale. It is also possible to use a hybrid approach, where automated strategies assist users by providing suggestions.&lt;/p>
&lt;p>Developing automated matching strategies is not a trivial task, and if we want to do it right, it takes a great deal of time and effort. This brings us to our next question: is it worth it?&lt;/p>
&lt;h2 id="why-do-we-need-matching">Why do we need matching?&lt;/h2>
&lt;p>In short, metadata matching gives us a more complete picture of &lt;a href="https://www.crossref.org/documentation/research-nexus/">the research nexus&lt;/a> by discovering missing relationships between various entities within and throughout the scholarly record:&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2024/matching-101-relationships.png"
alt="Example relationships in the scholarly record" width="75%">
&lt;/figure>
&lt;/div>
&lt;br />
&lt;p>These relationships are very powerful. They provide important context for any entity, whether it is a research output, a funder, a research institution, or an author. Imagine for a moment the scholarly record without any such relationships, where all bibliographic references, affiliations (institution names and addresses), and funding information (funder names and grant titles) are provided as unstructured strings only. In such a world, how would you calculate the number of times a particular research paper was cited? How would you get a list of research outputs supported by a specific funder? It would be incredibly challenging to navigate, summarise, and describe research activities, especially considering the scale. Thankfully, these and many other questions can be answered thanks to metadata matching that discovers relationships between entities in the scholarly record.&lt;/p>
&lt;p>There are two primary ways we can use metadata matching in our workflows: as semi-automated tools that help users look up the appropriate identifiers or as fully automated processes that enrich the metadata in various scholarly databases.&lt;/p>
&lt;p>The first approach is quite similar to the example we described at the beginning. If you are submitting scholarly metadata, for example of a new article to be published, you can use metadata matching to look up identifiers for the various entities and include these identifiers in the submission. For example, with the help of metadata matching, instead of submitting citation strings, you could provide the DOIs for works cited in the paper and instead of the name and address of your organisation, you could provide its ROR ID. To make this easier for people, metadata submission systems and applications sometimes integrate metadata matching tools into user interfaces.&lt;/p>
&lt;p>The second approach allows large, existing sources of scholarly metadata to be enriched with identifiers in a fully automated way. For example, we can match affiliation strings to ROR IDs using a combination of machine learning models and ROR&amp;rsquo;s default matching service, effectively adding more relationships between people and organisations. We can also &lt;a href="https://doi.org/10.64000/dpcc9-k4564" target="_blank">compare journal articles and preprints metadata&lt;/a> in the Crossref database by calculating similarity scores for titles, authors, and years of publication to match them with each other and provide more relationships between preprints and journal articles. This automated enrichment can be done at any point in time, even after research outputs have been formally published.&lt;/p>
&lt;p>There are fundamental differences between these two approaches. The first is done under the supervision of a user, and for the second, the matching strategy makes all the decisions autonomously. As a result, the first approach will typically (although not always) result in better quality matches. By contrast, the second approach is much faster, generally less expensive, and scales to even very large data sources.&lt;/p>
&lt;p>In the end, no matter what approach is used, the goal is to achieve a more complete accounting of the relationships between entities in the scholarly record.&lt;/p>
&lt;p>This blog is the first one in a series about metadata matching. In the coming weeks, we will cover more detail about the product features related to metadata matching, explain why metadata matching is not a trivial problem, and share how we can develop, assess, compare, and choose matching strategies. Stay tuned!&lt;/p></description></item><item><title>2024 public data file now available, featuring new experimental formats</title><link>https://www.crossref.org/blog/2024-public-data-file-now-available-featuring-new-experimental-formats/</link><pubDate>Tue, 14 May 2024 00:00:00 +0000</pubDate><author>Patrick Polischuk</author><guid>https://www.crossref.org/blog/2024-public-data-file-now-available-featuring-new-experimental-formats/</guid><description>&lt;p>This year’s public data file is now available, featuring over 156 million metadata records deposited with Crossref through the end of April 2024 from over 19,000 members. A full breakdown of Crossref metadata statistics is available &lt;a href="https://www.crossref.org/06members/53status.html">here&lt;/a>.&lt;/p>
&lt;p>Like last year, you can download all of these records in one go via Academic Torrents or directly from Amazon S3 via the “requester pays” method.&lt;/p>
&lt;p>&lt;strong>Download the file:&lt;/strong> The torrent download can be initiated &lt;a href="https://academictorrents.com/details/4426fa56a4f3d376ece9ac37ed088095a30de568" target="_blank">here&lt;/a>. Instructions for downloading via the “requester pays” method, along with other tips for using these files, can be found on the &lt;a href="https://www.crossref.org/documentation/retrieve-metadata/rest-api/tips-for-using-public-data-files-and-plus-snapshots/">“Tips for working with Crossref public data files and Plus snapshots”&lt;/a> page.&lt;/p>
&lt;p>In January, Martin Eve &lt;a href="https://doi.org/10.64000/5b0re-zpg76" target="_blank">announced&lt;/a> that we had been experimenting with alternative file formats meant to make our public data files easier to use by broader audiences. This year’s file will be published alongside the tools that can be used on the public data file to produce two experimental formats: &lt;a href="https://gitlab.com/crossref/labs/packer" target="_blank">JSON-lines&lt;/a> and &lt;a href="https://gitlab.com/crossref/labs/dois2sqlite" target="_blank">SQLite&lt;/a> (and a bonus &lt;a href="https://gitlab.com/crossref/labs/rustsqlitepacker" target="_blank">Rust version&lt;/a>). You can read more about our thinking behind this work in &lt;a href="https://doi.org/10.64000/5b0re-zpg76" target="_blank">Martin’s blog post&lt;/a>, and we are keen to hear your thoughts on these alternatives.&lt;/p>
&lt;p>Our annual public data file is meant to facilitate individuals and organisations interested in working with the entirety of our metadata corpus. Starting with the majority of our metadata records in one file should be much easier than starting from scratch with our API, but because Crossref metadata is always openly available, you can use &lt;a href="https://api.crossref.org/" target="_blank">the API&lt;/a> to keep your local copy up to date with new and updated records.&lt;/p>
&lt;p>If you’re curious about what you’ll get with the public data file, we’ve also published a sample version so that you can take a peek before committing to downloading the ~212 gb file. This file includes a random sample of JSON files and is available exclusively via torrent &lt;a href="https://academictorrents.com/details/d47fbe29e5ef93a6695421f79a6efa4b801acff1" target="_blank">here&lt;/a>.&lt;/p>
&lt;p>We hope you find this public data file useful. Should you have any questions about how to access or use the file, please see the tips below, or share your questions below (you will be redirected to our community forum).&lt;/p>
&lt;h3 id="tips-for-using-the-torrent-and-retrieving-incremental-updates">Tips for using the torrent and retrieving incremental updates&lt;/h3>
&lt;ul>
&lt;li>
&lt;p>Use the public data file if you want all Crossref metadata records. Everyone is welcome to the metadata, but it will be much faster for you and much easier on our APIs to get so many records in one file. Here are some &lt;a href="https://www.crossref.org/documentation/retrieve-metadata/rest-api/tips-for-using-public-data-files-and-plus-snapshots/">tips on how to work with the file&lt;/a>.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Use the REST API to incrementally add new and updated records once you have the initial file. Here is &lt;a href="https://www.crossref.org/documentation/retrieve-metadata/rest-api/tips-for-using-the-crossref-rest-api/">how to get started&lt;/a> (and avoid getting blocked in your enthusiasm to use all this great metadata!).&lt;/p>
&lt;/li>
&lt;li>
&lt;p>While bibliographic metadata is generally required, because lots of metadata is optional, records will vary in quality and completeness.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>Questions, comments, and feedback are welcome at &lt;a href="mailto:support@crossref.org">support@crossref.org&lt;/a>.&lt;/p></description></item><item><title>Integrity of the Scholarly Record (ISR): what do research institutions think?</title><link>https://www.crossref.org/blog/isr-online-event-with-research-institutions-2024/</link><pubDate>Thu, 09 May 2024 00:00:00 +0000</pubDate><author>Madhura Amdekar</author><guid>https://www.crossref.org/blog/isr-online-event-with-research-institutions-2024/</guid><description>&lt;p>Earlier this year, &lt;a href="https://doi.org/10.64000/x0431-5mz02" target="_blank">we reported on the roundtable discussion event&lt;/a> that we had organised in Frankfurt on the heels of the Frankfurt Book Fair 2023. This event was the second in the series of roundtable events that we are holding with our community to hear from you how we can all work together to preserve the integrity of the scholarly record - you can read more about insights from these events and about ISR in &lt;a href="https://www.crossref.org/categories/research-integrity/" target="_blank">this series of blogs&lt;/a>.&lt;/p>
&lt;p>Research institutions are one of the most important stakeholders in the endeavour of research integrity, and any conversation around ISR is incomplete without the views of this key community. This fact was acknowledged at the second ISR roundtable event, and one of the main takeaways from the discussions was to make more focused efforts to hear the viewpoints of researchers and academics.&lt;/p>
&lt;p>As the first step in this direction, we organised an online discussion on the integrity of the scholarly record, to which we invited: researchers and academics, research integrity experts based at academic institutions, Crossref members, as well as other organisations working on this topic such as COPE and Digital Science. The primary objective of this event was to hear from this community their perspectives on preserving and leveraging the integrity of the scholarly record and to identify opportunities for collaboration in this area. To ensure common ground, we also wanted to share information about Crossref metadata, the Research Nexus vision, and our position and role in the integrity of the scholarly record.&lt;/p>
&lt;p>To facilitate this, the event started with an introduction by Kora Korzec, Head of Community Engagement and Communication at Crossref, to our &lt;a href="https://www.crossref.org/about/" target="_blank">mission and vision&lt;/a> and the importance of capturing the relationships between the objects, people and places involved in research through the &lt;a href="https://www.crossref.org/documentation/research-nexus/" target="_blank">Research Nexus&lt;/a>. Amanda Bartell, Head of Member Experience, was next and she spoke about &lt;a href="https://doi.org/10.64000/edg3w-7t592" target="_blank">the scholarly record and the role that Crossref plays in preserving the record’s integrity&lt;/a>. In her presentation, Amanda emphasised that Crossref’s role is not to assess the quality of content deposited by the members but rather to provide infrastructure that enables the community to provide and use metadata about the scholarly content produced by members. It’s important not to put up barriers to entry, but to work with all publishers to encourage best practices.&lt;/p>
&lt;p>Dominika Tkaczyk, Head of Strategic Initiatives, shared details of a few Crossref projects that focus on monitoring and improving metadata completeness, thereby supporting ISR. These projects include improving the Participation Reports, using metadata matching to discover new relationships (e.g., &lt;a href="https://doi.org/10.64000/dpcc9-k4564" target="_blank">preprint &lt;em>published&lt;/em> as work&lt;/a>, &lt;a href="https://doi.org/10.64000/he02b-neb96" target="_blank">work &lt;em>supported&lt;/em> by funder&lt;/a>, etc), and importing more retractions and other updates from the Retraction Watch database that was &lt;a href="https://doi.org/10.13003/c23rw1d9" target="_blank">acquired and made openly available by Crossref&lt;/a>. Dominika used these examples to highlight the ways in which open and complete metadata can help in uncovering large scale trends and systemic concerns. The final speaker was Amanda French, &lt;a href="https://ror.org/" target="_blank">ROR&lt;/a> Technical Community Manager, who introduced the audience to the Research Organisation Registry, or ROR.&lt;/p>
&lt;p>To accomplish the primary aim of the event, which was to hear the community’s viewpoints, the participants were divided into breakout groups for discussions and given three prompts to answer. The rest of the blog is a summary of what we heard from the participants.&lt;/p>
&lt;h3 id="1-is-crossrefs-role-what-you-expected-what-surprised-you-what-are-we-missing">1. Is Crossref’s role what you expected? What surprised you? What are we missing?&lt;/h3>
&lt;p>An overarching sentiment from the academics in the audience was that Crossref does so much more than is known to researchers! They were surprised by the range of activities underway at Crossref. At the same time, there were calls for Crossref to play a bigger role. Suggestions included playing a leadership role in deciding which metadata elements are a priority, providing guidance on the main metadata components important for signalling trust, playing a greater role in connecting various identifiers to ensure that relationships between different content types are preserved well, and to coordinate the efforts being taken by institutions, publishers and service providers around research integrity, by virtue of Crossref’s unique position in the community. There was a broad agreement that by providing the essential infrastructure, Crossref acts as the base upon which other actors in the scholarly community can build.&lt;/p>
&lt;h3 id="2-what-metadata-elements-do-you-consider-important-for-signalling-trust">2. What metadata elements do you consider important for signalling trust?&lt;/h3>
&lt;p>Many participants spoke about the various ways in which author identity and affiliation are important as trust signals. Being able to identify when an author has changed institutions, or being able to make a distinction between authors who have the same name is important. Author affiliations that are authentic and verified would go a long way in establishing trust.&lt;/p>
&lt;p>Multiple assertions, e.g. for affiliations, would be welcome. The use cases for this could be when research starts at one institution and is carried over to another, or when researchers affiliated with an institution may perform part of the research overseas. Some of the participants, who actively investigate research data, shared that abstracts are valuable because they can be used for large scale analyses related to research integrity.&lt;/p>
&lt;p>Other metadata elements that came up during this discussion were data on peer review, ethics approval, patient and donor consent in medical research, editorial boards (especially of special issues), pre-registration, funding metadata, datasets and programming scripts.&lt;/p>
&lt;h3 id="3-what-value-do-you-see-in-the-integrity-and-completeness-of-the-scholarly-record-in-the-way-you-operate-how-do-you-contribute-to-it-how-can-it-support-you-to-achieve-your-own-goals">3. What value do you see in the integrity and completeness of the scholarly record in the way you operate? How do you contribute to it? How can it support you to achieve your own goals?&lt;/h3>
&lt;p>Participants acknowledged that integrity of the metadata and the scholarly record is essential. Ensuring this integrity is a dynamic process, much akin to the concept of organised scepticism which is the notion that all scientific work should be trusted subject to its verification. Several ideas were shared on how to progress the integrity and completeness of the scholarly record. One recommendation was to use multiple metadata trust markers as that can make it harder for bad actors to game the system, but this may run the risk of making things complicated. Another suggestion was to make metadata part of the onboarding procedure- by gathering staff ORCID iDs during the onboarding process and sharing the institutional ROR ID with staff to promote its use, institutions can ensure that this information is routinely made available. The metadata deposited with Crossref should be integrated with downstream workflows to better facilitate the use of this rich metadata. An example of this is to integrate Crossmark with other research tools such as reference management software.&lt;/p>
&lt;p>The participants acknowledged that this discussion underlined for them the fact that having identifiers in itself is not an indicator of quality and that the underlying metadata records and wider context is key to understanding trustworthiness of the content.&lt;/p>
&lt;p>This event was a good first step towards engaging researchers and academics in the conversation about ISR. It connected folks working in different parts of the world who are united by their interest in research integrity. There was good engagement among all and commitment to continue these conversations in the future, with many participants planning to connect at the &lt;a href="https://wcri2024.org/" target="_blank">World Conference on Research Integrity&lt;/a> in June (I’ll be attending as well, for anyone who wants to continue the conversation - along with my colleagues Fabienne and Evans).&lt;/p>
&lt;p>At Crossref, we plan on continuing these conversations with all segments of the community to understand their needs and perceptions around metadata. The greater the awareness about the importance of metadata and its applications, including for research integrity, the richer the metadata that we are able to collect together. This will lead to building a comprehensive Research Nexus and emergence of more relationships therein. Please write in response to this post on our Community Forum if you have any thoughts on this as we’d love to hear from you.&lt;/p>
&lt;h2 id="list-of-participants">List of participants&lt;/h2>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>&lt;/th>
&lt;th>&lt;/th>
&lt;th>&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>Manu Goyal&lt;/td>
&lt;td>International Journal of Cancer&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Panagiotis Kavouras&lt;/td>
&lt;td>University of Oslo&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Dorothy Bishop&lt;/td>
&lt;td>University of Oxford&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Zhesi (Phil) Shen&lt;/td>
&lt;td>Centre of Scientometrics, National Science Library, Chinese Academy of Sciences&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Wouter Vandevelde&lt;/td>
&lt;td>KU Leuven&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Leslie McIntosh&lt;/td>
&lt;td>Digital Science&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Elizabeth Noonan&lt;/td>
&lt;td>University College Cork&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Radek Gomola*&lt;/td>
&lt;td>Masaryk University Press&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>Queensland University of Technology&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>London School of Hygiene &amp;amp; Tropical Medicine&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>Vilnius Gediminas Technical University Library&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>Committee on Publication Ethics (COPE)&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Ginny Hendricks&lt;/td>
&lt;td>Chif Program Officer, Crossref&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Kornelia Korzec&lt;/td>
&lt;td>Director of Community, Crossref&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Amanda Bartell&lt;/td>
&lt;td>Director of Membership, Crossref&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Dominika Tkaczyk&lt;/td>
&lt;td>Director of Data Science, Crossref&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Amanda French&lt;/td>
&lt;td>Technical Community Manager, Crossref&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Madhura Amdekar&lt;/td>
&lt;td>Community Engagement Manager, Crossref&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>&lt;em>*Note: name added 21-May-2024&lt;/em>&lt;/p></description></item><item><title>Seeking consultancy: understanding joining obstacles for non-member journals</title><link>https://www.crossref.org/blog/seeking-consultancy-understanding-joining-obstacles-for-non-member-journals/</link><pubDate>Wed, 01 May 2024 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/seeking-consultancy-understanding-joining-obstacles-for-non-member-journals/</guid><description>&lt;p>Crossref is undertaking a large program, dubbed &lt;a href="https://www.crossref.org/community/special-programs/resourcing-crossref/">'RCFS' (Resourcing Crossref for Future Sustainability)&lt;/a> that will initially tackle five specific issues with our fees. We haven’t increased any of our fees in nearly two decades, and while we’re still okay financially and do not have a revenue growth goal, we do have inclusion and simplification goals. &lt;a href="https://www.crossref.org/pdfs/research-consulting-rcfs-report-public.pdf">This report from Research Consulting&lt;/a> helped to narrow down the five priority projects for 2024-2025 around these three core goals:&lt;/p>
&lt;h2 id="scope-of-the-rcfs-program-2024-2025">Scope of the RCFS Program 2024-2025&lt;/h2>
&lt;div class="shortcode-divwrap green-highlight">
&lt;span>&lt;h4 id="goal-more-equitable-fees">GOAL: MORE EQUITABLE FEES&lt;/h4>
&lt;ol>
&lt;li>&lt;strong>Project 1&lt;/strong>: Evaluate the USD $275 annual membership fee tier and propose a more equitable pricing structure, which might entail breaking this down into two or more different tiers.&lt;/li>
&lt;li>&lt;strong>Project 2&lt;/strong>: Define a new basis for sizing and tiering members for their capacity to pay&lt;/li>
&lt;/ol>
&lt;h4 id="goal-simplify-complex-fees">GOAL: SIMPLIFY COMPLEX FEES&lt;/h4>
&lt;ol start="3">
&lt;li>&lt;strong>Project 3&lt;/strong>: Address and adjust &lt;em>volume&lt;/em> discounts for Content Registration&lt;/li>
&lt;li>&lt;strong>Project 4&lt;/strong>: Address and adjust &lt;em>back-year&lt;/em> discounts for Content Registration&lt;/li>
&lt;/ol>
&lt;h4 id="goal-rebalance-revenue-sources">GOAL: REBALANCE REVENUE SOURCES&lt;/h4>
&lt;ol start="5">
&lt;li>&lt;strong>Project 5&lt;/strong>: Reflect the increasing value of Crossref as a metadata source, likely increasing Metadata Plus fees&lt;/li>
&lt;/ol>
&lt;/span>
&lt;/div>
&lt;h2 id="work-to-date">Work to date&lt;/h2>
&lt;p>As part of the RCFS program, we are working closely with our Membership &amp;amp; Fees Committee to discuss insights, gather feedback, and make recommendations to the Board. As a first step, we have surveyed and received responses from around 1000 of the current 8000 Crossref members in our lowest membership fee tier (USD $275). We are now starting to distill that data and will discuss it on our &lt;a href="https://www.crossref.org/events/the-shape-of-things-to-come">community call on May 8th&lt;/a> and subsequently with the M&amp;amp;F Committee to inform recommendations for fee changes that may going into effect in 2025 or 2026.&lt;/p>
&lt;h2 id="request-for-information-rfi-about-community-consultation-project">Request For Information (RFI) about community consultation project&lt;/h2>
&lt;p>While we have useful data from existing Crossref members, we know that there are many thousands of journals that are not (yet) members, and we need to understand this group better, in particular, to document and address the financial obstacles as well as the technical or social challenges.&lt;/p>
&lt;blockquote>
&lt;p>We are looking for community facilitation expertise, with multiple language skills, to conduct a series of focus groups with non-member journals, with a summary and insights report (in English) provided by the end of June 2024.&lt;/p>
&lt;/blockquote>
&lt;p>All the data and documentation will be available publicly on the dedicated &lt;a href="https://www.crossref.org/community/special-programs/resourcing-crossref/">RCFS Program website&lt;/a>&lt;/p>
&lt;p>As well as designing, conducting, and summarising the results of some focus groups (participants for which will be gathered via our own contacts and those of partners such as DOAJ, EIFL, and the Free Journal Network) we would like the consultant to review work such as the &lt;a href="https://diamasproject.eu/diamas-results-institutional-landscape-survey/" target="_blank">DIAMAS institutional publishing report&lt;/a>, and identify data relevant to Crossref’s fee model.&lt;/p>
&lt;p>If you would like to respond, please provide the following information and send it to Kora Korzec at &lt;a href="mailto:feedback@crossref.org">feedback@crossref.org&lt;/a> by &lt;strong>15th May&lt;/strong>:&lt;/p>
&lt;ul>
&lt;li>Your consultancy organisation and your role within it&lt;/li>
&lt;li>Examples of similar market research undertaken&lt;/li>
&lt;li>Languages spoken within your team&lt;/li>
&lt;li>Confirmation that the timeline is workable&lt;/li>
&lt;li>Approximate fee, likely range, or structure/basis for your fee&lt;/li>
&lt;/ul>
&lt;p>Equally, if you represent a journal or group of journals, such as Diamond Open Access journals, and are not yet using Crossref, please get in touch and we can include your group in the research.&lt;/p>
&lt;p>Thank you!&lt;/p></description></item><item><title>This year's call for expressions of interest to join our board</title><link>https://www.crossref.org/blog/this-years-call-for-expressions-of-interest-to-join-our-board/</link><pubDate>Fri, 26 Apr 2024 00:00:00 +0000</pubDate><author>Lucy Ofiesh</author><guid>https://www.crossref.org/blog/this-years-call-for-expressions-of-interest-to-join-our-board/</guid><description>&lt;p>The Crossref Nominating Committee is inviting expressions of interest to join the Board of Directors of Crossref for the term starting in January 2025. The committee will gather responses from those interested and create the slate of candidates that our membership will vote on in an election in September.&lt;/p>
&lt;p>&lt;strong>Expressions of interest will be due Monday, May 27th, 2024&lt;/strong>&lt;/p>
&lt;p>This is an exciting time to join the board, as we have a number of active projects underway: We are considering resourcing Crossref for a sustainable future and board members will be part of deciding any changes to our fees scheme and overseeing its implementation. We&amp;rsquo;re focusing on how our community and metadata can contribute to ensuring the integrity of the scholarly record. We’re broadening our metadata record to capture richer funding and institutional affiliations. We&amp;rsquo;re working towards a future where the scholarly record prioritizes relationships between research outputs to build a holistic research nexus. The board helps guide this work.&lt;/p>
&lt;h3 id="about-the-board-elections">About the board elections&lt;/h3>
&lt;p>The board is elected through the “one member, one vote” policy wherein every member organisation of Crossref has a single vote to elect representatives to the Crossref board. Board terms are for three years, and this year, there are four seats open for election.&lt;/p>
&lt;p>The board maintains a balance of seats, with eight seats for smaller members and eight seats for larger members (based on total revenue to Crossref). This is an effort to ensure that the scholarly community&amp;rsquo;s diversity of experiences and perspectives is represented in decisions made at Crossref.&lt;/p>
&lt;p>This year, we will elect two of the larger member seats (membership tiers $3,900 and above) and two of the smaller member seats (membership tiers $1,650 and below). You don’t need to specify which seat you are applying for; we will provide that information to the nominating committee.&lt;/p>
&lt;p>The online election will open in September, with results announced at the annual meeting on October 29th, 2024. New members will begin their term in January 2025.&lt;/p>
&lt;h3 id="about-the-nominating-committee">About the Nominating Committee&lt;/h3>
&lt;p>The Nominating Committee reviews the expressions of interest and selects a slate of candidates for election. The slate put forward will exceed the total number of open seats. The committee considers the statements of interest, organisational size, geography, and experience.&lt;/p>
&lt;p>2024 Nominating Committee&lt;/p>
&lt;ul>
&lt;li>James Phillpotts*, Director of Content Transformation and Standards, Oxford University Press, committee chair&lt;/li>
&lt;li>Oscar Donde*, Editor in Chief, Pan Africa Science Journal&lt;/li>
&lt;li>Rose L’Huillier*, Senior Vice President Researcher Products, Elsevier&lt;/li>
&lt;li>Ivy Mutambanengwe-Matanga, Chief Operating Officer, African Journals Online&lt;/li>
&lt;li>Adam Sewell, Chief Technology Officer, IOP Publishing&lt;/li>
&lt;/ul>
&lt;p>(*) indicates Crossref board member&lt;/p>
&lt;h3 id="what-is-the-committee-looking-for-this-year">What is the committee looking for this year&lt;/h3>
&lt;p>The committee looks for skills and experience that will complement the rest of the board. Candidates from countries and regions not currently reflected on the board are strongly encouraged to apply. Successful candidates often have some or all of these characteristics:&lt;/p>
&lt;ul>
&lt;li>Demonstrate a commitment to or understanding of our &lt;a href="https://www.crossref.org/strategy/" target="_blank">strategic agenda&lt;/a> or the &lt;a href="https://openscholarlyinfrastructure.org/" target="_blank">Principles of Open Scholarly Infrastructure&lt;/a>;&lt;/li>
&lt;li>Have expertise that may be underrepresented on the board currently;&lt;/li>
&lt;li>Hold senior/director-level positions in their organisations;&lt;/li>
&lt;li>Have experience with governance or community involvement;&lt;/li>
&lt;li>Represent member organisations that are active in the scholarly communications ecosystem;&lt;/li>
&lt;li>Demonstrate metadata best practices as shown in the member’s &lt;a href="https://www.crossref.org/members/prep/" target="_blank">participation report&lt;/a>&lt;/li>
&lt;/ul>
&lt;p>The board is also encouraging Crossref members who are research funders to apply.&lt;/p>
&lt;h3 id="board-roles-and-responsibilities">Board roles and responsibilities&lt;/h3>
&lt;p>Crossref’s services provide a central infrastructure for scholarly communications. Crossref’s board helps shape the future of our services and by extension, impacts the broader scholarly ecosystem. We are looking for board members to contribute their experience and perspective.&lt;/p>
&lt;p>The role of the board at Crossref is to provide strategic and financial oversight of the organisation, as well as guidance to the Executive Director and the staff leadership team, with the key responsibilities being:&lt;/p>
&lt;ul>
&lt;li>Setting the strategic direction for the organisation;&lt;/li>
&lt;li>Providing financial oversight; and&lt;/li>
&lt;li>Approving new policies and services.&lt;/li>
&lt;/ul>
&lt;p>The board is representative of our membership base and guides the staff leadership team on trends affecting scholarly communications. The board sets strategic directions for the organisation while also providing oversight into policy changes and implementation. Board members have a fiduciary responsibility to ensure sound operations. They do this by attending board meetings as well as joining more specific board committees.&lt;/p>
&lt;h3 id="who-can-apply-to-join-the-board">Who can apply to join the board?&lt;/h3>
&lt;p>Any active member of Crossref can apply to join the board. Crossref membership is open to organisations that produce content, such as academic presses, commercial publishers, standards organisations, and research funders.&lt;/p>
&lt;h3 id="what-is-expected-of-board-members">What is expected of board members?&lt;/h3>
&lt;p>Board members attend four meetings each year that typically take place in January, March, July, and November. Meetings have taken place in a variety of international locations and travel support is provided when needed. January, March, and November board meetings are held virtually, and all committee meetings take place virtually. Each board member should sit on at least one Crossref committee. Care is taken to accommodate the wide range of time zones in which our board members live.&lt;/p>
&lt;p>While the expressions of interest are specific to an individual, the seat that is elected to the board belongs to the member organisation. The primary board member also names an alternate who may attend meetings in the event that the primary board member is unable to. There is no personal financial obligation to sit on the board. The member organisation must remain in good standing.&lt;/p>
&lt;p>Board members are expected to be comfortable assuming the responsibilities listed above and to prepare and participate in board meeting discussions.&lt;/p>
&lt;h3 id="how-to-apply">How to apply&lt;/h3>
&lt;p>Please &lt;a href="https://docs.google.com/forms/d/e/1FAIpQLSf5m9mXCYRGQgu_6qlo7xaIz0LyFgmzIXTeOC-UW8_2C20pmw/viewform" target="_blank">click here to submit your expression of interest&lt;/a>. We ask for a brief statement about how your organisation could enhance the our board and a brief personal statement about your interest and experience with Crossref.&lt;/p>
&lt;p>Please contact me with any questions at &lt;a href="mailto:lofiesh@crossref.org">lofiesh@crossref.org&lt;/a>&lt;/p></description></item><item><title>Common views and questions about metadata across Africa</title><link>https://www.crossref.org/blog/common-views-and-questions-about-metadata-across-africa/</link><pubDate>Wed, 24 Apr 2024 00:00:00 +0000</pubDate><author>Johanssen Obanda</author><guid>https://www.crossref.org/blog/common-views-and-questions-about-metadata-across-africa/</guid><description>&lt;p>This past year has been a captivating journey of immersion within the Crossref community, a mix of online interactions and meaningful in-person experiences. From the engaging Sustainability Research and Innovation Conference in Port Elizabeth, South Africa, to the impactful webinars conducted globally, this has been more than just a professional endeavour; it has been a personal exploration of collaboration, insights, and a shared commitment to pushing the boundaries of scholarly communication.&lt;/p>
&lt;h2 id="working-collaboratively-with-research-funders-and-research-organisations">Working collaboratively with research funders and research organisations&lt;/h2>
&lt;div style="float:right;margin:10px">
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2024/cocreation-activity-SRI.JPG"
alt="Cocreation activity in smaller groups at the SRI conference." width="400px">&lt;figcaption>
&lt;p>Cocreation activity in smaller groups at the SRI conference.&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;/div>
&lt;p>The adventure began with a significant in-person event, the Sustainability Research and Innovation Conference. In the coastal city of Port Elizabeth, South Africa, I had the honour of hosting a parallel co-creation session titled &amp;ldquo;Connecting Science to Society: A Network Approach to Improving Science Communication in the Global South.&amp;rdquo; The co-creation session addressed research discoverability and accessibility among early-career researchers. Apart from some immediate feedback from the researchers in the room about how they might use co-creation beyond the conference to improve their research experience and outcome, I also had conversations with research funders from the Belmont Forum, Future Earth, and National Research Foundation - South Africa and the National Research Foundation - Mozambique about connecting their grants and grantees with their published outputs referencing Crossref’s &lt;a href="https://www.crossref.org/services/funder-registry/" target="_blank">Open Funder Registry&lt;/a> and &lt;a href="https://www.crossref.org/community/grants/" target="_blank">research grants registration&lt;/a>. A different side conversation was about a community organisation in Botswana that is interested in registering patents with Crossref for proper referencing and protecting the intellectual property of their research on the indigenous communities’ innovations and the associated published work. These conversations are ongoing, unveiling a new understanding of unique needs and opportunities to pursue with research funders and research organisations working on indigenous knowledge and innovations.&lt;/p>
&lt;h2 id="learning-from-organisations-in-gem-eligible-countries">Learning from organisations in GEM-eligible countries&lt;/h2>
&lt;p>The journey extended globally through a series of webinars conducted in Bangladesh, Tanzania, Nepal, and Ghana. Collaborating with dedicated Ambassadors and my colleagues leading the &lt;a href="https://www.crossref.org/gem/" target="_blank">Global Equitable Membership (GEM) program&lt;/a>, we witnessed an increase in Crossref membership from the GEM countries and initial metadata registration. The GEM Program offers relief from both Crossref membership and Crossref content registration fees for organisations in the least economically advantaged countries in the world, based on the World Bank&amp;rsquo;s IDA list. Susan, in her blog post, &amp;ldquo;&lt;a href="https://doi.org/10.64000/vnvbt-64862" target="_blank">The GEM Program: Year One&lt;/a>&amp;rdquo;, elaborated on the significance of these efforts and their impact on fostering equitable access to scholarly resources and communication through the expansion of Crossref&amp;rsquo;s membership base in underrepresented regions, such as Bangladesh, Tanzania, Nepal, and Ghana. Specific concerns encountered while presenting the GEM program included feedback expressing reservations about the program&amp;rsquo;s approach, particularly in deciding on eligible countries, and advocating eligibility for the program to be extended to all the non-GEM countries in Africa. Additionally, a conversation with some organisations brought up concerns regarding the program&amp;rsquo;s sustainability, with inquiries about whether GEM was merely a free trial or freemium service, and seeking assurances against future fees. The audience found these sessions helpful, acknowledging that joining fees were no longer going to be a barrier, yet questions about the program&amp;rsquo;s longevity brought out the need for sustained support.&lt;/p>
&lt;h2 id="discussing-how-the-research-nexus-can-support-the-community">Discussing how The Research Nexus can support the community&lt;/h2>
&lt;p>My journey then led me to Makerere University in Uganda for the Consortium of Uganda University Libraries (CUUL 2023) conference and the Forum for Open Research in MENA (FORM 2023) in Abu Dhabi. In Uganda, I noticed the synergy between university libraries, institutional repositories, and the research and education network service provider formed a consortium that played a crucial role in bridging the digital gap and supporting the adoption of open infrastructure. The event was mainly attended by librarians from different universities in Uganda. Most of those I connected with needed more information about Crossref and had questions about how Crossref DOIs are different from ARKs, which they commonly use in their publishing workflows. At FORM 2023, in my presentation titled, &amp;ldquo;The Research Nexus: A Rich and Reusable Open Network of Relationships in the Scholarly Record,&amp;rdquo; I shared Crossref&amp;rsquo;s vision for a connected research ecosystem with the audience that comprised of researchers, research administrators, and funders, and a good number of big publishers like IEEE and Taylor &amp;amp; Francis. The &lt;a href="https://doi.org/10.64000/vnvbt-64862" target="_blank">Research Nexus&lt;/a> seeks to reveal relationships beyond persistent identifiers, utilising rich metadata to connect various scholarly components. I also took the opportunity at both events to share about &lt;a href="https://theplace.discourse.group/" target="_blank">The Publishers Learning And Community Exchange (PLACE)&lt;/a>, an online forum promoting best practices in scholarly publishing. The goal was to show attendees how they can actively contribute to and benefit from this vision, fostering a robust and interconnected research community through Crossref&amp;rsquo;s open infrastructure.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2024/Dr-SAlwan-and-Obanda.jpg"
alt="Photo with Dr. Salwan Abdulateef, Crossref Ambassador - Iraq" width="50%">&lt;figcaption>
&lt;p>Photo with Dr. Salwan Abdulateef, rossref Ambassador - Iraq&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;/div>
&lt;p>I enjoyed the opportunity to join the National Open Science Dialogue by TCC Africa, which provided crucial insights, emphasising the need for assessing awareness, implementing comprehensive policies, and fostering collaboration around Open Science. Higher education institutions were recognized as influencers in the global Open Science movement, while a call for an inclusive research environment was underscored through open access and data sharing. The dialogue emphasized a collective effort involving policymakers, educators, researchers, and institutions, focusing on inclusivity and collaboration to advance Open Science in East Africa.&lt;/p>
&lt;h2 id="exploring-how-rich-metadata-can-provide-trust-signals-with-members-in-kenya">Exploring how rich metadata can provide trust signals with members in Kenya&lt;/h2>
&lt;p>Reflecting on the Crossref Nairobi event that happened in February 2024, it was an enriching experience exploring key issues shaping scholarly publishing in Kenya. The discussions also touched on the role of metadata as a trust signal and a tool for the persistence of the scholarly record, particularly in regions where data protection challenges persist. This is exemplified by concerns raised during the event about the fear of data theft, misuse, or loss, especially in places with comparatively weaker data protection laws. The presence of robust metadata, particularly with detailed provenance information, becomes crucial in such contexts, as it enables better identification and handling of potential misuse. Thus, through effective metadata implementation and the persistence facilitated by identifiers, the management of data risks can be significantly improved.&lt;/p>
&lt;p>The insights from existing Crossref members pointed out contextual challenges, regional differences, and the importance of effective post-publication processes. The conference served as a valuable platform for dialogue, emphasising the collective commitment to continuous improvement of scholarly communication in the country, and the need for continuous awareness and training on making the most of Crossref services. The roundtable discussions during the Crossmark service consultation brought to light various reflections and considerations regarding post-publication changes in publishing workflows. The Crossmark service was a new discovery for most participants, with potential value recognized in facilitating current updates on articles. However, there are existing barriers such as a lack of awareness and technical expertise, suggesting the need for further education to facilitate adoption. Overall, the consultation provided a platform for introspection and exploration of avenues for improving post-publication practices in scholarly publishing.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2024/Crossref-Nairobi-Group-Photo.jpg"
alt="Crossref Nairobi group photo" width="75%">&lt;figcaption>
&lt;p>Crossref Nairobi group photo&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;/div>
&lt;p>We organised the Crossref Nairobi event with the help of colleagues from the outreach team and local Ambassadors, Mercury Shitindo of Kenya, Baraka Ngussa of Tanzania and our Board Members in Kenya, Oscar Donde. It was the first time I saw both my colleagues and Ambassadors in action and working closely together - making presentations and accommodating last-minute facilitation changes to the program. Compared to attending or speaking at an event, organising one was a unique experience requiring a lot of planning in advance for logistics and the event program, identifying and keeping in touch with important stakeholders, ushering guests and being on standby for any matters that come up about the event. All of that went very well thanks to the team on the ground and cooperative participants.&lt;/p>
&lt;h2 id="exploring-the-role-of-open-infrastructure-for-african-universities">Exploring the role of open infrastructure for African universities&lt;/h2>
&lt;p>Attending the recent WACREN 2024 conference was an eye-opening experience, unfolding the role of open infrastructure in addressing challenges faced by African universities. A focus on open access systems and advocacy for decolonizing knowledge were voiced too, including challenges of affordability of DOIs and questions of local ownership amidst global initiatives. Global persistent identifier providers, including ORCID and DataCite too, had a presence at the conference, alongside passionate advocates for more locally managed, decentralised infrastructure. These are concerns that Crossref needs to understand better, as we seek to find effective ways of supporting equitable participation in the Research Nexus. The conference resonated with a call for continued work in fostering accessibility, sharing, and leveraging resources to accelerate research and innovation in Africa.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2024/WACREN-Ambassadors2.jpeg"
alt="Photo with our Ambassadors from West Africa at WACREN 2024 event: Blessing Abumere - Nigeria, Audrey Kenni Nganmeni - Cameroon, Richard Lamptey - Ghana and Oumy Ndiaye - Senegal." width="75%">&lt;figcaption>
&lt;p>Photo with our Ambassadors from West Africa at WACREN 2024 event: Blessing Abumere - Nigeria, Audrey Kenni Nganmeni - Cameroon, Richard Lamptey - Ghana and Oumy Ndiaye - Senegal.&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;/div>
&lt;p>Conversations with Crossref Ambassadors brought about a shared narrative across universities in some African countries. These institutions are actively embracing digital shifts, setting up institutional repositories using platforms like DSpace and OJS. However, challenges persist, particularly in funding and technical capacity. It&amp;rsquo;s heartening to see how national and regional research and education networks step in to help in internet connectivity, opening up collaboration opportunities with other interoperable infrastructure, setting up repositories, providing hosting services and event managing content identifiers.&lt;/p>
&lt;p>Deceptive publishing practices remain a shared concern, and we’ve had requests at these meetings for stricter inclusion criteria for membership of Crossref to ensure quality and trustworthiness of articles accessible through Crossref metadata.&lt;/p>
&lt;p>We’ve explained to those we’ve met that Crossref doesn’t (and can’t) assess the quality of content or the integrity of the research process. We don’t have the people or the skills, and it isn’t our mission to be the gatekeepers of research quality. A DOI record is just an indication that something was published, it isn’t an indication of quality.&lt;/p>
&lt;p>However, we do still have a vital role in preserving the integrity of the scholarly record. We provide the infrastructure which enables those who produce scholarly outputs to provide metadata (effectively evidence) about how they ensure the quality of content and how the outputs fit into the scholarly record. The scholarly record - that network of published outputs, inputs, relationships and contexts - is captured through the metadata records that our members register with us, and that we then distribute freely and openly through our API. The richer and more comprehensive Crossref records are, the more context there is for our members and for the whole scholarly research ecosystem to make their own decisions around trustworthiness. Blocking access to the infrastructure creates gaps in the scholarly record, but also potentially blocks legitimate newcomers.&lt;/p>
&lt;p>“Crossref is focused on enriching metadata to provide more and better trust signals while keeping barriers to membership and participation as low as possible to enable an inclusive scholarly record.”
Read more about &lt;a href="https://doi.org/10.64000/edg3w-7t592" target="_blank">Crossref’s role in preserving the integrity of the Scholarly record&lt;/a> in the blog post by Amanda Bartell.&lt;/p>
&lt;p>While the landscape of digital scholarly publication witnesses significant strides, a crucial need persists, the importance of preserving and interconnecting metadata to the global scholarly record. It&amp;rsquo;s not just about discoverability, a theme resonating strongly within the community, but about enabling reproducibility, upholding research and editorial integrity, and facilitating reporting and assessment.&lt;/p>
&lt;h2 id="the-path-forward">The path forward&lt;/h2>
&lt;p>As I reflect on this year of immersing myself within the Crossref community, building awareness in new communities, and learning more about the different perceptions across the region, it feels like a personal progression of growth and discovery. From the captivating in-person moments to the global webinars and collaborative efforts to address challenges in scholarly communication, this journey is not just a professional pursuit; it&amp;rsquo;s a personal exploration. The path forward involves continued support, intensified awareness-building, and sustained dialogue, ensuring that the scholarly ecosystem continues to thrive, evolve, and leave a lasting impact.&lt;/p></description></item><item><title>Testing times</title><link>https://www.crossref.org/blog/testing-times/</link><pubDate>Wed, 03 Apr 2024 00:00:00 +0000</pubDate><author>Martin Eve</author><guid>https://www.crossref.org/blog/testing-times/</guid><description>&lt;p>One of the challenges that we face in Labs and Research at Crossref is that, as we prototype various tools, we need the community to be able to test them. Often, this involves asking for deposit to a different endpoint or changing the way that a platform works to incorporate a prototype.&lt;/p>
&lt;p>The problem is that our community is hugely varied in its technical capacity and level of ability when it comes to modifying their platform. Some mega-publishers, for instance, outsource their platforms and so are dependent on third party developers/organisations when they want to make a change. Many smaller publishers, by contrast, use systems such as OJS, which come with Crossref plugins that make life very easy… but that require hard code changes to accommodate prototypes. Such changes are way beyond the technical capacity of most journal editors.&lt;/p>
&lt;p>So how can we prototype new ideas and test them? One way is by creating new interstitial interfaces that allow people to manually supplement metadata or register for prototype services. Of course, this requires additional work on behalf of the user. Every time they wish to participate they have to visit an extra web page and re-input details that, surely, were included in the original deposit.&lt;/p>
&lt;p>Another way would be for plugin developers to have an advanced option field that allowed end-users to change their deposit endpoint. It would be excellent to see this feature in OJS, Janeway, and also proprietary systems. This would allow us to work with the community to test new prototype mechanisms, without forcing anyone to edit code. Many systems already include the ability to switch between Crossref’s “test” system and our live deposit API. All I am really suggesting here is the logical next step: allow advanced users to specify a deposit endpoint of their own choosing so that we can give them access to prototype systems.&lt;/p>
&lt;p>Of course, it’s not always that simple. Sometimes, prototype systems will require new data fields on submission, for example. In those cases, there is nothing for it except to modify the plugin or to provide a separate interface. But sometimes, as in the case of the Op Cit project (more on which soon), all the data is already in place; we just need to direct users to a different endpoint. Such changes would definitely make testing times less trying.&lt;/p></description></item><item><title>Mending Chesterton's Fence: Open Source Decision-making</title><link>https://www.crossref.org/blog/mending-chestertons-fence-open-source-decision-making/</link><pubDate>Mon, 18 Mar 2024 00:00:00 +0000</pubDate><author>Joe Wass</author><guid>https://www.crossref.org/blog/mending-chestertons-fence-open-source-decision-making/</guid><description>&lt;p>When each line of code is written it is surrounded by a sea of context: who in the community this is for, what problem we&amp;rsquo;re trying to solve, what technical assumptions we&amp;rsquo;re making, what we already tried but didn&amp;rsquo;t work, how much coffee we&amp;rsquo;ve had today. All of these have an effect on the software we write.&lt;/p>
&lt;p>By the time the next person looks at that code, some of that context will have evaporated. There may be helpful code comments, tests, and specifications to explain how it should behave. But they don&amp;rsquo;t explain the path not taken, and why we didn&amp;rsquo;t take it. Or those occasions where the facts changed, so we changed our mind.&lt;/p>
&lt;p>Some parts of our system are as old as Crossref itself. Whilst our process still involves coffee, it&amp;rsquo;s safe to say that most of our working assumptions have changed, and for good reasons! We have to be very careful when working with our oldest code. We always consider why it was written that way, and what might have changed since. We&amp;rsquo;re always on the look out for &lt;a href="https://thoughtbot.com/blog/chestertons-fence" target="_blank">Chesterton&amp;rsquo;s Fence&lt;/a>!&lt;/p>
&lt;h2 id="leaving-a-trail">Leaving a Trail&lt;/h2>
&lt;p>We&amp;rsquo;re building a new generation of systems at Crossref, and as we go we&amp;rsquo;re being deliberate about supporting the people who will maintain it.&lt;/p>
&lt;p>When our oldest code was written, the software development team all worked in an office with a whiteboard or three, and the code was proprietary. Twenty years later, things are very different. The software development team is spread over 8 timezones. Thanks to &lt;a href="https://openscholarlyinfrastructure.org/" target="_blank">POSI&lt;/a>, all the new code we write is open source, so the next people to read that code might not even be Crossref staff.&lt;/p>
&lt;p>Working increasingly asynchronously, without that whiteboard, we need to record the options, collect evidence, and peer-review them within the team.&lt;/p>
&lt;p>So for the past couple of years the software team has maintained a &lt;a href="https://crossref.gitlab.io/engineering/decision-records/" target="_blank">decision register&lt;/a>. The first decision we recorded was that we should record decisions! Since then we have recorded the significant decisions as they arise. Plus some &lt;a href="https://crossref.gitlab.io/engineering/decision-records/dr-0003/" target="_blank">historical ones&lt;/a>.&lt;/p>
&lt;p>These aren&amp;rsquo;t functional specifications, which describe what the system should do. It&amp;rsquo;s the decisions and trade-offs we made along the way to get to the how. Look out for another blog post about specifications.&lt;/p>
&lt;p>By leaving a trail of explanations as we go, we make it easier for people to understand why code was written, and what has changed. We&amp;rsquo;re writing the story of our new systems. This makes it easier to alter the system in future in response to changes in our community, and the metadata they use.&lt;/p>
&lt;h2 id="difficult-decisions">Difficult Decisions&lt;/h2>
&lt;p>There are some fun challenges to building systems at Crossref. We have a lot of data. Our schema is very diverse, and has a vast amount of domain knowledge embedded in it. It&amp;rsquo;s changed over time to accommodate 20 years of scholarly publishing innovations. Our community is diverse too, from small one-person publishers with a handful of articles, through to large ones that publish millions.&lt;/p>
&lt;p>What might be an obvious decision for a database table with a thousand rows doesn&amp;rsquo;t always translate to a million. When you get to a billion, things change again. An initially sensible choice might not scale. And a scalable solution might look over-engineered if we had millions of DOIs, rather than hundreds of millions.&lt;/p>
&lt;p>The diversity of the data also poses challenges. A very simple feature might get complicated or expensive when it meets the heterogeneity of our metadata and membership. What might scale for journal article or grant metadata might not work for book chapters.&lt;/p>
&lt;p>The big decisions need careful discussion, experimentation, and justification.&lt;/p>
&lt;h2 id="2nf-or-not-2nf">2NF or not 2NF&lt;/h2>
&lt;p>One such recent decision was how we structure our SQL schema for the database that powers our new &amp;lsquo;relationships&amp;rsquo; REST API endpoint, currently in development.&lt;/p>
&lt;p>The data model is simple: we have a table of Relationships which connect pairs of Items. And each Item can have properties (such as a type). The way to model this is straightforward, following conventional &lt;a href="https://en.wikipedia.org/wiki/Database_normalization" target="_blank">normalization rules&lt;/a>:&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2024/open-source-decision-making/1.svg"
alt="SQL Tables, normalised" width="100%">
&lt;/figure>
&lt;p>We built the API around it, and all was well.&lt;/p>
&lt;p>We then added a feature which lets you look up relationships based on the properties of the subject or object. For example &amp;ldquo;find citations where the subject is an article and the object is a dataset&amp;rdquo;. This design worked well in our initial testing. We loaded more data into it, and it continued to work well.&lt;/p>
&lt;p>And then, the context changed. Once we tested loading a billion relationships in the database, the performance dropped. The characteristics of the data: size, shape and distribution, reached a point where the database was unable to run queries in a timely way. The PostgreSQL query planner became unpredictable and occasionally produced some quite exciting query plans (to non-technical readers: databases are neither the time nor the place for excitement).&lt;/p>
&lt;p>This is a normal experience in scaling up a system. We expected that something like this would happen at some point, but you don&amp;rsquo;t know when it will happen until you try. We bounced around some ideas and came up with a couple of alternatives. Each made trade-offs around processing time, data storage and query flexibility. The best way to evaluate them was to use real data at a representative scale.&lt;/p>
&lt;p>One of the options was denormalisation. This is a conventional solution to this kind of problem, but was not our first choice as it involves extra machinery to keep the data up-to-date, and more storage. It would not have been the correct solution for a smaller dataset. But we had the evidence that the other two approaches would not scale predictably.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2024/open-source-decision-making/2.svg"
alt="SQL Tables, normalised" width="100%">
&lt;/figure>
&lt;br>
&lt;p>By combining the data into one table, we can serve up API requests much more predictably, and with much better performance. This code is now running with the right performance. Technical readers note that this diagram is simplified. The &lt;a href="https://gitlab.com/crossref/manifold/-/blob/62fc458b280e71c6b6221908fb7824bd3573726f/src/main/resources/db/migration/V1_0_0__initial_schema_creation.sql#L356" target="_blank">real SQL schema&lt;/a> is a little different.&lt;/p>
&lt;p>Without writing this history down, and explaining what we tried, someone might misunderstand the reason for the code and try to simplify it. Decision record &lt;a href="https://crossref.gitlab.io/engineering/decision-records/dr-0500/" target="_blank">DR-0500&lt;/a> guards against that.&lt;/p>
&lt;p>But one day, when the context changes, future developers will be able to come back and modify the code, because they understand why it was like that in the first place.&lt;/p></description></item><item><title>Credential Checking at Crossref</title><link>https://www.crossref.org/blog/credential-checking-at-crossref/</link><pubDate>Fri, 15 Mar 2024 00:00:00 +0000</pubDate><author>Martin Eve</author><guid>https://www.crossref.org/blog/credential-checking-at-crossref/</guid><description>&lt;div class="shortcode-divwrap align-right">
&lt;span>&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2024/credential-checking.png" width="75%">
&lt;/figure>
&lt;/span>
&lt;/div>
It turns out that one of the things that is really difficult at Crossref is checking whether a set of Crossref credentials has permission to act on a specific DOI prefix. This is the result of many legacy systems storing various mappings in various different software components, from our Content System through to our CRM.
&lt;p>To this end, I wrote a basic application, &lt;a href="https://gitlab.com/crossref/labs/credcheck" target="_blank">credcheck&lt;/a>, that will allow you to test a Crossref credential against an API.&lt;/p>
&lt;p>There are two modes of usage. First, a command-line interface that allows you to run a basic command and get feedback:&lt;/p>
&lt;p>          Usage: cli.py [OPTIONS] USERNAME PASSWORD DOI&lt;/p>
&lt;p>Second, you can use it as a programmatic library in Python:&lt;/p>
&lt;p>          import cred&lt;br>
          credential = cred.Credential(username=username, password=password, doi=doi)&lt;/p>
&lt;p>          if not credential.is_authenticated():&lt;br>
          &amp;hellip;&lt;/p>
&lt;p>          if credential.is_authorised():&lt;br>
          &amp;hellip;&lt;/p>
&lt;p>The tool splits down authentication (whether the given username and password are valid) and authorisation (whether the valid credentials are usable against a specific DOI/prefix).&lt;/p>
&lt;p>For technical information, the way this works is by attempting to run a report on the specific DOI in question and then scraping the response page. We hope, at some future point, that there will be a real API for this, but for now this solves the problem as a bridge.&lt;/p></description></item><item><title>Subject codes, incomplete and unreliable, have got to go</title><link>https://www.crossref.org/blog/subject-codes-incomplete-and-unreliable-have-got-to-go/</link><pubDate>Wed, 13 Mar 2024 00:00:00 +0000</pubDate><author>Patrick Polischuk</author><guid>https://www.crossref.org/blog/subject-codes-incomplete-and-unreliable-have-got-to-go/</guid><description>&lt;p>Subject classifications have been available via the REST API for many years but have not been complete or reliable from the start and will soon be deprecated. dfdfd&lt;/p>
&lt;p>The &lt;code>subject&lt;/code> metadata element was born out of a Labs experiment intended to enrich the metadata returned via Crossref Metadata Search with All Subject Journal Classification codes from Scopus. This feature was developed when the REST API was still fairly new, and we now recognize that the initial implementation worked its way into the service prematurely.&lt;/p>
&lt;p>While subject classifications in Crossref metadata could be very useful, the current implementation in the REST API is problematic for three primary reasons:&lt;/p>
&lt;p>&lt;strong>They are misleadingly exposed in the API as a property of the work,&lt;/strong> when in fact they are a property of the container (e.g. a journal or conference proceeding). Just because a journal’s broad topic category is “X” doesn’t mean that a particular article in the journal is about “X.”&lt;/p>
&lt;p>&lt;strong>Existing works may have outdated subjects.&lt;/strong> Originally, subject codes were not updated periodically. However, subjects exposed in the /journals route are now updated once a day. Those exposed via the /works endpoint are indexed along with works, and so when a new subject list is ingested, new DOIs start getting new subjects, but existing works may have outdated subjects. We don’t have a mechanism for forcing updates when incorrect subject values are returned via the REST API, so this data can be stale and incorrect.&lt;/p>
&lt;p>&lt;strong>They are not applied to everything.&lt;/strong> This is because the Scopus list does not cover all the journals that Crossref has (conversely, the Scopus list contains some journals Crossref does not have), and does not contain other container types.&lt;/p>
&lt;p>The Labs team investigated options for improving subject classification coverage but ultimately concluded that there are insufficient solutions to the coverage problem. For more, please see Esha Datta’s findings published at Force11’s Upstream: &lt;a href="https://doi.org/10.54900/n6dnt-xpq48" target="_blank">https://doi.org/10.54900/n6dnt-xpq48&lt;/a>&lt;/p>
&lt;p>Where does that leave us? Rather than continuing to supply unreliable and misleading subject category metadata, we will be deprecating this feature in the coming weeks. To minimize disruption and avoid breaking changes at this time, we will be removing this data from our index, so the subject element will simply be empty. We may remove the &lt;code>subject&lt;/code> element in the future.&lt;/p>
&lt;p>We know that the community’s desire for subject-based analysis of metadata is very strong, and we have supported efforts to establish a multidisciplinary taxonomy. Inaccurate codes in the meantime do not help but actually hinder these efforts, giving the false impression that they are correct.&lt;/p>
&lt;p>We aim to deprecate the subject codes in April of this year.&lt;/p>
&lt;p>Please let us know if you have any questions or concerns by leaving a comment below, which will start a thread in our community forum.&lt;/p>
&lt;p>&lt;strong>Frequently asked questions&lt;/strong>&lt;/p>
&lt;p>Q. Will the subject field continue to be available and functional?&lt;br>
A. The subject metadata element will continue to be included in the JSON response but will not return any values.&lt;/p>
&lt;p>Q. Will new subject codes be added in the future?&lt;br>
A. We do not have any current plans to add new subject codes in the future.&lt;/p>
&lt;p>Q. I received a notification about this, but we don’t use subject codes. Do I need to do anything?&lt;br>
A. No, if you do not currently use the &lt;code>subject&lt;/code> element, you do not need to do anything about this change.&lt;/p>
&lt;p>Q. I noticed that wrong or inaccurate subject codes were assigned to my works. Is this a solution?&lt;br>
A. Yes. Until we can identify an accurate and sustainable system for assigning subject codes to Crossref metadata records, we want to stop assigning inaccurate subject codes and remove all existing assignments.&lt;/p></description></item><item><title>DOAJ and Crossref renew their partnership to support the least-resourced journals</title><link>https://www.crossref.org/blog/doaj-and-crossref-renew-their-partnership-to-support-the-least-resourced-journals/</link><pubDate>Wed, 06 Mar 2024 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/doaj-and-crossref-renew-their-partnership-to-support-the-least-resourced-journals/</guid><description>&lt;p>Crossref and DOAJ share the aim to encourage the dissemination and use of scholarly research using online technologies and to work with and through regional and international networks, partners, and user communities for the achievement of their aims to build local institutional capacity and sustainability.
Both organisations agreed to work together in 2021 in a variety of ways, but primarily to ‘encourage the dissemination and use of scholarly research using online technologies, and regional and international networks, partners and communities, helping to build local institutional capacity and sustainability around the world.’ Some of the fruits of this labour are:&lt;/p>
&lt;ul>
&lt;li>DOAJ added support for Crossref XML to make it easier for publishers to upload metadata&lt;/li>
&lt;li>Closer collaboration between customer/member support at both organisations, making it easier for publishers and journal editors to navigate both service’s technologies&lt;/li>
&lt;li>the launch of PLACE: ‘a ‘one-stop shop’ for information to support publishers in adopting best practices the industry developed’ (together with other partners)&lt;/li>
&lt;li>a pilot gap analysis of the journals in DOAJ with the possibility of helping them start to use and resolve DOIs.&lt;/li>
&lt;/ul>
&lt;p>The new agreement, signed earlier this month, will slightly shift focus to build upon existing collaborations, particularly around metadata. One of the primary sections of the MOU is enhancing support for the least-resourced journals by:&lt;/p>
&lt;ul>
&lt;li>Assigning DOIs and depositing the metadata with Crossref&lt;/li>
&lt;li>Finding ways to improve their DOAJ application experience to help them become indexed&lt;/li>
&lt;li>Collect and ingest their Crossref metadata into DOAJ&lt;/li>
&lt;li>Help them to get preserved via JASPER or similar initiatives&lt;/li>
&lt;li>Help identify other local partners, such as Crossref Sponsoring Organisations, to support their use of Crossref services&lt;/li>
&lt;/ul>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>It’s great that we can further underpin what is already a good working relationship. Both Crossref and DOAJ are central to discovery so it’s a natural partnership. Helping journals meet better standards and become indexed to make them more discoverable on a global scale is at the heart of our strategy. This agreement opens up a new avenue that allows the community to really focus on supporting those journals and the research they publish.’&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Joanna Ball, Managing Director of DOAJ&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>‘The collaborations with DOAJ so far only reconfirmed our shared goal to help make the global scholarly communications system more equitable wherever we can. Our joint projects aim to seek out and devise support for resource-constrained journals in multiple ways. DOAJ’s work is essential in helping journals to develop good practice, while Crossref offers an open infrastructure to ensure all journals can be included and discoverable in the global scholarly record.’&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Ginny Hendricks, Director of Member and Community Outreach at Crossref&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>&amp;mdash;&amp;mdash;&amp;ndash; END &amp;mdash;&amp;mdash;&lt;/p>
&lt;h3 id="about-doaj">About DOAJ&lt;/h3>
&lt;p>DOAJ is a community-curated online directory that indexes and provides access to high quality, open access, peer reviewed journals. DOAJ deploys around one hundred carefully selected volunteers from the community of library and other academic disciplines to assist in curating open access journals. This independent database contains over 20,400 peer-reviewed open access journals covering all areas of science, technology, medicine, social sciences, arts and humanities. DOAJ is financially supported worldwide by libraries, publishers and other like-minded organisations. DOAJ services (including the evaluation of journals) are free for all, and all data provided by DOAJ are harvestable via OAI/PMH and the API. See &lt;a href="https://doaj.org/" target="_blank">https://doaj.org/&lt;/a> for more information.&lt;/p>
&lt;h3 id="about-crossref">About Crossref&lt;/h3>
&lt;p>Crossref is a global community-governed open scholarly infrastructure that makes all kinds of research objects easy to find, assess, and reuse through a number of services critical to research communications, including an open metadata API that sees over 1.5 billion queries every month. Crossref’s ~20,000 members come from 155 countries and are made up of universities, publishers, funders, government bodies, libraries, and research groups. Their ~155 million DOI records contribute to the collective vision of a rich and reusable open network of relationships connecting research organisations, people, things, and actions; a scholarly record that the global community can build on forever, for the benefit of society.&lt;/p>
&lt;p>For more information please contact:
&lt;a href="mailto:dominic@doaj.org">dominic@doaj.org&lt;/a> and &lt;a href="mailto:rclark@crossref.org">rclark@crossref.org&lt;/a>&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2024/doaj-crossref-twitter-post-new-2024-1.png" width="80%">
&lt;/figure></description></item><item><title>What do we know about DOIs</title><link>https://www.crossref.org/blog/what-do-we-know-about-dois/</link><pubDate>Thu, 29 Feb 2024 00:00:00 +0000</pubDate><author>Martin Eve</author><guid>https://www.crossref.org/blog/what-do-we-know-about-dois/</guid><description>&lt;p>Crossref holds metadata for approximately 150 million scholarly artifacts. These range from peer reviewed journal articles through to scholarly books through to scientific blog posts. In fact, amid such heterogeneity, the only singular factor that unites such items is that they have been assigned a document object identifier (DOI); a unique identification string that can be used to resolve to a resource pertaining to said metadata (often, but not always, a copy of the work identified by the metadata).&lt;/p>
&lt;p>What, though, do we actually know about the state of persistence of these links? How many DOIs resolve correctly? How many landing pages, at the other end of the DOI resolution, contain the information that is supposed to be there, including the title and the DOI itself? How can we find out?&lt;/p>
&lt;p>The first and seemingly most obvious way that we can obtain some of these data is by working through the most recent sample of DOIs and attempting to fetch metadata from each of them using a standard python script. This involves using the httpx library to attempt to resolve each of the DOIs to a resource, visiting that resource and seeing what the landing page yields.&lt;/p>
&lt;p>Even this is not straightforward. Landing pages can be HTML resources or they can be PDF files, among other things. In the case of PDF files, to detect a run of text is not simple as a single line break can be enough to foil our search. Nonetheless, when using this strategy we find the following statistics:&lt;/p>
&lt;p>Total DOI count in sample: 5000&lt;br>
Number of HTTP 200 response: 3301*&lt;br>
Percentage of HTTP 200 responses: 66.02%&lt;br>
Number of titles found on landing page: 1580&lt;br>
Percentage of titles found on landing page: 31.60%&lt;br>
Number of DOIs in recommended format found on landing page: 1410&lt;br>
Percentage of DOIs in recommended format found on landing page: 28.20%&lt;br>
Number of titles and DOIs found on landing page: 929&lt;br>
Percentage of titles and DOIs found on landing page: 18.58%&lt;br>
Number of PDFs found on landing page: 1469&lt;br>
Percentage of PDFs found on landing page: 29.38%&lt;br>
Percent of PDFs found on landing pages that loaded: 44.50%&lt;/p>
&lt;p>* an HTTP 200 response means that the web page loaded correctly&lt;/p>
&lt;p>While these numbers look quite low, the problem here is that a large number of scholarly publishers use Digital Rights Management techniques on their sites that block a crawl of this type. We can use systems like Playwright to remote control browsers to do the crawling, so that the request looks as much like a genuine user as possible and to evade such detection systems. However, lots of these sites detect headless browsers (where the browser is invisible and running on a server) and block them with a 403 Permission Denied error.&lt;/p>
&lt;p>There&amp;rsquo;s &lt;a href="https://github.com/infosimples/detect-headless" target="_blank">a great Github javascript suite&lt;/a> that aims to help evade headless detection. The tests it uses are:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>User Agent: in a browser running with puppeteer in headless mode, user agent includes Headless.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>App Version: same as User Agent above.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Plugins: headless browsers don&amp;rsquo;t have any plugins. So we can say that if it has plugin it&amp;rsquo;s headful, but not otherwise since some browsers, like Firefox, don&amp;rsquo;t have default plugins.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Plugins Prototype: check if the Plugin and PluginsArray prototype are correct.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Mime Type: similar to Plugins test, where headless browsers don&amp;rsquo;t have any mime type&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Mime Type Prototype: check if the MimeType and MimeTypeArrayprototype are correct.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Languages: all headful browser has at least one language. So we can say that if it has no language it&amp;rsquo;s headless.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Webdriver: this property is true when running in a headless browser.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Time elapse: it pops an alert() on page and if it&amp;rsquo;s closed too fast, means that it&amp;rsquo;s headless.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Chrome element: it&amp;rsquo;s specific for chrome browser that has an element window.chrome.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Permission: in headless mode Notification.permission and navigator.permissions.query report contradictory values.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Devtool: puppeteer works on devtools protocol, this test checks if devtool is present or not.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Broken Image: all browser has a default nonzero broken image size, and this may not happen on a headless browser.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Outer Dimension: the attributes outerHeight and outerWidth have value 0 on headless browser.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Connection Rtt: The attribute navigator.connection.rtt,if present, has value 0 on headless browser.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Mouse Move: The attributes movementX and movementY on every MouseEvent have value 0 on headless browser.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>Using the stealth plugin for Playwright also allows us to evade most of these checks. This just leaves Mouse Move and Broken Image detection, which I thought would not outweigh all the other factors. We can also jitter the connection with arbitrary delays so that it should appear to be coming at random intervals, rather than a robotic crawl.&lt;/p>
&lt;p>Yet the basic fact is that we are still blocked from crawling many sites. This does not happen when we put the browser into headful mode, so current detection techniques have clearly evolved in the past half decade (since Detect Headless) was designed.&lt;/p>
&lt;p>If, however, we run the browser in a headful mode, the results are somewhat stunningly different:&lt;/p>
&lt;p>Total DOI count in sample: 5000&lt;br>
Number of HTTP 200 response: 4852&lt;br>
Percent of HTTP 200 responses: 97.04%&lt;br>
Number of titles found on landing page: 2547&lt;br>
Percentage of titles found on landing page: 50.94%&lt;br>
Number of DOIs in recommended format found on landing page: 2424&lt;br>
Percentage of DOIs in recommended format found on landing page: 48.48%&lt;br>
Number of titles and DOIs found on landing page: 1574&lt;br>
Percentage of titles and DOIs found on landing page: 31.48%&lt;br>
Number of PDFs found on landing page: 2085&lt;br>
Percentage of PDFs found on landing page: 41.70%&lt;br>
Percentage of PDFs found on landing pages that loaded: 42.97%&lt;/p>
&lt;p>Let&amp;rsquo;s talk about the resolution statistics. Other studies, looking at general links on the web, have found a link-rot rate of about 60%-70% over a ten-year period &lt;a href="https://www.zotero.org/google-docs/?jyT6HY" target="_blank">(Lessig, Zittrain, and Albert 2014; Stox 2022)&lt;/a>. The DOI resolution rate that we have, with 97% of links resolving (or a 3% link-rot rate), is far better and more robust than a web link in general.&lt;/p>
&lt;p>Is 3% a good or a bad number? It&amp;rsquo;s more robust than the web in general, but it still means that for every 100 DOIs, just under 3 will fail to resolve. We also cannot tell whether these DOIs are resolving to the correct target, except by using the metadata detection metrics (are the title and DOI on the landing page, which we could only detect at a far lower rate). It is entirely possible for a website to resolve with an HTTP 200 (OK) response, but for the page in question to be something very different to what the user expected, a phenomenon dubbed content drift. A good example is domain hijacking, where a domain name expires and spam companies buy them up. These still resolve to a web page, but instead of an article on RNA, for a hypothetical example, the user gets adverts for rubber welding hose. That said, other studies are also prone to this and there is no guarantee that content drift doesn&amp;rsquo;t affect a huge proportion of supposedly good links in the other studies, too.&lt;/p>
&lt;p>Of course, one of the most frustrating elements of this exercise is having to work around publisher blocks on content when visiting using a server-only robot script. It&amp;rsquo;s important for us periodically to monitor the uptime rate of the DOI system. We also recognise, though, that publishers want to block malicious traffic. However, we can&amp;rsquo;t perform our monitoring in an easy, automatic way if headless scripts are blocked from resolving DOIs and visiting their respective landing pages. This is not even a call for open access; it&amp;rsquo;s just saying that current anti-bot techniques, sometimes implemented for legitimate reasons, stifle our ability to know the landscape. Even if the bot resolved a DOI to just a paywall, it would be easier for us to monitor this than it is now. Similarly, CAPTCHA systems such as Cloudflare that would seem to offer an easy way to distinguish between humans (good) and robots (bad) can make life very difficult at the monitoring end. We would certainly be grateful for any proposed solution that could help us to work around these mechanisms.&lt;/p>
&lt;h2 id="conclusion">Conclusion&lt;/h2>
&lt;p>The context in which I wanted to know this information was so that we can take a snapshot of a page and then, at a later stage, determine whether it is down or has changed substantially. To do this, we are developing &lt;a href="https://gitlab.com/crossref/labs/shelob/" target="_blank">Shelob&lt;/a>, an experimental content drift spider system; that&amp;rsquo;s what we&amp;rsquo;ve used so far to conduct this analysis. Over time, Shelob will evolve, we hope, to give us a way to detect when content has drifted or gone offline. If, however, we can&amp;rsquo;t detect whether an endpoint is good in the first place, then we likewise cannot detect when things have gone wrong. On the other hand, if, when we first visit, we find the DOI and title on the landing page, but at some future point this degrades, we might be able to say with some confidence that the original has died. I, personally, would encourage publishers not to block automated crawlers, because it&amp;rsquo;s good when we can determine these types of figures.&lt;/p>
&lt;h2 id="works-cited">Works Cited&lt;/h2>
&lt;p>Lessig, Lawrence, Jonathan Zittrain, and Kendra Albert. 2014. &amp;lsquo;Perma: Scoping and Addressing the Problem of Link and Reference Rot in Legal Citations&amp;rsquo;. &lt;em>Harvard Law Review&lt;/em> 127 (4). &lt;a href="https://harvardlawreview.org/forum/vol-127/perma-scoping-and-addressing-the-problem-of-link-and-reference-rot-in-legal-citations/.%28https://www.zotero.org/google-docs/?970bfS" target="_blank">https://harvardlawreview.org/forum/vol-127/perma-scoping-and-addressing-the-problem-of-link-and-reference-rot-in-legal-citations/.(https://www.zotero.org/google-docs/?970bfS&lt;/a>&lt;/p>
&lt;p>Stox, Patrick. 2022. &amp;lsquo;Ahrefs Study on Link Rot&amp;rsquo;. SEO Blog by Ahrefs. 29 April 2022. &lt;a href="https://ahrefs.com/blog/link-rot-study/" target="_blank">https://ahrefs.com/blog/link-rot-study/&lt;/a>.&lt;/p></description></item><item><title>The Lammey Effect</title><link>https://www.crossref.org/blog/the-lammey-effect/</link><pubDate>Fri, 16 Feb 2024 00:00:00 +0000</pubDate><author>Crossref</author><guid>https://www.crossref.org/blog/the-lammey-effect/</guid><description>&lt;p>We’re equally sad and proud to report that &lt;a href="https://www.crossref.org/authors/rachael-lammey">Rachael Lammey&lt;/a> is moving on in her career to the very lucky team at 67Bricks. Her last day at Crossref is today, Friday 16th February. Which is too soon for us, but very exciting for her!&lt;/p>
&lt;p>It’s hard to overstate Rachael&amp;rsquo;s impact on Crossref’s growth and success in her 12 years here. She started as a Product Manager where she developed that role into a broad and central function, and soon moved into the newly-formed community team as International Outreach Manager where she grew important programs such as Sponsors, Ambassadors, a series of ‘LIVE’ events around the world, and she went on to manage her own team and establish some of the most important strategic relationships that Crossref now feels fortunate to have.&lt;/p>
&lt;div class="shortcode-divwrap align-right">
&lt;span>&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2024/rachael/rachael-b&amp;amp;w.jpeg" width="100%">
&lt;/figure>
&lt;/span>
&lt;/div>
Rachael was a significant part of the growth and adoption of new initiatives such as Crossmark, Similarity Check, the REST API, preprints, grants, data citation, and ROR. She's contributed to numerous organisations such as EASE, ALPSP, SSP, ISMTE, STM, and most recently co-Chaired the NISO working group on retractions and corrections.
&lt;p>As Head of Strategic Initiatives, and most recently, Director of Product, Rachael has shown dedication and leadership, supporting and strengthening not just her own teams but all of us across the organisation, encouraging us to do better while being one of the easiest people to work with.&lt;/p>
&lt;p>The &amp;lsquo;&lt;a href="https://en.wikipedia.org/wiki/Butterfly_effect" target="_blank">butterfly effect&lt;/a>&amp;rsquo; is the notion that the world is deeply interconnected and that one small occurrence can influence a much larger complex system. Rachael embodies that notion, having created positive ripples and waves&amp;mdash;and certainly many connections&amp;mdash;in the scholarly record, in our organisation, and across the community.&lt;/p>
&lt;h2 id="messages-from-colleagues">Messages from colleagues&lt;/h2>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Rachael, I was saddened when I first heard the news that you were moving on to another opportunity. Your professionalism, work ethic, and positive attitude have been inspirational to work around. I have enjoyed the opportunities we have had to collaborate. As you move on to a new experience I wish you success and happiness in your future endeavors. Your presence will be missed at Crossref! Best Wishes.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Ryan&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>I will miss you, Rachael. It has been great working with you for the few months that I have been at Crossref. I also cannot forget kayaking together with you and capsizing on the return to the shore, but almost professionally recovering. We would have made the best team this time around. I wish you all the best and many wins in your new role.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Obanda&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>I feel like the luckiest human to have worked with Rachael over the last 4 years. She’s the perfect mix of smart and funny and knowing how to get things done. Rachael is a big part of what makes Crossref culture so special — I’ve never felt so supported in a role as when Rachael was my manager and for that I am very grateful. I will miss her wit and humor and her pragmatic approach to work and life!&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Sara&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>&lt;div class="shortcode-divwrap align-left">
&lt;span>&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2024/rachael/rachael-live-brazil.jpg"
alt="Team at Crossref LIVE event in Brazil, 2016" width="100%">
&lt;/figure>
&lt;/span>
&lt;/div>
&lt;p>One of my first ‘Crossref LIVE’ events was with Rachael in Brazil in 2016. At the time, my role mostly focused on membership, and we had just started working more closely with ABEC, a large organisation in Brazil that sponsored quite a few members. Rachael managed the sponsor program then and thought this would be a good opportunity to collaborate with a sponsor on an event, and she asked me to join her. There is so much planning for these - venues, local partners, presentations, meetings - and she had all the details in order and made the event such a success. Rachael was supportive, encouraging, and I learned so much from working with her. The Brazil trip was such a positive experience that I realised I wanted to focus more closely on community engagement. Rachael encouraged me to do so.&lt;/p>
&lt;div class="shortcode-divwrap align-right">
&lt;span>&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2024/rachael/rachael-indonesia.jpg" width="100%">
&lt;/figure>
&lt;/span>
&lt;/div>
&lt;p>She and I went on to partner on more LIVE events together. Our time in Indonesia was perhaps one of the most memorable for me - as well as our LIVE event, we had an unexpected tour of Yogyakarta with our Indonesian hosts, involving a tour of Prambanan Temple (see photo below), batik fabric shopping, visiting a few universities, and a stop at our hosts’ home. All the while trying not to let the winding car ride and traffic get the better of us. Our event the next day went perfectly, and I told her, half-jokingly, that the whole experience renewed my faith in humanity. Of note, we also drank the only bottle of wine available in the hotel bar.&lt;/p>
&lt;p>Rachael was also my Crossref running buddy, and we spent quite a few miles together - in Brazil, NH, Maine, Oxford, and Spain. During our runs, topics ranged from Game of Thrones to Idris Elba to sportsing, but not so much about work. The next time I find myself in England, we will run a few more miles together, followed by a pint. Thank you for everything!&lt;/p>
&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Susan&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Many have pointed out how talented, wise, or skilled you are and I certainly will not contradict a single word of it but that&amp;rsquo;s not what comes to mind first for me. Those traits, while true, pale in comparison to the person you are. Your positive, bright demeanor and the way everyone always feels better just being around you. I have dreaded some meetings from time to time. But whenever I&amp;rsquo;ve been involved in something with you, I&amp;rsquo;ve always left feeling better than when I started (no matter how grumpy I may have entered). You have been a consistent bright light in the Crossref constellation and you will truly be missed.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Jon&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Rachael! You are the best at cutting through all the bulls**t to get at what really needs to happen and why! Your knowledge is broad and deep, as is your institutional memory for all things Crossref and scholarly publishing. And your unflappability in pretty much any context is admirable and inspiring. We’ll miss you big time! Wishing you all the success at 67Bricks and otherwise.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Shayn&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Hey Rachael - I’m happy to be writing this note of Congratulations!! to you, particularly because it would be awkward to explain this bit of verklempt I’m feeling. Our interactions have been limited, but my impressions of you are of confidence, calm, capability, and collegiality. Thanks so much for your work with the Billing team. I’m sorry we are losing you, and am also so glad to know that you are out there at the forefront of inspiring others elsewhere, not only in the work you do, but also how you go about it.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Laura&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>&lt;div class="shortcode-divwrap align-right">
&lt;span>&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2024/rachael/rachael-uksg.jpg" width="100%">
&lt;/figure>
&lt;/span>
&lt;/div>
Hey Rachael, Just a big THANK YOU for helping me out all this time. I've had so many questions, and you've always been there to answer them. I always knew I could count on you. Thanks for those heartening chats when I needed a boost, and for including me in webinars and recordings - it really helped me improve. Remember that funny mistake I made on a recording when I called us 'Rochael'? We sure had a good laugh! I'm gonna miss those times and working with you. Can't wait to catch up with you over a drink the next time I'm in town. Wishing you all the very best and once again, thanks for everything!
&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Rosa&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>I am happy we got to enjoy some delicious vegetarian/vegan meals and wine together. I guess I should also mention that I enjoyed recruiting, HR and business fun with you too. Thank you for being such a big part of Crossref for 12 years! Have fun conquering your new chapter. Congratulations!&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Michelle&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Rachael! You will be missed. I have really enjoyed our chats and work together. I will miss our wide ranging talks about food, books, and your descriptions of all the sportsing, which I would admire because I can barely manage a short run. :-) Thanks so much for being you and let’s stay in touch! Congratulations on your new endeavor, you’re going to be great.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Esha&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>When Rachel joined Crossref she brought a lot of enthusiasm and interest in learning about all that we were doing and also about what we could do. Her ideas and engaging leadership are wonderful for creating interest and drive to make projects happen. It has been wonderful to work with her over the 12 years here. I always look forward to seeing her and hearing what she has been doing outside of Crossref as well as inside. I will miss her but I know she will be doing great things wherever she may be.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Tim&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>We’ve had a number of opportunities to reminisce, gassing each other up about how great it has been to work together, so I won’t do too much more of that here. But we will continue building on all of your contributions at Crossref and will carry forward your truth-telling and problem-solving approach to the work we do here. Best of luck with all the future has to offer, and we will certainly miss having you on the team.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Patrick P&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>&lt;div class="shortcode-divwrap align-left">
&lt;span>&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2024/rachael/rachael-zoom.png" width="100%">
&lt;/figure>
&lt;/span>
&lt;/div>
Rachael - I will miss you. I’ve really enjoyed working with you, hanging out while traveling, and getting recommendations on good books to read. Crossref won’t be the same without you. I think you have worked in the most different areas of Crossref and on the most projects of anybody, ever. Your commitment, professionalism and humour helped make Crossref what it is today. Your sportsing is also very impressive. All the best.
&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Ed&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>&lt;p>Not all heroes wear capes! Rachael defines that saying so much with her ethic of getting things DONE! I know she loves to get things done but the speed and quality in which that happens is second-to-none. Rachael will be massively missed at Crossref and 67Bricks don’t yet know what they have found. I enjoyed working with Rachael throughout my tenure at Crossref, she has helped me a huge amount in developing my programming skills and has always been encouraging throughout, especially with the &amp;rsquo;toil-bashing’ which is substantial and overwhelming at times.&lt;/p>
&lt;p>On a more personal note, she is a great drinking buddy and always motivates me to be more active… by making me feel lazy. The number of hours Rachael would work was crazy, but then I always thought that anyone who gets up that early to go for runs must be a little crazy! AIl the best in your future endeavors and don’t be a stranger.&lt;/p>
&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Paul&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>&lt;div class="shortcode-divwrap align-right">
&lt;span>&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2024/rachael/rachael-park.jpeg" width="100%">
&lt;/figure>
&lt;/span>
&lt;/div>
When I started at Crossref in March 2015—at the UKSG conference in Glasgow—Rachael was leading a workshop on text mining, showing off in full glory her ‘unicorn’ mix of skills from her technical knowledge of metadata and APIs to her facilitation techniques with a large group of people, clearly a community whose needs she knew inside out. Later that evening, Rachael took it upon herself to induct me in the ways of Crossref. One of the most important things she thought I should know was that we were all trusted and treated like adults - there was no micromanagement and I was to feel completely free to challenge the status quo.
&lt;div class="shortcode-divwrap align-left">
&lt;span>&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2024/rachael/rachael-xmas.jpeg" width="100%">
&lt;/figure>
&lt;/span>
&lt;/div>
After one of the first ‘LIVE’ events, in Vilnius, I realised that it was Rachael who had created and embodied that trusted vibe through her own approach. She has been entrusted with so many programs, projects, teams, and tricky situations. Almost every launch, release, announcement, or achievement at Crossref very likely had Rachael’s eye on it at some stage, certainly the ‘actually-getting-it-done’ stage. Our close working relationship over the last nine years grew into a great friendship and I’m not quite sure how I’ll feel when the reality sets in and she’s not here for a quick chat, always a reality check. Working with Rachael has been inspiring, exciting, reassuring, and hilarious (that dry 'Norn Iron' humour!). 67Bricks is so fortunate and I can’t wait to watch her help them go from strength to strength, just like she has done for Crossref. See you soon, Ranty Rachael, no doubt putting the world to rights over a bottle of Malbec and many eyerolls 🙄.
&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Ginny&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Although our time working together only overlapped the short span of two years, I appreciate how much of a champion you were for ROR and everything else you did at Crossref! I’m sure you’ll continue to do the same, among many other great things, in your new journey at 67 Bricks. You will be missed!&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Adam&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Rachael, It has been wonderful working with you!! You are truly a special person. I always looked forward to when we chatted over slack, had a call together, or got to spend time together in person. You are sure to do amazing things on your next adventure. You will truly be missed!! I hope we can stay in touch! Good luck, Rachael!!! Fondly, Amy.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Amy&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>I am happy I got to meet Rachael when I joined Crossref in December 2023. We spoke generally about the Products team at Crossref, the differences and similarities between the African and British culture and upcoming projects on automation. You were really patient towards explaining and providing great information on metadata and research. Thank you so much for always responding swiftly to my requests pertaining to Finance issues. I have no doubt that you would be missed at Crossref and would keep doing great things into the future!!! Congratulations Rachael.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Patience&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>I will greatly miss working with you, Rachael; you have been a stalwart of reliability and enthusiasm during my time at Crossref and the organisation will not be the same without you. That said, of course, I wish you all the best of luck and success in your future endeavours!&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Martin&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Rachael- Congratulations on this new opportunity, I am thrilled for you! I am also very sorry that our time at Crossref did not overlap much and I am grateful for all the chances I had of interacting with you (including being able to meet you in London recently)- you were always very helpful and kind to me. I am hopeful that our paths will cross again in the future. We will definitely miss you here, and I wish you all the best for all the exciting things ahead.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Madhura&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>&lt;div class="shortcode-divwrap align-left">
&lt;span>&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2024/rachael/rachael-dog.jpg" width="75%">
&lt;/figure>
&lt;/span>
&lt;/div>
My third week at Crossref back in 2017 was at the annual meeting in Singapore, and not getting into the timezone and not sleeping for 4 days was eased by our visit to a rooftop nightclub on the penultimate night - just before you headed off to Indonesia for a series of meetings with members and sponsors. I still don’t know where you get all your energy!
&lt;p>I’m so sorry you’re leaving - I’ll really miss your honesty, your approach to getting things done, and of course seeing Rosie on our zoom calls. Looking forward to seeing what’s next for 67 Bricks - exciting times!&lt;/p>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Amanda B&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Rachael, it’s been a pleasure to work with you. You’re always ready to help and ever full of information. We’ve only just got coordinated on the perennial challenge of timelines! You took things on and got them done, as you said. The world of schol comms won’t even know how much it has to thank you for, probably chiefly for seeing the Retraction Watch data acquisition through and opening it up for all. I will miss your honesty and energy, and the opportunity to challenge you again on the amount of food consumed in one sitting… I don’t think you’ll need luck in your next place, but I wish you that it is all you want it to be.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Kora&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>I’m so glad to have met you in person over these couple of days in London shortly after I joined Crossref and it’s such a shame we didn’t have much time to work together more and spend more time (not working) together. Thanks for the introduction to the Scampi Fries - you’ve changed my life forever (for the best obviously)!&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Maryna&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>&lt;div class="shortcode-divwrap align-right">
&lt;span>&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2024/rachael/rachael-bonfire.jpg" width="100%">
&lt;/figure>
&lt;/span>
&lt;/div>
Thank you for your collaboration and friendship over the past decade! You will be missed. We've worked on a long series of abbreviations, acronyms, and portmanteaus! Thanks for organizing countless things, from conference satellites to conference rooms. Your long record as fire warden was unblemished. 67bricks will benefit from your singular drive and attention to detail. All the best!
&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Joe&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Rachael! One thing I admire most in a person is a facility with metaphor accompanied by the ability to see to the heart of a matter, and hoo boy do you have those qualities in spades. I remember so clearly your talk at the Crossref team meeting in Spain in 2023 in which you clarified the Big Picture for us all in an extremely enlightening way, and then, in a smaller but equally impressive achievement, casually mentioning in a Funder Registry meeting that funders should start &amp;ldquo;stretching and warming up&amp;rdquo; for the transition to ROR – boy did I latch on to that terrific image. I wish you all the best at 67 Bricks.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Amanda F&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>&lt;div class="shortcode-divwrap align-right">
&lt;span>&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2024/rachael/rachael-force11-2019.jpeg" width="75%">
&lt;/figure>
&lt;/span>
&lt;/div>
Rachael, thank you so much for all the support, patience, honesty, and determination. I will certainly miss our chats, work-related and non-work related. I wish you all the best in your new ventures!
&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Dominika&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Rachael - thank you for your boundless patience, generosity, and sense of humour. I’m very grateful I got to learn the Crossref ropes (cropes?) from you. Looking forward to randomly running into you on the Bristol karaoke circuit in 10 years’ time and performing an epic duet of Dancing in the Dark together. There’s a joke in there somewhere about you being the boss.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Lena&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Rachael, Congrats on your new opportunity. You will be greatly missed here. Through the years we have only been at the same events in person a handful of times but I will always remember your amazing personality and sense of humor. I am thankful to have spent some time with you at 2020 PIDapalooza.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Maria&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Thank you, Rachael. Thank you. I know everyone is telling you that they’re sad to see you go (I am too; we all are). I keep thinking if I delay telling you that, maybe the day won’t come when you walk out the Crossref doors. But here it is. Just wanted to you to know that I appreciate you. I appreciate you pushing us forward. I appreciate you being an advocate for all things Crossref. We’ll all miss you. Best of luck at 67bricks!&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Isaac&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>&lt;div class="shortcode-divwrap align-right">
&lt;span>&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2024/rachael/rachael-bar.jpeg" width="75%">
&lt;/figure>
&lt;/span>
&lt;/div>
On one of our first meet ups together, I drove us from the Lynnfield office to Logan airport in rush hour, and we managed to survive the Bostonian road rage in one piece. We spent the ride talking through the intricacies of a sponsoring organisation’s agreement. Rachael has been a safe set of hands and an encyclopedia of institutional memory for Crossref for 12 years.
&lt;p>Rachael is one of those people who’s as equally competent as she is a pleasure to work with. She’s an innate leader because people want to get behind her. She shows her depth of understanding while also inviting input from everyone in the room. I’ll miss our Zoom calls, our marathon Friday sessions, and our post-meeting pub visits.&lt;/p>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Lucy&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Hello! Here&amp;rsquo;s to hoping your new workplace appreciates you as much as you were here – they&amp;rsquo;re lucky to have you. I only wish we had the chance to interact more. Many hugs!&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Luis&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>&lt;div class="shortcode-divwrap align-left">
&lt;span>&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2024/rachael/rachael-pub.jpeg" width="75%">
&lt;/figure>
&lt;/span>
&lt;/div>
Rachael, I will really really miss you, professionally and personally (but you know this already !). I'll miss all our work, dog, book and putting the world to rights chats. You'll be brilliant whatever you do and wherever you go (67Bricks have no idea how lucky they are !). Just keep 'getting stuff done' and have fun 😀
&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Fabienne&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>You will be sorely missed but can be very proud of what you’ve done during your time at Crossref, I’m sure you’ll continue to have a big impact. You’ve always been a pleasure to work with: efficient, supportive, and always with a sense of fun and enjoyment. That’s probably one of the things that drew me to Crossref even before we worked together as colleagues. Thanks for the support and positivity you’ve brought on many, many occasions and best wishes for the future!&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Martyn&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Hey Rachael! I might have not had the chance to meet with you much while still around but I’ll definitely miss your jokes and the good vibes you were bringing to each call! Looking forward to taking over your place for board games when around Bristol ;) Wishing you a great start in the new place!&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Panos&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>ThunderCats are on the move. ThunderCats are loose. Says it all, really. Best of luck in your new endeavours.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Mike&lt;/cite>&lt;/p>
&lt;/div>
&lt;h2 id="crossref-wont-be-the-same-without-rachael-and-we-wish-her-well-on-her-way-to-even-greater-things">Crossref won&amp;rsquo;t be the same without Rachael and we wish her well on her way to even greater things.&lt;/h2>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2024/rachael/rachael-2024.jpeg" width="100%">
&lt;/figure>
&lt;h2 id="good-luck-lammey">Good luck, Lammey!&lt;/h2>
&lt;br>
&lt;br>
&lt;br>
&lt;br></description></item><item><title>Ed Pentz accepts the 2024 NISO Miles Conrad Award</title><link>https://www.crossref.org/blog/ed-pentz-accepts-the-2024-niso-miles-conrad-award/</link><pubDate>Tue, 13 Feb 2024 00:00:00 +0000</pubDate><author>Rosa Morais Clark</author><guid>https://www.crossref.org/blog/ed-pentz-accepts-the-2024-niso-miles-conrad-award/</guid><description>&lt;p>Great news to share: our Executive Director, Ed Pentz, has been selected as the 2024 recipient of the Miles Conrad Award from the USA&amp;rsquo;s National Information Standards organisation (NISO). The award is testament to an individual&amp;rsquo;s lifetime contribution to the information community, and we couldn&amp;rsquo;t be more delighted that Ed was voted to be this year&amp;rsquo;s well-deserved recipient.&lt;/p>
&lt;div class="shortcode-divwrap align-right">
&lt;span>&lt;figure>&lt;img src="https://www.crossref.org/images/staff/ed17-720px.png" width="100%">
&lt;/figure>
&lt;/span>
&lt;/div>
&lt;p>During the &lt;a href="https://niso.plus" target="_blank">NISO Plus conference&lt;/a> this week in Baltimore, USA, Ed accepted his award and delivered the 2024 Miles Conrad lecture, reflecting on how far open scholarly infrastructure has come, and the part he has played in this at Crossref and through numerous other collaborative initiatives.&lt;/p>
&lt;p>Established in 1965, the Miles Conrad Award gives recognition to those who&amp;rsquo;ve made substantial contributions to the information community over a lifetime. Named after the founder of the National Federation of Abstracting and Indexing Services (NFAIS)—an association that since merged with NISO—the award encourages innovation in content management and dissemination. Over the years, leaders and innovators who have significantly influenced the field of information exchange have been honored with the award. Ed has joined &lt;a href="https://www.niso.org/awards/MCA" target="_blank">an illustrious group&lt;/a>!&lt;/p>
&lt;p>Ed’s leadership in collaboration and diplomacy has led to Crossref&amp;rsquo;s success in making research objects more accessible and useful to a wide global audience, including publishers, researchers, funders, societies, libraries, and more. Crossref&amp;rsquo;s founding purpose is stated as:&lt;/p>
&lt;blockquote>
&lt;p>“To promote the development and cooperative use of new and innovative technologies to speed and facilitate scientific and other scholarly research”.&lt;/p>
&lt;/blockquote>
&lt;p>Acknowledging his privilege as a Western, university-educated, white man, which he comments has helped his career, Ed prioritises collaboration, open communication, teamwork, and equity in creating a positive, trusted environment that has brought together a diverse team of 49 colleagues from 11 countries. The organisation’s culture allows everyone to grow and contribute to the mission of a connected &lt;a href="https://www.crossref.org/documentation/research-nexus">research nexus&lt;/a> by including and developing solutions for community members across the globe.&lt;/p>
&lt;p>Before his journey with Crossref, Ed held a number of roles at Harcourt Brace, including launching Academic Press&amp;rsquo;s first online journal. This experience led to his involvement with the DOI-X pilot project, which became the foundation for Crossref. Since its launch in 2000, under his leadership, Crossref has become an important component of the research ecosystem, an open scholarly infrastructure with nearly 20,000 members across more than 150 countries. Crossref is now the main source of &amp;gt;155 million records about all kinds of research objects and this open metadata registry is relied upon by thousands of tools and services across the whole research system.&lt;/p>
&lt;p>Ed’s influence is also evident throughout the wider world of open scholarly infrastructure; aside from establishing Crossref, he co-founded &lt;a href="https://ror.org/" target="_blank">ROR&lt;/a> and was a founding member of &lt;a href="https://orcid.org/" target="_blank">ORCID&lt;/a>, where he also served as board Chair. Further, he has engaged with the community by holding various advisory positions, including the &lt;a href="https://www.doi.org/" target="_blank">DOI Foundation&lt;/a>, the Digital Object Naming Authority (DONA), and the Coalition for Diversity in Scholarly Publishing (C4DISC).&lt;/p>
&lt;p>Ed also emphasised that the long-term success of community initiatives lies in patience and the ability to agree on high-level principles of purpose and governance, which oil the wheels of collaboration, encourage participation, and enable more progressive change that builds and lasts over time. He says, &amp;ldquo;to solve collective problems it takes collaboration and diplomacy, bringing together a group of stakeholders, balancing their different concerns, building trust, and reaching consensus.&amp;rdquo;&lt;/p>
&lt;p>The adoption of the &lt;a href="https://openscholarlyinfrastructure.org" target="_blank">Principles of Open Scholarly Infrastructure (POSI)&lt;/a>, along with (so far) 14 other organisations, was a key &lt;a href="https://www.crossref.org/categories/posi/">turning point for Crossref&lt;/a>, Ed said, and one which has already paved the way for more openness of key metadata for the community, including references and retractions, as well as closer partnerships with many of the other POSI adoptees, given their shared understanding and experience.&lt;/p>
&lt;p>Referencing the current &amp;ldquo;peak hype&amp;rdquo; around artificial intelligence (AI), Ed points to the challenge of research integrity and the &amp;ldquo;growing field of science sleuthing&amp;rdquo; as a forthcoming area that Crossref and open metadata may help tackle at scale, including through Crossref&amp;rsquo;s &lt;a href="https://www.crossref.org/categories/research-integrity">Integrity of the Scholarly Record (ISR) Program&lt;/a> and&amp;mdash;of course&amp;mdash;community-wide collaboration.&lt;/p>
&lt;p>In concluding his talk, Ed describes his hopes and dreams for scholarly communications in the future. He would like to see more balance in diversity in the leadership of open scholarly infrastructure, extended integrations among the various foundational infrastructures, and a fully connected system where the scholarly record is inclusive globally.&lt;/p>
&lt;h4 id="ed-on-behalf-of-all-your-proud-colleagues-at-crossref-thank-you-and-congratulations">Ed, on behalf of all your proud colleagues at Crossref, thank you and congratulations!&lt;/h4></description></item><item><title>ISR Roundtable 2023: The future of preserving the integrity of the scholarly record together</title><link>https://www.crossref.org/blog/frankfurt-isr-roundtable-event-2023/</link><pubDate>Tue, 06 Feb 2024 00:00:00 +0000</pubDate><author>Madhura Amdekar</author><guid>https://www.crossref.org/blog/frankfurt-isr-roundtable-event-2023/</guid><description>&lt;p>Metadata about research objects and the relationships between them form the basis of the scholarly record: rich metadata has the potential to provide a richer context for scholarly output, and in particular, can provide trust signals to indicate integrity. Information on who authored a research work, who funded it, which other research works it cites, and whether it was updated, can act as signals of trustworthiness. Crossref provides foundational infrastructure to connect and preserve these records, but the creation of these records is an ongoing and complex community effort. Crossref has always shown a deep commitment to preserving the integrity of the scholarly record in an open and scalable manner.&lt;/p>
&lt;p>Given the increasing concerns in the community about matters of research integrity and integrity of the scholarly record (ISR), we at Crossref have been engaging with community members to understand what developments are needed. In 2022, we organised &lt;a href="https://doi.org/10.64000/3b445-2zr32" target="_blank">a roundtable discussion to talk about our role&lt;/a> and the applicability of Crossref’s services in preserving and assessing the integrity of the scholarly record. We’ve acted on much of that feedback since, and so in October 2023, we organised a follow-up event, once more gathering representatives of publishers, research integrity experts, policy-makers, academic institutions, funders, and researchers (the full list of participants can be found in the appendix). This post aims to offer insight into the discussions at this event and the next steps. The objective of this event was to take the conversation forward by:&lt;/p>
&lt;ul>
&lt;li>Sharing the progress made by Crossref on matters related to ISR since the last roundtable event.&lt;/li>
&lt;li>Sharing information about how metadata contributes to the Research Nexus, and can act as trust markers for research outputs.&lt;/li>
&lt;li>Apprising the community about the latest membership trends and examples of activities that we see, such as title transfer disputes, unregistered DOIs, requests for deleting records, and &lt;a href="https://arxiv.org/abs/2310.02192" target="_blank">sneaked references&lt;/a> .&lt;/li>
&lt;li>Building upon the ideas discussed during the 2022 roundtable event to progress the conversation about issues related to ISR.&lt;/li>
&lt;li>Learning from the participants about their experiences of pursuing research integrity initiatives.&lt;/li>
&lt;li>Last but no less importantly, hearing from the participants their perspectives on strategies for preserving the integrity of the scholarly record, and opportunities for collaborating to leverage metadata to assess the integrity of the scholarly record.&lt;/li>
&lt;/ul>
&lt;p>The event was kicked off by Ed Pentz, who spoke to the participants about how integrity is key to Crossref’s mission, and Crossref’s vision of the &lt;a href="https://www.crossref.org/documentation/research-nexus/" target="_blank">Research Nexus&lt;/a>. Next, Amanda Bartell, the Head of Member Experience at Crossref, shared the recent developments and trends in community behaviour. She expanded upon the actions taken by Crossref as part of its ISR program since the last roundtable event, which include:&lt;/p>
&lt;ul>
&lt;li>&lt;a href="https://doi.org/10.13003/c23rw1d9" target="_blank">Acquisition and opening of the Retraction Watch database&lt;/a>, which makes it easier to access information on retractions and corrections.&lt;/li>
&lt;li>Increased participation in the Global Equitable Membership (GEM) program, enabling a wider section of the community to provide and access trust signals.&lt;/li>
&lt;li>Newer developments around metadata that act as trust signals: e.g. 120K grants or awards now have a Crossref DOI, and the &lt;a href="https://doi.org/10.64000/v3429-p7810" target="_blank">planned transition of the Open Funder Registry into ROR&lt;/a>.&lt;/li>
&lt;li>Recruitment of a Community Manager to focus on working with publishers and editors, including on ISR (&lt;a href="https://doi.org/10.64000/j64jw-09931" target="_blank">that’s me!&lt;/a>), and recruitment of &lt;a href="https://doi.org/10.64000/rfdpw-qe476" target="_blank">a Technical Community Manager&lt;/a> to enable greater use of our APIs.&lt;/li>
&lt;/ul>
&lt;p>Amanda highlighted that all Crossref members should be using ROR IDs to provide affiliations for authors (along with ORCID iDs) in their Crossref metadata. She also shared some latest examples of community behaviours that we have seen, such as requests from authors to delete records of works that were published without their permission, title ownership disputes between publishers, and the recent instance of &lt;a href="https://arxiv.org/abs/2310.02192" target="_blank">sneaked references&lt;/a>.&lt;/p>
&lt;p>Ivan Oransky, co-founder of Retraction Watch, and Lena Stoll, Product Manager at Crossref, were next, and they spoke about the future of the Retraction Watch database, and about the &lt;a href="https://www.crossref.org/services/crossmark/" target="_blank">Crossmark service&lt;/a>. After this, some of the other roundtable participants shared initiatives that they have undertaken that support ISR:&lt;/p>
&lt;ul>
&lt;li>Jodi Schneider from the University of Illinois Urbana-Champaign spoke about &lt;a href="https://www.niso.org/standards-committees/crec" target="_blank">NISO’s CREC Working Group&lt;/a> that has created a Recommended Practice that should be followed by relevant stakeholders for communicating retracted research (Crossref’s Director of Product Rachael Lammey was the co-chair of that group).&lt;/li>
&lt;li>Kihong Kim from the Korean Council of Science Editors shared information about the workshops that the Council has organised for researchers on publishing in journals.&lt;/li>
&lt;li>Alberto Martín-Martín from Universidad de Granada presented his thoughts on how to reconcile the publishing system and the institutional view of tracking research outputs.&lt;/li>
&lt;li>Bianca Kramer from Sesame Science spoke about her analysis of and the implications of sneaked references, duplicate references, and missing references for citation integrity.&lt;/li>
&lt;li>Joris van Rossum from STM Solutions spoke about the STM Integrity Hub and the integrity tools that are being developed in collaboration with some publishers.&lt;/li>
&lt;/ul>
&lt;p>Some of the most valuable reflections stemmed from discussions in small groups on these three key questions:&lt;/p>
&lt;ol>
&lt;li>What value do you see in the integrity and completeness of the scholarly record in the way you operate? How do you contribute to it? How can it support you to achieve your own goals?&lt;/li>
&lt;li>Are you aware of Crossref services? What are the barriers to more uptake? What are the challenges and opportunities?&lt;/li>
&lt;li>What information is essential and nice to have for you in the scholarly records to support trust signalling and ascertaining trustworthiness?&lt;/li>
&lt;/ol>
&lt;p>As groups shared their discussions, a few themes became apparent that I would like to elaborate on further.&lt;/p>
&lt;h2 id="what-is-complete">What is “complete”?&lt;/h2>
&lt;p>Given the prompt to talk about the value of completeness of the scholarly record, an immediate reaction at most tables was: how much metadata qualifies as “complete” metadata? Can the scholarly record be considered complete if some publishers or journals do not use Crossref? What is the optimum level of metadata that should be deposited by members - should a minimum data standard be defined by disciplines, or should there be standard data requirements for all? The composition of metadata appears to change over time, too, as the processes change and our ability to record their facets increases. While there were spirited discussions about what constitutes a complete scholarly record, everyone agreed that “completeness” of metadata, as much as is possible, should be the aim. Unambiguous and consistent standards may help with this, for example, the Metadata 20/20 community creation of &lt;a href="https://metadata2020.org/resources/metadata-principles/" target="_blank">principles&lt;/a> and &lt;a href="https://metadata2020.org/resources/metadata-practices/" target="_blank">best practices&lt;/a>, and potentially also using &lt;a href="https://www.niso.org/press-releases/2021/01/nisos-recommended-practice-reproducibility-badging-and-definitions-now" target="_blank">a set of recognition standards and reproducibility badges&lt;/a>.&lt;/p>
&lt;p>Global participation is equally important for a truly “complete” scholarly record. In order to enable as many in the scholarly community as possible to participate in Crossref services and metadata, Crossref launched the &lt;a href="https://www.crossref.org/gem/" target="_blank">Global Equitable Membership (GEM)&lt;/a> program in 2023. Under this initiative, membership and content registration fees are waived off for members from the least economically advantaged countries. We are seeing first signs that &lt;a href="https://doi.org/10.64000/vnvbt-64862" target="_blank">this initiative meaningfully lowers the barriers to participation&lt;/a> for organisations based in those countries, and allows the global community to contribute towards the building of a comprehensive research ecosystem.&lt;/p>
&lt;p>At the end of the day, it is important to recognize that rich metadata is crucial because it can be used for all kinds of analysis, which in turn can drive decision-making. Even if some of the metadata components are sporadically missing, that could be acceptable, because every piece of data counts!&lt;/p>
&lt;h2 id="corrections-and-retractions">Corrections and Retractions&lt;/h2>
&lt;p>Similar to last year, retractions and corrections continued to be a topic of great interest in this year’s roundtable. This was not surprising given their relevance as trust indicators as well as the recent development with the acquisition of the Retraction Watch database by Crossref. Having heard from Ivan about the Retraction Watch &lt;a href="https://retractionwatch.com/retraction-watch-database-user-guide/retraction-watch-database-user-guide-appendix-b-reasons/" target="_blank">taxonomy of reasons for retractions&lt;/a> and the metadata included in the database, participants expressed the need to investigate this taxonomy as a community standard. While the Retraction Watch taxonomy is not widely known, we at Crossref are working to map the Crossmark taxonomy with the Retraction Watch taxonomy, which will enable complete integration of the Retraction Watch database with the Crossref database.&lt;/p>
&lt;p>It would also be useful to add more information to retraction notices. Having more information about the reasons for retraction will not only destigmatize retractions, but certain additional information, such as submission dates for those outputs, might help with ethical investigations to determine whether manuscripts were being submitted to multiple publishers simultaneously.&lt;/p>
&lt;p>On the topic of retractions, another aspect that came up in the room was about incentives for researchers to publish as much and as quickly as possible. If researchers indulge in unethical publishing practices due to this pressure to publish, that is hugely detrimental to the cause of research integrity and to the progress of scientific research in general. However, there is a distinction to be made between the integrity of the research and the integrity of the scholarly record - unethical research and publishing practices, including but not limited to data falsification, fabrication, and plagiarism, affect research integrity while integrity of the scholarly record is affected by unavailability of metadata, outdated metadata, incomplete metadata records, and incorrect metadata (e.g. as seen in the case of &lt;a href="https://arxiv.org/abs/2310.02192" target="_blank">sneaked references&lt;/a>).&lt;/p>
&lt;p>There was a lot of discussion about &lt;a href="https://www.crossref.org/services/crossmark/" target="_blank">Crossmark&lt;/a>, a cross-platform service provided by Crossref that allows readers to discover whether an item has been updated, corrected, or retracted just by clicking a button that is standardised across publication platforms. While most participants acknowledged its importance, they also pointed out that its uptake has been limited and publishers do not use it as much, perhaps because it is difficult to implement and there’s a matter of providing more clarity about it to the readers. There were suggestions to add a notification system to Crossmark such that every time a published output is retracted, a notification goes out. This seemed of particular interest to funders, whose grievance was that they are usually the last to find out when research that they have funded is retracted. They would welcome notifications that would alert them to such events.&lt;/p>
&lt;p>We already have plans to consult with the community more specifically about what changes they’d like to see to Crossmark that will enable them to implement it easily and use it more frequently. Take a look &lt;a href="https://community.crossref.org/t/communicating-post-publication-updates-inviting-feedback-on-the-next-steps-for-crossmark/4744" target="_blank">at this thread on our community forum and add your thoughts for our next steps on Crossmark&lt;/a>.&lt;/p>
&lt;h2 id="the-importance-of-education">The importance of education&lt;/h2>
&lt;p>There was an overwhelming sentiment that there was a need for collective arbitration of research integrity issues. However, everyone recognized that this is not a role for Crossref. We can act as a “trust broker” by bridging different metadata and identifiers that otherwise might not interact, creating a network of research outputs whose credibility can be verified by others. Many participants called for Crossref to increase its efforts in educating community members about the importance of metadata and how different pieces can be linked together to make meaningful connections.&lt;/p>
&lt;p>Research practices vary between countries, and between institutions. Correspondingly, the metadata being provided by diverse Crossref members may also vary. There is an opportunity here for the global research community to work together to increase awareness about ethical standards, so that a lack of specific metadata or its variances (e.g. unusually formatted metadata, or non-standard metadata fields) may not be construed as “lower quality” metadata. Many felt that the greatest need for education about metadata is for the academic community – although individual researchers contribute a wealth of metadata associated with any published research output, they do not necessarily understand how metadata contributes to the completeness of the scholarly record. There is a further opportunity to talk to the academic community about how different metadata components link together to form a rich network, supporting visibility and confidence in their work. A greater awareness about these topics is likely to encourage researchers to provide more metadata and identifiers.&lt;/p>
&lt;p>While most participants at the roundtable event agreed about the need for this conversation and the educational opportunities here, if Crossref were to lead these efforts, it would represent, in some eyes, a diversion from its mission. We do have several initiatives already to support our communities. As part of the &lt;a href="https://www.crossref.org/community/ambassadors/" target="_blank">Crossref Ambassadors program&lt;/a>, volunteers from the international scholarly community who believe in Crossref’s mission liaise with our team to conduct training in their communities about using Crossref services and, generally, about the importance of metadata. In 2023, we also launched a new online public forum, &lt;a href="https://theplace.discourse.group/" target="_blank">the PLACE&lt;/a>, in collaboration with the Committee on Publication Ethics (COPE), the Directory of Open Access Journals (DOAJ), and the Open Access Scholarly Publishers Association (OASPA). This forum is a place where new publishers can connect with these organisations and learn about best practices in scholarly publishing via discussion posts and by asking questions, as they get started. Another initiative that is designed to help new Crossref members is the “Managed Member Journey”: as members join and move through the various stages of membership, key information is shared with them during each of these stages in the form of triggered automated emails, web pages, and webinars.&lt;/p>
&lt;p>While Crossref&amp;rsquo;s direct interactions with researchers are limited, we welcome the community&amp;rsquo;s recognition of the need to raise awareness about these matters. We have started engaging more closely with the reporters of metadata issues, in many cases investigators and ‘sleuths’ in the area of research integrity, and plan some closer collaborations with this group in 2024. We are open to supporting community efforts to inform other stakeholders about the importance and uses of metadata.&lt;/p>
&lt;h2 id="incentives-for-the-community">Incentives for the community&lt;/h2>
&lt;p>Another theme that was heard repeatedly was “incentives”: incentives for researchers to contribute to a “complete” scholarly record, incentives for publishers to improve metadata, and incentives for everyone to report on and register retractions.&lt;/p>
&lt;p>As I mentioned before, a shared sentiment is that researchers may not be aware of the value of rich metadata. While more publications, increased citations, and greater grant funding are some examples of incentives that are part of the current academic settings, the right incentives probably do not exist for researchers to provide complete metadata. With the diverse set of participants present at this meeting, some groups also discussed how the current research assessment system can change to incorporate other metrics, perhaps those based on open science and open data.&lt;/p>
&lt;p>What could be the incentives for publishers to improve the metadata collected and deposited by them? One suggestion was that clearly defined benefits of rich metadata can incentivise publishers. Being aware of what funders are mandating, can be another incentive. On the same note, funders will benefit from knowing what metadata is being provided by publishers. This metadata is available through our open API, and nine key checks on members’ activity are available through our public &lt;a href="https://crossref.org/members/prep/" target="_blank">Participation Reports&lt;/a>.&lt;/p>
&lt;p>Retractions featured again in the discussion on the topic of incentives. As shared by Ivan, retractions are on the rise every year, with about 43k retractions currently in the Retraction Watch database. On the other hand, retractions registered in Crossmark at the time of the meeting numbered just 14k and have recently jumped up to 25k thanks to Hindawi/Wiley’s dedication to good open metadata. Besides the fact that the uptake of Crossmark by Crossref members is limited, another reason for the low number of retractions being registered is the associated stigma. Corrections and errata are usually conflated with retractions, and all these terms, which represent different kinds of updates that may happen to a published item, have a stigma associated with them in the academic community. There is a need to destigmatize retractions, and perhaps incentivize them by noting that these updates are essential to uphold the integrity of the scholarly record and to highlight the publishers that are showing leadership in addressing the issues openly through up-to-date Crossref metadata.&lt;/p>
&lt;h2 id="what-metadata-is-nice-to-have-in-the-scholarly-record">What metadata is nice to have in the scholarly record?&lt;/h2>
&lt;p>We asked everyone what information they think is essential as well as “nice to have” in the scholarly record to support trust signalling, and we heard a range of answers. Peer review information was recognized to be important. This would include data on who the peer reviewers were and &lt;a href="https://www.niso.org/standards-committees/peer-review-terminology" target="_blank">standard peer review terminology that has been published by NISO&lt;/a>. More generally, as much metadata as possible about the main actors of the peer review process was considered important - such as designating who the corresponding author is, and who the handling editor or the decision-making editor was.&lt;/p>
&lt;p>As special issues led by guest editors in journals have been brought to the attention of late due to the uncovering of irregularities in some of them, one of the first suggestions in this context was more metadata about special issues. Participants thought that it would be useful to collect and distribute information on handling/guest editors of special issues, peer reviewers, as well as submission and acceptance dates. Recently, COPE has released guidance on &lt;a href="https://publicationethics.org/node/56239" target="_blank">“best practices for guest-edited collections”&lt;/a> , highlighting that this topic looms at the forefront for the scholarly information industry.&lt;/p>
&lt;p>Adding information on ethical approvals provided by institutional review boards would add more nuance to the research outputs. Metadata about clinical trials helps to add transparency to research in a field, where reproducibility is of primary importance. Conflicts of interest are another factor that could be a cause of concern if not reported accurately; these declarations were mentioned by the participants as important for signalling trust.&lt;/p>
&lt;p>Recognizing that it is the relationships in the metadata that add context to research output, participants echoed that better interlinking between preprints and their published versions is required. To aid with all of this, it has been suggested that a complete list of all metadata that can be deposited with Crossref be made available in a simple format, so that members have more visibility about all the possibilities that exist for providing metadata.&lt;/p>
&lt;h2 id="next-steps">Next steps&lt;/h2>
&lt;p>We asked all participants if the discussions prompted them to plan to take any actions in the near future. Several attendees reflected that the discussion encouraged them to go back and review the metadata that they are depositing with Crossref, and how they can make more use of the data openly available from Crossref. We also heard how some found training opportunities therein - discussion points from the event could be included in workshops for affiliated researchers, and in COPE guidance for members. As encouraged by members of the &lt;a href="https://www.niso.org/standards-committees/crec" target="_blank">NISO’s CREC Working Group&lt;/a>, some participants were looking to respond in the (then open) consultations of the &lt;a href="https://groups.niso.org/higherlogic/ws/public/download/29165/RP-45-202X_NISO_Communication_of_Retractions_Removals_and_Expressions_of_Concern_CREC_draft_for_public_comment.pdf" target="_blank">draft Recommended Practice, NISO RP-45-202X, Communication of Retractions, Removals, and Expressions of Concern (CREC)&lt;/a>. One message resonated loud and clear: preserving the integrity of the scholarly record cannot be a lone endeavour and has to be a community effort. Attendees expressed their commitment to continue these conversations, with the next most opportune time being at the &lt;a href="https://www.stm-assoc.org/events/stm-week-2023/" target="_blank">STM week&lt;/a>. Everyone recognised that collaboration in this space is the need of the hour: facilitating information and data sharing across all the players in the ecosystem would be crucial to progressing this topic. As Bianca Kramer declared during her presentation, “I am committed to using only open data in my research, as access to data is important for the community to detect problems at scale”.&lt;/p>
&lt;p>At our end, we are looking to act on suggestions that are specific to Crossref:&lt;/p>
&lt;h3 id="consultation-with-the-community-about-crossmark">Consultation with the community about Crossmark&lt;/h3>
&lt;p>One of the first things that we are doing in early 2024 is to consult with our community about the developments needed in the Crossmark service. Our key aim with this exercise would be to understand how we can enable a more effective uptake of this service so that Crossref members can easily fulfil their obligation of keeping their records updated. We are keen to understand what we can do to help our members to send us metadata about updates to an output, and how we can help downstream services that use this data. Insights from this consultation will also help inform how the Retraction Watch data can be most effectively integrated into Crossmark and communicated to users. Please visit the discussion and add your thoughts here: &lt;a href="https://community.crossref.org/t/communicating-post-publication-updates-inviting-feedback-on-the-next-steps-for-crossmark/" target="_blank">https://community.crossref.org/t/communicating-post-publication-updates-inviting-feedback-on-the-next-steps-for-crossmark/&lt;/a>.&lt;/p>
&lt;h3 id="development-of-resources-for-using-our-api">Development of resources for using our API&lt;/h3>
&lt;p>As there is clearly no dearth of metadata components that the community thinks would be “nice to have” for signalling trust, it is equally important to equip users and downstream service providers to be able to access the rich metadata that is available with Crossref. This rich metadata opens up new avenues for the development of services and resources that can benefit the scholarly community. On account of this, we plan to prioritise development of resources for using Crossref APIs. These efforts would include making available workbooks with a variety of API use cases - ranging from how to use basic API queries, to how to use APIs for obtaining grant information or for obtaining citation data and so on, as well as retrieving corrections, retractions, and update information, especially when the Retraction Watch dataset merges in with the rest of the Crossref metadata.&lt;/p>
&lt;h3 id="working-group-to-facilitate-community-efforts-for-preserving-isr">Working group to facilitate community efforts for preserving ISR&lt;/h3>
&lt;p>We are looking to set up a working group that will facilitate the various stakeholders in the scholarly ecosystem to work together towards preserving the integrity of the scholarly record. One direction for the group could be to consider the role and impact of Crossref metadata in ISR. Another area of focus will be to enrich information about retractions, corrections, and expressions of concern. Raising industry-wide awareness about the current concerns in upholding the integrity of the scholarly record, and how comprehensive metadata can act as markers of trust about research output, would be another focal point.&lt;/p>
&lt;h3 id="continued-community-outreach">Continued community outreach&lt;/h3>
&lt;p>We will continue our efforts to engage with the community on the very important issues surrounding ISR. We are particularly keen to redouble our efforts to include more funders and institutions in these conversations. Preserving the integrity of the scholarly record needs to be a truly inclusive effort and will benefit from diverse voices in the community. With that in mind, consulting with the community in Asia is next on our radar.&lt;/p>
&lt;p>We look forward to working with the community further on this important topic - if you are keen to participate in these discussions and want to contribute towards preserving the integrity of the scholarly record, we would love to hear from you. Please write to us at &lt;a href="mailto:feedback@crossref.org">feedback@crossref.org&lt;/a> if you have any suggestions on this topic.&lt;/p>
&lt;h2 id="appendix-participant-list">Appendix: Participant list&lt;/h2>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>Name&lt;/th>
&lt;th>Role&lt;/th>
&lt;th>organisation&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>Ed Pentz&lt;/td>
&lt;td>Executive Director&lt;/td>
&lt;td>Crossref&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Amanda Bartell&lt;/td>
&lt;td>Head of Member Experience&lt;/td>
&lt;td>Crossref&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Madhura Amdekar&lt;/td>
&lt;td>Community Engagement Manager&lt;/td>
&lt;td>Crossref&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Luis Montilla&lt;/td>
&lt;td>Technical Community Manager&lt;/td>
&lt;td>Crossref&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Lena Stoll&lt;/td>
&lt;td>Product Manager&lt;/td>
&lt;td>Crossref&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Kora Korzec&lt;/td>
&lt;td>Head of Community Engagement and Communications&lt;/td>
&lt;td>Crossref&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Ivan Oransky&lt;/td>
&lt;td>Co-Founder&lt;/td>
&lt;td>Retraction Watch&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Jennifer Wright&lt;/td>
&lt;td>Research Integrity Manager&lt;/td>
&lt;td>Cambridge University Press&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Guntram Bauer&lt;/td>
&lt;td>Director of Science Policy &amp;amp; Communications&lt;/td>
&lt;td>Human Frontier Science Program&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Wendy Patterson&lt;/td>
&lt;td>Scientific Director&lt;/td>
&lt;td>Beilstein-Institut&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Sarah Jenkins&lt;/td>
&lt;td>Director, Research Integrity &amp;amp; Publishing Ethics&lt;/td>
&lt;td>Elsevier&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Helene Stewart&lt;/td>
&lt;td>Director, Editorial Relations Web of Science&lt;/td>
&lt;td>Clarivate&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Bianca Kramer&lt;/td>
&lt;td>Advisor, Research Analyst, Facilitator&lt;/td>
&lt;td>Sesame Open Science&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Adya Misra&lt;/td>
&lt;td>Research Integrity and Inclusion Manager&lt;/td>
&lt;td>Sage&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Andrew Joseph&lt;/td>
&lt;td>&lt;/td>
&lt;td>Wits University Press&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Theodora Bloom&lt;/td>
&lt;td>Executive Editor&lt;/td>
&lt;td>BMJ&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Alberto Martín-Martín&lt;/td>
&lt;td>Assistant Professor&lt;/td>
&lt;td>Universidad de Granada&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Aaron Wood&lt;/td>
&lt;td>Head, Product &amp;amp; Content Management&lt;/td>
&lt;td>American Psychological Association&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Fred Atherden&lt;/td>
&lt;td>Head of Production Operations&lt;/td>
&lt;td>eLife&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Kihong Kim&lt;/td>
&lt;td>&lt;/td>
&lt;td>Korean Council of Science Editors&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>David Flanagan&lt;/td>
&lt;td>Senior Director, Data Science&lt;/td>
&lt;td>Wiley&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Chiara Di Giambattista&lt;/td>
&lt;td>Communications Director&lt;/td>
&lt;td>OpenCitations&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Scott Delman&lt;/td>
&lt;td>Director of Publications&lt;/td>
&lt;td>ACM&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Chi Wai (Rick) Lee&lt;/td>
&lt;td>General Manager&lt;/td>
&lt;td>World Scientific Publishing Co (WSPC)&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Leslie McIntosh&lt;/td>
&lt;td>VP, Research Integrity&lt;/td>
&lt;td>Digital Science&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Adam Day&lt;/td>
&lt;td>Director&lt;/td>
&lt;td>Clear Skies&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Damaris Critchlow&lt;/td>
&lt;td>Project Manager&lt;/td>
&lt;td>Karger&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Tamara Welschot&lt;/td>
&lt;td>Head of Research Integrity, Prevention&lt;/td>
&lt;td>Springer Nature&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Kathryn Dally&lt;/td>
&lt;td>Research Integrity and Policy Lead&lt;/td>
&lt;td>Research Services, University of Oxford&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Masahiko Hayashi&lt;/td>
&lt;td>Director, JSPS Bonn Office&lt;/td>
&lt;td>Japan Society for the Promotion of Science&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Simone Taylor&lt;/td>
&lt;td>Chief, Publishing&lt;/td>
&lt;td>American Psychiatric Association&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Christna Chap&lt;/td>
&lt;td>Head of Editorial Development&lt;/td>
&lt;td>Karger Publishers&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Coromoto Power Febres&lt;/td>
&lt;td>Research Integrity Manager&lt;/td>
&lt;td>Emerald Publishing&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Carole Chapin&lt;/td>
&lt;td>Project Manager&lt;/td>
&lt;td>French Office for Research Integrity&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Jodi Schneider&lt;/td>
&lt;td>Associate Professor of Information Sciences&lt;/td>
&lt;td>University of Illinois Urbana Champaign&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Oliver Koepler&lt;/td>
&lt;td>Head of Lab Linked Scientific Knowledge&lt;/td>
&lt;td>TIB - Leibniz Information Centre for Science and Technology&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Heather Staines&lt;/td>
&lt;td>&lt;/td>
&lt;td>Delta Think&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Eri Anno&lt;/td>
&lt;td>&lt;/td>
&lt;td>JSPS Bonn office&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Joris van Rossum&lt;/td>
&lt;td>&lt;/td>
&lt;td>STM Solutions&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Anita de Waard&lt;/td>
&lt;td>VP Research Collaborations&lt;/td>
&lt;td>Elsevier&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table></description></item><item><title>RORing ahead: using ROR in place of the Open Funder Registry</title><link>https://www.crossref.org/blog/roring-ahead-using-ror-in-place-of-the-open-funder-registry/</link><pubDate>Tue, 30 Jan 2024 00:00:00 +0000</pubDate><author>Rachael Lammey</author><discourseUsername>rlammey</discourseUsername><guid>https://www.crossref.org/blog/roring-ahead-using-ror-in-place-of-the-open-funder-registry/</guid><description>&lt;p>A few months ago we announced our plan to &lt;a href="https://doi.org/10.64000/v3429-p7810" target="_blank">deprecate our support for the Open Funder Registry&lt;/a> in favour of using the ROR Registry to support both affiliation and funder use cases. The feedback we’ve had from the community has been positive and supports our members, service providers and metadata users who are already starting to move in this direction.&lt;/p>
&lt;p>We wanted to provide an update on work that’s underway to make this transition happen, and how you can get involved in working together with us on this.&lt;/p>
&lt;p>Overall, we are building more comprehensive support for ROR into Crossref’s services. Some of this work is specifically to support using ROR to identify funding organisations in place of funder registry IDs. We have a number of parallel, complementary projects underway to support different elements of this work:&lt;/p>
&lt;ol>
&lt;li>We are evolving our metadata schema so that we can collect ROR IDs in places where we currently support the collection of Funder IDs.&lt;/li>
&lt;li>We are analysing the coverage of Funder ID to ROR ID mappings and testing the way we expose them in our APIs.&lt;/li>
&lt;li>We are developing new matching strategies to match text strings to ROR IDs.&lt;/li>
&lt;/ol>
&lt;h2 id="1-schema-updates">1. Schema updates&lt;/h2>
&lt;p>Everything flows from being able to get ROR IDs into the Crossref metadata!&lt;/p>
&lt;p>We are evolving our metadata schema so that we can collect ROR IDs in places where we already support the collection of Funder IDs – for instance, in &lt;a href="https://www.crossref.org/documentation/funder-registry/funding-data-overview/">the funding section of the metadata for works&lt;/a> and in the funder section for grants.&lt;/p>
&lt;p>We’re working with members and service providers so that they can try sending us this data via a pipeline our Labs team has built to test schema updates before they go live. We are actively recruiting members to help us test our new pipeline by providing sample XML for registration. Planned metadata inputs and outputs are detailed in &lt;a href="https://docs.google.com/document/u/0/d/164h3UtBQ2mHf5lH5ZS6c_Oh8OuraoaQPvXhNNO3-Ko8/edit" target="_blank">Including ROR as a funder identifier in your metadata (metadata prototyping instructions)&lt;/a>, we’d encourage you to provide feedback on these in the document, ideally in the next two weeks.
We’re aiming to release an updated schema that supports these changes in Q1 2024.&lt;/p>
&lt;h2 id="2-modelling-ror-idfunder-id-mappings-in-our-metadata-model">2. Modelling ROR ID/Funder ID mappings in our metadata model&lt;/h2>
&lt;p>We have integrated the ROR registry into our evolving metadata model, and we have started work to integrate the Funder Registry. The aim is to create more flexibility in how Crossref’s metadata can be supplemented and queried, and give more clarity as to which party asserted or created a metadata element.&lt;/p>
&lt;p>We’re working on an early iteration of how the model handles ROR IDs, funder IDs and their equivalencies. Once we have something to share, we’ll welcome community feedback on this approach and on the metadata model in general.&lt;/p>
&lt;h2 id="3-developing-new-matching-strategies-to-match-text-strings-to-ror-ids">3. Developing new matching strategies to match text strings to ROR IDs&lt;/h2>
&lt;p>Ideally, everyone would always use persistent identifiers to exchange information about contributor and awardee affiliations, organisations related to works, as well as funders supporting the research. In practice, this information is often exchanged as data without identifiers, such as affiliation strings (e.g. “University of Virginia, Charlottesville, VA”), funder names, or even funding acknowledgements (e.g. “Funding and support generously provided by the Ford Foundation”). In such situations, a good metadata matching strategy can help map these to persistent identifiers.&lt;/p>
&lt;p>Currently, we are focused on developing reliable strategies for matching affiliation strings to ROR IDs. In the future, we will adapt the strategies to support funder names and funding acknowledgements as well. All the strategies will be rigorously evaluated using real-life data. We will make the strategies, as well as the evaluation datasets and evaluation results, publicly available for anyone to use. If you are interested in collaborating on the development or the evaluation of the matching strategies, &lt;a href="mailto:labs@crossref.org">please get in touch&lt;/a>!&lt;/p>
&lt;p>In the future, we might also apply some of the new matching strategies at Crossref, to the metadata our members send us. This would allow us to insert matched identifiers to the metadata to better connect organisations with other items in the scholarly record. We already have a process that matches the names of funders supporting research against the Funder Registry and enriches the metadata with matched Funder Registry IDs. Developing and evaluating reliable matching strategies will allow us to modify this process to use ROR IDs instead, and extend it to support other use cases, such as contributor affiliations.&lt;/p>
&lt;h2 id="what-will-the-transition-mean-for-you">What will the transition mean for you?&lt;/h2>
&lt;p>We do recommend that you begin looking at what it will take to integrate ROR into your systems and workflows for identifying funders. Talk to your service providers about this to ready them for this change.
To reiterate the point from the earlier post, in the short term, and even in the medium term, Funder IDs aren’t going away and the Funder IDs will continue to resolve – they are persistent, after all. Eventually, however, the Funder Registry will cease to be updated, so any new funders will only be registrable in Crossref metadata with ROR IDs. Legacy Funder IDs and their mapping to ROR IDs will be maintained, so if Crossref members submit a legacy Funder ID, it will get mapped to a ROR ID automatically. Note, too, that Crossref is committed to maintaining the current funder API endpoints until ROR IDs become the predominant identifier for newly registered content. We also know that there are questions that we’ll want to tackle with the community as we all make progress, some we know and some we don’t know. With that in mind:&lt;/p>
&lt;h2 id="tell-us-what-you-need">Tell us what you need!&lt;/h2>
&lt;p>We want to hear from you! We have set up several channels of communication meant to ensure that you can tell both ROR and Crossref what will make this transition easier for you and that you can get answers to your questions.&lt;/p>
&lt;p>First, we are conducting a series of Open Funder Registry user interviews designed to deepen our understanding of where Funder IDs are being used in workflows and systems. Write &lt;a href="mailto:community@ror.org">community@ror.org&lt;/a> if you&amp;rsquo;d like to participate in these interviews to show and tell us how you&amp;rsquo;re using Funder IDs.&lt;/p>
&lt;p>Second, in 2024, we will be running a follow-up to the funding data workshop we ran in June 2023. Please get in touch if your organisation would be interested in participating in the discussion.&lt;/p></description></item><item><title>Solving your technical support questions in a snap!</title><link>https://www.crossref.org/blog/solving-your-technical-support-questions-in-a-snap/</link><pubDate>Thu, 25 Jan 2024 00:00:00 +0000</pubDate><author>Isaac Farley</author><guid>https://www.crossref.org/blog/solving-your-technical-support-questions-in-a-snap/</guid><description>&lt;p>My name is Isaac Farley, Crossref Technical Support Manager. We’ve got a collective post here from our technical support team - staff members and contractors - since we all have what I think will be a helpful perspective to the question: &lt;strong>‘What’s that one thing that you wish you could snap your fingers and make clearer and easier for our members?’&lt;/strong> Within, you’ll find us referencing our &lt;a href="https://community.crossref.org/" target="_blank">Community Forum&lt;/a>, the open support platform where you can get answers from all of us and other Crossref members and users. We invite you to join us there; how about asking your next question of us there? Or, simply let us know how we did with this post. We’d love to hear from you!&lt;/p>
&lt;h2 id="a-little-about-us-and-what-drives-the-team">A little about us and what drives the team&lt;/h2>
&lt;p>I’m fortunate to manage a great team - Evans, Kathleen, Paul, Poppy, and Shayn - who enjoy and are hardwired to guide. We have different strengths and interests, but the thing that unites us is that we are energized when we can unpick tricky problems for all of you, our members and users. In 2023, the technical support team answered around 11,000 questions from all of you. We do that with one-to-one requests sent to us via email and within our support center (using a closed-source software called Zendesk). And, we’ve been providing more and more support in our Community Forum, where we’re aiming for open interactions, so we can all learn from the rich exchanges with all of you (the Forum has an integration with Zendesk, so posts made in the Forum are delivered to us there, so our team won’t miss any of your questions).&lt;/p>
&lt;p>We established in the previous paragraph that we have a great technical support team who all pride themselves on helping you. But we’re also human; the reality is that many of those ~11,000 technical support questions asked of us in 2023 were repetitive, and there are always trends in the questions asked. That’s another important reason why we’re hoping to have more and more of these questions asked and answered within our Community Forum; again, so we can all learn from one another. We know certain parts of content registration, metadata retrieval, and everything in between are, well, complicated. The Crossref learning curve can be steep for all of us. Collectively, our technical support team has more than 25 years of Crossref experience, and we’re continuously learning new things about the &lt;a href="https://www.crossref.org/documentation/research-nexus/" target="_blank">Research Nexus&lt;/a> and the scholarly ecosystem from one another and all of you.&lt;/p>
&lt;p>Learning through this complexity is one of the most enriching parts of our days. Our daily &lt;a href="https://en.wikipedia.org/wiki/Stand-up_meeting" target="_blank">stand-up&lt;/a>, modeled off of different software development methodologies, where together we troubleshoot tangly questions from all of you, share ideas, and just keep up-to-date on the latest from across the organisation leads to a lot of knowledge exchange. So, years ago, we decided to transform the issues we discuss in those stand-ups into public-facing posts in our Community Forum. It gave us the opportunity to share much-needed examples in a new community space; and, we knew, since these were the issues we were all discussing and learning from ourselves, that many of you would also benefit from us surfacing the topics openly. We call these posts &lt;a href="https://community.crossref.org/tag/ticket_of_month" target="_blank">tickets of the month&lt;/a>, since the majority of topics we discuss have originated from tickets in our support center.&lt;/p>
&lt;p>Examples of some of the most popular topics in the last two-plus years have been:&lt;/p>
&lt;ul>
&lt;li>&lt;a href="https://community.crossref.org/t/ticket-of-the-month-march-2022-getting-started-with-rest-api-queries/2587/29" target="_blank">Getting started with REST API queries&lt;/a> and the follow-up post &lt;a href="https://community.crossref.org/t/ticket-of-the-month-august-2023-using-postman-for-api-queries/4036/2" target="_blank">Using Postman for API Queries&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://community.crossref.org/t/ticket-of-the-month-june-2023-content-registration-did-it-work/3783" target="_blank">Content Registration: Did it work?&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://community.crossref.org/t/ticket-of-the-month-april-2023-the-new-labs-reports-are-here/3528" target="_blank">The new Labs Reports are here&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://community.crossref.org/t/ticket-of-the-month-february-2023-are-you-an-ojs-user-are-the-below-questions-familiar-we-d-like-to-help/3376" target="_blank">Are you an OJS user? Are the below questions familiar?&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://community.crossref.org/t/ticket-of-the-month-sept-2022-get-citation-counts-for-all-articles-in-a-particular-journal/3008" target="_blank">Get Citation Counts for all Articles in a Particular Journal&lt;/a>&lt;/li>
&lt;/ul>
&lt;h2 id="snapping-our-fingers">Snapping our fingers&lt;/h2>
&lt;p>Like I said, these posts originated from real-life questions of us from our community members. In most cases, we’ve been asked these questions by &lt;em>many&lt;/em> of you. These Community Forum posts are our attempts to unlock understanding of our services, rich metadata, or the larger Research Nexus. Said another way: we all see value in putting in the effort to post one more example or answer that nuanced question. Perhaps one of our posts will include an example that really resonates with you and/or your work.&lt;/p>
&lt;p>In that spirit, I asked Evans, Kathleen, Paul, Poppy, and Shayn to answer this question below (yes, I’m going to weigh in, too):&lt;/p>
&lt;blockquote>
&lt;p>&lt;strong>What’s that one thing that you wish you could snap your fingers and make clearer and easier for our members?&lt;/strong>&lt;/p>
&lt;/blockquote>
&lt;h2 id="evans-technical-support-specialist">Evans, Technical Support Specialist&lt;/h2>
&lt;p>As a publisher and a Crossref member, at one point or another, you might have made a mistake in the metadata deposited for a given DOI. I’m sure after the slight ‘shock’, the next question you had in mind was, &lt;em>‘How can I correct this mistake?’&lt;/em> Well, here is a simplified guide on how to do that correction/update!&lt;/p>
&lt;p>&lt;strong>Can I modify/ update the metadata of a registered DOI?&lt;/strong>
As indicated by my colleague Shayn below in this blog post, Crossref DOIs are designed to be persistent (and cannot be changed/deleted once registered). And YES, you can update the metadata associated with any of your registered DOIs whenever necessary, at no additional fee.&lt;/p>
&lt;p>&lt;strong>How can I perform a standard metadata update?&lt;/strong>
To add, change, or remove any metadata element from your existing records, you generally just need to resubmit your complete metadata record with the correct/new changes included. How you choose to update a DOI metadata record is highly dependent on the content registration tool/platform you are using/comfortable working with, as described below:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>&lt;strong>OJS&lt;/strong>: Navigate to the article record you wish to update, add in your new metadata/delete relevant metadata fields, and deposit it again using the &lt;a href="https://docs.pkp.sfu.ca/crossref-ojs-manual/" target="_blank">Crossref import/export plugin&lt;/a>. You must be running at least OJS 3.1.2 and have the Crossref import/export plugin enabled.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;strong>Web deposit form&lt;/strong>: Open the &lt;a href="https://www.crossref.org/documentation/register-maintain-records/web-deposit-form/" target="_blank">web deposit form&lt;/a>, and re-enter all the metadata, including the new changes - leave the relevant field blank to delete it, or add in your new metadata to update it - and resubmit the form (note: there are a &lt;a href="https://www.crossref.org/documentation/register-maintain-records/maintaining-your-metadata/updating-your-metadata/#00627" target="_blank">handful of exceptions&lt;/a> to this for the web deposit form).&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;strong>Depositing XML files with Crossref&lt;/strong>: Make changes to the relevant XML file and resubmit it to Crossref via the &lt;a href="https://doi.crossref.org/servlet/home" target="_blank">admin tool&lt;/a>. When making an update, you must supply all the bibliographic metadata for the record being updated, not just the fields that need to be changed. During the update process, we overwrite the existing metadata with the new information you submit, and insert null values for any fields not supplied in the update. This means, for example, that if you’ve supplied an online publication date in your initial deposit, you’ll need to include that date in subsequent deposits if you wish to retain it. Note that the value included in the &lt;timestamp> element must be incremented each time a DOI is updated.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>If you’re looking for &lt;a href="https://community.crossref.org/tag/update-doi" target="_blank">real-life examples&lt;/a> of other members who have updated their metadata, the Community Forum is a great starting point. If you have follow-up questions on any of the existing threads, I invite you to post a message today.&lt;/p>
&lt;h2 id="kathleen-technical-support-specialist">Kathleen, Technical Support Specialist&lt;/h2>
&lt;p>One of my favorite types of queries to tackle are those regarding content registration problems. I love a good mystery and getting to the bottom of why that pesky submission just didn&amp;rsquo;t succeed. Sometimes members come to us with an error message and specific questions about what has gone awry. But, in fact, two of the most common questions we receive are: 1) I deposited something; did it work? and 2) I deposited something; why isn&amp;rsquo;t it showing up?!&lt;/p>
&lt;p>To address the first question of whether your submission went through or not, I wrote a &lt;a href="https://community.crossref.org/t/ticket-of-the-month-june-2023-content-registration-did-it-work/3783" target="_blank">forum post back last June&lt;/a> talking about how to use the admin tool to see whether your registration was successful or not. We know there are also email alerts and perhaps status messages within your own registration platform, but using the admin tool is a great way to concretely check where your submission has ended up. If it&amp;rsquo;s not there, we didn&amp;rsquo;t get it!&lt;/p>
&lt;p>Using the admin tool is also a great way to get &lt;a href="https://www.crossref.org/documentation/register-maintain-records/verify-your-registration/submission-queue-and-log/#00143" target="_blank">more details about the submission&lt;/a> and more information in case the submission happened to fail. You may have had the experience in which you contacted us with a question about a failed deposit, and we asked you for the submission ID. You can find that info in the admin tool! And we ask for that, because that helps us get to the bottom of those error message mysteries.&lt;/p>
&lt;p>And, as for the second question of when will your DOI be active, my colleague, Paul, &lt;a href="https://community.crossref.org/t/ticket-of-the-month-september-2023-a-doi-namic-timeline/4143" target="_blank">wrote a fantastic post on the forum&lt;/a> (with an excellent flowchart and all!), explaining when you can expect to see your DOI up and running. Often members will submit a deposit and expect the DOI to resolve immediately. When that doesn&amp;rsquo;t happen, many think that something has gone wrong or perhaps there is an error, but, in fact, our systems may still be updating and processing the metadata.&lt;/p>
&lt;p>I recommend giving these two posts a read if you&amp;rsquo;re at all concerned about whether you&amp;rsquo;re depositing your content correctly or not. Hopefully, they&amp;rsquo;ll help ease your content-registration worries.&lt;/p>
&lt;h2 id="isaac-technical-support-manager">Isaac, Technical Support Manager&lt;/h2>
&lt;p>Oh, thanks for asking! Many of our members, after receiving one of &lt;a href="https://www.crossref.org/documentation/reports/" target="_blank">our reports&lt;/a>, will respond to us in support with a message similar to: ‘What did I do wrong? Please help me fix this. I don’t want to be out of compliance!’&lt;/p>
&lt;p>The receipt of one of our reports does not necessarily mean that you’ve done anything wrong. In truth, the reports we send to our official member contacts are produced using very simple logic. It’s true that they may signal larger, more complicated problems, but we really need your help to determine next steps (and, in some cases, no action is needed because there is no issue for members to fix (e.g., many failed resolutions within the resolution reports)).&lt;/p>
&lt;p>Let’s look at the conflict and resolution reports since those are the reports we get the most questions about:&lt;/p>
&lt;p>&lt;strong>&lt;a href="https://www.crossref.org/documentation/reports/conflict-report/" target="_blank">Conflict reports&lt;/a>&lt;/strong> are the most complicated of our reports to navigate. But, the reports are generated using simple logic: if you register two or more DOIs with matching bibliographic metadata, we’ll flag those DOIs as being in conflict, which will generate a warning message at the time of registration and a subsequent conflict report. When members receive this report, we often get the sense that members simply want us, the technical support team, to tell them how to fix it. The problem is we don’t know your content, so we don’t know if the two DOIs do represent a duplicate, or if both DOIs, while having very similar bibliographic metadata, are legitimate and will be maintained going forward (e.g., for errata). Paul wrote a great post in our community forum about what conflicts are and how to &lt;a href="https://community.crossref.org/t/ticket-of-the-month-october-2022-conflicts-and-how-to-resolve-them/3092" target="_blank">resolve them&lt;/a>.&lt;/p>
&lt;p>&lt;strong>&lt;a href="https://www.crossref.org/documentation/reports/resolution-report/" target="_blank">Resolution reports&lt;/a>&lt;/strong>, like conflict reports, are generated using simple logic: a resolution is the result of a click on that DOI. If a DOI has been registered, that click results in a successful resolution. If that DOI has not been registered, that click results in a failed resolution. Our monthly report is a count of those resolutions - successful and failed. Failures can represent content registration errors in a member’s workflow. Or, they can signal that an end user has made a mistake when attempting to click the DOI in question. So, for example, an end user perhaps added an extra period onto their DOI link. Instead of trying to resolve &lt;a href="https://doi.org/10.5555/cupnfcm2wj" target="_blank">https://doi.org/10.5555/cupnfcm2wj&lt;/a>, a legitimate DOI, they added a period to the end and tried to resolve &lt;a href="https://doi.org/10.5555/cupnfcm2wj" target="_blank">https://doi.org/10.5555/cupnfcm2wj&lt;/a>. instead. That extra period at the end of the DOI has made it a completely different DOI that is not registered with us, thus they get a failed resolution. This is pretty common. For members with content being regularly clicked, there will be user errors in the logs appearing as failed resolutions. The first question members should ask themselves when reviewing the failed .csv report within the resolution report is: ‘are any of these DOIs legitimate DOIs that I thought we had registered?’ We have more on the &lt;a href="https://community.crossref.org/t/ticket-of-the-month-october-2022-conflicts-and-how-to-resolve-them/3092" target="_blank">basics of resolution reports&lt;/a> also over in our Community Forum.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2024/DOI_NOT_FOUND.png"
alt="Preprint matching" width="70%">
&lt;/figure>
&lt;h2 id="paul-technical-support-specialist--rd-support-analyst">Paul, Technical Support Specialist &amp;amp; R&amp;amp;D Support Analyst&lt;/h2>
&lt;p>I know we were asked to name “one thing” but I have two that are closely related. May I snap my fingers twice and fix two issues? [Of course, Paul! Take it away!]&lt;/p>
&lt;p>&lt;strong>Paul’s first snap&lt;/strong>&lt;/p>
&lt;p>One of the most asked questions we get in support is “why is my DOI not working?” 90% of the time it is down to a failed submission. A good proportion of those failures are a result of title mismatches between the deposited container title and the one we have stored on the system here. There are other error messages that occur, too, which &lt;a href="https://doi.org/10.64000/9ftf4-evr94" target="_blank">I wrote about back in 2020&lt;/a>.&lt;/p>
&lt;p>So, “why do we fail submissions because of title differences?” You might ask.&lt;/p>
&lt;p>Well, the title and ISSN/ISBN and/or the title level DOIs act like locks to the title record, which need the right keys to unlock the title so that you can add or update the records against it. So if you don’t match what was in the original submission, you get a failure. Without that stringent check, we would have way too many iterations of titles and matching to those would be a nightmare. Not to mention sorting those DOIs into one container in the REST API.&lt;/p>
&lt;p>Isaac wrote a great forum post about these &lt;a href="https://community.crossref.org/t/ticket-of-the-month-october-2023-dispelling-pesky-journal-title-level-registration-errors/4282" target="_blank">title-level issues&lt;/a> as well.&lt;/p>
&lt;p>If a title update is required due to an error with an original title deposit, then these need to be made by the support team, so get in touch with us on the &lt;a href="https://community.crossref.org/tag/title_update" target="_blank">Community Forum&lt;/a>.&lt;/p>
&lt;p>&lt;strong>And, a second&lt;/strong>&lt;/p>
&lt;p>Permissions against titles and DOIs: Lots of our members don’t realise that each DOI has its own permissions against the prefix that currently ‘owns’ or is associated with that DOI in the background.&lt;/p>
&lt;p>It would be fair to assume you can tell just by looking at a DOI who the current publisher is, based on the prefix at the start —but that’s not always the case. Things can (and often do) change. Individual journals get purchased by other publishers, and whole organisations get bought and sold.&lt;/p>
&lt;p>What you can tell from looking at a DOI prefix is who originally registered it, but not necessarily who it currently belongs to. That’s because if a journal (or whole organisation) is acquired, DOIs don’t get deleted and re-registered to the new owner. The update will of course be reflected in the relevant metadata, but the prefix on the DOI will stay the same. It never changes—and that’s the whole point, that’s what makes the DOI persistent.
Isaac also wrote this in much more detail and explains the internal Crossref processes in his blog &lt;a href="https://doi.org/10.64000/91cyc-vka68" target="_blank">“What can often change, but always stays the same?“&lt;/a>&lt;/p>
&lt;p>These permissions are very important to understand when it comes to title transfers and working with updating your metadata against transferred DOIs to prevent duplicate DOIs for the same work.&lt;/p>
&lt;h2 id="poppy-technical-support-contractor">Poppy, Technical Support Contractor&lt;/h2>
&lt;p>As a researcher myself, I’d like to talk about references in a journal article, book, conference paper, etc. (I’ll just use ‘article’ going forward for simplicity). These are the references included in an article by the author. References in one article result in citations for another article. It&amp;rsquo;s the thing every author dreams of and accruing citations can be a big deal for authors, journals, and publishers.&lt;/p>
&lt;p>For readers, articles with no references can be less discoverable using systems that use citation links for relevance, and that discoverability is of critical importance for our members who decide to register references with us. We all want your content to be shared, cited, linked, and used far and wide.&lt;/p>
&lt;p>We receive many questions from authors asking why citations don’t show up; it&amp;rsquo;s usually due to metadata deposits with no references included. There may be an assumption that our process is like Google Scholar, which crawls full text and websites. This misunderstanding has a big impact on references and citation counts. However, as we do not store a copy of the paper, our intake system does not extract references from the article, regardless if they have a DOI. This is one of the main reasons that Crossref citation counts are lower than services that use extraction methods. We only store the data that a publisher registers and maintains with us. On deposit of a metadata record that includes references, our system performs a &lt;a href="https://doi.org/10.64000/pdm9z-20m09" target="_blank">matching process&lt;/a> - if there is a match, a cited-by connection is applied to the metadata. With deposits with no references, however, there is no data to match to other articles (and, therefore limitations on the discoverability and no cited-by count increase).&lt;/p>
&lt;p>An article with no references has big impacts for the authors, the journal, the publisher, researchers, and ultimately, the readers. This can mean decreased distribution of the content itself, reduced citation counts for cited articles, lower impact metrics for journals, and can ultimately affect value for publishers. For example, researchers just don’t include articles without references for scientometric analysis.&lt;/p>
&lt;p>Our &lt;a href="https://www.crossref.org/documentation/principles-practices/best-practices/references/" target="_blank">documentation on references&lt;/a> includes the elements for both &lt;a href="https://doi.org/10.64000/txft6-s1481" target="_blank">structured and unstructured data&lt;/a>. Including the DOI in the structured data is best practice as it provides a precise location with rich data for matching. If the matcher does not see a link between the deposited DOI and the cited DOI at the time of deposit, then the references are stored to be crawled with other matching algorithms later. So, we&amp;rsquo;re always working to create those rich cited-by linkages between works (raising the content’s profile and overall discoverability), no matter when you register reference metadata. You can also see how your publisher is doing on depositing references by viewing their &lt;a href="https://www.crossref.org/members/prep/" target="_blank">Participation Report&lt;/a>. If you are an author, you can &lt;a href="https://community.crossref.org/t/ticket-of-the-month-april-2022-reference-coverage-which-dois-have-i-registered-references-for/2670" target="_blank">check if your DOIs that were registered contained any references&lt;/a> by using our &lt;a href="https://api.crossref.org/swagger-ui/index.html" target="_blank">REST API&lt;/a>. &lt;em>Don’t see them?&lt;/em> You can always contact the editor of the journal or the publisher that published your paper and ask them to add them. &lt;em>Didn’t hear back?&lt;/em> Just drop us a line in the &lt;a href="https://community.crossref.org/tag/references" target="_blank">Community Forum&lt;/a>, we’re happy to help.&lt;/p>
&lt;h2 id="shayn-technical-support-specialist">Shayn, Technical Support Specialist&lt;/h2>
&lt;p>Let&amp;rsquo;s &amp;lsquo;zoom out&amp;rsquo; to the big picture. What are DOIs for? What makes them useful? What are we all doing here anyway?!&lt;/p>
&lt;p>There are a lot of different answers to those questions. It&amp;rsquo;s a complex picture. But, way back in the late ‘90s, the DOI system was designed in order to allow for the creation of unique and persistent identifiers. Crossref members use these identifiers to represent their research outputs and publications. This allows for reliable linking to those items, and the ability to identify and communicate the relationships between them, notably (but not exclusively!) citation relationships.&lt;/p>
&lt;p>So, what do we mean when we say that Crossref DOIs should be unique and persistent? In basic terms, &lt;strong>unique&lt;/strong> means that there is only a single Crossref DOI registered for a given citable research output. And, &lt;strong>persistent&lt;/strong> meaning that the DOI associated with a given research output today will continue to be associated with, and link to, that same research output indefinitely into the future.&lt;/p>
&lt;p>Yes, there are some &lt;a href="https://doi.org/10.64000/3gjb5-tkm69" target="_blank">grey areas&lt;/a>, and we know that everything doesn&amp;rsquo;t always work 100% perfectly all the time. But, the more &lt;a href="https://community.crossref.org/t/the-problem-with-duplicate-dois-and-how-you-can-help/2634" target="_blank">deviations from persistence and uniqueness&lt;/a>, the harder it becomes for end-users, publishers, Crossref, and other services which make use of our metadata to reliably find research outputs and reliably relate them to one another. It weakens the value and utility of DOIs for everyone.&lt;/p>
&lt;p>So, what does this mean in practice?&lt;/p>
&lt;ul>
&lt;li>Be certain that every item you register with Crossref is something you can maintain in the long-term.
&lt;ul>
&lt;li>Have an &lt;a href="https://community.crossref.org/t/tis-the-season-for-title-transfers/2328/3" target="_blank">arrangement with an archive&lt;/a> that can take responsibility for your content if your organisation stops hosting it or ceases to exist.&lt;/li>
&lt;li>Don’t register things that you know will only exist for a short time.&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>When you&amp;rsquo;re about to register new content, be absolutely sure that it hasn’t been registered already, either by your organisation or any other organisation.
&lt;ul>
&lt;li>If you &lt;a href="https://community.crossref.org/t/top-tips-for-pain-free-title-transfer/2408" target="_blank">acquire a new journal&lt;/a> from another publisher, have a process in place to check what content has already been registered and adopt the use of the DOIs registered by the prior publisher for that content. We can always provide a list of the existing DOIs for a journal.&lt;/li>
&lt;li>If you publish books, and have a co-publishing agreement with another publisher, distributor, or hosting platform, be aware that one of those other parties may have already registered DOIs for your books. Adopt the use of those DOIs rather than assigning and registering new ones. And, if you don’t want them to do that going forward, communicate that to your co-publishing partners.&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>When mistakes happen, inadvertently resulting in duplicate DOIs for a single item, identify them quickly. &lt;a href="https://community.crossref.org/t/ticket-of-the-month-october-2022-conflicts-and-how-to-resolve-them/3092" target="_blank">Alias&lt;/a> the new duplicate DOI to the long-standing original DOI, and remove all instances of the new DOI from your website or platform.&lt;/li>
&lt;li>Ensure that your publishing software, platform, or journal management system can accommodate DOIs with various prefixes for the same publication. You should be able to use (display, link, update metadata and URLs for) the DOIs registered for older content by any prior publishers as easily as you use the DOIs that you registered yourself for more recent content.&lt;/li>
&lt;/ul>
&lt;p>Things like &lt;em>persistence&lt;/em> and &lt;em>uniqueness&lt;/em> can sound like theoretical abstractions, but they actually play an important role in the day-to-day grind of your publishing operations. Their impact on linking, citing, discovery, and analysis of your content is concrete and important. Thus, it’s not surprising that we often hear from members and others in the research community who share this commitment to persistence, uniqueness, and overall rich, accurate metadata. You’ll see that play out in the Community Forum where &lt;a href="https://community.crossref.org/t/doi-registration-server-returning-an-error-no-response-from-serve/3219" target="_blank">members and users get involved&lt;/a> to troubleshoot issues, compare notes, and share ideas with us and one another. We appreciate the commitment to the Research Nexus and the overall spirit to serve in this growing community. Like we said at the top, we’re all wired to contribute in this way, so building an open, welcoming space that moves us forward excites us.&lt;/p>
&lt;p>Again, we invite you to join the discussion on this and many other Crossref-related topics over in our &lt;a href="https://community.crossref.org/" target="_blank">Community Forum&lt;/a>.&lt;/p></description></item><item><title>The GEM program - year one</title><link>https://www.crossref.org/blog/the-gem-program-year-one/</link><pubDate>Wed, 24 Jan 2024 00:00:00 +0000</pubDate><author>Susan Collins</author><guid>https://www.crossref.org/blog/the-gem-program-year-one/</guid><description>&lt;p>In January 2023, we began our&lt;a href="https://www.crossref.org/gem/"> Global Equitable Membership (GEM) Program&lt;/a> to provide greater membership equitability and accessibility to organisations located in the least economically advantaged countries in the world. Eligibility for the program is based on a member&amp;rsquo;s country; our list of countries is predominantly based on the &lt;a href="https://datahelpdesk.worldbank.org/knowledgebase/articles/906519-world-bank-country-and-lending-groups" target="_blank">International Development Association (IDA)&lt;/a>. Eligible members pay no membership or content registration fees.&lt;/p>
&lt;p>The list undergoes periodic reviews, as countries may be added or removed over time as economic situations change. Sri Lanka was added to the GEM program in March 2023 as they were recategorised to the IDA classification by the World Bank.&lt;/p>
&lt;p>When the program launched, we had 214 existing members eligible for the program who then were no longer charged for membership or content registration. Since the program began, we have welcomed an additional 131 new members into the program, including our first members from Cambodia and Togo.&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>Country&lt;/th>
&lt;th>As of 1/1/2023&lt;br> (start of GEM)&lt;/th>
&lt;th>Additions in 2023 &lt;br>(end of first year of GEM)&lt;/th>
&lt;th>Total&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>Afghanistan&lt;/td>
&lt;td>6&lt;/td>
&lt;td>4&lt;/td>
&lt;td>10&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Bangladesh&lt;/td>
&lt;td>56&lt;/td>
&lt;td>33&lt;/td>
&lt;td>89&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Benin&lt;/td>
&lt;td>1&lt;/td>
&lt;td>1&lt;/td>
&lt;td>2&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Bhutan&lt;/td>
&lt;td>4&lt;/td>
&lt;td>2&lt;/td>
&lt;td>6&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Burkina Faso&lt;/td>
&lt;td>2&lt;/td>
&lt;td>0&lt;/td>
&lt;td>2&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Burundi&lt;/td>
&lt;td>1&lt;/td>
&lt;td>0&lt;/td>
&lt;td>1&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Cambodia&lt;/td>
&lt;td>0&lt;/td>
&lt;td>2&lt;/td>
&lt;td>2&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Central African Republic&lt;/td>
&lt;td>1&lt;/td>
&lt;td>0&lt;/td>
&lt;td>1&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Congo, Democratic Republic&lt;/td>
&lt;td>1&lt;/td>
&lt;td>11&lt;/td>
&lt;td>12&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Ethiopia&lt;/td>
&lt;td>4&lt;/td>
&lt;td>6&lt;/td>
&lt;td>10&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Ghana&lt;/td>
&lt;td>14&lt;/td>
&lt;td>7&lt;/td>
&lt;td>21&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Guyana&lt;/td>
&lt;td>1&lt;/td>
&lt;td>1&lt;/td>
&lt;td>2&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Haiti&lt;/td>
&lt;td>1&lt;/td>
&lt;td>0&lt;/td>
&lt;td>1&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Kosovo&lt;/td>
&lt;td>2&lt;/td>
&lt;td>2&lt;/td>
&lt;td>4&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Kyrgyz Republic&lt;/td>
&lt;td>22&lt;/td>
&lt;td>3&lt;/td>
&lt;td>25&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Laos&lt;/td>
&lt;td>1&lt;/td>
&lt;td>0&lt;/td>
&lt;td>1&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Madagascar&lt;/td>
&lt;td>1&lt;/td>
&lt;td>1&lt;/td>
&lt;td>2&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Malawi&lt;/td>
&lt;td>1&lt;/td>
&lt;td>0&lt;/td>
&lt;td>1&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Maldives&lt;/td>
&lt;td>1&lt;/td>
&lt;td>0&lt;/td>
&lt;td>1&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Mali&lt;/td>
&lt;td>2&lt;/td>
&lt;td>0&lt;/td>
&lt;td>2&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Mauritania&lt;/td>
&lt;td>1&lt;/td>
&lt;td>0&lt;/td>
&lt;td>1&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Myanmar&lt;/td>
&lt;td>1&lt;/td>
&lt;td>0&lt;/td>
&lt;td>1&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Nepal&lt;/td>
&lt;td>20&lt;/td>
&lt;td>18&lt;/td>
&lt;td>38&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Nicaragua&lt;/td>
&lt;td>1&lt;/td>
&lt;td>0&lt;/td>
&lt;td>1&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Rwanda&lt;/td>
&lt;td>4&lt;/td>
&lt;td>1&lt;/td>
&lt;td>5&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Senegal&lt;/td>
&lt;td>3&lt;/td>
&lt;td>3&lt;/td>
&lt;td>6&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Somalia&lt;/td>
&lt;td>2&lt;/td>
&lt;td>2&lt;/td>
&lt;td>4&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Sri Lanka&lt;/td>
&lt;td>13&lt;/td>
&lt;td>5&lt;/td>
&lt;td>18&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Sudan&lt;/td>
&lt;td>9&lt;/td>
&lt;td>2&lt;/td>
&lt;td>11&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Tajikistan&lt;/td>
&lt;td>5&lt;/td>
&lt;td>1&lt;/td>
&lt;td>6&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Tanzania&lt;/td>
&lt;td>9&lt;/td>
&lt;td>7&lt;/td>
&lt;td>16&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Togo&lt;/td>
&lt;td>0&lt;/td>
&lt;td>1&lt;/td>
&lt;td>1&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Uganda&lt;/td>
&lt;td>3&lt;/td>
&lt;td>6&lt;/td>
&lt;td>9&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Yemen&lt;/td>
&lt;td>16&lt;/td>
&lt;td>12&lt;/td>
&lt;td>28&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Zambia&lt;/td>
&lt;td>5&lt;/td>
&lt;td>0&lt;/td>
&lt;td>5&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>With help from our ambassadors based in GEM countries, we organised and co-hosted several webinars to introduce the program, along with an introduction to Crossref, and the benefits of including all kinds of research objects in the &lt;a href="https://www.crossref.org/documentation/research-nexus/">Research Nexus&lt;/a>.&lt;/p>
&lt;ul>
&lt;li>
&lt;p>In April, our team, together with ambassador Binayak Raj Pandey, provided an overview of Crossref for members and organisations in Nepal. &lt;/p>
&lt;/li>
&lt;li>
&lt;p>Our team and ambassadors, Dr Md Jahangir Alam and Shaharima Parvin hosted two webinars in May for members and organisations in Bangladesh. The first webinar provided an introduction to Crossref, our services, and the GEM Program. The second webinar focused on the methods to register content and how to add and update metadata. &lt;/p>
&lt;/li>
&lt;li>
&lt;p>In September, ambassador Baraka Manjale Ngussa joined us for an introductory webinar aimed at organisations in Tanzania&lt;/p>
&lt;/li>
&lt;li>
&lt;p>In November, CARLIGH (the Consortium of Academic and Research Libraries in Ghana), Crossref, and EIFL co-hosted a webinar for librarians and journal editors in Ghana with a discussion on the GEM program and Crossref services.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>In 2024, we will continue to collaborate with our ambassadors and other members of the community to offer more opportunities for organisations in GEM-eligible countries to learn about the program and the benefits of membership for content discovery.&lt;/p>
&lt;p>The program was initially met with scepticism by some organisations in GEM-eligible countries, who wanted to be certain that it wasn&amp;rsquo;t a free trial, that there are no hidden fees, or that they would be required to pay later for other services. Others expressed concern that Crossref would introduce fees after a year or two. Though we were able to clarify these aspects of the program, we understand the concerns and are working to ensure we provide clarity and transparency about the program. Additionally, we will be conducting a complete review of our fees in 2024, and we will ensure that GEM-eligible members will have input.&lt;/p>
&lt;p>Although the program offers relief from fees, many organisations require technical assistance and language support. The GEM program would benefit from an increase in local &lt;a href="https://www.crossref.org/community/sponsors/">Sponsors&lt;/a> to facilitate membership and provide support, particularly In countries with the highest growth, such as Bangladesh, Nepal, Yemen, Kyrgyz Republic, and Ghana. Though we have Sponsors working with members who are in GEM countries (e.g. PKP), we do not yet have any Sponsors who are based in a GEM country.&lt;/p>
&lt;p>We will be working with relevant like-minded organisations, such as PKP, DOAJ, INASP, OASPA, EIFL, and others, to help identify suitable candidates for new Sponsors in underserved regions and engage them proactively. Additionally, we will consult with our ambassadors in GEM countries to help identify potential Sponsors. We are beginning the year by making the most of the momentum created in African countries (Uganda, Ghana, Tanzania) and looking to develop new networks in other parts of the world in Q2-Q4 of this year.&lt;/p></description></item><item><title>Increasing Crossref Data Reusability With Format Experiments</title><link>https://www.crossref.org/blog/increasing-crossref-data-reusability-with-format-experiments/</link><pubDate>Fri, 19 Jan 2024 00:00:00 +0000</pubDate><author>Martin Eve</author><guid>https://www.crossref.org/blog/increasing-crossref-data-reusability-with-format-experiments/</guid><description>&lt;p>Every year, Crossref releases a full public data file of all of our metadata. This is partly a &lt;a href="https://openscholarlyinfrastructure.org/" target="_blank">commitment to POSI&lt;/a> and partly just what we do. We want the community to re-use our metadata and to find interesting ends to which they can be put!&lt;/p>
&lt;p>However, we have also recognized, for some time, that 170GB of compressed .tar.gz files, spread over 27,000 items, is not the easiest of formats with which to work. For instance, there&amp;rsquo;s no indexing capacity on these files, meaning that it is virtually impossible simply to pull out the record for a DOI. Decompressing the .tar.gz files takes a good three hours or more even on high-end hardware, without any additional processing.&lt;/p>
&lt;p>To that end, the Crossref Labs team has been experimenting with different formats for trial release that might allow us to reach broader audiences, including those who have not previously worked with our metadata files. The two new formats, alongside the existing data file format, with which we have been experimenting, are JSON lines and SQLite.&lt;/p>
&lt;p>&lt;strong>JSON-L&lt;/strong>&lt;/p>
&lt;p>The first format with which we&amp;rsquo;ve been experimenting is JSON-L (JSON lines). With one JSON entry per line, as opposed to one giant JSON file/block, JSON-L lends itself to better parallelisation in systems such as SPARC, because the data can easily be partitioned.&lt;/p>
&lt;p>This data format also has the benefit of being appendable, one line at a time. Unlike conventional JSON, which requires the entire structure to be parsed in-memory before an append is possible, JSON-L can simply be written to and updated. It&amp;rsquo;s also possible to do multi-threaded write operations on the file, without each thread having to parse the entire JSON structure and then sync with other threads.&lt;/p>
&lt;p>In our experiments, JSON-L came with substantial parallelisation benefits. Our routines to calculate citation counts can be completed in ~20-25 minutes. Calculating the number of resolutions per container title takes less than half an hour.&lt;/p>
&lt;p>&lt;strong>SQLite&lt;/strong>&lt;/p>
&lt;p>SQLite is a library written in C with client bindings for Python, Java, C#, and many other languages that produces an on-disk, portable, single-file SQL database. You can produce the SQLite file using our &lt;a href="https://gitlab.com/crossref/labs/rustsqlitepacker" target="_blank">openly available Rust program, rustsqlitepacker&lt;/a>. We also have a Python script that can produce the final SQLite file, for those happier working in this language.&lt;/p>
&lt;p>The resultant SQLite file is approximately 900GB in size, so it requires quite a lot of free disk space to create in the first place (alongside storage of the data file that is needed to build it). However, queries are snappy when looking up by DOI and other indexes can be constructed (the indexing part of the procedure takes about 1.5 hours per field).&lt;/p>
&lt;p>The database structure, at present, is the bare minimum that will work. It contains a list of fields for searching/indexing &amp;ndash; DOI, URL, member, prefix, type, created, and deposited &amp;ndash; and a metadata field that contains the JSON response that would be returned by the API for this value.&lt;/p>
&lt;p>This allows for the processing and extraction of individual JSON elements using SQLite&amp;rsquo;s built-in json_extract method. For example, to get just the title of an item, you can use:&lt;/p>
&lt;p>SELECT json_extract(metadata, &amp;lsquo;$.title&amp;rsquo;) from works WHERE doi=&amp;ldquo;10.1080/10436928.2020.1709713&amp;rdquo;;&lt;/p>
&lt;p>The balance that we have had to strike here is between flattening the JSON so that more fields are indexable and searchable, as against the trade-off in time and processing that this takes to create the database in the first place. The first draft version of our experiment was wildly ambitious in flattening all the records and using an Object Relation Mapper (ORM) to present Python models of the database. Like &lt;a href="https://en.wiktionary.org/wiki/like_painting_the_Forth_Bridge" target="_blank">painting the Forth Bridge&lt;/a>, this initial attempt would not finish in any sane length of time. Indeed, by the time we&amp;rsquo;d created this year&amp;rsquo;s data file, we&amp;rsquo;d need to begin work on the next.&lt;/p>
&lt;p>What are the anticipated use cases here? When people need to do an offline metadata search on an embedded device, for instance, the portability and indexed lookup of the SQLite database can be very appealing. One of our team has even got the database running on a &lt;a href="https://www.raspberrypi.com/products/raspberry-pi-5/" target="_blank">Raspberry Pi 5&lt;/a>. You can also load the database into &lt;a href="https://datasette.io/" target="_blank">Datasette&lt;/a> if you want to explore it visually.&lt;/p>
&lt;p>Where do we go from here with this? It would be good to flatten a few more fields, but we would welcome feedback on use cases that we haven&amp;rsquo;t anticipated for SQLite and we&amp;rsquo;d love to hear whether this is already too unwieldy (at 900GB).&lt;/p>
&lt;p>&lt;strong>Data Files&lt;/strong>&lt;/p>
&lt;p>As usual, we will be releasing the annual data file in the next few months. As an experiment this year, we will also be releasing the tools that can be used on that file to produce these alternative file formats. We will consider releasing the final data files for each of these formats, too.&lt;/p>
&lt;p>What we would like to hear from the community is whether there are other data file formats that you might wish to use. Are there use cases that we haven&amp;rsquo;t anticipated? What would you ideally like in terms of file formats?&lt;/p></description></item><item><title>I4OA Hall of Fame - 2023 edition</title><link>https://www.crossref.org/blog/i4oa-hall-of-fame-2023-edition/</link><pubDate>Tue, 09 Jan 2024 00:00:00 +0000</pubDate><author>Bianca Kramer</author><guid>https://www.crossref.org/blog/i4oa-hall-of-fame-2023-edition/</guid><description>&lt;p>The &lt;a href="https://i4oa.org/" target="_blank">Initiative for Open Abstracts (I4OA)&lt;/a> was launched in September 2020 to advocate and promote the unrestricted availability of the abstracts of the world&amp;rsquo;s scholarly publications, particularly journal articles and book chapters, in trusted repositories where they are open and machine-accessible. I4OA calls on all scholarly publishers to open the abstracts of their published works and, where possible, to submit them to Crossref.&lt;/p>
&lt;p>Since the launch of I4OA, we have been tracking the openness of abstracts for all Crossref members over time (for data and code, see this &lt;a href="https://github.com/bmkramer/I4OA" target="_blank">GitHub repository&lt;/a>). For a subset of 40+, mostly larger, publishers, the proportion of current journal articles (published in the current year and preceding two years) that have abstracts deposited in Crossref is shown in a &lt;a href="https://i4oa.org/#:~:text=are%20shown%20in-,orange,-." target="_blank">chart on the I4OA website&lt;/a>, which is updated quarterly (Figure 1).&lt;/p>
&lt;center>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2024/I4OA_chart_current_2024_01_01.png"
alt="An image of a dot graph titled &amp;#39;selected publishers - abstracts in Crossref. Journal articles (2021-2023) per 2024-01-01&amp;#39;" width="75%">
&lt;/figure>
&lt;/center>
&lt;p>&lt;em>Figure 1: Proportion of current journal articles from selected publishers that have open abstracts in Crossref. Data collected on January 1, 2024 for publication years 2021-2023. Publishers already supporting I4OA are shown in orange.&lt;/em>&lt;/p>
&lt;p>These longitudinal data and accompanying visualisations allow us to identify and highlight good examples from 2023: publishers (both large and small) who newly started to make abstracts openly available last year and/or who managed to get the proportion of their articles with open abstracts close to 100%&lt;sup>1&lt;/sup>.&lt;/p>
&lt;p>While we highlight some of these examples below in our &amp;lsquo;Hall of Fame&amp;rsquo;, it&amp;rsquo;s important to also acknowledge all the publishers that already were depositing abstracts to Crossref for most or all of their journal articles prior to 2023, thereby contributing to the availability of abstracts as part of a rich ecosystem of open metadata, &lt;a href="https://www.leidenmadtrics.nl/articles/why-openly-available-abstracts-are-important-overview-of-the-current-state-of-affairs" target="_blank">for others to use and build upon&lt;/a>. &lt;/p>
&lt;h2 id="hall-of-fame---part-1-publishers-included-in-i4oa-visualisation">Hall of Fame - Part 1: publishers included in I4OA visualisation&lt;/h2>
&lt;p>For the set of (mostly larger) publishers included in the visualisation on the I4OA website, Figure 2 shows the difference in the proportion of abstracts available in Crossref between January and December 2023 for journal articles published in 2021-2023.&lt;/p>
&lt;p>A number of publishers stand out from this figure:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>&lt;strong>Wiley&lt;/strong> announced in &lt;a href="https://www.wiley.com/en-us/network/publishing/research-publishing/open-access/wiley-expands-commitment-to-open-research" target="_blank">October 2022&lt;/a> that it was joining I4OA and would be making abstracts available through Crossref. In August 2023, Wiley started to deposit abstracts to Crossref, and at the end of 2023, the proportion of current journal articles with open abstracts was 77%.&lt;/p>
&lt;p>This makes Wiley the first of the four largest traditional commercial publishers to deposit abstracts for the majority of journal articles they publish. Springer Nature does this only for their current open access articles, while Elsevier and Taylor &amp;amp; Francis&lt;sup>2&lt;/sup> do not yet provide abstracts to Crossref at all. SAGE, the fifth largest traditional commercial publisher, was a founding member of I4OA and has open abstracts for 85% of current journal articles. &lt;br>&lt;br>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Among society publishers, the &lt;strong>American Geophysical Union (AGU)&lt;/strong> went from 7% to 99% open abstracts for current journal articles last year, which is a great achievement. The publishing arm of the &lt;strong>American Institute of Physics (AIP Publishing)&lt;/strong> joins them in reaching close to 100% open abstracts, going from 41% to 95% in 2023.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;hr>
&lt;p>&lt;sup>1&lt;/sup>Depending on the type of journal(s) of a given publisher, the maximal coverage of open abstracts will often be somewhat below 100%, as in Crossref, all journal content is assigned the type ‘journal article’. This includes e.g. editorials, letters to the editor and other publication types that are not always expected to have abstracts.&lt;/p>
&lt;p>&lt;sup>2&lt;/sup>The numbers for Wiley and Taylor &amp;amp; Francis do not include Hindawi and F1000 Research, respectively, as these have separate Crossref member IDs. As most full open access publishers, both Hindawi and F1000 Research have high proportions of open abstracts (81% and 98%, respectively).&lt;/p>
&lt;ul>
&lt;li>
&lt;p>&lt;strong>CAIRN&lt;/strong> and &lt;strong>Project Muse&lt;/strong>, two publishing platforms in the humanities and social sciences representing a number of individual publishers, both started including abstracts in the metadata they provide to Crossref in 2023. At the end of 2023, CAIRN had abstracts available for 41% of current journal articles, while Project Muse was just starting out at 5%. Both will hopefully increase further this coming year.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Returning to traditional commercial publishers, &lt;strong>Wolters Kluwer Health&lt;/strong>, part of Wolters Kluwer, had seen a slow growth in the proportion of journal articles with open abstracts in the years prior to 2023, going from 2% to 10%. However, they showed a rapid increase in 2023, ending the year with 52% open abstracts.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>While it is good to see publishers who have publicly committed their &lt;a href="https://i4oa.org/#:~:text=Publishers%20supporting%20I4OA" target="_blank">support for I4OA&lt;/a> follow through with opening their abstracts (like Wiley and AIP), it is also very encouraging to see publishers who are not (yet) listed as I4OA supporters do so. This shows a growing awareness and action on this issue beyond advocacy through I4OA alone. And of course, we would love to list these publishers on our website as official supporters of I4OA!&lt;/p>
&lt;p>Figure 2 also shows some cases where the proportion of open abstracts has gone down during the year. This can be due to temporary technical issues in depositing abstracts (as was the case for Hindawi). Theoretically, the proportion of open abstracts can also go down when publishers stop providing abstracts altogether during the year, but we have not observed that to be the case.&lt;/p>
&lt;center>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2024/I4OA_chart_diff.png"
alt="An image of a dot graph titled &amp;#39;selected publishers - abstracts in Crossref. January - December 2023 (journal articles 2021-2023&amp;#39;" width="75%">
&lt;/figure>
&lt;/center>
&lt;p>&lt;em>Figure 2: Development in the proportion of open abstracts in 2023 for current journal articles (publication years 2021-2023) from selected publishers. Publishers already supporting I4OA are shown in orange. Light orange/blue dots show the proportion of open abstracts in January 2023, and dark orange/blue dots in December 2023.&lt;/em>&lt;/p>
&lt;h2 id="hall-of-fame---part-2-other-publishers">Hall of Fame - Part 2: other publishers&lt;/h2>
&lt;p>Among the many publishers not included in the limited selection shown in the I4OA visualisation, there are also some interesting highlights of publishers either starting out to deposit abstracts (and reaching a sizeable proportion) or having deposited open abstracts for almost all their current journal articles in 2023. The examples below drew our attention in 2023; they include a number of medium-sized publishers as well as a group of smaller publishers that deserve special attention.&lt;/p>
&lt;ul>
&lt;li>
&lt;p>The &lt;strong>European Molecular Biology organisation (EMBO)&lt;/strong> went from 0% to 42% open abstracts in 2023. However, from January 2024 onwards, several EMBO journals were transferred to Springer Nature, so EMBO can no longer be tracked at publisher level in Crossref. It will still be possible to look at the development of open abstracts for individual EMBO journals.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>The &lt;strong>Institution of Engineering and Technology (IET)&lt;/strong>, a medium-sized publisher, started to deposit abstracts in 2023, reaching 33% open abstracts for current journal articles at the end of the year.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>The &lt;strong>Acoustical Society of America (ASA)&lt;/strong> had open abstracts for almost all their current journal articles at the end of 2023, increasing from 50% to 97%.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Finally, in the second quarter of 2023, a group of &lt;strong>over 200 smaller Turkish publishers&lt;/strong> saw large increases in their coverage of open abstracts, resulting in open abstracts for 95%-100% of their current journal articles. Consultation with Crossref pointed to the potential supporting role of &lt;a href="https://dergipark.org.tr/en/" target="_blank">DergiPark&lt;/a>, one of the largest Crossref sponsors in Turkey. This is a great example of developments in open metadata at smaller publishers.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;h2 id="looking-forward">Looking forward &lt;/h2>
&lt;p>At the beginning of 2024, the proportion of current journal articles published by Crossref members with open abstracts has reached 49.7%, up from 20.7% when I4OA was launched in September 2020. This is thanks to a growing number of publishers who are depositing abstracts to Crossref, often depositing open abstracts for close to 100% of their journal articles.&lt;/p>
&lt;p>This blog post has highlighted a number of publishers who contributed to this growth in the availability of open abstracts in 2023. We hope these examples will inspire other publishers to start doing the same and are looking forward to following the growth in the availability of open abstracts in 2024.&lt;/p>
&lt;p>For publishers that started to deposit abstracts in recent years and are doing so for newly published articles only, our data on open abstracts for current journal articles will look better in 2024 than in 2023, as only articles published in the current year and two preceding years are taken into account.&lt;/p>
&lt;p>However, the benefits of having abstracts openly available from a central location such as Crossref (both for direct usage and for integration in other open scholarly infrastructures) are not limited to recent publications only. Hopefully, publishers currently depositing abstracts to Crossref will continue to do so both for newly published articles as well as for the backfiles of journal articles already published.&lt;/p>
&lt;p>Publishers who would like to be added to the list of I4OA supporters, or who would like more information on how to deposit abstracts for both new and existing journal articles, are very welcome to &lt;a href="mailto:openabstracts@gmail.com">reach out to I4OA&lt;/a>. More information about open abstracts in general, and I4OA in particular, can also be found in the &lt;a href="https://i4oa.org/faqs.html" target="_blank">FAQ&lt;/a> on the I4OA website.&lt;/p>
&lt;p>&lt;em>The author would like to thank Ludo Waltman (CWTS) and Ginny Hendricks (Crossref) for useful feedback on an earlier draft of this post.&lt;/em>&lt;/p>
&lt;p>&lt;em>This blog post is published under a CC BY 4.0 license. The header image is an adaptation of an image by Adam Jones available from Wikimedia Commons (&lt;a href="https://commons.wikimedia.org/wiki/File:Interior_02_of_Rock_%26_Roll_Hall_of_Fame_and_Museum,_Cleveland_%28by_Adam_Jones%29.jpg" target="_blank">https://commons.wikimedia.org/wiki/File:Interior_02_of_Rock_%26_Roll_Hall_of_Fame_and_Museum,_Cleveland_%28by_Adam_Jones%29.jpg&lt;/a>) and is shared under a CC BY-SA license.&lt;/em>&lt;/p></description></item><item><title>Discovering relationships between preprints and journal articles</title><link>https://www.crossref.org/blog/discovering-relationships-between-preprints-and-journal-articles/</link><pubDate>Thu, 07 Dec 2023 00:00:00 +0000</pubDate><author>Dominika Tkaczyk</author><guid>https://www.crossref.org/blog/discovering-relationships-between-preprints-and-journal-articles/</guid><description>&lt;p>In the scholarly communications environment, the evolution of a journal article can be traced by the relationships it has with its preprints. Those preprint–journal article relationships are an important component of &lt;a href="https://www.crossref.org/documentation/research-nexus/">the research nexus&lt;/a>. Some of those relationships are provided by Crossref members (including publishers, universities, research groups, funders, etc.) when they deposit metadata with Crossref, but we know that a significant number of them are missing. To fill this gap, we developed a new automated strategy for discovering relationships between preprints and journal articles and applied it to all the preprints in the Crossref database. We made the resulting dataset, containing both publisher-asserted and automatically discovered relationships, &lt;a href="https://doi.org/10.5281/zenodo.10144856" target="_blank">publicly available&lt;/a> for anyone to analyse.&lt;/p>
&lt;h2 id="tldr">TL;DR&lt;/h2>
&lt;ul>
&lt;li>
&lt;p>We have developed a new, heuristic-based strategy for matching journal articles to their preprints. It achieved the following results on the evaluation dataset: precision 0.99, recall 0.95, F0.5 0.98. The code is available &lt;a href="https://gitlab.com/crossref/labs/marple/-/blob/main/strategies_available/preprint_sbmv/strategy.py" target="_blank">here&lt;/a>.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>We applied the strategy to all the preprints in the Crossref database. It discovered 627K preprint–journal article relationships.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>We gathered all preprint–journal article relationships deposited by Crossref members, merged them with those discovered by the new strategy, and made everything available as &lt;a href="https://doi.org/10.5281/zenodo.10144856" target="_blank">a dataset&lt;/a>. There are 642K relationships in the dataset, including:&lt;/p>
&lt;ul>
&lt;li>296K provided by the publisher and discovered by the strategy,&lt;/li>
&lt;li>331K new relationships discovered by the strategy only,&lt;/li>
&lt;li>15K provided by the publisher only.&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>
&lt;p>In the future, we plan to replace our current matching strategy with the new one and make all discovered relationships available through the Crossref REST API.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;h2 id="introduction">Introduction&lt;/h2>
&lt;p>Relationships between preprints and journal articles link different versions of research outputs and allow one to follow the evolution of a publication over time. The Crossref deposit schema allows Crossref members to provide these relationships for new publications, either as a &lt;em>has-preprint&lt;/em> relationship deposited with a journal article, or an &lt;em>is-preprint-of&lt;/em> relationship deposited with a preprint.&lt;/p>
&lt;p>To assist members who deposit preprints, we also try to connect deposited journal articles with preprints. The current method looks for an exact match between the title and first authors. We send possible matches as suggestions to the preprint server, which decides whether to update the metadata with the relationship.&lt;/p>
&lt;p>At the time of writing, 137,837 journal articles in the Crossref database have a &lt;em>has-preprint&lt;/em> relationship&lt;sup id="fnref:1">&lt;a href="#fn:1" class="footnote-ref" role="doc-noteref">1&lt;/a>&lt;/sup>, and 562,225 works of type posted-content (preprints belong to this type) have an &lt;em>is-preprint-of&lt;/em> relationship&lt;sup id="fnref:2">&lt;a href="#fn:2" class="footnote-ref" role="doc-noteref">2&lt;/a>&lt;/sup>.&lt;/p>
&lt;p>We suspected that many preprint–journal article relationships are missing, as some members inevitably fail to deposit them, even after suggestions from the current matching strategy. Another factor is that the current strategy is fairly conservative, and probably misses a significant number of relationships. For these reasons, we decided to investigate whether we could improve on the current process. Doing so would allow us to infer missing relationships on a large scale, similar to how we automatically match bibliographic references to DOIs.&lt;/p>
&lt;p>This preprint matching task can be defined in two directions:&lt;/p>
&lt;ul>
&lt;li>We start with a journal article and we want to find all its preprints.&lt;/li>
&lt;li>We start with a preprint and we want to find a subsequently published journal article.&lt;/li>
&lt;/ul>
&lt;p>On the one hand, matching from journal articles to preprints would allow us to enrich the database continually with new relationships, either periodically or every time new content is added. Since journal articles tend to appear in the database later than their preprints, it makes sense for a new journal article to trigger the matching and not the other way round. This way we can expect the potential matches to be already in the database at the time of matching.&lt;/p>
&lt;p>On the other hand, matching from preprints to journal articles can be useful in a situation where we want to add relationships in an existing database retrospectively. In our case, the database contains many more journal articles than preprints, so for performance reasons it is better to start with preprints.&lt;/p>
&lt;p>In both cases we are dealing with structured matching, meaning that we match a metadata record of a work (preprint or journal article), rather than unstructured text.&lt;/p>
&lt;p>As a result of matching a single preprint or a single journal article, we should expect zero or more matched journal articles/preprints. Multiple matches occur when:&lt;/p>
&lt;ul>
&lt;li>there are multiple versions of the matched preprint and/or&lt;/li>
&lt;li>matched works have duplicates.&lt;/li>
&lt;/ul>
&lt;p>The image shows the result of matching a journal article to two versions of a preprint:&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/preprint-matching.png"
alt="Preprint matching" width="70%">
&lt;/figure>
&lt;br/>
&lt;h2 id="matching-strategy">Matching strategy&lt;/h2>
&lt;p>Our matching strategy uses the following workflow:&lt;/p>
&lt;ol>
&lt;li>Gathering a short list of candidates using the Crossref REST API.&lt;/li>
&lt;li>Scoring the similarity between the input item and each candidate.&lt;/li>
&lt;li>A final decision about which candidates, if any, should be returned as matches.&lt;/li>
&lt;/ol>
&lt;p>Gathering candidates is done using the Crossref REST API&amp;rsquo;s &lt;em>query.bibliographic&lt;/em> parameter. The query is a concatenation of the title and authors&amp;rsquo; last names of the input item. We filter the candidates based on their type, to leave only preprints or only journal articles, depending on the direction of the matching. In the future, instead of getting the candidates from the REST API, we will be using a dedicated search engine, optimised for preprint matching.&lt;/p>
&lt;p>Scoring candidates is heuristic-based. Similarities between titles, authors, and years are scored independently, and the final score is their average. Titles are compared in a fuzzy way using the &lt;a href="https://pypi.org/project/rapidfuzz/" target="_blank">rapidfuzz library&lt;/a>. Authors are compared pairwise using the ORCID ID, or first/last names if ORCID ID is not available. The similarity score between issued years is 1 if the article was published no earlier than one year before the preprint and no later than three years after the preprint, or 0 otherwise.&lt;/p>
&lt;p>The final decision is made based on two parameters: minimum score and maximum score difference, both chosen based on a validation dataset. The following diagram depicts the results of applying these two parameters in all possible scenarios. First, any candidate scoring below the minimum score is rejected (grey area in the diagram). Second, the scores of the remaining candidates are compared with the score of the top candidate. If the score of a candidate is close enough to the score of the top candidate, it is returned as a match (blue area).&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/preprint-matching-scenarios.png"
alt="Preprint matching scenarios" width="70%">
&lt;/figure>
&lt;br/>
&lt;p>This process can result in the following scenarios:&lt;/p>
&lt;ul>
&lt;li>Scenario A: there is no candidate above the minimum score. This means nothing matches sufficiently, so nothing is returned.&lt;/li>
&lt;li>Scenario B: there is only one candidate above the minimum score. This means it is the best match and we don&amp;rsquo;t have much of a choice, so it is returned.&lt;/li>
&lt;li>Scenario C: there are multiple candidates above the minimum score, and they all have similar scores. This means they all are similarly good matches, so all are returned.&lt;/li>
&lt;li>Scenario D: there are multiple candidates above the minimum score, but their scores differ a lot. In this case, we don&amp;rsquo;t want to return all of them, but only those that are close to the top match. Intuitively, we don&amp;rsquo;t want to return less-than-great matches if we have really great ones. This is when the maximum score difference comes into play: we return the candidates with the “score distance” to the top candidate lower than the maximum score difference.&lt;/li>
&lt;/ul>
&lt;p>We evaluated this strategy on a test set sampled from the Crossref metadata records. The test set contains 3,000 pairs (journal article, set of corresponding preprints). Half of the journal articles have known preprints and the other half don&amp;rsquo;t. The test set can be accessed &lt;a href="https://gitlab.com/crossref/labs/marple/-/blob/main/data/datasets/preprints-rest-api-2023-06-23.json" target="_blank">here&lt;/a>.&lt;/p>
&lt;p>We used precision, recall, and F0.5 as evaluation metrics:&lt;/p>
&lt;ul>
&lt;li>Precision measures the fraction of the matched relationships that are correct.&lt;/li>
&lt;li>Recall measures the fraction of the true relationships that were matched.&lt;/li>
&lt;li>F0.5 combines precision and recall in a way that favours precision.&lt;/li>
&lt;/ul>
&lt;p>The strategy achieved the following results: precision 0.9921, recall 0.9474, F0.5 0.9828. The average processing time was 0.96s.&lt;/p>
&lt;p>We have made this strategy (journal article -&amp;gt; preprints) available through the (experimental) API: &lt;a href="https://marple.research.crossref.org/match?task=preprint-matching&amp;amp;strategy=preprint-sbmv&amp;amp;input=10.1109/access.2022.3213707" target="_blank">https://marple.research.crossref.org/match?task=preprint-matching&amp;strategy=preprint-sbmv&amp;input=10.1109/access.2022.3213707&lt;/a>. The input is the DOI of a journal article we want to match to preprints, and the output is a list of matches found, along with the score for each.&lt;/p>
&lt;p>We have investigated other approaches to making decisions about which candidates to return as matches (step 3 above), including using machine learning. At present none have outperformed the heuristic approach described above. The heuristic method is also preferred because of its fast performance.&lt;/p>
&lt;h2 id="preprintjournal-article-relationship-dataset">Preprint–journal article relationship dataset&lt;/h2>
&lt;p>We applied the strategy to the entire Crossref database:&lt;/p>
&lt;ol>
&lt;li>We selected all preprints published until the end of August 2023. This included only works with type &lt;em>posted-content&lt;/em> and subtype &lt;em>preprint&lt;/em>, as reported by the REST API. There were 1,050,247 of them.&lt;/li>
&lt;li>We ran the matching strategy (preprint -&amp;gt; journal article) on them. This resulted in 627,011 preprint–journal article relationships.&lt;/li>
&lt;li>The resulting relationships were combined with the relationships deposited by the Crossref members. We included relationships of types &lt;em>has-preprint&lt;/em> or &lt;em>is-preprint-of&lt;/em>, where both sides of the relationship exist in our database, were published until the end of August 2023, and are of proper types and subtypes (type=&lt;em>journal-article&lt;/em> for the journal article and type=&lt;em>posted-content&lt;/em>, subtype=&lt;em>preprint&lt;/em> for the preprint).&lt;/li>
&lt;/ol>
&lt;p>The resulting dataset is a single CSV file with the following fields:&lt;/p>
&lt;ul>
&lt;li>preprint DOI (string)&lt;/li>
&lt;li>journal article DOI (string)&lt;/li>
&lt;li>whether the publisher of the journal article deposited this relationship (boolean)&lt;/li>
&lt;li>whether the publisher of the preprint deposited this relationship (boolean)&lt;/li>
&lt;li>the confidence score returned by the strategy (float, empty if the strategy did not discover this relationship)&lt;/li>
&lt;/ul>
&lt;p>The dataset contains:&lt;/p>
&lt;ul>
&lt;li>641,950 relationships in total, including 580,532 preprints and 565,129 journal articles,&lt;/li>
&lt;li>14,939 of them were deposited by the Crossref members, but not discovered by the strategy,&lt;/li>
&lt;li>330,826 of them were discovered by the strategy, but not provided by any Crossref member,&lt;/li>
&lt;li>296,185 of them were both deposited by a Crossref member and discovered by the strategy.&lt;/li>
&lt;/ul>
&lt;p>The dataset can be downloaded &lt;a href="https://doi.org/10.5281/zenodo.10144856" target="_blank">here&lt;/a>.&lt;/p>
&lt;h2 id="conclusions-and-whats-next">Conclusions and what&amp;rsquo;s next&lt;/h2>
&lt;p>Overall, based on the number of existing and newly discovered preprint–journal article relationships, it seems that employing automated matching strategies would approximately double the number of these relationships in the Crossref database. In the future, we would like to match new journal articles on an ongoing basis. We also plan to make all discovered relationships available through the REST API.&lt;/p>
&lt;p>In the meantime, we will be publishing the discovered relationships in the form of datasets, and we invite anyone interested to further analyse this data. And if you find out something interesting about preprints and their relationships, do let us know!&lt;/p>
&lt;div class="footnotes" role="doc-endnotes">
&lt;hr>
&lt;ol>
&lt;li id="fn:1">
&lt;p>&lt;a href="https://api.crossref.org/types/journal-article/works?filter=relation.type:has-preprint" target="_blank">https://api.crossref.org/types/journal-article/works?filter=relation.type:has-preprint&lt;/a>&amp;#160;&lt;a href="#fnref:1" class="footnote-backref" role="doc-backlink">&amp;#x21a9;&amp;#xfe0e;&lt;/a>&lt;/p>
&lt;/li>
&lt;li id="fn:2">
&lt;p>&lt;a href="https://api.crossref.org/types/posted-content/works?filter=relation.type:is-preprint-of" target="_blank">https://api.crossref.org/types/posted-content/works?filter=relation.type:is-preprint-of&lt;/a>&amp;#160;&lt;a href="#fnref:2" class="footnote-backref" role="doc-backlink">&amp;#x21a9;&amp;#xfe0e;&lt;/a>&lt;/p>
&lt;/li>
&lt;/ol>
&lt;/div></description></item><item><title>Perspectives: Madhura Amdekar on meeting the community and pursuing passion for research integrity</title><link>https://www.crossref.org/blog/perspectives-madhura-amdekar-meeting-community-pursuing-research-integrity/</link><pubDate>Tue, 05 Dec 2023 00:00:00 +0000</pubDate><author>Madhura Amdekar</author><guid>https://www.crossref.org/blog/perspectives-madhura-amdekar-meeting-community-pursuing-research-integrity/</guid><description>&lt;div style="float:left;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/2022/perspectives.png" alt="sound bar logo" width="150px" class="img-responsive" />
&lt;/div>
&lt;p>The second half of 2023 brought with itself a couple of big life changes for me: not only did I move to the Netherlands from India, I also started a new and exciting job at Crossref as the newest Community Engagement Manager. In this role, I am a part of the Community Engagement and Communications team, and my key responsibility is to engage with the global community of scholarly editors, publishers, and editorial organisations to develop sustained programs that help editors to leverage rich metadata.&lt;/p>
&lt;p>This represents an exciting phase in my professional journey, as I now have the chance to learn and develop new skills, broaden my understanding of the publishing landscape, and at the same time be able to leverage the experience I gained so far. I originally trained as an ecologist, obtaining a PhD studying colour change in a tropical agamid lizard in India at the &lt;a href="https://iisc.ac.in/" target="_blank">Indian Institute of Science&lt;/a> (Bengaluru, India). Having immensely enjoyed the process of writing manuscripts based on the data that resulted from my PhD thesis, I was drawn to working in the scholarly publishing industry. I worked for 3.5 years as a Senior Associate at Wiley, overseeing an editor support service by devising strategic scale-up planning and process improvement initiatives.&lt;/p>
&lt;p>I then moved countries as well as jobs and joined Crossref. The world of scholarly communications is a rapidly changing ecosystem, that is ably supported by scholarly infrastructure - the sets of tools and services that support this industry. Being a part of Crossref, a global organisation that provides open scholarly infrastructure, allows me to work with and make an impact on the broad scholarly community that ranges from publishers of all shapes and sizes, funders, to academic institutions, and researchers.&lt;/p>
&lt;p>So far, the &lt;a href="https://www.crossref.org/categories/research-integrity/">integrity of the scholarly record (ISR)&lt;/a> has been the focus of my work. Now more than ever, the community is cognizant of the need to uphold the integrity of the scholarly output. Metadata and relationships between research outputs can support this endeavour in a substantial manner because information such as who contributed to a research output, who funded it, who cites it, whether it was updated after publication, aids provenance and provides signals about whether the output is trustworthy.&lt;/p>
&lt;p>Most of Crossref’s tools and services play a key role here: be it reference linking to allow researchers to increase discoverability of their work, tracking post-publication updates to research outputs via Crossmark, or detecting text plagiarism via Similarity Check. We noticed that not all editors and editorial teams associate metadata as signals of integrity, and might be unaware of the benefits of rich metadata. Therefore, my priority is to utilise opportunities to engage with editors about how metadata can provide trust indicators about a research output. I aim to empower editors to collect and leverage rich metadata.&lt;/p>
&lt;p>While I am no stranger to the world of scholarly communications, engaging with the broader Crossref community has been a new experience for me. In my day to day work, I employ a range of different skills such as program design and management, content planning and outreach, networking, and meeting facilitation. I have also been participating in trainings to enhance my skill set – I recently completed a training course on &lt;a href="https://www.cscce.org/trainings/cef/" target="_blank">Community Engagement Fundamentals&lt;/a>, which has equipped me with a better understanding of the concepts and strategies that I will need as a community manager. Additionally, I also underwent the Group Facilitation Methods training course led by the Institute of Cultural Affairs (ICA) where I learnt a couple of effective methods for group facilitation and leading workshops.&lt;/p>
&lt;p>Equipped with these skills, I have moderated a few community events already – most prominently &lt;a href="https://www.youtube.com/watch?v=b5VBUWz8KZY" target="_blank">the community call about Crossref and Retraction Watch&lt;/a> to discuss Crossref’s acquisition and opening up of the Retraction Watch database. It was a valuable experience to contribute to the planning of an online event and host a panel of distinguished guests.&lt;/p>
&lt;p>I was also fortunate to be able to meet our community members in-person: I supported the organisation of the Frankfurt roundtable event that was held as part of Crossref’s Integrity of the Scholarly Record (ISR) program, where we engaged with community members to get their perspectives on how to work together towards preserving the integrity of the scholarly record (keep watching this space for a forthcoming blog summarising the outcomes from this event!). Additionally, I attended the Frankfurt Book Fair – the experience of getting to meet our members and to hear from them first-hand about all things Crossref, was unparalleled! I used this opportunity to meet several of our publisher members and discuss their view points about engaging with editors on ISR. The idea was received positively: we heard specific suggestions of metadata that would be of interest to readers of scientific manuscripts, and our members also expressed interest in finding out more about how metadata can act as markers of trust for a research output. I plan to use the insights from these meetings for the development of the ISR editor engagement program.&lt;/p>
&lt;p>As I reflect on the past three months, there are a few things that have stood out to me. In terms of work, no two days are the same. My work plan for the day can range from making presentations for outreach activities, creating content such as this blogpost, working on an engagement strategy, to planning events, attending online or offline community meetings, facilitating or moderating some of those events, and networking with community members. This variety in work keeps me motivated to give my best each day. I am also grateful that I have the ability to make an impact with my work in an area that I am passionate about. In my previous job, I had developed a good understanding of research integrity and publication ethics. As a community manager now, I’m looking to work with editorial teams on the integrity of the scholarly record. This role gives me an opportunity to further nurture this interest of mine.&lt;/p>
&lt;p>At times, working from home remotely has been a challenge. However, I have enjoyed attending in-person events as they are not just a chance to meet our community members, but also a chance to meet my colleagues and connect with them.&lt;/p>
&lt;p>I feel privileged to be able to connect with research communities all over the world and make a meaningful contribution towards supporting the discoverability and impact of their work. I am particularly excited to work at the forefront of shaping the future of preserving the integrity of the scholarly record, in tandem with our community. If this is a topic that excites you as well, I am keen to hear from you. It has been a wonderful three months at Crossref so far and I look forward to future collaborations with our community to develop effective ways of supporting and empowering editors to make the most of metadata for their publications.&lt;/p></description></item><item><title>Joint statement on research data</title><link>https://www.crossref.org/blog/joint-statement-on-research-data/</link><pubDate>Tue, 28 Nov 2023 00:00:00 +0000</pubDate><author>Hylke Koers</author><guid>https://www.crossref.org/blog/joint-statement-on-research-data/</guid><description>&lt;p>
&lt;div style="float:right;margin:10px">
&lt;a href="https://www.crossref.org/pdfs/stm-research-data-infographic-FINAL-4.pdf" title="Download PDF">
&lt;img src="https://www.crossref.org/images/blog/2023/stm-research-data-infographic-FINAL-4.jpg" width="400"/>
&lt;/a>
&lt;/div>
&lt;/p>
&lt;p>STM, DataCite, and Crossref are pleased to announce an updated joint statement on research data.&lt;/p>
&lt;p>In 2012, DataCite and STM drafted an &lt;a href="https://www.stm-assoc.org/2012_06_14_STM_DataCite_Joint_Statement.pdf" target="_blank">initial joint statement&lt;/a> on the linkability and citability of research data. With nearly 10 million data citations tracked, thousands of repositories adopting data citation best practices, thousands of journals adopting data policies, data availability statements and establishing persistent links between articles and datasets, and the introduction of data policies by an increasing number of funders, there has been significant progress since. It now seems appropriate to focus on providing updated recommendations for the various stakeholders involved in research data sharing.&lt;/p>
&lt;p>The premise of the original joint statement still stands: most stakeholders across the spectrum of researchers, funders, librarians and publishers agree about the benefits of making research data available and findable for reuse by others. This improves utility and rigor of the scholarly record. Still, research data sharing is not yet a self-evident step in the research lifecycle. We now have sufficient scholarly communication infrastructure in place to bring about widespread change and believe momentum is building for collective action.&lt;/p>
&lt;p>It is in this context that DataCite, a global membership community working with over 2800 repositories around the world, and STM, whose membership consists of over 140 scientific, technical, and medical publishing organisations, are issuing this joint statement. Crossref, a nonprofit open infrastructure with over 18,000 institutional members from 150 countries, joins this call, recognising the need for an amplified focus on data citation. The aim of this statement is to accelerate adoption of best practices and policies, and encourage further development of critical policies in collaboration with a wide group of stakeholders.&lt;/p>
&lt;p>Signatories of this statement recommend the following as best practice in research data sharing:&lt;/p>
&lt;ol>
&lt;li>When publishing their results, researchers deposit related research data and outputs in a trustworthy data repository that assigns persistent identifiers (DOIs where available). Researchers link to research data using persistent identifiers.&lt;/li>
&lt;li>When using research data created by others, researchers provide attribution by citing the datasets in the reference section using persistent identifiers.&lt;/li>
&lt;li>Data repositories enable sharing of research outputs in a FAIR way, including support for metadata quality and completeness.&lt;/li>
&lt;li>Publishers set appropriate journal data policies, describing the way in which data is to be shared alongside the published article.&lt;/li>
&lt;li>Publishers set instructions for authors to include Data Citations with persistent identifiers in the references section of articles.&lt;/li>
&lt;li>Publishers include Data Citations and links to data in Data Availability Statements with persistent identifiers (DOIs where available) in the article metadata registered with Crossref.&lt;/li>
&lt;li>In addition to Data Citations, Data Availability Statements (human- and machine-readable) are included in published articles where appropriate.&lt;/li>
&lt;li>Repositories and publishers connect articles and datasets through persistent identifier connections in the metadata and reference lists.&lt;/li>
&lt;li>Funders and research organisations provide researchers with guidance on open science practices, track compliance with open science policies where possible, and promote and incentivize researchers to openly share, cite and link research data.&lt;/li>
&lt;li>Funders, policymaking institutions, publishers and research organisations collaborate towards aligning FAIR research data policies and guidelines.&lt;/li>
&lt;li>All stakeholders collaborate in the development of tools, processes, and incentives throughout the research cycle to enable sharing of high-quality research data, making all steps in the process clear, easy and efficient for researchers by providing support and guidance.&lt;/li>
&lt;li>Stakeholders responsible for research assessment take into account data sharing and data citation in their reward and recognition system structures.&lt;/li>
&lt;/ol>
&lt;p>We, the following signatories shall adopt and promote the relevant best practices laid out above. We hope that our action inspires the community, including researchers, research funders, research institutions, data repositories and publishers, to join us in making it easy for researchers to share, link and cite research data.&lt;/p>
&lt;p>&lt;a href="https://docs.google.com/forms/d/1Anc6EAs02YTveU3PqELJU2jnTK1R84hMWgB9kNX01p0/edit" target="_blank">Endorse the statement here.&lt;/a>&lt;/p></description></item><item><title>What was the talk of #Crossref2023?</title><link>https://www.crossref.org/blog/what-was-the-talk-of-crossref2023/</link><pubDate>Tue, 21 Nov 2023 00:00:00 +0000</pubDate><author>Kornelia Korzec</author><guid>https://www.crossref.org/blog/what-was-the-talk-of-crossref2023/</guid><description>&lt;p>Have you attended any of our annual meeting sessions this year? Ah, yes – there were many in this conference-style event. I, as many of my colleagues, attended them all because it is so great to connect with our global community, and hear your thoughts on the developments at Crossref, and the stories you share.&lt;/p>
&lt;p>Let me offer some highlights from the event and a reflection on some emergent themes of the day. You can browse the recordings and slides archived on our &lt;a href="https://www.crossref.org/crossref-annual-meeting/">Annual Meeting page&lt;/a>.&lt;/p>
&lt;p>Ginny Hendricks opened the meeting by reminding everyone about the research nexus vision, and the work that’s underway to bring us closer to it. Ginny went on to highlight progress in metadata and relationships being registered by our members, and mentioned members that have particularly rich metadata records – with the special joint recognition for learned societies of South Korea. Participation statistics can be reviewed in our &lt;a href="https://member-metrics.fly.dev" target="_blank">Labs Member Metadata Metrics Tables&lt;/a>.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/scale-of-crossref-infrastructure2023.jpg"
alt="A slide showing The scale of Crossref infrastructure including the following information: &amp;gt;19,000 organisational members from 152 countries; &amp;gt;40% self identify as institution- or university-based; &amp;gt;150 million open metadata records with a DOI; 1.1 billion DOI resolutions every month; 000s (?) systems reusing metadata through search/API and 1.2 billion queries every month (up from 607mil in 2018); 150 Sponsor orgs; 50 Ambassadors; $1,150,000 on data storage and processing alone in 2024; 48 staff across 8 time zones and 11 countries" width="700">
&lt;/figure>
&lt;p>Since 2018 we’ve seen a 512% increase in the number of abstracts included in the metadata; with Wiley’s recent addition of millions of abstracts to their records largely contributing to this change. On the relationships side, in the same period, we’ve noted a staggering 3004% growth in preprint-to-article links, and we’re pleased to report a growing number of funding relationships being made available thanks to more and more funders registering Crossref DOIs for grants.&lt;/p>
&lt;p>For those who couldn’t join us at such an early hour, Ed Penz included some of these highlights in his own strategic update later in the day. However, he focused on our activity and plans towards fulfilling our four strategic goals:&lt;/p>
&lt;ul>
&lt;li>To contribute to an environment where the community identifies and co-creates solutions for broad benefit&lt;/li>
&lt;li>To be a sustainable source of complete, open, and global scholarly metadata and relationships&lt;/li>
&lt;li>To be publicly accountable to the Principles of Open Scholarly Infrastructure (POSI) practices of sustainability, insurance, and governance&lt;/li>
&lt;li>To foster a strong team—because reliable infrastructure needs committed people who contribute to and realise the vision, and thrive doing it&lt;/li>
&lt;/ul>
&lt;p>&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/crossref2023-strategic-update.jpg"
alt="A slide showing actions by Crossref split into Recently completed, In forcus, Up next, Under consideration – an excerpt from the crossref.org/strategy page" width="700">
&lt;/figure>
&lt;br>
Speakers from across our global community shared their initiatives too. Most of these talks have been accompanied by &lt;a href="https://community.crossref.org/c/crossref-events/crossref2023-presentations/44" target="_blank">posters or abstracts shared on our Community Forum&lt;/a> and still available for preview and discussion:&lt;/p>
&lt;ul>
&lt;li>Making data citations available at scale: The Global Open Data Citation Corpus by Iratxe Puebla;&lt;/li>
&lt;li>“Who Cares?” Defining Citation Style in Scholarly Journals by Vincas Grigas and Pavla Vizváry;&lt;/li>
&lt;li>DOI registration for scholarly blogs by Martin Fenner;&lt;/li>
&lt;li>Enhancing Research Connections through Metadata: A Case Study with AGU and CHORUS by Tara Packer, Kristina Vrouwenvelder, Shelley Stall;&lt;/li>
&lt;li>Index Crossref, Integrity, Professional And Institutional Development by Engjellushe Zenelaj;&lt;/li>
&lt;li>Brazilian retractions in the Retraction Watch Database - RWDB by Edilson Damasio; and&lt;/li>
&lt;li>Now that you’ve published, what do you do with Metadata? - by Joann Fogleson.&lt;/li>
&lt;/ul>
&lt;p>In addition to these updates, we’ve heard from:&lt;/p>
&lt;ul>
&lt;li>Izabela Szyprowska (OP, European Commission), Nikolaos Mitrakis (RTD, European Commission), and Paola Mazzucchi (mEDRA) talked about the process and rationale of implementing Crossref DOIs for grants at the European Commission; and&lt;/li>
&lt;li>Amanda French from ROR/Crossref about the new ‘ROR / Open Funder Registry overlap’ tool.&lt;/li>
&lt;/ul>
&lt;p>We also assembled a diverse panel and invited the community to discuss &lt;strong>“What we still need to build a robust Research Nexus?”&lt;/strong> The discussion ranged from how different parts of our community currently use existing metadata, to how we can come together to make improvements, especially in the area of standards and equitability, and touched on metadata priorities. I’ll highlight some of the threads below, but it’s certainly worth engaging with the &lt;a href="https://www.youtube.com/watch?si=7Nzk9wUQMJMdxffY&amp;amp;v=d_u-Ad9-H64&amp;amp;feature=youtu.be" target="_blank">full recording of the discussion&lt;/a>, and offering your own perspective on the Community Forum, commenting below.&lt;/p>
&lt;p>Having participated in the whole day of talks, I found that a few themes emerged as popular in the community: data citations, making it easier to register metadata, making better use of metadata, retractions, and equity of participation in the research nexus.&lt;/p>
&lt;h2 id="data-citations">Data citations&lt;/h2>
&lt;p>With the advances in the Crossref API relationships endpoint, Martyn Rittman demonstrated how we’re now providing more comprehensive support for data citations. You can follow his demonstration in the &lt;a href="https://community.crossref.org/t/relationships-endpoint-update-and-event-data-api-sunsetting/4214" target="_blank">Collab Notebook&lt;/a> he used for the demo and shared for your perusal. He also mentioned that the developments in this feature of our API will soon replace the current service provided via the Events API. Feel free to connect with Martin on the community forum and comment with questions and suggestions.&lt;/p>
&lt;p>As mentioned above, DataCite’s Iratxe Puebla mentioned the Make Data Count initiative and the leaky pipeline of data citations we’ve got at the moment in the scholarly literature, obscuring the true picture of data reuse. This prevents the community from recognising and incentivising data creation and reuse appropriately. One way of addressing this is the Global Open Data Citation Corpus. Crossref and DataCite collaborate closely in connecting and making that data available.&lt;/p>
&lt;p>Linking datasets, as well as software, was reported as part of the AGU and CHORUS initiative in Enhancing Research Connections through Metadata.&lt;/p>
&lt;p>Data sharing and citing is as much a culture as a technology problem. As Iratxe Puebla admitted, there are many norms and processes for capturing and sharing that information,and DataCite is interested to hear about different use cases. As highlighting data’s relationship with works is a growing interest for our community, hopefully more understanding and perhaps even commonality can be built soon.&lt;/p>
&lt;h2 id="making-it-easier-to-register-metadata">Making it easier to register metadata&lt;/h2>
&lt;p>As part of the Demonstrations session, we’ve seen two developments to support members with registering their metadata more easily.&lt;/p>
&lt;p>Crossref’s Lena Stoll shared plans for the new version of the Crossref Registration Form, the helper tool for manual registration of metadata, which translates the submission into XML, for inclusion in the Crossref database. At the moment, the form only accepts grant registrations, but it will be bolstered before the end of the year to include journal articles then other record types in time.&lt;/p>
&lt;p>Erik Hanson from PKP demonstrated the latest OJS version, commenting on specific changes made in the new version in response to the key pain points reported by users of the previous release.&lt;/p>
&lt;p>In addition, we’ve heard of two independent projects by Martin Fenner and Esha Data to enable metadata registration and Crossref DOIs for scholarly blogs.&lt;/p>
&lt;h2 id="making-better-use-of-metadata">Making better use of metadata&lt;/h2>
&lt;p>Supported by the beginner’s demo of our REST API by Luis Montilla, there were many voices about opportunities for making good use of Crossref’s open metadata.
Nikolaos Mitrakis of the European Commission talked about the implementation of Crossref IDs for grants as a step towards tracing and connecting the grants with not just academic but also societal outcomes of the awards, and the plans for using those in the evaluation and steering of their funding programmes.&lt;/p>
&lt;p>Joann Fogleson of the American Society of Civil Engineers gave a buzzy metaphor of publishers’ role in their work with metadata being comparable with that of a pollinator – collecting the metadata at one end, then registering, displaying and making it available to different services, in order to enable a reacher scholarly environment for discovery.&lt;/p>
&lt;p>Many of the major themes have found their way to the discussion of what is still needed to build a robust network of connections between scholarly objects, institutions and individuals. One of the ways Ludo Waltman of CWTS, Leiden University, intends to use our open metadata is as part of the upcoming open-source version of the Laiden rankings and he invited the community to contribute and help optimise this project to provide an alternative to closed and selective databases.&lt;/p>
&lt;p>Panellists also spoke of new opportunities in the light of data mining and machine learning. Ran Dang, Atlantis Press, as a publisher shared a concern about the standard of metadata across cultures and disciplines, and the need to digitise past publications – which can then help better leverage multi-lingual scholarship. Matt Buys of DataCite, pointed out to the Global Data Citation Corpus they are developing, which leverages a SciBERT model to pull out data citations, which is brought together with Crossref/DataCite citation metadata.&lt;/p>
&lt;p>Opening the data is essential to enabling its wider use, and here Ludo gave the example of the fantastic outcome for references metadata, which has been made open by default for the entire corpus of Crossref-registred works. He hopes that this can inspire us to make similar progress in other areas.&lt;/p>
&lt;p>A little on a tangent with regards to metadata use, yet speaking of excellent examples of the community making progress together, Ginny pointed out &lt;a href="http://ror.org" target="_blank">ROR&lt;/a>, how this is becoming a new standard for solving a longstanding problem of standardising affiliations metadata.&lt;/p>
&lt;h2 id="retractions">Retractions&lt;/h2>
&lt;p>Perhaps not entirely surprising, given the recent &lt;a href="https://doi.org/10.13003/c23rw1d9" target="_blank">acquisition of the Retraction Watch database by Crossref&lt;/a> and making the data openly available, retractions featured in a few different talks at the meeting. First, Lena Stoll and Martin Eve from Crossref, shared how that data can be accessed – that is as the csv file from &lt;a href="https://api.labs.crossref.org/data/retractionwatch?ginny@crossref.org" target="_blank">https://api.labs.crossref.org/data/retractionwatch?[your-email@here]&lt;/a>(add your email as indicated), and the Crossref Labs API also displays information about retractions in the /works/ route when metadata is available. There are plans for incorporating this information with our REST API in the future.&lt;/p>
&lt;p>Ed and Ginny have shown stats for increases in retraction metadata registered in Crossmark but commented on limited participation in Crossmark overall. Recording retraction information in this way is still important, alongside the Retraction Watch data, this allows for multiple assertions of that information, and increases confidence in its accuracy. We’re preparing to consult with the community at large about the future direction of the Crossmark service, to make it easier to implement and more useful for the readers.&lt;/p>
&lt;p>Finally, Edilson Damasio from State University of Maringá-UEM, Brazil, and a long-time Crossref Ambassador, presented the analysis of Brazilian records in the Retraction Watch data, and he promises further analysis to come, comparing the situation across geographies.&lt;/p>
&lt;h2 id="equity-of-participation-in-the-research-nexus">Equity of participation in the research nexus&lt;/h2>
&lt;p>Amanda Bartell opened the research nexus discussion with a reminder of what that vision entails and pointing out commonality of goals in the community – “Like others, Crossref has a vision of a rich and reusable open network of relationships connecting research organisations, people, things, and actions; a scholarly record that the global community can build on forever, for the benefit of society. We call this interconnected network the Research Nexus, but others in the community have different names for it, such as knowledge graph or PID graph.”&lt;/p>
&lt;p>The richness of this network depends upon the participation of all those who produce and publish scholarship, so naturally the topic of equality emerged in that discussion. In addition to Ran Dang’s concern for multilingualism and digitisation of past publications from all parts of the world, Mercury Shitindo of St Paul&amp;rsquo;s University, Kenya talked of the need for more education, training and accessible resources for her community, to be able to participate more effectively in this ecosystem. She can see that affiliations and citations are of priority there, as these enable transparency and facilitate collaborations. Matt Buys of DataCite echoed her point, talking about the importance of the role of contributors “It&amp;rsquo;s important not to lose sight of people and places – to recognise the importance of contributor roles in the PID-graph”.&lt;/p>
&lt;p>Earlier in the day, we mentioned the launch of our Global Equitable Membership, or GEM programme. Since January, 110 new organisations from eligible countries have joined Crossref fee-free. Ginny was quick to admit that the need for a fee-waiver programme like this stems from the regular fees schedule not being in tune with our global membership, and she mentioned the upcoming fees review.&lt;/p>
&lt;p>Financial barriers are often what get attention, yet reducing barriers to participation with technology is equally important for building a robust research nexus. With the planned changes to our registration form, we’ll make it easier to register works for those who don’t regularly use XML.&lt;/p>
&lt;p>Johanssen Obanda took time to show the examples of community activity and events organised by our global network of Ambassadors, and to thank all our advocates and partners for their tireless work. They are also helping tackle barriers, supporting our members to actively participate in the research nexus with their metadata, and help enable the community to make good use of the network of relationships that data denotes.&lt;/p>
&lt;p>Showcasing our “One member one vote” truth, the Board election was the focal point of the annual meeting, as always. We closed the ballot and announced the results, with seven members selected to join the Board in 2024.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/crossref2023-board-elections-result.jpg"
alt="A slide showing the members elected to the Board and their representatives: In Tier 1: Beilstein-Institut, Wendy Patterson; Korean Council of Science Editors, Kihong Kim; OpenEdition, Marin Dacos; Universidad Autónoma de Chile, Dr. Ivan Suazo; Vilnius University, Vincas Grigas; Tier 2: Oxford University Press, James Phillpotts; University of Chicago Press, Ashley Towne" width="700">
&lt;/figure>
&lt;p>The event went very smoothly overall. Talks were delivered efficiently, the panellists shared diverse perspectives and we elected our new Board members. Huge thanks to Rosa Clark, our Communications and Events Manager, who orchestrated the event and has been a constant behind-the-scenes presence supervising the entire show. I’m grateful to all colleagues at Crossref, who helped make it an enjoyable experience and an informative event for our community. Finally – it wouldn’t be a real meeting without the active participation of the speakers and panellists, who shared their metadata stories, and even joined us for some relaxed unplugged chats.&lt;/p></description></item><item><title>Perspectives: Luis Montilla on making science fiction concepts a reality in the scholarly ecosystem</title><link>https://www.crossref.org/blog/perspectives-luis-montilla-sci-fi-concepts-reality-scholarly-ecosystem/</link><pubDate>Mon, 20 Nov 2023 00:00:00 +0000</pubDate><author>Luis Montilla</author><guid>https://www.crossref.org/blog/perspectives-luis-montilla-sci-fi-concepts-reality-scholarly-ecosystem/</guid><description>&lt;div style="float:left;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/2022/perspectives.png" alt="sound bar logo" width="150px" class="img-responsive" />
&lt;/div>
&lt;br>
&lt;p>Hello, readers! My name is Luis, and I&amp;rsquo;ve recently started a new role as the Technical Community Manager at Crossref, where I aim to bridge the gap between some of our services and our community awareness to enhance the Research Nexus. I&amp;rsquo;m excited to share my thoughts with you.&lt;/p>
&lt;p>My journey from research to science communications infrastructure has been a gradual transition. As a Masters student in Biological Sciences, I often felt curious about the behind-the-scenes after a paper is submitted and published. For example, the fate of data being stored in the drawer or copied and forgotten in the hard drive after the paper is online. I come from a university that shares its name with at least &lt;a href="https://en.wikipedia.org/wiki/Sim%C3%B3n_Bol%C3%ADvar_University" target="_blank">three completely different universities&lt;/a> in Latin America, and that also is pretty similar to another one with multiple offices across the &lt;a href="https://es.wikipedia.org/wiki/Universidad_Andina_Sim%C3%B3n_Bol%C3%ADvar" target="_blank">region&lt;/a>, which made me wonder if there was a standard way of identifying our affiliations. And then we have the topic of our names in hispanoamerica. We use two family names, and more often than not, we have a middle name (and then I could tell you stories about multiple-word middle names), which inevitably leads to authors having many combinations of full names and hyphenations.&lt;/p>
&lt;p>This curiosity led me to volunteer in the Journal of the Venezuelan Society of Ecology. This role has been a transformative experience because my goal was to learn more about the publishing aspect of science. Still, today I realize that this is a fraction of what the scholarly ecosystem represents. The experience allowed me to grasp the importance of having a community with a sense of belonging, the relevance of multilingualism, and the importance of having access to an open infrastructure that allows smaller communities to be participants in the global dynamics. Moreover, it seemed to me that a research paper is more than the capstone of a building that we place and then move on to the next project or the next experiment; instead, it is a node in the vast network of human knowledge, connected to other papers through references, but also to all the other elements that are produced as part of the research, namely datasets, protocols, code, presentations, posters, preprints, peer-review reports and more. In short, the research &lt;em>metadata&lt;/em> extends the life of the research output and makes it visible to the rest of the community.&lt;/p>
&lt;p>This brings us to my onboarding to the Crossref team. At Crossref, I became part of a team and a driving force whose idea of the &lt;a href="https://www.crossref.org/documentation/research-nexus/">Research Nexus&lt;/a> &lt;sup id="fnref:1">&lt;a href="#fn:1" class="footnote-ref" role="doc-noteref">1&lt;/a>&lt;/sup> aligns perfectly with my aspirations. And to explain myself better, I&amp;rsquo;ll draw an analogy using one of my favorite authors. In Isaac Asimov&amp;rsquo;s Second Foundation, a character shows to another a wall covered to the last millimeter with equations and writings. He describes his contribution to &amp;ldquo;The Plan&amp;rdquo; as follows: &amp;ldquo;&amp;hellip;Every red mark you see on the wall is the contribution of a man among us who lived since Seldon&amp;rdquo;.&lt;sup id="fnref:2">&lt;a href="#fn:2" class="footnote-ref" role="doc-noteref">2&lt;/a>&lt;/sup> This idea sounded fascinating to me and only possible in a sci-fi book; a massive integrated research ecosystem where scientists focused more on how their contributions fit in the big picture. Today I have come to think that metadata helps materialize this idea by interconnecting all knowledge, and more importantly, in stark contrast to Asimov&amp;rsquo;s plan developed and guarded by a secret society, Crossref&amp;rsquo;s research nexus is a &amp;ldquo;reusable open network,&amp;rdquo; &amp;ldquo;a scholarly record that the global community can build on forever.&amp;rdquo; In a world with undeniably unequal access to resources, providing open access and fostering community efforts to contribute to this growing collective effort is a fundamental condition to empower and visualize underrepresented voices.&lt;/p>
&lt;p>We make available a series of tools to access and probe this data, including our REST API, but we know its potential is far from being realized. As Technical Community Manager at Crossref, my primary responsibility is to understand the needs of our community members who interact with our REST API. I aim to build and maintain relationships with new and existing metadata users to promote the effective usage of our API. I will also be working closely with organisations such as hosting platforms, manuscript submission systems, and general publisher services. In essence, I want to ensure that our community across the globe is aware of the vast possibilities that imply using and contributing to the Research Nexus.&lt;/p>
&lt;p>I am committed to fostering an engaged and collaborative technical community. As we move forward, I look forward to sharing insights, experiences, and knowledge with all of you. Stay tuned for more updates, and let&amp;rsquo;s explore the world of APIs, metadata, and scholarly communities together!&lt;/p>
&lt;div class="footnotes" role="doc-endnotes">
&lt;hr>
&lt;ol>
&lt;li id="fn:1">
&lt;p>&lt;a href="https://www.crossref.org/documentation/research-nexus/" target="_blank">Crossref (2021) The research nexus. Accessed on 20 October 2023.&lt;/a>&amp;#160;&lt;a href="#fnref:1" class="footnote-backref" role="doc-backlink">&amp;#x21a9;&amp;#xfe0e;&lt;/a>&lt;/p>
&lt;/li>
&lt;li id="fn:2">
&lt;p>Asimov, I. (1953) Second Foundation. Gnome Press.&amp;#160;&lt;a href="#fnref:2" class="footnote-backref" role="doc-backlink">&amp;#x21a9;&amp;#xfe0e;&lt;/a>&lt;/p>
&lt;/li>
&lt;/ol>
&lt;/div></description></item><item><title>Similarity check update: A new similarity report and AI writing detection tool soon to be available to iThenticate v2 users</title><link>https://www.crossref.org/blog/similarity-check-update-a-new-similarity-report-and-ai-writing-detection-tool-soon-to-be-available-to-ithenticate-v2-users/</link><pubDate>Wed, 01 Nov 2023 00:00:00 +0000</pubDate><author>Fabienne Michaud</author><guid>https://www.crossref.org/blog/similarity-check-update-a-new-similarity-report-and-ai-writing-detection-tool-soon-to-be-available-to-ithenticate-v2-users/</guid><description>&lt;p>In May, we updated you on the latest &lt;a href="https://doi.org/10.64000/76zr9-ewc27" target="_blank">changes and improvements&lt;/a> to the new version of iThenticate and let you know that a new similarity report and AI writing detection tool were on the horizon.&lt;/p>
&lt;p>&lt;strong>On Wednesday 1 November 2023&lt;/strong>, Turnitin (who produce iThenticate) will be releasing a brand new similarity report and a free preview to their AI writing detection tool in iThenticate v2. The AI writing detection tool will be enabled by default and account administrators will be able to switch it off/on.&lt;/p>
&lt;p>Turnitin will be running a webinar on their new similarity report and AI writing detection tool on &lt;del>Tuesday 28 November&lt;/del> (EDIT 23/11/16: Monday 11 December 2023). More information on the webinar and how to register will be communicated by Turnitin in the coming weeks.&lt;/p>
&lt;h2 id="new-similarity-report">New similarity report&lt;/h2>
&lt;p>On Wednesday, all iThenticate v2 users will have access to the new version of the similarity report which will include:&lt;/p>
&lt;ul>
&lt;li>a word count and the number of text blocks for each matched source&lt;/li>
&lt;li>the ability to include or exclude overlapping sources from the overall similarity score&lt;/li>
&lt;li>a clearer colour differentiation between the different sources&lt;/li>
&lt;li>improved accessibility features&lt;/li>
&lt;/ul>
&lt;h3 id="enabling-the-new-similarity-report">Enabling the new similarity report&lt;/h3>
&lt;p>The new similarity report will be enabled as a default for all your journals. Account administrators wishing to switch off the new similarity report can do so by going to &lt;strong>Settings&lt;/strong> and selecting from the &lt;strong>General&lt;/strong> tab, under the &lt;strong>New Similarity Report Experience&lt;/strong> heading, the &lt;strong>Disable&lt;/strong> option.&lt;/p>
&lt;h3 id="classic-view--new-view">Classic view / new view&lt;/h3>
&lt;p>As this will be a significant change to your current experience, Turnitin have provided access for a period of time to the ‘classic view’ and you will be able to toggle between the original interface and the new one by clicking on ‘Switch to the classic view’ or ‘Switch to the new view’ buttons at the top of your report.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/switch-view.png"
alt="image with text" width="500">
&lt;/figure>
&lt;p>The similarity score will continue to be available at the top right-hand corner of the similarity report.&lt;/p>
&lt;h3 id="exclusions">Exclusions&lt;/h3>
&lt;p>By clicking on the &lt;strong>Filters&lt;/strong> button you’ll be able to check and/or adjust your report’s section and repository exclusions.&lt;/p>
&lt;p>Please note that the exclusions previously set up by account administrators should be unchanged by this release.&lt;/p>
&lt;h3 id="sources--match-groups-view">Sources / Match Groups view&lt;/h3>
&lt;p>The &lt;strong>Sources&lt;/strong> view will be the default view and will list all sources. By using the on/off button next to ‘Show overlapping sources’, you’ll be able to include or exclude overlapping sources. This will be ‘off’ as a default.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/overall-similarity.png"
alt="image with text 53% overal similarity" width="350">
&lt;/figure>
&lt;p>The &lt;a href="https://tiibetahelpcenter.zendesk.com/hc/en-us/articles/17717056347277-How-do-the-Match-Groups-work-" target="_blank">Match Groups&lt;/a> view is completely new and may not suit everyone’s needs. It is divided into four categories ‘Not Cited or Quoted’, ‘Missing Quotations’, ‘Missing Citation’ and ‘Cited and Quoted’ and will highlight matches found in your text.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/overall-similarity2.png"
alt="image with text 33% overal similarity" width="350">
&lt;/figure>
&lt;h4 id="pdf-report">PDF report&lt;/h4>
&lt;p>You’ll also now find the PDF report in the top right-hand corner of the similarity report, by clicking on the ‘download’ icon.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/classic-download.png"
alt="image with text switch to the classic view and dropdown options" width="350">
&lt;/figure>
&lt;h3 id="submission-details">Submission details&lt;/h3>
&lt;p>‘Submission Details’ is located now under the ‘i’ icon in the top right-hand corner of your report. This is where you will find the oid (or unique number) for your manuscript which Turnitin will ask you to provide &lt;a href="https://www.crossref.org/documentation/similarity-check/ithenticate-account-use/help/" target="_blank">when you are reporting a technical issue&lt;/a>.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/submission-detail.png"
alt="image with text submission details" width="350">
&lt;/figure>
&lt;p>&lt;a href="https://help.turnitin.com/originality-check/new-similarity-report/new-report-experience.htm" target="_blank">Turnitin’s documentation for the new similarity report&lt;/a>&lt;/p>
&lt;h2 id="ai-writing-detection-tool">AI writing detection tool&lt;/h2>
&lt;p>Many of you have been concerned about the use of AI writing in the research papers you’ve received since the launch of ChatGPT last November and have been in touch to enquire about the availability of an AI writing detection tool for Crossref members.&lt;/p>
&lt;p>You will also have read that Turnitin have developed an AI writing detector tool and have made it available to their education sector customers since April. Turnitin have published &lt;a href="https://www.turnitin.com/blog/ai-writing-detection-update-from-turnitins-chief-product-officer" target="_blank">an update&lt;/a> in May, a &lt;a href="https://www.youtube.com/watch?v=zy-aH1pcbKk" target="_blank">helpful video&lt;/a> and &lt;a href="https://www.turnitin.com/blog/understanding-the-false-positive-rate-for-sentences-of-our-ai-writing-detection-capability" target="_blank">further information on the false positive rates&lt;/a> in June based on the feedback they’ve received from the education community.&lt;/p>
&lt;p>I am pleased to announce that Turnitin’s AI writing detection tool will be available &lt;strong>as a free preview&lt;/strong> to iThenticate v2 users, via the new version of the similarity report, &lt;strong>from Wednesday 1 November until the end of December 2023.&lt;/strong>&lt;/p>
&lt;h3 id="enabling-ai-writing-detection">Enabling AI writing detection&lt;/h3>
&lt;p>Our preference was to have the new AI writing detection tool turned ‘off’ as a default, however this hasn’t been possible. Account administrators can turn this feature off by going to &lt;strong>Settings&lt;/strong> and selecting the &lt;strong>Crossref Web&lt;/strong> tab and scrolling down to the &lt;strong>AI Writing&lt;/strong> section at the very bottom of the page. The feature is applied to all submissions when it is enabled.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/ai-writing.png"
alt="image with text AI writing with an X overlay" width="350">
&lt;/figure>
&lt;p>Please note that AI Writing detection is only available in the new similarity report.&lt;/p>
&lt;h3 id="integrations">Integrations&lt;/h3>
&lt;p>There is currently no integration between manuscript tracking systems and the AI writing detection tool. However the AI score will be available via the similarity report. If the AI writing detection tool has been set as ‘off’ by the account administrator, there will be no score and the ‘AI Writing’ heading will not be visible on the similarity report:&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/ai-writing-small.png"
alt="image with text" width="350">
&lt;/figure>
&lt;h3 id="file-requirements">File requirements&lt;/h3>
&lt;p>Turnitin have made some important file requirements available for the tool to run a report:&lt;/p>
&lt;ul>
&lt;li>Must be written &lt;strong>in English&lt;/strong>&lt;/li>
&lt;li>A minimum 300 words&lt;/li>
&lt;li>A maximum of 15,000 words&lt;/li>
&lt;li>The file size must be less than 100 MB&lt;/li>
&lt;li>Accepted file types are .docx, .pdf, .rtf and .txt&lt;/li>
&lt;/ul>
&lt;p>If your file does not meet the above requirements, iThenticate v2 will display the following message:&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/ai-writing-big.png"
alt="image with text AI writing" width="350">
&lt;/figure>
&lt;p>Turnitin’s AI writing detection tool has been developed to detect GPT 3, 3.5, 4 and other variants. More information on this is available on their &lt;a href="https://www.turnitin.com/products/features/ai-writing-detection/faq" target="_blank">FAQs page&lt;/a>.&lt;/p>
&lt;p>Turnitin have provided the following guidance regarding the AI scores:&lt;/p>
&lt;p>&lt;em>&amp;ldquo;Blue with a percentage between 0 and 100: The submission has processed successfully. The displayed percentage indicates the amount of qualifying text within the submission that Turnitin’s AI writing detection model determines was generated by AI. As noted previously, this percentage is not necessarily the percentage of the entire submission. If text within the submission was not considered long-form prose text, it will not be included.&lt;/em>&lt;/p>
&lt;p>&lt;em>Our testing has found that there is a higher incidence of false positives when the percentage is between 1 and 20. In order to reduce the likelihood of misinterpretation, the AI indicator will display an asterisk (*) for percentages between 1 and 20 to call attention to the fact that the score is less reliable.&lt;/em>&lt;/p>
&lt;p>&lt;em>To explore the results of the AI writing detection capabilities, select the indicator to open the AI writing report. The AI writing report opens in a new tab of the window used to launch the Similarity Report. If you have a pop-up blocker installed, ensure it allows Turnitin pop-ups.&amp;rdquo;&lt;/em>&lt;/p>
&lt;p>Please note that unlike the similarity report, the AI writing report will only provide a score and highlight the blocks of texts likely to have been written by an AI tool and will not list source matches.&lt;/p>
&lt;p>We encourage you to test the writing detection tool as much as possible during the free preview period (1 November-31 December 2023).&lt;/p>
&lt;h2 id="next">Next&lt;/h2>
&lt;h3 id="paraphrase-detection">Paraphrase detection&lt;/h3>
&lt;p>&lt;del>Turnitin are planning to release a beta version of their new paraphrase detection tool at the end of this year/Q1, 2024. It will be initially available as a free preview for a short period of time.&lt;/del> (EDIT 23/11/16: There is currently no timeline available for Turnitin&amp;rsquo;s paraphrase detection tool which is having a knock-on effect on the availiblity of the AI writing and paraphrase detection bundle and associated fees previously mentioned in this post)&lt;/p>
&lt;h3 id="ai-and-paraphrase-detection-bundle-edit-231116-ai-writing-detection-tool">&lt;del>AI and paraphrase detection bundle&lt;/del> (EDIT 23/11/16: AI writing detection tool)&lt;/h3>
&lt;p>Once the free preview period ends, Turnitin &lt;del>would like to offer Crossref members an AI and paraphrase detection bundle&lt;/del> (EDIT 23/11/16: are planning to make their AI writing detection tool available) from 2024 - this means that if you choose to subscribe to this new service, you will be charged an additional fee each time you upload a manuscript.&lt;/p>
&lt;h3 id="fixes">Fixes&lt;/h3>
&lt;p>Many of you have been waiting for fixes to the aggregation of URLs issues in the matched sources of the similarity report and to the doc-to-doc PDF report in iThenticate v2. Turnitin are planning to release fixes for these before the end of 2023.&lt;/p>
&lt;p>✏️ Do get in touch via &lt;a href="mailto:support@crossref.org">support@crossref.org&lt;/a> if you have any questions about iThenticate v1 or v2 or start a discussion by commenting on this post below or in &lt;a href="https://community.crossref.org/" target="_blank">our Community Forum&lt;/a>.&lt;/p></description></item><item><title>Perspectives: Audrey Kenni-Nemaleu on scholarly communications in Cameroon</title><link>https://www.crossref.org/blog/perspectives-audrey-kenni-nemaleu-on-scholarly-communications-in-cameroon/</link><pubDate>Thu, 05 Oct 2023 00:00:00 +0000</pubDate><author>Audrey Kenni-Nemaleu</author><guid>https://www.crossref.org/blog/perspectives-audrey-kenni-nemaleu-on-scholarly-communications-in-cameroon/</guid><description>&lt;div style="float:left;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/2022/perspectives.png" alt="sound bar logo" width="150px" class="img-responsive" />
&lt;/div>
&lt;p>Our Perspectives blog series highlights different members of our diverse, global community at Crossref. We learn more about their lives and how they came to know and work with us, and we hear insights about the scholarly research landscape in their country, the challenges they face, and their plans for the future.&lt;/p>
&lt;p>Notre série de blogs Perspectives met en lumière différents membres de la communauté internationale de Crossref. Nous en apprenons davantage sur leur vie et sur la manière dont ils ont appris à nous connaître et à travailler avec nous, et nous entendons parler du paysage de la recherche universitaire dans leur pays, des défis auxquels ils sont confrontés et de leurs projets pour l&amp;rsquo;avenir.&lt;/p>
&lt;p>Today, we meet Audrey Kenni-Nemaleu, Crossref Ambassador in Cameroon and Assistant Editor of the Pan-African Medical Journal (PAMJ). Audrey is excited about engaging Crossref&amp;rsquo;s community in French West Africa. Please take a moment to read and listen to Audrey&amp;rsquo;s perspective.&lt;/p>
&lt;p>Aujourd&amp;rsquo;hui, nous rencontrons Audrey Kenni-Nemaleu, ambassadrice Crossref au Cameroun et rédactrice adjointe du Pan-African Medical Journal (PAMJ). Audrey est enthousiaste à l&amp;rsquo;idée d&amp;rsquo;impliquer la communauté Crossref en Afrique occidentale française. Veuillez prendre un moment pour lire et écouter le point de vue d&amp;rsquo;Audrey.&lt;/p>
&lt;div class='shortcode-row '> &lt;div class="col-md-6 col-sm-12 no-first-para-highlight">&lt;p>&lt;strong>&lt;p align="center">English&lt;/p>&lt;/strong>&lt;/p>
&lt;div style="padding:133.33% 0 0 0;position:relative;">&lt;iframe src="https://player.vimeo.com/video/895601167?badge=0&amp;amp;autopause=0&amp;amp;player_id=0&amp;amp;app_id=58479" frameborder="0" allow="autoplay; fullscreen; picture-in-picture" style="position:absolute;top:0;left:0;width:100%;height:100%;" title="Audrey's Perspective - Crossref Ambassador - English">&lt;/iframe>&lt;/div>&lt;script src="https://player.vimeo.com/api/player.js">&lt;/script>
&lt;/div>
&lt;div class="col-md-6 col-sm-12 no-first-para-highlight">&lt;p>&lt;strong>&lt;p align="center">Français&lt;/p>&lt;/strong>&lt;/p>
&lt;div style="padding:133.33% 0 0 0;position:relative;">&lt;iframe src="https://player.vimeo.com/video/895601175?badge=0&amp;amp;autopause=0&amp;amp;player_id=0&amp;amp;app_id=58479" frameborder="0" allow="autoplay; fullscreen; picture-in-picture" style="position:absolute;top:0;left:0;width:100%;height:100%;" title="Audrey's Perspective - Crossref Ambassador - French">&lt;/iframe>&lt;/div>&lt;script src="https://player.vimeo.com/api/player.js">&lt;/script>
&lt;/div>&lt;/div>
&lt;p>&lt;br>&lt;br>&lt;/p>
&lt;blockquote>
&lt;p>Tell us a bit about your organisation, your objectives, and your role&lt;br>
Pouvez-vous nous parler de votre organisation, vos objectifs et votre rôle ?&lt;/p>
&lt;/blockquote>
&lt;p>My name is Audrey Kenni Nganmeni-Nemaleu, assistant editor for &lt;a href="https://www.panafrican-med-journal.com/" target="_blank">the Pan-African Medical Journal&lt;/a>. I am specifically responsible for editing the articles in terms of form, ensuring that they meet the journal&amp;rsquo;s standards. Furthermore, I am the focal point of my journal for Crossref, that is to say I am responsible for managing all the problems that all publishers may encounter with DOIs and the various Crossref services to which our journal has subscribed. My role is also to manage all the conflicts that we may encounter with the DOIs submitted to Crossref. I train our journal staff in using Crossref services. I am also the focal point of my journal for COPE (Committee of Publications Ethics) which is an organisation that helps to regulate ethical publishing practices. It is in this capacity that I participate COPE&amp;rsquo;s webinars on behalf of our journal.&lt;/p>
&lt;p>Je m’appelle Audrey Kenni Nganmeni Nemaleu, éditrice assistante pour le Pan African Medical Journal. Je m’occupe précisément de traiter les articles sur le plan de la forme en m’assurant qu’ils respectent les normes du journal. Par ailleurs je suis point focal de mon journal pour Crossref c’est-à-dire je suis chargée de gérer tous les problèmes que l’ensemble des éditeurs peuvent rencontrer avec les DOIs et les différents services de Crossref auxquels notre journal a souscrit. Mon rôle également c’est de gérer tous les conflits qu’on peut rencontrer avec les DOIs soumis à Crossref. Je forme également le personnel de notre journal à l’utilisation des services de Crossref. Je suis aussi point focal de mon journal pour COPE (Committee of Publications ethics) qui est un organisme qui aide dans la régulation des pratiques éthiques en matière de publication. C’est dans ce cadre que je participe à tous les webinaires de cette organisation afin qu’il y ait toujours au moins une personne qui participe à ces webinaires pour le compte de notre journal.&lt;/p>
&lt;blockquote>
&lt;p>What is one thing that others should know about your country and its research activity?&lt;br>
Que doivent savoir les autres sur les activités de recherche dans votre pays ?&lt;/p>
&lt;/blockquote>
&lt;p>In my country, Cameroon, the research activity is still young. There are few scientific journals and we are actually the most influential journal in our country and subregion. There are also few schools or institutions that focus especially on research. For the time being, research activities in my country mainly revolve around congresses and conferences where researchers can exhibit their works. There is very little support for scientific research in my country.&lt;/p>
&lt;p>Dans mon pays, le Cameroun, la recherche scientifique est encore jeune. Il existe peu de revues scientifiques et nous sommes en fait le journal le plus influent de notre pays et de notre sous-région. Il existe également peu d&amp;rsquo;écoles ou d&amp;rsquo;nstitutions qui spécialisées sur la recherche. Pour l&amp;rsquo;instant, les activités de recherche dans mon pays s&amp;rsquo;articulent principalement autour de congrès
et de conférences où les chercheurs peuvent exposer leurs travaux. Il y a très peu de soutien à la recherche scientifique dans mon pays.&lt;/p>
&lt;blockquote>
&lt;p>Are there trends in scholarly communications that are unique to your part of the world?&lt;br>
Existe-t-il des tendances particulières en matière de recherche scientifique dans votre région ?&lt;/p>
&lt;/blockquote>
&lt;p>In this part of the world, we do our best to follow the code of ethics of the various organisations in which we are a member: Committee of publication ethics (COPE), World Association of Medical Editors (WAME), Open Access Scholarly Publishing Association (OASPA). What we have seen emerging recently is the organisation, by professional scientific societies, of small conferences, workshops and meetings to exchange information. These small events are less costly to organize, hence their gain in popularity. We support these activities through sponsorship, and use them as opportunities to strengthen young researchers&amp;rsquo; capacities in areas such as scientific writing, publication ethics. We also use those opportunities to introduce to young researchers concepts such as Open Access, Open Science, DOIs and other modern publishing services.&lt;/p>
&lt;p>Dans notre pays, nous nous efforçons de suivre le code de déontologie des différentes organisations dont nous sommes membre : Committee of publication ethics (COPE), World Association of Medical Editors (WAME), Open Access Scholarly Publishing Association (OASPA). Ce que l&amp;rsquo;on a vu émerger récemment, c&amp;rsquo;est l&amp;rsquo;organisation, par des sociétés scientifiques professionnelles, de petits colloques, ateliers et réunions d&amp;rsquo;échange d&amp;rsquo;informations. Ces petits événements sont moins coûteux à organiser, d&amp;rsquo;où leur gain en popularité. Nous soutenons ces activités par le sponsoring et les utilisons comme des opportunités pour renforcer les capacités des jeunes chercheurs dans des domaines tels que l&amp;rsquo;écriture scientifique, l&amp;rsquo;éthique de la publication. Nous utilisons également ces opportunités pour leur présenter des concepts tels que le libre accès, la science ouverte, les DOIs et d&amp;rsquo;autres services d&amp;rsquo;édition modernes.&lt;/p>
&lt;blockquote>
&lt;p>What about any political policies, challenges, or mandates that you have to consider in your work?&lt;br>
Quels sont les politiques, défis ou mandats auxquels vous faites face dans votre travail ?&lt;/p>
&lt;/blockquote>
&lt;p>Operating a journal in our context is challenging. The critical challenges are as basic as constant availability of electricity or stable and fast internet connectivity. How to maintain a stable stream revenue to support the journal is also a critical challenge. Most of our authors are young, self-funded and with limited resources. Most cannot afford the amount we charge for article publishing fees, which in comparison, is very limited. So we have to be extremely creative to operate.&lt;/p>
&lt;p>Faire fonctionner une revue dans notre contexte est difficile. Les défis critiques sont aussi fondamentaux que la disponibilité constante de l&amp;rsquo;électricité ou une connexion Internet stable et rapide. Comment maintenir un flux stable des revenus pour soutenir la revue constitue également un défi crucial. La plupart de nos auteurs sont jeunes, autofinancés, avec des ressources limitées et par conséquent n’arrivent pas à payer les frais de publication d&amp;rsquo;articles pourtant très bas. Nous devons donc être extrêmement créatifs pour gérer nos charges.&lt;/p>
&lt;blockquote>
&lt;p>How would you describe the value of being part of the Crossref community; what impact has your participation had on your goals?&lt;br>
Comment décririez-vous la valeur de faire partie de la communauté Crossref ? Quel est l’impact de votre participation sur vos objectifs ?&lt;/p>
&lt;/blockquote>
&lt;p>As a Crossref ambassador, I talk about Crossref around me, among my colleagues whether they are in Kenya or Cameroon. I shared the links to participate in Crossref webinars with my colleagues. I invited them to become ambassadors by sharing with them the links to join the community. I participated in several ambassador training webinars on different themes including: how to submit DOI to Crossref, ORCID. I participated in a Crossref event in Nairobi, Kenya. It was a memorable moment where I was able to meet other ambassadors. We were able to have a small meeting on the difficulties we encountered in growing the Crossref community in Africa. We produced a document to this effect which we submitted to Crossref in 2022. For the moment, I have not yet been able to organize an event as an ambassador, but I would like to with the help of Crossref. But being an ambassador is not the easiest thing because sometimes in our context people do not understand the use of Crossref&amp;rsquo;s services because we are in an environment where the DOI is not yet very well known, and where even publishers know nothing about this. A question I am often asked is whether this work is paid and are discouraged when they learn that it is voluntary work.&lt;/p>
&lt;p>Comme ambassadrice de Crossref, je parle autour de moi de Crossref, parmi mes collègues qu’ils soient au Kenya ou au Cameroun. J’ai partagé les liens pour participer à des webinaires de Crossref à mes collègues. Je les ai invités à devenir des ambassadeurs en partageant avec eux les liens pour rejoindre la communauté. J’ai participé à plusieurs webinaires de formation des ambassadeurs sur différents thèmes notamment ORCID. J’ai également participe à un évènement de Crossref à Nairobi au Kenya. C’était un moment mémorable ou j’ai pu rencontrer d’autres ambassadeurs. Nous avons pu faire une petite réunion sur les difficultés que nous rencontrons pour faire grandir la communauté Crossref en Afrique. Nous avons d’ailleurs produit un document à cet effet que nous avons soumis à Crossref en 2022. Pour l’instant, je n’ai pas encore pu organiser d’évènement dans le cadre d’ambassadeur, mais j’aimerais avec l’aide de Crossref voir comment le faire. Etre ambassadrice n’est pas la chose la plus facile car parfois dans notre contexte les gens ne comprennent pas le bien-fondé des services de Crossref car on est dans un environnement ou le DOI n’est pas encore très connu, et où beaucoup de journaux et même d’editeurs ne savent rien de cela. Une question qu’on me pose souvent est celle savoir si ce travail est remunere et se découragent quand ils apprennent que c’est du bénévolat.&lt;/p>
&lt;blockquote>
&lt;p>For you, what would be the most important thing Crossref could change (do more of/do better in)?&lt;br>
Pour vous, quelle serait la chose la plus importante que Crossref pourrait changer (faire plus/faire mieux) ?&lt;/p>
&lt;/blockquote>
&lt;p>Crossref could invest in more capacity building, events, and communications in this part of the world. Why not localize Crossref in the francophone part of Africa? Crossref could offer continuing educational activities to professionals in order to improve their skills or acquire new knowledge in metadata and correlative disciplines. Crossref could also sponsor/support journal publishing and scholarship in Africa.&lt;/p>
&lt;p>Crossref pourrait investir dans davantage de renforcement des capacités, d&amp;rsquo;événements et de communications dans cette partie du monde. Pourquoi ne pas localiser Crossref dans la partie francophone de l’Afrique ? Crossref pourrait proposer des activités de formation continue aux professionnels afin d&amp;rsquo;améliorer leurs compétences ou d&amp;rsquo;acquérir de nouvelles connaissances dans les métadonnées et les disciplines corrélatives. Crossref pourrait également sponsoriser/soutenir la publication de revues et les bourses d’études en Afrique.&lt;/p>
&lt;blockquote>
&lt;p>Which other organisations do you collaborate with or are pivotal to your work in open science?&lt;br>
Avec quelles autres organisations collaborez-vous ou alors quelles sont les organismes pivot au cœur de votre travail en science ouverte ?&lt;/p>
&lt;/blockquote>
&lt;p>I collaborate with various institutions such as &lt;a href="https://publicationethics.org/" target="_blank">COPE (Committee on Publication Ethics)&lt;/a>, &lt;a href="https://www.ajol.info/" target="_blank">AJOL African Journals Online&lt;/a>, and &lt;a href="https://oaspa.org/" target="_blank">OASPA (Open Access Scholarly Publishing Association)&lt;/a>. I attend webinars of these organisations on behalf of my journal.&lt;/p>
&lt;p>Je collabore avec diverses institutions telles que &lt;a href="https://publicationethics.org/" target="_blank">COPE (Committee on Publication Ethics)&lt;/a>, &lt;a href="https://www.ajol.info/" target="_blank">AJOL African Journals Online&lt;/a>, et &lt;a href="https://oaspa.org/" target="_blank">OASPA (Open Access Scholarly Publishing Association)&lt;/a>. J&amp;rsquo;assiste à des webinaires de ceux-ci organisations au nom de ma revue.&lt;/p>
&lt;blockquote>
&lt;p>What are your plans for the future?&lt;br>
Quels sont vos plans pour l&amp;rsquo;avenir ?&lt;/p>
&lt;/blockquote>
&lt;p>My plan for the future is to continue working in science communication with different other organisations, and more within my community.&lt;/p>
&lt;p>Mon plan pour l&amp;rsquo;avenir est de continuer à travailler dans le domaine de la communication scientifique avec différentes autres organisations, et davantage au sein de ma communauté.&lt;/p>
&lt;p>Thank you, Audrey!&lt;/p>
&lt;p>Merci, Audrey !&lt;/p></description></item><item><title>Feedback on automatic digital preservation and self-healing DOIs</title><link>https://www.crossref.org/blog/feedback-on-automatic-digital-preservation-and-self-healing-dois/</link><pubDate>Thu, 28 Sep 2023 00:00:00 +0000</pubDate><author>Martin Eve</author><guid>https://www.crossref.org/blog/feedback-on-automatic-digital-preservation-and-self-healing-dois/</guid><description>&lt;p>Thank you to everyone who responded with feedback on the Op Cit proposal. This post clarifies, defends, and amends the original proposal in light of the responses that have been sent. We have endeavoured to respond to every point that was raised, either here or in the document comments themselves.&lt;/p>
&lt;p>&lt;b>We strongly prefer for this to be developed in collaboration with CLOCKSS, LOCKSS, and/or Portico, i.e. through established preservation services that already have existing arrangements in place, are properly funded, and understand the problem space. There is low level of trust in the Internet Archive, also given a number of ongoing court cases and erratic behavior in the past. People are questioning the sustainability and stability of IA, and given it is not funded by publishers or other major STM stakeholders there is low confidence in IA setting their priorities in a way that is aligned with that of the publishing industry.&lt;/b>&lt;/p>
&lt;p>We acknowledge that some of our members have a low level of trust in The Internet Archive, but many of our (primarily open access members) work very closely with the IA and our research has shown that, without the IA, the majority our smaller open access members would have almost no preservation at all. We have already had conversations with CLOCKSS and Portico about involvement in the pilot and thinking through what a scale-to-production would look like. That said, for a proof-of-concept, the Internet Archive presents a very easy way to get off the ground, with a stable system that has been running for almost 30 years.&lt;/p>
&lt;p>&lt;b>This seems to be a service for OA content only, but people wonder for how long. Someone already spotted an internal CrossRef comment on the working doc that suggested “why not just make it default for everything &amp;amp; everyone”, and that raises concern.&lt;/b>&lt;/p>
&lt;p>The primary audience for this service is small OA publishers that are, at present, poorly preserved. These publishers present a problem for the whole scholarly environment because linking to their works can prove non-persistent if preservation is not well handled. Enhancing preservation for this sector therefore benefits the entire publishing industry by creating a persistent linking environment. We have no plans to make this the “default for everything and everyone” because the licensing challenges alone are massive, but also because it isn’t necessary. Large publishers like Elsevier are doing a good job of digitally preserving their content. We want this service to target the areas that are currently weaker.&lt;/p>
&lt;p>Crossref will always respect the content rights of our members. We never force our members to release their content through Crossref that they don&amp;rsquo;t ask us to release.&lt;/p>
&lt;p>The purpose of the Op Cit project is to make it easier for our members to fulfil commitments they already made when they joined Crossref.&lt;/p>
&lt;p>Crossref is fundamentally an infrastructure for preserving citations and links in the scholarly record. We cannot do that if the content being cited or linked to disappears.&lt;/p>
&lt;p>When signing the Crossref membership agreement, members agree to employ their best efforts to preserve their content with archiving services so that Crossref can continue to link citations to it even in extremis. For example- if they have ceased operations.&lt;/p>
&lt;p>Some of our members already do this well. They have already made arrangements with the major archiving providers. They do not need the Op Cit service to help them with archiving. However, the Op Cit service will still help them ensure that the DOIs that they cite continue work. So it will still benefit them even if they don&amp;rsquo;t use it directly.&lt;/p>
&lt;p>However, our research shows that many of our members are not fulfilling the commitments they made when joining Crossref. Over the next few years, we will be trying to fix this. Primarily through outreach- encouraging members to set up and record with Crossref archiving arrangements with the archives of their choice.&lt;/p>
&lt;p>But we know some members will find this too technically challenging and/or costly. [And frankly, given what we&amp;rsquo;ve learned of the archiving landscape, we can see their point.] The proposed Op Cit service is for these members. The vast majority of these members are Open Access publishers, so the &amp;ldquo;rights&amp;rdquo; questions are far more straightforward- making the implementation of such a service much more tractable.&lt;/p>
&lt;p>&lt;b>Someone asked what this means for the publisher-specific DOI prefix for this content? Will this be lost?&lt;/b>&lt;/p>
&lt;p>No.&lt;/p>
&lt;p>&lt;b>There is concern about the interstitial page that Crossref would build that gives the user access options. The value of Crossref to publishers is adding services that are invisible and beneficial to users, not adding a visible step that requires user action.&lt;/b>&lt;/p>
&lt;p>There is nothing in Crossref’s terms that says that we have to be invisible. The basic truth is that detecting content drift is really hard and several efforts to do so before have failed. Without a reliable way of knowing whether we should display the interstitial page, which may become possible in future, we have to display something for now, or the preservation function will not work.&lt;/p>
&lt;p>Crossref has, also, supported user-facing interstitial services for over a decade, including:&lt;/p>
&lt;ul>
&lt;li>Multiple Resolution&lt;/li>
&lt;li>Coaccess&lt;/li>
&lt;li>CrossMark&lt;/li>
&lt;li>Crossref Metdata Search&lt;/li>
&lt;li>REST API&lt;/li>
&lt;/ul>
&lt;p>So we have a long track record of non-B2B service provision.&lt;/p>
&lt;p>&lt;b>There is confusion about why Crossref seems to want to build the capacity to “lock” records in absence of flexibility. People feel no need for Crossref to get involved here.&lt;/b>&lt;/p>
&lt;p>This is a misunderstanding of the terminology. The Internet Archive allows the domain owner to request content to be removed. This would mean that, in future, if a new domain owner wanted, they could remove previously preserved material from the archive, thereby breaking the preservation function. When we say we want to “lock” a record, we mean that a future domain owner cannot remove content from the preservation archive. This also prevents domain hijackers from compromising the digital preservation.&lt;/p>
&lt;p>&lt;b>There is concern about the possibility to hack this system to give uncontrolled access to all full-text content by attacking publishing systems and making them unavailable. This is an unhappy path scenario but something on people’s minds.&lt;/b>&lt;/p>
&lt;p>The system only works on content that is provided with an explicitly stated open license (see response above).&lt;/p>
&lt;p>&lt;b>I think this project would be improved by better addressing the people doing the preservation maintenance work that this requires. Digital preservation is primarily a labor problem, as the technical challenges are usually easier than the challenge of consistently paying people to keep everything maintained over time. Through that lens, this is primarily a technical solution to offload labor resources from small repositories to (for now) the Internet Archive, where you can get benefits from the economies of scale. There are definitely cases where that could be useful! But I think making this more explicit will further a shared understanding of advantages and disadvantages and help you all see future roadblocks and opportunities for this approach.&lt;/b>&lt;/p>
&lt;p>This consultation phase was designed, precisely, to ensure that those working in the space could have their say. While this is a technical project, we recognize that any solution must value and understand labor. That means that any scaling to production must and will also include a funding solution to address the social labor challenge.&lt;/p>
&lt;p>&lt;b>Is there any sense in polling either the IA Wayback Machine or the LANL Memento Aggregator first to determine if snapshot(s) already exist?&lt;/b>&lt;/p>
&lt;p>We could do this, but it would add an additional hop/lookup on deposit. Plus, we want to store the specific version deposited at the specific time it is done, including re-deposits.&lt;/p>
&lt;p>&lt;b>I would encourage looking at a distributed file system like IPFS (&lt;a href="https://en.wikipedia.org/wiki/InterPlanetary_File_System%29" target="_blank">https://en.wikipedia.org/wiki/InterPlanetary_File_System)&lt;/a>. This would allow easy duplication, switching and peering of preservation providers. Correctly leveraged with IPNS; resolution, version tracking and version immutability also become benefits. Later after beta the IPNS metadata could be included as DOI metadata.&lt;/b>&lt;/p>
&lt;p>We had considered IPFS for other projects, but really, for this, we want to go with recognised archives, not end up running our own infrastructure for preservation.&lt;/p>
&lt;p>&lt;b>It might be useful to look into the 10320/loc option for the Handle server: the &lt;a href="https://www.handle.net/overviews/handle_type_10320_loc.html" target="_blank">https://www.handle.net/overviews/handle_type_10320_loc.html&lt;/a>. I can imagine a use case where a machine agent might want to access an archive directly without needing to go to an interstitial page.&lt;/b>&lt;/p>
&lt;p>It is good to see reference to the HANDLE system and alternative ways that we might use it. We will consult internally on the technical viability of this.&lt;/p>
&lt;p>In general, though, we prefer to use web-native mechanisms when they are available. We already support direct machine access via HTTP redirects and by exposing resource URLs in the metadata that can be retreivd via content negotiation. In this case, we would be looking at supporting the 300 (multiple choice) semantics.&lt;/p>
&lt;p>&lt;b>I&amp;rsquo;m curious to see how this will work for DOI versioning mechanisms like in Zenodo, where you have one DOI to reference all versions as well as version specific DOIs. If your record contains metadata + many files and a new version just versions one of the several files my assumption is that within the proposed system an entire new set (so all files) is archived. In theory this could also be a logical package, where simply the delta is stored, but I guess in a distributed preservation framework like the one proposed here, this would be hard to achieve.&lt;/b>&lt;/p>
&lt;p>This is a good point and it could lead to many more, frustrating, hops before the user reaches the content. We will conduct further research into this scenario, but we also note that Zenodo&amp;rsquo;s DOIs do not come from Crossref, but from DataCite.&lt;/p>
&lt;p>&lt;b>There&amp;rsquo;s a decent body of research at this point on automated content drift detection. This recent paper: &lt;a href="https://ceur-ws.org/Vol-3246/10_Paper3.pdf" target="_blank">https://ceur-ws.org/Vol-3246/10_Paper3.pdf&lt;/a> likely has links to other relevant articles.&lt;/b>&lt;/p>
&lt;p>We have no illusions about the difficulty of detecting semantic drift but this is helpful and interesting. We will read this material and related articles to appraise the current state of content drift detection.&lt;/p>
&lt;p>&lt;b>Out of curiosity, will we be using one type of archive (i.e., IA or CLOCKSS or LOCKSS or whatever) or will it possibly be a combination of a few archives? Reading the comments, it looks like some of them charge a fee, so I see why we&amp;rsquo;d use open source solutions first. Also, eventually could it be something that the member chooses? i.e. which archive they might want to use. Again, the latter question isn&amp;rsquo;t something for the prototype, but I&amp;rsquo;m curious about this use case. Also, I wonder about the implementation details if it is more than one archive. The question is totally moot of course, if we&amp;rsquo;re sticking with one archive for now.&lt;/b>&lt;/p>
&lt;p>The design will allow for deposit in multiple archives – and we will have to design a sustainability model that will cover those archives that need funding. As above, this is an important part of the move to production.&lt;/p>
&lt;p>&lt;b>Will be good for future interoperability to make sure at least one of the hashes is a SoftWare Hash IDentifier (see swhid.org). The ID is not really software specific and will interoperate with the Software Heritage Archive and git repositories.&lt;/b>&lt;/p>
&lt;p>We will certainly ensure best practices for checksums.&lt;/p>
&lt;p>&lt;b>Comments on the Interstitial Page&lt;/b>&lt;/p>
&lt;p>&lt;b>I&amp;rsquo;d keep the interstitial page without planning its eradication. (See why in the last paragraph)
I&amp;rsquo;d even advocate for it to be a beautiful and useful reminder to users that &amp;ldquo;This content is preserved&amp;rdquo;.
I&amp;rsquo;d go further and recommend that publishers deposit alternate urls of other preservation agents like PMC etc, that would also be displayed. This page could even be merged with multi-resolution system.&lt;/p>
&lt;p>The why: I&amp;rsquo;m concerned of hackers and of predatory publishers exploiting the spider heuristics by highjacking small journals and keeping just enough metadata as in them as to fool the resolver and then adding links to whatever products, scams and whatnots&amp;hellip;&lt;/b>&lt;/p>
&lt;ul>
&lt;li>&lt;/li>
&lt;/ul>
&lt;p>&lt;b>Technical. Scraping landing pages is hard. We&amp;rsquo;ve had a lot of projects to do this over the years. You can mitigate the risk by tiering / heuristics. Maybe even feedback loop to publishers to encourage them to put the right metadata on the landing page.&lt;/b>&lt;/p>
&lt;ul>
&lt;li>&lt;/li>
&lt;/ul>
&lt;p>&lt;b>This is the only part of this proposal that I don&amp;rsquo;t like. People are used to DOIs resolving directly to content, and I don&amp;rsquo;t think that should be changed unless absolutely necessary. I would prefer that the DOI resolves to the publisher&amp;rsquo;s copy if it exists, and the IA copy otherwise.&lt;/b>&lt;/p>
&lt;p>We will continue the discussion about the interstitial page. The basic technical fact, as above, is that detecting content drift is hard and so we may need, at least, to start with the page. However, some commentators presented reasons for keeping it.&lt;/p>
&lt;p>We also have already supported interstitial pages for multiple resolution and co-access for over a decade.&lt;/p>
&lt;p>It is member&amp;rsquo;s choice whether they wish to deposit alternative URLs and we already have a mechanism for this.&lt;/p></description></item><item><title>2023 board election slate</title><link>https://www.crossref.org/blog/2023-board-election-slate/</link><pubDate>Wed, 27 Sep 2023 00:00:00 +0000</pubDate><author>Lucy Ofiesh</author><guid>https://www.crossref.org/blog/2023-board-election-slate/</guid><description>&lt;p>I’m pleased to share the 2023 board election slate. Crossref’s &lt;a href="https://www.crossref.org/committees/nominating">Nominating Committee&lt;/a> received 87 submissions from members worldwide to fill seven open board seats.&lt;/p>
&lt;p>We maintain a balance of eight large member seats and eight small member seats. A member’s size is determined based on the membership fee tier they pay. We look at how our total revenue is generated across the membership tiers and split it down the middle. Like last year, about half of our revenue came from members in the tiers $0 - $1,650, and the other half came from members in tiers $3,900 - $50,000. We have two large member seats and five small member seats open for election in 2023.&lt;/p>
&lt;p>The Nominating Committee presents the following slate.&lt;/p>
&lt;h2 id="the-2023-slate">The 2023 slate&lt;/h2>
&lt;h3 id="tier-1-candidates-electing-five-seats">Tier 1 candidates (electing five seats):&lt;/h3>
&lt;ul>
&lt;li>&lt;strong>Beilstein-Institut&lt;/strong>, Wendy Patterson&lt;/li>
&lt;li>&lt;strong>Korean Council of Science Editors&lt;/strong>, Kihong Kim&lt;/li>
&lt;li>&lt;strong>Lujosh Ventures Limited&lt;/strong>, Olu Joshua&lt;/li>
&lt;li>&lt;strong>NISC Ltd&lt;/strong>, Mike Schramm&lt;/li>
&lt;li>&lt;strong>OpenEdition&lt;/strong>, Marin Dacos&lt;/li>
&lt;li>&lt;strong>Universidad Autónoma de Chile&lt;/strong>, Dr. Ivan Suazo&lt;/li>
&lt;li>&lt;strong>Vilnius University&lt;/strong>, Vincas Grigas&lt;/li>
&lt;/ul>
&lt;h3 id="tier-2-candidates-electing-two-seats">Tier 2 candidates (electing two seats):&lt;/h3>
&lt;ul>
&lt;li>&lt;strong>Association for Computing Machinery (ACM)&lt;/strong>, Scott Delman&lt;/li>
&lt;li>&lt;strong>Oxford University Press&lt;/strong>, James Phillpotts&lt;/li>
&lt;li>&lt;strong>Public Library of Science (PLOS)&lt;/strong>, Dan Shanahan&lt;/li>
&lt;li>&lt;strong>University of Chicago Press&lt;/strong>, Ashley Towne&lt;/li>
&lt;/ul>
&lt;div class="shortcode-divwrap blue-highlight">
&lt;span>&lt;h3 id="here-are-the-candidates-organisational-and-personal-statementsboard-and-governanceelections2023-slate">&lt;a href="https://www.crossref.org/board-and-governance/elections/2023-slate/">Here are the candidates&amp;rsquo; organisational and personal statements&lt;/a>&lt;/h3>
&lt;/span>
&lt;/div>
&lt;h2 id="you-can-be-part-of-this-important-process-by-voting-in-the-election">You can be part of this important process by voting in the election&lt;/h2>
&lt;p>If your organisation is a voting member in good standing of Crossref as of September 10th, 2023, you are eligible to vote when voting opens on September 27th, 2023.&lt;/p>
&lt;h2 id="how-can-you-vote">How can you vote?&lt;/h2>
&lt;p>Your organisation’s designated voting contact will receive an email from eBallot the week of September 25th with the Formal Notice of Meeting and Proxy Form with concise instructions on how to vote. The email will include a username and password with a link to our voting platform.&lt;/p>
&lt;p>The election results will be announced at the &lt;a href="https://www.crossref.org/crossref-live-annual/">LIVE23 online meeting&lt;/a> on October 31st, 2023. Save the date! Incoming members will take their seats at the March 2024 board meeting.&lt;/p></description></item><item><title>News: Crossref and Retraction Watch</title><link>https://www.crossref.org/blog/news-crossref-and-retraction-watch/</link><pubDate>Tue, 12 Sep 2023 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/news-crossref-and-retraction-watch/</guid><description>&lt;p>&lt;img src="https://www.crossref.org/img/crossref-logo-icon-only.svg" alt="Crossref logo icon" height="15" style="display: inline;" /> &lt;a href="https://doi.org/10.13003/c23rw1d9" target="_blank">https://doi.org/10.13003/c23rw1d9&lt;/a>&lt;/p>
&lt;h2 id="crossref-acquires-retraction-watch-data-and-opens-it-for-the-scientific-community">Crossref acquires Retraction Watch data and opens it for the scientific community&lt;/h2>
&lt;p>&lt;em>&lt;strong>Agreement to combine and publicly distribute data about tens of thousands of retracted research papers, and grow the service together&lt;/strong>&lt;/em>&lt;/p>
&lt;p>&lt;em>12th September 2023&lt;/em> —&amp;ndash; The Center for Scientific Integrity, the organisation behind the Retraction Watch blog and database, and Crossref, the global infrastructure underpinning research communications, both not-for-profits, announced today that the Retraction Watch database has been acquired by Crossref and made a public resource. An agreement between the two organisations will allow Retraction Watch to keep the data populated on an ongoing basis and always open, alongside publishers registering their retraction notices directly with Crossref.&lt;/p>
&lt;div class="shortcode-divwrap align-left">
&lt;span>&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/rw-cr-announcement.png"
alt="crossref-acquires-retraction-watch-data" width="100%">
&lt;/figure>
&lt;/span>
&lt;/div>
&lt;p>Both organisations have a shared mission to make it easier to assess the trustworthiness of scholarly outputs. Retractions are an important part of science and scholarship regulating themselves and are a sign that academic publishing is doing its job. But there are more journals and papers than ever, so identifying and tracking retracted papers has become much harder for publishers and readers. That, in turn, makes it difficult for readers and authors to know whether they are reading or citing work that has been retracted. Combining efforts to create the largest single open-source database of retractions reduces duplication, making it more efficient, transparent, and accessible for all.&lt;/p>
&lt;p>Product Director Rachael Lammey says, “Crossref is focused on documenting and clarifying the scholarly record in an open and scalable form. For a decade, our members have been recording corrections and retractions through our infrastructure, and incorporating the Crossmark button to alert readers. Collaborating with Retraction Watch augments publisher efforts by filling in critical gaps in our coverage, helps the downstream services that rely on high-quality, open data about retractions, and ultimately directly benefits the research community.”&lt;/p>
&lt;p>The Center for Scientific Integrity and the Retraction Watch blog will remain separate from Crossref and will continue their journalistic work investigating retractions and related issues; the agreement with Crossref is confined to the database only and Crossref itself remains a neutral facilitator in efforts to assess the quality of scientific works. Both organisations consider publishers to be the primary stewards of the scholarly record and they are encouraged to continue to add retractions to their Crossref metadata as a priority.&lt;/p>
&lt;p>“Retraction Watch has always worked to make our highly comprehensive and accurate retraction data available to as many people as possible. We are deeply grateful to the foundations, individuals, and members of the publishing services industry who have supported our efforts and laid the groundwork for this development,” said Ivan Oransky, executive director of the Center for Scientific Integrity and co-founder of Retraction Watch. “This agreement means that the Retraction Watch Database has sustainable funding to allow its work to continue and improve.”&lt;/p>
&lt;p>Please join Crossref and Retraction Watch leadership, among other special guests, for a community call on 27th September at 1 p.m. UTC to discuss this new development in the pursuit of research integrity.&lt;/p>
&lt;hr>
&lt;h3 id="supporting-details">Supporting details&lt;/h3>
&lt;ul>
&lt;li>Crossref retractions number 14k, and the Retraction Watch database currently numbers 43k. There is some overlap, making a total of around 50k retractions.&lt;/li>
&lt;li>&lt;del>The full dataset has been released through Crossref’s Labs API, initially as a .csv file to download directly: &lt;a href="https://api.labs.crossref.org/data/retractionwatch?ginny@crossref.org" target="_blank">https://api.labs.crossref.org/data/retractionwatch?name@email.org&lt;/a> (add your ‘mailto’).&lt;/del> &lt;em>Edit: 2024-10-10:&lt;/em> The full dataset is available in a git repository at &lt;a href="https://gitlab.com/crossref/retraction-watch-data" target="_blank">https://gitlab.com/crossref/retraction-watch-data&lt;/a>.&lt;/li>
&lt;li>The Crossref Labs API also displays information about retractions in the &lt;code>/works/&lt;/code> route when metadata is available, such as &lt;a href="https://api.labs.crossref.org/works/10.2147/CMAR.S324920?mailto=ginny@crossref.org" target="_blank">https://api.labs.crossref.org/works/10.2147/CMAR.S324920?name@email.org&lt;/a> (add your ‘mailto’). If you don&amp;rsquo;t have a .json viewer, please see below for screenshot.&lt;/li>
&lt;li>Crossref is paying an initial acquisition fee of USD $175,000 and will pay Retraction Watch USD $120,000 each year, increasing by 5% each year.&lt;/li>
&lt;li>The initial term of the contract is five years. &lt;del>The full text of the contract will be made public in the coming fortnight.&lt;/del> &lt;em>EDIT 2023-09-26:&lt;/em> &lt;a href="https://www.crossref.org/pdfs/retraction-watch-crossref-fully-executed-23-08-2023.pdf">Here is the signed agreement&lt;/a>.&lt;/li>
&lt;li>There will be a community call on 27th September at 1 p.m. UTC (your time zone &lt;a href="https://dateful.com/eventlink/3093150191" target="_blank">here&lt;/a>). Please &lt;a href="https://crossref.zoom.us/webinar/register/WN_U0naDJTCQIS_sQECv8Aa4Q" target="_blank">register&lt;/a>.&lt;/li>
&lt;li>An open &lt;a href="https://docs.google.com/document/d/1GabgCP_sUwvW2XEtOfWmFwNIpizagWvlreALWvbZY8Y/edit" target="_blank">FAQ document&lt;/a> is available to collect questions to be answered at the webinar.&lt;/li>
&lt;li>This announcement will always be accessible via Crossref DOI &lt;a href="https://doi.org/10.13003/c23rw1d9" target="_blank">https://doi.org/10.13003/c23rw1d9&lt;/a>; please use this persistent link for sharing.&lt;/li>
&lt;/ul>
&lt;h5 id="about-retraction-watch-and-the-center-for-scientific-integrity">About Retraction Watch and The Center for Scientific Integrity&lt;/h5>
&lt;p>The Center for Scientific Integrity is a U.S. 501(c)3 non-profit whose mission is to promote transparency and integrity in science and scientific publishing, and to disseminate best practices and increase efficiency in science. In addition to maintaining and curating the Retraction Watch Database, the Center is the home of &lt;a href="http://retractionwatch.com" target="_blank">Retraction Watch&lt;/a>, a blog founded in 2010 that reports on scholarly retractions and related issues in research integrity.&lt;/p>
&lt;h5 id="about-crossref">About Crossref&lt;/h5>
&lt;p>&lt;a href="https://www.crossref.org" target="_blank">Crossref&lt;/a> is a global community infrastructure that makes all kinds of research objects easy to find, assess, and reuse through a number of services critical to research communications, including an open metadata API that sees over 2 billion queries every month. Crossref’s &amp;gt;19,000 members come from 151 countries and are predominantly university-based. Their ~150 million DOI records contribute to the collective vision of a rich and reusable open network of relationships connecting research organisations, people, things, and actions; a scholarly record that the global community can build on forever, for the benefit of society.&lt;/p>
&lt;h3 id="enquiries">Enquiries&lt;/h3>
&lt;ul>
&lt;li>For Retraction Watch/Center for Scientific Integrity: Ivan Oransky, &lt;a href="mailto:ivan@retractionwatch.com?subject=Crossref%20and%20Retraction%20Watch">ivan@retractionwatch.com&lt;/a>&lt;/li>
&lt;li>For Crossref: Ginny Hendricks, &lt;a href="mailto:ginny@crossref.org?subject=Retraction%20Watch%20and%20Crossref">ginny@crossref.org&lt;/a>&lt;/li>
&lt;/ul>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/sample-record-retraction-watch-border.png"
alt="A screenshot of an example Labs API metadata record with a Retraction Watch-asserted retraction" width="100%">&lt;figcaption>
&lt;p>A screenshot of an example Labs API metadata record with a Retraction Watch-asserted retraction&lt;/p>
&lt;/figcaption>
&lt;/figure></description></item><item><title>Open Funder Registry to transition into Research Organisation Registry (ROR)</title><link>https://www.crossref.org/blog/open-funder-registry-to-transition-into-research-organisation-registry-ror/</link><pubDate>Thu, 07 Sep 2023 00:00:00 +0000</pubDate><author>Amanda French</author><guid>https://www.crossref.org/blog/open-funder-registry-to-transition-into-research-organisation-registry-ror/</guid><description>&lt;p>Today, we are announcing a long-term plan to deprecate the &lt;a href="https://www.crossref.org/services/funder-registry/">Open Funder Registry&lt;/a>. For some time, we have understood that there is significant overlap between the Funder Registry and the &lt;a href="https://ror.org" target="_blank">Research Organisation Registry (ROR)&lt;/a>, and funders and publishers have been asking us whether they should use Funder IDs or ROR IDs to identify funders. It has therefore become clear that &lt;strong>merging the two registries will make workflows more efficient and less confusing for all concerned.&lt;/strong> Crossref and ROR are therefore working together to ensure that Crossref members and funders can use ROR to simplify persistent identifier integrations, to register better metadata, and to help connect research outputs to research funders.&lt;/p>
&lt;p>Just yesterday, we published &lt;a href="https://doi.org/10.64000/3f63f-yt393" target="_blank">a summary of a recent workshop between funders and publishers on funding metadata workflows&lt;/a> that we convened with the Dutch Research Council (NWO) and Sesame Open Science. As the report notes, &amp;ldquo;open funding metadata is arguably the next big thing&amp;rdquo; [in Open Science]. That being the case, we think this is the ideal time to strengthen our support of open funding metadata by beginning this transition to ROR.&lt;/p>
&lt;h2 id="comparing-the-features-of-ror-and-the-funder-registry">Comparing the features of ROR and the Funder Registry&lt;/h2>
&lt;p>Let&amp;rsquo;s look at some of the major similarities and differences between the two registries, including their history, features, scope, and usage, since there are important nuances and distinctions that are helpful to understand.&lt;/p>
&lt;h3 id="overview">Overview&lt;/h3>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>ROR&lt;/th>
&lt;th>Funder Registry&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>Launched in 2019&lt;/td>
&lt;td>Launched in 2013&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Primary use case is contributor affiliation&lt;/td>
&lt;td>Primary use case is funding acknowledgement&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>105k+ records&lt;/td>
&lt;td>35k+ records&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>CC0 data&lt;/td>
&lt;td>CC0 data&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>REST API&lt;/td>
&lt;td>REST API&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Free to use&lt;/td>
&lt;td>Free to use&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Entire registry downloadable as JSON and CSV&lt;/td>
&lt;td>Entire registry downloadable as RDF; funder names and IDs downloadable as CSV&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Records contain mappings to other IDs&lt;/td>
&lt;td>Records do not contain mappings to other IDs&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>organisation relationships and hierarchy&lt;/td>
&lt;td>organisation relationships and hierarchy&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>8 organisation types&lt;/td>
&lt;td>2 funder types, 8 funder subtypes&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Open source code and multiple open-source tools available&lt;/td>
&lt;td>Open source code&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Web-based registry search&lt;/td>
&lt;td>Web-based search for works in Crossref associated with each Funder ID&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Web-based landing pages for each ROR record&lt;/td>
&lt;td>JSON landing pages for each Funder Registry record&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Updated monthly&lt;/td>
&lt;td>Updated monthly&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Public curation process&lt;/td>
&lt;td>Private curation process&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Anyone can request changes and additions&lt;/td>
&lt;td>Anyone can request changes and additions&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Stable financial support&lt;/td>
&lt;td>Stable financial support&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Beginning to be supported in funding and publishing workflows&lt;/td>
&lt;td>Somewhat well supported in most funding and publishing workflows&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Currently used by 260+ Crossref members &lt;sup id="fnref:1">&lt;a href="#fn:1" class="footnote-ref" role="doc-noteref">1&lt;/a>&lt;/sup>&lt;/td>
&lt;td>Currently used by 2100+ Crossref members &lt;sup id="fnref:2">&lt;a href="#fn:2" class="footnote-ref" role="doc-noteref">2&lt;/a>&lt;/sup>&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;h3 id="history">History&lt;/h3>
&lt;p>The &lt;a href="https://www.crossref.org/services/funder-registry/">Open Funder Registry&lt;/a> was &lt;a href="https://www.crossref.org/news/2013-05-28-crossrefs-fundref-launches-publishers-and-funders-track-scholarly-output/">launched as FundRef over a decade ago&lt;/a> to enable the community to &lt;strong>cite research financing&lt;/strong> and assert it within the scholarly record, acknowledging the organisations granting their support. Elsevier generously donated the seed data for the Funder Registry and has managed its curation for the last ten years, while we have maintained the technical operations and promoted community adoption of the Funder Registry.&lt;/p>
&lt;p>The &lt;a href="https://ror.org/" target="_blank">Research Organisation Registry (ROR)&lt;/a> was &lt;a href="https://ror.org/blog/2019-02-10-announcing-first-ror-prototype/" target="_blank">introduced in 2019&lt;/a> by the California Digital Library, DataCite, and Crossref to enable the community to &lt;strong>cite contributor affiliations&lt;/strong> and assert them within the scholarly record, acknowledging the organisations that housed or performed the research. Digital Science generously donated the seed data for the Research Organisation Registry from its Global Research Identifier Database (GRID) initiative, and Crossref, DataCite, and the California Digital Library have contributed labor and resources to turn ROR into a mature, independent, freely available offering.&lt;/p>
&lt;h3 id="scope">Scope&lt;/h3>
&lt;p>One key difference between the registries is that &lt;strong>ROR has always included funding organisations, and ROR records have always included mappings to Funder IDs where available,&lt;/strong> while the reverse is not true: the Funder Registry includes only funding organisations, not other kinds of organisations, and Funder Registry records do not currently include mappings to ROR IDs or other identifiers. It therefore makes sense to expand ROR&amp;rsquo;s initial contributor affiliation use case to include the function of identifying research financing.&lt;/p>
&lt;h3 id="usage">Usage&lt;/h3>
&lt;p>More Crossref members use Funder IDs than use ROR IDs, to be sure. You can see from the table above that the number of Crossref members using Funder IDs in Crossref records is higher by almost a factor of 10 than the number of Crossref members using ROR IDs in Crossref records. But note too that &lt;strong>the current &lt;em>rate&lt;/em> of adoption is far higher for ROR than it is for the Funder Registry.&lt;/strong> Since &lt;a href="https://doi.org/10.64000/1nkjy-15275" target="_blank">January of 2022&lt;/a>, we&amp;rsquo;ve seen a gratifying number of publishers and service providers beginning to use ROR identifiers for contributor affiliations in Crossref. In the last year, the number of Crossref members depositing ROR IDs has increased by 356%, while the number depositing Funder IDs has increased only by 12%. As evidenced by its ballooning API traffic, too, with more than 20 million requests last month,&lt;sup id="fnref:3">&lt;a href="#fn:3" class="footnote-ref" role="doc-noteref">3&lt;/a>&lt;/sup> ROR is clearly being used by many scholarly research systems for many purposes. &lt;strong>The more systems that use an identifier, the more valuable that identifier becomes as a vehicle for exchanging information.&lt;/strong>&lt;/p>
&lt;p>Even though ROR&amp;rsquo;s primary use case has been to identify contributor affiliations, ROR is in fact already being used by funders. Nineteen funding organisations are depositing ROR IDs in their grant records with Crossref to denote principal investigator affiliations,&lt;sup id="fnref:4">&lt;a href="#fn:4" class="footnote-ref" role="doc-noteref">4&lt;/a>&lt;/sup> and, following a meeting of the &lt;a href="https://www.crossref.org/working-groups/funders/">Crossref Funder Advisory Group&lt;/a> last month, all eighty funder members are primed to start using ROR IDs to identify themselves in grant records. DataCite has allowed ROR IDs as a funding identifier since 2019&lt;sup id="fnref:5">&lt;a href="#fn:5" class="footnote-ref" role="doc-noteref">5&lt;/a>&lt;/sup>, and while there are currently over 877,000 DataCite records that use Funder IDs to identify funders,&lt;sup id="fnref:6">&lt;a href="#fn:6" class="footnote-ref" role="doc-noteref">6&lt;/a>&lt;/sup> there are also over 161,000 DataCite records that use ROR IDs to identify funders.&lt;sup id="fnref:7">&lt;a href="#fn:7" class="footnote-ref" role="doc-noteref">7&lt;/a>&lt;/sup>&lt;/p>
&lt;h3 id="tools-and-services">Tools and services&lt;/h3>
&lt;p>Both the Funder Registry and ROR offer open data and open source code, but we think that ROR&amp;rsquo;s suite of free and open source utilities (some of which were developed by Crossref staff) gives it a competitive advantage. We know that publishers and their service providers have ongoing challenges in collecting and matching funding information from authors and in validating Funder IDs. With ROR’s extensive toolkit, &lt;strong>publishers and their technology providers who adopt ROR will be in a much better position to improve the accuracy of funding acknowledgements in metadata, which can in turn enable the development of reliable analytics, tools, and services for funders, regulators, research facilities, and the public&lt;/strong>.&lt;/p>
&lt;p>Crossref has built tools based on OpenRefine for both the Funder Registry and ROR: the &lt;a href="https://www.crossref.org/labs/fundref-reconciliation-service/">Open Funder Registry Reconciliation Service&lt;/a> and the &lt;a href="https://ror.readme.io/docs/openrefine-reconciler" target="_blank">ROR Reconciler&lt;/a> are both useful ways to clean messy data. ROR, however, also offers a much-used &lt;a href="https://ror.readme.io/docs/affiliation-parameter" target="_blank">API endpoint that helps match organisation names to ROR IDs&lt;/a>, and several third parties have also developed and shared &lt;a href="https://ror.readme.io/docs/match-organisation-names-to-ror-ids#match-organisation-names-to-ror-ids-using-third-party-tools" target="_blank">open source matching tools and services for ROR&lt;/a>. Crossref and ROR are also collaborating on new strategies for affiliation matching that will be able to match funding references.&lt;/p>
&lt;h3 id="community-engagement-models">Community engagement models&lt;/h3>
&lt;p>The Funder Registry has been curated for over a decade through time and expertise generously donated by Elsevier. ROR offers more transparency and community involvement; it is &lt;a href="https://ror.org/about/#governance-model" target="_blank">openly governed&lt;/a> by Crossref, DataCite, and the California Digital Library and is advised by a global network of community stakeholders through its &lt;a href="https://ror.org/community/#steering-group" target="_blank">Steering Group and&lt;/a> Community Advisory Group. ROR is &lt;a href="https://github.com/ror-community/ror-updates/issues" target="_blank">openly curated&lt;/a> and is aided by a global &lt;a href="https://ror.org/registry/#curation-advisory-board" target="_blank">Curation Advisory Board&lt;/a> of volunteers.&lt;/p>
&lt;h3 id="summary">Summary&lt;/h3>
&lt;p>For all of the above reasons, then, we believe that in the long term ROR will serve the community better as an identifier for funders. In a future post, we&amp;rsquo;ll do an even deeper dive into comparing the Funder Registry and ROR, comparing the metadata and data in each registry and giving statistics on funder assertions in our metadata.&lt;/p>
&lt;h2 id="what-will-this-mean-for-you">What will this mean for you?&lt;/h2>
&lt;p>The many organisations whose tools, services, and workflows have been architected to use Funder Registry IDs will find this transition a challenge, and we don&amp;rsquo;t want to make light of that issue. Over the last ten years, we have encouraged the community to adopt Funder IDs, and the community has demonstrably recognized the benefits of doing so. Publishers have put a great deal of time, thought, and effort into collecting funder data and including it in Crossref metadata, and they have built internal reports and workflows around the Funder Registry. &lt;strong>Both Crossref and ROR are committed to making the transition from the Funder Registry to the Research Organisation Registry as simple as possible for those who have adopted the Funder Registry.&lt;/strong>&lt;/p>
&lt;p>If you are not already using the Funder Registry and are planning to begin standardizing funding data, we recommend that you use ROR to identify funders. If you are currently using the Funder Registry in your systems and workflows, don&amp;rsquo;t worry! &lt;strong>In the short term, and even in the medium term, Funder IDs aren&amp;rsquo;t going away.&lt;/strong> Eventually, however, the Funder Registry will cease to be updated, so any new funders will only be registrable in Crossref metadata with ROR IDs. Legacy Funder IDs and their mapping to ROR IDs will be maintained, so if Crossref members submit a legacy Funder ID, it will get mapped to a ROR ID automatically. Note, too, that Crossref is committed to maintaining the current funder API endpoints until ROR IDs become the predominant identifier for newly registered content.&lt;/p>
&lt;p>In short, if you are already using Funder IDs, you can and should continue to do so. However, we do recommend that you begin looking at what it will take to integrate ROR into your systems and workflows for identifying funders. Think of it as warming up before a workout: it&amp;rsquo;s time to start swinging your arms and stretching your hamstrings.&lt;/p>
&lt;p>We face challenges in this transition, too. Of these, we think the largest will be (1) completing the reconciliation work involved in mapping Funder IDs to ROR IDs, and (2) overhauling Crossref&amp;rsquo;s schemas, APIs, and deposit tools to support ROR IDs in all the ways we currently support Funder IDs. We&amp;rsquo;ll discuss both of these challenges in future blog posts, but it&amp;rsquo;s worth saying that &lt;strong>any challenges pale in comparison to the benefit of enabling the whole community to use a single open identifier in multiple places in the scholarly record.&lt;/strong>&lt;/p>
&lt;h2 id="tell-us-what-you-need">Tell us what you need!&lt;/h2>
&lt;p>We want to hear from you. You can use our &lt;a href="https://community.crossref.org/" target="_blank">Community Forum&lt;/a> talk to us about the Crossref Funder Registry, and you can &lt;a href="https://tinyurl.com/ror-slack" target="_blank">join the ROR Slack&lt;/a> to talk to the ROR team and community. You can also contact Crossref via our &lt;a href="https://support.crossref.org/hc/en-us/requests/new?ticket_form_id=360001642691" target="_blank">request form&lt;/a> or email ROR at &lt;a href="mailto:info@ror.org">info@ror.org&lt;/a>, and you can attend online &lt;a href="https://www.crossref.org/events/">Crossref events&lt;/a> and &lt;a href="https://ror.org/events" target="_blank">ROR events&lt;/a> to get updates from us and ask us your questions.&lt;/p>
&lt;p>One of the major messages we&amp;rsquo;re already hearing from funders and publishers is expressed in &lt;a href="https://doi.org/10.64000/3f63f-yt393/" target="_blank">yesterday&amp;rsquo;s post on open funding metadata&lt;/a>: &amp;ldquo;While many concluded that there was still a long way to go to solve the many technical challenges related to funding metadata, attendees were unanimous on its importance.&amp;rdquo; We look forward to beginning this important work together.&lt;/p>
&lt;div class="footnotes" role="doc-endnotes">
&lt;hr>
&lt;ol>
&lt;li id="fn:1">
&lt;p>&lt;a href="https://api.crossref.org/works?filter=has-ror-id:t&amp;amp;facet=publisher-name:*" target="_blank">Crossref API works with ROR IDs faceted by publisher name&lt;/a>&amp;#160;&lt;a href="#fnref:1" class="footnote-backref" role="doc-backlink">&amp;#x21a9;&amp;#xfe0e;&lt;/a>&lt;/p>
&lt;/li>
&lt;li id="fn:2">
&lt;p>&lt;a href="https://api.crossref.org/works?filter=has-funder-doi:t&amp;amp;facet=publisher-name:*" target="_blank">Crossref API works with Funder IDs faceted by publisher name&lt;/a>&amp;#160;&lt;a href="#fnref:2" class="footnote-backref" role="doc-backlink">&amp;#x21a9;&amp;#xfe0e;&lt;/a>&lt;/p>
&lt;/li>
&lt;li id="fn:3">
&lt;p>&lt;a href="https://p.datadoghq.eu/sb/db1aec04-0c1a-11ec-860a-da7ad0900005-7d7c572812608235cca3359ee5ec591a?from_ts=1690924139911&amp;amp;to_ts=1693516139911&amp;amp;live=true" target="_blank">ROR API Public API Usage Insights&lt;/a>&amp;#160;&lt;a href="#fnref:3" class="footnote-backref" role="doc-backlink">&amp;#x21a9;&amp;#xfe0e;&lt;/a>&lt;/p>
&lt;/li>
&lt;li id="fn:4">
&lt;p>&lt;a href="http://api.crossref.org/works?filter=has-ror-id:t,type-name:Grant&amp;amp;facet=publisher-name:*" target="_blank">Crossref API works of type &amp;ldquo;Grant&amp;rdquo; with ROR IDs faceted by publisher name&lt;/a>&amp;#160;&lt;a href="#fnref:4" class="footnote-backref" role="doc-backlink">&amp;#x21a9;&amp;#xfe0e;&lt;/a>&lt;/p>
&lt;/li>
&lt;li id="fn:5">
&lt;p>&lt;a href="https://schema.datacite.org/meta/kernel-4.3/" target="_blank">DataCite Metadata Schema 4.3 release notes, August 2019&lt;/a>&amp;#160;&lt;a href="#fnref:5" class="footnote-backref" role="doc-backlink">&amp;#x21a9;&amp;#xfe0e;&lt;/a>&lt;/p>
&lt;/li>
&lt;li id="fn:6">
&lt;p>&lt;a href="https://api.datacite.org/dois?query=fundingReferences.funderIdentifierType:%22Crossref%20Funder%20ID%22&amp;amp;page[size]=0" target="_blank">DataCite API Funder ID in funding reference&lt;/a>&amp;#160;&lt;a href="#fnref:6" class="footnote-backref" role="doc-backlink">&amp;#x21a9;&amp;#xfe0e;&lt;/a>&lt;/p>
&lt;/li>
&lt;li id="fn:7">
&lt;p>&lt;a href="https://api.datacite.org/dois?query=fundingReferences.funderIdentifierType:ROR&amp;amp;page[size]=0" target="_blank">DataCite API ROR ID in funding reference&lt;/a>&amp;#160;&lt;a href="#fnref:7" class="footnote-backref" role="doc-backlink">&amp;#x21a9;&amp;#xfe0e;&lt;/a>&lt;/p>
&lt;/li>
&lt;/ol>
&lt;/div></description></item><item><title>Open funding metadata through Crossref; a workshop to discuss challenges and improving workflows</title><link>https://www.crossref.org/blog/open-funding-metadata-community-workshop-report/</link><pubDate>Wed, 06 Sep 2023 00:00:00 +0000</pubDate><author>Hans de Jonge</author><guid>https://www.crossref.org/blog/open-funding-metadata-community-workshop-report/</guid><description>&lt;p>Ten years on from the launch of the &lt;a href="https://www.crossref.org/services/funder-registry/" target="_blank">Open Funder Registry&lt;/a> (OFR, formerly FundRef), there is renewed interest in the potential of openly available funding metadata through Crossref. And with that: calls to improve the quality and completeness of that data. Currently, about 25% of Crossref records contain some kind of funding information. Over the years, this figure has grown steadily. A number of &lt;a href="https://doi.org/10.31222/osf.io/smxe5" target="_blank">recent publications&lt;/a> have shown, however, that there is considerable variation in the extent to which publishers deposit these data to Crossref. Technical but also business issues seem to lie at the root of this. Crossref - in close collaboration with the Dutch Research Council NWO and Sesame Open Science - brought together a group of 26 organisations from across the ecosystem to discuss the barriers and possible solutions. This blog presents some anonymized lessons learned.&lt;/p>
&lt;h2 id="there-is-no-open-science-without-open-metadata">There is no Open Science without open metadata&lt;/h2>
&lt;p>The interest in the potential of this open-source funding metadata seems to be entering a new stage. When registering (or updating) a DOI record for a publication, publishers can include information about the funding of the research. The Open Funder Registry grew out of recommendations in the &lt;a href="https://doi.org/10.1629/2048-7754.98" target="_blank">report from the US Scholarly Publishing Roundtable in 2010&lt;/a>. During the Annual Meeting of Crossref that year, Frederick Dylla, CEO of the American Institute of Physics, argued that in order to make research funding information in publications accessible, it needed to be presented in a standard way and stored in a central location.&lt;/p>
&lt;p>The benefits of having open funding metadata available, listed by Dylla in &lt;a href="https://www.slideshare.net/CrossRef/dylla-cross-refannual-general-mtg-nov2010" target="_blank">his presentation&lt;/a> 13 years ago, are still very valid:&lt;/p>
&lt;ul>
&lt;li>&lt;strong>Researchers&lt;/strong> benefit because it increases transparency of their funding sources and supports the requirements they already have from their funders.&lt;/li>
&lt;li>For &lt;strong>funders&lt;/strong>, having this data available is essential because it allows them to identify the published outcomes of publicly funded research. Essential to monitor compliance with open access policies, but also important given the pressures funders face to account for their spending of public money.&lt;/li>
&lt;li>For &lt;strong>publishers&lt;/strong>, funding metadata provides a valuable service, as it provides insight into how the research they publish is funded.&lt;/li>
&lt;/ul>
&lt;p>Although Crossref has been collating funding metadata for many years, there seems to be a renewed interest in this service. Publishers have long expressed a desire to solve the challenges, meta-researchers need this information in order to analyze research on research, editors are concerned with research integrity, including funding trends, and funders themselves need to track the reach and return of their support.&lt;/p>
&lt;p>Open Science seems to be an important driver: As we move to an ecosystem built on Open Science principles, not only publications, data, and software need to be openly available, but also the metadata associated with those scholarly outputs. Indeed, in an Open Science world, all meta information should be open, and academia should not be dependent anymore on data from proprietary bibliographic databases. Indicators for research assessment and policy development should be open indicators, derived from open metadata. Much has been done in this area already, in the context of &lt;a href="https://i4oc.org/" target="_blank">Open Citations&lt;/a> and &lt;a href="https://i4oa.org/" target="_blank">Open Abstracts&lt;/a>. While many in the community have focused on the bigger picture of advocating for all open metadata, e.g. &lt;a href="https://metadata2020.org/" target="_blank">Metadata 20/20&lt;/a>, open funding metadata is arguably the next big thing. Open Research Information, including open metadata, must be a strategic priority for science and society.&lt;/p>
&lt;h2 id="room-for-improvement">Room for improvement&lt;/h2>
&lt;p>After ten years of collecting funding metadata, 25% of records in Crossref contain some kind of funding information, and this figure was reached by a steady growth over that time. A number of recent studies have shown, however, that there is room for improvement. A case study published by two of the present authors has shown that the extent to which publishers deposit funding information to Crossref varies considerably. Some larger society presses - American Chemical Society (ACS), American Physical Society (APS), and Royal Society of Chemistry (RSC) - perform exceptionally well, with almost 100% of publications containing funding information. But there is still a large number of publishers - among them large legacy publishers - that attain substantially lower figures or do not seem to deposit funding metadata at all. Our case study has shown that often this cannot be explained by the fact that authors have not provided any funding information, as often this information is available in the acknowledgement sections of the papers. Somehow, however, this data does not find its way to Crossref.&lt;/p>
&lt;h2 id="workflows-and-challenges-collect-retain-validate-deposit">Workflows and challenges: collect, retain, validate, deposit&lt;/h2>
&lt;p>In order to chart the challenges that publishers face when collecting this information, we organized a roundtable session. 26 organisations were invited from across the ecosystem. These included: major publishers (American Chemical Society, British Medical Journal, Elsevier, IOP Publishing, PLOS, Royal Society of Chemistry, Sage, Springer Nature, Taylor &amp;amp; Francis, and Wiley), funders (European Research Council, Austrian Research Council, Dutch Research Council, OSTI-DOE, UKRI, and Michael J Fox Foundation) as well as service providers (Aries Editorial Manager, PKP / OJS, Scholastica, and eJournal Press).&lt;/p>
&lt;p>In order to map the potential barriers and challenges publishers face, participants were presented with a workflow scheme representing a hypothetical production process.&lt;/p>
&lt;p>This workflow outlined the steps in the production process at which funder information would potentially be handled, as well as some of the considerations that might be at play at each step.&lt;/p>
&lt;ul>
&lt;li>collecting funder information (upon submission or acceptance)&lt;/li>
&lt;li>extracting funder information from full text&lt;/li>
&lt;li>retaining funder information through the production workflow&lt;/li>
&lt;li>including funder information in article metadata&lt;/li>
&lt;li>making metadata and/or full text available for indexing&lt;/li>
&lt;/ul>
&lt;p>Participants were invited to comment on this workflow and place digital dots in the scheme to identify challenges in the collection, retention, and deposit of funding information. These pain points were afterwards fleshed out in break-out groups.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/funding-roundtable-scheme.png"
alt="publishing-workflow-funding-metadata" width="100%">
&lt;/figure>
&lt;h2 id="lessons-learned">Lessons learned&lt;/h2>
&lt;h3 id="1-still-a-lack-of-awareness-among-editors-and-authors">1. Still a lack of awareness among editors and authors&lt;/h3>
&lt;p>For many journals and publishers, collecting funding information starts when papers are submitted through submission systems. Many publishers use the same systems: ScholarOne and Editorial Manager, though many have multiple systems in place for different portfolios of journals. Around 25,000 journals use PKP’s Open Journal System, and Scholastica and eJournal Press are growing in popularity and importance. All of them provide the possibility for authors to enter funder information but this does not by all means mean that all journals make use of it. Submission systems are highly customizable, and publishers tend to tailor systems to the needs and wishes of their journals. Editors who do not see much value in collecting funding metadata therefore present a first ‘weak link’. Publishers and tech providers agreed that more outreach is needed about the importance of funding metadata among editors and authors.&lt;/p>
&lt;h3 id="2-improvements-are-needed-in-submission-systems">2. Improvements are needed in submission systems&lt;/h3>
&lt;p>Where journals and publishers agree on asking authors to register funding information through the submission systems, many express a tension between collecting structured metadata and making it as easy as possible for authors. Many are hesitant to use mandatory input fields. Instead, funding metadata is often collected as free text, giving rise to a plethora of ambiguities. Most systems provide suggestions based on the input of the author based on the Open Funder Registry. A lot seems to go wrong at this stage. Authors often persist in the wrong spelling of their funder and do not choose predefined suggestions, making it very difficult to match input to Funder IDs. Publishers estimated the number of non-matches up to 50%. Trivial issues like “Bill &amp;amp; Melinda” versus “Bill and Melinda” or “Netherlands organisation” versus “Netherlands Organisation” result in errors. Here, autocomplete techniques seem to be in dire need of improvement. Based on a preliminary analysis of funder name variants used in Crossref, adding up to 3 of the most frequently used name variants to the list of ‘alternative funder names’ in the Funder registry could solve around 60% of missed matches.&lt;/p>
&lt;h3 id="3-a-lot-can-be-learned-from-how-some-publishers-have-changed-and-organized-their-workflows">3. A lot can be learned from how some publishers have changed and organized their workflows&lt;/h3>
&lt;p>Faced with these issues, the Royal Society of Chemistry has invested in innovative workflows to enhance the availability of funding metadata. Instead of relying solely on the free text input of the authors, RSC presented to the group the details of how they have tackled the issue. In addition to author-provided acknowledgements, they work with third-party production vendors to programmatically extract information from the acknowledgement section of papers. Data from the two sources are compared, and when differences or conflicts are being noted, the data is fixed, completed, and reformatted. The next step is crucial - the newly-cleansed funding data is fed back to the author for validation, and retained during the production phase of the paper. Implementation of this validation stage has increased the availability of funding metadata by 30%. In 2023 80% of papers published by RSC have some kind of structured funding metadata. An additional benefit of this feedback loop was its educational effect by alerting authors to the importance of correct funding information. But even RSC continues to struggle with issues of funder name ambiguity, use of acronyms, authors reporting grant or award names instead of funder names, issues with phraseology of funding acknowledgements, and frustrations with the user experience of the service provider integrations with the OFR.&lt;/p>
&lt;p>Many publishers agreed that collecting funding information from full-text papers is the preferred option. Not only because it lowers the burden for authors, but also because this potentially renders better data as this is where authors are expected to include this information as part of their funder’s commitments.&lt;/p>
&lt;h3 id="4-retaining-information-and-submitting-no-big-deal">4. Retaining information and submitting: no big deal&lt;/h3>
&lt;p>At the beginning of the workshop, it was expected that maybe the retention of funding information and the propagation through various interlinked systems might pose problems for publishers. However, this was not identified as a problem by participants. Nor was there mention of any challenges in depositing information to Crossref, nor of downstream databases having difficulties retrieving the metadata.&lt;/p>
&lt;h3 id="5-there-is-a-genuine-interest-across-the-ecosystem-to-improve-funding-information-in-crossref">5. There is a genuine interest across the ecosystem to improve funding information in Crossref&lt;/h3>
&lt;p>While many concluded that there was still a long way to go to solve the many technical challenges related to funding metadata, attendees were unanimous on its importance. Participants agreed that these improvements would require investments from publishers. A willingness to do those was expressed, but also a sense that publishers who do should be incentivised for it, maybe as part of the agreements they have with library consortia. &lt;a href="https://repository.jisc.ac.uk/id/eprint/8904" target="_blank">JISC’s recent contract with Taylor &amp;amp; Francis&lt;/a> (page 164, Section 7a (iii)) is a good example of how consortia can successfully negotiate the supply of high quality metadata, including funding metadata. It was agreed that another solution could be to allow the additional deposit of the free-text acknowledgement section as a metadata field in Crossref. Instead of educating authors to enter their data correctly or relying on publishers and tech providers to improve their systems to turn free text funder acknowledgement text to structured data, text mining and machine learning could facilitate the improvement of this data.&lt;/p>
&lt;h2 id="next-steps">Next steps&lt;/h2>
&lt;p>For this workshop, we concentrated on the collection and registration of funding metadata by publishers and did not go into the important, related, issue of the Crossref Grant Linking System (Grant IDs) nor of the plans to further align funder IDs with ROR IDs, both projects that help the community to better record funding information.&lt;/p>
&lt;p>Next steps resulting from this community workshop, as&lt;/p>
&lt;ul>
&lt;li>Funders are encouraged to join and register their grants with Crossref DOIs so that &lt;a href="https://www.crossref.org/community/grants/" target="_blank">registered grants&lt;/a> can in future be &lt;a href="https://doi.org/10.64000/he02b-neb96" target="_blank">linked directly to publications&lt;/a> and other outputs. About 50 funders have already created around 90,000 grant records. The more grant DOIs that are created by funders, the more likely publishers will be able to prioritize collecting them in their own publication metadata.&lt;/li>
&lt;li>Publishers are encouraged to work with their service providers to prioritize the quality of the open funding metadata through Crossref, which is a source for downstream analyses and inclusion by many thousands of tools and services.&lt;/li>
&lt;li>Other stakeholders are also offering opportunities to focus on funding metadata, showing a growing interest in the completeness of funder metadata. For example, OA Switchboard’s &lt;a href="https://www.oaswitchboard.org/blog-post-18july2023-funder-pilot" target="_blank">funder pilot&lt;/a>, which also looks at the potential to feed enriched metadata back to Crossref to make them publicly available, and the Open Research Funder Group’s work to &lt;a href="https://www.orfg.org/news/2022/9/19/community-responds-to-orfgs-call-to-improve-research-output-tracking" target="_blank">promote the improvement of tracking research output, including funding metadata&lt;/a>, which includes an active working group in this area.&lt;/li>
&lt;li>Crossref will continue to work with publishers and service providers to encourage and make it easier to include funder information in article metadata, including the use of grant identifiers and funder identifiers. Work is underway to bring the &lt;a href="https://www.crossref.org/services/funder-registry" target="_blank">Open Funder Registry&lt;/a> closer to &lt;a href="https://ror.org" target="_blank">ROR (Research Organisation Registry)&lt;/a>, and is planning, at some point in the future, to merge the OFR into ROR, as ROR has a much wider scope and is more broadly community-governed. Crossref has also begun some work on collecting ROR IDs where we currently collect Funder IDs. More technical information is available in &lt;a href="https://crossref.atlassian.net/browse/CR-1208" target="_blank">this ticket&lt;/a>).&lt;/li>
&lt;/ul>
&lt;p>We would like to thank all the participants of the workshop for their openness and commitment to working through these issues together. It was a rare opportunity to share insights from publishers, service providers, funders, and researchers - and a useful first step in co-creating a shared understanding of the challenges and charting a path forward.&lt;/p></description></item><item><title>Perspectives: My thoughts on starting my new role at Crossref</title><link>https://www.crossref.org/blog/perspectives-my-thoughts-on-starting-my-new-role-at-crossref/</link><pubDate>Thu, 06 Jul 2023 00:00:00 +0000</pubDate><author>Johanssen Obanda</author><guid>https://www.crossref.org/blog/perspectives-my-thoughts-on-starting-my-new-role-at-crossref/</guid><description>&lt;p>My name is Johanssen Obanda. I joined Crossref in February 2023 as a Community Engagement Manager to look after the Ambassadors program and help with other outreach activities. I work remotely from Kenya, where there is an increasing interest in improving the exposure of scholarship by Kenyan researchers and ultimately by the wider community of African researchers. In this blog, I’m sharing the experience and insights of my first 4 months in this role.&lt;/p>
&lt;p>Right before joining Crossref, I was working as Stakeholder Manager with &lt;a href="http://africarxiv.org/" target="_blank">AfricArXiv&lt;/a>, a community-led digital archive for African research communication. I transitioned to working with Crossref to take up a more challenging role, so I can apply the community-building and social innovation skills I gained over the last five years in my profession.&lt;/p>
&lt;p>What surprised me the most here is realising that such a robust infrastructure is being administered by a relatively small team. I wondered how the team keeps the services running and builds new solutions for the community. However, I am impressed by the collaborative culture, positive and healthy work environment, and great systems.&lt;/p>
&lt;p>I work within the Community Engagement and Communications team, where we collaboratively address members’ questions and challenges, plan events, create helpful content for our community and keep in touch with them. We help grow our community and create a better experience using our products and services.&lt;/p>
&lt;p>My main focus has been the Ambassador programme, which started in 2018 and currently comprises 48 Ambassadors globally. The Ambassadors are our trusted contacts who support and engage our communities locally to make scholarly communications better. Through one-on-one virtual interaction with most of them, I noted that there was little interaction among the Ambassadors. Most of our Ambassadors want to connect more, both face-to-face and online. In the coming months, we aim to design our meetings together with the Ambassadors to encourage better exchange and relationships.&lt;/p>
&lt;p>I value Crossref’s insistence on diversity, equity and inclusion, and I enjoy contributing to those activities. Working with my colleagues in the outreach team to organise webinars and activities for the &lt;a href="https://www.crossref.org/gem/" target="_blank">Global Equitable Membership (GEM) programme&lt;/a> has been an exciting experience. I particularly enjoyed engaging with our Ambassadors Shaharima Parvin and Jahangir Alam from Bangladesh, and Binayak Pandey from Nepal, in organising the initial webinars for the GEM program in their countries. I feel it is one of the ways of creating more in-depth connections between our communities and our Ambassadors while making it possible for more institutions to be part of Crossref and contribute to scholarly communication.&lt;/p>
&lt;p>I have made a few webinar presentations online and recently did one in-person &lt;a href="https://www.crossref.org/pdfs/SRI2023-conference-Crossref-Poster.pdf">poster&lt;/a> presentation in South Africa at the Sustainability, Research and Innovation conference. I gained more confidence interacting with the wider Crossref community and a deeper understanding of Crossref’s services. I look forward to more opportunities to discuss Crossref’s mission with the community and to collaborate with like-minded organisations, contributing to joint initiatives, such as the upcoming Better Together webinar series with ORCID and DataCite, and the &lt;a href="https://forumforopenresearch.com/" target="_blank">Forum for Open Research in MENA&lt;/a> events.&lt;/p>
&lt;p>I experienced the challenges of working remotely in many ways. A couple of days, there was no power, other days the internet connection was painfully slow, and hopping from one restaurant to another was something I had to deal with from time to time, with the hopes of finding quiet most times to have a good meeting with my colleagues, until I had more dependable work station. On the positive side, coordinating meeting times with colleagues, taking on tasks asynchronously and collaborating in real-time across different tools are making me more agile, patient and empathetic with myself and my colleagues.&lt;/p>
&lt;p>I am driven by the impact I want to contribute in my career working with Crossref, which is to build an inclusive research ecosystem where researchers across the globe can easily access scientific knowledge and make meaningful connections. And I feel confident about my colleagues, our systems and infrastructure and my capabilities to be part of a thriving community and organisation.&lt;/p></description></item><item><title>A Request for Comment - Automatic Digital Preservation and Self-Healing DOIs</title><link>https://www.crossref.org/blog/a-request-for-comment-automatic-digital-preservation-and-self-healing-dois/</link><pubDate>Thu, 29 Jun 2023 00:00:00 +0000</pubDate><author>Martin Eve</author><guid>https://www.crossref.org/blog/a-request-for-comment-automatic-digital-preservation-and-self-healing-dois/</guid><description>&lt;p>Digital preservation is crucial to the &amp;ldquo;persistence&amp;rdquo; of persistent identifiers. Without a reliable archival solution, if a Crossref member ceases operations or there is a technical disaster, the identifier will no longer resolve. This is why the Crossref member terms insist that publishers make best efforts to ensure deposit in a reputable archive service. This means that, if there is a system failure, the DOI will continue to resolve and the content will remain accessible. This is how we protect the integrity of the scholarly record.&lt;/p>
&lt;p>I will write another post, soon, on the reality of preservation of items with a Crossref DOI, but recent work in the Labs team has determined that we have a situation of drastic under-preservation of much scholarly material that has been assigned a persistent identifier. In particular, content from our smaller Crossref members, with limited financial resources, is often precariously preserved. Further, DOI URLs are not always updated, even when, for instance, the underlying domain has been registered by a different third party. This results in DOIs pointing to new, hijacked, and elapsed content that does not reflect the metadata that we hold.&lt;/p>
&lt;p>We (&lt;a href="https://www.crossref.org/people/geoffrey-bilder/" target="_blank">Geoffrey&lt;/a>) have (has) long-harboured ambitions to build a system that would allow for automatic deposit into an archive and then to present access options to the resolving user. This would ensure that all Crossref content had at least one archival solution backing it and greatly contribute to the improved persistent resolvability of our DOIs. We refer to this, internally, as &amp;ldquo;Project Op Cit&amp;rdquo;. And we&amp;rsquo;re now in a position to begin building it.&lt;/p>
&lt;p>However, we need to get this right from the design phase out. We need input from librarians working in the digital preservation space. We need input from members on whether they would use such a service. We are not digital preservation experts and we are acutely aware that we need the expertise of those who are, particularly where we&amp;rsquo;ve had to take some shortcuts. For instance: we are aware that the Internet Archive is perhaps not the first choice of many digital preservation librarians and specialists, who opt for specific scholarly-communications solutions. However, it is easy, open, and free. Hence, we propose for the prototype to use IA, on the assumption that this will be a proof-of-concept only, which we will expand to other archives if there is demand and once it works.&lt;/p>
&lt;p>So: please do read the below and add your comments and questions to this thread in the community forum (link below), or &lt;a href="mailto:meve@crossref.org">send me queries/concerns by email&lt;/a>. It would be excellent if we could receive comments by mid-August 2023. If you would rather comment on a Google doc, &lt;a href="https://docs.google.com/document/d/1UHW8n_ohJhETc4aLK6ZHB3OK0A0270mgM1l4IsNudZ0/edit?usp=sharing" target="_blank">that&amp;rsquo;s also possible&lt;/a>.&lt;/p>
&lt;p>If enough people are interested, we could also host a community call to discuss this design and its prototyping. Do please, when emailing, let me know if this is of interest.&lt;/p>
&lt;hr>
&lt;h2 id="project-op-cit-self-healing-dois">Project Op Cit (Self-Healing DOIs)&lt;/h2>
&lt;h2 id="request-for-comment">Request for Comment&lt;/h2>
&lt;p>This document sets out the problem statement, a proposed prototype solution, and a transition path to production if successful.&lt;/p>
&lt;h2 id="proposed-prototype-solution">Proposed Prototype Solution&lt;/h2>
&lt;p>For members who opt-in to the service, We have a special class of DOI (only for open-access content) where, when the DOI is registered:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>We immediately make an archive of the item with any archiving services that care to participate in the project (minimally, the Internet Archive, which is the easiest for us to begin with, but a modular/pluggable archival system). The &lt;a href="https://archive.org/developers/internetarchive/" target="_blank">Internet Archive Python Librar&lt;/a>y should let us submit to them. We could pursue other arrangements with CLOCKSS, LOCKSS, and Portico.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>We update the XML to reflect the archives to which it has been submitted.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>The DOI landing page is redirected to an interstitial page that we control. This page gives the user access options.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>We develop processes to determine whether the original URL &amp;ldquo;works&amp;rdquo;. The heuristics that define whether a resource has changed substantially or works need long-term consideration and real-world testing. Using the interstitial page approach will allow us to refine this, with a long-term goal of eradicating it.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;center>&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2023/deposit-process-blog.png"
alt="Image showing the process flow from user to OpCit Deposit Endpoint to Preservation System (archive) to Crossref Deposit System (Live API)" width="75%">&lt;figcaption>
&lt;p>Figure 1: The Deposit Process&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;p>&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2023/resolution-process-blog.png"
alt="Image showing the process flow from user to DOI to OpCit Deposit Endpoint to Preservation System (preserved copy) to Crossref Deposit System (original publications)" width="75%">&lt;figcaption>
&lt;p>Figure 2: The Resolution Process&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;/center>&lt;/p>
&lt;h3 id="potential-challenges">Potential Challenges&lt;/h3>
&lt;ul>
&lt;li>
&lt;p>Content drift. It would be extremely difficult to detect content change vs. (eg) page structure change, except in the case of binary fulltext. However, we can poll for the DOI at an HTML endpoint and detect when binary fulltext items, such as a PDF, change.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Latency on resolver if lookup is real-time. For this reason, we need a periodic crawler so that resolvers do not wait for real-time detection on access.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>If using Internet Archive, the domain owner (at the present moment) can request the removal of content. We would need the capacity to &amp;ldquo;lock&amp;rdquo; records that are being used as Op Cit redirection archival copies. This requires a further conversation with the Internet Archive.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;h2 id="prototype-componentsarchitecture">Prototype Components/Architecture&lt;/h2>
&lt;h3 id="registration-proxy-and-database-fleming">Registration Proxy and Database (&amp;ldquo;Fleming&amp;rdquo;)&lt;/h3>
&lt;p>The registration proxy implements a pass-through to the deposit API and hosts a relational database of self-healing DOIs (Postgres). It will be hosted at api.labs.crossref.org/deposit/opcit and clients will have to use this endpoint to deposit. Simultaneously, the proxy will:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>Determine the license status of the incoming item.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>If the license is open and fulltext is provided, deposit a copy in selected digital preservation archives. Store proof of licensing attestation.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>In the case of binary files (fulltext PDF), store a hash of the content.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Store the DOI, binary hash, and all URLs in a relational database under &amp;ldquo;pending&amp;rdquo; state.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Pass through the request to Crossref&amp;rsquo;s content registration system.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Monitor the result of this request and remove stored data if registration fails.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Re-registration through Fleming will update existing entries and re-fix their data against content drift at this time.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;h3 id="spider-shelob">Spider (&amp;ldquo;Shelob&amp;rdquo;)&lt;/h3>
&lt;p>A series of components that:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>Check that &amp;ldquo;pending&amp;rdquo; DOIs have been successfully registered. Remove those that have not and move those that have to &amp;ldquo;active&amp;rdquo; state.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Dereference &amp;ldquo;active&amp;rdquo; DOIs and ensure that we have the most current URL in case updates have gone directly to the live resolver.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Periodically crawl URLs in the self-healing database.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>On HTTP 301 code, update database entry to point to new permanent URL.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>On HTTP 302 code, follow the temporary redirect expecting the original content.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>On HTTP 4xx codes, mark the entry as dead.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>On HTTP 200 code of HTML landing page, parse the page for the presence of the DOI. If the DOI is not present, mark the entry as dead.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;h3 id="resolver-proxy-hippocrates">Resolver Proxy (&amp;ldquo;Hippocrates&amp;rdquo;)&lt;/h3>
&lt;ul>
&lt;li>
&lt;p>Display an interstitial landing page with archival versions and an explanation.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>At some future point, for active entries, resolve to the stored URL (faster but could be de-synced) or pass the request to the live resolver (requires an extra hop but will always be in-sync with deposit).&lt;/p>
&lt;/li>
&lt;/ul>
&lt;h3 id="observability-and-statistics">Observability and statistics&lt;/h3>
&lt;p>Metrics we will collect:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>Count of DOIs using Op Cit&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Count of visitors arriving on Op Cit landing pages&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Usage count of each outgoing link/access option&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>A daily report will present:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>Newly &amp;ldquo;failed&amp;rdquo; entries that we believe have died&lt;/p>
&lt;/li>
&lt;li>
&lt;p>These will be checked extensively, particularly at first, to ascertain whether our failure heuristics are valid&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Entries that have recovered&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>Errors will be logged and monitored via Grafana.&lt;/p>
&lt;h3 id="documentation-and-automated-tests">Documentation and Automated Tests&lt;/h3>
&lt;ul>
&lt;li>
&lt;p>Core assumptions and new behaviours of the platform will be documented as part of the prototype.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Automated tests will be written, especially for the spider (&amp;ldquo;Shelob&amp;rdquo;), which must handle a diverse variety of real-world situations.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;h3 id="prototype-architecture-requirements">Prototype Architecture Requirements&lt;/h3>
&lt;ul>
&lt;li>
&lt;p>Postgres RDS for resolution/self-healing DOI data (AWS).&lt;/p>
&lt;/li>
&lt;li>
&lt;p>FastAPI hosting for passthrough proxy (fly.io).&lt;/p>
&lt;/li>
&lt;li>
&lt;p>EC2 hosting for the spider (AWS).&lt;/p>
&lt;/li>
&lt;li>
&lt;p>FastAPI hosting for resolver proxy (fly.io).&lt;/p>
&lt;/li>
&lt;/ul>
&lt;h2 id="transition-to-production">Transition to Production&lt;/h2>
&lt;p>If this prototype garners popular appeal, a transition to production would need to keep some prototype components and rewrite others.&lt;/p>
&lt;ul>
&lt;li>
&lt;p>&amp;ldquo;Fleming&amp;rdquo; would need to be rewritten as a deposit module / integrated with Manifold&amp;rsquo;s (the next-generation system at Crossref) deposit. If this would create too much overhead, it need not be a blocking process in the deposit.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&amp;ldquo;Shelob&amp;rdquo; would continue to need to run continuously and to scale with the adoption of self-healing DOIs unless one of the other options were used.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Prototype architecture will be written so that spidering can be distributed between several servers, if required.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&amp;ldquo;Hippocrates&amp;rdquo; would need to be integrated into the live link resolver. Depending on how a field for a self-healing DOI is embedded in Manifold, this may not need any additional database hits.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;h3 id="back-content">Back Content&lt;/h3>
&lt;p>We also have a database of back content stored by the Internet Archive, mapped to DOIs where they have been able to do so. This data source could be used to enable self-healing DOIs on all content in this archive.&lt;/p></description></item><item><title>Crossref Research and Development: Releasing our Tools from the Ground Up</title><link>https://www.crossref.org/blog/crossref-research-and-development-releasing-our-tools-from-the-ground-up/</link><pubDate>Wed, 21 Jun 2023 00:00:00 +0000</pubDate><author>Martin Eve</author><guid>https://www.crossref.org/blog/crossref-research-and-development-releasing-our-tools-from-the-ground-up/</guid><description>&lt;p>This is the first post in a series designed to showcase what we do in the Crossref R&amp;amp;D group, also known as &lt;a href="https://www.crossref.org/labs/" target="_blank">Crossref Labs&lt;/a>, which over the last few years has been strengthened, first with Dominika Tkaczyk and Esha Datta, last year with part of Paul Davis’s time, and more recently, yours truly. Research and development are, obviously, crucial for any organisation that doesn’t want to stand still. The R&amp;amp;D group builds prototypes, experimental solutions, and data-mining applications that can help us to understand our member base, in the service of future evolution of the organisation. One of the strategic pillars of Crossref is that we want to contribute to an environment in which the scholarly research community identifies shared problems and co-creates solutions for broad benefit. We do this in all teams through research and engagement with our expanding community.&lt;/p>
&lt;p>&lt;img src="https://www.crossref.org/img/labs/creature3.svg" alt="The Crossref Labs Creature"> &lt;img src="https://www.crossref.org/img/labs/creature2.svg" alt="The Crossref Labs Creature"> &lt;img src="https://www.crossref.org/img/labs/creature1.svg" alt="The Crossref Labs Creature">&lt;/p>
&lt;p>For example, if the metadata team wants to implement a new field in our schema, it helps to have a prototype to show to members. The Labs team would implement such a prototype. If we want to know the answer to a question about the 150m or so metadata records we have – e.g. how many DOIs are duplicates? – it’s the Labs team that will work on this.&lt;/p>
&lt;p>When building such prototypes, which can often seem esoteric and one-off, though, it can be easy to believe that there is no way anybody else would re-use our components. At the same time, we find ourselves consistently working with the same infrastructures, re-using different code blocks across many applications. One of the tasks I have been working on is to extract these duplicated functions and to get them into external code libraries.&lt;/p>
&lt;p>Why is this important? As many readers doubtless know, Crossref is committed to &lt;a href="https://doi.org/10.24343/C34W2H" target="_blank">The Principles of Open Scholarly Infrastructure&lt;/a>. For reasons of insurance, everything we do and newly develop is open source and we want our members to be able to re-use the software that we create. It’s also important because, if we centralize these low-level building blocks, we make it much easier to fix bugs when they occur, which would otherwise be distributed across all of our projects.&lt;/p>
&lt;p>As a result, Crossref Labs has a series of small code libraries that we have released for various service interactions. We often find ourselves needing to interact with AWS services. Indeed, Crossref’s live systems are in the process of transitioning to running in the cloud, rather than our own data centre. It makes sense, therefore, for prototype Labs systems to run on this infrastructure, too. However, the boto3 library is not terribly Pythonic. As a result, many of our low-level tools interact with AWS. These include:&lt;/p>
&lt;ul>
&lt;li>&lt;a href="https://gitlab.com/crossref/labs/claws" target="_blank">CLAWS: the Crossref Labs Amazon Web Services toolkit&lt;/a>. The CLAWS library gives speedy and Pythonic access to functions that we use again and again. This includes downloading files from and pushing data to S3 buckets (often in parallel/asynchronously), fetching secrets from AWS Secrets Manager, generating pre-signed URLs, and more.&lt;/li>
&lt;li>&lt;a href="https://gitlab.com/crossref/labs/longsight" target="_blank">Longsight: A range of common logging functions for the observability of Python AWS cloud applications&lt;/a>. Less mature than CLAWS, this is the starting point for observability across Labs applications. It supports running in AWS Lambda function contexts or pushing your logs to AWS Cloudwatch from anywhere else. It also supports logging metrics in structured forms. Crucially, the logs are all converted into machine-readable JSON format. This allows us to export the metrics into Grafana dashboards to visualize failure and performance.&lt;/li>
&lt;li>&lt;a href="https://gitlab.com/crossref/labs/distrunner" target="_blank">Distrunner: decentralized data processing on AWS services&lt;/a>. Easily the least mature and experimental of these libraries, distrunner is one of the ways that we distribute the workloads of our recurrent data processing. A number of the Labs projects require us to run recurrent data-processing tasks. For instance, my colleague Dominika Tkaczyk has developed the &lt;a href="https://gitlab.com/crossref/labs/sampling-framework" target="_blank">sampling framework&lt;/a> that is regenerated once per week. We use Apache Airflow (and, specifically, Amazon Managed Workflows for Apache Airflow) to host these periodic tasks. This is useful because it gives us quick, visual oversight if tasks fail. However, the Airflow worker instances on AWS are quite severely underpowered and unsuitable for large in-memory activities. Hence, the sampling framework fires up a Spark instance for its processing. Often, though, we do not need the parallelization of Spark and just want to be able to run a generic Python script in a more powerful environment. That’s what distrunner is designed to do. The current version uses &lt;a href="https://www.coiled.io/" target="_blank">Coiled&lt;/a> but this may change in the future.&lt;/li>
&lt;/ul>
&lt;p>While these tools will be useful to nobody except programmers – and this has been quite a technical post – there is a broader philosophical point to be made about this approach, in which everything is available for re-use, “from the ground up”. The point is: we also try, in Labs and in the process of “R&amp;amp;Ding”, to work without privileged access. That is: I don’t get “inside” access to a database that isn’t accessible to external users. I have to work with the same APIs and systems as would an end-user of our services. This means that, when we develop internal libraries, it’s worth releasing them. Because they use systems that are accessible to any of our users.&lt;/p>
&lt;p>I should also say that our openness is more than unidirectional. While we are putting a lot of effort into ensuring that everything new we put out is openly accessible, we are also open to contributions coming in. If we’ve built something and you make changes or improve it, please do get in touch or submit a pull request. Openness has to work both ways if projects are truly to be used by the community.&lt;/p>
&lt;p>Future posts – coming soon! – will introduce some of the technologies and projects that we have been building atop this infrastructure. This includes a Labs API system; new functionality to retrieve unpaginated datasets of whole API routes; a study of the preservation status of DOI-assigned content; and a mechanism for modeling new metadata fields.&lt;/p></description></item><item><title>Our annual call for board nominations</title><link>https://www.crossref.org/blog/our-annual-call-for-board-nominations/</link><pubDate>Tue, 30 May 2023 00:00:00 +0000</pubDate><author>Lucy Ofiesh</author><guid>https://www.crossref.org/blog/our-annual-call-for-board-nominations/</guid><description>&lt;p>The Crossref Nominating Committee invites expressions of interest to join the Board of Directors of Crossref for the term starting in March 2024. The committee will gather responses from those interested and create the slate of candidates that our members will vote on in an election in September.&lt;/p>
&lt;p>Expressions of interest will be due Monday, June 26th, 2023.&lt;/p>
&lt;h3 id="about-the-board-elections">About the board elections&lt;/h3>
&lt;p>The board is elected through the “one member, one vote” policy wherein every member organisation of Crossref has a single vote to elect representatives to the Crossref board. Board terms are for three years; this year, seven seats are open for election.&lt;/p>
&lt;p>The board maintains a balance of seats, with eight seats for smaller members and eight seats for larger members (based on total revenue to Crossref). This is to ensure that the diversity of experiences and perspectives of the scholarly community are represented in decisions made at Crossref.&lt;/p>
&lt;p>This year we will elect two of the larger member seats (membership tiers $3,900 and above) and five of the smaller member seats (membership tiers $1,650 and below). You don’t need to specify which seat you are applying for. We will provide that information to the nominating committee.&lt;/p>
&lt;p>The election takes place online, and voting will open in September. Election results will be shared at the annual meeting on October 31st. New members will commence their term in March 2024.&lt;/p>
&lt;h3 id="about-the-nominating-committee">About the Nominating Committee&lt;/h3>
&lt;p>The Nominating Committee reviews the expressions of interest and selects a slate of candidates for election. The slate put forward will exceed the total number of open seats. The committee considers the statements of interest, organisational size, geography, and experience.&lt;/p>
&lt;p>2023 Nominating Committee:&lt;/p>
&lt;ul>
&lt;li>Aaron Wood, American Psychological Association, chair*&lt;/li>
&lt;li>Oscar Donde, Pan Africa Science Journal*&lt;/li>
&lt;li>David Haber, American Society for Microbiology&lt;/li>
&lt;li>Rose L’Huillier, Elsevier*&lt;/li>
&lt;li>Marie Souliere, Frontiers&lt;/li>
&lt;/ul>
&lt;p>(*) indicates Crossref board member&lt;/p>
&lt;h3 id="what-does-the-committee-look-for">What does the committee look for&lt;/h3>
&lt;p>The committee looks for skills and experience that will complement the rest of the board. Candidates from countries and regions that are not currently reflected on the board are strongly encouraged to apply. Successful candidates often have some or all of these characteristics:&lt;/p>
&lt;ul>
&lt;li>demonstrate a commitment to or understanding of our &lt;a href="https://www.crossref.org/strategy/" target="_blank">strategic agenda&lt;/a> or the &lt;a href="https://openscholarlyinfrastructure.org/" target="_blank">Principles of Open Scholarly Infrastructure&lt;/a>;&lt;/li>
&lt;li>have expertise that may be underrepresented on the board currently;&lt;/li>
&lt;li>hold senior/director-level positions in their organisations;&lt;/li>
&lt;li>have experience with governance or community involvement;&lt;/li>
&lt;li>represent member organisations that are active in the scholarly communications ecosystem;&lt;/li>
&lt;li>demonstrate metadata best practices as shown in the member’s &lt;a href="https://www.crossref.org/members/prep/" target="_blank">participation report&lt;/a>&lt;/li>
&lt;/ul>
&lt;h3 id="board-roles-and-responsibilities">Board roles and responsibilities&lt;/h3>
&lt;p>Crossref’s services provide a central infrastructure to scholarly communications. Crossref’s board helps shape the future of our services and, by extension, impacts the broader scholarly ecosystem. We are looking for board members to contribute their experience and perspective.&lt;/p>
&lt;p>The role of the board at Crossref is to provide strategic and financial oversight of the organisation, as well as guidance to the Executive Director and the staff leadership team, with the key responsibilities being:&lt;/p>
&lt;ul>
&lt;li>Setting the strategic direction for the organisation;&lt;/li>
&lt;li>Providing financial oversight; and&lt;/li>
&lt;li>Approving new policies and services.&lt;/li>
&lt;/ul>
&lt;p>The board is representative of our membership base and guides the staff leadership team on trends affecting scholarly communications. The board sets strategic directions for the organisation while also providing oversight into policy changes and implementation. Board members have a fiduciary responsibility to ensure sound operations. Board members do this by attending board meetings, as well as joining more specific board committees.&lt;/p>
&lt;h3 id="who-can-apply-to-join-the-board">Who can apply to join the board?&lt;/h3>
&lt;p>Any active member of Crossref can apply to join the board. Crossref membership is open to organisations that produce content, such as academic presses, commercial publishers, standards organisations, and research funders.&lt;/p>
&lt;h3 id="what-is-expected-of-board-members">What is expected of board members?&lt;/h3>
&lt;p>Board members attend three meetings each year that typically take place in March, July, and November. Meetings have taken place in various international locations, and travel support is provided when needed. March and November board meetings are held virtually, and all committee meetings take place virtually. Each board member should sit on at least one Crossref committee. Care is taken to accommodate the wide range of timezones in which our board members live.&lt;/p>
&lt;p>While the expressions of interest are specific to an individual, the seat that is elected to the board belongs to the member organisation. The primary board member also names an alternate who may attend meetings if the primary board member cannot. There is no personal financial obligation to sit on the board. The member organisation must remain in good standing.&lt;/p>
&lt;p>Board members are expected to be comfortable assuming the responsibilities listed above and to prepare and participate in board meeting discussions.&lt;/p>
&lt;h3 id="how-to-apply">How to apply&lt;/h3>
&lt;p>Please &lt;a href="https://docs.google.com/forms/d/1XMsDlKx7-ZoyB0uAWmKt6QBl2z2QennvgiG4pprxW94/edit" target="_blank">click here to submit your expression of interest&lt;/a>. We ask for a brief statement about how your organisation could enhance the Crossref board and a brief personal statement about your interest and experience with Crossref.&lt;/p>
&lt;p>Please contact me with any questions at &lt;a href="mailto:lofiesh@crossref.org">lofiesh@crossref.org&lt;/a>&lt;/p></description></item><item><title>Metadata connects the global community – summary of our Community update 2023</title><link>https://www.crossref.org/blog/metadata-connects-the-global-community-summary-of-our-community-update-2023/</link><pubDate>Fri, 12 May 2023 00:00:00 +0000</pubDate><author>Kornelia Korzec</author><guid>https://www.crossref.org/blog/metadata-connects-the-global-community-summary-of-our-community-update-2023/</guid><description>&lt;p>We were delighted to engage with over 200 community members in our latest Community update calls. We aimed to present a diverse selection of highlights on our progress and discuss your questions about participating in the Research Nexus. For those who didn’t get a chance to join us, I’ll briefly summarise the content of the sessions here and I invite you to join the conversations on the Community Forum.&lt;/p>
&lt;p>You can take a look at the slides here and the recordings of the calls are available &lt;a href="https://zenodo.org/record/7921925#.ZFzh3OzMKrc" target="_blank">here&lt;/a>.&lt;/p>
&lt;h3 id="tldr">TL;DR&lt;/h3>
&lt;ul>
&lt;li>The membership is growing, including that in the GEM programme countries, and we focus on adding new Sponsors in areas where we have insufficient coverage to support prospective members&lt;/li>
&lt;li>The grant registration form is available for funders who don’t use XML, and we’re working to expand to other record types&lt;/li>
&lt;li>The preview of the Relationship API endpoint is available – start exploring relationships between different records and record types, from citations to funding, and more&lt;/li>
&lt;li>Usefulness of metadata records for inferring integrity of the content or publisher relies on all members of the community contributing to this effort. Crossref will continue to enrich our schema to capture new types of relevant information and to promote the best metadata practices.&lt;/li>
&lt;li>Cited-by is now open for everyone to use 🎉 – no need for additional authorisation steps – &lt;strong>Registering your references will have even greater impact now!&lt;/strong>&lt;/li>
&lt;li>The Labs participation report is available and it’s been a hit. Please note that this tool is still underdevelopment – new functionalities can be added but there might also be bugs that we are yet to resolve, so don’t hold off with feedback.&lt;/li>
&lt;li>We’ve received close to 1,000 responses in our first ever Metadata Priorities Survey. It’s still open until 18th of May and we encourage all members to take it. So far we’ve learnt that majority of our respondents are keen to deposit as much metadata as possible – and some would like to register more than we currently enable.&lt;/li>
&lt;/ul>
&lt;h3 id="metadata-completeness-and-integrity">Metadata completeness and integrity&lt;/h3>
&lt;p>A key theme of the call was encouraging greater participation in the Research Nexus and the importance of complete metadata. One particular benefit of a rich and transparent metadata network is the opportunity to infer judgments on the integrity of the scholarly record (ISR). Amanda Bartell, Head of Member Experience, highlighted that the community agrees that availability of information about relationships between research outputs, institutions and other elements of the scholarly ecosystem together provide essential context for deciding about trustworthiness of organisations and their published content. Conversely, it can make it harder for parties to pass off information as trustworthy when that context is missing. Amanda summarised community feedback related to Crossref’s role in the integrity of the scholarly record in &lt;a href="https://doi.org/10.64000/3b445-2zr32" target="_blank">her recent blog post&lt;/a>.&lt;/p>
&lt;p>Our members can contribute to that rich network of relationships by curating their metadata and providing contextual information – especially the highly sought for elements highlighted in the presentation.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/help-context.png"
alt="Screenshot of slide how can you help start adding text" width="75%">
&lt;/figure>
&lt;h3 id="our-community">Our community&lt;/h3>
&lt;p>Since LIVE22, we have had 1,130 new members join us. That includes 51 organisations from countries included in our Global Equitable Membership (GEM) programme. You can find out more in &lt;a href="https://community.crossref.org/t/global-equitable-membership-gem-program-update/3518" target="_blank">the latest news about the programme on our Community Forum&lt;/a> from Susan Collins, Community Engagement Manager.&lt;/p>
&lt;p>We see great opportunities with enriching our metadata corpus with works carried out in some of the least economically-advantaged regions of the world. Registering their content with us will increase its discoverability for the global scholarship, while adding important relationships into the Research Nexus. We’re glad at the new members joining us under the auspices of the Global Equitable Membership (GEM) programme and we’re reaching out to existing and new communities with our Ambassadors, to encourage more metadata registrations.&lt;/p>
&lt;p>Our Sponsors and Ambassadors, alongside our Outreach and Membership Team, support members to participate as effectively as possible in the Research Nexus. We’re delighted to see both programmes growing, with eight new Sponsors and seven new Ambassadors having joined us since October.&lt;/p>
&lt;p>Simultaneously, we’re working with like-minded organisations to provide useful resources for the growing and changing scholarly communications community. The recent launch of the online forum for new publishers seeking to learn about best practices in the industry, &lt;a href="https://theplace.discourse.group/" target="_blank">The PLACE&lt;/a>, is another way in which we hope to support wider participation in the Research Nexus, and promote open and sustainable practices.&lt;/p>
&lt;p>With our growing community, there’s always interest in We have planned &lt;a href="https://www.crossref.org/events/">a webinar&lt;/a> later this month to provide an overview of Crossref – including the members benefits and obligations, and how to use our services.&lt;/p>
&lt;h3 id="service-news">Service news&lt;/h3>
&lt;p>References metadata is essential for connecting works with one another. It enables provision of citation information, aids discoverability for researchers, as well as assessment and evaluation for institutions and funders. It’s almost a year since all the references metadata deposited with Crossref has been made openly available. At the moment, 52.0% of journal articles, and 44.5% of all works have references. Martyn Rittman, Product Manager for the Cited-by service says “It’s not bad, but we can do better!”&lt;/p>
&lt;p>With three different mechanisms for doing it available to our members, we hope that all have a suitable tool to fit with their needs. You can register references with XML via HTTPS POST (structured or unstructured), with the dedicated OJS Plugin if you’re an OJS user, or with our Simple Text Query (unstructured text) – this is especially relevant to the Web Deposit Form users. We find that journal articles with deposited references seem to be cited more than those without, and by a lot: 21.8 vs. 6.1 incoming citations on average!&lt;/p>
&lt;p>We have now made our Cited-by service open to all. To realise its full benefit, it is essential to register your references.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/citedby-blog.png"
alt="Screenshot of slide Cited by" width="75%">
&lt;/figure>
&lt;p>&lt;br>There were concerns in the community about references ‘lost’ as part of supplementary material that may not be registered in its own right. Colleagues advised that if the data has an identifier, such as a DataCite DOI, you can add a relationship to say that it&amp;rsquo;s supplementary material (see &lt;a href="https://www.crossref.org/documentation/schema-library/markup-guide-metadata-segments/relationships/">https://www.crossref.org/documentation/schema-library/markup-guide-metadata-segments/relationships/&lt;/a>) or add them as a reference. Martyn is curious to hear from others in the community on this topic. There is an increasing focus on data citations and we&amp;rsquo;d like to see how we can better support them.&lt;/p>
&lt;p>Many members have questions related to plans for &lt;a href="https://doi.org/10.64000/30vzx-r5x16" target="_blank">replacing Metadata Manager&lt;/a>. Rachael Lammey, Director of Product, explained that we’re working on broadening our new &lt;a href="https://www.crossref.org/documentation/register-maintain-records/grant-registration-form/">Grant Registration Form&lt;/a> to include more record types over the course of 2023. It has a few advantages over the current Web Deposit Form. It allows you to save a local copy once you first register a piece of content. It makes updating your records easier, as you can drop that file onto the form to add the metadata so that you can update it and redeposit rather than having to fill out the information all over again, and we have started adding automatic lookup fields to help users populate information on affiliations using ROR IDs more accurately. We will keep you posted on the progress with new developments and ask for beta testers for new record types as they are added.&lt;/p>
&lt;p>Metadata information about individual work is not as useful as the opportunity to interrogate the relationships between works and within the global scholarly output. [The preview of the Relationship API endpoint](&lt;a href="https://community.crossref.org/t/relationships-are-here/3523" target="_blank">https://community.crossref.org/t/relationships-are-here/3523&lt;/a>, modest as it is at this stage – with only 1% of our relationship metadata included (or 10 mln relationships) – offers a powerful demonstration of the way in which metadata contextualises research outputs within the entangled network of ever-progressing scholarship.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/code-for-blog.png"
alt="Screenshot of code" width="50%">
&lt;/figure>
&lt;p>&lt;br>We’ve also mentioned the recent &lt;a href="https://community.crossref.org/t/get-in-on-the-action-help-shape-our-website-with-your-feedback/3431" target="_blank">transition of our website to GitLab&lt;/a>, which allows everyone to contribute by creating merge requests and issues. Through this open collaboration, which supports our commitment to meet the Principles of Open Scholarly Infrastructure, we aim to cultivate a sense of ownership among contributors and make our information and documentation more useful and efficient for everyone.&lt;/p>
&lt;h3 id="labs-participation-report">Labs participation report&lt;/h3>
&lt;p>For organisations who wish to keep a close eye on their metadata – to understand what they deposit, how that compares with other members, and what could be improved, can start using our Lab participation reports. We encourage you to test this not-yet-finished tool and let us know your feedback. Participants at our updates found it very informative, with the opportunity to preview contents of recent deposits, see the participation breakdowns by a prefix, and improved data visualisation.
We had questions about how data citation counts are generated in the report. Martyn Rittman explained that: “This is a prototype and that&amp;rsquo;s one of the issues we need to tidy up! We know via Event Data and our Scholix endpoint what is a dataset, but that hasn&amp;rsquo;t yet been incorporated to the Labs Reports”. There was also a suggestion of enabling export of simple lists of all member’s DOIs with respective URLs from the report and the team might look into that. Yet, lists of DOIs missing specific metadata types are already downloadable.&lt;/p>
&lt;p>To learn more about the reports, try them out, and to provide feedback, please take a look at &lt;a href="https://community.crossref.org/t/ticket-of-the-month-april-2023-the-new-labs-reports-are-here/3528" target="_blank">the information shared recently by Paul Davis&lt;/a>, Tech Support Specialist &amp;amp; R&amp;amp;D Support Analyst.&lt;/p>
&lt;h3 id="metadata-priorities">Metadata priorities&lt;/h3>
&lt;p>Patricia Feeney, Head of Metadata, shared some updates about the current metadata corpus registered with Crossref, and some recent trends.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/metadata-trends-blog.png"
alt="Screenshot of side titled metadata trends" width="75%">
&lt;/figure>
&lt;p>&lt;br>She then went on to summarise some preliminary results of our ongoing &lt;a href="https://community.crossref.org/t/take-our-metadata-priorities-survey-by-may-18/3498" target="_blank">metadata priorities survey&lt;/a>, which all members are encouraged to take part in by &lt;strong>18th of May&lt;/strong>. So far, we’ve received close to 1,000 responses. We’ve learnt that majority of our respondents are keen to deposit as much metadata as possible – and some would like to register more than we currently enable. Close to a half of the respondents who did not express an interest in sharing all metadata are still interested to learn more about the value of their metadata.&lt;/p>
&lt;p>She then went on to summarise some preliminary results of our ongoing &lt;a href="https://community.crossref.org/t/take-our-metadata-priorities-survey-by-may-18/3498" target="_blank">metadata priorities survey&lt;/a>, which all members are encouraged to take part in by &lt;strong>18th of May&lt;/strong>. So far, We’ve received close to 1,000 responses. We’ve learnt that majority of our respondents are keen to deposit as much metadata as possible – and some would like to register more than we currently enable. However, close to a half of the respondents are interested to learn more about the value of their metadata.&lt;/p>
&lt;p>The survey consults our members about their preferences for developing any of the potential projects under consideration:&lt;/p>
&lt;ul>
&lt;li>Contributor IDs&lt;/li>
&lt;li>Contributor roles/ CRediT&lt;/li>
&lt;li>Alternate names&lt;/li>
&lt;li>Multilingual metadata&lt;/li>
&lt;li>Expand abstract support&lt;/li>
&lt;li>Citation types (content)&lt;/li>
&lt;li>Conference event IDs&lt;/li>
&lt;/ul>
&lt;p>It appears that support for citation types is the strongest among our respondents, while very polarised views have been shared about multilingual metadata and expanding support for abstracts. Among other suggestions, we received a lot of comments related to keywords. Overall, support for all projects was strong.&lt;/p>
&lt;p>The verdicts are not in yet – still time to respond to the survey and make your metadata priorities known!&lt;/p>
&lt;h3 id="thank-you-and-keep-in-touch">Thank you and keep in touch&lt;/h3>
&lt;p>With much of the content shared ahead of the time through our Community Forum, the sessions were bubbling with questions and valuable comments from the community. We look forward to continuing the conversations asynchronously on the Community Forum. Please don’t hesitate to share your thoughts and ask further questions. We’d also love to hear suggestions for topics of the most interest for our future updates.&lt;/p>
&lt;p>The more complete the metadata we collect together, the more connections in the ecosystem become transparent. This creates opportunities for discovery and collaborations, and greater insights about the scholarly process. Our community is growing in numbers, diversity, and technical capacity for building the Research Nexus together. We welcome your questions and suggestions of initiatives that support the fullest participation possible.&lt;/p></description></item><item><title>2023 public data file now available with new and improved retrieval options</title><link>https://www.crossref.org/blog/2023-public-data-file-now-available-with-new-and-improved-retrieval-options/</link><pubDate>Tue, 02 May 2023 00:00:00 +0000</pubDate><author>Patrick Polischuk</author><guid>https://www.crossref.org/blog/2023-public-data-file-now-available-with-new-and-improved-retrieval-options/</guid><description>&lt;p>We have some exciting news for fans of big batches of metadata: this year’s public data file is now available. Like &lt;a href="https://doi.org/10.64000/wsnyw-yap64" target="_blank">in&lt;/a> &lt;a href="https://doi.org/10.64000/96h9h-b8437" target="_blank">years&lt;/a> &lt;a href="https://doi.org/10.64000/9m04d-1an91" target="_blank">past&lt;/a>, we’ve wrapped up all of our metadata records into a single download for those who want to get started using all Crossref metadata records.&lt;/p>
&lt;p>We’ve once again made &lt;a href="https://academictorrents.com/details/d9e554f4f0c3047d9f49e448a7004f7aa1701b69" target="_blank">this year’s public data file available via Academic Torrents&lt;/a>, and in response to some feedback we’ve received from public data file users, we’ve taken a few additional steps to make accessing this 185 gb file a little easier.&lt;/p>
&lt;p>First, we’re proactively hosting seeds in a few locations around the world to improve torrent download performance in terms of both speed and reliability.&lt;/p>
&lt;p>And second, we’ve added an option to download this year’s public data file directly from Amazon S3 for a small transaction fee paid by the recipient, bypassing the need to use the torrent altogether. The fee just covers the AWS cost of the download. Instructions for downloading the public data file via the &amp;ldquo;Requester Pays&amp;rdquo; method are available on the &lt;a href="https://www.crossref.org/documentation/retrieve-metadata/rest-api/tips-for-using-public-data-files-and-plus-snapshots/" target="_blank">&amp;ldquo;Tips for working with Crossref public data files and Plus snapshots&amp;rdquo;&lt;/a> page.&lt;/p>
&lt;p>The 2023 public data file features over 140 million metadata records deposited with Crossref through the end of March 2023, including over 76,000 grant records. Because Crossref metadata is always openly available, you can use our API to keep your local copy of our metadata corpus up to date with new and updated records.&lt;/p>
&lt;p>In previous years, closed and limited references were removed from the public data file. Since we &lt;a href="https://doi.org/10.64000/b7a98-vbz07" target="_blank">updated our membership terms&lt;/a> to make all deposited references open in 2022, the 2023 public data file for the first time includes all references deposited with us.&lt;/p>
&lt;p>We hope you find this public data file useful. Should you have any questions about how to access or use the file, please see the tips below, or bring your questions to our community forum.&lt;/p>
&lt;h3 id="tips-for-using-the-torrent-and-retrieving-incremental-updates">Tips for using the torrent and retrieving incremental updates&lt;/h3>
&lt;ul>
&lt;li>
&lt;p>Use the public data file if you want all Crossref metadata records. Everyone is welcome to the metadata, but it will be much faster for you and much easier on our APIs to get so many records in one file. Here are some &lt;a href="https://www.crossref.org/documentation/retrieve-metadata/rest-api/tips-for-using-public-data-files-and-plus-snapshots/" target="_blank">tips on how to work with the file&lt;/a>.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Use the REST API to incrementally add new and updated records once you have the initial file. Here is &lt;a href="https://www.crossref.org/documentation/retrieve-metadata/rest-api/tips-for-using-the-crossref-rest-api/" target="_blank">how to get started&lt;/a> (and avoid getting blocked in your enthusiasm to use all this great metadata!).&lt;/p>
&lt;/li>
&lt;li>
&lt;p>While bibliographic metadata is generally required, because lots of metadata is optional, records will vary in quality and completeness.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>Questions, comments, and feedback are welcome at &lt;a href="mailto:support@crossref.org">support@crossref.org&lt;/a>.&lt;/p></description></item><item><title>Similarity Check: look out for a refreshed interface and improvements for iThenticate v2 account administrators</title><link>https://www.crossref.org/blog/similarity-check-look-out-for-a-refreshed-interface-and-improvements-for-ithenticate-v2-account-administrators/</link><pubDate>Mon, 01 May 2023 00:00:00 +0000</pubDate><author>Fabienne Michaud</author><guid>https://www.crossref.org/blog/similarity-check-look-out-for-a-refreshed-interface-and-improvements-for-ithenticate-v2-account-administrators/</guid><description>&lt;p>In 2022, we flagged up &lt;a href="https://doi.org/10.64000/3rqq0-xqq76" target="_blank">some changes to&lt;/a> Similarity Check, which were taking place in v2 of Turnitin&amp;rsquo;s iThenticate tool used by members participating in the service. We noted that further enhancements were planned, and want to highlight some changes that are coming very soon. These changes will affect functionality that is used by account administrators, and doesn&amp;rsquo;t affect the Similarity Reports themselves.&lt;/p>
&lt;p>&lt;em>From Wednesday 3 May 2023&lt;/em>, administrators of iThenticate v2 accounts will notice some changes to the interface and improvements to the Users, Groups, Integrations, Statistics and Paper Lookup sections.&lt;/p>
&lt;h3 id="logging-in">Logging in&lt;/h3>
&lt;p>iThenticate v2 account administrators and browser users will see a new login page when logging in to iThenticate v2:&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/simcheck-login-blog.png"
alt="Screenshot of sign in page" width="75%">
&lt;/figure>
&lt;h3 id="a-refreshed-interface">A refreshed interface&lt;/h3>
&lt;p>Once logged in to iThenticate v2, account administrators will see an updated design, with improved notifications to let them know whether a task/action has been successfully completed or not.&lt;/p>
&lt;h3 id="users">Users&lt;/h3>
&lt;p>There will be improvements to the user management system for account administrators, including a much clearer navigation menu for managing active, pending and deactivated users.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/simcheck-dropdown-blog.png"
alt="Screenshot of dropdown options" width="25%">
&lt;/figure>
&lt;p>There will also be a filtering option on the Users page to search for active, pending and deactivated users by first name, last name, email address, group and date added. In addition coloured labels will be introduced to easily identify the level of access (or &amp;lsquo;Role&amp;rsquo;) for each user.&lt;/p>
&lt;p>&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/pending-userscreen-blog.png"
alt="Screenshot of pending users page" width="100%">
&lt;/figure>
&lt;P>&lt;/p>
&lt;p>An improved bulk user import process will be available, with clearer guidance on any issues that may arise during the upload. This new development will also include new screens for adding and editing users with more notifications to help prevent mistakes.&lt;/p>
&lt;h3 id="integrations">Integrations&lt;/h3>
&lt;p>For account administrators managing peer review management system integrations and needing to generate API keys, the Integrations page will be improved to make copying API keys simpler.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/simcheck-intergrations-blog.png"
alt="Screenshot of Turnitin API Integration page" width="100%">
&lt;/figure>
&lt;h3 id="statistics">Statistics&lt;/h3>
&lt;p>iThenticate v2 administrators will also notice some improvements to the Statistics page. Usage data should load faster and will be sortable by user group. They will also be able to generate large usage reports of over 100k submissions.&lt;/p>
&lt;h3 id="paper-lookup">Paper lookup&lt;/h3>
&lt;p>The Paper lookup will allow iThenticate v2 account administrators to find submissions that have been made from any integration connected to their iThenticate v2 account. They can be found by searching the paper ID (or oid number) of the submission.&lt;/p>
&lt;p>&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/simcheck-paperlookup-blog.png"
alt="Screenshot of search for a paper page" width="100%">
&lt;/figure>
&lt;P>&lt;/p>
&lt;p>Please note: the ability to search for submissions by the user&amp;rsquo;s name is available for manuscripts submitted via the iThenticate v2 website only and not for papers submitted via an integration.&lt;/p>
&lt;h3 id="new-password-requirements">New password requirements&lt;/h3>
&lt;p>To improve the security of users&amp;rsquo; accounts, new password requirements will be introduced, including a minimum of 8 symbols, 1 special symbol, 1 upper case letter, and 1 number.&lt;/p>
&lt;h3 id="next-in-ithenticate-v2">Next in iThenticate v2&lt;/h3>
&lt;p>Turnitin, who produce iThenticate, are currently working on a number of new features and developments including an improved similarity report, paraphrase and AI writing detection. A detailed timeline is not yet available but we&amp;rsquo;ll be updating you on these new developments in the coming months.&lt;/p>
&lt;p>✏️ Do get in touch via &lt;a href="mailto:support@crossref.org">support@crossref.org&lt;/a> if you have any questions about iThenticate v1 or v2 or start a discussion by commenting on this post below.&lt;/p></description></item><item><title>ISR part four: Working together as a community to preserve the integrity of the scholarly record</title><link>https://www.crossref.org/blog/isr-part-four-working-together-as-a-community-to-preserve-the-integrity-of-the-scholarly-record/</link><pubDate>Wed, 26 Apr 2023 00:00:00 +0000</pubDate><author>Amanda Bartell</author><guid>https://www.crossref.org/blog/isr-part-four-working-together-as-a-community-to-preserve-the-integrity-of-the-scholarly-record/</guid><description>&lt;p>We&amp;rsquo;ve been spending some time speaking to the community about our role in research integrity, and particularly the integrity of the scholarly record. In this blog, we&amp;rsquo;ll be sharing what we&amp;rsquo;ve discovered, and what we&amp;rsquo;ve been up to in this area.&lt;/p>
&lt;p>We’ve discussed in our &lt;a href="https://www.crossref.org/categories/research-integrity/">previous posts&lt;/a> in the “Integrity of the Scholarly Record (ISR)” series that the infrastructure Crossref builds and operates (together with our partners and integrators) captures and preserves the scholarly record, making it openly available for humans and machines through metadata and relationships about all research activity. This &lt;a href="https://www.crossref.org/documentation/research-nexus/">Research Nexus&lt;/a> makes it easier and faster for everyone involved in research performance, management, and communications to understand information in context and make decisions about the trustworthiness of organisations and their published research outputs. Conversely, it can make it harder for parties to pass off information as trustworthy when the information doesn&amp;rsquo;t include that context.&lt;/p>
&lt;p>The community needs open scholarly infrastructure that can adapt to the changes in scholarly research and communications, and we’ve been changing and adapting already by building on the concept of the scholarly record with our vision:&lt;/p>
&lt;blockquote>
&lt;p>Like others, we envision a rich and reusable open network of relationships connecting research organisations, people, things, and actions; a scholarly record that the global community can build on forever, for the benefit of society.&lt;/p>
&lt;/blockquote>
&lt;p>We don’t assess the quality of the work that our members register, and we keep the barriers to membership deliberately low to ensure that we are capturing as much of the scholarly record as possible and encouraging best practice. We are careful to talk about Crossref’s specific role being with the Integrity of the Scholarly Record (ISR), and not the broader area of ‘research integrity’ (i.e. the integrity of the research process or content itself).&lt;/p>
&lt;p>But there are many challenges and threats to research integrity and the integrity of the scholarly record, and there are tradeoffs with keeping the barriers to membership low. With that in mind, we have been dedicating more time to speaking with the community to explore what part we are and should in future play to help the community assess and improve trustworthiness in the scholarly record. We also want to work out where we can make use of our neutral, central role to convene different groups in scholarly communications to work together on these challenges.&lt;/p>
&lt;h2 id="a-revealing-afternoon-in-frankfurt">A revealing afternoon in Frankfurt&lt;/h2>
&lt;p>Our starting point was a roundtable discussion in Frankfurt in October 2022. We organized it to coincide with the Frankfurt Book Fair, but the invited participants were from a wider spectrum than just publishers. The 40 invited participants represented editors, funders, research integrity professionals at publishers, representatives of ministries of science, and other partner organisations such as OASPA, COPE, STEM and DOAJ.&lt;/p>
&lt;p>This half-day session enabled us to sense-check our thinking with the community and get input into whether our position is the best one for their needs.&lt;/p>
&lt;p>Ed Pentz introduced the session by reminding participants that integrity is key to Crossref’s mission and is the basis of the shared Research Nexus vision. Amanda (that’s me) talked through our current membership processes, recent membership trends, and why wider participation is key and also the sort of questions the community comes to Crossref to solve (eg title ownership disputes). And finally, Ginny Hendricks talked through the specific services and metadata that Crossref has already developed to support the community as signals of trustworthiness, and introduced some new activities and ideas.&lt;/p>
&lt;figure class="img-responsive">&lt;a href="https://docs.google.com/presentation/d/1chyWGpa1Ap3X9yC3H7xBebpfgCcINvs1Z89wGOXqUkI/edit#slide=id.g16b249b2c40_1_11">&lt;img src="https://www.crossref.org/images/blog/2023/frankfurt-slide-image.png"
alt="Slide deck cover image Crossref&amp;#39;s role in the Integrity of the Scholarly Record (ISR)" width="75%">&lt;/a>
&lt;/figure>
&lt;p>You can &lt;a href="https://docs.google.com/presentation/d/1chyWGpa1Ap3X9yC3H7xBebpfgCcINvs1Z89wGOXqUkI/edit#slide=id.g16b7e602dde_2_340" target="_blank">check out the slide deck&lt;/a> and for more background, read our &lt;a href="https://www.crossref.org/categories/research-integrity/">previous posts&lt;/a> in the ISR series.&lt;/p>
&lt;p>Participants then split into small groups representing a mix of communities, and we asked them to discuss three key questions:&lt;/p>
&lt;ul>
&lt;li>Is Crossref’s role what you expected? What surprised you? What are we missing?&lt;/li>
&lt;li>Are you aware of Crossref services? What are the barriers to more uptake? What are the challenges and opportunities?&lt;/li>
&lt;li>What more could Crossref or its members do?&lt;/li>
&lt;/ul>
&lt;p>After discussion, each small group fed back to the room, and we followed up with a whole group discussion, before ending the day with a post-it note exercise for what Crossref should start doing, stop doing, and continue doing.&lt;/p>
&lt;p>Here&amp;rsquo;s what we learned.&lt;/p>
&lt;h3 id="the-importance-of-whole-community-involvement-in-research-integrity-and-isr">The importance of whole community involvement in research integrity and ISR&lt;/h3>
&lt;p>The need for all parts of the community to come together to solve the problems of research integrity came through loud and clear - there is no single group that can solve this problem on its own.&lt;/p>
&lt;p>Publishers expressed frustration that responsibility for research integrity has been placed seemingly solely in their hands when institutions and funders can “unwittingly incentivise bad behaviour”. But it was clear that funders are just as concerned with research integrity issues, with many having made a dedicated trip for the roundtable. There were comments that bringing publishers and funders together around these issues was a rare but important opportunity, and there were calls for this to be an annual event. Both funders and publishers called for more involvement from and inclusion of research institutions in the discussion.&lt;/p>
&lt;p>The group agreed that Crossref’s main focus should continue to be capturing and sharing the scholarly record, and that metadata and relationships are key for attribution, evidence, and provenance. One participant commented that “you can’t make open science work unless the metadata is complete” and that this would only happen with efforts throughout the community. Accurate and complete metadata needs to be:&lt;/p>
&lt;ul>
&lt;li>pushed for by funders and institutions (through advocacy and policy)&lt;/li>
&lt;li>provided by the authors and other contributors&lt;/li>
&lt;li>collated, curated, and registered by the publishers and repositories&lt;/li>
&lt;li>collected, matched, (sometimes cleansed), and distributed by Crossref.&lt;/li>
&lt;li>(and we would add “prioritised by all who want to support open infrastructure over commercial alternatives”)&lt;/li>
&lt;/ul>
&lt;p>Interestingly, this echoes the &lt;a href="https://metadata2020.org/resources/metadata-personas/" target="_blank">‘metadata personas’&lt;/a> output of the Metadata 20/20 initiative which defined roles in the community’s collective metadata effort:&lt;/p>
&lt;ul>
&lt;li>Metadata Creators: providing descriptive information (metadata) about research and scholarly objects.&lt;/li>
&lt;li>Metadata Curators: classifying, normalising, and standardising this descriptive information to increase its value as a resource.&lt;/li>
&lt;li>Metadata Custodians: storing and maintaining this descriptive information and making it available for consumers.&lt;/li>
&lt;li>Metadata Consumers: knowingly or unknowingly using the descriptive information to find, discover, connect, cite, and assess research objects.&lt;/li>
&lt;/ul>
&lt;h3 id="importance-of-whole-publisher-involvement">Importance of whole-publisher involvement&lt;/h3>
&lt;p>A few participants, particularly those in editorial or integrity roles at publishing organisations, had not previously made the connection that metadata could be important signals of integrity. This highlighted a key problem - working with Crossref is seen by publishers as a technical/production workflow issue, and so knowledge of the benefits of metadata can be siloed within those teams. Crossref needs to reach out to editorial and research integrity teams to explain that good metadata isn’t just an end in itself and reinforce the impact it has on research integrity. This buy-in from across publisher organisations is vital.&lt;/p>
&lt;blockquote>
&lt;p>We’re currently &lt;a href="https://www.crossref.org/jobs/2023-04-19-community-engagement-manager/">recruiting a Community Engagement Manager&lt;/a> with editorial or research integrity experience to dedicate time to this area, to advocate for richer metadata within the editorial community, and progress this important conversation.&lt;/p>
&lt;/blockquote>
&lt;h3 id="agreement-of-the-importance-of-metadata-but-an-acknowledgment-that-this-brings-extra-cost">Agreement of the importance of metadata but an acknowledgment that this brings extra cost&lt;/h3>
&lt;p>Most participants agreed that rich metadata and relationships provide a core tool in establishing and protecting integrity. But they also acknowledged that collecting and registering more metadata often comes with an extra cost - whether that’s from system changes or just extra staff time. This is particularly true where publishers are working with third-party platforms and suppliers where there may be additional costs for adding fields and functionality to collect more metadata and register it with Crossref. Where knowledge of metadata is siloed in technical and production teams, and the wider benefits aren’t acknowledged, it can be hard to get internal buy-in for these extra costs and efforts.&lt;/p>
&lt;p>The Frankfurt group also pointed out that the benefits of more comprehensive metadata (and what this means for ISR) are spread across the research ecosystem, but it is the publisher that usually bears the costs.&lt;/p>
&lt;h3 id="need-to-define-which-metadata-elements-are-trust-signals-and-make-it-easier-for-the-community-to-provide-and-access-them">Need to define which metadata elements are trust signals and make it easier for the community to provide and access them&lt;/h3>
&lt;p>Through the course of the discussion, various elements were determined to be important to capture as “trust signals” and to identify relationships such as for retractions, conferences, reviewers, data, and when Crossref membership has been revoked for cause. We need to spend time identifying and prioritising these so that our members can do the same.&lt;/p>
&lt;p>We need to make it easier for smaller, less technically-resourced members to provide this metadata, both through our tools and our documentation, as “doing this work can be very geeky and the documentation isn’t easy to understand as a layperson”.&lt;/p>
&lt;p>There was also a discussion about where the metadata comes from - should community members be able to contribute metadata and assertions to other members’ records? If the provenance is captured then yes.&lt;/p>
&lt;p>Once the metadata is captured, there remain challenges for users in where to start with the 145 million Crossref records. The groups asked Crossref to make it easier for community members to understand and use these records to make informed decisions, including by creating and sharing sample queries, libraries, and case studies.&lt;/p>
&lt;blockquote>
&lt;p>We’re currently &lt;a href="https://www.crossref.org/jobs/2023-04-20-technical-community-manager/">recruiting a Technical Community Manager&lt;/a> to help improve the support we provide in this area to API users, service providers, and other metadata integrators .&lt;/p>
&lt;/blockquote>
&lt;h3 id="the-importance-of-retractionscorrections-information">The importance of retractions/corrections information&lt;/h3>
&lt;p>There was a lot of discussion about retractions and their importance as trust indicators. The group was surprised by how few retractions are currently registered with Crossref through Crossmark (12k). There was a lot of discussion around why Crossmark isn’t currently being adopted, and interest in taking this forward.&lt;/p>
&lt;p>This needs to be a focus for Crossref, to encourage members to register retractions, corrections, and updates, and to make it easier for smaller publishers. There are new and emerging publishers who really want tools to help them demonstrate the legitimacy of their research, and an easy way for them to record corrections and retractions is key.&lt;/p>
&lt;p>In their paper &lt;a href="https://osf.io/6z7s3/" target="_blank">Towards a connected and dynamic scholarly record of updates, corrections, and retractions&lt;/a> (September 17th, 2022), Ginny Hendricks, Rachael Lammey, and Martyn Rittman discuss how retraction information could be more effectively used - for example, letting a preprint reader know that the resulting article has been retracted, or letting the author of an article know the data that they’ve based their work on has been withdrawn.&lt;/p>
&lt;p>Collecting the information is just the start - cascading retraction information throughout the research ecosystem is the main goal, and Crossref plays a central role here. As noted in the Information Quality Lab’s project &lt;a href="https://infoqualitylab.org/projects/risrs2020/" target="_blank">Reducing the inadvertent spread of retracted science: Shaping a research and implementation agenda&lt;/a>, “Many retracted papers are not marked as retracted on publisher and aggregator sites, and retracted articles may still be found in readers’ PDF libraries, including in reference management systems such as Zotero, EndNote, and Mendeley”.&lt;/p>
&lt;p>It’s particularly important that this information is fed back to funders and institutions, and the group discussed having push notifications to these audiences for retractions. Some funders even employ staff members whose main purpose is to identify retractions.&lt;/p>
&lt;p>It was pointed out that there may be good sources of retraction information (such as Retraction Watch) that Crossref could incorporate and match in our metadata.&lt;/p>
&lt;h3 id="gaps-in-ownership-and-crossrefs-role">Gaps in ‘ownership’, and Crossref’s role&lt;/h3>
&lt;p>The group discussed the many gaps in ownership for elements of research integrity, and some groups wondered if Crossref should actually change our approach and take on more responsibility for vetting content. However, after discussion, the group mostly agreed that this would mean a change of mission (and more staff) for Crossref and potentially limit global participation, thus making the metadata corpus less useful. Crossref should provide the widest possible metadata in an easy-to-consume format, and “other organisations can provide the verification layer”.&lt;/p>
&lt;p>It was acknowledged that it would be easy for Crossref to get overwhelmed, so we ended the day by discussing not only what we should start doing, but also what we should stop doing. Unsurprisingly, there was a lot more to continue or start doing than stop doing!&lt;/p>
&lt;p>However, the fact remains that there are gaps in ownership - for example, there is no central arbiter of who ‘owns’ a journal. Also, where do you go if you have a problem with a journal? Often the &lt;a href="https://publicationethics.org/" target="_blank">Committee on Publication Ethics (COPE)&lt;/a> is seen as a solution, but they can’t solve this problem alone - it needs a coordinated effort from funders, institutions, publishers, and other partner organisations such as the &lt;a href="https://oaspa.org/" target="_blank">Open Access Scholarly Publishing Association (OASPA)&lt;/a>, the &lt;a href="https://doaj.org/" target="_blank">Directory of Opena Access Journals (DOAJ)&lt;/a>, and like-minded organisations.&lt;/p>
&lt;p>Many noted that Crossref is well-positioned to convene horizontal multi-stakeholder discussions to start to find solutions.&lt;/p>
&lt;p>We also know that there are other industry initiatives aimed at supporting this work. The STM Association’s work on an &lt;a href="https://www.stm-assoc.org/stm-integrity-hub/" target="_blank">Integrity Hub&lt;/a> is gathering pace and aims to provide, among other things ‘a cloud-based environment for publishers to check submitted articles for research integrity issues’.&lt;/p>
&lt;h2 id="what-happened-next-turns-out-it-really-is-all-about-relationships">What happened next? Turns out, it really is all about relationships…&lt;/h2>
&lt;p>Since this meeting in Frankfurt last October, we’ve been focusing on relationships - thinking about how we capture them in our metadata, and working in partnership with other organisations to bolster our support for ISR.&lt;/p>
&lt;p>The rest of this blog post highlights some of the activities underway:&lt;/p>
&lt;h3 id="increasing-participation-in-crossref">Increasing participation in Crossref&lt;/h3>
&lt;p>In January 2023, we launched our &lt;a href="https://doi.org/10.64000/7rqaz-q4616" target="_blank">new GEM Program&lt;/a>, which offers relief from fees for members in the least economically-advantaged countries in the world. By opening up participation even further, we aim to extend the corpus of open metadata, giving opportunities for more connections, more context, and more relationships.&lt;/p>
&lt;h3 id="supporting-members-in-meeting-best-practices">Supporting members in meeting best practices&lt;/h3>
&lt;p>&lt;a href="https://doi.org/10.64000/ctyr5-j0r91" target="_blank">ISR blog 2&lt;/a> explained more about how we help new members become “good Crossref citizens” with automated onboarding emails, extensive documentation, events and webinars, and help from our support team, Ambassadors, and other members in our Community Forum.&lt;/p>
&lt;div style="margin:10px;">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/PLACE-master-logo-red-ot.png"
alt="Publishers Learning &amp;amp; Community Exchange logo" width="50%">
&lt;/figure>
&lt;/div>
&lt;p>We’ve recently joined forces with COPE, DOAJ, and OASPA to create a new online public forum for organisations interested in adopting best practices in scholarly publishing. At the Publishers Learning And Community Exchange or &lt;a href="https://theplace.discourse.group/" target="_blank">The PLACE&lt;/a>, new scholarly publishers can access information from multiple agencies in one place, ask questions of the experts, and join conversations with each other. Do take a look!&lt;/p>
&lt;h3 id="being-clearer-on-the-impact-of-better-metadata">Being clearer on the impact of better metadata&lt;/h3>
&lt;p>As discussed earlier, better metadata can sometimes bring extra costs, and it’s helpful to understand the impact of this investment. We know from our ongoing outreach work that it’s difficult for our members to keep hearing that Crossref needs more and better metadata. They ask us for resources and increasingly want to see hard evidence of benefits to them. We recently &lt;a href="https://doi.org/10.64000/nhmg5-3ra76" target="_blank">showcased the journey of the American Society for Microbiology&lt;/a> which went from ‘zero to hero’ in terms of metadata participation and completeness in Crossref. They describe their efforts to increase their registered metadata over the last few years, and note a significant increase in their average monthly successful DOI resolutions from ~390,000 in 2015 to an average of ~3.7 million in 2022. They found that “the more metadata we push out into the ecosystem, the more it appears to be used… Remembering that your publishing program benefits as much as everyone else’s when you deposit more metadata can help refine your short-term and long-term priorities.”&lt;/p>
&lt;p>We know we sound like a broken record sometimes, but now other members can take it from ASM!&lt;/p>
&lt;h3 id="encouraging-better-metadata-and-more-relationships-and-identifying-trust-signals">Encouraging better metadata and more relationships and identifying &amp;rsquo;trust signals'&lt;/h3>
&lt;p>We’re trying to make it easier for members to accurately register key metadata fields, with the launch of our new &lt;a href="https://www.crossref.org/documentation/register-maintain-records/grant-registration-form/">grants registration form&lt;/a> which will be extended to journals and other record types soon. This includes a &lt;a href="https://ror.org/" target="_blank">ROR&lt;/a> lookup - adding this unique identifier for research organisations gives even better context for the metadata.&lt;/p>
&lt;figure class="img-responsive">&lt;img src="https://www.crossref.org/images/blog/2023/grants-form-ROR-integration.png"
alt="Screenshot from grant registration tool showing a search for a research institution and suggestions from the ROR database" width="75%">
&lt;/figure>
&lt;p>We are also working to make it possible for &lt;a href="https://crossref.atlassian.net/browse/RD-19" target="_blank">anyone to contribute to metadata records&lt;/a>, and have the provenance of these contributions clearly asserted.&lt;/p>
&lt;p>Metadata adoption is still a key goal for our staff; indeed our new &lt;a href="https://www.crossref.org/strategy/#we-want-to-be-a-sustainable-source-of-complete-open-and-global-scholarly-metadata-and-relationships">2023-2025 strategic roadmap&lt;/a> specifies…&lt;/p>
&lt;blockquote>
&lt;p>“We want to be a sustainable source of complete, open, and global scholarly metadata and relationships. We are working towards this vision of a ‘Research Nexus’ by demonstrating the value of richer and connected open metadata, incentivising people to meet best practices, while making it easier to do so.”&lt;/p>
&lt;/blockquote>
&lt;p>… with item number one under projects ‘in focus’, being: “Adoption activities to focus on top metadata adoption priorities, which are:&lt;/p>
&lt;ul>
&lt;li>&lt;a href="https://i4oc.org/" target="_blank">references&lt;/a>;&lt;/li>
&lt;li>&lt;a href="https://i4oa.org/" target="_blank">abstracts&lt;/a>;&lt;/li>
&lt;li>&lt;a href="https://www.crossref.org/community/grants/">grants&lt;/a>; and&lt;/li>
&lt;li>&lt;a href="https://ror.org/" target="_blank">ROR&lt;/a>”.&lt;/li>
&lt;/ul>
&lt;p>We’re continuing to talk with the community to work out which metadata elements are most useful as trust signals, and we’re trying to prioritise some of the schema changes required to capture new elements. If you haven’t already, please respond to Patricia Feeney’s &lt;a href="https://community.crossref.org/t/take-our-metadata-priorities-survey-by-may-18/3498" target="_blank">metadata priorities survey&lt;/a>.&lt;/p>
&lt;h3 id="thinking-about-retractions-and-corrections">Thinking about retractions and corrections&lt;/h3>
&lt;p>We’ve been closely involved with the &lt;a href="https://www.niso.org/standards-committees/crec" target="_blank">NISO CREC working group&lt;/a>, and they should be making the initial draft recommendations public soon - watch this space!&lt;/p>
&lt;h3 id="making-it-easier-to-view-and-compare-metadata-and-expand-the-relationships">Making it easier to view and compare metadata and expand the relationships&lt;/h3>
&lt;p>Our &lt;a href="https://www.crossref.org/members/prep/" target="_blank">Participation Reports&lt;/a> provide a visualisation of the metadata that’s available via our free REST API. There’s a separate Participation Report for each member, and it shows what percentage of that member&amp;rsquo;s content includes nine key metadata elements. It’s an important tool to help those in the community understand our metadata more easily.&lt;/p>
&lt;p>We have been working on a &lt;a href="https://community.crossref.org/t/ticket-of-the-month-april-2023-the-new-labs-reports-are-here/3528" target="_blank">new version of Participation Reports&lt;/a>, allowing more comparison between members, and extra metadata elements to communicate trustworthiness, including whether each member has thought about the long-term preservation of their content, and whether it has been added to a repository. There is a test version to look at in our Labs sandbox. Do take a look and &lt;a href="https://gitlab.com/crossref/labs/crossref-labs-reports/-/issues" target="_blank">provide feedback&lt;/a>.&lt;/p>
&lt;p>We’ve also made public our &lt;a href="https://www.crossref.org/operations-and-sustainability/membership-operations/revocation/#process-for-revoking-membership-due-to-contravention-of-the-membership-terms">list of members whose membership was revoked for contravention of the membership terms&lt;/a>.&lt;/p>
&lt;h3 id="continuing-to-work-with-funders">Continuing to work with funders&lt;/h3>
&lt;p>We’re continuing to work with funders through our growing funder membership, the Funder Advisory Group and other groups, including the Open Research Funders Group, the HRA, Altum, Europe PMC, and the ORCID Funder Interest Group. And we’re continuing to build the important relationships between funding and outputs (see &lt;a href="https://doi.org/10.64000/he02b-neb96" target="_blank">Dominika Tkaczyk’s recent report&lt;/a>) and engage with this key audience for research integrity.&lt;/p>
&lt;h3 id="discussions-with-the-community">Discussions with the community&lt;/h3>
&lt;p>We’ll be talking about ISR at our next community update on May 3rd - there are two versions of the meeting depending on your timezone - do &lt;a href="https://community.crossref.org/t/save-the-date-crossref-community-update-metadata-connects-the-global-community/3373" target="_blank">sign up&lt;/a> if you haven’t already. And if you’re attending the SSP conference in June, do come along to our panel &lt;a href="https://customer.sspnet.org/SSP/ssp/AM23/Program.aspx?hkey=2b8aa5b0-5fc3-4b7a-9fa7-c212e5f1b9ab" target="_blank">“Working together to preserve the integrity of the scholarly record in a transparent and trustworthy way”&lt;/a>.&lt;/p></description></item><item><title>We’re hiring! New technical, community, and membership roles at Crossref</title><link>https://www.crossref.org/blog/were-hiring-new-technical-community-and-membership-roles-at-crossref/</link><pubDate>Fri, 21 Apr 2023 00:00:00 +0000</pubDate><author>Michelle Cancel</author><guid>https://www.crossref.org/blog/were-hiring-new-technical-community-and-membership-roles-at-crossref/</guid><description>&lt;p>Do you want to help make research communications better in all corners of the globe? Come and join the world of nonprofit open infrastructure and be part of improving the creation and sharing of knowledge.&lt;/p>
&lt;p>We are recruiting for three new staff positions, all new roles and all fully remote and flexible. See below for more about our ethos and what it&amp;rsquo;s like working at Crossref.&lt;/p>
&lt;p>🚀 &lt;strong>&lt;a href="https://www.crossref.org/jobs/2023-04-20-technical-community-manager">Technical Community Manager&lt;/a>&lt;/strong>, working with our &amp;lsquo;integrators&amp;rsquo; so all repository/publishing platforms and plugins, all API users incl. managing contracts with subscribers, and generally helping a very nice bunch of RESTful API dabblers, both novice and intermediate. The goal is to offer more interactive engagement such as sprints, and more technical consultation to help the community with things like query efficiency, public data dump ingestion, etc. Thousands of users exist, from individual researchers and small academic tools to giant technology companies. Researching and analysing usage and building tools to meet their needs is key, so this role works closely with Product and R&amp;amp;D colleagues and likely needs a developer or developer-advocacy background.&lt;/p>
&lt;p>🎯 &lt;strong>&lt;a href="https://www.crossref.org/jobs/2023-04-13-member-experience-manager/">Member Experience Manager&lt;/a>&lt;/strong>, ramping up to handle the mammoth operation that is&amp;hellip; membership, currently 18,000 members from 150 countries, and onboarding the ~180 new joiners we welcome monthly, mostly from Africa and Asia. This role involves lots of education and relationship management, but because of the scale, we also need someone with a real business process/analysis approach, improving how our systems function so that the operation flows seamlessly and isn&amp;rsquo;t a pain for people (both members and staff). This role manages two full-time Member Support Specialists (UK and Indonesia) and three part-time contractors (USA, France, and one other as yet unknown).&lt;/p>
&lt;p>🎈 &lt;strong>&lt;a href="https://www.crossref.org/jobs/2023-04-19-community-engagement-manager/">Community Engagement Manager&lt;/a>&lt;/strong>, working with the global community of scholarly editors at a time when research integrity is top of mind for our entire ecosystem. This is a classic community role for someone keen to cross over from managing or editing journals or books and perhaps make your volunteer work official. Activities will include program and project management, event and working group facilitation, communications and content creation. You&amp;rsquo;d be interacting with groups like the Asian Council of Science Editors, the European Association of Science Editors, and the Council of Science Editors, plus many more that you&amp;rsquo;d identify. It&amp;rsquo;s all about helping editors, who work hand-in-hand with authors, to think about metadata as signals of trust and better use available services, such as those for retraction management or plagiarism checking, and helping to define needs for emerging activity too, such as machine-generated content.&lt;/p>
&lt;h3 id="working-at-crossref">Working at Crossref&lt;/h3>
&lt;p>We’re a not-for-profit membership organisation that exists to make scholarly communications better. We rally the community; tag and share metadata; run an open infrastructure; play with technology; and make tools and services—all to help put research in context.&lt;/p>
&lt;p>Crossref sits at the heart of the global exchange of research information, and our job is to make it possible—and easier—to find, cite, link, assess, and reuse research, from journals and books, to preprints, data, and grants. Through partnerships and collaborations we engage with members in 150 countries (and counting) and it’s very important to us to nurture that community.&lt;/p>
&lt;p>We’re about &lt;a href="https://www.crossref.org/people/">45 staff&lt;/a> and remote-first. This means that we support our teams working asynchronously and with flexible hours. We are dedicated to an open and fair research ecosystem and that’s reflected in our ethos and staff culture. We like to work hard but we have fun too! We take a creative, iterative approach to our projects, and believe that all team members can enrich the culture and performance of our whole organisation. Check out the &lt;a href="https://www.crossref.org/people/org-chart/">organisation chart&lt;/a>.&lt;/p>
&lt;p>We are active supporters of ongoing professional development opportunities and promote self-learning at every opportunity. Crossref has a healthy financial situation and we only continue to grow. While we won’t have a clear hierarchical path for staff to follow, there are always evolving opportunities to progress and be challenged.&lt;/p>
&lt;p>We especially encourage applications from people with backgrounds historically under-represented in research and scholarly communications.&lt;/p>
&lt;p>Bookmark our &lt;a href="https://www.crossref.org/jobs">jobs page&lt;/a> to watch for future opportunities!&lt;/p></description></item><item><title>The PLACE for new publishers – a one-stop-shop for information and a friendly community</title><link>https://www.crossref.org/blog/the-place-for-new-publishers-a-one-stop-shop-for-information-and-a-friendly-community/</link><pubDate>Mon, 17 Apr 2023 00:00:00 +0000</pubDate><author>Kornelia Korzec</author><guid>https://www.crossref.org/blog/the-place-for-new-publishers-a-one-stop-shop-for-information-and-a-friendly-community/</guid><description>&lt;p>The Publishers Learning And Community Exchange (PLACE) at &lt;a href="https://theplace.discourse.group" target="_blank">theplace.discourse.group&lt;/a> is a new online public forum created for organisations interested in adopting best practices in scholarly publishing. New scholarly publishers can access information from multiple agencies in one place, ask questions of the experts and join conversations with each other.&lt;/p>
&lt;p>Scholarly publishing is an interesting niche of an industry – it appears at the same time ancillary and necessary to the practice and development of scholarship itself. The sooner and more easily a piece of academic work is shared, the greater the chance that others will find and build upon it. Many practices of the publishing industry have been developed to support discovery and integrity of the scholarship that produces shareable works, and as the landscape of scholarly communications constantly evolves, a number of agencies arose to promote and continuously update the standards and best practices within it.&lt;/p>
&lt;p>We realise that the sheer number of agencies involved in regulating and preserving scholarly content is in itself a challenge and can be confusing. Newer publishers may find it difficult to know where to go to find the right information, what policies they need to follow or international criteria they need to meet and how to go about doing so. When time or finances are tight, it’s not easy to try to reinvent the wheel.&lt;/p>
&lt;p>Following the long-established practice of signposting organisations between us, we’ve worked together with the &lt;a href="https://publicationethics.org" target="_blank">Committee on Publication Ethics (COPE)&lt;/a>, &lt;a href="https://doaj.org" target="_blank">the Directory of Open Access Journals (DOAJ)&lt;/a>, and the &lt;a href="https://oaspa.org" target="_blank">Open Access Scholarly Publishers Association (OASPA)&lt;/a> to establish the PLACE. We share values and goals to work more effectively to better support the needs of our communities. Each organisation is taking actions to lower barriers to participation and provide greater support for the organisations that publish scholarly and professional content that we work with.&lt;/p>
&lt;p>Hence, we envisaged the PLACE as a ‘one stop shop’ for access to more consolidated and plainly put information, to support publishers in adopting best practices the industry developed. We also hope that by setting the information service as a forum, we will encourage open exchange with publishers who aspire to do things right, as well as between them.&lt;/p></description></item><item><title>Renewed Persistence</title><link>https://www.crossref.org/blog/renewed-persistence/</link><pubDate>Sat, 01 Apr 2023 00:00:00 +0000</pubDate><author>Joe Wass</author><guid>https://www.crossref.org/blog/renewed-persistence/</guid><description>&lt;p>&lt;strong>We believe in Persistent Identifiers. We believe in defence in depth. Today we&amp;rsquo;re excited to announce an upgrade to our data resilience strategy.&lt;/strong>&lt;/p>
&lt;div style="float:right;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/2023-04-01-renewed-persistence/single-floppy-small.jpg"
alt="5¼ inch floppy disk with Crossref logo" width="200px">
&lt;/figure>
&lt;/div>
&lt;p>&lt;a href="https://en.wikipedia.org/wiki/Defense_in_depth_%28computing%29" target="_blank">Defence in depth&lt;/a> means layers of security and resilience, and that means layers of backups. For some years now, our last line of defence has been a reliable, tried-and-tested technology. One that&amp;rsquo;s been around for a while. Yes, I&amp;rsquo;m talking about the humble &lt;a href="https://en.wikipedia.org/wiki/Floppy_disk#8-inch_and_5%C2%BC-inch_disks" target="_blank">5¼ inch floppy disk&lt;/a>.&lt;/p>
&lt;p>This may come as surprise to some. When things go well, you&amp;rsquo;re probably never aware of them. In day to day use, the only time a typical Crossref user sees a floppy disk is when they click &amp;lsquo;save&amp;rsquo; (yes, some journals still require submissions in Microsoft Word).&lt;/p>
&lt;h2 id="history">History&lt;/h2>
&lt;p>But why?&lt;/p>
&lt;p>Let me take you back to the early days of Crossref. The technology scene was different. This data was too important to trust to new and unproven technologies like Zip disks, CD-Rs or USB Thumb Drives. So we started with punched cards.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/2023-04-01-renewed-persistence/punched-card.jpg"
alt="IBM 5081-style punched card." width="100%">&lt;figcaption>
&lt;p>IBM 5081-style punched card.&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;p>Punched cards are reliable and durable as long as you don&amp;rsquo;t &lt;a href="https://en.wikipedia.org/wiki/Punched_card#Do_Not_Fold,_Spindle_or_Mutilate" target="_blank">fold, spindle or mutilate&lt;/a> them. But even in 2001 we knew that punched cards&amp;rsquo; days were numbered. The capacity of 80 characters kept DOIs short. Translating DOIs into &lt;a href="https://en.wikipedia.org/wiki/EBCDIC" target="_blank">EBCDIC&lt;/a> made &lt;a href="https://en.wikipedia.org/wiki/ASCII" target="_blank">ASCII&lt;/a> a challenge, let alone &lt;a href="https://en.wikipedia.org/wiki/Serial_Item_and_Contribution_Identifier" target="_blank">SICI&lt;/a>s. We kept a close eye on the nascent Unicode.&lt;/p>
&lt;h2 id="breathing-room">Breathing Room&lt;/h2>
&lt;p>In 2017 the change of DOI display guidelines from &lt;code>http://dx.doi.org&lt;/code> to &lt;code>https://doi.org&lt;/code> shortened each DOI by 2 characters, buying us some time. But eventually we knew we had to upgrade to something more modern.&lt;/p>
&lt;p>So we migrated to 5¼ inch floppy disks.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/2023-04-01-renewed-persistence/floppy-in-drive.jpg"
alt="5¼ Floppy disk in drive" width="100%">&lt;figcaption>
&lt;p>5¼ Floppy disk in drive&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;p>At 640 KB per disk these were a huge improvement. We could fit around 20,000 DOIs on one floppy. Today we only need around 10,000 floppy disks to store all of our DOIs (not the metadata, just the DOIs). Surprisingly this only takes about 20 metres of shelf space to store.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/2023-04-01-renewed-persistence/desktop.jpg"
alt="laptop computer connected to floppy disk drive, pile of disks next to it" width="100%">&lt;figcaption>
&lt;p>Typical work from home setup. Getting ready to backup some DOIs!&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;p>The move to working-from-home brought an unexpected benefit. Staff mail floppy disks to each other and keep them in constant rotation, which produces a distributed fault tolerant system.&lt;/p>
&lt;h2 id="persistence-means-change">Persistence Means Change&lt;/h2>
&lt;p>But it can&amp;rsquo;t last forever. DOIs registration shows no sign of slowing down. It&amp;rsquo;s clear we need a new, compact storage medium. So, after months of research, we&amp;rsquo;ve invested in new equipment.&lt;/p>
&lt;p>&lt;strong>Today we announce our migration to 3½ inch floppies.&lt;/strong>&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/2023-04-01-renewed-persistence/new-equipment.jpg"
alt="dual format floppy disk drive, with 5¼ inch and 3½ inch floppy disks" width="100%">
&lt;/figure>
&lt;br>
&lt;p>If it goes to plan you won&amp;rsquo;t even notice the change.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/2023-04-01-renewed-persistence/old-new.jpg"
alt="two stacks of disks" width="100%">
&lt;/figure>
&lt;h3 id="image-credits">Image credits&lt;/h3>
&lt;p>Punched card: IBM 5081-style punched card. Derived from &lt;a href="https://en.wikipedia.org/wiki/File:Blue-punch-card-front-horiz_top-char-contrast-stretched.png" target="_blank">public domain by Gwern&lt;/a>.&lt;/p></description></item><item><title>Start citing data now. Not later</title><link>https://www.crossref.org/blog/start-citing-data-now.-not-later/</link><pubDate>Thu, 23 Mar 2023 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/start-citing-data-now.-not-later/</guid><description>&lt;p>Recording data citations supports data reuse and aids research integrity and reproducibility. Crossref makes it easy for our members to submit data citations to support the scholarly record.&lt;/p>
&lt;h3 id="tldr">TL;DR&lt;/h3>
&lt;p>Citations are essential/core metadata that all members should submit for all articles, conference proceedings, preprints, and books. Submitting data citations to Crossref has long been possible. And it’s easy, you just need to:&lt;/p>
&lt;ul>
&lt;li>Include data citations in the references section &lt;strong>as you would for any other citation&lt;/strong>&lt;/li>
&lt;li>Include a DOI or other persistent identifier for the data if it is available - just &lt;strong>as you would for any other citation&lt;/strong>&lt;/li>
&lt;li>Submit the references to Crossref through the content registration process &lt;strong>as you would for any other record&lt;/strong>&lt;/li>
&lt;/ul>
&lt;p>And your data citations will flow through all the normal processes that Crossref applies to citations. And it will be distributed openly to the community (including DataCite!) via Crossref’s services and APIs. All data citations deposited with Crossref will be exposed in the (soon-to-be launched) &lt;a href="https://doi.org/10.5438/vjz9-kx84" target="_blank">Data Citation Corpus&lt;/a>.&lt;/p>
&lt;p>And then, you can sit back and congratulate yourself for making your publication more useful to researchers who want to be able to reuse the data underlying your publications.&lt;/p>
&lt;h3 id="background">Background&lt;/h3>
&lt;p>You might ask, “So if submitting Data Citations to Crossref has long been possible, why do you have to write this?”&lt;/p>
&lt;p>Historically, authors did not cite data in the way they cited publications. Instead, they would often refer to the data in the main text of the article. This has made it hard to determine what data lay behind the research and/or access the data.&lt;/p>
&lt;p>But the research community has increasingly recognized that data is a first-class research output and that we should treat it as such. In short, we should formally cite data.&lt;/p>
&lt;p>But because citing data is a comparatively new practice, it has been subject to a lot of new analysis. And unsurprisingly, people analyzing data citation have discovered that there is a lot of nuance to citation &lt;em>of any kind&lt;/em>.&lt;/p>
&lt;p>There are lots of reasons for citing something. There are lots of internalized conventions for citing things. And there are different conventions for citation for different research objects. And SSH citation practice differs from STEM. And legal citation practices are different from scholarly citation practices. And citation practices even vary by subdiscipline and by journal.&lt;/p>
&lt;p>Those who have been looking at what it means to “cite data” have naturally stumbled into a thicket of divergent practices - some of which are historical holdovers, some of which are stylistic preferences, and some of which are clearly adaptations to deal with the specific needs of certain research objects/containers or different disciplines.&lt;/p>
&lt;p>The temptation has been to try and rationalize this &lt;em>before&lt;/em> extending the practice of citation to data.&lt;/p>
&lt;p>“Maybe because data is a distinct record type, we should include the fact that it is a data citation in the citation itself?”&lt;/p>
&lt;p>“Maybe because people cite data for different reasons, we should include a typology of citation types in all data citations?”&lt;/p>
&lt;p>And so you may hear some people say, “hold off on data citation - we don’t have an optimal way to do it yet, and it can be very complicated.”&lt;/p>
&lt;p>But guess what?&lt;/p>
&lt;p>We currently don’t label citations to monographs as “citation to monograph.”&lt;/p>
&lt;p>And we don’t currently include the reason for citation when we are citing a journal article.&lt;/p>
&lt;p>&lt;a href="https://sparontologies.github.io/cito/current/cito.html" target="_blank">It would be very cool if we did.&lt;/a> And it would likely make citations even more useful if we did.&lt;/p>
&lt;p>But citations are already useful even without these features. And so, to delay citing data indefinitely because we have an opportunity to improve the act of citation is just perverse. Our community has always opted for progress over perfection.&lt;/p>
&lt;p>For one thing - the efforts are not mutually exclusive. We can start citing data with the current limitations of citation practices and simultaneously propose mechanisms for making citation more useful in the future, including new guidelines to deal with the unique issues that citing data poses.&lt;/p>
&lt;p>But in the meantime, we will be doing researchers a giant favour if we at least include our imperfect and ambiguous, and unconventional references to data in the references section of an article so that they can be accessed and processed along with all the other imperfect, ambiguous and variant citations that we find so useful.&lt;/p>
&lt;p>Some of our members are already doing this. They have been for a long time. And they haven’t found it any more complicated than managing non-data references in the past.&lt;/p>
&lt;p>Join them and make your metadata more useful.&lt;/p>
&lt;p>Cite data now. Don’t put it off.&lt;/p>
&lt;p>And Crossref will continue to work with DataCite and the rest of the community to make the distribution even easier and more useful.&lt;/p>
&lt;h3 id="so-who-is-already-citing-data">So who is already citing data?&lt;/h3>
&lt;h4 id="top-10-members-depositing-data-citations-from-november-may-2022">Top 10 members depositing data citations from November-May 2022&lt;/h4>
&lt;p>(broken down by DOI prefix, which is why you see some publishers listed twice):&lt;/p>
&lt;table>
&lt;tr>
&lt;td>&lt;strong>Prefix&lt;/strong>
&lt;/td>
&lt;td>&lt;strong>Member name&lt;/strong>
&lt;/td>
&lt;td>&lt;strong>Data citations deposited&lt;/strong>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>10.1038
&lt;/td>
&lt;td>Springer Science and Business Media LLC
&lt;/td>
&lt;td>7174
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>10.1016
&lt;/td>
&lt;td>Elsevier BV
&lt;/td>
&lt;td>6527
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>10.1007
&lt;/td>
&lt;td>Springer Science and Business Media LLC
&lt;/td>
&lt;td>4748
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>10.5194
&lt;/td>
&lt;td>Copernicus GmbH
&lt;/td>
&lt;td>3017
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>10.1080
&lt;/td>
&lt;td>Informa UK Limited
&lt;/td>
&lt;td>2346
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>10.1177
&lt;/td>
&lt;td>SAGE Publications
&lt;/td>
&lt;td>2082
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>10.1002
&lt;/td>
&lt;td>Wiley
&lt;/td>
&lt;td>2048
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>10.1111
&lt;/td>
&lt;td>Wiley
&lt;/td>
&lt;td>1888
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>10.1108
&lt;/td>
&lt;td>Emerald
&lt;/td>
&lt;td>1876
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>10.3390
&lt;/td>
&lt;td>MDPI AG
&lt;/td>
&lt;td>1827
&lt;/td>
&lt;/tr>
&lt;/table>
&lt;h4 id="top-10-data-citations-per-deposited-work">Top 10 data citations per deposited work&lt;/h4>
&lt;p>(again, broken down by prefix)&lt;/p>
&lt;table>
&lt;tr>
&lt;td>&lt;strong>Member name&lt;/strong>
&lt;/td>
&lt;td>&lt;strong>Prefix&lt;/strong>
&lt;/td>
&lt;td>&lt;strong>Data citations deposited&lt;/strong>
&lt;/td>
&lt;td>&lt;strong>Data citations per work&lt;/strong>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Consortium Erudit
&lt;/td>
&lt;td>10.7202
&lt;/td>
&lt;td>580
&lt;/td>
&lt;td>1.149
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>SLACK, Inc.
&lt;/td>
&lt;td>10.3928
&lt;/td>
&lt;td>462
&lt;/td>
&lt;td>0.646
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>S. Karger AG
&lt;/td>
&lt;td>10.1159
&lt;/td>
&lt;td>1653
&lt;/td>
&lt;td>0.532
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Proceedings of the National Academy of Sciences
&lt;/td>
&lt;td>10.1073
&lt;/td>
&lt;td>973
&lt;/td>
&lt;td>0.502
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>American Academy of Pediatrics (AAP)
&lt;/td>
&lt;td>10.1542
&lt;/td>
&lt;td>486
&lt;/td>
&lt;td>0.397
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>F1000 Research Ltd
&lt;/td>
&lt;td>10.12688
&lt;/td>
&lt;td>552
&lt;/td>
&lt;td>0.341
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>American Association for the Advancement of Science (AAAS)
&lt;/td>
&lt;td>10.1126
&lt;/td>
&lt;td>952
&lt;/td>
&lt;td>0.317
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Springer Science and Business Media LLC
&lt;/td>
&lt;td>10.1038
&lt;/td>
&lt;td>7174
&lt;/td>
&lt;td>0.231
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>JMIR Publications Inc.
&lt;/td>
&lt;td>10.2196
&lt;/td>
&lt;td>864
&lt;/td>
&lt;td>0.187
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>American Geophysical Union (AGU)
&lt;/td>
&lt;td>10.1029
&lt;/td>
&lt;td>692
&lt;/td>
&lt;td>0.166
&lt;/td>
&lt;/tr>
&lt;/table>
&lt;p>These are for the prefixes with the most data citations deposited (&amp;gt;500 in 6 months) so there might be smaller members doing better than this.&lt;/p>
&lt;h3 id="summaries-are-great-but-i-want-to-see-some-actual-examples">Summaries are great, but I want to see some actual examples!&lt;/h3>
&lt;p>Here are some examples showing how data is cited by our members:&lt;/p>
&lt;ul>
&lt;li>This eLife article: &lt;a href="https://doi.org/10.7554/eLife.26410" target="_blank">https://doi.org/10.7554/eLife.26410&lt;/a> cites this dataset in Dryad &lt;a href="https://doi.org/10.5061/dryad.854j2" target="_blank">https://doi.org/10.5061/dryad.854j2&lt;/a>.&lt;/li>
&lt;li>This Copernicus article: &lt;a href="https://doi.org/10.5194/acp-22-7105-2022" target="_blank">https://doi.org/10.5194/acp-22-7105-2022&lt;/a> cite to this dataset &lt;a href="https://doi.org/10.24381/cds.bd0915c6" target="_blank">https://doi.org/10.24381/cds.bd0915c6&lt;/a>&lt;/li>
&lt;li>This Sciendo article: &lt;a href="https://doi.org/10.2478/plc-2021-0008" target="_blank">https://doi.org/10.2478/plc-2021-0008&lt;/a> cites this APA-hosted language competence test &lt;a href="https://doi.org/10.1037/t15159-000" target="_blank">https://doi.org/10.1037/t15159-000&lt;/a>&lt;/li>
&lt;li>This De Gruyter article: &lt;a href="https://doi.org/10.1515/opth-2020-0160" target="_blank">https://doi.org/10.1515/opth-2020-0160&lt;/a> cites this bibliography at Oxford Bibliographies: &lt;a href="https://doi.org/10.1093/OBO/9780195396584-0012" target="_blank">https://doi.org/10.1093/OBO/9780195396584-0012&lt;/a>&lt;/li>
&lt;/ul>
&lt;p>And here are some example API requests for discovering more metadata citations. You can use these API requests as examples and adapt to your own needs.&lt;/p>
&lt;h4 id="find-all-the-dois-that-cite-dataset-x-identified-by-doi">Find all the DOIs that cite Dataset X (identified by DOI)&lt;/h4>
&lt;p>&lt;a href="https://api.eventdata.crossref.org/v1/events?rows=20&amp;amp;scholix=true&amp;amp;obj-id=10.5061/dryad.854j2" target="_blank">https://api.eventdata.crossref.org/v1/events?rows=20&amp;amp;scholix=true&amp;amp;obj-id=10.5061/dryad.854j2&lt;/a>&lt;/p>
&lt;h4 id="find-all-data-citations-from-crossref-member-x-identified-by-member-prefix">Find all data citations from Crossref member X (identified by member prefix)&lt;/h4>
&lt;p>&lt;a href="https://api.eventdata.crossref.org/v1/events?rows=20&amp;amp;scholix=true&amp;amp;subj-id.prefix=10.7202" target="_blank">https://api.eventdata.crossref.org/v1/events?rows=20&amp;amp;scholix=true&amp;amp;subj-id.prefix=10.7202&lt;/a>&lt;/p>
&lt;h4 id="find-papers-with-supplementary-data">Find papers with supplementary data&lt;/h4>
&lt;p>&lt;a href="https://api.crossref.org/v1/works?filter=prefix:10.3390,relation.type:is-supplemented-by" target="_blank">https://api.crossref.org/v1/works?filter=prefix:10.3390,relation.type:is-supplemented-by&lt;/a>&lt;/p>
&lt;h4 id="find-all-data-citations-to-crossref-member-x">Find all data citations &lt;em>to&lt;/em> Crossref member X&lt;/h4>
&lt;p>&lt;a href="https://api.eventdata.crossref.org/v1/events?rows=20&amp;amp;scholix=true&amp;amp;obj-id.prefix=10.7202" target="_blank">https://api.eventdata.crossref.org/v1/events?rows=20&amp;amp;scholix=true&amp;amp;obj-id.prefix=10.7202&lt;/a>&lt;/p>
&lt;h4 id="find-all-data-citations-to-datacite-member-x">Find all data citations to DataCite member X&lt;/h4>
&lt;p>&lt;a href="https://api.eventdata.crossref.org/v1/events?rows=20&amp;amp;scholix=true&amp;amp;obj-id.prefix=10.5061" target="_blank">https://api.eventdata.crossref.org/v1/events?rows=20&amp;amp;scholix=true&amp;amp;obj-id.prefix=10.5061&lt;/a>&lt;/p></description></item><item><title>Shooting for the stars – ASM’s journey towards complete metadata</title><link>https://www.crossref.org/blog/shooting-for-the-stars-asms-journey-towards-complete-metadata/</link><pubDate>Tue, 14 Mar 2023 00:00:00 +0000</pubDate><author>Kornelia Korzec</author><guid>https://www.crossref.org/blog/shooting-for-the-stars-asms-journey-towards-complete-metadata/</guid><description>&lt;p>At Crossref, we care a lot about the completeness and quality of metadata. Gathering robust metadata from across the global network of scholarly communication is essential for effective co-creation of the research nexus and making the inner workings of academia traceable and transparent. We invest time in community initiatives such as &lt;a href="https://metadata2020.org" target="_blank">Metadata 20/20&lt;/a> and &lt;a href="https://zenodo.org/communities/better-together/?page=1&amp;amp;size=20" target="_blank">Better Together webinars&lt;/a>. We encourage members to take time to look up their &lt;a href="https://www.crossref.org/documentation/reports/participation-reports/">participation reports&lt;/a>, and &lt;a href="https://community.crossref.org/c/reports/30" target="_blank">our team can support you&lt;/a> if you’re looking to understand and improve any aspects of metadata coverage of your content.&lt;/p>
&lt;p>In 2022, we have observed with delight the growth of one of our members from basic coverage of their publications to over 90% in most areas, and no less than 70% of the corpus is covered by all key types of metadata Crossref enables (see &lt;a href="https://www.crossref.org/members/prep/235" target="_blank">their own participation report&lt;/a> for details). Here, Deborah Plavin and David Haber share the story of ASM’s success and lessons learnt along the way.&lt;/p>
&lt;h3 id="could-you-introduce-your-organisation">Could you introduce your organisation?&lt;/h3>
&lt;p>The American Society for Microbiology publishes 16 peer-reviewed journals advancing the microbial sciences, from food microbiology, to genomics and the microbiome, comprising 14% of all microbiology articles. Six of those are open-access journals, and 56% of ASM’s published papers are open access. Together, our journals contribute 25% of all microbiology citations.&lt;/p>
&lt;h3 id="would-you-tell-us-a-little-more-about-yourselves">Would you tell us a little more about yourselves?&lt;/h3>
&lt;p>DH: David Haber, Publishing Operations Director at the American Society for Microbiology. I live in a century-old house that is in a perpetual state of renovation due to my inability to stop starting new projects before I complete old ones.&lt;/p>
&lt;p>DP: Deborah Plavin, Digital Publishing Manager at the American Society for Microbiology. Following David’s example, my apartment in Washington D.C. is just up the block from one of the homes Duke Ellington lived in &lt;a href="https://www.hmdb.org/m.asp?m=142334" target="_blank">https://www.hmdb.org/m.asp?m=142334&lt;/a>.&lt;/p>
&lt;h3 id="what-value-do-society-publishers-in-general-see-in-metadata-in-your-view">What value do society publishers in general see in metadata in your view?&lt;/h3>
&lt;p>DP: In my view, robust metadata allows publishers to look at changes over time, do comparative analysis within and across research areas, more easily identify trends, and plan for future analysis (e.g., if we deposit data citation information and we change our processes to make it more straightforward, do we see any change in the percentage of articles that include that information, etc.).&lt;/p>
&lt;p>DH: To echo Deborah&amp;rsquo;s point, to be able to name something distinctly and clearly identify its specific attributes is vital to understanding past research and planning for future possibilities. One of our fundamental roles as a publisher for a non-profit society is to properly lay this metadata foundation so that we can provide services and new venues for our members, authors, and readers that match their needs and track with the trends in research. Without good and robust metadata, it is impossible to truly understand the direction in which our community is pointing us.&lt;/p>
&lt;h3 id="metadata-for-your-own-research-outputs-in-the-last-year-has-grown-rapidly-why-such-focus-on-metadata-in-2022">Metadata for your own research outputs in the last year has grown rapidly. Why such focus on metadata in 2022?&lt;/h3>
&lt;p>DP: This is something that ASM has been chipping away at over time. Years ago we found that it wasn’t always easy to take advantage of deposits that included new kinds of metadata. That was either because we needed to work out how and where to capture it in the process or because platform providers weren’t always ready — coming up with ways to process the XML that publishers supply in many different ways takes time. These back-end processes that feed the infrastructure aren’t usually of great interest to stakeholders, and so it allowed us to play around, flounder, fail, refine, and try again.&lt;/p>
&lt;p>We looked at having 3rd parties deposit metadata for us, and while that helped expand the kind of metadata we were delivering, it created workflow challenges of its own. What turned out to be most effective was budgeting for content cleanup projects and depositing updated and more robust metadata to Crossref.&lt;/p>
&lt;p>We also benefited from a platform migration, which allowed us to take advantage of additional resources during that process.&lt;/p>
&lt;p>DH: Coming from a production background, I have always been fascinated with the when and how of capturing key metadata during the publishing process. When are those data good and valuable, and when should they be tossed or cleaned up for downstream deliveries? Because Deborah and ASM directors saw a more complete Crossref metadata set for our corpus as a truly valuable target, we were able to really think hard about what kind of data we were capturing and when, how those requirements may have influenced our various policies and copyediting requirements over the years, and how best to re-engineer our processes with the goal of good metadata capture throughout our publishing workflows. From our perspective, Crossref gave us a target, a “this-is-cool-bit-of-info&amp;quot; that Crossref can collect in a deposit; therefore, how can we capture that during our processes while driving further efficiencies? ASM journals had been so driven by legacy print workflows that such a change in perspective (toward metadata as a publishing object) really allowed us to re-imagine almost everything we do as a publisher.&lt;/p>
&lt;h3 id="has-the-ostp-memo-influenced-your-effort">Has the OSTP memo influenced your effort?&lt;/h3>
&lt;p>DP: I think that the Nelson memo hasn’t changed our focus; instead, I think it’s been another data point supporting our efforts and work in this area.&lt;/p>
&lt;p>DH: Deborah is exactly right. The release of this memo only re-affirmed our commitment to creating complete and rich metadata. The Nelson memo points to many possible paths forward, in terms of both Open Access and Open Science, but we feel our work on improving our metadata outputs positions us well to pick a path that best suits our goals as a non-profit society publisher.&lt;/p>
&lt;h3 id="how-big-was-this-effort-could-you-draw-us-a-picture-of-how-many-colleagues-or-parts-of-the-organisation-were-involved-did-you-involve-any-external-stakeholders-such-as-authors-editors-or-others">How big was this effort? Could you draw us a picture of how many colleagues or parts of the organisation were involved? Did you involve any external stakeholders, such as authors, editors, or others?&lt;/h3>
&lt;p>DH: It was simple. Took five minutes…
In all seriousness, the key is having the support of the organisation as a whole. To do this properly, it is vitally important to know the end from the beginning, so to speak. It is one thing to say let’s start capturing ORCID IDs and deliver them to Crossref, but it is completely another to create a cohesive process in which those IDs are authenticated and validated throughout the workflow. So something as simple as a statement “ORCID IDs seem cool, let’s try to capture them” could affect how researchers submit files, how reviewers log into various systems (i.e., ORCID as SSO), how data are passed to production vendors, what copyeditors and XML QC people need to be focused on, and what integrations authors may expect at the time of publication. Being part of an organisation that embraced such change allowed us to proceed with care with each improvement to the metadata we made.&lt;/p>
&lt;p>But that is more about incremental improvement. The beginning of this process started when we were making upgrades to our online publishing platform, and we were trying to figure out how best to get DOIs registered for our older content. When we started looking at this, we soon realized that, sure, we could do the bare minimum and just assign DOIs to this older content outside the source XML/SGML, but did that make sense? Wouldn’t it make more sense, especially since we were updating the corpus to a new DTD, to populate the source content with these newly assigned DOIs? Once we decided that we were going to revise the older content with DOIs, it made sense for us to create a custom XSL transform routine to generate Crossref deposits that would capture as much metadata as possible. So, working with a vendor to clean and update our content for one project (an online platform update) allowed us also to make massive improvements to our Crossref metadata as a side benefit.&lt;/p>
&lt;p>Of course, I do have to apologize to the STM community for the Crossref outages in late 2019. That was just me depositing thousands of records in batches one sleepless night.&lt;/p>
&lt;h3 id="what-were-the-key-challenges-you-encountered-in-this-project-and-how-did-you-overcome-them">What were the key challenges you encountered in this project, and how did you overcome them?&lt;/h3>
&lt;p>DH: Resources and time are always an issue. Much of the work was done in-house in spare moments captured here and there. But there are great resources in github and at Crossref to help focus on defining what is important and what is possible in such a project. And, honestly, defining what was important and weighing that against the effort to find said important bit in the corpus of articles we have was the most challenging part of this process. In other words, limiting the focus. Once one decides to start looking at the inconsistencies in older content, it is hard not to say: “Oh, look. That semi-important footnote was treated as a generic author note rather than a conflict-of-interest statement; let’s fix that.” Once you start down that path, you can spend years fiddling with stuff. For me, a key mantra was: “We now have access to the content. We can always do another Crossref metadata update if things change or shift over time.”&lt;/p>
&lt;h3 id="have-there-been-any-important-milestones-along-the-way-you-were-able-to-celebrate-or-any-set-backs-you-had-to-resolve-in-the-process">Have there been any important milestones along the way you were able to celebrate? Or any set-backs you had to resolve in the process?&lt;/h3>
&lt;p>DP: For as long as I can remember, the importance of good metadata has been among the loudest messages of best practice in the industry. I don’t think that I have been able to really quantify/ demonstrate the value of that work. Looking at the consistent increases in the Crossref monthly resolution reports that we saw between 2015 and 2022 and looking at our participation reports has helped provide some measure of progress. For example, the number of average monthly successful resolutions in that Crossref report in 2015 was ~390,000. The last time I checked, the 2022 numbers were ~ 3.7 million. In 2023, I hope that we will be able to leverage Event Data for this as well.&lt;/p>
&lt;p>The setbacks have fallen into two categories: timing and process. Our internal resourcing to get this done within our preferred time frame, to have the content loaded and delivered, and triage problems—it’s a battle between the calendar and competing priorities.&lt;/p>
&lt;p>DH: When Deborah first shared those stats with me, I was floored. I don’t think either of us suspected such an increase was possible. For me, the biggest setback was mistakenly sending about ~50,000 DOI records to queue and watching them all fail because I grabbed the wrong batch. Ooops. I never made that mistake again, though.&lt;/p>
&lt;h3 id="was-any-specific-type-of-metadata-or-any-part-of-the-schema-particularly-easy-or-particularly-difficult-to-get-right-in-asms-production-process">Was any specific type of metadata or any part of the schema particularly easy or particularly difficult to get right in ASM’s production process?&lt;/h3>
&lt;p>DH: For us, the most difficult piece of metadata revolves around data availability and how we capture linked data resources (outside of data citation resources). Because of our current editorial style (which had been print-centric for years), we did not do a good job of identifying whether there are data associated with published content in a consistent machine-readable way. We did some experiments with one of our journals to capture this outside of our normal Crossref deposit routine, but that was not as accurate or sustainable as we would have liked. But, in that experiment, we learned a few things about how we treat these data throughout our publishing process and we have plans to create a sustainable integrated workflow for this to capture resource/data linkages in our Crossref deposits.&lt;/p>
&lt;h3 id="what-were-your-thoughts-on-last-years-move-to-open-references-metadata-has-that-impacted-on-your-project-in-any-way">What were your thoughts on last year’s move to open references metadata? Has that impacted on your project in any way?&lt;/h3>
&lt;p>DP: We were really excited about this; based on the rather limited approach to sorting out impact at the moment, the more metadata we push out into the ecosystem, the more it appears to be used. In my view, that is at the core of what society publishers want to do—ensure that research is accessible and discoverable wherever our users expect to find it.&lt;/p>
&lt;p>DH: 100% agree.&lt;/p>
&lt;h3 id="how-did-you-keep-motivated-and-on-course-throughout">How did you keep motivated and on-course throughout?&lt;/h3>
&lt;p>DP: These kinds of things are never done; for example, we have placeholders for CRediT roles, and getting ready for that work as part of a DTD migration will be the next big thing. The motivation for that is really meeting our commitment to the community, seeing the impact of the author metadata versus article metadata, and seeing what we can learn.&lt;/p>
&lt;p>DH: Metadata at its core is one of the pillars of our service as a publisher. To provide the best service, we need to provide the best metadata possible. Just remembering that this can be incremental, allows us to celebrate the large moments and the small. And whether one is partying with a massive 7 layer cake or a smaller cake pop, both are sweet and motivating.&lt;/p>
&lt;h3 id="now-that-the-project-is-completed-are-you-seeing-the-benefits-you-were-hoping-to-achieve">Now that the project is completed, are you seeing the benefits you were hoping to achieve?&lt;/h3>
&lt;p>DP: This is a hard one to answer as we are using limited measurements at this time. At a high level, I am pleased. While I am eager to leverage event data in the coming year, it would be really helpful to get feedback from the community on how we can improve as well as other ways to evaluate impact.&lt;/p>
&lt;p>DH: I want to take up this idea of metadata as a service once more. I don’t mean in terms of discoverability or searchability, either. Let’s take ORCID deposited into Crossref as an example. When done properly (with the proper authentication and validation occurring in the background), we are able to integrate citation data directly to an author&amp;rsquo;s ORCID profile. We have found that this small service is really appreciated.&lt;/p>
&lt;h3 id="is-there-any-metadata-that-youd-like-to-be-able-to-include-with-your-publishing-records-in-the-future-that-isnt-possible-currently-what-would-it-be-and-why">Is there any metadata that you’d like to be able to include with your publishing records in the future that isn’t possible currently? What would it be and why?&lt;/h3>
&lt;p>DP: CRediT roles would be great because it could give greater insight into collaboration within and across disciplines, it could allow for some automation and integration opportunities in the peer review process, and maybe it would visualize aspects of authors’ careers.&lt;/p>
&lt;p>DH: I second capturing CRediT roles. What would be really interesting is also creating a standard that quantifies the accessibility conformance/rating of content and passing that into Crossref.&lt;/p>
&lt;h3 id="what-was-the-key-lesson-you-learned-from-this-project">What was the key lesson you learned from this project?&lt;/h3>
&lt;p>DP: Incremental change can be just as challenging as a massive overhaul, and so it’s important to reevaluate your goals along the way—things always change. There have been cases where we were able to do things that we hadn’t initially thought were feasible.&lt;/p>
&lt;p>DH: Always keep the larger goal in mind and remember that any project can birth a new project. Everything does not happen at once.&lt;/p>
&lt;h3 id="whats-your-next-big-challenge-for-2023">What’s your next big challenge for 2023?&lt;/h3>
&lt;p>DP: There is a lot to contend with in the industry right now, and in addition to that we are going through some serious infrastructure changes in our program. With all that madness comes many opportunities. For that reason, when I take a step back from the tactical implications of all that and what we are interested in doing, I think our biggest challenge in 2023 will be identifying what has made an impact and why.&lt;/p>
&lt;p>DH: In the short-term, it is making sure that none of our production process changes has negatively affected the past metadata work we spent so much time honing. Once that settles down, it will be determining the best way forward from a publishing perspective in handling true versioning and capturing accurate event data.&lt;/p>
&lt;h3 id="based-on-your-experience-what-would-be-your-advice-for-colleagues-from-other-scholarly-publishing-organisations">Based on your experience, what would be your advice for colleagues from other scholarly publishing organisations?&lt;/h3>
&lt;p>DP: It can seem daunting, but the small wins can create momentum and do not have to be expensive. Remembering that your publishing program benefits as much as everyone else’s when you deposit more metadata can help refine your short-term and long-term priorities.&lt;/p>
&lt;p>DH: Don’t be afraid of making a mess of things. Messes are okay. They aren’t risky. They just reveal the clutter. And clutter gives one reason to clean things up.&lt;/p>
&lt;p>THANK YOU for the interview!&lt;/p>
&lt;hr>
&lt;h3 id="about-the-american-society-for-microbiology">About the American Society for Microbiology&lt;/h3>
&lt;p>The American Society for Microbiology is one of the largest professional societies dedicated to the life sciences and is composed of 30,000 scientists and health practitioners. ASM&amp;rsquo;s mission is to promote and advance the microbial sciences.&lt;/p>
&lt;p>ASM advances the microbial sciences through conferences, publications, certifications and educational opportunities. It enhances laboratory capacity around the globe through training and resources. It provides a network for scientists in academia, industry and clinical settings. Additionally, ASM promotes a deeper understanding of the microbial sciences to diverse audiences.
For more information about ASM visit &lt;a href="https://asm.org/" target="_blank">asm.org&lt;/a>.&lt;/p></description></item><item><title>In the know on workflows: The metadata user working group</title><link>https://www.crossref.org/blog/in-the-know-on-workflows-the-metadata-user-working-group/</link><pubDate>Tue, 28 Feb 2023 00:00:00 +0000</pubDate><author>Jennifer Kemp</author><guid>https://www.crossref.org/blog/in-the-know-on-workflows-the-metadata-user-working-group/</guid><description>&lt;p>What’s in the metadata matters because it is So.Heavily.Used.&lt;/p>
&lt;p>You might be tired of hearing me say it but that doesn’t make it any less true. Our open APIs now see over 1 &lt;em>billion&lt;/em> queries per month. The metadata is ingested, displayed and redistributed by a vast, global array of systems and services that in whole or in part are often designed to point users to relevant content. It’s also heavily used by researchers, who author the content that is described in the metadata they analyze. It’s an interconnected supply chain of users large and small, occasional and entirely reliant on regular querying.&lt;/p>
&lt;h2 id="tldr">Tl;dr&lt;/h2>
&lt;p>Crossref recently wrapped up our first &lt;a href="https://www.crossref.org/working-groups/metadata-user/" target="_blank">Working Group&lt;/a> for users of the metadata, a group that plays a key role in discoverability and the metadata supply chain. You can jump directly to the &lt;a href="#what-are-the-recommendations">stakeholder-specific recommendations&lt;/a> or take a moment to share your &lt;a href="https://docs.google.com/forms/d/1bHO7gGYC-HqObkXgD5xrSUIjE-m93cTZ8Bp1RBJXgwo/edit" target="_blank">use case&lt;/a> or &lt;a href="mailto:feedback@crossref.org">feedback&lt;/a>.&lt;/p>
&lt;h2 id="why-a-metadata-user-group-why-now">Why a metadata user group? Why now?&lt;/h2>
&lt;p>A majority of Crossref metadata users rely on our &lt;a href="https://www.crossref.org/services/metadata-retrieval/" target="_blank">free, open APIs&lt;/a> and many are anonymous. A small but growing group of users pay for a &lt;a href="https://www.crossref.org/services/metadata-retrieval/metadata-plus/" target="_blank">guaranteed service level option&lt;/a> and while their individual needs and feedback have long been integrated into Crossref’s work, as a group they provide a window into the workflows and use cases for the metadata of the scholarly record. As this use grows in &lt;a href="https://www.crossref.org/strategy/" target="_blank">strategic importance&lt;/a>, to both Crossref and the wider community, it was clear that we might be overdue for a deeper dive into user workflows.&lt;/p>
&lt;p>In 2021, we surveyed these subscribers for their feedback and brought together a few volunteers over a series of 5 calls to dig into a number of topics specific to regular users of metadata. This group, the first primarily non-member &lt;a href="https://www.crossref.org/working-groups/metadata-user/" target="_blank">working group&lt;/a> at Crossref, wrapped up in December 2022, and we are grateful for their time:&lt;/p>
&lt;ul>
&lt;li>Achraf Azhar, Centre pour la Communication Scientifique Directe (CCSD)&lt;/li>
&lt;li>Satam Choudhury, HighWire Press&lt;/li>
&lt;li>Nees Jan van Eck, CWTS-Leiden University&lt;/li>
&lt;li>Bethany Harris, Jisc&lt;/li>
&lt;li>Ajay Kumar, Nova Techset&lt;/li>
&lt;li>David Levy, Pubmill&lt;/li>
&lt;li>Bruno Ohana, biologit&lt;/li>
&lt;li>Michael Parkin, European Bioinformatics Institute (EMBL-EBI)&lt;/li>
&lt;li>Axton Pitt, Litmaps&lt;/li>
&lt;li>Dave Schott, Copyright Clearance Center (CCC)&lt;/li>
&lt;li>Stephan Stahlschmidt, German Centre for Higher Education Research and Science Studies (DZHW)&lt;/li>
&lt;/ul>
&lt;p>This post is intended to summarize the work we did, to highlight the role of metadata users in research communications, to provide a few ideas for future efforts and, crucially, to get your feedback on the findings and recommendations. Though this particular group set out to meet for a limited time, we hope this report helps facilitate ongoing conversations with the user community.&lt;/p>
&lt;h2 id="survey-highlights">Survey Highlights&lt;/h2>
&lt;p>If you’re looking for an easy overview of users and use cases, here’s a great starting point.&lt;/p>
&lt;figure>&lt;img src='https://www.crossref.org/images/documentation/metadata-users-uses.png' alt='Metadata users and uses: metadata from Crossref APIs is used for a variety of purposes by many tools and services' title='' width='75%'>&lt;/figure>
&lt;button id="show-img" type="button" class="btn btn-default" data-toggle="modal" data-target="#image32">Show image&lt;/button>
&lt;div id="image32" class="modal fade" aria-labelledby="my-modalLabel" aria-hidden="true" tabindex="-1" role="dialog">
&lt;div class="modal-dialog" data-dismiss="modal">
&lt;div class="modal-content" >
&lt;div class="modal-body">
&lt;button type="button" class="close" data-dismiss="modal" aria-hidden="true">×&lt;/button>
&lt;img src="https://www.crossref.org/images/documentation/metadata-users-uses.png" class="img-responsive" style="width: 100%;">
&lt;/div>
&lt;/div>
&lt;/div>
&lt;/div>
&lt;p>If you interpret this graphic to mean that there is a lot of variety centered on a few high level use cases, the survey and our experiences with users certainly supports that. A few key takeaways from the 2021 survey may be useful context:&lt;/p>
&lt;ul>
&lt;li>&lt;strong>Frequency of use&lt;/strong>: At least 60% of respondents query metadata on a daily basis&lt;/li>
&lt;li>&lt;strong>Use cases&lt;/strong>
&lt;ul>
&lt;li>Finding and enhancing metadata as well as using it for general discovery are all common use cases&lt;/li>
&lt;li>For most users, matching DOIs and citations is a common need but for a significant group, it is their primary use case&lt;/li>
&lt;li>Analyzing the corpus for research was a consistent use case for 13% of respondents&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>&lt;strong>Metadata of particular interest&lt;/strong>
&lt;ul>
&lt;li>Abstracts are the most desirable non-bibliographic metadata, followed by affiliation information, including RORs
&lt;ul>
&lt;li>Some other elements (beyond citation information) that respondents find useful are:
&lt;ul>
&lt;li>Corrections and retractions&lt;/li>
&lt;li>&lt;a href="https://www.crossref.org/documentation/principles-practices/best-practices/relationships/">Relationship metadata&lt;/a>&lt;/li>
&lt;li>Book chapters&lt;/li>
&lt;li>&lt;a href="https://doi.org/10.64000/tynar-j7a72" target="_blank">Grant information&lt;/a>&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;p>NB: The survey did not ask about references but we are frequently asked why they’re not included more often.&lt;/p>
&lt;p>It’s also worth noting that about a third of respondents said that correct metadata is more important to them than any particular element.&lt;/p>
&lt;p>There is more to this survey that isn’t covered here but it was kept fairly short to help with the response rate. Knowing we would have some focused time to discuss issues too numerous or nuanced to reasonably address in a survey, we compiled a long list of questions and topics for the Working Group then followed up with a second, more detailed survey to kick off the meeting series.&lt;/p>
&lt;h2 id="what-we-set-out-to-address">What we set out to address&lt;/h2>
&lt;p>We had three primary goals for this Working Group:&lt;/p>
&lt;ul>
&lt;li>Highlight the efforts of metadata users in enabling discovery and discoverability&lt;/li>
&lt;li>Determine direction(s) for improved engagement&lt;/li>
&lt;li>Inform the Crossref product development &lt;a href="https://trello.com/b/02zsQaeA/crossref-roadmap" target="_blank">roadmap&lt;/a> for &lt;a href="https://www.crossref.org/services/metadata-retrieval/">metadata retrieval services&lt;/a>&lt;/li>
&lt;/ul>
&lt;p>Of course, everyone involved had some questions and topics of interest to cover, including (but not limited to):&lt;/p>
&lt;ul>
&lt;li>Understanding publisher workflows&lt;/li>
&lt;li>How best to introduce changes, e.g. for a high volume of updated records&lt;/li>
&lt;li>Understanding the Crossref schema&lt;/li>
&lt;li>Query efficiencies, i.e. ‘tips and tricks’ (here for the REST API)&lt;/li>
&lt;li>Which scripts, tools and/or programs are used in workflows&lt;/li>
&lt;li>What other metadata sources are used&lt;/li>
&lt;li>What kind of normalization or processing is done on ingest&lt;/li>
&lt;li>How metadata errors are handled&lt;/li>
&lt;/ul>
&lt;h2 id="what-did-we-learn">What did we learn?&lt;/h2>
&lt;p>&lt;strong>Workflows&lt;/strong>&lt;br>
I started with the admittedly ambitious goal of collecting a library of workflows. After a few years of working with users, I learned never to assume what a user was doing with the metadata, why or how. For example, some subscribers use Plus snapshots (a monthly set of all records), regularly or occasionally and some don’t use them at all. Understanding why users make the choices they do is always helpful.&lt;/p>
&lt;p>In my experience, workflows are frequently characterized as “set it and forget it.” It’s hard to know how often and how easily they might be adapted when, for example, a new record type like peer review reports becomes available. So, it’s worth exploring when and how to highlight to users changes that might be of interest.&lt;/p>
&lt;p>As it turned out, half the group had their workflows mostly or fully documented. The rest are partially documented, not documented at all or the availability of documentation was unknown. Helping users document their workflows, to the extent possible, should be a mutually beneficial effort to explore going forward. We&amp;rsquo;re doing similar work with the aim of making ours more transparent and replicable.&lt;/p>
&lt;p>&lt;strong>Feedback on subscriber services&lt;/strong>&lt;br>
User feedback might be the most obvious and directly consequential work of this group, at least for Crossref - understanding how well the services used meet their needs and what might be improved.&lt;/p>
&lt;p>One frequent suggestion for improvement is faster response time on queries. This is an area we’ve focused on for some time, because refining queries to be &lt;a href="https://www.crossref.org/documentation/retrieve-metadata/rest-api/tips-for-using-the-crossref-rest-api/">more efficient&lt;/a> is often the most straightforward way to improve response times and one reason for the emphasis on workflows.&lt;/p>
&lt;p>We also discussed the possibility of whether or how to notify users of changes of interest. Just defining “change” is complex since they are so frequent and may often be considered very minor. We’ve been experimenting a bit over the past few years with notifying these users in cases where we’re aware of upcoming large volumes of changes, which is sometimes the case when landing page URLs are updated due to a platform change, for example. It was incredibly useful to discuss with the group what volume of records would be a useful threshold to trigger a notification (100K if you’re curious).&lt;/p>
&lt;p>But perhaps the most common feedback we get from all users is on the metadata itself and the myriad &lt;a href="https://doi.org/10.64000/6jvdm-fmy29" target="_blank">quality issues&lt;/a> involved. The group spent a fair amount of time discussing how this affects their work and shared a few examples of notable concerns:&lt;/p>
&lt;ul>
&lt;li>Author name issues, e.g. ‘Anonymous’ is an option for authors but that or things like ‘n/a’ are sometimes used in surname fields&lt;/li>
&lt;li>Invalid DOIs are sometimes found in reference lists&lt;/li>
&lt;li>Garbled characters from text not rendering properly&lt;/li>
&lt;li>Affiliation information is often not included or incomplete (e.g. doesn’t include RORs)&lt;/li>
&lt;li>Inconsistencies in commonly included information, e.g. ISSNs&lt;/li>
&lt;/ul>
&lt;p>It’s worth noting that a common misunderstanding - not just among users - is &lt;a href="https://www.crossref.org/documentation/schema-library/required-recommended-elements/">what is required&lt;/a> in the metadata. Users nearly always expect more metadata and more consistency than is actually available. The introduction of &lt;a href="https://www.crossref.org/members/prep/" target="_blank">Participation Reports&lt;/a> a few years ago was a very useful start to what is an ongoing discussion about the variable nature of metadata quality and completeness.&lt;/p>
&lt;p>&lt;strong>Users in the metadata supply chain&lt;/strong>&lt;br>
A few years ago, our colleague Joe Wass used &lt;a href="https://www.crossref.org/services/event-data/">Event Data&lt;/a> to put together this chart of &lt;a href="https://doi.org/10.64000/r69t9-bcr78" target="_blank">referrals from non-publisher sources&lt;/a> in 2015.&lt;/p>
&lt;img src="https://www.crossref.org/wp/blog/uploads/2016/05/month-top-10-filtered-domains-1.png" alt="month-top-10-filtered-domains" class="img-responsive" />
&lt;p>The role of metadata users in discoverability of content is key in my view and one that often doesn’t get enough attention, especially given that the systems and services that use this information often use it to point their own users to relevant resources. And because they work so closely with the metadata, users frequently report errors and so serve as a sort of de facto quality control. So, unfortunately, the effects of incomplete or incorrect metadata on these users might be the most powerful way to highlight the need for more and better metadata.&lt;/p>
&lt;h2 id="what-are-the-recommendations">What are the recommendations?&lt;/h2>
&lt;p>In discussions with the Working Group, a few themes emerged, largely around best practices, which, by their nature, tend to be aspirational.&lt;/p>
&lt;p>If you’re not already familiar with the &lt;a href="https://metadata2020.org/resources/metadata-personas/" target="_blank">personas&lt;/a> and &lt;a href="https://metadata2020.org/resources/metadata-practices/" target="_blank">Best Practices&lt;/a> and &lt;a href="https://metadata2020.org/resources/metadata-principles/" target="_blank">Principles&lt;/a> of Metadata 2020, that is a useful starting point (I am admittedly biased here!) and many are echoed in the following recommendations:&lt;/p>
&lt;p>&lt;strong>For users:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>Document and periodically review workflows&lt;/li>
&lt;li>Report errors to members or to &lt;a href="mailto:support@crossref.org">Crossref support&lt;/a> and reflect corrections when they’re made (metadata and content)&lt;/li>
&lt;li>Understand what &lt;a href="https://www.crossref.org/documentation/schema-library/required-recommended-elements/">is and isn’t&lt;/a> in the metadata&lt;/li>
&lt;li>Follow &lt;a href="https://www.crossref.org/documentation/retrieve-metadata/rest-api/tips-for-using-the-crossref-rest-api/">best practices&lt;/a> for using APIs&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>For Crossref:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>Define a set of metadata changes, e.g. to affiliations, to further the discussion around thresholds for notifying users of ‘high volumes’ of changes&lt;/li>
&lt;li>Provide an output schema.&lt;/li>
&lt;li>Continue refining the input schema to include information like preprint server name, journal article sub types (research article, review article, letter, editorial, etc.), corresponding author flags, raw funding statement texts, provenance information, etc.&lt;/li>
&lt;li>Collaborate on improving processes for reporting &lt;a href="https://doi.org/10.64000/6jvdm-fmy29" target="_blank">metadata errors&lt;/a> and making corrections and enhancements&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>For metadata providers (publishers, funders and their service providers):&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>Follow &lt;a href="https://metadata2020.org/learn-more/outcomes/" target="_blank">Metadata 2020 Metadata Principles and Practices&lt;/a>&lt;/li>
&lt;li>Consistency is important, e.g. using the same, correct relationship for preprint to VoR links for all records
&lt;ul>
&lt;li>Workarounds such as putting information into a field that is ‘close’ but not meant for it can be considered a kind of error&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>Understand the roles and needs of users in amplifying your outputs&lt;/li>
&lt;li>Respond promptly to reports of metadata errors&lt;/li>
&lt;li>Whenever possible, provide PIDs (ORCID IDs, ROR IDs, etc.) in addition to (not as a substitute for) textual metadata&lt;/li>
&lt;/ul>
&lt;h2 id="what-is-still-unclear-or-unfinished">What is still unclear or unfinished?&lt;/h2>
&lt;p>Honestly, a lot. We knew from the outset that the group would conclude with much more work to be done, in part because there is so much variety under the umbrella of metadata users and many answers lead to more questions and in part because the metadata and the user community will continue to evolve. Even without a standing group that meets regularly, it’s very much an ongoing conversation and we invite you to join it.&lt;/p>
&lt;h2 id="now-its-your-turncan-you-help-fill-in-the-blanks">Now it’s your turn–can you help fill in the blanks?&lt;/h2>
&lt;p>Does any or all of this resonate with you? Do you take exception to any of it? Do you have suggestions for continuing the conversation?&lt;/p>
&lt;p>Specifically, can you help fill in any of the literal blanks? We&amp;rsquo;ve prepared a &lt;a href="https://docs.google.com/forms/d/1bHO7gGYC-HqObkXgD5xrSUIjE-m93cTZ8Bp1RBJXgwo/edit" target="_blank">short survey&lt;/a> that we hope can serve as a template for collecting (anonymous) workflows. Please take just a few minutes to answer a few short questions such as how often you query for metadata.&lt;/p>
&lt;p>If you are willing to share examples of your queries or have questions or further comments, please &lt;a href="mailto:feedback@crossref.org">get in touch&lt;/a>.&lt;/p></description></item><item><title>Perspectives: Mohamad Mostafa on scholarly communications in UAE</title><link>https://www.crossref.org/blog/perspectives-mohamad-mostafa-on-scholarly-communications-in-uae/</link><pubDate>Mon, 27 Feb 2023 00:00:00 +0000</pubDate><author>Mohamad Mostafa</author><guid>https://www.crossref.org/blog/perspectives-mohamad-mostafa-on-scholarly-communications-in-uae/</guid><description>&lt;div style="float:left;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/2022/perspectives.png" alt="sound bar logo" width="150px" class="img-responsive" />
&lt;/div>
&lt;p>Our Perspectives blog series highlights different members of our diverse, global community at Crossref. We learn more about their lives and how they came to know and work with us, and we hear insights about the scholarly research landscape in their country, the challenges they face, and their plans for the future.&lt;/p>
&lt;div style="text-align: right">
تسلط سلسلة مدونة توقعات - وجهات نظر الخاصة بنا الضوء على أعضاء مختلفين من مجتمعنا العالمي المتنوع في كروس رف .نتعلم المزيد عن حياتهم وكيف تعرفوا وعملوا معنا، ونسمع رؤى حول مشهد البحث العلمي في بلدهم، والتحديات التي يواجهونها، وخططهم للمستقبل.
&lt;/div> &lt;br>
&lt;p>As we continue with our Perspectives blog series, today, we meet Mohamad Mostafa, Crossref Ambassador in the UAE and Production Manager at Knowledge E. Mohamad is passionate about helping improve the discoverability of research through rich metadata. We invite you to read and listen to what Mohamad has to say!&lt;/p>
&lt;div style="text-align: right">
بينما نواصل سلسلة مدونة توقعات - وجهات نظر الخاصة بنا، نلتقي اليوم مع محمد مصطفى، سفير كروس رف في الإمارات العربية المتحدة ومدير الإنتاج في نوليدج اي . محمد متحمس للمساعدة في تحسين إمكانية اكتشاف البحث من خلال البيانات الوصفية الغنية. ندعوكم لقراءة ما يقوله محمد والاستماع إليه!
&lt;/div> &lt;br>
&lt;div
id="video-player-blog-perspectives-mohamad-mostafa"
class="vimeo-player-container"
data-player-id="blog-perspectives-mohamad-mostafa"
data-default-video="895597454"
>
&lt;div class="video-languages">
&lt;a type="button" style="cursor:pointer;" class="video-language-button" data-videoid="895597454" data-playerid="video-player-blog-perspectives-mohamad-mostafa">English&lt;/a>
&lt;a type="button" style="cursor:pointer;" class="video-language-button" data-videoid="895597913" data-playerid="video-player-blog-perspectives-mohamad-mostafa">عربي&lt;/a>
&lt;/div>
&lt;div id="vimeo-blog-perspectives-mohamad-mostafa">&lt;/div>
&lt;/div>
&lt;p>&lt;br>&lt;br>&lt;/p>
&lt;blockquote>
&lt;p>Tell us a bit about your organisation, your objectives, and your role&lt;/p>
&lt;/blockquote>
&lt;div style="text-align: right">
&lt;blockquote>
&lt;p>أخبرنا قليلاً عن مؤسستك وأهدافك ودورك&lt;/p>
&lt;/blockquote>
&lt;/div>&lt;br>
My name is Mohamad Mostafa, and I am the Production Manager at Knowledge E. Within our publishing program, we publish around 2000 articles across 13 titles that are fully Open Access, which is something that I really value.
&lt;div style="text-align: right">
اسمي محمد مصطفى، وأنا مدير الإنتاج في نولدج إي. ضمن برنامج النشر الخاص بنا، ننشر حوالي 2000 مقالة عبر 13 عنوانًا مفتوح الوصول بالكامل، وهو أمر أقدره حقًا.
&lt;/div>&lt;br>
In a world that’s moving faster than ever, the availability, quality, and pursuit of knowledge are fundamental for advancement. Knowledge E, in line with its vision of developing a more knowledgeable world, helps institutions advance the quality of their research; move towards teaching excellence; upgrade library technology, services, and practices; and advance scholarship through journal publication, management, and training. In other words, it works with higher education institutions, research centres, ministries, publishers, and scholars to solve our society’s most significant challenges.
&lt;div style="text-align: right">
في عالم يتحرك بشكل أسرع من أي وقت مضى، يعد توافر المعرفة وجودتها والسعي وراءها أمورًا أساسية للتقدم. إن نوليدج إي، تماشياً مع رؤيتها لتطوير عالم أكثر معرفة ودراية، تساعد المؤسسات على تحسين جودة أبحاثها؛ التحرك نحو التميز في التدريس؛ ترقية مكتباتها الرقمية والخدمات والممارسات المتعلقة بها؛ ودعم المنح الدراسية المتقدمة من خلال نشر المجلات وإدارتها والتدريب. بمعنى آخر، تعمل شركة نولدج إي مع مؤسسات التعليم العالي ومراكز البحث والوزارات والناشرين والعلماء لحل أهم التحديات التي تواجه مجتمعنا.&lt;/div>&lt;br>
&lt;p>I am also a Crossref Ambassador. As part of the ambassador program, we aim to raise awareness about Crossref services among librarians, publishers, editors, and authors in the Middle East and North Africa region. As part of this, we run workshops in English and Arabic, emphasizing the importance of comprehensive metadata and persistent identifiers. We also help research communities improve their understanding of how to use Crossref services. The importance of making regional research objects easy to find, cite and reuse encouraged me to join the ambassador program.&lt;/p>
&lt;div style="text-align: right">
أنا أيضًا سفير كروس رف. كجزء من برنامج السفراء، نهدف إلى زيادة الوعي حول خدمات Crossref بين أمناء المكتبات والناشرين والمحررين والمؤلفين في منطقة الشرق الأوسط وشمال إفريقيا. وكجزء من هذا، فإننا ندير ورش عمل باللغتين الإنجليزية والعربية، للتأكيد على أهمية البيانات الوصفية الشاملة والمعرفات المستمرة. نحن أيضًا نساعد مجتمعات البحث على تحسين فهمهم لكيفية استخدام خدمات .Crossref شجعتني أهمية تسهيل العثور على عناصر البحث الإقليمية والاستشهاد بها وإعادة استخدامها على الانضمام إلى برنامج سفراء كروس رف.&lt;/div>&lt;br>
&lt;blockquote>
&lt;p>What is one thing that others should know about your country and its research activity?&lt;/p>
&lt;/blockquote>
&lt;div style="text-align: right">
&lt;blockquote>
&lt;p>ما هو الشيء الذي يجب أن يعرفه الآخرون عن بلدك ونشاطه البحثي؟&lt;/p>
&lt;/blockquote>
&lt;/div>&lt;br>
A lot of regional research is being produced (in Arabic) and even without proper infrastructure (the lack of language support within the international publishing ecosystems such as peer review systems, indexes, citations databases, submissions systems, etc.) and the inadequate awareness about the various services (such as Crossref solutions) that can help with the discoverability and visibility of this research, the Arab region is increasingly recognised as a global leader in research outputs. Generally, these are some of the challenges and frustrations associated with the MENA (Middle East/North Africa) region.
&lt;div style="text-align: right">
يتم إنتاج الكثير من الأبحاث الإقليمية (باللغة العربية) وحتى بدون بنية تحتية مناسبة (نقص الدعم اللغوي داخل أنظمة النشر الدولية مثل أنظمة مراجعة الأقران، والفهارس، وقواعد بيانات الاستشهادات، وأنظمة التقديم، وما إلى ذلك) وعدم كفاية الوعي حول الخدمات المختلفة (مثل حلول(Crossref التي يمكن أن تساعد في اكتشاف هذه البحوث وإبرازها، يتم الاعتراف بالمنطقة العربية بشكل متزايد كرائد عالمي في مخرجات البحث. بشكل عام، هذه بعض التحديات والإحباطات المرتبطة بمنطقة الشرق الأوسط وشمال إفريقيا.
&lt;/div>&lt;br>
&lt;blockquote>
&lt;p>Are there trends in scholarly communications that are unique to your part of the world?&lt;/p>
&lt;/blockquote>
&lt;div style="text-align: right">
&lt;blockquote>
&lt;p>هل توجد اتجاهات في الاتصالات العلمية فريدة من نوعها في الجزء الذي تعيش فيه من العالم؟&lt;/p>
&lt;/blockquote>
&lt;/div>&lt;br>
In general, Open Access and Open Research are getting more and more attention in our region currently. We have recently launched the Forum for Open Research in MENA to raise awareness about all the new scholarly communications trends and support the Middle East and North Africa movement towards Open Science.
&lt;div style="text-align: right">
بشكل عام، يحظى الوصول الحر والبحث المفتوح باهتمام متزايد في منطقتنا حاليًا. لقد أطلقنا مؤخرًا منتدى الأبحاث المفتوحة في منطقة الشرق الأوسط وشمال إفريقيا لزيادة الوعي حول الاتصالات العلمية الجديدة ودعم حركة الشرق الأوسط وشمال إفريقيا نحو العلوم المفتوحة.
&lt;/div>&lt;br>
The Forum for Open Research in MENA (FORM) is a non-profit membership organisation supporting the advancement of open science policies and practices in research communities and institutions across the Arab world.
&lt;div style="text-align: right">
منتدى البحوث المفتوحة في الشرق الأوسط وشمال إفريقيا (FORM) هو منظمة غير ربحية ذات عضوية تدعم النهوض بسياسات وممارسات العلوم المفتوحة في المجتمعات والمؤسسات البحثية في جميع أنحاء العالم العربي.
&lt;/div>&lt;br>
We believe the Arab world has the resources and capability to play a pivotal role in the global transition towards more accessible, sustainable, and inclusive research and education models. And we want to support all our research communities and stakeholder groups in the journey towards a more ‘open’ world. Our vision is to help unlock research for and in the Arab world. Our mission is to support the advancement of open science practices in research libraries and universities across the Arab world by facilitating the exchange of actionable insights and developing practical policies.
&lt;div style="text-align: right">
نعتقد أن العالم العربي لديه الموارد والقدرة على لعب دور محوري في التحول العالمي نحو نماذج بحث وتعليم أكثر سهولة واستدامة وشمولية. ونريد دعم جميع مجتمعاتنا البحثية ومجموعات أصحاب المصلحة في رحلتنا نحو عالم أكثر "انفتاحًا". رؤيتنا هي دعم الوصول الحر والبحوث المفتوحة في العالم العربي. ومهمتنا هي دعم تقدم ممارسات العلوم المفتوحة في مكتبات البحث والجامعات في جميع أنحاء العالم العربي من خلال تسهيل تبادل الأفكار القابلة للتنفيذ وتطوير السياسات العملية.
&lt;/div>&lt;br>
Our first Annual Forum was held in Cairo in October 2022 (as part of the global Open Access Week initiative). The event was a huge success, with over 1,100 delegates from over 48 countries across the globe. The next Annual Forum will be hosted in the UAE in October 2023, and details will be available shortly on our website.
&lt;div style="text-align: right">
عقد المنتدى السنوي الأول في القاهرة في أكتوبر 2022 (كجزء من مبادرة أسبوع الوصول الحر العالمي). حقق الحدث نجاحًا كبيرًا، حيث حضره أكثر من 1100 مندوب من أكثر من 48 دولة حول العالم. سيتم استضافة المنتدى السنوي القادم في دولة الإمارات العربية المتحدة في أكتوبر 2023، وستتوفر التفاصيل قريبًا على موقعنا.
&lt;/div>&lt;br>
&lt;blockquote>
&lt;p>How would you describe the value of being part of the Crossref community; what impact has your participation had on your goals?&lt;/p>
&lt;/blockquote>
&lt;div style="text-align: right">
&lt;blockquote>
&lt;p>كيف تصف قيمة أن تكون جزءًا من مجتمعCrossref ؟ ما هو تأثير مشاركتك على أهدافك؟&lt;/p>
&lt;/blockquote>
&lt;/div>&lt;br>
I have been a Crossref ambassador for more than 5 years now, and I can really say that it has been a great experience being part of such an amazing and collaborative community. We got the chance to interact with different publishers and service providers and participate in different Crossref annual events. It’s also perfectly aligned with our vision of supporting Open Research.
&lt;div style="text-align: right">
لقد كنت سفيرًا لـ Crossref لأكثر من 5 سنوات حتى الآن، ويمكنني حقًا أن أقول إنها كانت تجربة رائعة أن أكون جزءًا من هذا المجتمع المذهل والتعاوني. لقد أتيحت لنا الفرصة للتفاعل مع مختلف الناشرين ومقدمي الخدمات والمشاركة في الأحداث السنوية المختلفة لـ .Crossref كما أنه يتماشى تمامًا مع رؤيتنا لدعم البحث المفتوح.
&lt;/div>&lt;br>
Recently, we have delivered a series of three Arabic webinars that offered basic metadata information and advanced insights about the role of metadata and how Crossref services can help an institution. These webinars have been well received by the community of regional publishers, university presses, and librarians. Dozens of questions have been answered, and technical enquires have been resolved. It was a great experience, and it was good to see that kind of interest in our community. Also, more educational webinars are yet to come!
&lt;div style="text-align: right">
قدمنا مؤخرًا سلسلة من ثلاث ندوات عربية عبر الإنترنت تمحورت حول معلومات البيانات الوصفية الأساسية ورؤى متقدمة حول دور البيانات الوصفية وكيف يمكن لخدمات Crossref أن تساعد المؤسسات البحثية. لقيت هذه الندوات عبر الإنترنت استحسان مجتمع الناشرين الإقليميين دور النشر الجامعية وأمناء المكتبات. تمت الإجابة على عشرات الأسئلة، وتم الرد على الاستفسارات الفنية. لقد كانت تجربة رائعة، وكان من المفرح أن نرى هذا النوع من الاهتمام في مجتمعنا. بالإضافة إلى ذلك، سيتم تقديم المزيد من الندوات التعليمية على الإنترنت في المستقبل.
&lt;/div>&lt;br>
&lt;blockquote>
&lt;p>For you, what would be the most important thing Crossref could change (do more of/do better in)?&lt;/p>
&lt;/blockquote>
&lt;div style="text-align: right">
&lt;blockquote>
&lt;p>بالنسبة لك، ما هو الشيء الأكثر أهمية الذي يمكن لـ Crossref تغييره (القيام بالمزيد / القيام بعمل أفضل في)؟&lt;/p>
&lt;/blockquote>
&lt;/div>&lt;br>
&lt;p>Language is still a barrier in some parts of the Arab region, so producing more educational content in different formats (webinars, flyers, videos with subtitles, etc.) would be highly appreciated here. &lt;/p>
&lt;div style="text-align: right">
لا تزال اللغة تشكل حاجزًا في بعض المناطق العربية، لذا سيكون إنتاج المزيد من المحتوى التعليمي بتنسيقات مختلفة (ندوات عبر الإنترنت، ونشرات، ومقاطع فيديو مع ترجمة، وما إلى ذلك) موضع تقدير كبير هنا.
&lt;/div>&lt;br>
&lt;blockquote>
&lt;p>Which other organisations do you collaborate with or are pivotal to your work in open scholarship?&lt;/p>
&lt;/blockquote>
&lt;div style="text-align: right">
&lt;blockquote>
&lt;p>ما هي المنظمات الأخرى التي تتعاون معها أو التي تلعب دورًا محوريًا في عملك في مجال الابحاث المفتوحة؟&lt;/p>
&lt;/blockquote>
&lt;/div>&lt;br>
We work closely with ORCiD and invite them to our events, support DOAJ via our
charitable Foundation, and rely heavily on PKP products mainly the Open Journal
Systems (OJS) with plans to expand and start using Open Monograph Press (OMP).
&lt;div style="text-align: right">
إننا نعمل عن كثب مع ORCiD ونقدر دعمهم لفاعلياتنا، كما ندعم DOAJ عبر موقعنا ومؤسستنا الخيرية، ونعتمد بشكل كبير على منتجات مشروع المعرفة العامة وخاصة المجلة المفتوحة أنظمة (OJS) كما أننا نود التوسع والبدء في استخدام Open Monograph Press (OMP).
&lt;/div>&lt;br>
&lt;blockquote>
&lt;p>What are the post-pandemic challenges/hopes you are facing and how are you adapting to them/what you’re looking forward to?&lt;/p>
&lt;/blockquote>
&lt;div style="text-align: right">
&lt;blockquote>
&lt;p>ما هي التحديات / الآمال التي تواجهها في فترة ما بعد الجائحة وكيف تتكيف معها / ما الذي تتطلع إليه؟&lt;/p>
&lt;/blockquote>
&lt;/div>&lt;br>
We aim for more face-to-face meetings and onsite workshops/conferences as
the world opens up again. In addition, we have launched the Forum for Open Research
in MENA (FORM) (a non-profit membership organisation supporting the advancement of Open Science policies and practices in research communities and institutions across the Arab region.)
&lt;div style="text-align: right">
نحن نهدف إلى المزيد من الاجتماعات وجهًا لوجه وورش العمل / المؤتمرات. بالإضافة إلى ذلك، أطلقنا منتدى البحث المفتوحMENA (FORM) ، وهي منظمة غير ربحية ذات عضوية تدعم النهوض بسياسات وممارسات العلوم المفتوحة في مجتمعات ومؤسسات البحث في جميع أنحاء المنطقة العربية.
&lt;/div>&lt;br>
A catalyst for positive action, we work with key stakeholders to develop and implement a pragmatic programme to facilitate the transition toward more accessible, inclusive, and sustainable research and education models in the Arab region. Our driving focus is on building the resources, the membership, the organisational structures, and the broader community to support the advancement of Open Science in research communities and research institutions across the Arab world.
&lt;div style="text-align: right">
كمحفز للعمل الإيجابي، نحن نعمل مع أصحاب المصلحة الرئيسيين للتطوير وتنفيذ برنامج عملي لتسهيل الانتقال نحو المزيد من نماذج البحث والتعليم الشاملة والمستدامة والتي يسهل الوصول إليها في المنطقة العربية. ينصب تركيزنا الدافع على بناء الموارد، والعضوية، والهياكل التنظيمية، والمجتمع الأوسع لدعم تقدم العلوم المفتوحة في المجتمعات البحثية والمؤسسات البحثية عبر العالم العربي.
&lt;/div>&lt;br>
&lt;p>Following the huge success of our 2022 Annual Forum (held in Cairo with the support and endorsement of UNESCO and the Egyptian Knowledge Bank), which attracted over 1100 delegates from 48 countries, our 2023 Annual Forum will be held in Abu Dhabi in the UAE. For more details about the event and the call for papers, see our website: &lt;a href="https://forumforopenresearch.com" target="_blank">https://forumforopenresearch.com&lt;/a>&lt;/p>
&lt;div style="text-align: right">
&lt;p>بعد النجاح الكبير لمنتدى 2022 السنوي (الذي عقد في القاهرة مع دعم وتأييد اليونسكو وبنك المعرفة المصري)، التي اجتذبت أكثر من 1100 مندوب من 48 دولة، المنتدى السنوي لعام 2023 سيعقد في أبو ظبي في دولة الإمارات العربية المتحدة. لمزيد من التفاصيل حول الحدث والدعوة للمشاركة، راجع موقعنا على الإنترنت:&lt;a href="https://forumforopenresearch.com" target="_blank">https://forumforopenresearch.com&lt;/a>&lt;/p>
&lt;/div>&lt;br>
&lt;blockquote>
&lt;p>What are your plans for the future?&lt;/p>
&lt;/blockquote>
&lt;div style="text-align: right">
&lt;blockquote>
&lt;p>ما هي خططك المستقبلية؟&lt;/p>
&lt;/blockquote>
&lt;/div>&lt;br>
Keep working with different global and regional stakeholders to help the transition of our region towards Open Science.
&lt;div style="text-align: right">
استمر في العمل مع مختلف الشركاء العالميين والإقليميين للمساعدة في انتقال منطقتنا العربية نحو العلوم المفتوحة.
&lt;/div>&lt;br>
&lt;br>
Thank you, Mohamad!
&lt;div style="text-align: right">
شكرا لك يا محمد!
&lt;/div>&lt;br></description></item><item><title>The more the merrier, or how more registered grants means more relationships with outputs</title><link>https://www.crossref.org/blog/the-more-the-merrier-or-how-more-registered-grants-means-more-relationships-with-outputs/</link><pubDate>Wed, 22 Feb 2023 00:00:00 +0000</pubDate><author>Dominika Tkaczyk</author><guid>https://www.crossref.org/blog/the-more-the-merrier-or-how-more-registered-grants-means-more-relationships-with-outputs/</guid><description>&lt;p>One of the main motivators for funders registering grants with Crossref is to simplify the process of research reporting with more automatic matching of research outputs to specific awards. In March 2022, we developed a simple approach for linking grants to research outputs and &lt;a href="https://doi.org/10.64000/ske16-xve54" target="_blank">analysed how many such relationships could be established&lt;/a>. In January 2023, we repeated this analysis to see how the situation changed within ten months. Interested? Read on!&lt;/p>
&lt;h2 id="tldr">TL;DR&lt;/h2>
&lt;ul>
&lt;li>
&lt;p>The overall numbers changed a lot between March 2022 and January 2023:&lt;/p>
&lt;ul>
&lt;li>the total number of registered grants doubled (from ~38k to ~76k)&lt;/li>
&lt;li>the total numbers of relationships established between grants and research outputs quadrupled (from 21k to 92k)&lt;/li>
&lt;li>the percentage of linked grants increased substantially (from 10% to 23%)&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>
&lt;p>Most of this growth can be attributed to one funder, the European Union. They started registering grants with us in December 2022, and:&lt;/p>
&lt;ul>
&lt;li>their grants constitute 47% of all grants registered by January 2023 and 95% of grants registered between March 2022 and January 2023&lt;/li>
&lt;li>72% of all established relationships involve their grants&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>
&lt;p>We have further work planned both internally and with the community to consolidate and build out important relationships between funding and research outputs.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;h2 id="introduction">Introduction&lt;/h2>
&lt;p>When we started to develop, think and talk about grant registration at Crossref back in 2017, one of the key things we expected this to support was easier, more efficient, accurate analysis of research outputs funded by specific awards.&lt;/p>
&lt;p>This is backed up by conversations with funders who are keen to fill in gaps in the map of the research landscape with new data points and better quality information, search for grants, investigators, projects or organisations associated with awards and simplify the process of research reporting and with automatic matching of outputs to grants.&lt;/p>
&lt;p>This is in keeping with and informed our recent recommendations about &lt;a href="https://doi.org/10.64000/nfzyk-mfw64" target="_blank">how funding agencies can meet open science guidance using existing open infrastructure&lt;/a>, which included input from ORCID and DataCite. It&amp;rsquo;s also in keeping with &lt;a href="https://doi.org/10.64000/drsct-whk54" target="_blank">recent studies&lt;/a> on how important funding and grant metadata is to help the community use this information in their own research.&lt;/p>
&lt;p>To meet these expectations, we need not only identifiers and metadata of grants, but also relationships between them and research outputs supported by them. Unfortunately, our schema does not make it easy to directly deposit such relationships, and so there are only a handful of them available. But we wouldn&amp;rsquo;t let such a minor obstacle stop us! In March 2022 &lt;a href="https://doi.org/10.64000/ske16-xve54" target="_blank">we analysed the metadata of registered grants&lt;/a> and developed a simple matching approach to automatically link grants to research outputs supported by them. Back then, we were able to find 20,834 relationships, involving 17,082 research outputs and 3,858 grants (which was 10% of all registered grants).&lt;/p>
&lt;p>Now that we are seeing the accumulation of grant metadata being registered with Crossref, we have a bigger dataset to test these expectations against than we did a year ago. So we decided to do the analysis again. And the results are in, they&amp;rsquo;re open, and they&amp;rsquo;re positive. We&amp;rsquo;ll explain below. &lt;/p>
&lt;h2 id="the-methodology">The methodology&lt;/h2>
&lt;p>To spare you from having to read the old analysis in detail, here is a very brief summary of the matching methodology. To find relationships between grants and research outputs, we iterated over all registered grants, and for each grant we searched for research outputs that looked like they might have been supported by this grant. We established a relationship between a grant and a research output if one of the following three scenarios was true:&lt;/p>
&lt;ol>
&lt;li>
&lt;p>The research output contained the DOI of the grant (deposited as the award number).&lt;/p>
&lt;/li>
&lt;li>
&lt;p>The award number in the grant was the same as the award number in the research output, the research output contained the funder ID, and one of the following was true: &lt;br>
a. Funder ID in the grant was the same as the funder ID in the research output  &lt;br>
b. Funder ID in the grant replaced or was replaced by the funder ID in the research output &lt;br>
c. Funder ID in the grant was an ancestor or the descendant of the funder ID in the research output&lt;/p>
&lt;/li>
&lt;li>
&lt;p>The award number in the grant was the same as the award number in the research output, the research output did not contain the funder ID, and one of the following was true:&lt;br>
a. Funder name in the research output was the same as the funder name in the grant&lt;br>
b. Funder name in the research output was the same as the name of a funder that replaced or was replaced by the funder in the grant&lt;br>
c. Funder name in the research output was the same as the name of an ancestor or a descendant of the funder in the grant&lt;/p>
&lt;/li>
&lt;/ol>
&lt;p>Note that the replaced/replaced-by relationships and ancestor/descendant hierarchy are taken from the &lt;a href="https://www.crossref.org/services/funder-registry/">Funder Registry&lt;/a>.&lt;/p>
&lt;h2 id="current-results">Current results&lt;/h2>
&lt;p>Since March 2022, six additional funders have started registering grants with us. As a result, the total number of grants doubled, and the total number of established relationships between grants and research outputs, linked grants, and linked research outputs quadrupled. Here is the comparison of the total numbers of grants, established relationships, linked grants, and linked research outputs in March 2022 and in January 2023:&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/overall-statistics-blog-png"
alt="Graph titled overall statistics showing the comparison of the total numbers of grants, established relationships, linked grants, and linked research outputs in March 2022 and in January 2023" width="100%">
&lt;/figure>
&lt;p>&lt;br>95% of grants registered within ten months between March 2022 and January 2023 were registered by one funder: the European Union. This suggests that this funder contributed a lot to this rapid increase in the number of established relationships. It looks like this funder&amp;rsquo;s grant metadata is of high quality and matches well the funding information given in the research outputs supported by this funder&amp;rsquo;s grants.&lt;/p>
&lt;p>Let&amp;rsquo;s also compare the breakdowns of all established relationships by the matching method:&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2023/percentage-relationships-matching-method.png"
alt="Graph titled percentage of relationships by the matching method comparing the breakdowns of all established relationships by the matching method." width="100%">
&lt;/figure>
&lt;p>&lt;br>The distributions are a bit different. Currently, the percentage of relationships established based on the replaced/replaced-by relationship is much smaller than before, suggesting that newer data uses correct funder IDs instead of deprecated ones. Also, the percentage of the relationships matched by the funder ID increased from 40% to 48%, which is great, because this is the most reliable way of matching.&lt;/p>
&lt;p>And here we have the statistics broken down by grant registrants. Only funders with at least 100 registered grants are included. The table shows the number of relationships, grants, linked grants, and linked research outputs, and is sorted by the percentage of linked grants.&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>funder&lt;/th>
&lt;th>relationships&lt;/th>
&lt;th>linked research outputs&lt;/th>
&lt;th>grants&lt;/th>
&lt;th>linked grants&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>European Union&lt;/td>
&lt;td>66,562&lt;/td>
&lt;td>60,630&lt;/td>
&lt;td>35,530&lt;/td>
&lt;td>12,688 (36%)&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Gordon and Betty Moore Foundation&lt;/td>
&lt;td>93&lt;/td>
&lt;td>92&lt;/td>
&lt;td>113&lt;/td>
&lt;td>33 (29%)&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Japan Science and Technology Agency (JST)&lt;/td>
&lt;td>15,584&lt;/td>
&lt;td>13,464&lt;/td>
&lt;td>9,923&lt;/td>
&lt;td>2,323 (23%)&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>James S. McDonnell Foundation&lt;/td>
&lt;td>519&lt;/td>
&lt;td>513&lt;/td>
&lt;td>577&lt;/td>
&lt;td>121 (21%)&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Melanoma Research Alliance&lt;/td>
&lt;td>188&lt;/td>
&lt;td>185&lt;/td>
&lt;td>425&lt;/td>
&lt;td>82 (19%)&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Muscular Dystrophy Association&lt;/td>
&lt;td>50&lt;/td>
&lt;td>50&lt;/td>
&lt;td>178&lt;/td>
&lt;td>25 (14%)&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Parkinson&amp;rsquo;s Foundation&lt;/td>
&lt;td>30&lt;/td>
&lt;td>29&lt;/td>
&lt;td>107&lt;/td>
&lt;td>15 (14%)&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Asia-Pacific Network for Global Change Research&lt;/td>
&lt;td>127&lt;/td>
&lt;td>127&lt;/td>
&lt;td>560&lt;/td>
&lt;td>70 (13%)&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>The ALS Association&lt;/td>
&lt;td>96&lt;/td>
&lt;td>90&lt;/td>
&lt;td>477&lt;/td>
&lt;td>58 (12%)&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Wellcome&lt;/td>
&lt;td>8,868&lt;/td>
&lt;td>6,436&lt;/td>
&lt;td>17,537&lt;/td>
&lt;td>1,735 (10%)&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>American Cancer Society&lt;/td>
&lt;td>19&lt;/td>
&lt;td>19&lt;/td>
&lt;td>266&lt;/td>
&lt;td>15 (6%)&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Templeton World Charity organisation&lt;/td>
&lt;td>2&lt;/td>
&lt;td>2&lt;/td>
&lt;td>281&lt;/td>
&lt;td>2 (0.7%)&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Office of Scientific and Technical Information (OSTI)&lt;/td>
&lt;td>73&lt;/td>
&lt;td>69&lt;/td>
&lt;td>8,723&lt;/td>
&lt;td>62 (0.7%)&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Children&amp;rsquo;s Tumor Foundation&lt;/td>
&lt;td>1&lt;/td>
&lt;td>1&lt;/td>
&lt;td>662&lt;/td>
&lt;td>1 (0.1%)&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;br>
There are substantial differences between the percentages of linked grants from different funders. One of the newest registrants, the European Union, is at the top of the table with 36% of their grants linked to research outputs. This further confirms the high quality of the metadata registered by this member. It is worth noticing that this member is responsible for the majority of the growth reported here as they cover Horizon Europe, the European Research Council, and many other funding bodies and schemes. &lt;/br>&lt;br>
&lt;p>Why are these percentages so low for some funders? It could be caused by systematic discrepancies between the award numbers attached to the grants and those reported in research outputs. It could also be the case that most grants registered by a given funder are new grants, and the research outputs supported by them simply have not been published yet. Time will tell!  &lt;/br>&lt;/p>
&lt;h2 id="whats-next">What&amp;rsquo;s next&lt;/h2>
&lt;p>We&amp;rsquo;re dedicating lots of time in 2023 to examine, evolve, and expose the matching we do and can do at Crossref across different metadata fields. We then plan to incorporate matching improvements into our services so that everyone can benefit.&lt;/p>
&lt;p>This isn&amp;rsquo;t a standalone piece of work. As you can see, the more award metadata we have connected to grants by funders and connected to outputs by those who post or publish research, the better we&amp;rsquo;ll be able to do this. To make it easier for more funders to participate, and based on funder feedback, we&amp;rsquo;ve built &lt;a href="https://www.crossref.org/documentation/register-maintain-records/grant-registration-form/">a simple tool for members to register their grants&lt;/a>. We will also work to help incorporate grant identifiers into publishing and funder workflows, and further our discussions with the funders in our &lt;a href="https://www.crossref.org/working-groups/funders/">Funder Advisory Group&lt;/a> and the wider community, including working together with the Open Research Funders Group, the HRA, Altum, Europe PMC, the OSTP, and the ORCID Funder Interest Group. And there will be more to come as we work together to consolidate and build out important relationships between funding and outputs - for everyone.&lt;/p>
&lt;h2 id="follow-up">Follow-up&lt;/h2>
&lt;p>Every new thing takes time to get off the ground and to show evidence of its value. We&amp;rsquo;ve seen a significant step forward recently with funders joining and contributing to the &lt;a href="https://www.crossref.org/documentation/research-nexus">research nexus&lt;/a>. Publishers have been contributing funding data for years, and it&amp;rsquo;s now becoming much clearer to see how these two communities and these two sets of metadata are coming together to make research smoother and easier to manage and evaluate. If you are ready to register grants, talk about linking up your outputs, or just want to learn more about this work, we&amp;rsquo;d love to hear from you.&lt;/p></description></item><item><title>Don't take it from us: Funder metadata matters</title><link>https://www.crossref.org/blog/dont-take-it-from-us-funder-metadata-matters/</link><pubDate>Thu, 16 Feb 2023 00:00:00 +0000</pubDate><author>Jennifer Kemp</author><guid>https://www.crossref.org/blog/dont-take-it-from-us-funder-metadata-matters/</guid><description>&lt;h2 id="why-the-focus-on-funding-information">Why the focus on funding information?&lt;/h2>
&lt;p>We are often asked who uses Crossref metadata and for what. One common use case is researchers in bibliometrics and scientometrics (among other fields) doing meta analyses on the entire corpus of records. As we pass the 10 year mark for the Funder Registry and 5 years of funders joining Crossref as members to register their grants, it’s worth a look at some recent research that focuses specifically on funding information. After all, there is funding behind so much scholarly work it seems obvious that it would be routinely documented in the scholarly record. But it often isn’t and that’s a problem. These sources make clear the need for accurate funding information and the problems that the lack of it creates.&lt;/p>
&lt;p>First, a few notes for context on these sources and the issues they discuss :&lt;/p>
&lt;ul>
&lt;li>The percent of records with funding information reached about 25% as of 2021. Not all items registered are the result of funding but surely it is much higher than 25% so there is considerable room for improvement. The authors cite publishers that omit funding information as well as those that include it routinely. Overall, society publishers are at the top of the list of those that do it well.&lt;/li>
&lt;li>Three of the four sources found problems in some cases confirming funding information from the metadata in the original sources. This initially surprised me though less so once I thought about the strange nature of metadata workflows.&lt;/li>
&lt;li>The complexity of fully and correctly acknowledging multiple sources of funding in any given publication is a recurring theme.&lt;/li>
&lt;li>All of the sources mention the need for manual work in analyzing funding and publication information.&lt;/li>
&lt;/ul>
&lt;p>The first two papers are from the same 2022 issue of &lt;em>Quantitative Science Studies&lt;/em> and are complementary.&lt;/p>
&lt;p>Alexis-Michel Mugabushaka, Nees Jan van Eck, Ludo Waltman; &lt;strong>Funding COVID-19 research: Insights from an exploratory analysis using open data infrastructures.&lt;/strong>
&lt;em>Quantitative Science Studies&lt;/em> 2022; 3 (3): 560–582. doi: &lt;a href="https://doi.org/10.1162/qss_a_00212" target="_blank">https://doi.org/10.1162/qss_a_00212&lt;/a>&lt;/p>
&lt;p>This first paper tackles the timely question of determining which funders have supported publications of COVID-19 research and compares coverage of funding data in Crossref to that in Scopus and Web of Science. Even with so much urgent attention focused on the pandemic, the authors found that only 17% of publications in the COVID-focused CORD-19 database have funding identified in their Crossref records.
We’re often asked about differences in the metadata (and &lt;a href="https://www.crossref.org/documentation/cited-by/" target="_blank">citation counts&lt;/a>) between Crossref and other sources such as Scopus. In this case, both proprietary sources studied have more funder coverage.
If you are disappointed in these results or want to learn more, I encourage you to read the authors’ recommendations for improving funding data in Crossref or &lt;a href="mailto:feedback@crossref.org">get in touch with us&lt;/a>.&lt;/p>
&lt;p>Bianca Kramer, Hans de Jonge; &lt;strong>The availability and completeness of open funder metadata: Case study for publications funded by the Dutch Research Council.&lt;/strong> &lt;em>Quantitative Science Studies&lt;/em> 2022; 3 (3): 583–599. doi: &lt;a href="https://doi.org/10.1162/qss_a_00210" target="_blank">https://doi.org/10.1162/qss_a_00210&lt;/a>&lt;/p>
&lt;p>This next paper focuses on a set of outputs funded by the &lt;a href="https://www.nwo.nl/en" target="_blank">NWO&lt;/a> (the Dutch Research Council). Since the funder is already known, the authors could look at multiple sources (Crossref and others) to see whether or where the NWO is correctly identified as the funder. This study also found better coverage than Crossref in proprietary sources like Web of Science. Knowing that not all outputs are the result of funded research, this paper provides a new and useful baseline for comparing percentages of coverage.
Discussions of research funding so often focus on the physical and life sciences so it’s very good to see that 37% of works in this study are in the humanities and social sciences.&lt;/p>
&lt;p>Borst, T., Mielck, J., Nannt, M., Riese, W. (2022). &lt;strong>Extracting Funder Information from Scientific Papers - Experiences with Question Answering.&lt;/strong> In: , et al. Linking Theory and Practice of Digital Libraries. TPDL 2022. Lecture Notes in Computer Science, vol 13541. Springer, Cham. &lt;a href="https://doi.org/10.1007/978-3-031-16802-4_24" target="_blank">https://doi.org/10.1007/978-3-031-16802-4_24&lt;/a>&lt;/p>
&lt;p>Given the considerable effort required to conduct these analyses, it’s only logical to consider automating as much of the work as possible. This next paper focuses on automatic recognition of funders in economics papers in digital libraries.
An interesting complication described here is the inclusion of funding for open access fees in acknowledgments and while the authors conclude that automated text mining of funder information performs better than manual curation, they also state that manual indexing is still necessary “for a gold standard of reliable metadata.”&lt;/p>
&lt;p>Habermann, T. (2022). &lt;strong>Funder Metadata: Identifiers and Award Numbers.&lt;/strong> &lt;a href="https://metadatagamechangers.com/blog/2022/2/2/funder-metadata-identifiers-and-award-numbers" target="_blank">https://metadatagamechangers.com/blog/2022/2/2/funder-metadata-identifiers-and-award-numbers&lt;/a>&lt;/p>
&lt;p>Finally, this concise blog post looks at &lt;a href="https://ror.org/registry/" target="_blank">RORs&lt;/a> as well as funder names and acronyms. The author shows how acronyms contribute to the need for manual analysis. He also spends some time on award numbers, which is one of the three &lt;a href="https://www.crossref.org/documentation/funder-registry/#00283">funding elements&lt;/a> publishers can (and, as we’ve seen, should) include in their metadata. Award numbers are also a focus of this work and, unfortunately, another frequent reason for additional manual work.&lt;/p>
&lt;h2 id="a-common-theme-more-metadata-needed">A common theme: More metadata needed&lt;/h2>
&lt;p>Though collectively, this research paints a fairly dim picture of the current availability, completeness and accuracy of existing funding information in publication metadata, all is not lost. This is a good opportunity to point out the value and availability of grant records since unique, persistent identifiers for grants (yes, &lt;a href="https://doi.org/10.64000/tynar-j7a72" target="_blank">DOIs for grants&lt;/a>) paired with more and better funding metadata from publishers go a very long way to realizing the vision of the &lt;a href="https://www.crossref.org/documentation/research-nexus/">Research Nexus&lt;/a>. And it certainly would make things a whole lot easier for the researchers who use this open metadata to analyze the scholarly record for the rest of us.&lt;/p></description></item><item><title>Refocusing our Sponsors Program; a call for new Sponsors in specific countries</title><link>https://www.crossref.org/blog/refocusing-our-sponsors-program-a-call-for-new-sponsors-in-specific-countries/</link><pubDate>Mon, 06 Feb 2023 00:00:00 +0000</pubDate><author>Susan Collins</author><guid>https://www.crossref.org/blog/refocusing-our-sponsors-program-a-call-for-new-sponsors-in-specific-countries/</guid><description>&lt;p>Some small organisations who want to register metadata for their research and participate in Crossref are not able to do so due to financial, technical, or language barriers. To attempt to reduce these barriers we have developed several programs to help facilitate membership. One of the most significant&amp;mdash;and successful&amp;mdash;has been our &lt;a href="https://www.crossref.org/community/sponsors/">Sponsor program&lt;/a>.&lt;/p>
&lt;p>Sponsors are organisations that are generally not producing scholarly content themselves but work with or publish on behalf of groups of smaller organisations that wish to join Crossref but face barriers to do so independently.  Sponsors work directly with Crossref in order to provide billing, technical, and, if applicable, language support to Members.&lt;/p>
&lt;p>Because Sponsors are important partners in facilitating membership there is a high bar to meet to be accepted as a Sponsor. To ensure that an organisation can accurately represent Crossref and has the resources to be successful we created &lt;a href="https://www.crossref.org/community/sponsors/#sponsor-criteria">a set of criteria&lt;/a> that must be met to be considered.&lt;/p>
&lt;p>Our Sponsors program has grown considerably over the last decade and has now become the primary route to membership for emerging markets and small or academic-adjacent publishing operations.&lt;/p>
&lt;p>The program began in 2012 with four Sponsors, based primarily in South Korea and Turkey, representing fewer than 100 members. In the next stage of development, the program covered Brazil, India, and Ukraine, and nearly 1300 members. At the end of 2022, the program had grown to over 100 sponsors from &lt;a href="https://www.crossref.org/membership/about-sponsors/">45 countries&lt;/a> representing over 11,000 of our members.&lt;/p>
&lt;p>Though the program continues to expand, there are still regions where we lack Sponsors, while having an abundance in others. We are working with members, ambassadors, and the community to help identify organisations that may be a fit with the Sponsor program and based in those regions where coverage is lacking.&lt;/p>
&lt;p>This January we announced our &lt;a href="https://www.crossref.org/gem/">Global Equitable Membership (GEM) Program&lt;/a> which offers relief from membership and content registration fees for members in the least economically-advantaged countries in the world. Eligibility for the program is based on a member&amp;rsquo;s country on our curated list.&lt;/p>
&lt;p>Though the GEM program reduces financial barriers to becoming a member, many organisations still require technical assistance and local language support. Working with a Sponsor would help organisations overcome these burdens. However, there is little or no Sponsor coverage for organisations located in most GEM-eligible countries. That means that in places like Bangladesh, Nepal, and Senegal, where we&amp;rsquo;ve seen a lot of growth, more organisations could join us if a suitable local Sponsor could support them.&lt;/p>
&lt;p>We have made the decision to pause accepting new Sponsors from regions where Sponsor numbers are already very high or not based in a GEM region. By doing so we can focus on growing the program in areas where there is the greatest need.&lt;/p>
&lt;p>We are also going to focus on how best to support our current 100+ Sponsors and work with them to evaluate ways to improve the program. We will bolster the training and resources, outreach activities, and solicit feedback on additional ways we can help.&lt;/p>
&lt;p>We would love to hear from organisations based in GEM countries who might consider becoming a Sponsor. But our invitation for Sponsors is not limited to the support for the GEM program. There are countries where the GEM program won&amp;rsquo;t apply, but where growth is high and no Sponsor is present. In particular, we seek support in the following countries where member numbers are growing but could be better supported.&lt;/p>
&lt;center>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>Country/state&lt;/th>
&lt;th>Region&lt;/th>
&lt;th>No. Crossref members&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>Nigeria&lt;/td>
&lt;td>Sub-Saharan Africa (Western)&lt;/td>
&lt;td>99&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Philippines&lt;/td>
&lt;td>South-eastern Asia&lt;/td>
&lt;td>81&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Kenya&lt;/td>
&lt;td>Sub-Saharan Africa (Eastern)&lt;/td>
&lt;td>40&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Egypt&lt;/td>
&lt;td>Northern Africa&lt;/td>
&lt;td>26&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Sri Lanka&lt;/td>
&lt;td>Southern Asia&lt;/td>
&lt;td>13&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;/center>
&lt;p>If your organisation is based in one of these regions and supports or provides services to scholarly publishers in one of the above countries &amp;mdash;please take a look &lt;a href="https://www.crossref.org/community/sponsors/">at the criteria&lt;/a> set out on our website and do get in touch to start the conversation if you think you can meet them. We&amp;rsquo;re excited to hear from you!&lt;/p></description></item><item><title>Measuring Metadata Impacts: Books Discoverability in Google Scholar</title><link>https://www.crossref.org/blog/measuring-metadata-impacts-books-discoverability-in-google-scholar/</link><pubDate>Wed, 25 Jan 2023 00:00:00 +0000</pubDate><author>Lettie Conrad</author><guid>https://www.crossref.org/blog/measuring-metadata-impacts-books-discoverability-in-google-scholar/</guid><description>&lt;p>&lt;em>This blog post is from Lettie Conrad and Michelle Urberg, cross-posted from the The Scholarly Kitchen.&lt;br>
As sponsors of this project, we at Crossref are excited to see this work shared out.&lt;/em>&lt;/p>
&lt;p>The scholarly publishing community talks a LOT about metadata and the need for high-quality, interoperable, and machine-readable descriptors of the content we disseminate. However, as &lt;a href="https://scholarlykitchen.sspnet.org/2021/09/30/the-experience-of-good-metadata-linking-metadata-to-research-impacts/" target="_blank">we’ve reflected on previously in the &lt;em>Kitchen&lt;/em>&lt;/a>, despite well-established information standards (e.g., persistent identifiers), our industry lacks a shared framework to measure the value and impact of the metadata we produce.&lt;/p>
&lt;p>In 2021, we embarked on a &lt;a href="https://doi.org/10.64000/h3w86-2z708" target="_blank">Crossref-sponsored study&lt;/a> designed to measure how metadata impacts end-user experiences and contributes to the successful discovery of academic and research literature via the mainstream web. Specifically, we set out to learn if scholarly books with DOIs (and associated metadata) were more easily found in Google Scholar than those without DOIs.&lt;/p>
&lt;p>Initial results indicated that &lt;strong>DOIs have an indirect influence on the discoverability of scholarly books in Google Scholar&lt;/strong> &amp;ndash; however, we found no direct linkage between book DOIs and the quality of Google Scholar indexing or users’ ability to access the full text via search-result links. Although Google Scholar claims to not use DOI metadata in its search index, the results of our mixed-methods study of 100+ books (from 20 publishers) demonstrate that books with DOIs are generally more discoverable than those without DOIs.&lt;/p>
&lt;p>As we finalize our analysis, we are &lt;a href="https://community.crossref.org/t/with-or-without-measuring-impacts-of-books-metadata/3058" target="_blank">sharing some early results&lt;/a> and inviting input from our community. What relevant lessons can we glean from this exercise? What changes might book publishers consider based on the outcomes of this study?&lt;/p>
&lt;h2 id="background-on-the-study">Background on the study&lt;/h2>
&lt;p>This study was designed to evaluate metadata impacts &amp;amp; benefits to users. Given its popularity with a range of stakeholders in our industry, we set out to measure metadata impacts on discoverability in the mainstream web – namely, Google Scholar.&lt;/p>
&lt;p>Our test method and analysis rubric was developed based on our own information-user research, in particular how readers search and retrieve scholarly ebooks, as well as published studies about academic information experiences and research practices. We rated the search performance of more than 100 scholarly books using preset test queries (two for each title). The books tested in this study came from publishers of all sorts and sizes, and represent both monographs and edited volumes from a range of fields; some were open access and others were published under traditional licensing models.&lt;/p>
&lt;p>We developed and executed known-item test searches that were designed to simulate common researcher practices. Heuristic analysis of the search results was used to rate the search performance on a 5-point scoring rubric, which was designed to measure the degree of friction in locating the book in question. This method allowed us to assess specific book and metadata attributes by their search performance scores to assess the impact of book metadata on content discoverability in Google Scholar.&lt;/p>
&lt;h2 id="results-and-findings">Results and findings&lt;/h2>
&lt;p>In this study, we learned that high-value fields include the primary title paired with subtitles, author/editor surnames and/or field of study. Queries using full book titles performed the best across the board. Those using publication dates and/or author/editor surnames and/or publisher names, but without the book title, were the lowest performers.&lt;/p>
&lt;p>Surprisingly, our discoverability scores show no significant variation in performance by the type of book, whether edited or authored. Open-access titles performed somewhat better than traditional ones. Books covering humanities and social science fields performed a bit better than STM books, but only by a slim difference (that is not statistically significant).&lt;/p>
&lt;p>We primarily tested the discoverability of book titles, from equal numbers of books with and without chapter-level DOIs. We ran similar tests for chapter-title discoverability but found the majority of test queries for chapters lead users to the full book itself. While books without title-level DOIs were found to be less discoverable, we did not find a measurable difference between books with or without chapter-level DOIs. (Note: All books in this study with chapter-level DOIs assigned also carried a title-level DOI, which was found to be fairly common.)&lt;/p>
&lt;p>Based on these results, we are developing a theory that &lt;strong>books with DOIs perform better in Google Scholar because they benefit from the structured, open metadata associated&lt;/strong> with those DOIs – which are used by hundreds of platforms and services, and therefore are “seeded” throughout the mainstream web, which Scholar may draw on for indexing, linking, etc. That said, however, these results also suggest that publishers are best served by a metadata strategy that is well attuned to the protocols expected of each channel for book search and discovery. In a recent conversation about our findings, Anurag Acharya himself noted that these results underscore the need for publishers to invest in the robust construction and broad distribution of book metadata.&lt;/p>
&lt;p>In this study, we have observed that the metadata protocols surrounding Google Scholar are not fully integrated into our industry’s established scholarly information standards bodies, like NISO, or infrastructure organisations, like Crossref. While some mainstream data standards prevail in the Scholar index, like the use of schema.org and HTTP, some key metadata attributes seem to be lacking. For example, an indicator of the type of scholarly book (monograph, handbook, etc.) would improve Google Scholar’s search index and could be used to filter search results, thereby improving users’ experiences discovering scholarly books. One clear challenge for book publishers today is the fact that Google Scholar operates outside of our community-governed scholarly information infrastructure.&lt;/p>
&lt;h2 id="what-comes-next">What comes next&lt;/h2>
&lt;p>While this study focused on Google Scholar, the results and lessons learned are applicable to other mainstream channels of information seeking/discovery. Our report, due out spring 2023, will contribute to the literature intended to support user-centric information systems design and content architecture by scholarly publishers and service providers.&lt;/p>
&lt;p>As we write up our findings, we intend to develop a framework that can help publishers and others measure the impact of their work to enrich and distribute scholarly metadata. We hope this first systematic review of the impacts of metadata on the discoverability of books in Google Scholar will provide valuable insights for this community. In the meantime, please share your thoughts and questions in the comments below &amp;ndash; or reach out to us directly (&lt;a href="https://www.linkedin.com/in/lettieyconrad/" target="_blank">see Lettie’s profile here&lt;/a> and &lt;a href="https://www.linkedin.com/in/michelleurberg/" target="_blank">Michelle’s profile here&lt;/a>).&lt;/p>
&lt;p>Acknowledgments: &lt;em>The authors would like to thank Jennifer Kemp at Crossref for the inspiration to take this dive into the metadata literature and reflect on its impact on research information experiences. Special thanks to Anurag Acharya at Google Scholar for his consultation during this study.&lt;/em>&lt;/p></description></item><item><title>Introducing our new Global Equitable Membership (GEM) program</title><link>https://www.crossref.org/blog/introducing-our-new-global-equitable-membership-gem-program/</link><pubDate>Wed, 07 Dec 2022 00:00:00 +0000</pubDate><author>Susan Collins</author><guid>https://www.crossref.org/blog/introducing-our-new-global-equitable-membership-gem-program/</guid><description>&lt;p>When Crossref began over 20 years ago, our members were primarily from the United States and Western Europe, but for several years our membership has been more global and diverse, growing to almost 18,000 organisations around the world, representing 148 countries.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2022/gem-blog-v4.jpg"
alt="image of GEM logo and country list" width="80%">
&lt;/figure>
&lt;p>As we continue to grow, finding ways to help organisations participate in Crossref is an important part of our mission and approach. Our goal of creating the &lt;a href="https://www.crossref.org/documentation/research-nexus">Research Nexus&lt;/a>&amp;mdash;a rich and reusable open network of relationships connecting research organisations, people, things, and actions; a scholarly record that the global community can build on forever, for the benefit of society&amp;mdash;can only be achieved by ensuring that participation in Crossref is accessible to all. Building a network for the global community must include input from all of the global community. &lt;/p>
&lt;p>Although Crossref membership is open to all organisations that produce scholarly and professional materials, cost and technical challenges can be barriers to joining for many organisations. To address some of these challenges, we created our &lt;a href="https://www.crossref.org/membership/about-sponsors/">Sponsors Program&lt;/a>, which provides technical, financial and local language support. We also collaborate with the Public Knowledge Project on the &lt;a href="https://docs.pkp.sfu.ca/crossref-ojs-manual/" target="_blank">Open Journals Platform&lt;/a> to develop plugins for OJS users.&lt;/p>
&lt;p>Additionally, we had a limited &amp;lsquo;fee assistance&amp;rsquo; program to waive the content registration fees for members working under specific Sponsor arrangements, including INASP, and African Journals Online (AJOL). Learning from the experiences of such successful partnerships, starting in January 2023, we are expanding this program to provide greater membership equitability and accessibility to organisations located in the least economically-advantaged countries in the world through our &lt;a href="https://www.crossref.org/gem/">Global Equitable Membership&lt;/a> (GEM) Program. This new scheme now encompasses the annual fee as well as the content registration fees.&lt;/p>
&lt;p>Eligibility for the program is based on a member&amp;rsquo;s country. We have curated the list, predominantly based on the&lt;a href="https://datahelpdesk.worldbank.org/knowledgebase/articles/906519-world-bank-country-and-lending-groups" target="_blank"> International Development Association&lt;/a> (IDA) list and excluding anywhere we are bound by international sanctions. From January 2023, organisations based in countries listed in our GEM program will be eligible to join Crossref and contribute with their metadata to a robust scholarly record at no cost. This also applies to 187 existing members in eligible countries who will no longer be charged for Crossref membership or content registration.&lt;/p>
&lt;h3 id="existing-crossref-members-in-gem-eligible-countries">Existing Crossref members in GEM-eligible countries&lt;/h3>
&lt;table>
&lt;tr>
&lt;td>Bangladesh (54)
&lt;/td>
&lt;td>Burundi (1)
&lt;/td>
&lt;td>Kiribati (0)
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Kyrgyz Republic (20)
&lt;/td>
&lt;td>Central African Republic (1)
&lt;/td>
&lt;td>Lesotho (0)
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Nepal (19)
&lt;/td>
&lt;td>Democratic Republic of the Congo (1)
&lt;/td>
&lt;td>Liberia (0)
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Ghana (15)
&lt;/td>
&lt;td>Guyana (1)
&lt;/td>
&lt;td>Marshall Islands (0)
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Yemen (10)
&lt;/td>
&lt;td>Haiti (1)
&lt;/td>
&lt;td>Mauritania (0)
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Sudan (7)
&lt;/td>
&lt;td>Honduras (1)
&lt;/td>
&lt;td>Micronesia (0)
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Tanzania (7)
&lt;/td>
&lt;td>Laos (1)
&lt;/td>
&lt;td>Mozambique (0)
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Afghanistan (6)
&lt;/td>
&lt;td>Madagascar (1)
&lt;/td>
&lt;td>Nicaragua (0)
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Ethiopia (5)
&lt;/td>
&lt;td>Malawi (1)
&lt;/td>
&lt;td>Niger (0)
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Zambia (5)
&lt;/td>
&lt;td>Maldives (1)
&lt;/td>
&lt;td>Samoa (0)
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Bhutan (4)
&lt;/td>
&lt;td>Myanmar (1)
&lt;/td>
&lt;td>Sao Tome and Principe (0)
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Rwanda (4)
&lt;/td>
&lt;td>Cambodia (1)
&lt;/td>
&lt;td>Sierra Leone (0)
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Tajikistan (4)
&lt;/td>
&lt;td>Chad (1)
&lt;/td>
&lt;td>Solomon Islands (0)
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Kosovo (3)
&lt;/td>
&lt;td>Comoros (1)
&lt;/td>
&lt;td>South Sudan (0)
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Senegal (3)
&lt;/td>
&lt;td>Cote d’Ivoire (1)
&lt;/td>
&lt;td>Togo (0)
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Uganda (3)
&lt;/td>
&lt;td>Djibouti (1)
&lt;/td>
&lt;td>Tonga (0)
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Burkina Faso (2)
&lt;/td>
&lt;td>Eritrea (1)
&lt;/td>
&lt;td>Tuvalu (0)
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Mali (2)
&lt;/td>
&lt;td>Gambia (1)
&lt;/td>
&lt;td>Vanuatu (0)
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Somalia (2)
&lt;/td>
&lt;td>Guinea (1)
&lt;/td>
&lt;td>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Benin (1)
&lt;/td>
&lt;td>Guinea-Bissau (1)
&lt;/td>
&lt;td>
&lt;/td>
&lt;/tr>
&lt;/table>
&lt;p>The list of countries will undergo an annual review, to follow the latest guidance from IDA, which uses the somewhat simplistic World Bank income classifications but applies a more granular blend of criteria for economic health, thereby allowing for greater nuance, such as indicating countries where the gap between rich and poor is very wide.&lt;/p>
&lt;p>The program results from our experience working with and knowing the communities through Sponsors and working with past members who have struggled to pay. It aims to bring us closer to our vision of building an inclusive, rich and open network of relationships underpinning the scholarly record. With the support of the &lt;a href="https://www.crossref.org/committees/membership-and-fees/">Membership and Fees Committee&lt;/a>, the launch of the program was confirmed with the recent unanimous vote of our Board to evolve our fee assistance program into a more expansive scheme. GEM presents a more comprehensive and equitable solution than our former arrangements. It involves an opportunity to join Crossref and contribute scholarly metadata to our global community on a zero-fee basis for membership and content registration. This offering will be applied by default to organisations based in all eligible countries, irrespective of joining through any specific Sponsor, or independently.&lt;/p>
&lt;p>While the GEM Program will alleviate financial barriers, and we hope to see the numbers above grow significantly, the GEM program will not necessarily help ease technical or administrative burdens. We still need our valued Sponsors for that and we seek new Sponsors in the above locations. We would love to hear from organisations based in GEM countries who might consider becoming a Sponsor or otherwise support local colleagues in building experience of metadata and working with global open scholarly infrastructure systems like Crossref. Please &lt;a href="mailto:feedback@crossref.org">reach out to me&lt;/a> to discuss ideas or with any other questions or comments.&lt;/p></description></item><item><title>How funding agencies can meet OSTP (and Open Science) guidance using existing open infrastructure</title><link>https://www.crossref.org/blog/how-funding-agencies-can-meet-ostp-and-open-science-guidance-using-existing-open-infrastructure/</link><pubDate>Thu, 17 Nov 2022 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/how-funding-agencies-can-meet-ostp-and-open-science-guidance-using-existing-open-infrastructure/</guid><description>&lt;p>In August 2022, the United States Office of Science and Technology Policy (OSTP) issued a &lt;a href="https://web.archive.org/web/20221124074730/https://www.whitehouse.gov/wp-content/uploads/2022/08/08-2022-OSTP-Public-Access-Memo.pdf" target="_blank">memo (PDF)&lt;/a> on ensuring free, immediate, and equitable access to federally funded research (a.k.a. the “Nelson memo”). Crossref is particularly interested in and relevant for the areas of this guidance that cover metadata and persistent identifiers—and the infrastructure and services that make them useful.&lt;/p>
&lt;p>Funding bodies worldwide are increasingly involved in research infrastructure for dissemination and discovery. While this post does respond to the OSTP guidelines point-by-point, the information here applies to all funding bodies in all countries. It will be equally useful for publishers and other systems that operate in the scholarly research ecosystem.&lt;/p>
&lt;p>In response to calls from our community for more specifics, this post:&lt;/p>
&lt;ol>
&lt;li>Provides an overview of the specific ways that Crossref (along with organisations and initiatives like &lt;a href="https://datacite.org/" target="_blank">DataCite&lt;/a>, &lt;a href="https://orcid.org/" target="_blank">ORCID&lt;/a>, and &lt;a href="https://ror.org/" target="_blank">ROR&lt;/a>) helps U.S. federal agencies&amp;mdash;and indeed any other funder&amp;mdash;meet critical aspects of the recommendations.&lt;/li>
&lt;li>Restates our intent to collaborate with all stakeholders in the scholarly research ecosystem, including the OSTP, the US federal agencies, our existing funder, publisher, and university members, to support the recommendation as plans develop.&lt;/li>
&lt;li>References the &lt;a href="https://www.crossref.org/categories/grants">work and adoption of Crossref Grant DOIs&lt;/a>, including analyses of existing metadata matching funding to outputs.&lt;/li>
&lt;li>Highlights that what’s outlined in the memo aligns with our longstanding mission to capture and maintain the scholarly record and our vision of the Research Nexus, as we describe in our current blog series, regarding our &lt;a href="https://doi.org/10.64000/edg3w-7t592" target="_blank">role in preserving the integrity of the scholarly record (ISR)&lt;/a>.&lt;/li>
&lt;/ol>
&lt;h2 id="infrastructure-already-exists-to-support-funder-goals-it-just-needs-more-adoption">Infrastructure already exists to support funder goals; it just needs more adoption&lt;/h2>
&lt;p>Ensuring free, immediate, and equitable access to metadata that captures the scholarly record is an essential part of meeting the aims of the memo but also supporting Open Science globally.&lt;/p>
&lt;p>In September, Crossref ORCID, DataCite, and ROR participated in the &lt;a href="https://altum.com/forum-on-grants-management/" target="_blank">2022 Forum on Global Grants Management&lt;/a> run by Altum and the summary provides a good example of the importance of open infrastructure and open metadata to the goals of Open Science:&lt;/p>
&lt;div class="shortcode-divwrap blue-highlight">
&lt;span>Open Science begins with open infrastructure: Attendees agreed that Open Science relies on many other &amp;lsquo;opens’ – most notably, open metadata, open infrastructure, and open governance. Metadata and DOIs (digital object identifiers) for publications, grants, and research outputs, are essential to illuminate the connections that exist between funding and outcomes. That metadata runs on infrastructure powered by organisations such as Crossref, ORCID, ROR, and DataCite.&lt;/span>
&lt;/div>
&lt;p>As a foundational scholarly infrastructure committed to meeting the &lt;a href="https://openscholarlyinfrastructure.org/" target="_blank">Principles of Open Scholarly Infrastructure (POSI)&lt;/a> of governance, insurance, and sustainability, Crossref plays an essential role in implementing and supporting key aspects of the guidance. For many years, we have been focused on the integrity of the scholarly record (ISR), and the shared vision to collectively achieve what we call the &lt;a href="https://www.crossref.org/documentation/research-nexus/">Research Nexus&lt;/a>, which is described as&lt;/p>
&lt;blockquote>
&lt;p>A rich and reusable open network of relationships connecting research organisations, people, things, and actions; a scholarly record that the global community can build on forever, for the benefit of society.&lt;/p>
&lt;/blockquote>
&lt;p>Metadata&amp;mdash;including persistent identifiers and relationships between different research objects&amp;mdash;is the foundation of the Research Nexus and is critical to openly and sustainably fulfilling the OSTP memo&amp;rsquo;s recommendations.&lt;/p>
&lt;p>This topic of open metadata and identifiers isn’t just an issue for research resulting from US federal funding. We are working to implement open scholarly infrastructure globally, bringing significant benefits to the whole scholarly research ecosystem.&lt;/p>
&lt;p>The current situation brings to mind the William Gibson quote, “&lt;a href="https://quoteinvestigator.com/2012/01/24/future-has-arrived/" target="_blank">The future is already here - it’s just not evenly distributed yet&lt;/a>”. Much of the open infrastructure to support the identifier, metadata and reporting requirements of the OSTP memo already exists, but it is unevenly implemented. Increased collaboration and effort will be needed to bring this all to fruition.&lt;/p>
&lt;p>We set out below some steps that all stakeholders can take to meet not just the OSTP guidelines, but Open Science goals more broadly, and globally.&lt;/p>
&lt;h2 id="what-does-adoption-look-like-how-exactly-do-funders-and-other-stakeholders-work-with-this-infrastructure">What does ‘adoption’ look like? How exactly do funders and other stakeholders work with this infrastructure?&lt;/h2>
&lt;p>The OSTP memo calls for specific actions concerning metadata and identifiers where, fortunately, open and global solutions already exist.&lt;/p>
&lt;p>For example, item 4 a) says, “&lt;em>Collect and make publicly available appropriate metadata associated with scholarly publications and data resulting from federally funded research.&lt;/em>” Crossref and DataCite make metadata, including persistent identifiers (DOIs to be specific), openly available for a broad range of research objects from &lt;a href="https://search.crossref.org/" target="_blank">publications&lt;/a> to &lt;a href="https://search.datacite.org/" target="_blank">data&lt;/a>. Item 4 b) reads, “&lt;em>Assign unique digital persistent identifiers to all scientific research and development awards and intramural research protocols&lt;/em>”. Again, federal agencies and other funders are already &lt;a href="https://www.crossref.org/community/grants/">joining&lt;/a> to register awards and grants and &lt;a href="https://doi.org/10.64000/tynar-j7a72" target="_blank">distribute these records openly&lt;/a> through Crossref. However, this is an example of uneven adoption as registering awards and grants with DOIs is only being done by a few funders so far, which needs to increase.&lt;/p>
&lt;h4 id="here-is-an-ideal-workflow-that-funders-and-publishers-can-already-follow">Here is an ideal workflow that funders and publishers can already follow&lt;/h4>
&lt;ol>
&lt;li>Funders join Crossref to register grants and awards (or indeed any other object such as reports). They apply on our website, accept our terms, and provide key information such as contact details. An annual membership fee ranges from $200-$1200 USD.&lt;/li>
&lt;li>Funders and publishers collect ROR IDs and authenticated ORCID iDs for all authors/awardees and their affiliations.&lt;/li>
&lt;li>Funders register a Crossref DOI for the award/grant, including awardees’ ORCID iDs and ROR IDs. They send us XML information about the grant (note that we will imminently release an online form to make it easier for the less technical funders). Many funder members register the metadata through a third party, such as Altum (if they use ProposalCentral) or Europe PMC.&lt;/li>
&lt;li>At the same time, funders update the awardees’ ORCID record directly with the Crossref Grant DOI and metadata.&lt;/li>
&lt;li>Grantees produce research objects and outputs such as data, protocols, code, preprints, articles, conference papers, book chapters, etc.&lt;/li>
&lt;li>These objects are registered with Crossref or DataCite, and DOIs are created by the publisher or repository members who include ORCID iDs, Crossref Grant DOIs (gathered from the author), ROR IDs for affiliations for all contributors, and other key metadata such as licensing information, and in the case of publications - references and abstracts. Note that the publisher works its magic (actually, publishers do a lot of editorial and production work, such as including data citations in the references using DataCite DOIs for the data in data repositories).&lt;/li>
&lt;li>On the Crossref side, we do a bunch of processing and matching and are planning to refine this and do more. Sometimes relationships are notified and added, such as data citation, preprints related to articles or funding acknowledgements converted from free text to &lt;a href="https://www.crossref.org/services/funder-registry/">Open Funder Registry IDs&lt;/a> and names.&lt;/li>
&lt;li>Grant records with Crossref DOIs are now part of the scholarly record. All stakeholders may retrieve the open metadata and relationships through our public APIs. Crossref and DataCite will always provide open metadata, as safeguarded by our respective commitments to POSI.&lt;/li>
&lt;/ol>
&lt;div class="shortcode-divwrap blue-highlight">
&lt;span>&lt;p>Anyone can use the open metadata registered with Crossref, DataCite and ORCID as connections have been established between (ideally all) research objects and entities through open metadata and identifiers. This means that:&lt;/p>
&lt;ul>
&lt;li>Funding agencies can monitor compliance with their policies&lt;/li>
&lt;li>Publishers can identify the funder and meet their requirements&lt;/li>
&lt;li>Funding agencies can assess and report on the reach and return of their funding programs&lt;/li>
&lt;li>The provenance and integrity of the scholarly record is preserved and discoverable, benefitting all stakeholders.&lt;/li>
&lt;/ul>&lt;/span>
&lt;/div>
&lt;h2 id="suggestions-for-meeting-ostp-and-open-science-guidance-point-by-point">Suggestions for meeting OSTP and Open Science guidance, point by point&lt;/h2>
&lt;table>
&lt;tr>
&lt;td>&lt;strong>OSTP Recommendation&lt;/strong>
&lt;/td>
&lt;td>&lt;strong>Publishers should…&lt;/strong>
&lt;/td>
&lt;td>&lt;strong>Funding agencies should…&lt;/strong>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>4 a) Collect and make publicly available appropriate metadata associated with scholarly publications and data resulting from federally funded research
&lt;/td>
&lt;td>
&lt;ul>
&lt;li>For scholarly publications: register comprehensive metadata &amp; DOIs with Crossref.
&lt;li>For scholarly data: register comprehensive metadata and DOIs with DataCite.
&lt;/li>
&lt;/ul>
&lt;/td>
&lt;td>
&lt;ul>
&lt;li>Use Crossref’s API to retrieve publication and other metadata.
&lt;li>Use DataCite’s API to retrieve data/repository metadata.
&lt;/li>
&lt;/ul>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>
i) all author and co-author names, affiliations, and sources of funding, referencing digital persistent identifiers, as appropriate;
&lt;/td>
&lt;td>
&lt;ul>
&lt;li>Collect and validate the following from authors at manuscript submission: ROR &amp; ORCiD IDs, Crossref Grant DOIs.
&lt;li>Include data citations in reference lists, preferably with DataCite DOIs.
&lt;/li>
&lt;/ul>
&lt;/td>
&lt;td>
&lt;ul>
&lt;li>Register awards and grants with Crossref and create DOI records for them.
&lt;li>Use ORCID’s API to retrieve validated contributor metadata.
&lt;li>Update contributors’ ORCID records with Crossref Grant DOIs and metadata.
&lt;li>Use ROR API to retrieve and verify affiliation metadata.
&lt;li>Recommend data citations be included in published outputs.
&lt;/li>
&lt;/ul>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>ii) the date of publication; and,
&lt;/td>
&lt;td>
&lt;ul>
&lt;li>Include acceptance and publication dates in Crossref metadata.
&lt;/li>
&lt;/ul>
&lt;/td>
&lt;td>
&lt;ul>
&lt;li>Use Crossref’s API to retrieve publication dates.
&lt;/li>
&lt;/ul>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>
iii) a unique digital persistent identifier for the research output;
&lt;/td>
&lt;td>
&lt;ul>
&lt;li>For scholarly publications and research outputs: register full metadata &amp; DOIs with Crossref.
&lt;li>For scholarly data: register full metadata and DOIs with DataCite.
&lt;/li>
&lt;/ul>
&lt;/td>
&lt;td>
&lt;ul>
&lt;li>Use Crossref and DataCite APIs to retrieve DOIs for research outputs.
&lt;/li>
&lt;/ul>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>4 b) Instruct federally funded researchers to obtain a digital persistent identifier that meets the common/core standards of a digital persistent identifier service defined in the NSPM-33 Implementation Guidance, include it in published research outputs when available, and provide federal agencies with the metadata associated with all published research outputs they produce, consistent with the law, privacy, and security considerations.
&lt;/td>
&lt;td>
&lt;ul>
&lt;li>Collect ORCID iDs on manuscript submission for all authors.
&lt;li>Register Crossref and DataCite DOIs and metadata for research outputs, including data.
&lt;/li>
&lt;/ul>
&lt;/td>
&lt;td>
&lt;ul>
&lt;li>Recommend that researchers applying for funding obtain an ORCID iD and collect them upon grant application for all applicants.
&lt;li>Prepopulate grant applications with CV and publication information from applicants’ ORCID records.
&lt;li>ORCID iDs should be included in the grants registered by the agencies with Crossref.
&lt;li>Agencies can use our open APIs to retrieve the metadata on publications and data rather than ask researchers to do it, saving time and effort.
&lt;/li>
&lt;/ul>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>4 c) Assign unique digital persistent identifiers to all scientific research and development awards and intramural research protocols that have appropriate metadata linking the funding agency and their awardees through their digital persistent identifiers.
&lt;/td>
&lt;td>
&lt;/td>
&lt;td>
&lt;ul>
&lt;li>Join Crossref to register Crossref Grant DOIs, including ROR IDs and ORCID iDs
&lt;li>Ensure grant proposal and assessment systems integrate with Crossref, ROR for affiliations and with ORCID for applicants/awardees.
&lt;/li>
&lt;/ul>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>5 a) coordinate between federal science agencies to enhance efficiency and reduce redundancy in public access plans and policies, including as it relates to digital repository access;
&lt;/td>
&lt;td>
&lt;ul>
&lt;li>Work with agencies to ensure a smooth, automated workflow.
&lt;/li>
&lt;/ul>
&lt;/td>
&lt;td>
&lt;ul>
&lt;li>Using and supporting existing open scholarly infrastructure and using open identifiers will avoid duplication of effort and make the overall ecosystem more efficient .
&lt;/li>
&lt;/ul>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>5 b) improve awareness of federally funded research results by all potential users and communities;
&lt;/td>
&lt;td>
&lt;ul>
&lt;li>Collect Crossref Grant DOIs from authors and use them to link from publications to grant information.
&lt;/li>
&lt;/ul>
&lt;/td>
&lt;td>
&lt;ul>
&lt;li>Communicate your Crossref Grant DOIs and open grant metadata widely via human and machine interfaces. Inclusion in the Crossref API will enhance dissemination and discoverability
&lt;li>Update contributors’ ORCID records with Crossref Grant DOIs and metadata
&lt;/li>
&lt;/ul>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>5 c) consider measures to reduce inequities in the publishing of, and access to, federally funded research and data, especially among individuals from underserved backgrounds and those who are early in their careers;
&lt;/td>
&lt;td>
&lt;/td>
&lt;td>
&lt;ul>
&lt;li>Registering grants and sharing metadata through Crossref means it’s part of the world’s largest open community-governed metadata exchange and makes it available to the entire world without restriction.
&lt;/li>
&lt;/ul>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>5 d) develop procedures and practices to reduce the burden on federally funded researchers in complying with public access requirements;
&lt;/td>
&lt;td>
&lt;ul>
&lt;li>Ensure your systems and those you work with make it as easy as possible for authors to provide the necessary metadata and persistent identifiers - work towards as much automation as possible and pulling from other systems rather than asking for data to be re-keyed.
&lt;/li>
&lt;/ul>
&lt;/td>
&lt;td>
&lt;ul>
&lt;li>Ensure the platforms you work with, such as grant proposal or assessment systems, retrieve and prepopulate ROR IDs, ORCID iDs, and Crossref and DataCite DOIs and associated metadata whenever possible so that the researchers don’t have to manually rekey or reformat data.
&lt;/li>
&lt;/ul>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>5 e) recommend standard consistent benchmarks and metrics to monitor and assess implementation and iterative improvement of public access policies over time;
&lt;/td>
&lt;td>
&lt;/td>
&lt;td>
&lt;ul>
&lt;li>Ensure that platforms and systems integrate with ROR, ORCID, Crossref, and DataCite so that this open metadata can lead to the creation of benchmarks and metrics.
&lt;/li>
&lt;/ul>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>5 f) improve monitoring and encourage compliance with public access policies and plans;
&lt;/td>
&lt;td>
&lt;ul>
&lt;li>Use open infrastructure to help authors easily comply with public access and funder/institution policies. Automate systems as much as possible.
&lt;/li>
&lt;/ul>
&lt;/td>
&lt;td>
&lt;ul>
&lt;li>Using the open infrastructure, metadata, and identifiers outlined in this post will make monitoring more straightforward and compliance easier for all stakeholders. The community can build services on open infrastructure and metadata.
&lt;/li>
&lt;/ul>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>5 g) coordinate engagement with stakeholders, including but not limited to publishers, libraries, museums, professional societies, researchers, and other interested non-governmental parties on federal agency public access efforts;
&lt;/td>
&lt;td>
&lt;ul>
&lt;li>Work with the global open infrastructure organisations (Crossref, DataCite and ORCID) whose members include funding agencies, societies, publishers, universities, libraries, repositories, museums, NGOs, and many other stakeholders - all looking to improve the efficiency of the research ecosystem.
&lt;/li>
&lt;/ul>
&lt;/td>
&lt;td>
&lt;ul>
&lt;li>Work with the global open infrastructure organisations (Crossref, DataCite and ORCID) whose members include funding agencies, societies, publishers, universities, libraries, repositories, museums, NGOs, and many other stakeholders - all looking to improve the efficiency of the research ecosystem.
&lt;/li>
&lt;/ul>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>5 h) develop guidance on desirable characteristics of—and best practices for sharing in—online digital publication repositories;
&lt;/td>
&lt;td>
&lt;ul>
&lt;li>Support automated systems that use metadata and identifiers to populate repositories automatically.
&lt;/li>
&lt;/ul>
&lt;/td>
&lt;td>
&lt;ul>
&lt;li>Collaborate with publishers, Crossref and others to develop automated systems to populate repositories.
&lt;/li>
&lt;/ul>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>5 j) develop strategies to make federally funded publications, data, and other such research outputs and their metadata are findable, accessible, interoperable, and re-useable, to the American public and the scientific community in an equitable and secure manner.
&lt;/td>
&lt;td>
&lt;ul>
&lt;li>Provide and support a range of discovery services based on open infrastructure.
&lt;/li>
&lt;/ul>
&lt;/td>
&lt;td>
&lt;ul>
&lt;li>Encourage discovery services - and develop services - that use the open infrastructure, metadata and persistent identifiers to enable.
&lt;/li>
&lt;/ul>
&lt;/td>
&lt;/tr>
&lt;/table>
&lt;h2 id="everybody-needs-to-play-their-part">Everybody needs to play their part&lt;/h2>
&lt;p>A lot of the work on making the above happen is already underway, and there is widespread adoption of open identifiers and metadata, but as noted above, funders are still early in the adoption journey, and implementation among all stakeholders is patchy.&lt;/p>
&lt;p>Critical parts of the infrastructure rely on third-party platforms that supply tools and systems to authors, funders, and publishers - so coordinating the support for the appropriate metadata and identifiers in these systems and tools is very important.&lt;/p>
&lt;p>We are emphasising how our existing open scholarly infrastructure systems are helping. But we also know that it’s not all perfect yet. Infrastructure is always evolving, metadata is never complete, refactoring workflows and systems can be costly, and integration can always be smoother. But our existing open infrastructure has already delivered significant benefits, and broader adoption will bring additional benefits to the whole scholarly research and communications ecosystem and help achieve the promise of Open Science in advancing human knowledge.&lt;/p>
&lt;p>While working on this coordination and integration, we all try to remember that it should minimise work for researchers, and processes should be as automated as possible.&lt;/p>
&lt;p>Collaboration is key to making this all work.&lt;/p>
&lt;p>We already work with many funders through our &lt;a href="https://www.crossref.org/working-groups/funders">Advisory Group&lt;/a>, our 30 funder members, &lt;a href="https://api.crossref.org/types/grant/works?rows=0&amp;amp;facet=funder-name:*" target="_blank">25 of whom&lt;/a> have so far collectively registered around &lt;a href="https://api.crossref.org/works?filter=type:grant" target="_blank">40,000 Crossref Grant DOIs, retrievable from our open API&lt;/a>. Some grants are even &lt;a href="https://doi.org/10.64000/ske16-xve54" target="_blank">matched&lt;/a> to resulting outputs already, and some funders have recently dug into Crossref metadata to analyse outcomes from their investments, such as the &lt;a href="https://doi.org/10.31222/osf.io/gj4hq" target="_blank">Dutch Research Council (NWO) which presents findings and makes a case for greater emphasis on Crossref funding metadata&lt;/a>.&lt;/p>
&lt;p>We also work closely with partners &lt;a href="http://blog.europepmc.org/2020/06/global-grant-ids-in-europe-pmc.html" target="_blank">Europe PMC&lt;/a> and &lt;a href="https://altum.com/" target="_blank">Altum&lt;/a>, and we engage in community research and discussion, for example, through the &lt;a href="https://www.orfg.org/" target="_blank">Open Research Funders Group&lt;/a>.&lt;/p>
&lt;p>Alongside our fellow infrastructures and open identifier registries ORCID, DataCite, and ROR, we integrate with and support each other operationally and out in the community.&lt;/p>
&lt;p>We will continue focusing our resources and efforts on engaging with funders, including US federal agencies responding by the OSTP guidelines, and all stakeholders to support the entire global scholarly research ecosystem.&lt;/p>
&lt;h4 id="everyone-has-a-part-to-play-and-we-must-all-pull-together-to-prioritize-this-work">Everyone has a part to play, and we must all pull together to prioritize this work.&lt;/h4>
&lt;blockquote>
&lt;p>Who’s in?&lt;/p>
&lt;/blockquote>
&lt;p>Please &lt;a href="mailto:feedback@crossref.org">get in touch&lt;/a> with Ed, Ginny, or Jennifer (or indeed DataCite or ORCID or ROR) if you’d like to have a discussion about the workflows described here, or just to make sure you’re up to date on the latest developments and opportunities we describe. We look forward to working with all funding agencies to support them as they develop their plans.&lt;/p></description></item><item><title>Better preprint metadata through community participation</title><link>https://www.crossref.org/blog/better-preprint-metadata-through-community-participation/</link><pubDate>Wed, 09 Nov 2022 00:00:00 +0000</pubDate><author>Martyn Rittman</author><guid>https://www.crossref.org/blog/better-preprint-metadata-through-community-participation/</guid><description>&lt;p>Preprints have become an important tool for rapidly communicating and iterating on research outputs. There is now a range of preprint servers, some subject-specific, some based on a particular geographical area, and others linked to publishers or individual journals in addition to generalist platforms. In 2016 the Crossref schema started to support preprints and since then the number of metadata records has grown to around 16,000 new preprint DOIs per month.&lt;/p>
&lt;p>Preprints aren’t the same as journal articles, books, or conference papers. They have unique features, and how they are viewed and integrated into the publishing process has evolved over the past six years. For this reason, we have been revisiting the preprint metadata schema and decided that the best approach would be to form an advisory group (AG) of preprint practitioners and experts to help us.&lt;/p>
&lt;p>The AG has identified a number of areas in which preprint metadata could be improved. Four of these were considered to have the highest priority:&lt;/p>
&lt;ol>
&lt;li>Withdrawal and removal of preprints.&lt;/li>
&lt;li>Preprints as an article type (not a subtype of &lt;a href="https://www.crossref.org/documentation/schema-library/markup-guide-record-types/posted-content-includes-preprints/" target="_blank">posted content&lt;/a>) in the schema.&lt;/li>
&lt;li>Relationships between preprints and other outputs.&lt;/li>
&lt;li>Versioning of preprints.&lt;/li>
&lt;/ol>
&lt;p>The members of the AG set to work with great enthusiasm, sharing perspectives and expertise. This led to &lt;a href="https://doi.org/10.13003/psk3h6qey4" target="_blank">a first tranche of recommendations&lt;/a> shared for feedback earlier this year, and we’re grateful for &lt;a href="https://community.crossref.org/t/share-your-thoughts-on-preprint-metadata/2800" target="_blank">engagement and feedback from the community&lt;/a> over the last few months.&lt;/p>
&lt;h2 id="what-did-the-community-say">What did the community say?&lt;/h2>
&lt;p>Some of the points raised in the feedback were:&lt;/p>
&lt;ul>
&lt;li>Could the origin of a withdrawal be included in the metadata, in particular whether it was requested by an author or another party?&lt;/li>
&lt;li>Can the metadata represent when a preprint has been submitted to a journal and what stage it is in the editorial process?&lt;/li>
&lt;li>Crossref is not alone in looking at preprint metadata, and several NISO groups are also engaged in related work.&lt;/li>
&lt;li>Interoperability and the ability to create relationships with identifiers beyond DOIs is important to maintain an accurate and comprehensive record of research outputs.&lt;/li>
&lt;/ul>
&lt;p>These will form the basis for ongoing discussions.&lt;/p>
&lt;h2 id="what-happens-next">What happens next?&lt;/h2>
&lt;p>There are three next steps that we will be taking.&lt;/p>
&lt;ol>
&lt;li>The recommendations outline only the outcomes of discussions in a relatively brief format. We have been working on a more detailed paper to communicate more about what was discussed and provide some extra justification and alternatives.&lt;/li>
&lt;li>The AG will continue to meet and discuss the points raised during consultation on the recommendations, along with topics that were considered a lower priority at an earlier stage.&lt;/li>
&lt;li>We will draw up a set of proposals for specific changes to the metadata schema that will reflect the outcomes of the recommendations and discussions.&lt;/li>
&lt;/ol>
&lt;p>Although the initial period for feedback on preprint metadata has ended, we welcome feedback at any time. If you would like to get in touch, please contact me or any member of the &lt;a href="https://www.crossref.org/working-groups/preprints/" target="_blank">advisory group&lt;/a>.&lt;/p></description></item><item><title>Forming new relationships: Contributing to Open source</title><link>https://www.crossref.org/blog/forming-new-relationships-contributing-to-open-source/</link><pubDate>Wed, 19 Oct 2022 00:00:00 +0000</pubDate><author>Patrick Vale</author><guid>https://www.crossref.org/blog/forming-new-relationships-contributing-to-open-source/</guid><description>&lt;h2 id="tldr">TL;DR&lt;/h2>
&lt;p>One of the things that makes me glad to work at Crossref is the principles to which we hold ourselves, and the most public and measurable of those must be the &lt;a href="https://openscholarlyinfrastructure.org/" target="_blank">Principles of Open Scholarly Infrastructure&lt;/a>, or POSI, for short. These ambitions lay out how we want to operate - to be open in our governance, in our membership and also in our source code and data. And it&amp;rsquo;s that openness of source code that&amp;rsquo;s the reason for my post today - on 26th September 2022, our first collaboration with the &lt;a href="https://jsonforms.io/" target="_blank">JSON Forms&lt;/a> open-source project was &lt;a href="https://github.com/eclipsesource/jsonforms/releases/tag/v3.0.0" target="_blank">released into the wild&lt;/a>.&lt;/p>
&lt;p>Like most organisations, we depend heavily on open-source software for our operations - the software is universally available, generally high quality and &amp;lsquo;free&amp;rsquo;. And it&amp;rsquo;s easy to take that dependency, and the associated dependency on free time and effort on the part of the maintainers, for granted - but that&amp;rsquo;s not very sustainable. In fact, we believe relying on open-source software without helping to sustain it is an anti-pattern, and this project marks the start of our efforts to make funding open-source software a standard part of our technology budget.&lt;/p>
&lt;p>This isn&amp;rsquo;t the first time we&amp;rsquo;ve &lt;a href="https://github.com/sckott/habanero" target="_blank">supported&lt;/a> or &lt;a href="https://gitlab.com/crossref/rest_api" target="_blank">released&lt;/a> open-source software. Indeed for the past few years, all our new software is open source, and we&amp;rsquo;re in the process of replacing old closed code with new, so that eventually all our code will be open source. But this is the first time we&amp;rsquo;ve contributed extensively to something that isn&amp;rsquo;t focussed primarily on us, and our services. This is a project that we will find very useful, but it is a general purpose tool, and it&amp;rsquo;s already gaining traction in the community.&lt;/p>
&lt;h2 id="background-and-motivations">Background and motivations&lt;/h2>
&lt;p>A while back, I was tasked to do a quick &lt;a href="http://agiledictionary.com/209/spike/" target="_blank">spike&lt;/a> of work on testing the theory that we could use automated form generation tools to bring new interfaces to our users more quickly, and make them easier for &amp;ldquo;people who aren&amp;rsquo;t devs&amp;rdquo; to adapt and manage. We wanted to build a new user interface for registering content, and especially we wanted to make it easier for funders to register the grants they were awarding. As well as being more approachable by a less-technical audience, we also wanted these forms to be accessible (in terms of &lt;a href="https://www.a11yproject.com/" target="_blank">a11y&lt;/a> and users of assistive technology) and localisable - we wanted a solution that would cater to the needs of our rapidly diversifying membership.&lt;/p>
&lt;h2 id="enter-json-schema">Enter JSON Schema&lt;/h2>
&lt;p>We were clear about one side of the puzzle - we knew that we had to look beyond the XML ecosystem upon which much of our existing system is built - and landed on &lt;a href="https://json-schema.org/" target="_blank">JSON Schema&lt;/a>. JSON Schema is a &amp;lsquo;vocabulary that allows you to annotate and validate JSON documents&amp;rsquo;. This means you can describe the shape you expect your data to take, and apply constraints-based validation to that. Which means, in terms of a form library, that you can infer the structure of the form and test that the data entered into it matches what you expect. More than that, you can use that built-in validation to provide error messages to help people get the data right, first time.&lt;/p>
&lt;p>Working backwards from the outcome, the argument for adopting JSON Schema is compelling. It provides a mechanism for checking that data you are handling (for example, receiving input from a form) conforms to the constraints that you declare, but also allows you to tell people up-front, in a human and machine-readable way, what structure and format you will accept. This closed-loop of data annotation and validation gets more appealing when you look at the wide adoption of JSON Schema across languages and libraries. You can pretty much guarantee that for whatever client or server -side technology you are using, there will be a JSON Schema validator for it. Being able to share schemas across your systems (and equally importantly, with third parties) moves JSON schema from &amp;lsquo;just&amp;rsquo; being about data validation, to a key supportive technology.&lt;/p>
&lt;p>Building a form derived from a JSON Schema is an equally attractive prospect. JSON Schema &lt;a href="https://www.jviotti.com/dissertation.pdf" target="_blank">was conceived&lt;/a> during the AjaxWorld conference in 2007 as a &amp;lsquo;JSON-based format for defining the structure of JSON data&amp;rsquo;, and its use as a form-generation tool is relatively new, but there is growing community interest. There is even a &lt;a href="https://github.com/json-schema-org/community/discussions/70" target="_blank">discussion&lt;/a> about how to best create a JSON Schema vocabulary, specifically geared towards addressing some of the needs of form generation users. However, even in its current form, a JSON Schema can be passed to a library, and a very serviceable user interface appears. The devil is always in the detail, and the client-side libraries differ in their abilities to customise areas such as layout (you may not always want your form fields to appear in &lt;strong>exactly&lt;/strong> the same order as they do in your JSON Schema), custom elements (you might want something that wasn&amp;rsquo;t a form input, or that changes based on user input) and localisation. The ability to flexibly customise the appearance and behaviour of the interface was a key factor in our selection of a client-side form generation library.&lt;/p>
&lt;h2 id="choosing-a-library">Choosing a library&lt;/h2>
&lt;p>The other side of the puzzle was less clear - choosing a UI library that would take this JSON Schema, and turn it into a useful, and usable, form. I made the prototype using the venerable &lt;a href="https://github.com/rjsf-team/react-jsonschema-form" target="_blank">React JSON Schema form&lt;/a>. This worked well as a proof of concept, but veered dramatically off our chosen Frontend stack of &lt;a href="https://vuejs.org/" target="_blank">VueJS&lt;/a> and &lt;a href="https://vuetifyjs.com/" target="_blank">Vuetify&lt;/a>, and had some architectural constraints that would limit the scope of customisations we could make to our forms. So I went off looking for libraries that would work with our stack and came up with &lt;a href="https://koumoul-dev.github.io/vuetify-jsonschema-form/latest/" target="_blank">Vuetify JSON Schema Form&lt;/a>, and &lt;a href="https://jsonforms.io/" target="_blank">JSON Forms&lt;/a>.&lt;/p>
&lt;p>Vuetify JSON Schema Form matched our stack perfectly, but made some interesting decisions about the layout of data within the form, and that wouldn&amp;rsquo;t suit our purposes without dramatic modification.&lt;/p>
&lt;p>JSON Forms was an abstracted library, with a core handling the JSON Schema transformation and validation, and separate rendering libraries to handle the form generation. This was great - they had renderers for Angular, React, and even some support for VueJS. But not Vuetify.&lt;/p>
&lt;p>Clearly, we were going to have to make something.&lt;/p>
&lt;p>We made contact with the maintainers of both short-listed libraries to see how we could collaborate in creating a tool that would meet all of our (and hopefully, much of the wider community&amp;rsquo;s) requirements. Both maintainers were very helpful, and we had constructive discussions in both cases. In the end, we decided that the abstracted nature of the JSON Forms project was a better fit for our needs, providing a flexible platform on which we - and others - could extend. We were fortunate to receive funding from the Gordon and Betty Moore Foundation (Grant Agreement #10485) in order to accelerate this work, so we could provide a Grant Registration UI more quickly. We paid a large portion of that funding to the library maintainers, and Crossref contributed a portion of my time on the project. This allowed us to enter into an agreement with &lt;a href="https://eclipsesource.com/" target="_blank">EclipseSource&lt;/a>, the maintainers of JSON Forms, to collaboratively develop the new VueJS and Vuetify renderer library. Stefan Dirix, the lead maintainer, worked with me to build it.&lt;/p>
&lt;p>We didn&amp;rsquo;t forget about Vuetify JSON Schema Form though, and by way of appreciation for their help in the early stages, Crossref made a contribution towards the continued development of that library.&lt;/p>
&lt;h2 id="json-forms---now-with-vuetify">JSON Forms - now with Vuetify&lt;/h2>
&lt;p>Work started on the &lt;a href="https://github.com/eclipsesource/jsonforms-vuetify-renderers" target="_blank">JSON Forms Vuetify renderer set&lt;/a> in September 2021 - Stefan quickly created the first early prototypes of the new form renderers - but then we had a stroke of luck. Our repository received more input from the community. The one that made us sit up and take real notice was the news that someone else had already ported the JSON Forms React renderer set to Vue/Vuetify - and was &lt;a href="https://jsonforms.discourse.group/t/unclear-on-how-to-implement-basic-styling-in-vue2-according-to-github-page/347/5" target="_blank">offering this&lt;/a> as a contribution. &lt;a href="https://github.com/kchobantonov" target="_blank">Krasimir Chobantonov&amp;rsquo;s&lt;/a> fantastic first contribution got &lt;a href="https://github.com/eclipsesource/jsonforms-vuetify-renderers/pull/5" target="_blank">merged in&lt;/a> at the end of the month. This propelled the project forward massively, and was an early validation of the value of working in the open. Needless to say, we were very grateful. Another example of the open source value chain was that Stefan - as the maintainer - could take the time to carefully review and tidy up the incoming code, so what was merged was the product of two great developers.&lt;/p>
&lt;p>Having this great head start meant we could turn our attention to one of the other big areas we wanted to get right - localisation. Traditionally, JSON Schema -generated forms have handled localisation (translation of text and adjustment of date and numerical formats) by wholesale duplication and translation of the schema. This is cumbersome, and doesn&amp;rsquo;t integrate very well with custom error messages, nor external sources of interface messages (think form labels, descriptions, placeholders). So Stefan came up with a proposal, which we accepted, to add complete &lt;a href="https://github.com/eclipsesource/jsonforms/pull/1825" target="_blank">i18n support&lt;/a> to the library. We now have a mechanism by which you can hook up a translation engine of your choice, and JSON forms will use that to lookup messages, before falling back to the validator (also localised!) and finally, the JSON Schema&amp;rsquo;s defaults. This gives much stronger integration and allows the community to plug in their existing localisation methods - no wasted effort.&lt;/p>
&lt;p>Since the localisation addition, we&amp;rsquo;ve been working on fine-tuning the layout engine, making bug fixes, and integrating more closely with the underlying Vuetify library. This allows developers to more easily use the existing Vuetify parameters to change the style and behaviour of their form widgets. Again, no wasted effort. &lt;/p>
&lt;p>We&amp;rsquo;re lucky to have an active community - &lt;a href="https://github.com/kchobantonov" target="_blank">@kchobantonov&lt;/a> continues to make great contributions and push the library forward in unexpected ways - and the library is gaining popularity, with an average of a few hundred downloads per day. &lt;/p>
&lt;p>Some of our funder members have already seen this work in action, and given their feedback on early iterations of the user interface that supports registering grant records. We&amp;rsquo;ll be releasing this publicly very soon to get feedback from members - and then using that feedback to iterate on the grants registration form, and look towards extending it to other record types. &lt;/p>
&lt;h2 id="open-source-positivity">Open source POSItivity&lt;/h2>
&lt;p>A continuous theme throughout this project has been the willingness of people working on these open source projects to be generous with their time and experience. Whether it has been form generation libraries, the &lt;a href="https://json-schema.org/" target="_blank">JSON Schema project&lt;/a> or maintainers of &lt;a href="https://fluent-vue.demivan.me/" target="_blank">localisation plug-ins&lt;/a> - help, advice and encouragement have never been far away. And that&amp;rsquo;s appreciated. But it&amp;rsquo;s not something that we, or any other organisation who relies on the software they produce, should take for granted. Open source software helps everyone who uses it, and there&amp;rsquo;s a real opportunity within our community to make meaningful steps towards supporting its sustainability. Ironically, it&amp;rsquo;s often the most-used general purpose tools that get the least attention. We can change that.&lt;/p>
&lt;h2 id="look-out-for-more">Look out for more&lt;/h2>
&lt;p>Look out for more posts from the &lt;a href="https://www.crossref.org/categories/engineering/">engineering&lt;/a> team, coming soon!&lt;/p>
&lt;h3 id="references">References&lt;/h3>
&lt;p>&lt;a href="https://www.jviotti.com/dissertation.pdf" target="_blank">JSON Binpack: A space-efficient schema-driven and schema-less binary serialization specification based on JSON Schema&lt;/a> (Chapter 3.2.1 History and Relevance)&lt;/p>
&lt;p>&lt;a href="https://web.archive.org/web/20071026190426/http://www.json.com/2007/09/27/json-schema-proposal-collaboration/" target="_blank">https://web.archive.org/web/20071026190426/http://www.json.com/2007/09/27/json-schema-proposal-collaboration/&lt;/a>&lt;/p></description></item><item><title>ISR part three: Where does Crossref have the most impact on helping the community to assess the trustworthiness of the scholarly record?</title><link>https://www.crossref.org/blog/isr-part-three-where-does-crossref-have-the-most-impact-on-helping-the-community-to-assess-the-trustworthiness-of-the-scholarly-record/</link><pubDate>Mon, 17 Oct 2022 00:00:00 +0000</pubDate><author>Rachael Lammey</author><discourseUsername>rlammey</discourseUsername><guid>https://www.crossref.org/blog/isr-part-three-where-does-crossref-have-the-most-impact-on-helping-the-community-to-assess-the-trustworthiness-of-the-scholarly-record/</guid><description>&lt;p>Ans: metadata and services are all underpinned by &lt;a href="https://openscholarlyinfrastructure.org/" target="_blank">POSI&lt;/a>.&lt;/p>
&lt;p>Leading into a blog post with a question always makes my brain jump ahead to answer that question with the simplest answer possible. I was a nightmare English Literature student. &amp;lsquo;Was Macbeth purely a villain?&amp;rsquo; &amp;lsquo;No&amp;rsquo;. *leaves exam*&lt;/p>
&lt;p>Just like not giving one-word answers to exam questions, &lt;a href="https://doi.org/10.64000/edg3w-7t592" target="_blank">playing our role in the integrity of the scholarly record&lt;/a> and helping our members enhance theirs takes thought, explanation, transparency, and work.&lt;/p>
&lt;p>Some of the elements Amanda outlines in the previous posts in this series (&lt;a href="https://doi.org/10.64000/edg3w-7t592" target="_blank">Part 1&lt;/a>, &lt;a href="https://doi.org/10.64000/ctyr5-j0r91" target="_blank">Part 2)&lt;/a> really resonated from a product perspective:&lt;/p>
&lt;blockquote>
&lt;p>We must be cautious that our best practices for demonstrating legitimacy and identifying deceptive behaviour do not raise already-high barriers for emerging publications or organisations that present themselves in ways that some may not recognize as professional standards. Disruption is different from deception. Crossref has an opportunity to think about how to identify deceptive actions and pair that with our efforts to bring more people on board and support their full participation in our ecosystem.&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>We don&amp;rsquo;t have the means or desire to be the arbiter of research quality (whatever that means). However, we operate neutrally, at the center of scholarly communications, and we can help develop a shared consensus or framework. Our metadata elements and tools can be positioned to signal or detect trustworthiness. An important distinction is that we can play a role in assessing legitimacy (activities of the actors) but not in quality (calibre of the content itself).&lt;/p>
&lt;/blockquote>
&lt;h2 id="crossref-has-lots-of-plans-and-lots-to-do-to-improve-our-role-in-isr">Crossref has lots of plans (and lots to do) to improve our role in ISR &lt;/h2>
&lt;p>Rather than a long list of things we want to do in terms of tools, services, and functionality, it feels more manageable to break this work into three key areas.&lt;/p>
&lt;h4 id="1--collecting-better-information-in-better-ways">1. Collecting better information in better ways&lt;/h4>
&lt;p>We think many elements of the metadata our members record with us help expose important information about the research, e.g., authors, publication dates, and abstracts. We also help our members assess submissions for originality via our Similarity Check service, and the ongoing migration to &lt;a href="https://doi.org/10.64000/bg7rk-dae91" target="_blank">iThenticate V2&lt;/a> aims to better support this aspect of the publication process.&lt;/p>
&lt;p>Beyond this, as Amanda points out, &amp;lsquo;once members start registering their content, their metadata speaks about their practices&amp;rsquo;. Seeing who published a work along with the metadata they provide; validated ORCID IDs to identify the authors, reference lists and links to related research and data, and important updates to the work via Crossmark, all contribute to showing not just the &amp;lsquo;what&amp;rsquo; but the &amp;lsquo;how&amp;rsquo; so that the community can use that information to support their decision-making.&lt;/p>
&lt;p>I always want to stress that this work is not just an &amp;lsquo;ask&amp;rsquo; for our members. We are moving in the same direction as we improve the things we do to support organisations in registering their records with us, answering their questions, working with partner organisations like &lt;a href="https://publicknowledge.org/" target="_blank">PKP&lt;/a>, consulting with our community on pain points, and thinking about how we can better enhance and facilitate their work. We&amp;rsquo;ve been fortunate that our community has taken the time to engage in discussions with Turnitin on iThenticate improvements, do user testing sessions as we build simple user interfaces to record grants, lead calls and conversations on improving grant metadata and supporting the uptake of ROR and data citation, and provide thoughtful feedback on &lt;a href="https://doi.org/10.31222/osf.io/6z7s3" target="_blank">our recent preprint on CRE metadata&lt;/a>. This all helps us to explain, structure, and prioritize our product work.&lt;/p>
&lt;p>There are also some closely related R&amp;amp;D-led projects that are already informing our thinking:&lt;/p>
&lt;ul>
&lt;li>A more responsive version of &lt;a href="https://www.crossref.org/members/prep/" target="_blank">participation reports&lt;/a> so that it&amp;rsquo;s easier for members to identify gaps in their metadata and compare against others.&lt;/li>
&lt;li>Making it easier to get metadata back in a format where members can easily redeposit it.&lt;/li>
&lt;li>Better matching to help us and our members augment the metadata they send us to add value to the work we all do.&lt;/li>
&lt;/ul>
&lt;p>We said in the previous blog posts that we&amp;rsquo;ll pose questions about what kinds of metadata give what kind of levels of trustworthiness, and have previously highlighted the following activities:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>Reporting corrections and retractions through &lt;a href="https://www.crossref.org/services/crossmark/">Crossmark&lt;/a> metadata. We know that our members are collecting this information, but often it isn&amp;rsquo;t making it through metadata workflows to us. We&amp;rsquo;re part of the &lt;a href="https://www.niso.org/standards-committees/crec" target="_blank">NISO CREC (Communication of Retractions, Removals, and Expressions of Concern) working group&lt;/a> with many of our members and metadata users, as this feels like something critical to address.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Assessing originality using &lt;a href="https://www.crossref.org/services/similarity-check/">Similarity Check&lt;/a>. On average, we&amp;rsquo;re seeing 320 new Similarity Check subscribers each year, with over 10 million checks being done each year by our members. &lt;/p>
&lt;/li>
&lt;li>
&lt;p>Establishing provenance and stakeholders through ORCID and ROR. At the time of writing, we have over 30,000 ROR IDs in Crossref, and this is growing steadily across different record types. ROR is keen to support adoption and so are we. &lt;/p>
&lt;/li>
&lt;li>
&lt;p>Acknowledging funding and other support through the use of the &lt;a href="https://www.crossref.org/services/funder-registry/">Open Funder Registry&lt;/a> and &lt;a href="https://www.crossref.org/community/grants/">registering grants metadata&lt;/a>. This has improved in quality and completeness since we launched the Funder Registry in 2014 and with more comprehensive support for grants in more recent years. But we still have work to do, as this paper by Kramer and de Jonge points out: &lt;a href="https://doi.org/10.1162/qss_a_00210" target="_blank">The availability and completeness of open funder metadata&lt;/a>.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;a href="https://www.crossref.org/community/data-citation/">Citing data&lt;/a> for transparency and reproducibility, including &lt;a href="https://www.crossref.org/community/data-citation/">linking to related research data&lt;/a>. Scholix, MDC and STM Research Data groups. &lt;/p>
&lt;/li>
&lt;li>
&lt;p>Demonstrating open peer review by &lt;a href="https://www.crossref.org/documentation/research-nexus/peer-reviews/">registering peer review reports&lt;/a>. Members have already recorded over 300,000 peer reviews with Crossref, opening up this information on their processes.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>In your organisation, what weight do you give these? We know that some of our members register some of these things in more volume than others - is that due to their perceived value, technical limitations, or &amp;lsquo;we&amp;rsquo;re working on it, give us time?&amp;rsquo; Do you think of them in the context of the integrity of the record or are we off the mark? Are there other things we haven&amp;rsquo;t mentioned in this blog that we could capture, report on and highlight? &lt;/p>
&lt;h4 id="2--disseminating-this-information-and-supporting-its-downstream-use">2. Disseminating this information and supporting its downstream use&lt;/h4>
&lt;p>We want to make it as easy as possible for everyone to access and use the metadata our members register with us. Especially as some of the biggest metadata users are our members and, more selfishly, us! But there&amp;rsquo;s no point collecting metadata to support ISR if it&amp;rsquo;s unwieldy and difficult to access and use.&lt;/p>
&lt;p>We&amp;rsquo;re working on a project, described in the &lt;a href="https://doi.org/10.64000/35qx3-8z834" target="_blank">mid-year community update&lt;/a> by a number of my colleagues to break down internal metadata silos and model it in a more flexible way. This will lend itself to better information collection and exchange, and support of the Research Nexus by building a relationships API to let anyone see all of the relationships Crossref can see between a given work and well, anything else related to it (citations, links to preprints, links to data to name but a few).&lt;/p>
&lt;p>Part of that work will involve supplementing the metadata our members register with high-quality, curated data from selected sources, making it clear where those assertions have come from.&lt;/p>
&lt;p>We want our API to perform consistently and well, to contain all the metadata our members register, handle it appropriately, and be able to keep the information in it up-to-date.&lt;/p>
&lt;p>Our API will underpin the reports we provide our members (among other things) so that we can provide simple interfaces for organisations to check how they&amp;rsquo;re doing along with more functional requests. Do their DOIs resolve? Are they submitting metadata updates when they publish a correction? How much will they be billed in a given quarter? We have a lot of internal reporting and need to build more, and if we want to use these, chances are many others do too, so we should open those up.&lt;/p>
&lt;h4 id="3--trying-to-live-up-to-posi-to-underpin-this-work">3. Trying to live up to POSI to underpin this work&lt;/h4>
&lt;p>When I see a new project, initiative, tool or service in the research ecosystem the first thing I want to do is find out about the organisation itself so that I can base some decisions on that. &lt;a href="https://barbarafister.net/libraries/lateral-reading-and-information-systems-in-the-age-of-distrust/" target="_blank">Lateral reading&lt;/a> in action.&lt;/p>
&lt;p>At Crossref, we want to show who we are beyond just our tools, services, and products and be transparent about our values. That&amp;rsquo;s why we have &lt;a href="https://doi.org/10.64000/hzemx-j7n79" target="_blank">adopted the Principles of Open Scholarly Infrastructure&lt;/a> or POSI for short. Now we need to meet these principles and we&amp;rsquo;re &lt;a href="https://doi.org/10.64000/1a8fc-3jq97" target="_blank">working towards that&lt;/a>. POSI proposes three areas that an Open Infrastructure organisation like Crossref can address to garner the trust of the broader scholarly community: accountability (governance), funding (sustainability), and protection of community interests (insurance). POSI also proposes a set of concrete commitments that an organisation can make to build community trust in each area.&lt;/p>
&lt;p>So POSI isn&amp;rsquo;t just opening code and metadata, it&amp;rsquo;s telling our community how we handle membership, governance, product development, technical and financial stability and security, holding our hands up when we&amp;rsquo;ve got something wrong, and actively looking to improve upon the things we do.&lt;/p>
&lt;p>Are you still reading? If so, you&amp;rsquo;ve done better than many of my examiners, I&amp;rsquo;m sure. So stay with us as we work together to ensure we bring quality, transparency, and integrity to the work we all do.&lt;/p>
&lt;p>The next part in this series will report back on the feedback and discussions and potentially propose some new or adjusted priorities. Join us at the Frankfurt bookfair this week (hall 4.2, booth M5) or comment on this post below.&lt;/p></description></item><item><title>ISR part two: How our membership approach helps to preserve the integrity of the scholarly record</title><link>https://www.crossref.org/blog/isr-part-two-how-our-membership-approach-helps-to-preserve-the-integrity-of-the-scholarly-record/</link><pubDate>Mon, 10 Oct 2022 00:00:00 +0000</pubDate><author>Amanda Bartell</author><guid>https://www.crossref.org/blog/isr-part-two-how-our-membership-approach-helps-to-preserve-the-integrity-of-the-scholarly-record/</guid><description>&lt;p>In &lt;a href="https://doi.org/10.64000/edg3w-7t592" target="_blank">part one&lt;/a> of our series on the Integrity of the Scholarly Record (ISR), we talked about how the metadata that our members register with us helps to preserve the integrity of the record, and in particular how &amp;rsquo;trust signals&amp;rsquo; in the metadata, combined with relationships and context, can help the community assess the work. &lt;/p>
&lt;p>In this second blog, we describe membership eligibility and what you can and cannot tell simply from the fact that an organisation is a Crossref member; why increasing participation and reducing barriers actually helps to enhance the integrity of the scholarly record; and how we handle the very small number of cases where there may be a question mark.&lt;/p>
&lt;h2 id="who-can-become-a-crossref-member-and-do-we-check-new-applicants">Who can become a Crossref member and do we check new applicants?&lt;/h2>
&lt;p>Membership is open to organisations that &amp;ldquo;produce professional and scholarly materials and content&amp;rdquo;, and this is deliberately defined broadly. We’re a global community of members with content in all disciplines, in many formats, with all kinds of business models - research institutions, publishers, government agencies, research funders, banks, museums and many more. &lt;/p>
&lt;blockquote>
&lt;p>Essentially, if your content is likely to be cited in the research ecosystem and you consider it part of the evidence trail, then you’re eligible to join.&lt;/p>
&lt;/blockquote>
&lt;p>We ask organisations to complete an online application form and accept our member terms. On receipt of the application, we run a few very basic checks to ensure that:&lt;/p>
&lt;ol>
&lt;li>The applicant can meet the membership criteria and seems to have the capacity to fulfill the &lt;a href="https://www.crossref.org/membership/terms/">obligations&lt;/a> (and follow our &lt;a href="https://www.crossref.org/code-of-conduct/">code of conduct&lt;/a>).&lt;/li>
&lt;li>We are legally permitted to accept them as a member (for example, we can’t accept applications from some countries due to &lt;a href="https://www.crossref.org/operations-and-sustainability/membership-operations/sanctions/">sanctions&lt;/a>.&lt;/li>
&lt;li>They haven&amp;rsquo;t previously been a member of Crossref whose membership was &lt;a href="https://www.crossref.org/operations-and-sustainability/membership-operations/revocation/">revoked&lt;/a>.&lt;/li>
&lt;li>They haven&amp;rsquo;t misrepresented themselves in the application (such as their location).&lt;/li>
&lt;li>The applicant or an affiliate is not already a member of Crossref (so that we can advise they join under a single membership fee).&lt;/li>
&lt;/ol>
&lt;p>As long as the applicant can meet these requirements, and as long as they are able to pay any membership fees upfront for their first year of membership, they are able to become a Crossref member, get a DOI prefix, and start registering their metadata to share it with the global scholarly community. &lt;/p>
&lt;p>We are aware that some organisations in some regions may not be able to join Crossref independently. There may be barriers for them - the cost of membership fees, the fact that we only accept payment in US dollars, language barriers or technical barriers. &lt;/p>
&lt;p>To help increase participation globally, we work with &lt;a href="https://www.crossref.org/membership/about-sponsors/">sponsors&lt;/a> in some regions. All sponsors facilitate membership for organisations who wish to participate in Crossref. They pay one central membership fee on behalf of all the members they work with, and they also pay content registration fees on behalf of their members. Many sponsors register content on behalf of their members, and even if they don’t, most provide local language and technical support. Sponsors are able to charge for their services, but it can be a very economical route for a member to join. In the last year, out of the 2,322 new members that we’ve welcomed, almost 58% joined via a sponsor.&lt;/p>
&lt;p>We also &lt;a href="https://www.crossref.org/gem/">waive registration fees&lt;/a> for members in certain lower income countries who join via three of our sponsors, and we are planning to expand this program soon (pending board approval in November). [&lt;em>EDIT 2022-November-23: The new &lt;a href="https://www.crossref.org/gem">Global Equitable Membership (GEM) Program&lt;/a> was approved and takes effect 1st January 2023&lt;/em>]&lt;/p>
&lt;h2 id="the-importance-of-keeping-barriers-to-entry-low">The importance of keeping barriers to entry low &lt;/h2>
&lt;p>As you can see, the checks that we run on new applicants are fairly limited in scope. In the last year, we’ve welcomed 2,322 new members and we only declined 39 applications. And 34 of these declined applications were effectively from one organisation whose membership was revoked in 2019.&lt;/p>
&lt;p>Even this minimal set of checks takes a lot of research and keeps our member support specialists very busy - thank you &lt;a href="https://www.crossref.org/people/sally-jennings/">Sally Jennings&lt;/a> and &lt;a href="https://www.crossref.org/people/robbykha-rosalien/">Robbykha Rosalien&lt;/a> (as well as contractors Kim and Collin). &lt;/p>
&lt;p>So why shouldn&amp;rsquo;t we run more extensive checks on new member applicants? Why don’t we check the quality of their content, or that they are following best practices? Why don’t we decline membership for organisations that can’t demonstrate editorial integrity or that aren’t meeting 100% of the membership obligations from the start?&lt;/p>
&lt;p>Nevermind the additional capacity that more extensive checks on the over 200 applicants we receive per month would entail, it&amp;rsquo;s more fitting with our mission to:&lt;/p>
&lt;ol>
&lt;li>enable equitable participation; and&lt;/li>
&lt;li>focus on evidence:&lt;/li>
&lt;/ol>
&lt;h3 id="equitable-participation">Equitable participation&lt;/h3>
&lt;p>Inclusivity is very important to us - after all, one of our &lt;a href="https://www.crossref.org/truths/">organisational truths&lt;/a> (the guiding principles for everything we do) is “come one, come all”, and this is mirrored in the POSI principles that &lt;a href="https://doi.org/10.64000/1a8fc-3jq97" target="_blank">commit us to broad stakeholder representation&lt;/a>. We know that for new organisations, it may take them a while to be able to completely fulfil the membership obligations. We support them with information to help them understand what being a participant in the Crossref community entails. These organisations would have less of a chance of developing better practices if we were to limit membership in Crossref to &amp;lsquo;proven&amp;rsquo; candidates. Besides, it would introduce a &lt;a href="https://en.wikipedia.org/wiki/Race_condition" target="_blank">race condition&lt;/a>; if joining and sharing metadata through Crossref is widely considered best practice, new entrants &lt;em>need&lt;/em> to join Crossref in order to show that they are adopting best practices.&lt;/p>
&lt;h3 id="trust-signals-and-the-research-nexus">Trust signals and the Research Nexus&lt;/h3>
&lt;p>Secondly, it&amp;rsquo;s not our role to make such a call; we don’t have the expertise to decide if an organisation would be considered “good” at what they are producing; there are other organisations guiding in this area, such as with the &lt;a href="https://publicationethics.org/resources/guidelines-new/principles-transparency-and-best-practice-scholarly-publishing" target="_blank">Principles of Transparency and Best Practice in Scholarly Publishing&lt;/a>. Instead, we focus on the decision-making tools, metadata, and relationships that can help provide trust signals for the community.&lt;/p>
&lt;p>Once members start registering their content, their activity and metadata speak about their practices – others in the community can process that metadata, combined with its wider context, and identify trust signals to make their own decisions. That metadata can only be shared in an open and machine-readable way if an organisation joins Crossref and starts registering their records and underpinning data with us. &lt;/p>
&lt;p>To paint a more detailed picture of the scholarly record, our priority is to get more and varied organisations contributing to the research nexus, rather than putting up barriers and blockers until they are performing perfectly. If they aren’t acting in the best interests of the scholarly community, then having the metadata available to assess will quickly make that obvious and hopefully encourage changes - sunlight being the best disinfectant, as the saying goes.&lt;/p>
&lt;p>As we said in the &lt;a href="https://doi.org/10.64000/edg3w-7t592" target="_blank">first ISR blog&lt;/a>:&lt;/p>
&lt;blockquote>
&lt;p>“Crossref itself doesn’t assess the quality of content or the integrity of the research process but rather enables those who produce scholarly outputs to provide metadata (effectively evidence) about how they ensure the quality of content and how the outputs fit into the scholarly record.”&lt;/p>
&lt;/blockquote>
&lt;p>In our next post in the series, we&amp;rsquo;ll talk more about the workflow and decision-making tools we have in place and are planning to develop. We&amp;rsquo;ll pose questions about what kinds of metadata give what kind of levels of trustworthiness.&lt;/p>
&lt;h2 id="helping-new-members-become-good-crossref-citizens">Helping new members become “good Crossref citizens”&lt;/h2>
&lt;p>Once an applicant becomes a member, we help them to completely fulfil the membership terms - ensuring that, for example, they register and display DOIs, keep their metadata up to date, and implement &lt;a href="https://www.crossref.org/services/reference-linking/">reference linking&lt;/a> properly. &lt;/p>
&lt;p>We have &lt;a href="https://www.crossref.org/documentation/">a lot of documentation&lt;/a> on our website, we run regular &lt;a href="https://www.crossref.org/events/">events and webinars&lt;/a>, and we have a series of automated onboarding emails for new members to help them move through the key stages of the member journey from set up and onboarding to levelling up and using additional services like Crossmark and Similarity Check. Our staff are also on hand alongside Ambassadors and other members in our &lt;a href="https://community.crossref.org/" target="_blank">Community Forum&lt;/a>. Speaking of POSI (and transparent operations) we receive around 3,000 emails per month with support requests so we are gradually moving support from closed 1:1 email to the more public and efficient community support forum.&lt;/p>
&lt;p>We work with members who aren’t fulfilling the obligations to understand challenges and help explain what they need to do. This is currently reactive, but we have plans to automate checks on whether members are meeting the membership terms in future.&lt;/p>
&lt;p>Outside of confirming that our members are behaving as “good Crossref citizens”, there aren’t many other areas where the membership team typically gets involved. Our mission is to help preserve the integrity of the scholarly record by making the metadata provided by our members openly available in a machine-readable format. We don’t investigate our members’ business practices or take a deep dive into their editorial processes (such as peer review), and there are many areas where we aren’t able to get involved. For example, we cannot arbitrate title ownership disputes.  &lt;/p>
&lt;h2 id="its-all-about-preserving-the-integrity-of-the-scholarly-record">It’s all about preserving the integrity of the scholarly record&lt;/h2>
&lt;p>We do sometimes revoke membership, but this is for limited reasons: &lt;/p>
&lt;ol>
&lt;li>unpaid invoices;&lt;/li>
&lt;li>legal sanctions or judgments against the member or its home country; or&lt;/li>
&lt;li>contravention of the membership terms.&lt;/li>
&lt;/ol>
&lt;h3 id="membership-revocation-due-to-unpaid-invoices">Membership revocation due to unpaid invoices&lt;/h3>
&lt;p>We spend a lot of time communicating with members who haven’t paid their invoices and ensuring they have the information they need to solve the problem. Revoking membership due to unpaid fees is an absolute last step for us, but financial sustainability means we can keep the organisation afloat and keep our infrastructure running.&lt;/p>
&lt;p>Where members have unpaid fees, we eventually suspend their access to register new records and then ultimately revoke their membership if the fees remain unpaid. Once an organisation’s membership has been revoked, they would need to re-apply if they wanted to become a member again in the future. If accepted, the applicant would need to pay all outstanding invoices before re-joining. &lt;/p>
&lt;p>In March 2022, we revoked membership for around 140 members due to unpaid invoices (out of a total of over 17,000 active members).
 &lt;/p>
&lt;h3 id="membership-revocation-due-to-sanctions">Membership revocation due to sanctions&lt;/h3>
&lt;p>Occasionally, we are informed of sanctions that we need to comply with, such as the recent case of Russia invading Ukraine where each Russian member needed to be checked for individual sanctions and some were revoked. Such revocations have to be voted on by the Executive Committee and then ratified by the board. Read more &lt;a href="https://www.crossref.org/operations-and-sustainability/membership-operations/sanctions/">information on our sanctions process&lt;/a>. &lt;/p>
&lt;h3 id="membership-revocation-for-cause">Membership revocation for cause&lt;/h3>
&lt;p>Very occasionally there may be evidence that a member is in contravention of the membership terms. This may include:&lt;/p>
&lt;ul>
&lt;li>Misrepresentation in the original membership application&lt;/li>
&lt;li>Fraudulent use of identifiers or metadata&lt;/li>
&lt;li>Contravening the code of conduct&lt;/li>
&lt;li>Any other basis set forth in our governing documents.&lt;/li>
&lt;/ul>
&lt;p>We always try to work together with the member to solve problems, and again, revoking membership is an absolute last step. The revocation has to be voted on by the Executive Committee and then ratified by the board. &lt;/p>
&lt;p>Our first ever revocation for cause was in July 2019 for OMICS, after the board voted that the &lt;a href="https://www.ftc.gov/legal-library/browse/cases-proceedings/152-3113-omics-group-inc" target="_blank">US Federal Trade Commission&lt;/a>&amp;rsquo;s ruling against them amounted to a cause for revocation. There have been a handful of cases since. For example, most recently in September this year we revoked membership for a member who was registering DOIs for journals with the ISSNs of similarly-named publications.&lt;/p>
&lt;p>There’s more information about our &lt;a href="https://www.crossref.org/operations-and-sustainability/membership-operations/revocation/">processes to revoke membership&lt;/a> on our website.&lt;/p>
&lt;h2 id="more-participation-for-the-win">More participation for the win&lt;/h2>
&lt;p>In conclusion, we believe that the more parties able to participate in Crossref and provide metadata and context for the research nexus, the more robust this makes the scholarly record.&lt;/p>
&lt;p>But do you agree? Are these measures enough? What other information about our membership operations would help us be more transparent? As we said in our first blog, we need your help to establish whether our approach is still the right one, if we are missing anything and what else we might be able to do.&lt;/p>
&lt;p>Here’s how you can help:&lt;/p>
&lt;ul>
&lt;li>Join the discussion about the integrity of the scholarly record on our community forum.&lt;/li>
&lt;li>Keep an eye out for future blog posts and meetings. We are having a small, in-person discussion prior to the Frankfurt Book Fair and will report on this in a future blog post.&lt;/li>
&lt;li>Sign up to attend &lt;a href="https://www.crossref.org/crossref-annual-meeting/archive/#2022">Crossref LIVE22&lt;/a> for updates on these topics and all things Crossref.&lt;/li>
&lt;li>Join and support initiatives and organisations that we partner with or who use our metadata to look at ethical practices, for example, &lt;a href="https://publicationethics.org/" target="_blank">COPE&lt;/a>, &lt;a href="https://doaj.org/" target="_blank">DOAJ&lt;/a>, and &lt;a href="https://oaspa.org/" target="_blank">OASPA&lt;/a>, and review the &lt;a href="https://www.oaspa.org/resources/principles-of-transparency-and-best-practice-in-scholarly-publishing/" target="_blank">Principles of Transparency in Scholarly Publishing&lt;/a>, which these organisations worked on with &lt;a href="https://www.wame.org/" target="_blank">WAME&lt;/a>.&lt;/li>
&lt;/ul></description></item><item><title>ISR part one: What is our role in preserving the integrity of the scholarly record?</title><link>https://www.crossref.org/blog/isr-part-one-what-is-our-role-in-preserving-the-integrity-of-the-scholarly-record/</link><pubDate>Thu, 22 Sep 2022 00:00:00 +0000</pubDate><author>Amanda Bartell</author><guid>https://www.crossref.org/blog/isr-part-one-what-is-our-role-in-preserving-the-integrity-of-the-scholarly-record/</guid><description>&lt;p>The integrity of the scholarly record is an essential aspect of research integrity. Every initiative and service that we have launched since our founding has been focused on documenting and clarifying the scholarly record in an open, machine-actionable and scalable form. All of this has been done to make it easier for the community to assess the trustworthiness of scholarly outputs. Now that the scholarly record itself has evolved beyond the published outputs at the end of the research process – to include both the elements of that process and its aftermath – preserving its integrity poses new challenges that we strive to meet&amp;hellip; we are reaching out to the community to help inform these efforts.&lt;/p>
&lt;p>Scholarly research, and therefore scholarly communications, are rapidly changing with the development of new approaches, technologies, and models. We need &lt;a href="https://doi.org/10.24343/C34W2H" target="_blank">open scholarly infrastructure&lt;/a> that can adapt to these changes and provide trust signals that enable assessment of the integrity of the research and reflect the ways that research is changing. Crossref has been changing and adapting by building on the concept of the scholarly record with our vision of the &lt;a href="https://doi.org/10.64000/35qx3-8z834" target="_blank">Research Nexus&lt;/a>:&lt;/p>
&lt;blockquote>
&lt;p>&amp;ldquo;a rich and reusable open network of relationships connecting research organisations, people, things, and actions; a scholarly record that the global community can build on forever, for the benefit of society&amp;rdquo;.&lt;/p>
&lt;/blockquote>
&lt;p>The foundation of the scholarly record and Research Nexus is metadata and relationships - the richer and more comprehensive the metadata and relationships in Crossref records, the more context there is for our members and for the whole scholarly research ecosystem. This will lead to a range of benefits from better discovery and saving researchers time to the assessment of research impact and research integrity. This is why Crossref is focused on enriching metadata to provide more and better trust signals while keeping barriers to membership and participation as low as possible to enable an inclusive scholarly record.&lt;/p>
&lt;p>We want to engage with the community to emphasise this role, share our plans for the future, and get feedback to establish if we are heading in the right direction.&lt;/p>
&lt;p>This blog explains our current position and will be followed by subsequent posts exploring all our services and plans in this area, as well as more details on our membership operations and policies.&lt;/p>
&lt;h2 id="what-is-integrity-of-the-scholarly-record-isr-and-how-does-it-feed-into-research-integrity">What is “Integrity of the Scholarly Record” (ISR), and how does it feed into Research Integrity?&lt;/h2>
&lt;p>The US National Institutes of Health (NIH) &lt;a href="https://grants.nih.gov/policy/research_integrity/what-is.htm" target="_blank">defines&lt;/a> research integrity as a set of values in scientific research: honesty; accuracy; efficiency; and objectivity. It’s concerned with the &lt;em>soundness of the process&lt;/em> of science. As a subset of that, the &lt;em>outputs of the scholarly publishing process&lt;/em> create a “scholarly record” which allows those in the community to find evidence and context to help confirm whether these values have been adhered to. The scholarly record is Crossref’s focus. This means that Crossref itself doesn’t assess the quality of content or the integrity of the research process but rather enables those who produce scholarly outputs to provide metadata (effectively evidence) about how they ensure the quality of content and how the outputs fit into the scholarly record (through reference links, ORCID iDs for authors, ROR IDs for affiliations, funding and licensing information, etc.).&lt;/p>
&lt;p>Crossref members include any organisation that produces research objects and materials (publishers, societies, universities, funders, research institutions, scholars) so they can establish a persistent record—tied to a persistent and unique identifier—for these outputs and supply metadata about this content in an open, machine-readable way. Maintaining this record for the long term, and adding in an important layer of context, establishes the integrity of the scholarly record as well as ensuring it is something that can be used by the whole community to improve scholarly research for generations to come.&lt;/p>
&lt;h2 id="the-scholarly-record-is-about-more-than-just-published-outputs---its-also-a-network-of-inputs-relationships-and-contexts">The scholarly record is about more than just published outputs - it’s also a network of inputs, relationships, and contexts&lt;/h2>
&lt;p>In the past, the Scholarly Record was seen as just the published outputs at the end of the research process - for example, journal articles or book chapters. But as the OCLC Research Group notes in their 2014 report on &lt;a href="https://doi.org/10.25333/C3763V" target="_blank">The Evolving Scholarly Record&lt;/a>:&lt;/p>
&lt;blockquote>
&lt;p>“The boundaries of the scholarly record are in flux, as they stretch to extend over an ever-expanding range of materials.”&lt;/p>
&lt;/blockquote>
&lt;p>OCLC describes how outputs at the “process” and “aftermath” stages of the research process are becoming increasingly important alongside the outputs at the traditional “outcomes” stage.&lt;/p>
&lt;p>We like to take this even further. We think the evolving Scholarly Record is about more than just recording different types of &lt;em>works&lt;/em>. As the above report notes “&lt;em>The scholarly record is evolving to have greater emphasis on collecting and curating context of scholarly inquiry […] One can imagine an article in quantitative biology published in a Wiley journal, the data for which resides in Dryad; the e-print in arXiv; and the conference poster in F1000. All of these materials may be considered part of the scholarly record, but no single institution will collect them all. Instead, access is achieved through a coordination of stewardship roles in which the scholarly record is decomposed into discrete, interrelated units that organisations specialize in collecting, preserving, and making available&lt;/em>.”&lt;/p>
&lt;p>It’s this &lt;em>interrelatedness&lt;/em> that we think is important, and Crossref plays an important role in collecting, matching, and sharing those relationships. We now focus on this ‘nexus’ - so no longer primarily the different types of objects, but increasingly the interplay and relationships between them. The context, rather than the individual metadata elements, is what’s key.&lt;/p>
&lt;p>Martin Eve explores this idea further in his blog &lt;a href="https://eve.gd/2022/07/26/what-is-the-scholarly-record/" target="_blank">What is the Scholarly Record&lt;/a>, suggesting “the scholarly record is a decentralized network of evolving truth assertions” and “Whether a truth assertion is part of the scholarly record is determined by another set of distributed assertions and their power configurations (say, through institutional affiliation) of the individuals who make such assertions.”&lt;/p>
&lt;p>Barbara Fister&amp;rsquo;s excellent &lt;a href="https://barbarafister.net/libraries/lateral-reading-and-information-systems-in-the-age-of-distrust/" target="_blank">talk about the importance of lateral reading as a way to understand information systems&lt;/a> discusses how professional fact checkers “engaged in “lateral reading,” check other sources for context before spending time reading and analyzing a source.”&lt;/p>
&lt;p>Fister highlights the “SIFT” approach from &lt;a href="https://barbarafister.net/libraries/lateral-reading-and-information-systems-in-the-age-of-distrust/" target="_blank">A Curriculum for Civic Online Reasoning&lt;/a>, created by a group of educators at Stanford University for students to evaluate online content. And she argues that this approach is also useful for assessing scholarly materials noting&lt;/p>
&lt;blockquote>
&lt;p>“&lt;em>The networked, social nature of scholarship is worth making explicit&lt;/em>”.&lt;/p>
&lt;/blockquote>
&lt;h2 id="where-does-crossref-fit-in-where-do-we-have-the-most-impact-and-opportunity">Where does Crossref fit in? Where do we have the most impact and opportunity?&lt;/h2>
&lt;p>To address the question of our role in the integrity of the scholarly record, we need to understand several aspects that Crossref has to balance in this capacity, such as&lt;/p>
&lt;ul>
&lt;li>We don’t have the means or desire to be the arbiter of research quality. However, we operate neutrally, at the centre of scholarly communications, and we can help develop a shared consensus or framework. Our metadata elements and tools can be positioned to signal or detect trustworthiness. An important distinction is that we can play a role in assessing &lt;em>legitimacy&lt;/em> but not in assessing &lt;em>quality&lt;/em>.&lt;/li>
&lt;li>We must be cautious that our best practices for demonstrating legitimacy and handling less-than-legitimate behaviour do not raise already-high barriers for emerging publications or organisations that present in ways that some may not recognise as professional standards. Disruption is different from deception. In discussions with our board this point has come out strongly: that Crossref has an opportunity to think about how to help the community identify deceptive actions and pair that with our efforts to bring more people on board.&lt;/li>
&lt;li>Addressing this issue may involve changes to our membership eligibility and processes, bylaws, policies, staff resources, and technical and metadata solutions; actually, a combination of all these aspects. Many of these are projects that are already planned and we have ideas for extending these.&lt;/li>
&lt;li>We regularly review the process we use for evaluating when and why to revoke membership for reasons other than non-payment. The volume of cases that we believe justify membership revocation&amp;mdash;while a tiny fraction of members&amp;mdash;is growing and does take staff and legal resources to address.&lt;/li>
&lt;/ul>
&lt;h2 id="crossref-and-our-members-aleady-help-preserve-the-integrity-of-the-scholarly-record-in-significant-ways">Crossref and our members aleady help preserve the integrity of the scholarly record in significant ways&lt;/h2>
&lt;p>Almost all of our services in some way touch on enabling people to express and evaluate trustworthiness; our mission statement commits us to “&lt;em>making research objects easy to find, cite, link, &lt;strong>assess&lt;/strong>, and reuse [&amp;hellip;] all to help put research in context.&lt;/em>”&lt;/p>
&lt;p>We have, of course, specific tools and services that augment this activity too. Many members are active in:&lt;/p>
&lt;ul>
&lt;li>Reporting corrections and retractions through &lt;a href="https://www.crossref.org/services/crossmark/">Crossmark&lt;/a> metadata.&lt;/li>
&lt;li>Assessing originality using &lt;a href="https://www.crossref.org/services/similarity-check/">Similarity Check&lt;/a>.&lt;/li>
&lt;li>Conveying their stewardship via the public &lt;a href="https://www.crossref.org/members/prep/" target="_blank">participation report&lt;/a>.&lt;/li>
&lt;li>Establishing provenance and stakeholders through funding metadata, ORCID, and ROR.&lt;/li>
&lt;li>Acknowledging funding through the use of the &lt;a href="https://www.crossref.org/services/funder-registry/">Open Funder Registry&lt;/a> and &lt;a href="https://www.crossref.org/community/grants/">registering grants metadata&lt;/a>.&lt;/li>
&lt;li>&lt;a href="https://www.crossref.org/community/data-citation/">Citing data&lt;/a> for transparency and reproducibility, including linking to related research data via &lt;a href="https://doi.org/10.64000/a0db5-dgq68" target="_blank">Event Data&lt;/a>.&lt;/li>
&lt;li>Demonstrating open peer review by &lt;a href="https://www.crossref.org/documentation/research-nexus/peer-reviews/">registering peer review reports&lt;/a>.&lt;/li>
&lt;/ul>
&lt;p>As recently concluded in this &lt;a href="https://doi.org/10.1038/d41586-022-02915-1" target="_blank">Nature editorial&lt;/a> calling for us to think beyond open references,&lt;/p>
&lt;blockquote>
&lt;p>“Depositing all relevant metadata in Crossref should become the norm in scholarly publishing.”&lt;/p>
&lt;/blockquote>
&lt;p>For those members just starting out on their journey, there are some immediate specific things that all members are able to do. Check your &lt;a href="https://www.crossref.org/members/prep/" target="_blank">participation report&lt;/a> and start registering more metadata to add that contextual layer:&lt;/p>
&lt;ul>
&lt;li>References&lt;/li>
&lt;li>Abstracts&lt;/li>
&lt;li>&lt;a href="https://doi.org/10.31222/osf.io/6z7s3" target="_blank">Corrections and retractions via Crossmark&lt;/a>&lt;/li>
&lt;li>License links&lt;/li>
&lt;li>ORCID IDs for authors&lt;/li>
&lt;li>ROR IDs for affiliations&lt;/li>
&lt;li>Grant IDs for funding acknowledgements&lt;/li>
&lt;li>Cite data (preferably using DataCite DOIs in reference lists)&lt;/li>
&lt;li>Register all related objects such as versions and translations via relationships&lt;/li>
&lt;li>Register grants with Crossref (funder members).&lt;/li>
&lt;/ul>
&lt;blockquote>
&lt;p>By enabling our members to register their research objects and create metadata records about them that are freely and openly shared with the scholarly community, we facilitate them in being able to communicate the context and trustworthiness of that object.&lt;/p>
&lt;/blockquote>
&lt;p>And within that metadata, they can create relationships not just between research objects and also between research stakeholders - the individuals, affiliations, funders, and other players involved. That’s why we work so closely with other parts of foundational scholarly infrastructure (ORCID, DataCite, ROR) and why we now have more than 30 funders registering grants with us. We want to help to capture, identify, and link together all these important elements and more to deliver context for the scholarly record.&lt;/p>
&lt;p>We started this blog by talking about the changes that are taking place in the world of research and how the infrastructure needs to adapt and change. Although we have extensive plans in place to improve our contribution to ISR, we need your help to establish whether our role is still the right one, whether we are missing anything and what else we might be able to do.&lt;/p>
&lt;ul>
&lt;li>Join the discussion about the integrity of the scholarly record, and the Research Nexus on our Community Forum.&lt;/li>
&lt;li>Keep an eye out for future blog posts and meetings. We are having a small, in-person discussion prior to the Frankfurt Book Fair and will report on this in a future blog post.&lt;/li>
&lt;li>Sign up to attend &lt;a href="https://www.crossref.org/crossref-annual-meeting/archive/#2022">Crossref LIVE22&lt;/a> for updates on these topics and all things Crossref.&lt;/li>
&lt;li>Join and support initiatives and organisations that we partner with or who use our metadata to look at ethical practices in publishing, for example, &lt;a href="https://publicationethics.org/" target="_blank">COPE&lt;/a>, &lt;a href="https://doaj.org/" target="_blank">DOAJ&lt;/a>, and &lt;a href="https://oaspa.org/" target="_blank">OASPA&lt;/a>, and review the &lt;a href="https://www.oaspa.org/resources/principles-of-transparency-and-best-practice-in-scholarly-publishing/" target="_blank">Principles of Transparency in Scholarly Publishing&lt;/a>, which these organisations worked on with &lt;a href="https://www.wame.org/" target="_blank">WAME&lt;/a>.&lt;/li>
&lt;/ul>
&lt;p>In the coming weeks, we will post more about our product and metadata plans and also about the specifics of membership operations and cases we see and how we’re currently addressing them.&lt;/p>
&lt;h3 id="please-share-your-thoughts">Please share your thoughts!&lt;/h3></description></item><item><title>2022 Board Election</title><link>https://www.crossref.org/blog/2022-board-election/</link><pubDate>Fri, 16 Sep 2022 00:00:00 +0000</pubDate><author>Lucy Ofiesh</author><guid>https://www.crossref.org/blog/2022-board-election/</guid><description>&lt;p>I’m pleased to share the 2022 board election slate. Crossref’s Nominating Committee received 40 submissions from members worldwide to fill five open board seats.&lt;/p>
&lt;p>We maintain a balance of eight large member seats and eight small member seats. A member’s size is determined based on the membership fee tier they pay. We look at how our total revenue is generated across the membership tiers and split it down the middle. Like last year, about half of our revenue came from members in the tiers $0 - $1,650, and the other half came from members in tiers $3,900 - $50,000. We have four large member seats and one small member seat open for election in 2022.&lt;/p>
&lt;p>The &lt;a href="https://www.crossref.org/committees/nominating">Nominating Committee&lt;/a> presents the following slate.&lt;/p>
&lt;h2 id="the-2022-slate">The 2022 slate&lt;/h2>
&lt;h3 id="tier-1-candidates-electing-one-seat">Tier 1 candidates (electing one seat):&lt;/h3>
&lt;ul>
&lt;li>&lt;strong>eLife&lt;/strong>, Damian Pattinson, Executive Director&lt;/li>
&lt;li>&lt;strong>Pan Africa Science Journal&lt;/strong>, Oscar Donde, Editor in Chief&lt;/li>
&lt;/ul>
&lt;h3 id="tier-2-candidates-electing-four-seats">Tier 2 candidates (electing four seats):&lt;/h3>
&lt;ul>
&lt;li>&lt;strong>Clarivate&lt;/strong>, Christine Stohn, Director of Product Management&lt;/li>
&lt;li>&lt;strong>Elsevier&lt;/strong>, Rose L’Huillier, Senior Vice President Researcher Products&lt;/li>
&lt;li>&lt;strong>The MIT Press&lt;/strong>, Nick Lindsay, Journals and Open Access Director&lt;/li>
&lt;li>&lt;strong>Springer Nature&lt;/strong>, Anjalie Nawaratne, VP Data Transformation &amp;amp; Chief Business Architect&lt;/li>
&lt;li>&lt;strong>Wiley&lt;/strong>, Allyn Molina, Group Vice President, Research Publishing&lt;/li>
&lt;/ul>
&lt;div class="shortcode-divwrap blue-highlight">
&lt;span>&lt;h3 id="here-are-the-candidates-organisational-and-personal-statementsboard-and-governanceelections2022-slate">&lt;a href="https://www.crossref.org/board-and-governance/elections/2022-slate/">Here are the candidates&amp;rsquo; organisational and personal statements&lt;/a>&lt;/h3>
&lt;/span>
&lt;/div>
&lt;h2 id="you-can-be-part-of-this-important-process-by-voting-in-the-election">You can be part of this important process by voting in the election&lt;/h2>
&lt;p>If your organisation is a voting member in good standing of Crossref as of September 6th, 2022, you are eligible to vote when voting opens on September 20th, 2022.&lt;/p>
&lt;h2 id="how-can-you-vote">How can you vote?&lt;/h2>
&lt;p>Your organisation’s designated voting contact will receive an email the week of September 19th with the Formal Notice of Meeting and Proxy Form with concise instructions on how to vote. You will also receive a username and password with a link to our voting platform.&lt;/p>
&lt;p>The election results will be announced at the &lt;a href="https://www.crossref.org/crossref-annual-meeting/archive/#2022" target="_blank">LIVE22 online meeting&lt;/a> on October 26th, 2022. Save the date! Incoming members will take their seats at the March 2023 board meeting.&lt;/p></description></item><item><title>Accessibility for Crossref DOI Links: Call for comments on proposed new guidelines</title><link>https://www.crossref.org/blog/accessibility-for-crossref-doi-links-call-for-comments-on-proposed-new-guidelines/</link><pubDate>Tue, 06 Sep 2022 00:00:00 +0000</pubDate><author>Jennifer Kemp</author><guid>https://www.crossref.org/blog/accessibility-for-crossref-doi-links-call-for-comments-on-proposed-new-guidelines/</guid><description>&lt;p>Our entire community &amp;ndash; members, metadata users, service providers, community organisations and researchers &amp;ndash; &lt;a href="https://www.crossref.org/documentation/member-setup/constructing-your-dois/" target="_blank">create&lt;/a> and/or &lt;a href="https://www.crossref.org/services/metadata-retrieval/" target="_blank">use&lt;/a> DOIs in some way so making them more accessible is a worthy and overdue effort.&lt;/p>
&lt;p>For the first time in five years and only the second time ever, we are recommending some changes to our &lt;a href="https://www.crossref.org/display-guidelines/" target="_blank">DOI display guidelines&lt;/a> (the changes aren’t really for display but more on that below). We don’t take such changes lightly, because we know it means updating established workflows. We appreciate the questions that prompted us to make this recommendation and we know it’s critical that we get community input on the proposed updates.&lt;/p>
&lt;h2 id="tldr">TL;DR&lt;/h2>
&lt;p>Here is a quick overview:&lt;/p>
&lt;ul>
&lt;li>DOIs and URLs themselves don’t really tell readers much. People with visual impairments rely on screen readers to read out loud the contents of a page. We’re asking for the title of each DOI to be added, in an &lt;a href="https://www.w3.org/WAI/standards-guidelines/aria/" target="_blank">ARIA&lt;/a> (Accessible Rich Internet Applications) attribute, so these users understand what these links are for.&lt;/li>
&lt;li>Accessible text, as this kind of description is known, should be included for all links, but at this time, we’re specifically recommending it for landing pages of newly registered records.&lt;/li>
&lt;li>It’s not required, yet. We’re proposing a 2 year recommendation period and we want your feedback on the particulars, including timing and how we can help. Please take a &lt;a href="https://forms.gle/K6zWQ3f1dmYUkj9T6" target="_blank">short survey&lt;/a> and/or &lt;a href="mailto:feedback@crossref.org">get in touch&lt;/a> and share your thoughts.&lt;/li>
&lt;li>We’ll finalize these recommendations after assessing the feedback. Please check back for updates.&lt;/li>
&lt;/ul>
&lt;h2 id="what-is-changing-when-and-why">What is changing, when and why&lt;/h2>
&lt;p>The proposed updates are meant to improve overall usability, particularly for people with visual impairments, by aligning our guidelines with modern accessibility requirements such as the new &lt;a href="https://www.w3.org/2021/09/UX-Guide-metadata-1.0/principles/" target="_blank">W3C recommendations&lt;/a> and the &lt;a href="https://inclusivepublishing.org/blog/what-does-the-european-accessibility-act-mean-for-global-publishing/" target="_blank">European Accessibility Act&lt;/a>. This means that assistive technologies such as screen readers can interpret DOI links.&lt;/p>
&lt;p>&lt;strong>Why are changes being recommended?&lt;/strong>&lt;/p>
&lt;p>DOIs are &lt;a href="https://www.crossref.org/documentation/member-setup/constructing-your-dois/#whyopaque" target="_blank">unique&lt;/a> and persistent links to items in the scholarly record so it makes sense that they link to the full URLs for the associated content –for example, a journal article. The issue for people who rely on screen readers is that a DOI link doesn’t provide title or other information to give that link context. Users of screen readers need to know what the destination of a link is.&lt;/p>
&lt;p>These users often lack the context that other users have; in fact, they may be presented with links in a document as a list. That&amp;rsquo;s why all links, not just DOI links, need what is called &amp;ldquo;accessible text.” Providing additional information for links requires &lt;a href="https://www.w3.org/WAI/standards-guidelines/aria/" target="_blank">ARIA&lt;/a> (Accessible Rich Internet Applications) techniques. This speaks to the Web Content Accessibility Guidelines (WCAG), the standard guidelines for accessibility across the web, specifically &lt;a href="https://www.w3.org/WAI/WCAG21/Understanding/link-purpose-in-context" target="_blank">success criterion 2.4.4&lt;/a> - Link Purpose (In Context), which aims to ‘help users understand the purpose of each link so they can decide whether they want to follow the link.’&lt;/p>
&lt;p>&lt;strong>For your feedback: recommended draft changes&lt;/strong>&lt;/p>
&lt;p>We recommend the addition of an &lt;em>aria-label&lt;/em> attribute for DOI links, containing as its value the descriptive title of the content represented by the DOI, so that screen readers can interpret DOI links. This means that, &lt;em>while the DOI display itself doesn’t actually change&lt;/em>, the link is enhanced with additional, contextual information for the user of assistive technology, in one of two ways, either:&lt;/p>
&lt;ul>
&lt;li>&lt;strong>an aria-label attribute&lt;/strong>, described as ‘a way to place a descriptive text label on an object,’ identifying the destination, or&lt;/li>
&lt;li>&lt;strong>an aria-describedby attribute&lt;/strong> pointing to where the destination is identified in the surrounding text.&lt;/li>
&lt;/ul>
&lt;p>The updated HTML for a journal article*, for example, would be:&lt;/p>
&lt;p>&lt;code>&amp;lt;a href=&amp;quot;https://doi.org/10.5555/12345678&amp;quot; aria-label=&amp;quot;DOI for Toward a Unified Theory of High-Energy Metaphysics: Silly String Theory&amp;quot;&amp;gt;https://doi.org/10.5555/12345678&amp;lt;/a&amp;gt;&lt;/code>&lt;/p>
&lt;p>Here the aria-label has been set to the value of the ‘title’ property as retrieved from the Crossref REST API at &lt;a href="https://api.crossref.org/v1/works/10.5555/12345678" target="_blank">https://api.crossref.org/v1/works/10.5555/12345678&lt;/a>.&lt;/p>
&lt;p>*Note that fields may vary slightly for different &lt;a href="https://www.crossref.org/documentation/research-nexus/" target="_blank">record types&lt;/a>.&lt;/p>
&lt;p>This proposed solution allows screen readers to read aloud to users the value of the aria-label attribute, instead of the full DOI in the link text.&lt;/p>
&lt;p>&lt;em>At this time, we are recommending the change for landing pages in particular&lt;/em>, but it can and should be applied to wherever DOI links appear, whenever feasible (more on this below).&lt;/p>
&lt;p>Our guidelines will continue to state that the DOI should always be displayed as a full URL link&amp;ndash;that will not change. Neither will &lt;a href="https://www.crossref.org/services/content-registration/">content registration&lt;/a>&amp;ndash;we are not asking for additional information in your deposits.&lt;/p>
&lt;p>&lt;strong>It’s not perfect, but it’s very worthwhile&lt;/strong>&lt;/p>
&lt;p>This recommendation has some limitations worth noting but it must be said that there is no perfect solution.&lt;/p>
&lt;p>DOI links appear in lots of places - PDFs for one notable example. We reviewed and tested the recommendation with Bill Kasdorf, Principal, Kasdorf &amp;amp; Associates, LLC, Richard Orme, CEO, DAISY Consortium, and George Kerscher, Chief Innovations Officer, DAISY Consortium-Senior Officer, Global Literacy, Benetech, who graciously provided their time and expertise. EPUBs and websites proved to be easy to update; other formats, notably PDFs, less so. Widespread adoption of accessible DOIs is so important and we don’t want confusion or frustration to get in the way of making progress. We support and welcome efforts to include an ARIA attribute wherever DOI links appear, but we recommend focusing on landing pages, for now.&lt;/p>
&lt;p>Patrick Vale, Crossref Senior Front End Developer, explains that:&lt;/p>
&lt;blockquote>
&lt;p>”DOI links serve a very specific purpose: to provide the persistent link to an item in the scholarly record. And as such, they present an unusual set of requirements when balancing accurately presenting the information they encode - the persistent link - and making that link accessible, and understandable. With these proposed changes, we hope to strike this balance.“&lt;/p>
&lt;/blockquote>
&lt;p>We know it will be a challenge (more on that below) but we think it’s absolutely a worthwhile effort. Indeed, we are undertaking a project to update our own website to meet these recommendations and to review overall accessibility.&lt;/p>
&lt;p>As Bill Kasdorf notes:&lt;/p>
&lt;blockquote>
&lt;p>“Most people have no idea how many people with visual impairments there are. Not only is it unfair to those people not to provide accessible text for links, the authors and publishers of the linked resource are missing a lot of readers. This update is a great move by Crossref, and every bit aligned with its mission to make scholarly content discoverable and consumable.”&lt;/p>
&lt;/blockquote>
&lt;p>&lt;strong>We propose the following timeline, also for your feedback&lt;/strong>&lt;/p>
&lt;p>Once finalized, following community feedback, the updated guidelines will be issued as a recommendation for a suggested period of two years starting next year, 2023. Beginning in 2025, the changes will be required for landing pages of newly registered content (and strongly recommended for existing registered content). Feedback on this approach and timeline is also encouraged.&lt;/p>
&lt;h2 id="help-us-help-you">Help us help you&lt;/h2>
&lt;p>We are conscious that adding descriptive information to DOI links places a significant responsibility on the members and Service Providers creating and hosting these links. Therefore, we are also considering the creation of a tool to help with implementation. Initial discussions suggest this could be a JavaScript helper tool, which could be included on member websites. We also welcome feedback as to how such a tool might be implemented, and how it would best integrate with existing sites and workflows.&lt;/p>
&lt;h2 id="call-for-comments---by-1st-november">Call for comments - by 1st November&lt;/h2>
&lt;p>We hope that this proposal is a welcome one and that the timing is good for moving forward together toward greater accessibility of the scholarly record.
&lt;strong>We welcome questions, feedback and suggestions through 1st November via the &lt;a href="https://forms.gle/7diHy46Cu5J52q417" target="_blank">survey&lt;/a> below or by email to &lt;a href="mailto:feedback@crossref.org">feedback@crossref.org&lt;/a>&lt;/strong>&lt;/p>
&lt;iframe src="https://docs.google.com/forms/d/e/1FAIpQLScqLWIycofCUbGXxZRcOjkDM43zsIsfLdO2ZqhVVHiwDQUSeQ/viewform" width="760" height="500" frameborder="0" marginheight="0" marginwidth="0" >Loading...&lt;/iframe>
&lt;h2 id="small-changes-big-impact">Small changes, big impact&lt;/h2>
&lt;p>We’re excited to make changes that improve accessibility and we look forward to the community’s response to our proposal. We will share aggregated feedback in an updated post later this year.&lt;/p>
&lt;h2 id="a-note-on-language">A note on language&lt;/h2>
&lt;p>Multiple sources were consulted to find the most appropriate and inclusive term(s) for users of screen readers in this context. “Print disabled,” for example, seemed to be a good candidate but was ultimately deemed likely to be confusing to a very global publishing audience, who often don’t physically print anything. Sources differ slightly, for example between the US and UK and of course, this English text may well be translated into other languages. Feedback on the terms used here is also very welcome.&lt;/p>
&lt;h2 id="additional-resources">Additional resources&lt;/h2>
&lt;ul>
&lt;li>&lt;a href="https://inclusivepublishing.org/about-the-inclusive-publishing-hub/" target="_blank">The Inclusive Publishing Hub&lt;/a> (DAISY Consortium)&lt;/li>
&lt;li>&lt;a href="https://ncdj.org/style-guide/" target="_blank">National Center on Disability and Journalism&lt;/a> (Arizona State University, US)&lt;/li>
&lt;li>&lt;a href="https://www.gov.uk/government/publications/inclusive-communication/inclusive-language-words-to-use-and-avoid-when-writing-about-disability" target="_blank">Inclusive Language guidance&lt;/a> (UK government)&lt;/li>
&lt;li>&lt;a href="https://apastyle.apa.org/style-grammar-guidelines/bias-free-language/disability" target="_blank">The American Psychological Association (APA) Bias-Free Language Disability Guide&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://hcommons.org/groups/open-access-books-network/forum/topic/accessibility-of-oa-books/?view=all#post-57431" target="_blank">The Open Access Books Network&lt;/a> (OABN)&lt;/li>
&lt;/ul></description></item><item><title>Martin Paul Eve is joining our R&amp;D group as a Principal Developer</title><link>https://www.crossref.org/blog/martin-paul-eve-is-joining-our-rd-group-as-a-principal-developer/</link><pubDate>Fri, 26 Aug 2022 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/martin-paul-eve-is-joining-our-rd-group-as-a-principal-developer/</guid><description>&lt;p>I&amp;rsquo;m delighted to say that &lt;a href="https://en.wikipedia.org/wiki/Martin_Paul_Eve" target="_blank">Martin Paul Eve&lt;/a> will be joining Crossref as a Principal R&amp;amp;D Developer starting in January 2023.&lt;/p>
&lt;p>As a Professor of Literature, Technology, and Publishing at Birkbeck, University of London- Martin has always worked on issues relating to metadata and scholarly infrastructure. In joining the Crossref R&amp;amp;D group, Martin can focus full-time on helping us design and build a new generation of services and tools to help the research community navigate and make sense of the scholarly record.&lt;/p>
&lt;p>&lt;a href="https://eve.gd/2022/08/26/moving-on-my-infrastructural-turn/" target="_blank">Martin himself explains the logic of this move on his own blog&lt;/a>, so I won&amp;rsquo;t attempt to do the same here other than to say:&lt;/p>
&lt;blockquote>
&lt;p>praxis makes perfect.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;em>(mic drop)&lt;/em>&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/labs/bookwheel.png"
alt="Because it is a law that all blog posts having to do with anything related to the digital humanties are required to include a picture of a bookwheel, we present an image generated by DALL·E with the folloiwng prompt: &amp;#39;A bookwheel in the style of the 16th-century illustration by Agostino Ramelli and where the books are replaced by open laptops&amp;#39;" width="500" height="500">&lt;figcaption>
&lt;p>&lt;a href="https://labs.openai.com/s/pAPWg9vK7kIn763OLho9QvZ1" target="_blank">Created with DALL·E, an AI system by OpenAI&lt;/a> with the the prompt: &amp;lsquo;A bookwheel in the style of the 16th-century illustration by Agostino Ramelli and where the books are replaced by open laptops&amp;rsquo;&lt;/p>
&lt;/figcaption>
&lt;/figure></description></item><item><title>Flies in your metadata (ointment)</title><link>https://www.crossref.org/blog/flies-in-your-metadata-ointment/</link><pubDate>Mon, 25 Jul 2022 00:00:00 +0000</pubDate><author>Isaac Farley</author><guid>https://www.crossref.org/blog/flies-in-your-metadata-ointment/</guid><description>&lt;p>Quality metadata is foundational to the &lt;a href="https://doi.org/10.64000/35qx3-8z834" target="_blank">research nexus&lt;/a> and all Crossref services. When inaccuracies creep in, these create problems that get compounded down the line. No wonder that reports of metadata errors from authors, members, and other metadata users are some of the most common messages we receive into the technical support team (we &lt;a href="https://metadata2020.org/resources/metadata-practices/" target="_blank">encourage&lt;/a> you to continue to report these metadata errors).&lt;/p>
&lt;p>We make members’ metadata openly available via our APIs, which means people and machines can incorporate it into their research tools and services - thus, we all want it to be accurate. Manuscript tracking services, search services, bibliographic management software, library systems, author profiling tools, specialist subject databases, scholarly sharing networks - all of these (&lt;a href="https://metadata2020.org/learn-more/stakeholders/" target="_blank">and more&lt;/a>) incorporate scholarly metadata into their software and services. They use our APIs to help them get the most complete, up-to-date set of metadata from all of our publisher members. And of course, members themselves are able to use our free APIs too (and often do; our members account for the vast majority of overall metadata usage).&lt;/p>
&lt;figure>&lt;img src='https://www.crossref.org/images/documentation/metadata-users-uses.png' alt='Metadata users and uses: metadata from Crossref APIs is used for a variety of purposes by many tools and services' title='' width='75%'>&lt;/figure>
&lt;p>We know many organisations use Crossref metadata. We highlighted several different examples in our &lt;a href="https://www.crossref.org/categories/api-case-study/" target="_blank">API case study blog series&lt;/a> and &lt;a href="https://www.crossref.org/services/metadata-retrieval/user-stories/" target="_blank">user stories&lt;/a>. Now, consider how errors could be (and often are) amplified throughout the whole research ecosystem.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2022/research--nexus-2021.png"
alt="visualizing the Research Nexus vision" width="75%">
&lt;/figure>
&lt;p>While many inaccuracies in the metadata have clear consequences (e.g., if an author’s name is misspelled or their ORCID iD is registered with a typo, the ability to credit the author with their work can be compromised), there are others, &lt;a href="http://api.crossref.org/works?facet=published:*" target="_blank">like this example of typos in the publication date&lt;/a>, that may seem subtle, but also have repercussions. When we receive reports of metadata quality inaccuracies, we review the claims and work to connect metadata users with our members to investigate and then correct those inaccuracies.&lt;/p>
&lt;p>Thus, while Crossref does not update, edit, or correct publisher-provided metadata &lt;a href="https://doi.org/10.64000/1d2x1-ch923" target="_blank">directly&lt;/a>, we do work to enrich and improve the scholarly record, a goal we’re always striving for. Let’s look at a few common examples and how to avoid them.&lt;/p>
&lt;h2 id="pagination-faux-pas">Pagination faux pas&lt;/h2>
&lt;h3 id="first-page-marked-as-1">First page marked as 1&lt;/h3>
&lt;h4 id="in-the-xml-registered">In the XML registered&lt;/h4>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-XML" data-lang="XML">&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;pages&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;first_page&amp;gt;&lt;/span>1&lt;span class="nt">&amp;lt;/first_page&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;last_page&amp;gt;&lt;/span>1&lt;span class="nt">&amp;lt;/last_page&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;/pages&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;h4 id="related-rest-api-query">Related REST API query&lt;/h4>
&lt;p>&lt;a href="https://api.crossref.org/works?filter=type:journal-article&amp;amp;select=DOI,title,issue,page&amp;amp;sample=100" target="_blank">https://api.crossref.org/works?filter=type:journal-article&amp;select=DOI,title,issue,page&amp;sample=100&lt;/a>&lt;/p>
&lt;h4 id="more-on-the-problem">More on the problem&lt;/h4>
&lt;p>Very little content begins and ends on page 1. Especially journal articles. But, many members may not know what the page range of the content will be when they register the content with us (perhaps the content in question is an ahead-of-print journal article and the member intends to update this page range later). The issue here is that page range is an important piece of the metadata that we use for citation matching. If the pagination registered with us is incorrect, and it differs from the pagination stated in the citation, our matching process is challenged. Thus, we might fail to establish a citation link between the two works. The page range beginning with page 1 is the most common pagination error that the technical support team sees.&lt;/p>
&lt;blockquote>
&lt;p>More metadata does not mean better metadata.&lt;/p>
&lt;/blockquote>
&lt;h3 id="other-pagination-errors">Other pagination errors&lt;/h3>
&lt;h4 id="in-the-xml-registered-1">In the XML registered&lt;/h4>
&lt;pre tabindex="0">&lt;code class="language-XMLSchema" data-lang="XMLSchema">&amp;lt;item_number item_number_type=&amp;#34;article-number&amp;#34;&amp;gt;1&amp;lt;/item_number&amp;gt;
&lt;/code>&lt;/pre>&lt;h4 id="more-on-the-problem-1">More on the problem&lt;/h4>
&lt;p>Like first pages beginning with 1, few internal article numbers are 1. We see a disproportionate number of article number 1s in the metadata. Again, this can prevent citation matching. Mistakes happen in all aspects of life, including metadata entry. That said, if you, as a member, don’t use internal article numbers or other metadata elements that can be registered, a recommendation we’d make is: &lt;strong>if you don’t know what the metadata element is, omit it&lt;/strong>. More metadata does not mean better metadata. If you’d like to know more about what the elements are, bookmark our &lt;a href="https://data.crossref.org/reports/help/schema_doc/5.3.1/index.html" target="_blank">schema documentation in Oxygen&lt;/a> or review our &lt;a href="https://www.crossref.org/xml-samples/" target="_blank">sample XML files&lt;/a>.&lt;/p>
&lt;h4 id="in-the-xml-registered-2">In the XML registered&lt;/h4>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-XML" data-lang="XML">&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;pages&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;first_page&amp;gt;&lt;/span>121-123&lt;span class="nt">&amp;lt;/first_page&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;last_page&amp;gt;&lt;/span>129&lt;span class="nt">&amp;lt;/last_page&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;/pages&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;h4 id="more-on-the-problem-2">More on the problem&lt;/h4>
&lt;p>This content either begins on page 121, 122, or 123. It cannot start on all three pages. Ironically, registering a first page of 121-123 ensures that we will not match the article if it is included in a citation for another DOI with a first page of 121, 122, or 123.&lt;/p>
&lt;h2 id="author-naming-lapses">Author naming lapses&lt;/h2>
&lt;p>Examples: Titles (Dr., Prof. etc.) in the given_name field; Suffixes (Jr., III, etc.) in the surname field; superscript number, asterisk, or dagger after author names (usually carried over from website formatting that references affiliations); full name in surname field&lt;/p>
&lt;h4 id="in-the-xml-registered-3">In the XML registered&lt;/h4>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-XML" data-lang="XML">&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;contributors&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;person_name&lt;/span> &lt;span class="na">sequence=&lt;/span>&lt;span class="s">&amp;#34;first&amp;#34;&lt;/span> &lt;span class="na">contributor_role=&lt;/span>&lt;span class="s">&amp;#34;author&amp;#34;&lt;/span>&lt;span class="nt">&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;given_name&amp;gt;&lt;/span>DOCTOR KATHRYN&lt;span class="nt">&amp;lt;/given_name&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;surname&amp;gt;&lt;/span>RAILLY&lt;span class="nt">&amp;lt;/surname&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;/person_name&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;person_name&lt;/span> &lt;span class="na">sequence=&lt;/span>&lt;span class="s">&amp;#34;additional&amp;#34;&lt;/span> &lt;span class="na">contributor_role=&lt;/span>&lt;span class="s">&amp;#34;author&amp;#34;&lt;/span>&lt;span class="nt">&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;given_name&amp;gt;&lt;/span>DOCTOR JOSIAH S.&lt;span class="nt">&amp;lt;/given_name&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;surname&amp;gt;&lt;/span>CARBERRY&lt;span class="nt">&amp;lt;/surname&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;/person_name&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;/contributors&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-XML" data-lang="XML">&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;contributors&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;person_name&lt;/span> &lt;span class="na">contributor_role=&lt;/span>&lt;span class="s">&amp;#34;author&amp;#34;&lt;/span> &lt;span class="na">sequence=&lt;/span>&lt;span class="s">&amp;#34;first&amp;#34;&lt;/span>&lt;span class="nt">&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;surname&amp;gt;&lt;/span>Mahmoud Rizk&lt;span class="nt">&amp;lt;/surname&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;/person_name&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;person_name&lt;/span> &lt;span class="na">contributor_role=&lt;/span>&lt;span class="s">&amp;#34;author&amp;#34;&lt;/span> &lt;span class="na">sequence=&lt;/span>&lt;span class="s">&amp;#34;additional&amp;#34;&lt;/span>&lt;span class="nt">&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;surname&amp;gt;&lt;/span>Asta L Andersen(&lt;span class="nt">&amp;lt;/surname&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;/person_name&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;/contributors&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;h4 id="related-rest-api-queries">Related REST API queries&lt;/h4>
&lt;ol>
&lt;li>&lt;a href="https://api.crossref.org/works?query.author=professor" target="_blank">https://api.crossref.org/works?query.author=professor&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://api.crossref.org/works?query.author=doctor" target="_blank">https://api.crossref.org/works?query.author=doctor&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://api.crossref.org/works?query.author=ingeniero" target="_blank">https://api.crossref.org/works?query.author=ingeniero&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://api.crossref.org/works?query.author=junior" target="_blank">https://api.crossref.org/works?query.author=junior&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://api.crossref.org/works?query.author=III" target="_blank">https://api.crossref.org/works?query.author=III&lt;/a>&lt;/li>
&lt;/ol>
&lt;h4 id="more-on-the-problem-3">More on the problem&lt;/h4>
&lt;p>Neither Josiah nor Kathryn’s official given name includes ‘doctor,’ thus it should be omitted from the metadata. Including ‘doctor’ in the metadata and/or capping the authors’ names in the metadata does not result in additional accreditation or convey status. Instead, the result is to muddle the metadata record. As with page numbers in the metadata, &lt;strong>accurate author names are crucial for citation matching&lt;/strong>.&lt;/p>
&lt;h2 id="organisations-as-authors-slip-ups">organisations as authors slip-ups&lt;/h2>
&lt;p>Examples: The contributor role for person names is for persons, not organisational contributors, but we see this violated from time to time. Unfortunately, no persons are being credited with contributing to content that have these errors present in the metadata record.&lt;/p>
&lt;h4 id="in-the-xml-registered-4">In the XML registered&lt;/h4>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-XML" data-lang="XML">&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;contributors&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;person_name&lt;/span> &lt;span class="na">sequence=&lt;/span>&lt;span class="s">&amp;#34;first&amp;#34;&lt;/span> &lt;span class="na">contributor_role=&lt;/span>&lt;span class="s">&amp;#34;author&amp;#34;&lt;/span>&lt;span class="nt">&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;surname&amp;gt;&lt;/span>Society&lt;span class="nt">&amp;lt;/surname&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;/person_name&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;/contributors&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-XML" data-lang="XML">&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;person_name&lt;/span> &lt;span class="na">contributor_role=&lt;/span>&lt;span class="s">&amp;#34;author&amp;#34;&lt;/span> &lt;span class="na">sequence=&lt;/span>&lt;span class="s">&amp;#34;first&amp;#34;&lt;/span>&lt;span class="nt">&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;given_name&amp;gt;&lt;/span>University of Melbourne&lt;span class="nt">&amp;lt;/given_name&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;surname&amp;gt;&lt;/span>University of Melbourne&lt;span class="nt">&amp;lt;/surname&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;/person_name&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;/contributors&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;h4 id="related-rest-api-queries-1">Related REST API queries&lt;/h4>
&lt;ol>
&lt;li>&lt;a href="https://api.crossref.org/works?query.author=society" target="_blank">https://api.crossref.org/works?query.author=society&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://api.crossref.org/works?query.author=university" target="_blank">https://api.crossref.org/works?query.author=university&lt;/a>&lt;/li>
&lt;/ol>
&lt;h4 id="more-on-the-problem-4">More on the problem&lt;/h4>
&lt;p>We love seeing inclusion of organisational contributors in the metadata, when that metadata is correct. Unfortunately, we do see mistakes where organisations are entered as people and people are inadvertently omitted from the metadata record (sometimes omission of people in the contributor list is intentional, but other times it is a mistake). In the XML above, the organisation was entered as an organisational contributor - the organisation itself is being credited with the work. This is sometimes confused with an author affiliation or even a &lt;a href="https://www.crossref.org/community/ror/" target="_blank">ROR ID&lt;/a>. Our &lt;a href="https://www.crossref.org/documentation/schema-library/markup-guide-metadata-segments/affiliations/" target="_blank">schema library&lt;/a> and &lt;a href="https://www.crossref.org/xml-samples/" target="_blank">XML samples&lt;/a> are a great place to start, if you’re interested in learning more about organisational contributors versus author affiliations.&lt;/p>
&lt;h2 id="null-no-nos">Null no-nos&lt;/h2>
&lt;p>Examples: Too many times we see &amp;ldquo;N/A&amp;rdquo;, “null”, &amp;ldquo;none&amp;rdquo; in various fields
(pages, authors, volume/issue numbers, titles, etc.). &lt;strong>If you don’t have or know the metadata, it’s better to omit it&lt;/strong> for optional metadata elements than to include inaccuracies in the metadata record.&lt;/p>
&lt;h4 id="in-the-xml-registered-5">In the XML registered&lt;/h4>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-XML" data-lang="XML">&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;journal_volume&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;volume&amp;gt;&lt;/span>null&lt;span class="nt">&amp;lt;/volume&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-XML" data-lang="XML">&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;pages&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;first_page&amp;gt;&lt;/span>null&lt;span class="nt">&amp;lt;/first_page&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;last_page&amp;gt;&lt;/span>null&lt;span class="nt">&amp;lt;/last_page&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;/pages&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-XML" data-lang="XML">&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;person_name&lt;/span> &lt;span class="na">sequence=&lt;/span>&lt;span class="s">&amp;#34;first&amp;#34;&lt;/span> &lt;span class="na">contributor_role=&lt;/span>&lt;span class="s">&amp;#34;author&amp;#34;&lt;/span>&lt;span class="nt">&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;given_name&amp;gt;&lt;/span>Not Available&lt;span class="nt">&amp;lt;/given_name&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;surname&amp;gt;&lt;/span>Not Available&lt;span class="nt">&amp;lt;/surname&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;/person_name&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;person_name&lt;/span> &lt;span class="na">sequence=&lt;/span>&lt;span class="s">&amp;#34;additional&amp;#34;&lt;/span> &lt;span class="na">contributor_role=&lt;/span>&lt;span class="s">&amp;#34;author&amp;#34;&lt;/span>&lt;span class="nt">&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;given_name&amp;gt;&lt;/span>Not Available&lt;span class="nt">&amp;lt;/given_name&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;surname&amp;gt;&lt;/span>Not Available&lt;span class="nt">&amp;lt;/surname&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;/person_name&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;h4 id="related-rest-api-queries-2">Related REST API queries&lt;/h4>
&lt;ol>
&lt;li>&lt;a href="https://api.crossref.org/works?query.author=null" target="_blank">https://api.crossref.org/works?query.author=null&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://api.crossref.org/works?query.author=none" target="_blank">https://api.crossref.org/works?query.author=none&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://api.crossref.org/works?query.author=Not%20Available" target="_blank">https://api.crossref.org/works?query.author=Not%20Available&lt;/a>&lt;/li>
&lt;/ol>
&lt;h4 id="more-on-the-problem-5">More on the problem&lt;/h4>
&lt;p>Nulls and Not Availables, like many of the examples in this blog, are not simply agnostic when included in the metadata record. &lt;strong>Including nulls in your metadata limits our ability to match references and establish connections&lt;/strong> between research works. These works do not expand and enrich the research nexus; quite the opposite. The incorrect metadata limits our ability to establish relationships between works.&lt;/p>
&lt;h2 id="where-to-go-from-here">Where to go from here?&lt;/h2>
&lt;p>One thing we’ve said throughout this blog that we’ll reiterate here is: accurate metadata is important. It’s important in itself, and the metadata registered with us is heavily used by many systems and services, so think Crossref and beyond. In addition to that expanding perspective, there are practical steps members and metadata users can take to help us:&lt;/p>
&lt;p>As a member registering metadata with us:&lt;/p>
&lt;ol>
&lt;li>make sure we have a current metadata quality contact for your account and update us if there’s a change&lt;/li>
&lt;li>if you receive an email request from us to investigate a potential metadata error, help us&lt;/li>
&lt;li>if you do not know what to enter into a metadata element or helper tool field, please leave it blank; perhaps some of the examples of errors within this blog were placeholders that the responsible members intended to come back to - to correct in time; that’s also a practice to avoid&lt;/li>
&lt;li>if you find a record in need of an update, &lt;a href="https://www.crossref.org/documentation/register-maintain-records/maintaining-your-metadata/updating-your-metadata/" target="_blank">update it&lt;/a> - updates to existing records are always free (we do this to encourage updates and the resulting accurate, rich metadata, so take advantage of it).&lt;/li>
&lt;/ol>
&lt;p>As a metadata user:&lt;/p>
&lt;ol>
&lt;li>if you spot a metadata record that doesn’t seem right, let us know with an email to &lt;a href="mailto:support@crossref.org">support@crossref.org&lt;/a> and/or report it to the member responsible for maintaining the metadata record (if you have a good contact there)&lt;/li>
&lt;li>if you’re eager to confirm the last update of a metadata record, our REST API is a great resource; here’s a handy query to use as a starting point: this one returns records on our Crossref prefix 10.5555 that have been updated in 2022: &lt;a href="https://api.crossref.org/prefixes/10.5555/works?rows=500&amp;amp;filter=from-update-date:2022-01-01,until-pub-date:2022-12-31&amp;amp;mailto=support@crossref.org" target="_blank">https://api.crossref.org/prefixes/10.5555/works?rows=500&amp;filter=from-update-date:2022-01-01,until-pub-date:2022-12-31&amp;mailto=support@crossref.org&lt;/a>&lt;/li>
&lt;/ol>
&lt;p>Making connections between research objects is critical, and inaccurate metadata complicates that process. We’re continually working to better understand this, too. That’s why we’re currently researching &lt;a href="https://doi.org/10.64000/h3w86-2z708" target="_blank">the reach and effects of metadata&lt;/a>. Our technical support team is always eager to assist in correcting errors. We’re also keen on avoiding those mistakes altogether, so if you are uncertain about a metadata element or have questions about anything included in this blog post, please do contact us at &lt;a href="mailto:support@crossref.org">support@crossref.org&lt;/a>. Or, better yet, post your question in the &lt;a href="https://community.crossref.org/c/tech-support/8" target="_blank">community forum&lt;/a> so all members and users can benefit from the exchange. If you have a question, chances are others do as well.&lt;/p></description></item><item><title>How I think about ROR as infrastructure</title><link>https://www.crossref.org/blog/how-i-think-about-ror-as-infrastructure/</link><pubDate>Fri, 08 Jul 2022 00:00:00 +0000</pubDate><author>Amanda French</author><guid>https://www.crossref.org/blog/how-i-think-about-ror-as-infrastructure/</guid><description>&lt;p>The other day I was out and about and got into a conversation with someone who asked me about my doctoral work in English literature. I&amp;rsquo;ve had the same conversation many times: I tell someone (only if they ask!) that &lt;a href="http://dx.doi.org/10.17613/M66K5R" target="_blank">my dissertation&lt;/a> was a history of the villanelle, and then they cheerfully admit that they don&amp;rsquo;t know what a &lt;a href="https://en.wikipedia.org/wiki/Villanelle" target="_blank">villanelle&lt;/a> is, and then I ask them if they&amp;rsquo;re familiar with Dylan Thomas&amp;rsquo;s poem &lt;a href="https://poets.org/poem/do-not-go-gentle-good-night" target="_blank">&amp;ldquo;Do not go gentle into that good night.&amp;rdquo;&lt;/a> So far, everyone has heard of it &amp;ndash; it&amp;rsquo;s a very well-known poem indeed. I then explain that &amp;ldquo;Do not go gentle into that good night&amp;rdquo; is a villanelle, and that a villanelle is a poetic form something like a sonnet. So far, everyone also knows what a sonnet is, which is why I use that as a comparison, even though a villanelle isn&amp;rsquo;t all that much like a sonnet, in my opinion. They&amp;rsquo;re both poetic forms, however, with a particular standard number of lines and a particular standard rhyme scheme, so in that sense they certainly are alike.&lt;/p>
&lt;p>Oddly enough, I think my early background in the study of poetic form is very much of a piece with my new role here at Crossref as &lt;a href="https://ror.org/blog/2022-06-13-welcome-amanda-french/" target="_blank">Technical Community Manager for ROR, the Research Organisation Registry&lt;/a>. Both poetic form and metadata are invisible to most people, but both are valuable infrastructure. Both poetic form and metadata involve generally-accepted practices and standards that differ between different groups of people and change over time. Both writing formal poetry and creating rich metadata can seem burdensome and rigid to some people, but to my mind, both are generative. A solid underlying foundation allows for all kinds of creativity to flourish on the surface.&lt;/p>
&lt;p>That might be part of why as soon as I heard about ROR I understood its tremendous potential. As someone who&amp;rsquo;s worked in digital humanities and scholarly communication for over fifteen years, I&amp;rsquo;ve long appreciated the value of clean, standard, comprehensive metadata in general. For instance, I explained the origin and value of the &lt;a href="https://www.dublincore.org/" target="_blank">Dublin Core metadata standard&lt;/a> to many a history scholar in the &lt;a href="https://omeka.org" target="_blank">Omeka&lt;/a> workshops I often taught at &lt;a href="https://thatcamp.org" target="_blank">THATCamp&lt;/a>. Later, while overseeing the &lt;a href="https://vtechworks.lib.vt.edu" target="_blank">institutional repository at Virginia Tech University Libraries&lt;/a>, I learned even more about both the importance and the difficulty of creating, acquiring, and providing good metadata. When the pandemic began in 2020, I &lt;a href="https://covidtracking.com/analysis-updates/why-its-hard-to-count-recovered" target="_blank">learned more than I ever wanted to know about messy data&lt;/a> as Community Lead for &lt;a href="https://covidtracking.com" target="_blank">The COVID Tracking Project at &lt;em>The Atlantic&lt;/em>.&lt;/a>&lt;/p>
&lt;p>Data and metadata are, let&amp;rsquo;s admit it, very hard to keep clean and consistent as they travel through multiple systems, and that&amp;rsquo;s why it&amp;rsquo;s important to regularize as much as we can through automatic means such as APIs that use agreed-upon standards. Scholarship is a network of networks, and common identifiers like DOIs and ORCIDs enable the interchange of information in those networks about scholarly outputs and scholars, and thus they enable scholarship itself. What could be more important than that?&lt;/p>
&lt;p>But the organisations that employ, fund, and publish scholarly researchers have had a hard time keeping track of everything &amp;ldquo;their&amp;rdquo; researchers have given to the world. That&amp;rsquo;s the problem that ROR, &lt;strong>&lt;a href="https://ror.org" target="_blank">&amp;ldquo;a community-led registry of open, sustainable, usable, and unique identifiers for every research organisation in the world,&amp;rdquo;&lt;/a>&lt;/strong> can help solve. In an ideal world, universities might use ROR IDs to track the research their faculty have produced, certainly, but they might also discover which universities their faculty&amp;rsquo;s co-authors most often come from. Funders might use ROR IDs to identify the research outputs that have benefited from their funds, certainly, but they might also analyze whether they are funding enough researchers from institutions in rural areas. Publishers might use ROR IDs to offer affiliation searching in their own public interfaces, certainly, but they might also create internal reports on compliance with institution-level transformative Open Access agreements. Once something like ROR is widely adopted, the &lt;a href="https://www.crossref.org/documentation/research-nexus">vision of the Research Nexus&lt;/a> becomes closer to reality: &amp;ldquo;A rich and reusable open network of relationships connecting research organisations, people, things, and actions; a scholarly record that the global community can build on forever, for the benefit of society.&amp;rdquo; ROR is all about the &amp;ldquo;organisations&amp;rdquo; part of that alluring vision.&lt;/p>
&lt;p>If you&amp;rsquo;re curious about ROR and want to learn more (hey, that rhymes!), you might want to watch the highly informative presentation from September 2021 &lt;a href="https://www.youtube.com/watch?v=D9Mtqb64OEk" target="_blank">&amp;ldquo;Working with ROR as a Crossref Member&amp;rdquo;&lt;/a>, in which you&amp;rsquo;ll learn several interesting things, including the following:&lt;/p>
&lt;ul>
&lt;li>ROR itself is not an organisation, but an initiative supported jointly by Crossref, DataCite, and the California Digital Library;&lt;/li>
&lt;li>Crossref members cited institutional affiliation identifiers as one of their top priorities in 2019, second only to abstracts;&lt;/li>
&lt;li>The specifics of how one recent ROR integrator, the open access journal publisher &lt;a href="https://hindawi.com" target="_blank">Hindawi&lt;/a>, used the ROR API to create a typeahead widget in its manuscript submission system that replaces user-supplied free text with a standard institution name and a ROR ID behind the scenes, helping them to generate useful internal reports about institutional payments; and&lt;/li>
&lt;li>Crossref supports the submission of ROR IDs in its XML content registration process and makes ROR IDs &lt;a href="https://doi.org/10.64000/1nkjy-15275" target="_blank">available in its API&lt;/a>.&lt;/li>
&lt;/ul>
&lt;p>I&amp;rsquo;m also enthusiastically inviting you to &lt;a href="mailto:afrench@crossref.org">get in touch with me&lt;/a> if you&amp;rsquo;d like to learn more about ROR or if you&amp;rsquo;d like to tell me about your previous experience with ROR. And if you don&amp;rsquo;t get in touch with me, please be aware that I might well reach out to you – I&amp;rsquo;m eager to hear what you hope for from ROR, but also what you&amp;rsquo;re skeptical about. For, after all, &lt;a href="https://www.poetryfoundation.org/poems/43333/the-waking-56d2220f25315" target="_blank">I learn by going where I have to go&lt;/a> – don&amp;rsquo;t we all?&lt;/p></description></item><item><title>Seeing your place in the Research Nexus</title><link>https://www.crossref.org/blog/seeing-your-place-in-the-research-nexus/</link><pubDate>Wed, 22 Jun 2022 00:00:00 +0000</pubDate><author>Kornelia Korzec</author><guid>https://www.crossref.org/blog/seeing-your-place-in-the-research-nexus/</guid><description>&lt;p>Having joined the Crossref team merely a week previously, the mid-year community update on June 14th was a fantastic opportunity to learn about the Research Nexus vision. We explored its building blocks and practical implementation steps within our reach, and within our imagination of the future.&lt;/p>
&lt;p>Read on &lt;a href="https://www.youtube.com/watch?v=2Vrw-E8cCcw" target="_blank">(or watch the recording)&lt;/a> for a whistlestop tour of everything – from what on Earth is Research Nexus, through to how it’s taking shape at Crossref, to how &lt;strong>you are&lt;/strong> involved, and finally – to what concerns the community surrounding the vision and how we’re going to address that.&lt;/p>
&lt;h3 id="summary-of-presentations">Summary of presentations&lt;/h3>
&lt;p>&lt;em>&lt;figure>&lt;a href="https://doi.org/10.13003/emo8xxhz">&lt;img src="https://www.crossref.org/images/blog/2022/midyear-research-nexus-cover-slide.jpeg"
alt="screenshot of the first slide of the presentation" width="80%">&lt;/a>&lt;figcaption>
&lt;p>Click on image above to access the presentation.&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;/em>&lt;/p>
&lt;p>The idea is simple in principle: scholarly records ought to be transparent – available to examine and learn from for all. Much of scientific production and communication these days has a heavy digital footprint so the Nexus is nothing but simply connecting the loose strands, right? Yet, as the scholarly record is a reflection of the continuous progress made by multiple actors within the context of scientific structures and processes, bringing the Nexus to life is a little short of simple.&lt;/p>
&lt;p>“&lt;em>What we think of as metadata is expanding, and the notion of ‘record types’ is changing&lt;/em>” – said Ginny Hendricks. A great majority of scholarly ‘objects’, whether they are data sets, research articles, monographs, or others, undergo many processes (including review, publication, licensing, correction, derivation) and influence knowledge and practice over time.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/documentation/research-nexus-2023-final.png"
alt="visualizing the Research Nexus vision" width="80%">
&lt;/figure>
&lt;p>Making that progress visible and discoverable will allow for tracing the development of ideas and changes in our thinking over time. Transparency of the complete scholarly records will help to understand the impact of science funding and changing policies. It can support a more robust and comprehensive assessment of research, and contribute to improving integrity within as well as public trust in sciences.&lt;/p>
&lt;p>The Research Nexus concept was first introduced by Jennifer Lin in 2017 as “&lt;a href="https://doi.org/10.64000/k2hez-ysv45" target="_blank">Better research through better metadata&lt;/a>”. Important adaptations to the model were needed to break it out of the content-specific schema. Ginny also pointed out that the concept is shared among the scholarly infrastructure community, citing a report from 2015 by OCLC Research on &lt;a href="https://doi.org/10.25333/C3J63N" target="_blank">conscious coordination for stewardship of the evolving scholarly record&lt;/a>.&lt;/p>
&lt;p>Patricia Feeney has given us reasons for optimism in building a robust Nexus. She’s shown areas of greatest growth in metadata reported to Crossref and shared &lt;a href="https://trello.com/b/JaB7xxgw/crossref-metadata" target="_blank">a public roadmap&lt;/a> of types of information we’re asked to enable in the future. We’re seeing a true boom of datasets and peer review reports registrations, and the relationship metadata for our records is improving too. At &lt;a href="https://doi.org/10.64000/b7a98-vbz07" target="_blank">the dawn of defaulting to open references&lt;/a>, 44% of records we hold have associated references and that is growing. Provision of the newly enabled affiliation information (ROR IDs) is on the rise, as is the funder information. Some conversations and questions followed highlighting the need for further guidance in these areas.&lt;/p>
&lt;p>To make a case for enriching metadata records, Martyn Rittman demonstrated examples of traceability of research influence on realities outside academia. He captured recent examples of data citations and other references present not just between scholarly papers, but also in policy documents and popular media. These allow for greater discoverability of literature – but also show the public influence and impact of the research and the work’s context in our wider society.&lt;/p>
&lt;p>&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2022/slide-policy-docs.png"
alt="expanding what the Research Nexus covers" width="80%">
&lt;/figure>
&lt;br>
&lt;br>
While Martyn shared our blue-skies aspiration to streamline Crossref’s APIs to offer insight to all these relationships with a single service, Joe Wass grounded those ambitions in the reality of technical work underway. His team’s attention is divided between three main areas. They continue to maintain and de-bug our existing infrastructure. They are developing self-service solutions for members. Finally, they are mapping and planning improved infrastructure, evaluating technology against the Research Nexus vision.&lt;/p>
&lt;p>Bringing it back to the source (of metadata), Rachael Lammey offered a very practical guide to key activities enabling Research Nexus that all members can take on now. She highlighted the benefits of collecting and registering data citations, ROR IDs, and grant funding information. She went on to talk about challenges of subject classification (at a journal level) that our research and development efforts are focusing on at the moment.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2022/research-nexus-do-now.png"
alt="What Crossref members can do to build the Research Nexus" width="80%">
&lt;/figure>
&lt;h3 id="summary-of-discussions">Summary of discussions&lt;/h3>
&lt;p>Publishing has changed dramatically and our members recognise increasing opportunities for transparency of the scholarly record. Breaking the distant vision of Research Nexus down into actionable chunks made it more relatable for call participants. Many reflected on seeing their place in it properly for the first time. Yet, challenges remain and many were brought to the fore in the discussions.&lt;/p>
&lt;p>The reliability and usability of the technology for registering metadata with Crossref needs to improve. We need to do better in supporting multi-language and multi-alphabet information. Not just developing systems anew, but also streamline the way content is registered and annotated, and continue to disambiguate the competing identifiers. Different record types, chiefly books, present specific challenges in this regard. Finally, making all that metadata accessible and usable is key to enabling insights from the rich data we collectively make available.&lt;/p>
&lt;p>Technology is important, but won’t overcome the barriers that exist in the mindsets. Siloed thinking means that publishers may not be sensitive to benefits that improved relationship metadata could have for colleagues working on assessment, even within the same institutions. Greater guidance or best practices for new identifiers, such as ORCID, ROR, grants, would allow more publishers to get on board with the changes. Researchers often don’t help the cause either – many don’t realise the role and benefits of metadata for their work and are reluctant to provide rich information related to it, perceiving it as a bureaucratic burden.&lt;/p>
&lt;p>In a nutshell, I learnt that – while the concept of Research Nexus is pretty complex – we’re all already participating in making it a reality. I’m grateful to the call participants for sharing their challenges and ideas so generously. It means we can work to address those. I’ll be sure to follow-up on requests for support and clearer guidelines about citing data, recording ROR IDs and grants information in the metadata, and we’ll engage our community on complex topics of record updates (corrections, retractions and versions). Be sure to keep in touch with the conversations on the &lt;a href="https://community.crossref.org/" target="_blank">Community Forum&lt;/a>. I’ll see you there!&lt;/p></description></item><item><title>Announcing our new Head of Strategic Initiatives: Dominika Tkaczyk</title><link>https://www.crossref.org/blog/announcing-our-new-head-of-strategic-initiatives-dominika-tkaczyk/</link><pubDate>Fri, 10 Jun 2022 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/announcing-our-new-head-of-strategic-initiatives-dominika-tkaczyk/</guid><description>&lt;h2 id="tldr">TL;DR&lt;/h2>
&lt;p>A year ago, we announced that we were putting the &amp;ldquo;R&amp;rdquo; back in R&amp;amp;D. That was when Rachael Lammey joined the R&amp;amp;D team as the Head of Strategic Initiatives.&lt;/p>
&lt;p>And now, with &lt;a href="https://doi.org/10.64000/jcwr7-q5y75" target="_blank">Rachael assuming the role of Product Director&lt;/a>, I&amp;rsquo;m delighted to announce that &lt;a href="https://www.crossref.org/people/dominika-tkaczyk/" target="_blank">Dominika Tkaczyk&lt;/a> has agreed to take over Rachael&amp;rsquo;s role as the Head of Strategic Initiatives. Of course, you might already know her.&lt;/p>
&lt;p>We will also immediately start recruiting for a new Principal R&amp;amp;D Developer to work with Esha and Dominika on the R&amp;amp;D team.&lt;/p>
&lt;h2 id="what-does-this-mean-for-rd">What does this mean for R&amp;amp;D?&lt;/h2>
&lt;p>Before I talk about what Dominika&amp;rsquo;s move means in practice, I just want to take a moment to thank Rachael for the time she spent working with us. Over the past year, she has injected a massive amount of energy into the group and rebuilt the team&amp;rsquo;s momentum. This is exactly what we asked her to do.&lt;/p>
&lt;p>Rachael&amp;rsquo;s first task was to repatriate her two R&amp;amp;D colleagues, who we had loaned to work on other urgent projects. Dominika was the technical lead on the port and relaunching of the REST API. Esha was the technical lead for the ROR initiative. In addition, Rachael has been working with Esha, Dominika, Paul Davis, and me on several shorter-term strategic projects that are shaping our overall development strategy.&lt;/p>
&lt;ul>
&lt;li>Exploring and implementing a new approach to building content registration front ends. This approach is schema-driven and bakes in localization and accessibility support from the start. The new approach is currently the basis for the grant registration tool that our Product &amp;amp; Tech teams are now testing with our new funder members.&lt;/li>
&lt;li>Exploring and ultimately rejecting a &amp;ldquo;pull-based&amp;rdquo; approach to registering metadata, where Crossref would harvest structured metadata from member landing pages instead of asking members to deposit it with us via XML. You are not really doing R&amp;amp;D unless some of your ideas fail. In this case, we quickly discovered that the logistics of crawling our members’ websites, combined with the sparsity of structured metadata in landing pages, made a pull-based approach fragile and impractical.&lt;/li>
&lt;li>Exploring the use of ML techniques to fill gaps in the journal classification data that is currently in the REST API. Gaining new data science badges in the process.&lt;/li>
&lt;li>Exploring alternative approaches to building community-extendable reporting tools using standard data science tooling and techniques.&lt;/li>
&lt;li>Exploring how we can help reduce support toil by using data science tools like notebooks to create new support tools and self-serve UIs for information frequently requested by members that can otherwise prove difficult to get using our existing tools.&lt;/li>
&lt;li>Looking at extending the matching technology previously developed by labs to try and &lt;a href="https://doi.org/10.64000/ske16-xve54" target="_blank">better match funder grant-information research outputs&lt;/a>.&lt;/li>
&lt;/ul>
&lt;p>And this is just a sample of projects Rachael helped promote and prioritize. It is the nature of many of the larger R&amp;amp;D projects that you don&amp;rsquo;t see the immediate results until long after they&amp;rsquo;ve been conceived and put into motion. This means that Rachael has been working on some things over the past year that are not yet public.&lt;/p>
&lt;p>But, with any luck, we may see some significant new developments in how Crossref collects and distributes information about significant updates to the scholarly record- including retractions and withdrawals. We are also likely to see more work to promote data citation amongst our members. And finally, we are likely to see an attempt to create a community-managed and open research classification taxonomy. Of course, as is the case with research projects, there is no guarantee that any of these nascent ideas/projects will make it into a production service. Still, if even one of them does, it will become as vital a part of open scholarly infrastructure as DOIs, ORCIDs, or ROR IDs are now.&lt;/p>
&lt;p>And we will have Rachael and the hard work of the R&amp;amp;D group, important cameos from others, and community input to thank for giving them the initial push to realization. So that&amp;rsquo;s a pretty good track record for just a year in the R&amp;amp;D group.&lt;/p>
&lt;h2 id="passing-the-torch">Passing the torch&lt;/h2>
&lt;p>And this is a track record I&amp;rsquo;m confident that Dominika can match as she takes over Rachael&amp;rsquo;s role.&lt;/p>
&lt;p>Soon after Dominika joined the Crossref R&amp;amp;D team, she started to expand her activities to include more production engineering practice, team leadership, and community outreach. She has also worked extensively with support and outreach- providing them with data science consulting and mentoring in software development. Her new role as the Head of Strategic Initiatives will continue this trend. She will spend less time prototyping software and analyzing data and more time liaising with our members and the broader community to understand their needs and design R&amp;amp;D projects to test approaches to meeting those needs. This means a lot more liaising with other Crossref teams, speaking with our members and the wider community, and participating in working groups and conferences.&lt;/p>
&lt;p>It also probably means a lot less programming and analysis. But programming and building prototypes are critical to the R&amp;amp;D team. And so the first thing we will do is start recruiting for a new Principal R&amp;amp;D Developer to continue working along with Esha on conducting experiments and developing POCs.&lt;/p>
&lt;p>I’m looking forward to the next year. With Rachael taking the role of Product Director and Dominika taking over as the Head of Strategic Initiatives, we are well-positioned to make profound technical and conceptual improvements to Crossref&amp;rsquo;s services while simultaneously working with the community to line up our next strategic priorities.&lt;/p></description></item><item><title>Rethinking staff travel, meetings, and events</title><link>https://www.crossref.org/blog/rethinking-staff-travel-meetings-and-events/</link><pubDate>Tue, 07 Jun 2022 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/rethinking-staff-travel-meetings-and-events/</guid><description>&lt;p>As a distributed, global, and community-led organisation, sharing information and listening to our members both online and in person has always been integral to what we do.&lt;/p>
&lt;p>For many years Crossref has held both in-person and online meetings and events, which involved a fair amount of travel by our staff, board, and community. This changed drastically in March 2020, when we had to stop traveling and stop having in-person meetings and events. Due to the hard work and creativity of our team and the support of our Ambassadors and Sponsors, we were able to &lt;a href="https://doi.org/10.64000/byv2m-9fm07" target="_blank">move to exclusively online meetings and events&lt;/a> and maintain connections with colleagues, members, and much of the scholarly research community.&lt;/p>
&lt;p>Online meetings have benefits compared to in-person ones; they have a much lower carbon footprint, and they can be more inclusive because people don’t have to find the time and money to travel. But there are limitations to online meetings; individual connections made in person do become harder to maintain, and new connections are more difficult to make and grow online. Sometimes just by sitting with someone, meeting their team and drinking their tea, free-flowing conversation leads to real progress.&lt;/p>
&lt;p>But with over 17,000 members in 150 countries, our small staff can’t be everywhere, and we need to consider the personal as well as the environmental impacts.&lt;/p>
&lt;p>When we started work on the 2022 budget last year, our staff and board took the opportunity to think about our approach, with the goal of not going back to ‘normal’. So we asked ourselves, now that we have a better sense of what works and what doesn&amp;rsquo;t, how can we make our travel and in-person meetings have a greater impact on our goals, while also traveling less and reducing our impact on the environment?&lt;/p>
&lt;p>We decided that in the context of our mission and values, we had to take into account three key areas:&lt;/p>
&lt;ol>
&lt;li>The environment and climate change&lt;/li>
&lt;li>Inclusion&lt;/li>
&lt;li>Work/life balance.&lt;/li>
&lt;/ol>
&lt;p>We developed an updated strategy for in-person and online meetings from 2022 onwards along with a set of recommendations and commitments to reduce our carbon footprint. The commitments were approved by the board at its November 2021 meeting.&lt;/p>
&lt;h2 id="our-plan-for-online-and-in-person-meetings">Our plan for online and in-person meetings&lt;/h2>
&lt;p>Online events will generally be aimed at broad groups, in multiple timezones, to inform, update, and test general ideas and assumptions at scale. In contrast, in-person events will be smaller, focusing on deep learning, co-creation, and collaborating through various formats such as workshops, roundtables, or sprints, ideally working toward a specific outcome. These smaller in-person meetings will be scheduled alongside other community events so there will be fewer trips on the whole but each trip more consolidated.&lt;/p>
&lt;p>Each in-person meeting will have stated goals such as recruiting and onboarding a new Sponsor, bringing our Ambassadors together to build relationships and share best practices, or getting experts together in a room to help decide important polices, improve some code, or plan new initiatives. At the moment, we are not planning &amp;lsquo;hybrid&amp;rsquo; events as we don&amp;rsquo;t believe they will help meet our goals.&lt;/p>
&lt;p>While online meetings and webinars provide a &lt;em>breadth&lt;/em> of interactions, in-person meetings can provide greater &lt;em>depth&lt;/em> and opportunities for more meaningful engagement and purposeful discussion, and it is this depth that we have missed over the last two and a half years. Therefore, we are identifying focus countries where we plan on engaging more with local community groups. Each country-level engagement plan includes outreach and communications activities and some in-person meetings.&lt;/p>
&lt;h3 id="factors-and-aims-for-selecting-focus-countries">Factors and aims for selecting focus countries&lt;/h3>
&lt;p>Inclusion is important for us and we are committed to supporting the needs of our community members worldwide. We aim to combine meaningful conversations with informational activities. We want to provide time in the day for technical problem solving and/or a more strategically focused session, both of which have worked well in the past. We hope to learn more about trends in our selected focus countries, including the challenges our members face, local publishing norms, barriers to participation in Crossref, and understand and help to adapt government policies.&lt;/p>
&lt;p>We consider a number of factors when selecting countries with which to focus our activities:&lt;/p>
&lt;ul>
&lt;li>Where we have a relatively large number of members.&lt;/li>
&lt;li>Where we are seeing an increase in new members joining.&lt;/li>
&lt;li>Where we have not undertaken engagement activities in at least 3 years.&lt;/li>
&lt;li>Where we have good contacts to collaborate with, i.e., a national funder, a sponsor or ambassador, a government body, or another organisation aligned with our mission.&lt;/li>
&lt;li>Where we have very few members but where research output is high according to other sources, in order to understand and overcome barriers to participating in Crossref.&lt;/li>
&lt;li>Where we can consolidate multiple engagement activities in one trip, for example run a LIVE (informational) meeting or workshop, develop relationships with a key Sponsor, or discuss national research policy with government representatives.&lt;/li>
&lt;li>Where we can coordinate our engagement efforts alongside other local community events.&lt;/li>
&lt;/ul>
&lt;h2 id="our-environmental-commitments">Our environmental commitments&lt;/h2>
&lt;p>In line with rethinking how we engage with our members and making sure we do so in the most sustainable, inclusive, and impactful way, we are making the following commitments:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>Crossref staff will think strategically and consider environmental, inclusion, and work/life balance issues when they plan travel. We will make the most of in-person events by focusing on those that involve interaction, such as listening and learning from our members and users, deepening relationships, co-creating, and forming new alliances&lt;/p>
&lt;/li>
&lt;li>
&lt;p>We will travel less and have fewer face-to-face meetings going forward compared with 2019 as a baseline year. The 2022 travel and events budget was reduced by 40% and set at 60% of the 2019 budget. Travel and in-person events for the first half of 2022 have been limited so we will make this same commitment for 2023 still using 2019 as the baseline.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Crossref will track the carbon footprint of staff travel to meetings and events. We will regularly review the data and find ways to reduce the environmental impact.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Combine stakeholder visits with event trips and vice versa whenever possible (if you do 1 plane trip to a location 1000 miles away instead of 2 trips, you reduce your impact by 0.5t)&lt;/p>
&lt;/li>
&lt;li>
&lt;p>As previously planned before the COVID-19 pandemic, the Crossref LIVE Annual Meetings will remain online only and will be held in different time zones. Having them in different time zones will enable global sharing of updates with a lower environmental impact.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Crossref board meetings will be reduced from three in-person meetings per year to one face-to-face and two online meetings per year.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Fewer staff will attend fewer in-person conferences and will combine them with other travel.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>For Crossref staff meetings, it is important for our distributed staff to meet face-to-face as a whole organisation and as teams. We will plan for one all-staff in person meeting per year (at which there can also be team meetings). Additional team meetings will be based on the reduced travel and meetings budget. Where possible, team meetings will be combined with other meetings (e.g. conferences or other community events).&lt;/p>
&lt;/li>
&lt;li>
&lt;p>While trips that combine meetings may mean longer time away from home, we will still try to avoid staff having to travel or be away on weekends. We will also:&lt;/p>
&lt;ul>
&lt;li>Avoid short-haul flights (under 2.5 to 3 hours) where trains are available.&lt;/li>
&lt;li>Book hotels within walking distance of the event locations (if safe) in order to reduce taxi use.&lt;/li>
&lt;li>Use public transport and trains (if efficient and safe).&lt;/li>
&lt;li>Select hotels that have good sustainability plans in place, seeking out ‘green’ hotels where (if available and within budget).&lt;/li>
&lt;li>Prioritize locations where the fewest number of staff have to travel or travel the shortest distances.&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;h3 id="reporting">Reporting&lt;/h3>
&lt;p>From now on, we will:&lt;/p>
&lt;ul>
&lt;li>Track staff travel incl. the number of trips, miles flown, and the carbon impact.&lt;/li>
&lt;li>Estimate the carbon footprint of our two offices, staff home working, our data center, and our cloud infrastructure.&lt;/li>
&lt;li>Track all Crossref-hosted events - in-person and online and review annually (what went well, what can be improved, how to further reduce carbon footprint) as part of the budgeting process.&lt;/li>
&lt;/ul>
&lt;p>Many organisations are now rethinking how to go about travel, conferences, meetings, and work in general. The pandemic may have been the trigger for a big shift in the ways we work and interact, and not all of it was welcome or should continue; however, sometimes it takes a big event to give us the space to sit back, reflect, and change things for the better going forward. As always, we&amp;rsquo;ll evaluate these approaches over time.&lt;/p>
&lt;p>All of this means we may be declining some in-person meetings (and when we do, please don’t take it personally) but we still look forward to engaging with our community in a purposeful way.&lt;/p>
&lt;p>This feels like a good time to give a shout-out to all our &lt;a href="https://www.crossref.org/community/our-ambassadors/">Ambassadors&lt;/a> and &lt;a href="https://www.crossref.org/membership/about-sponsors/">Sponsors&lt;/a> around the world who are very important for insight and engagement, and we will continue to partner with them for both online and in-person meetings.&lt;/p></description></item><item><title>Annual call for board nominations</title><link>https://www.crossref.org/blog/annual-call-for-board-nominations/</link><pubDate>Tue, 31 May 2022 00:00:00 +0000</pubDate><author>Lucy Ofiesh</author><guid>https://www.crossref.org/blog/annual-call-for-board-nominations/</guid><description>&lt;p>The Crossref Nominating Committee is inviting expressions of interest to join the Board of Directors of Crossref for the term starting in March 2023. The committee will gather responses from those interested and create the slate of candidates that our membership will vote on in an election in September.&lt;/p>
&lt;p>Expressions of interest will be due Friday, June 24th, 2022.&lt;/p>
&lt;h3 id="about-the-our-board-elections">About the our board elections&lt;/h3>
&lt;p>The board is elected through the “one member, one vote” policy wherein every member organisation of Crossref has a single vote to elect representatives to the Crossref board. Board terms are for three years, and this year there are five seats open for election.&lt;/p>
&lt;p>The board maintains a balance of seats, with eight seats for smaller members and eight seats for larger members (based on total revenue to Crossref). This is in an effort to ensure that the diversity of experiences and perspectives of the scholarly community are represented in decisions made at Crossref.&lt;/p>
&lt;p>This year we will elect four of the larger member seats (membership tiers $3,900 and above) and one of the smaller member seats (membership tiers $1,650 and below). You don’t need to specify which seat you are applying for. We will provide that information to the Nominating Committee.&lt;/p>
&lt;p>The election takes place online and voting will open in September. Election results will be shared at the annual meeting in October. New members will commence their term in March 2023.&lt;/p>
&lt;h3 id="about-the-nominating-committee">About the Nominating Committee&lt;/h3>
&lt;p>The Nominating Committee reviews the expressions of interest and selects a slate of candidates for election. The slate put forward will exceed the total number of open seats. The committee considers the statements of interest, organisational size, geography, gender, and experience.&lt;/p>
&lt;p>2022 Nominating Committee:&lt;/p>
&lt;ul>
&lt;li>Abel Packer, SciELO, Brazil, chair*&lt;/li>
&lt;li>Patrick Alexander, Penn State University Press, US&lt;/li>
&lt;li>Nisha Doshi, Cambridge University Press, UK&lt;/li>
&lt;li>Marc Hurlbert, Melanoma Research Alliance , US*&lt;/li>
&lt;li>Kihong Kim, Korean Council of Science Editors, South Korea*&lt;/li>
&lt;/ul>
&lt;p>(*) indicates Crossref board member&lt;/p>
&lt;h3 id="what-does-the-committee-look-for">What does the committee look for&lt;/h3>
&lt;p>The committee looks for skills and experience that will complement the rest of the board. Candidates from countries and regions that are not currently reflected on the board are strongly encouraged to apply. Successful candidates often demonstrate a commitment to or understanding of our &lt;a href="https://www.crossref.org/strategy/" target="_blank">strategic agenda&lt;/a> or the &lt;a href="https://openscholarlyinfrastructure.org/" target="_blank">Principles of Open Scholarly Infrastructure&lt;/a>; hold positions within their organisations that may be underrepresented on the board currently; and/or have experience with governance or community involvement. The Nominating Committee will also review the member organisation&amp;rsquo;s &lt;a href="https://www.crossref.org/members/prep/" target="_blank">participation report&lt;/a>.&lt;/p>
&lt;h3 id="who-can-apply-to-join-the-board">Who can apply to join the board?&lt;/h3>
&lt;p>Any active member of Crossref can apply to join the board. Crossref membership is open to organisations that produce content, such as academic presses, commercial publishers, standards organisations, and research funders.&lt;/p>
&lt;h3 id="board-roles-and-responsibilities">Board roles and responsibilities&lt;/h3>
&lt;p>Crossref’s services provide central infrastructure to scholarly communications. Crossref’s board helps shape the future of our services, and by extension, impacts the broader scholarly ecosystem. We are looking for board members to contribute their experience and perspective.&lt;/p>
&lt;p>The role of the board at Crossref is to provide strategic and financial oversight of the organisation, as well as guidance to the Executive Director and the staff leadership team, with the key responsibilities being:&lt;/p>
&lt;ul>
&lt;li>Setting the strategic direction for the organisation;&lt;/li>
&lt;li>Providing financial oversight; and&lt;/li>
&lt;li>Approving new policies and services.&lt;/li>
&lt;/ul>
&lt;p>The board is representative of our membership base and guides the staff leadership team on trends affecting scholarly communications. The board sets strategic directions for the organisation while also providing oversight into policy changes and implementation. Board members have a fiduciary responsibility to ensure sound operations. Board members do this by attending board meetings, as well as joining more specific board committees.&lt;/p>
&lt;h3 id="what-is-expected-of-board-members">What is expected of board members?&lt;/h3>
&lt;p>Board members attend three meetings each year that typically take place in March, July, and November. Meetings have taken place in a variety of international locations and travel support is provided when needed. Following travel restrictions as a result of COVID-19, the board adopted a plan to convene at least one of the board meetings virtually each year and all committee meetings take place virtually. Most board members sit on at least one Crossref committee. Care is taken to accommodate the wide range of timezones in which our board members live.&lt;/p>
&lt;p>While individuals apply to join the board, the seat that is elected to the board ultimately belongs to the member organisation. The primary board member also names an alternate who may attend meetings in the event that the primary board member is unable to. There is no personal financial obligation to sit on the board. The member organisation must remain in good standing.&lt;/p>
&lt;p>Board members are expected to be comfortable assuming the responsibilities listed above and to prepare and participate in board meeting discussions.&lt;/p>
&lt;h3 id="how-to-apply">How to apply&lt;/h3>
&lt;p>Please &lt;a href="https://docs.google.com/forms/d/e/1FAIpQLSeh_paZyposW2HNSbwodAtxkdwseELsrJ91bpMfC3w_XfNDbg/viewform" target="_blank">click here to submit your expression of interest&lt;/a>. We ask for a brief statement about how your organisation could enhance the Crossref board and a brief personal statement about your interest and experience with Crossref.&lt;/p>
&lt;p>Please contact me with any questions at &lt;a href="mailto:lofiesh@crossref.org">lofiesh@crossref.org&lt;/a>&lt;/p></description></item><item><title>2022 public data file of more than 134 million metadata records now available</title><link>https://www.crossref.org/blog/2022-public-data-file-of-more-than-134-million-metadata-records-now-available/</link><pubDate>Fri, 13 May 2022 00:00:00 +0000</pubDate><author>Patrick Polischuk</author><guid>https://www.crossref.org/blog/2022-public-data-file-of-more-than-134-million-metadata-records-now-available/</guid><description>&lt;p>In 2020 we released our &lt;a href="https://doi.org/10.64000/wsnyw-yap64" target="_blank">first public data file&lt;/a>, something we’ve turned into &lt;a href="https://doi.org/10.64000/96h9h-b8437" target="_blank">an annual affair&lt;/a> supporting our commitment to the &lt;a href="https://doi.org/10.64000/hzemx-j7n79" target="_blank">Principles of Open Scholarly Infrastructure (POSI)&lt;/a>. We’ve just posted the 2022 file, which can now be &lt;a href="https://academictorrents.com/details/4dcfdf804775f2d92b7a030305fa0350ebef6f3e" target="_blank">downloaded via torrent&lt;/a> like in years past.&lt;/p>
&lt;p>We aim to publish these in the first quarter of each year, though as you may notice, we’re a little behind our intended schedule. The reason for this delay was that we wanted to &lt;a href="https://community.crossref.org/t/reindexing-a-large-number-of-records-in-the-rest-api/2568" target="_blank">make critical new metadata fields available&lt;/a>, including resource URLs and titles with markup.&lt;/p>
&lt;p>Crossref metadata is always openly available via &lt;a href="https://api.crossref.org/" target="_blank">our API&lt;/a>. We recommend you use this method to incrementally add new and updated records once you’re up and running with an annual public data file. If you’re interested in more frequent and regular “full-file” downloads, consider subscribing to our &lt;a href="https://www.crossref.org/services/metadata-retrieval/metadata-plus/" target="_blank">Metadata Plus program&lt;/a>. Plus subscribers have access to monthly snapshots in JSON and XML formats.&lt;/p>
&lt;p>Every year our metadata corpus grows. The 2020 file was 65GB and held 112 million records; 2021 came in at 102GB and 120 million records. This year the file weighs in at 160 GB and contains metadata for 134 million records, or all Crossref records registered up to and including April 30, 2022.&lt;/p>
&lt;h3 id="tips-for-using-the-torrent-and-retrieving-incremental-updates">Tips for using the torrent and retrieving incremental updates&lt;/h3>
&lt;ul>
&lt;li>
&lt;p>Use &lt;a href="https://academictorrents.com/details/4dcfdf804775f2d92b7a030305fa0350ebef6f3e" target="_blank">the torrent&lt;/a> if you want all of these records. Everyone is welcome to the metadata, but it will be much faster for you and much easier on our APIs to get so many records in one file. Here are some &lt;a href="https://www.crossref.org/documentation/retrieve-metadata/rest-api/tips-for-using-public-data-files-and-plus-snapshots/" target="_blank">tips on how to work with the file&lt;/a>.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Use the REST API to incrementally add new and updated records once you’ve got the initial file. Here is &lt;a href="https://www.crossref.org/documentation/retrieve-metadata/rest-api/tips-for-using-the-crossref-rest-api/" target="_blank">how to get started&lt;/a> (and avoid getting blocked in your enthusiasm to use all this great metadata!).&lt;/p>
&lt;/li>
&lt;li>
&lt;p>‘Limited’ and ‘closed’ &lt;a href="https://www.crossref.org/education/content-registration/descriptive-metadata/references/#00564/" target="_blank">references&lt;/a> are not included in the file or our open APIs. And while bibliographic metadata is generally required, lots of metadata is optional, so that records will vary in quality and completeness.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>Questions, comments, and feedback are welcome at &lt;a href="mailto:support@crossref.org">support@crossref.org&lt;/a>.&lt;/p></description></item><item><title>Announcing our new Director of Product: Rachael Lammey</title><link>https://www.crossref.org/blog/announcing-our-new-director-of-product-rachael-lammey/</link><pubDate>Thu, 12 May 2022 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/announcing-our-new-director-of-product-rachael-lammey/</guid><description>&lt;p>Unfortunately, Bryan Vickery has moved onto pastures new. I would like to thank him for his many contributions at Crossref and we all wish him well.&lt;/p>
&lt;p>I’m now pleased to announce that &lt;a href="https://www.crossref.org/people/rachael-lammey">Rachael Lammey&lt;/a> will be Crossref’s new Director of Product starting on Monday, May 16th.&lt;/p>
&lt;p>Rachael’s skills and experience are perfectly suited for this role. She has been at Crossref since 2012 and has deep knowledge and experience of all things Crossref: our mission; our members; our culture; and our services.&lt;/p>
&lt;p>In all her roles at Crossref Rachael has demonstrated how community-focused product development can be done.&lt;/p>
&lt;p>Starting as a Product Manager for Similarity Check and Crossmark, she then led community discussions on text and data mining and taxonomies, introduced our support of preprints, and led the very successful ORCID Auto-update integration. She initiated our important partnership with the Public Knowledge Project including scoping and overseeing the joint plugin development work over the years. She helped to grow the Sponsors program, establish the LIVE informational events, oversaw the founding of our ambassador program, engaged more research funders and institutions, and became a go-to person for data citation expertise in our community.&lt;/p>
&lt;p>In her brief time in &lt;a href="https://doi.org/10.64000/21qxf-gk42" target="_blank">our Research &amp;amp; Development team&lt;/a>, she helped to kick off that group’s reinvigoration and has engaged with numerous new community and technical initiatives. Such relationships—together with her knowledge of our systems and API—have enabled her to be a key driver in the development and adoption of ROR and grants - two of the highest strategic priorities of recent years.&lt;/p>
&lt;p>Rachael says:&lt;/p>
&lt;blockquote>
&lt;p>&amp;ldquo;Alignment in planning and focusing on delivering outcomes will be my initial priorities. I&amp;rsquo;m conscious that we have a lot in play and I want to support the product team in their existing and ambitious goals while working with the leadership team and our very diverse community to focus and prioritise our development roadmap. I&amp;rsquo;m really grateful for this opportunity and I am looking forward to working with our members, users, and other open infrastructure organisations in this new capacity&amp;rdquo;.&lt;/p>
&lt;/blockquote>
&lt;p>Our staff and the board are very enthusiastic about Rachael&amp;rsquo;s appointment and we know our community will be too. Please join us in congratulating Rachael!&lt;/p></description></item><item><title>Similarity Check: what’s new with iThenticate v2?</title><link>https://www.crossref.org/blog/similarity-check-whats-new-with-ithenticate-v2/</link><pubDate>Tue, 10 May 2022 00:00:00 +0000</pubDate><author>Fabienne Michaud</author><guid>https://www.crossref.org/blog/similarity-check-whats-new-with-ithenticate-v2/</guid><description>&lt;p>Since we announced last September the &lt;a href="https://doi.org/10.64000/bg7rk-dae91" target="_blank">launch of a new version of iThenticate&lt;/a>, a number of you have upgraded and become familiar with iThenticate v2 and its new and improved features which include:&lt;/p>
&lt;ul>
&lt;li>A faster, more user-friendly and responsive interface&lt;/li>
&lt;li>A preprint exclusion filter, giving users the ability to identify content on preprint servers more easily&lt;/li>
&lt;li>A new “red flag” feature that signals the detection of hidden text such as text/quotation marks in white font, or suspicious character replacement&lt;/li>
&lt;li>A private repository available for browser users, allowing them to compare against their previous submissions to identify duplicate submissions within your organisation&lt;/li>
&lt;li>A content portal, helping users check how much of their own research outputs have been successfully indexed, self-diagnose and fix the content that has failed to be indexed in iThenticate.&lt;/li>
&lt;/ul>
&lt;p>We’ve received some great feedback from iThenticate v2 users and user testers:&lt;/p>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>“There are a lot of new and helpful features implemented in version 2 of iThenticate.”&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Beilstein Institut&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>“The updates to the user interface make working with the new version a pleasure. It has a very modern feel and is easy to use, as an app on a phone. We particularly like being able to click on a link and easily exclude a source from view with just a few clicks. The response time and speed of download are also greatly improved which will cut down processing time on our end.”&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Frontiers&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>“I like the ability to be able to exclude content directly from the report.”&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; American Chemical Society&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>More information for administrators and users is available on the Turnitin website: &lt;a href="https://help.turnitin.com/crossref-similarity-check/v2.htm" target="_blank">iThenticate v2 documentation&lt;/a>.&lt;/p>
&lt;h2 id="upgrading-to-ithenticate-v2">Upgrading to iThenticate v2&lt;/h2>
&lt;p>In September, we started inviting new and existing Similarity Check subscribers using iThenticate in the browser to upgrade to this new version. And now some of the manuscript submission systems have completed their integrations with the new version of iThenticate too, so users of these systems can start to migrate. Morressier users are already using iThenticate v2, and in the next few days, we will be emailing all eJournalPress users. We know the other major manuscript submission systems are also working on their integrations, and we&amp;rsquo;ll be in touch with members using them as soon as they confirm they are ready.&lt;/p>
&lt;h3 id="manuscript-tracking-system-integrations">Manuscript tracking system integrations&lt;/h3>
&lt;p>All Similarity Check subscribers using a manuscript management system will particularly appreciate a closer integration with iThenticate v2 which means that users will be able to view their Similarity Report and investigate sources within their manuscript tracking system.&lt;/p>
&lt;h3 id="ejournalpress">eJournalPress&lt;/h3>
&lt;p>eJournalPress users will also be able to customise their iThenticate v2 settings via a configuration interface and to decide, for example, to include or exclude bibliographies from their Similarity Reports. The new integration will also show the top five matches returned by iThenticate directly in the eJournalPress interface.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2022/eJournalPress-configuration.png"
alt="eJournalPress configuration settings in iThenticate v2" width="80%">&lt;figcaption>
&lt;p>eJournalPress configuration settings in iThenticate v2&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;h3 id="editorial-manager-and-scholarone">Editorial Manager and ScholarOne&lt;/h3>
&lt;p>Aries (Editorial Manager) and Clarivate (ScholarOne) are planning to release their iThenticate v2 integrations later this year and we will be inviting users to upgrade in the coming months.&lt;/p>
&lt;p>Please check &lt;a href="https://community.crossref.org/t/new-version-of-ithenticate-update-for-peer-review-management-system-users/2211" target="_blank">our community forum&lt;/a> for updates on manuscript tracking system integrations.&lt;/p>
&lt;h2 id="more-new-and-improved-features">More new and improved features&lt;/h2>
&lt;h3 id="user-friendly-pdf-report">User-friendly PDF report&lt;/h3>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>“The report is clean and easy to read.”&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; The National Academies of Sciences, Engineering, and Medicine&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>“The clickable links will save us a considerable amount of time as they make it easy for the author to understand where the overlap is coming from, meaning we do not need to spend time clarifying overlap reports to the authors. The summary page is also very useful as authors and editors are easily able to see which sections have been included and excluded from the report.”&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Frontiers&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>The PDF version of the Similarity Report has been completely redesigned and can easily be downloaded, emailed and printed. It contains a summary of the report i.e. word count, character count, number of pages, file size, excluded sections, submission, and report dates as well as the similarity score and &lt;strong>a list of the top sources with clickable links.&lt;/strong>&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2022/first-page.png"
alt="First page of the Similarity Report in iThenticate v2" width="80%">&lt;figcaption>
&lt;p>First page of the Similarity Report in iThenticate v2&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2022/clickable-links.png"
alt="Summary and clickable links in the new Similarity Report in iThenticate v2" width="80%">&lt;figcaption>
&lt;p>Summary and clickable links in the new Similarity Report in iThenticate v2&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;h3 id="custom-section-exclusion-filter">Custom section exclusion filter&lt;/h3>
&lt;p>In iThenticate v2, &lt;strong>users can now exclude sections that are standard&lt;/strong> such as authors, affiliations, ethics statements, acknowledgments, etc. from the Similarity Report which often impacts similarity scores. You can choose from the templates available and/or create your own custom section exclusions from the admin portal.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2022/custom-section.png"
alt="Custom section exclusion filter in the iThenticate v2 admin portal" width="80%">&lt;figcaption>
&lt;p>Custom section exclusion filter in the iThenticate v2 admin portal&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2022/summary-excluded-custom-sections.png"
alt="Summary of excluded custom sections on the iThenticate v2 Similarity Report" width="80%">&lt;figcaption>
&lt;p>Summary of excluded custom sections on the iThenticate v2 Similarity Report&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>“The user interface is definitely more responsive than v1, especially when I am looking at the full-text viewing mode, scrolling through the text to compare matches, reading through the box of text in the matching source [&amp;hellip;] I also especially like the options around excluding, I was able to see our submitted work was also taken into the database and showed matches against the papers we’d uploaded already. Going forward, this is a really interesting thing for us, especially if we are looking at duplicated content in the same journal.”&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Taylor &amp;amp; Francis &lt;/cite>&lt;/p>
&lt;/div>
&lt;h3 id="user-reporting">User reporting&lt;/h3>
&lt;p>Details of user activity including folder names, similarity scores, word count, and file format are now also available in iThenticate v2 and downloadable as Excel and csv. files.&lt;/p>
&lt;h2 id="up-next">Up next&lt;/h2>
&lt;h3 id="product-development">Product development&lt;/h3>
&lt;p>Further enhancements to existing features and interface such as the view full-text mode, user groups, and custom section exclusions are planned for this year. &lt;strong>Paraphrase detection&lt;/strong> and &lt;strong>citation matching&lt;/strong> are currently in development.&lt;/p>
&lt;h3 id="ithenticate-v2-training">iThenticate v2 training&lt;/h3>
&lt;p>&lt;a href="https://help.turnitin.com/crossref-similarity-check/v2.htm" target="_blank">iThenticate v2 documentation&lt;/a> is available from the Turnitin website. Training videos and webinars will be available later on in the year.&lt;/p>
&lt;p>✏️ Do get in touch via &lt;a href="mailto:support@crossref.org">support@crossref.org&lt;/a> if you have any questions about iThenticate v1 or v2 or start a discussion by commenting on this blog post below.&lt;/p></description></item><item><title>Do you want to be a Crossref Ambassador?</title><link>https://www.crossref.org/blog/do-you-want-to-be-a-crossref-ambassador/</link><pubDate>Thu, 14 Apr 2022 00:00:00 +0000</pubDate><author>Vanessa Fairhurst</author><guid>https://www.crossref.org/blog/do-you-want-to-be-a-crossref-ambassador/</guid><description>&lt;h2 id="a-re-cap">A re-cap&lt;/h2>
&lt;p>We kicked off our Ambassador Program in 2018 after consultation with our members, who told us they wanted greater support and representation in their local regions, time zones, and languages.&lt;/p>
&lt;p>We also recognized that our membership has grown and changed dramatically over recent years and that it is likely to continue to do so. We now have over 16,000 members across 140 countries. As we work to understand what’s to come and ensure that we are meeting the needs of such an expansive community, having trusted local contacts we can work closely with is key to ensuring we are more proactive in engaging with new audiences and supporting existing members.&lt;/p>
&lt;p>We know that Crossref still remains inaccessible to many around the world, and in line with our strategic goal to engage communities, we want to lower the barriers to participation. Our Ambassadors are essential to us achieving this goal as we look to develop additional content in languages other than English, identify organisations to work closer with to support local research ecosystems, provide more in-person and online events in local time zones and languages, and do more in terms of open support via our community forum.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2022/Crossref_AmbsdrsLogo_RGB.png"
alt="Ambassadors program logo" width="350">
&lt;/figure>
&lt;/div>
&lt;h2 id="what-are-our-ambassadors-up-to-now">What are our ambassadors up to now?&lt;/h2>
&lt;p>We currently have a &lt;a href="https://www.crossref.org/community/our-ambassadors/" target="_blank">team&lt;/a> of 30 ambassadors, spanning Indonesia, Turkey, Ukraine, India, Bangladesh, Colombia, Mexico, Tanzania, Cameroon, Nigeria, Russia, Brazil, USA, UAE, Australia, China, Malaysia, Mongolia, Singapore, and Taiwan. The program is reviewed annually, welcoming new faces and sometimes sadly saying goodbye to others. This enables us to continue improving how we work together and ensures the Ambassador team remains a diverse group of committed individuals that have the time and support from Crossref to fully participate in the program.&lt;/p>
&lt;p>Over the last 3 years, we’ve had some great successes alongside a few challenges, not least of which has been working across 15 countries during a pandemic. We have all experienced the additional personal and professional strain that COVID-19 brought along, including shifts in the way we work and anxieties in the way we go about our lives. Of course, it has also meant that all our interactions have been restricted to Zoom, which has many benefits but doesn’t compare to face-to-face interactions when it comes to building strong working relationships, particularly across language and cultural barriers.&lt;/p>
&lt;p>Despite this, our ambassador team helped us run 15 multi-lingual webinars last year, including Content Registration in Arabic, Getting Started with Books in Brazilian Portuguese, and an Introduction to Crossref in Chinese. They also helped us translate various materials and content into other languages, provided feedback on our new developments, took part in beta-testing, provided support to members on our community forum, and participated in calls to contribute to the program&amp;rsquo;s future.&lt;/p>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>I love helping people get to know Crossref&amp;rsquo;s products and services.&lt;/p>
&lt;/blockquote>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>I was proud to work as Ambassador and give an online Chinese webinar to introduce Crossref and the services in Oct. 2021.&lt;/p>
&lt;/blockquote>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>I am glad to be of help to Spanish speakers who are not able to grasp all the Crossref information correctly because of a language barrier or because they don&amp;rsquo;t have the time to read and explore all the information available.&lt;/p>
&lt;/blockquote>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Muy contento de poder formar parte como Embajador y con ello poder promover el uso y aprovechamiento de los productos de Crossref.&lt;/p>
&lt;/blockquote>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>I feel so blessed meeting with many diverse friends in Crossref ranging from Europe to Asia continents.&lt;/p>
&lt;/blockquote>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Feeling happy by giving back knowledge to my regional community.&lt;/p>
&lt;/blockquote>
&lt;/div>
&lt;h2 id="the-future-is-ours-to-co-create">The future is ours to co-create&lt;/h2>
&lt;p>As countries are slowly dropping restrictions and we are taking our first cautious steps into a potential ‘post-pandemic’ world, our Community Engagement and Communication team has been looking at what this means for our activities in 2022 and beyond.&lt;/p>
&lt;p>A big part of this is identifying local communities and groups to engage with to learn what challenges our members are facing, what barriers to participation in Crossref still exist, and how we can overcome these together. This practice is also fundamental to our vision of the Research Nexus––a rich and reusable open network of relationships connecting research organisations, people, things, and actions––which can only become a reality if everyone can fully contribute to the scholarly record.&lt;/p>
&lt;p>As such, we would like to expand our Ambassador Program and particularly encourage applications from those based in the following countries:&lt;/p>
&lt;div class='shortcode-row '>
&lt;div class="col-md-6 col-sm-12 no-first-para-highlight">&lt;p>Argentina&lt;br>
Chile&lt;br>
Canada&lt;br>
Croatia&lt;br>
El Salvador&lt;br>
Germany&lt;br>
Ghana&lt;/p>
&lt;/div>
&lt;div class="col-md-6 col-sm-12 no-first-para-highlight">&lt;p>Iraq&lt;br>
Kenya&lt;br>
Nicaragua&lt;br>
Nigeria&lt;br>
Peru&lt;br>
Poland&lt;br>
Vietnam&lt;/p>
&lt;/div>
&lt;/div>
&lt;p>By being one of our ambassadors, you will become a key part of the Crossref community; our first port of call for updates or to test out new products or services, become well connected to our wide network of members, and work closely with us to make scholarly communications better for all.&lt;/p>
&lt;p>If you are interested in participating, please read more on our &lt;a href="https://www.crossref.org/community/ambassadors/">Ambassadors page&lt;/a>. You can submit an &lt;a href="https://www.crossref.org/community/ambassadors/#apply-to-become-an-ambassador">application&lt;/a> letting us know why you are interested, how you work with Crossref currently, and a bit more about yourself. We will then follow up with you to discuss your ideas and the program in more detail.&lt;/p>
&lt;p>The Ambassador Program is quite flexible, so you can choose how and when you contribute based on your comfort levels and other commitments. However, it does come with some minimum requirements of attending two team calls a year, being responsive and letting us know if anything is preventing you from participating, and completing our annual feedback survey so we can continue to improve the program going forward.
A good level of English and a firm understanding of our services and systems at Crossref is also a must to participate fully in the program and provide support to others in your local community. If you have just joined Crossref or want to learn more about how to work with us, then the Ambassador program may be too much for you right now, but our documentation has lots of helpful information and step-by-step guides, and you could also look at attending one of our events or joining our community forum.&lt;/p>
&lt;p>If you have any questions, you can always contact us at &lt;a href="mailto:feedback@crossref.org">feedback@crossref.org&lt;/a>. We look forward to hearing from you!&lt;/p></description></item><item><title>Amendments to membership terms to open reference distribution and include UK jurisdiction</title><link>https://www.crossref.org/blog/amendments-to-membership-terms-to-open-reference-distribution-and-include-uk-jurisdiction/</link><pubDate>Mon, 04 Apr 2022 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/amendments-to-membership-terms-to-open-reference-distribution-and-include-uk-jurisdiction/</guid><description>&lt;h2 id="tldr">Tl;dr&lt;/h2>
&lt;p>Forthcoming amendments to Crossref&amp;rsquo;s membership terms will include:&lt;/p>
&lt;ol>
&lt;li>
&lt;p>Removal of &amp;lsquo;reference distribution preference&amp;rsquo; policy: &lt;strong>all references in Crossref will be treated as open metadata from 3rd June 2022.&lt;/strong>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>An addition to sanctions jurisdictions: &lt;strong>the United Kingdom will be added to sanctions jurisdictions that Crossref needs to comply with.&lt;/strong>&lt;/p>
&lt;/li>
&lt;/ol>
&lt;br>
Sponsors and members have been emailed today with the 60-day notice needed for changes in terms.
&lt;h3 id="reference-distribution-preferences">Reference distribution preferences&lt;/h3>
&lt;p>In 2017, when we consolidated our metadata services under Metadata Plus, we made it possible for members to set a preference for the &lt;a href="https://www.crossref.org/documentation/content-registration/descriptive-metadata/references/#00564">distribution of references&lt;/a> to Open, Limited, or Closed. Prior to the 2017 change, we acted as a broker of 1:1 feeds of parts of metadata for parts of our community - clearly a role that was not scalable.&lt;/p>
&lt;p>We are well underway to pay back technical debt on our 20-year-old metadata system and effectively rearchitect it. We therefore recently needed to decide whether to rewrite code for a capability that hardly any member was using. Just one member has chosen Closed, and Limited was the default for a while, but the vast majority of our members now prefer Open distribution. Additionally, bringing references in line with other metadata significantly simplifies this work and will speed up the technical development.&lt;/p>
&lt;p>The Crossref Board discussed the issue in our meeting on 10th March 2022, and voted to remove the reference distribution policy set in 2017. &lt;a href="https://www.crossref.org/board-and-governance/#motions">All board motions&lt;/a> go on our website, and the wording of this particular motion is:&lt;/p>
&lt;blockquote>
&lt;p>Resolve that, based on a technical assessment, we will change the reference distribution policy so that all references registered with Crossref are treated the same as other metadata, following a planned transition.&lt;/p>
&lt;/blockquote>
&lt;p>This motion means that 60 days from today&amp;mdash;3rd June 2022&amp;mdash;all references in Crossref will be open and after that available through our API. As with all other metadata, if members cannot make references available, or do not want them openly distributed, they can choose not to deposit them. However, depositing references is necessary in order to retrieve citation links from our members-only Cited-by API.&lt;/p>
&lt;p>Check the documentation for information on how to &lt;a href="https://www.crossref.org/documentation/content-registration/descriptive-metadata/references/">deposit references&lt;/a> and use &lt;a href="https://www.crossref.org/documentation/cited-by/">Cited-by&lt;/a>. Also look up your &lt;a href="https://www.crossref.org/members/prep/" target="_blank">participation dashboard&lt;/a> to see if you are already registering references and your current distribution setting.&lt;/p>
&lt;h3 id="sanctions-jurisdictions">Sanctions jurisdictions&lt;/h3>
&lt;p>Following the UK departing from the European Union, we needed to add the United Kingdom as a separate jurisdiction that we must comply with, alongside the United Nations, the United States of America, and the European Union.&lt;/p>
&lt;p>Where there are either relevant financial or governance-based sanctions against individuals, organisations, geographic regions, or whole countries, Crossref is legally bound to comply with these four different jurisdictions. These laws supersede our own governing bylaws.&lt;/p>
&lt;p>We have launched a new &lt;a href="https://www.crossref.org/operations-and-sustainability">operations and sustainability&lt;/a> section of our website, which includes &lt;a href="https://www.crossref.org/operations-and-sustainability/membership-operations/sanctions/">a sanctions page&lt;/a> which we will keep updated with any changes and actions we&amp;rsquo;re taking.&lt;/p>
&lt;h2 id="the-specific-terms-that-will-change">The specific terms that will change&lt;/h2>
&lt;p>The complete membership terms are &lt;a href="https://www.crossref.org/membership/terms/">online here&lt;/a>. In the text below, any text to be removed is shown in &amp;lsquo;strike-through&amp;rsquo; text and any additions are in bold. These new terms will be in effect from 3rd June 2022.&lt;/p>
&lt;hr>
&lt;p>&lt;strong>5. Distribution of Metadata by Crossref.&lt;/strong> Without limiting the provisions of Section 4 above, the Member acknowledges and agrees that&lt;del>, subject to the Member&amp;rsquo;s reference distribution preference,&lt;/del>all Metadata and Identifiers registered with Crossref are made available for reuse without restriction through (but not limited to) public APIs and search interfaces, which enhances discoverability of Content. Metadata and Identifiers may also be licensed to third party subscribers along with an agreement for Crossref to provide third parties with certain higher levels of support and service. &lt;del>For the avoidance of doubt, the scope of Crossref&amp;rsquo;s distribution (if any) of a Member&amp;rsquo;s references is based on such Member&amp;rsquo;s reference distribution preference, as established by the Member in accordance with the &amp;ldquo;Reference Distribution&amp;rdquo; page on the Website.&lt;/del>&lt;/p>
&lt;p>&lt;strong>20. Compliance.&lt;/strong> Each of the Member and Crossref shall perform under this Agreement in compliance with all laws, rules, and regulations of any jurisdiction which is or may be applicable to its business and activities, including anti-corruption, copyright, privacy, and data protection laws, rules, and regulations.&lt;/p>
&lt;p>The Member warrants that neither it nor any of its affiliates, officers, directors, employees, or members is (i) a person whose name appears on the list of Specially Designated Nationals and Blocked Persons published by the Office of Foreign Assets Control, U.S. Department of Treasury (“OFAC”), (ii) a department, agency or instrumentality of, or is otherwise controlled by or acting on behalf of, directly or indirectly, any such person; (iii) a department, agency, or instrumentality of the government of a country subject to comprehensive U.S. economic sanctions administered by OFAC; or (iv) is subject to sanctions by the United Nations, &lt;strong>the United Kingdom,&lt;/strong> or the European Union.&lt;/p>
&lt;hr>
&lt;p>As always, please get in touch with us via &lt;a href="mailto:member@crossref.org">member@crossref.org&lt;/a> with any questions.&lt;/p></description></item><item><title>With a little help from your Crossref friends: Better metadata</title><link>https://www.crossref.org/blog/with-a-little-help-from-your-crossref-friends-better-metadata/</link><pubDate>Thu, 31 Mar 2022 00:00:00 +0000</pubDate><author>Jennifer Kemp</author><guid>https://www.crossref.org/blog/with-a-little-help-from-your-crossref-friends-better-metadata/</guid><description>&lt;p>We talk so much about more and better metadata that a reasonable question might be: what is Crossref doing to help?&lt;/p>
&lt;p>Members and their service partners do the heavy lifting to provide Crossref with metadata and we don’t change what is supplied to us. One reason we don’t is because members can and often do change their records (important note: updated records do not incur fees!). However, we do a fair amount of behind the scenes work to check and report on the metadata as well as to add context and relationships. As a result, some of what you see in the metadata (and some of what you don’t) is facilitated, added or updated by Crossref.&lt;/p>
&lt;p>Much of the work is automated but some of it still requires manual intervention (sound familiar?). Here’s an overview:&lt;/p>
&lt;h2 id="before-registration">Before registration&lt;/h2>
&lt;p>Our &lt;a href="https://www.crossref.org/documentation/retrieve-metadata/" target="_blank">open APIs&lt;/a> allow for Crossref metadata to be used throughout research and scholarly communications systems and services, before and after records are registered with us. Those who have used a search function in something like a manuscript submission system, rather than having to hand key or copy and paste the information, will appreciate how these integrations reduce time, effort and the likelihood of errors in collecting metadata well before it gets to Crossref.&lt;/p>
&lt;p>For one example, it’s very common for members to use the metadata to add DOIs to reference lists when preparing deposits. Of course, new members first need a &lt;a href="https://www.crossref.org/documentation/member-setup/constructing-your-dois/" target="_blank">prefix&lt;/a> (and a memberID and name, but more on that later) in order to register content. We also provide a &lt;a href="https://www.crossref.org/documentation/member-setup/constructing-your-dois/suggested-doi-registration-workflow-including-suffix-generator/" target="_blank">suffix generator&lt;/a> for help in &lt;a href="https://www.crossref.org/documentation/member-setup/constructing-your-dois/" target="_blank">constructing DOIs&lt;/a>. If you’re not sure how best to make use of existing metadata in deposits, we’ve got &lt;a href="https://www.crossref.org/services/metadata-retrieval/" target="_blank">a few options&lt;/a> for you. Questions are welcome.&lt;/p>
&lt;p>We don’t often put it this way but we should: Crossref members rely on the metadata as much, if not more, than the rest of the community. More and better metadata directly benefits our members.&lt;/p>
&lt;h2 id="upon-registration">Upon registration&lt;/h2>
&lt;p>There are a number of ways we work with the metadata when deposits are received.&lt;/p>
&lt;ul>
&lt;li>&lt;strong>Checking for uniqueness&lt;/strong> In order to avoid duplicate records, we check to make sure that a title or work hasn&amp;rsquo;t been registered before. Depending on what we find, a conflict report or failed registration may result.&lt;/li>
&lt;li>&lt;strong>Adding DOIs to references&lt;/strong> When references come to us without DOIs, we’ll try to match and add them.&lt;/li>
&lt;li>&lt;strong>&lt;a href="https://www.crossref.org/community/orcid/" target="_blank">ORCID auto-update&lt;/a>&lt;/strong> We automatically update authors’ ORCID records (with their permission of course) whenever deposits include their ORCID iDs.&lt;/li>
&lt;li>&lt;strong>Preprint to VoR reports&lt;/strong> We compare title information and provide &lt;a href="https://www.crossref.org/documentation/content-registration/content-type-markup-guide/posted-content-includes-preprints/#00094" target="_blank">notifications&lt;/a> of matching records to members depositing preprints, to help them fulfill their obligation to link to Versions of Record (VoRs), where they exist.&lt;/li>
&lt;li>&lt;strong>&lt;a href="https://www.crossref.org/documentation/content-registration/structural-metadata/relationships/" target="_blank">Relationships&lt;/a>&lt;/strong> Like preprint to VoR links, components are another kind of relationship. These might be supplementary material such as figures we can link to the ‘parent’ record.&lt;/li>
&lt;li>&lt;strong>Funding data&lt;/strong> When members register only a funder name as part of the information on who funded the work, we’ll try to match it to its identifier from the &lt;a href="https://www.crossref.org/services/funder-registry/" target="_blank">Funder Registry&lt;/a>, to support better linking between funders and works.&lt;/li>
&lt;li>&lt;strong>Timestamps&lt;/strong> We add date-times for first created and last updated to member-supplied timestamps.&lt;/li>
&lt;li>&lt;strong>Count of references&lt;/strong> That’s right, we count all the references for each record that includes them and add the total to the metadata.&lt;/li>
&lt;/ul>
&lt;h2 id="after-registration">After registration&lt;/h2>
&lt;p>Once registered, we check, report on and update metadata in a few ways.&lt;/p>
&lt;ul>
&lt;li>&lt;strong>&lt;a href="https://www.crossref.org/documentation/reports/doi-crawler-report/" target="_blank">Link checking&lt;/a>&lt;/strong> We email each member a monthly Resolution Report with details of the number of failed and successful resolutions for their DOIs. If someone in the community reports a DOI that isn’t registered, we email the member a DOI Error Report.&lt;/li>
&lt;li>&lt;strong>Citation counts and matches&lt;/strong> Citation counts for records of members participating in our &lt;a href="https://www.crossref.org/services/cited-by/" target="_blank">Cited-by service&lt;/a> are openly available in our REST API. The matching citations themselves are available to members, for their own records only.&lt;/li>
&lt;li>&lt;strong>&lt;a href="https://doi.org/10.64000/7hff7-sc238" target="_blank">Title transfers&lt;/a>&lt;/strong> Title, prefix and DOI transfers are common and require assistance from our team.&lt;/li>
&lt;li>&lt;strong>MemberID&lt;/strong> It’s not uncommon for members to have more than one prefix. The memberID means users of the REST API can query for records associated with all of a member’s prefixes.&lt;/li>
&lt;li>&lt;strong>Digital preservation&lt;/strong> We handle the infrequent but critical update of URLs that are necessary when titles are triggered for digital preservation. We also preserve the metadata itself, with both &lt;a href="https://clockss.org/" target="_blank">CLOCKSS&lt;/a> and &lt;a href="https://www.portico.org/" target="_blank">Portico&lt;/a>.&lt;/li>
&lt;/ul>
&lt;p>Of course, since records are often redeposited with updates (note, deposit fees are only charged once per record), some of these processes on our side are repeated as necessary.&lt;/p>
&lt;p>This list isn’t exhaustive and other needs and opportunities will emerge. For example, we are looking at matching to add &lt;a href="https://doi.org/10.64000/9aaza-a3158" target="_blank">ROR&lt;/a> IDs, as we do for funderIDs, and doing some research into how we might determine and assert subject classifications at the work-level. If you&amp;rsquo;re interested in more about this kind of work, you&amp;rsquo;ll want to read this &lt;a href="https://doi.org/10.64000/ske16-xve54" target="_blank">recent post&lt;/a> by my Labs colleague Dominika on matching grants to outputs.&lt;/p>
&lt;p>&lt;a href="mailto:feedback@crossref.org">Get in touch&lt;/a> if you have questions or for more information.&lt;/p></description></item><item><title>Perspectives: Bruna Erlandsson on scholarly communications in Brazil</title><link>https://www.crossref.org/blog/perspectives-bruna-erlandsson-on-scholarly-communications-in-brazil/</link><pubDate>Mon, 28 Mar 2022 00:00:00 +0000</pubDate><author>Bruna Erlandsson</author><guid>https://www.crossref.org/blog/perspectives-bruna-erlandsson-on-scholarly-communications-in-brazil/</guid><description>&lt;div style="float:left;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/2022/perspectives.png" alt="sound bar logo" width="150px" class="img-responsive" />
&lt;/div>
&lt;p>Join us for the first in our &lt;em>Perspectives&lt;/em> blog series. In this series of blogs, we will be meeting different members of our diverse, global community at Crossref. We learn more about their lives, how they came to know and work with us, and we hear insights about the scholarly research landscape in their country, challenges they face, and plans for the future.&lt;/p>
&lt;p>In our first blog, we meet Bruna Erlandsson, Crossref Ambassador in Brazil, co-owner of Linceu Editorial, and client services manager at ABEC Brasil. Bruna has dedicated her career to scholarly publishing and has worked with Crossref for many years. We invite you to have a read and a listen below to meet Bruna!&lt;/p>
&lt;div class='shortcode-row '> &lt;div class="col-md-6 col-sm-12 no-first-para-highlight">&lt;div
id="video-player-perspectives-bruna-english"
class="vimeo-player-container"
data-player-id="perspectives-bruna-english"
data-default-video="1030565718"
>
&lt;div class="video-languages">
&lt;pre>&lt;code>&amp;lt;a type=&amp;quot;button&amp;quot; style=&amp;quot;cursor:pointer;&amp;quot; class=&amp;quot;video-language-button&amp;quot; data-videoid=&amp;quot;1030565718&amp;quot; data-playerid=&amp;quot;video-player-perspectives-bruna-english&amp;quot;&amp;gt;Portuguese&amp;lt;/a&amp;gt;
&lt;/code>&lt;/pre>
&lt;/div>
&lt;div id="vimeo-perspectives-bruna-english">&lt;/div>
&lt;/div>
&lt;/div>
&lt;div class="col-md-6 col-sm-12 no-first-para-highlight">&lt;div
id="video-player-perspectives-bruna-portuguese"
class="vimeo-player-container"
data-player-id="perspectives-bruna-portuguese"
data-default-video="1030565745"
>
&lt;div class="video-languages">
&lt;pre>&lt;code>&amp;lt;a type=&amp;quot;button&amp;quot; style=&amp;quot;cursor:pointer;&amp;quot; class=&amp;quot;video-language-button&amp;quot; data-videoid=&amp;quot;1030565745&amp;quot; data-playerid=&amp;quot;video-player-perspectives-bruna-portuguese&amp;quot;&amp;gt;English&amp;lt;/a&amp;gt;
&lt;/code>&lt;/pre>
&lt;/div>
&lt;div id="vimeo-perspectives-bruna-portuguese">&lt;/div>
&lt;/div>
&lt;/div>&lt;/div>
&lt;p>&lt;br>&lt;br>&lt;/p>
&lt;blockquote>
&lt;p>Tell us a bit about your organisation, your objectives, and your role&lt;br>
​​Conte-nos um pouco sobre sua organização, seus objetivos e sua função&lt;/p>
&lt;/blockquote>
&lt;p>I am a co-founder of the company Linceu Editorial, dedicated to publishing scientific and technological research in ethical, creative, and innovative ways. We strive to provide quality editorial services that meet standard industry requirements and best practices, increase visibility, attract readers and potential authors, and ensure their work is properly cited. My personal goal is to be recognized by the scientific community for providing excellent service to our clients.&lt;/p>
&lt;p>Sou sócia proprietária da empresa Linceu Editorial, que se dedica à editoração de artigos científicos de inúmeras revistas, de forma ética, criativa e inovadora. Buscamos atribuir aos periódicos de nosso portfólio os requisitos de qualidade editorial alinhados às melhores práticas editoriais, de forma que aumentem sua visibilidade e atraiam leitores, potenciais autores e, não menos importante, que recebam citações em seus artigos. Meu objetivo pessoal é obter reconhecimento da comunidade científica por meio de uma prestação de serviço em nível de excelência.&lt;/p>
&lt;blockquote>
&lt;p>What is one thing that others should know about your country and its research activity?&lt;br>
O que os outros deveriam saber sobre seu país e sua atividade de pesquisa?&lt;/p>
&lt;/blockquote>
&lt;p>Brazil is the South American leader in publishing scientific articles in Open Access journals. However, it faces challenges due to the absence of a more comprehensive public policy to support scientific editors. As a result, most journals are produced by teaching and/or research institutions or scientific associations with volunteer editorial teams that, although lacking professional journal production skills, produce high-quality journals. Only a tiny percentage of Brazilian journals are published through commercial publishers.&lt;/p>
&lt;p>O Brasil é o líder sul-americano na publicação de artigos científicos, com destaque para as revistas em acesso aberto. No entanto, enfrenta desafios em função da ausência de uma política pública mais abrangente para apoio aos editores científicos. A maior parte dos periódicos é produzida por instituições de ensino/pesquisa ou Sociedades Científicas, tendo uma equipe editorial voluntária e carecendo de profissionalização em sua produção, embora, em muitos casos, apresentem boa qualidade. Apenas uma pequena porcentagem de periódicos brasileiros é publicada por meio de um publisher comercial.&lt;/p>
&lt;blockquote>
&lt;p>Are there trends in scholarly communications that are unique to your part of the world?&lt;br>
Existem tendências nas comunicações acadêmicas que são únicas em sua parte do mundo?&lt;/p>
&lt;/blockquote>
&lt;p>I wouldn&amp;rsquo;t say unique. However, adherence to Open Science practices, such as preprints and making research data available, is already part of the editorial culture. On the other hand, open peer review is not yet well accepted by everyone in the scientific community, and only a few journals adopt it. In addition, in some areas of research, such as Education and Social Science, researchers are very active - on forums, in discussions lists and attending the same conferences - so there’s this feeling that ‘everyone knows everyone’ which can then lead to potential conflicts of interest and apprehensiveness around open peer review, particularly when it comes to publishing a negative review.&lt;/p>
&lt;p>Eu não diria única, mas penso que, no Brasil, a adesão às práticas da ciência aberta, como publicação em preprint e disponibilização de dados de pesquisa, já fazem parte da cultura editorial. Por outro lado, a revisão aberta ainda não é bem aceita por toda comunidade científica, sendo poucos os periódicos que o adotam. Além disso, em algumas áreas de conhecimento com grande produção local, como por exemplo a Ciências Sociais e Educação, a interação entre membros da comunidade é muito grande, visto que são pesquisadores muito ativos em fóruns, listas de discussões e conferências da área, causando a sensação de que &amp;ldquo;todo mundo conhece todo mundo&amp;rdquo;, resultando em um possível conflito de interesse, visto que existe um grande receio em publicar um parecer aberto, especialmente se o caso for um parecer negativo.&lt;/p>
&lt;blockquote>
&lt;p>What about any political policies, challenges, or mandates that you have to consider in your work?&lt;br>
E as políticas, desafios ou mandatos políticos que você deve considerar em seu trabalho?&lt;/p>
&lt;/blockquote>
&lt;p>In Latin America we have a large indexing database, Redalyc, and a digital library of Open Access journals, which has recently excluded a number of journals for charging APCs (Article Processing Charges), upon the understanding that this would go against their Diamond Open Access requirement.&lt;/p>
&lt;p>However, in Brazil - in general - the understanding of Open Access is not so limited. Charging APCs are in fact encouraged by many as a form of self-sustainability of the journal while still being Open Access.&lt;/p>
&lt;p>As for challenges, one of the biggest is whether or not to publish in English. Although the number of Brazilian journals that publish exclusively in English or both languages (Portuguese and English) is remarkably high. There is still however a belief that local science is only of interest to the local public, and so some question whether there is a value in publishing in English (or other languages). For example, if an author writes a research paper about a small riverside community in the countryside of Acre state in Brazil, they might ask why someone outside the country would be interested in reading that.&lt;/p>
&lt;p>Aqui na América Latina, temos uma grande base indexadora, Redalyc, e biblioteca digital de periódicos de Acesso Aberto que, recentemente, excluíu da base um número considerável de periódicos que cobrassem qualquer tipo de taxa de publicação, por entender que isso iria contra os requisitos de seu modelo de Acesso Aberto Diamante (periódicos em acesso aberto livre de taxa de publicação).&lt;/p>
&lt;p>No entanto, no Brasil, em geral, o entendimento é outro, a cobrança de taxas de processamento não descaracteriza o acesso aberto, sendo, na verdade, encorajado por muitos como uma forma de auto-sustentabilidade do periódico.&lt;/p>
&lt;p>Já em relação a desafios, acredito que um dos maiores é a questão de publicar ou não em inglês. Embora seja notável o número de periódicos brasileiros que publicam exclusivamente em inglês ou ainda nos dois idiomas (português e inglês), existe ainda a crença de que a ciência local só teria interesse do público local, criando assim o questionamento se há ou não o valor em publicar em outro idioma. Por exemplo, se uma pesquisa estuda algo sobre uma comunidade ribeirinha no interior do estado do Acre, aqui no Brasil, é comum existir a dúvida se algo tão específico seria do interesse de alguém de fora do nosso país.&lt;/p>
&lt;blockquote>
&lt;p>How would you describe the value of being part of the Crossref community; what impact has your participation had on your goals?&lt;br>
Como você descreveria o valor de fazer parte da comunidade Crossref; que impacto teve sua participação em seus objetivos?&lt;/p>
&lt;/blockquote>
&lt;p>I get immense value from being part of the Crossref community. Being a Crossref Ambassador brings greater recognition and legitimacy to my role working with editors and adds value to my company’s services as well. The title of Ambassador enhances trust in my opinions, presentations, and when providing support and clarification to those asking questions. However it also comes with a great responsibility to do this well, which motivates me to always keep up to date with developments at Crossref. Through the Ambassador Program I have given several webinars for Crossref and the Associação Brasileira de Editores Científicos (ABEC Brasil), which provide much needed information and support to Portuguese speaking Crossref members as well as enhancing the visibility of my professional activities at Linceu Editorial.&lt;/p>
&lt;p>É um valor enorme fazer parte da comunidade Crossref! Ser Embaixadora do Crossref traz um reconhecimento entre os editores e agrega valor aos serviços de minha empresa. Esse título assegura confiabilidade em minhas opiniões, apresentações, e esclarecimentos de dúvidas, o que traz junto uma grande responsabilidade que me motiva a me manter sempre atualizada com tudo em relação ao Crossref. Através do Programa de Embaixadores eu ministrei diversos webinários para a Crossref e também para a Associação Brasileira de Editores Científicos (ABEC Brasil), fornecendo muitas informações necessárias para os membros da Crossref que falam português, e também isso tudo acaba por retornar em visibilidade para as minhas atividades profissionais na Linceu Editorial.&lt;/p>
&lt;blockquote>
&lt;p>For you, what would be the most important thing Crossref could change (do more of/do better in)?&lt;br>
Para você, qual seria a coisa mais importante que o Crossref poderia mudar (fazer mais/fazer melhor)?&lt;/p>
&lt;/blockquote>
&lt;p>I think there is still a need for more multilingual training both online and face-to-face, which has been particularly lacking during the pandemic, to provide more information on Crossref services beyond Content Registration. For example Similarity Check is a service that people still have a lot of questions about (such as ‘what is the magic similarity percentage score to identify plagiarism?’ Answer - there isn’t one!). Crossmark is another service where I believe people could benefit from more training on it’s importance in the publication process, not only in cases of retraction but also in guaranteeing that the article is up-to-date and trustworthy. In Brazil many people use Open Journal Systems (OJS) and so the development of Crossref service specific plugins and training on how to use them is really useful!&lt;/p>
&lt;p>Acho que ainda há necessidade de mais treinamentos multilíngues, tanto online quanto presencial – o que tem sido particularmente escasso durante a pandemia – para fornecer mais informações sobre os serviços do Crossref além do Registro de Conteúdo. Por exemplo, o Similarity Check é um serviço sobre o qual as pessoas ainda têm muitas dúvidas (como &amp;lsquo;qual é a porcentagem de similaridade aceitável para identificar plágio?&amp;rsquo; Resposta - não existe!). O Crossmark é outro serviço onde acredito que as pessoas poderiam se beneficiar de mais treinamento sobre sua importância no processo de publicação, não apenas em casos de retratação, mas também para garantir que o artigo esteja sempre atualizado e confiável. No Brasil muitas pessoas usam o Open Journal Systems (OJS) e por isso o desenvolvimento de plugins específicos do serviço Crossref e treinamento sobre como usá-los seriam muito úteis!&lt;/p>
&lt;blockquote>
&lt;p>Which other organisations do you collaborate with or are pivotal to your work in open science?&lt;br>
Com quais outras organizações você colabora ou é fundamental para o seu trabalho em ciência aberta?&lt;/p>
&lt;/blockquote>
&lt;p>I contribute to ABEC Brasil in a variety of ways including speaking on short courses about Crossref, designing content for lectures as part of an online program called ABEC Educação (which will be launched soon), and as a volunteer consultant to answer a variety of questions from editors regarding content registration at Crossref.&lt;/p>
&lt;p>Contribuo com a ABEC Brasil, participando tanto como ministrante de minicursos sobre ferramentas Crossref quanto como conteudista de um curso no Programa EaD ABEC Educação (que será lançado em breve), além de como consultora voluntária para atender a diversas dúvidas de editores em relação a depósito de conteúdo.&lt;/p>
&lt;blockquote>
&lt;p>What are the post-pandemic challenges you are facing and how are you adapting to them?&lt;br>
Quais são os desafios pós-pandemia que você está enfrentando e como você está se adaptando a eles?&lt;/p>
&lt;/blockquote>
&lt;p>Considering the current situation in Brazil, I don’t think I would consider us having reached ‘post-pandemic’ just yet. Although vaccination is taking place successfully, there are still many uncertainties and fears. A good example of this is Crossref LIVE Brazil which was canceled at the start of the pandemic and at the moment we still don’t know when we will be able to reschedule this. It still feels too risky to bring a number of speakers from abroad to Brazil and too soon to hold such a large in-person event.&lt;/p>
&lt;p>However, if I had to highlight one challenge I&amp;rsquo;ve been facing, it would be something more personal rather than work-related. Beyond a shadow of a doubt, it would be the lack of human contact! It has been really hard to get use to not gathering together with family and friends and not being able to travel, meet new people, and experience new cultures. To deal with it, I spend my free time planning the places I will go to and people I will visit as soon as this whole situation is over!&lt;/p>
&lt;p>Para ser honesta, considerando a realidade atual no Brasil, eu ainda não considero o momento atual &amp;ldquo;pós-pandemia&amp;rdquo;. Embora a vacinação esteja ocorrendo com sucesso, ainda existem muitas incertezas e medos. Um exemplo bem claro é o Crossref Live in Brazil, que foi cancelado assim que a pandemia foi &amp;ldquo;anunciada&amp;rdquo; e, até hoje, não sabemos quando ocorrerá, pois ainda soa muito arriscado trazer palestrantes de fora para o Brasil e também se encontrar com diversas pessoas em um evento presencial.&lt;/p>
&lt;p>No entanto, se eu tivesse que destacar um desafio que tenho enfrentado, seria algo mais pessoal e não relacionado ao trabalho. E, sem sombras de dúvidas, seria a falta de contato humano! Está sendo realmente complicado se acostumar em não encontrar amigos e familiares, e também não poder viajar e conhecer novos lugares, pessoas e culturas – o jeito que encontrei para lidar com isso é gastar meu tempo livre planejando todos os lugares que irei e todas as pessoas que visitarei assim que essa situação toda passar.&lt;/p>
&lt;blockquote>
&lt;p>What are your plans for the future?&lt;br>
Quais são seus planos para o futuro?&lt;/p>
&lt;/blockquote>
&lt;p>My plans for the future include continuously learning more and more about scholarly publishing including the various services that Crossref provides. I want to be able to help publishers implement valuable tools into their workflows such as Similarity Check and Crossmark, and contribute to greater scientific dissemination of Brazilian research so that Brazilian journals can get the global recognition, visibility and value they deserve.&lt;/p>
&lt;p>Meus planos para o futuro incluem aprender cada vez mais e mais sobre publicação científica, incluindo os vários serviços que o Crossref oferece. Quero poder ajudar os editores a implementar ferramentas valiosas em seus fluxos de trabalho, como Similarity Check e Crossmark, e contribuir para uma maior divulgação científica das pesquisas brasileiras para que os periódicos brasileiros possam obter o reconhecimento global, visibilidade e valor que merecem.&lt;/p>
&lt;p>Thank you, Bruna!&lt;br>
Obrigado, Bruna!&lt;/p></description></item><item><title>Outage of March 24, 2022</title><link>https://www.crossref.org/blog/outage-of-march-24-2022/</link><pubDate>Thu, 24 Mar 2022 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/outage-of-march-24-2022/</guid><description>&lt;p>So here I am, apologizing again. Have I mentioned that I hate computers?&lt;/p>
&lt;p>We had &lt;a href="https://status.crossref.org/incidents/gwxd1yqdw304" target="_blank">a large data center outage&lt;/a>. It lasted 17 hours. It meant that pretty much all Crossref services were unavailable - our main website, our content registration system, our reports, our APIs. 17 hours was a long time for us - but it was also an inconvenient time for numerous members, service providers, integrators, and users. We apologise for this.&lt;/p>
&lt;p>Like the &lt;a href="https://doi.org/10.64000/sen6x-c2c16" target="_blank">outage last October&lt;/a>, the issue was related to the data center that we are trying to leave. However, unlike last time, our single nearby network admin wasn&amp;rsquo;t in surgery at the time. Tim was alerted in the early hours of his morning and was able get up and immediately investigate.&lt;/p>
&lt;p>Despite having both secondary and tertiary backup connections, neither activated appropriately.&lt;/p>
&lt;p>The problem was with incomplete BGP (Border Gateway Protocol) settings on our primary connection&amp;rsquo;s network provider’s side. We never noticed this because our backup connection had the correct and complete BGP settings. But our backup circuit went down (we don’t know why yet), and when the router with complete settings went down, only the router with the incomplete settings was available and so everything went down.&lt;/p>
&lt;p>We hadn’t yet fully configured the tertiary connection to cut over automatically. This meant cutting over to the tertiary during the outage would have required manual and potentially error-prone reconfiguration. Not something we wanted to do in a hurry with a sleep-deprived network admin.&lt;/p>
&lt;p>It’s not an excuse at all. But we are currently down two people in our infrastructure group. One of our infrastructure staff recently left for a startup, and we are already hiring a new third position. In short, our one-long-suffering sysadmin had to field this all by himself. But hey - &lt;a href="https://www.crossref.org/jobs/2022-03-15-head-of-infrastructure/">we are hiring a Head of Infrastructure&lt;/a>, and if you are interested you can now see the work you&amp;rsquo;d have cut out for you!&lt;/p>
&lt;p>So things are back up and we’ve resolved the incident but we are carefully and cautiously monitoring. We will further analyze what went wrong and post an update when we have a clearer picture.&lt;/p>
&lt;p>I apologize for the downstream pain this outage will have inevitably caused. We realize that many people will now be scrambling to clean things up after this lengthy outage.&lt;/p>
&lt;p>More when I have it… but for now I&amp;rsquo;ll mostly be curled up in a ball.&lt;/p></description></item><item><title>Announcing the ROR Sustaining Supporters program</title><link>https://www.crossref.org/blog/announcing-the-ror-sustaining-supporters-program/</link><pubDate>Wed, 23 Mar 2022 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/announcing-the-ror-sustaining-supporters-program/</guid><description>&lt;p>In collaboration with California Digital Library and DataCite, Crossref guides the operations of the &lt;a href="https://ror.org" target="_blank">Research Organisation Registry (ROR)&lt;/a>. ROR is community-driven and has an independent sustainability plan involving grants, donations, and in-kind support from our staff.&lt;/p>
&lt;p>ROR is a vital component of the Research Nexus, our vision of a fully connected open research ecosystem. It helps people identify, connect, and analyze the affiliations of those contributing to, producing, and publishing all kinds of research objects. Crossref added support for ROR to its schema and REST API in 2021 and we are &lt;a href="https://doi.org/10.64000/9aaza-a3158" target="_blank">asking Crossref members&lt;/a> to use ROR IDs for author affiliations in the metadata they deposit with Crossref. But this post is about how the Crossref community can support ROR in another way.&lt;/p>
&lt;p>All three lead organisations&amp;mdash;as well as the ROR initiative&amp;mdash;have publicly committed to the &lt;a href="http://openscholarlyinfrastructure.org/" target="_blank">POSI Principles&lt;/a> and we know that our diverse and global community is increasingly interested in showing its support for open scholarly infrastructure too. Now there&amp;rsquo;s an opportunity to show that support; the following blog by Maria Gould, cross-posted from the &lt;a href="https://ror.org/blog/2022-02-28-help-sustain-ror/" target="_blank">ROR blog&lt;/a>, explains how.&lt;/p>
&lt;hr>
&lt;h2 id="ror-begins-a-new-round-of-community-fundraising">ROR begins a new round of community fundraising&lt;/h2>
&lt;p>Since ROR launched in 2019, we have been charting a path to sustainability that leverages our broad community network and diversifies our funding sources. ROR is currently funded through a combination of in-kind support from its three operating organisations, project-based grant funds, and financial contributions from community members.&lt;/p>
&lt;p>While ROR aims to minimize overhead and contain costs, it still requires resources to build and maintain the registry&amp;rsquo;s infrastructure, especially as adoption continues to grow. ROR has been working to establish independent revenue streams that complement ROR&amp;rsquo;s in-kind support, avoid dependence on grant funds, and ensure the registry data remains openly available.&lt;/p>
&lt;p>This year, ROR is initiating a new round of community fundraising. Building on the &lt;a href="https://ror.org/blog/2019-10-16-help-sustain-ror" target="_blank">community fundraising campaign&lt;/a> we ran during 2019-2021, we are renewing a call for organisations to commit to supporting ROR financially. We are launching a Sustaining Supporters program that opens up new ways for organisations to participate in the collective funding of ROR.&lt;/p>
&lt;h2 id="ror-sustaining-supporters-program">ROR Sustaining Supporters program&lt;/h2>
&lt;p>With the Sustaining Supporters program, organisations are encouraged to support ROR&amp;rsquo;s operating expenses on a recurring annual basis. Any organisation that signs up to support ROR through the end of 2022 will be recognized as a Founding Supporter and receive a supporter badge that can be displayed on their website.&lt;/p>
&lt;p>We want to make the process of contributing to ROR as easy as possible. To ensure this is the case, organisations can support ROR at any amount that works for their budget and capacity. Also, to simplify the invoicing process, organisations that are already members of &lt;a href="https://crossref.org/" target="_blank">Crossref&lt;/a> or &lt;a href="https://datacite.org/" target="_blank">DataCite&lt;/a> can choose to receive an invoice directly from Crossref and DataCite for their ROR contributions. However, if organisations prefer, they can also be invoiced directly from ROR.&lt;/p>
&lt;h2 id="why-support-ror">Why support ROR&lt;/h2>
&lt;p>ROR aims to be an example of the power and potential of community-funded open infrastructure. ROR is committed to providing open, stakeholder-governed infrastructure for research organisation identifiers and associated metadata. Implementation of ROR IDs in scholarly infrastructure and metadata enables more efficient discovery and tracking of research outputs across institutions and funding bodies.&lt;/p>
&lt;p>The Sustaining Supporters program is the next step in ROR&amp;rsquo;s sustainability journey. ROR is continuing to explore future potential paid service tiers designed for those organisations and companies that rely heavily on our infrastructure, which would complement the supporters program. However, rest assured that any paid services will not impact the availability of ROR data or our commitment to supporting our community, in line with our commitment to the &lt;a href="https://openscholarlyinfrastructure.org/" target="_blank">Principles of Open Scholarly Infrastructure (POSI)&lt;/a>.&lt;/p>
&lt;p>We&amp;rsquo;ve all seen key infrastructure components disappear, be enclosed, or get acquired. We are also realistic about how much effort and cost is involved in sustaining key components of open infrastructure that the scholarly community depends on. And we are committed to doing this right. That means not just sustaining core infrastructures, but investing in them so that they can evolve alongside community needs.&lt;/p>
&lt;p>ROR is a free resource for the research community. However, this shared infrastructure does require a collective funding approach that can sustain it as a common good.&lt;/p>
&lt;h2 id="join-us">Join us!&lt;/h2>
&lt;p>This is an exciting moment to be part of ROR&amp;rsquo;s growth. Let&amp;rsquo;s fund open infrastructure together!&lt;/p>
&lt;p>If your organisation is interested in supporting ROR and helping to fund open, community-led infrastructure, &lt;a href="https://ror.org/sustain/" target="_blank">sign up here&lt;/a>.&lt;/p></description></item><item><title>Follow the money, or how to link grants to research outputs</title><link>https://www.crossref.org/blog/follow-the-money-or-how-to-link-grants-to-research-outputs/</link><pubDate>Tue, 22 Mar 2022 00:00:00 +0000</pubDate><author>Dominika Tkaczyk</author><guid>https://www.crossref.org/blog/follow-the-money-or-how-to-link-grants-to-research-outputs/</guid><description>&lt;p>The ecosystem of scholarly metadata is filled with relationships between items of various types: a person authored a paper, a paper cites a book, a funder funded research. Those relationships are absolutely essential: an item without them is missing the most basic context about its structure, origin, and impact. No wonder that finding and exposing such relationships is considered very important by virtually all parties involved. Probably the most famous instance of this problem is finding citation links between research outputs. Lately, another instance has been drawing more and more attention: linking research outputs with grants used as their funding source. How can this be done and how many such links can we observe?&lt;/p>
&lt;h2 id="tldr">TL;DR&lt;/h2>
&lt;ul>
&lt;li>We looked for links between research outputs and grants registered with Crossref.&lt;/li>
&lt;li>Grant DOIs alone are not enough for linking research outputs with grants, because the funding information in research outputs typically does not contain grant DOIs (yet). Award numbers alone are also not enough because they are not globally unique.&lt;/li>
&lt;li>We used either grant DOIs (if available) or the combination of award number and funder information to match grants to research outputs.&lt;/li>
&lt;li>In total, we found 20,834 links between research outputs and registered grants, involving 17,082 research outputs and 3,858 grants (10% of all registered grants)&lt;sup id="fnref:1">&lt;a href="#fn:1" class="footnote-ref" role="doc-noteref">1&lt;/a>&lt;/sup>.&lt;/li>
&lt;li>Erroneous and incomplete metadata, especially involving award numbers, is the main factor that prevents linking research outputs to grants.&lt;/li>
&lt;/ul>
&lt;h2 id="introduction">Introduction&lt;/h2>
&lt;p>The ecosystem of scholarly metadata is filled with relationships between items of various types: a person authored a paper, an author works at a university, a paper cites a book, a book contains a chapter, a funder funded research. Those relationships are absolutely essential: an item without them is missing the most basic context about its structure, origin, and impact.&lt;/p>
&lt;p>No wonder that finding and exposing relationships between items in the scientific ecosystem is considered very important by virtually all parties involved. Probably the most famous instance of this problem is finding citation links between research outputs. Another, relatively new example, is linking research outputs with grants used as their funding source.&lt;/p>
&lt;p>At Crossref, for some time now we have been seeing a steady growth of funder membership and grant registration. We are aware that the possibility of finding relationships between grants and research outputs is a big reason why funders are registering grants with us in the first place. Being able to see which research outputs are being supported by which grants helps reduce the reporting burden on researchers, funders, and institutions alike, especially now with the addition of &lt;a href="https://doi.org/10.64000/1nkjy-15275" target="_blank">ROR IDs&lt;/a> to help complete the picture. Exposing relationships between research outputs and grants also increases the transparency of funding sources of the research, making it easier to assess and trust scientific findings.&lt;/p>
&lt;p>But how can we find those relationships and how many of them can we already observe? Thankfully our REST API, &lt;a href="https://doi.org/10.64000/tynar-j7a72" target="_blank">recently equipped with the grant metadata&lt;/a>, can help us answer these questions.&lt;/p>
&lt;h2 id="the-perfect-scenario">The perfect scenario&lt;/h2>
&lt;p>Imagine a world where the metadata of any scientific output states all relationships with other items existing in the scientific ecosystem, and those related items are always referred to by their persistent identifiers, allowing all this information to be accessed in a fully machine-readable way&amp;hellip; Lovely, right?&lt;/p>
&lt;p>In the case of citations, in such a perfect world every bibliographic reference has a DOI of the cited item. And in the case of funding information, a scientific paper contains grant DOIs, stating the funded-by relationships between the paper and the grants.&lt;/p>
&lt;p>But, as the last two years have painfully taught us all, life is not all rainbows and unicorns.&lt;/p>
&lt;h2 id="the-reality-kicks-in">The reality kicks in&lt;/h2>
&lt;p>We know that around &lt;a href="https://doi.org/10.64000/txft6-s1481" target="_blank">71% of bibliographic references are deposited with Crossref without a DOI of the cited item&lt;/a>. This means that if we want to establish citation links between items, we need to match the bibliographic references using the provided metadata, which is not a trivial task and can potentially introduce errors.&lt;/p>
&lt;p>And the situation with the funding information and grant DOIs is even worse.&lt;/p>
&lt;h3 id="problem-1-our-schema-does-not-allow-the-publishers-to-attach-grant-dois-to-research-outputs">Problem #1: our schema does not allow the publishers to attach grant DOIs to research outputs&lt;/h3>
&lt;p>This issue is 100% on us. Because grant DOIs are relatively new, our deposit schema does not yet allow to specify the grant DOI in the funding information of a research output, even if the publisher wanted to. We are working on changing this.&lt;/p>
&lt;p>Interestingly, it looks like persistent identifiers always find a way. Within over 7.4 million research outputs with funding information, we noticed 6 cases where a grant DOI was provided as an award number. For example in &lt;a href="https://api.crossref.org/works/10.1093/nar/gkaa994" target="_blank">10.1093/nar/gkaa994 &lt;/a>we have the following:&lt;/p>
&lt;pre tabindex="0">&lt;code>funder: [
{
name: &amp;#34;Wellcome Trust&amp;#34;,
award: [&amp;#34;10.35802/108758&amp;#34;],
doi-asserted-by: &amp;#34;publisher&amp;#34;,
DOI: &amp;#34;10.13039/100010269&amp;#34;
}, ...
]
&lt;/code>&lt;/pre>&lt;p>This may not be 100% correct from the schema perspective, but it is very useful when one is interested in linking grants to research outputs!&lt;/p>
&lt;p>But those cases are extremely rare outliers. For the vast majority of the outputs, grant DOIs are not present in the metadata. This means that, just like in the case of bibliographic references, we have to use the metadata to match funding information to grants.&lt;/p>
&lt;p>Funding information is typically given as a pair: award number, funder information. Grants contain similar metadata. One might be tempted to use only the award number for linking, as in some cases it can look like a grant identifier.&lt;/p>
&lt;p>Let&amp;rsquo;s consider an example. We want to find all papers funded by grant &lt;a href="https://api.crossref.org/works/10.37807/gbmf7622" target="_blank">10.37807/gbmf7622&lt;/a>. The award number is &lt;code>GBMF7622&lt;/code>. A simple approach might be to search for items with this award number in Crossref&amp;rsquo;s REST API, which returns 12 results&lt;sup id="fnref:2">&lt;a href="#fn:2" class="footnote-ref" role="doc-noteref">2&lt;/a>&lt;/sup>. However, one of the resulting items is the grant itself&lt;sup id="fnref:3">&lt;a href="#fn:3" class="footnote-ref" role="doc-noteref">3&lt;/a>&lt;/sup>. So excluding that, it seems like there are 12-1=11 research outputs funded by this grant.&lt;/p>
&lt;p>Simple and easy, right? Well, think again.&lt;/p>
&lt;h3 id="problem-2-award-numbers-are-not-unique">Problem #2: award numbers are not unique&lt;/h3>
&lt;p>Let&amp;rsquo;s look at another example grant: &lt;a href="https://api.crossref.org/works/10.46936/10.25585/60000600" target="_blank">10.25585/60000600&lt;/a>. Its award number is &lt;code>2817&lt;/code> and the funder is the &lt;a href="https://api.crossref.org/funders/10.13039/100000015" target="_blank">US Department of Energy&lt;/a>.&lt;/p>
&lt;p>When we search for this award we get 10 results&lt;sup id="fnref:4">&lt;a href="#fn:4" class="footnote-ref" role="doc-noteref">4&lt;/a>&lt;/sup>. Like before, one of them is our grant. After examining the remaining 9 we will see that:&lt;/p>
&lt;ul>
&lt;li>3 items have been funded by the &lt;a href="https://api.crossref.org/funders/10.13039/100015911" target="_blank">Joint Genome Institute&lt;/a>, which according to the Funder Registry has been incorporated into &lt;a href="https://api.crossref.org/funders/10.13039/100006151" target="_blank">Basic Energy Sciences&lt;/a>, which is a descendant of the &lt;a href="https://api.crossref.org/funders/10.13039/100000015" target="_blank">US Department of Energy&lt;/a>&lt;/li>
&lt;li>2 items have been funded by &lt;a href="https://api.crossref.org/funders/10.13039/100001819" target="_blank">International Rett Syndrome Foundation&lt;/a> from the US&lt;/li>
&lt;li>2 items have been funded by &lt;a href="https://api.crossref.org/funders/10.13039/501100003074" target="_blank">Agencia Nacional de Promoción Científica y Tecnológica&lt;/a> from Argentina&lt;/li>
&lt;li>1 item has been funded by &lt;a href="https://api.crossref.org/funders/10.13039/501100007113" target="_blank">Arak University of Medical Sciences&lt;/a> from Iran&lt;/li>
&lt;li>1 item has been funded by &lt;a href="https://api.crossref.org/funders/10.13039/501100004883" target="_blank">Shahrekord University&lt;/a> also from Iran&lt;/li>
&lt;/ul>
&lt;p>So among only 9 items mentioning the same award number we have in fact 5 different grants. Our input grant should probably be linked only to the three items mentioning Joint Genome Institute. The main problem illustrated here is that the award numbers are not globally unique, and thus should not be treated like identifiers.&lt;/p>
&lt;p>Indeed, within 38,326 grants registered so far, we have 37,608 distinct award numbers, and among those, there are 716 award numbers, each of which appears in multiple grants. This issue comes in two flavours: conflicts between and within funders.&lt;/p>
&lt;h4 id="between-funder-award-number-conflicts">Between-funder award number conflicts&lt;/h4>
&lt;p>A conflict between funders is when more than one funder uses the same award number for one of their grants. This is expected - award numbers are assigned by funders internally and are not designed to be a globally unique identifier.&lt;/p>
&lt;p>Out of 716 award numbers that appear in multiple grants, 12 are numbers that appear in grants of different funders. For example, there are two grants with the award number &lt;code>105626&lt;/code>:&lt;/p>
&lt;ul>
&lt;li>&lt;a href="https://api.crossref.org/works/10.48050/pc.gr.10753" target="_blank">Systemic MFG-E8 Blockade as Melanoma Therapy&lt;/a> funded by Melanoma Research Alliance&lt;/li>
&lt;li>&lt;a href="https://api.crossref.org/works/10.35802/105626" target="_blank">Institutional Strategic Support Fund Phase2 FY2014/16&lt;/a> funded by Wellcome Trust&lt;/li>
&lt;/ul>
&lt;p>Because of those conflicts, we cannot simply rely on the award numbers for linking grants to research outputs. Instead, we have to use more information to be sure that the links are correctly established.&lt;/p>
&lt;h4 id="within-funder-award-number-conflicts">Within-funder award number conflicts&lt;/h4>
&lt;p>To our big surprise, it turns out that the majority of the award number conflicts happen not between different funders, but within the grants of a single funder. Out of 716 award numbers that appear in multiple grants, 704 appear in multiple grants of a single funder only. Such situations are not expected and could indicate an error or some other systematic issue with the data.&lt;/p>
&lt;p>Interestingly, out of those 704 award numbers, 700 are associated with the US Department of Energy. We&amp;rsquo;ve followed up with them in order to clarify or resolve this. The US Department of Energy pointed out a fundamental issue with the data model: currently a grant deposited with Crossref has to have at least one funder DOI, and no other way of identifying the associated organisation is allowed. At the same time, some of the facilities that should appear in their grants&amp;rsquo; metadata are not funders at all and thus cannot be identified by a funder DOI. In the future, they plan to identify those facilities in their grant metadata by providing ROR IDs.&lt;/p>
&lt;p>Because of within-funder award number conflicts, in some cases it might be difficult to distinguish between two grants with the same award number and funder. A solution might be to use additional information or simply not accept any links if a research output cannot be reliably linked to one grant only.&lt;/p>
&lt;h2 id="our-linking-approach">Our linking approach&lt;/h2>
&lt;p>Based on all those observations, we adopted the following approach:&lt;/p>
&lt;ol>
&lt;li>We iterated over all registered grants, for each we performed the following steps:
&lt;ul>
&lt;li>We used &lt;code>award.number:&amp;lt;grant DOI&amp;gt;&lt;/code> filter in the REST API to find all items listing a given grant&amp;rsquo;s DOI as the award number. Because this is based on the grant&amp;rsquo;s persistent identifier, we recorded those links without any further verification.&lt;/li>
&lt;li>We used the &lt;code>award.number:&amp;lt;grant award number&amp;gt;&lt;/code> filter in the REST API to find all items listing grant&amp;rsquo;s award number in the funding information. Each resulting item was then verified by comparing the funder information in the item to the funder information in the grant. We recorded the link between the grant and the candidate item only if the verification succeeded.&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>In the final step, we examined all recorded links to make sure that each pair (research output, award number) is linked to at most one grant. Links violating this rule were flagged as not reliable.&lt;/li>
&lt;/ol>
&lt;p>We used different techniques to verify the funder information between the research output (item) and the grant, depending on what information is available. Grants always have the funder DOI. The item, however, can have the funder DOI, the funder name, or both.&lt;/p>
&lt;p>If the funder DOI was available on both sides, the following rules were used for the funder verification (ordered by decreasing confidence):&lt;/p>
&lt;ul>
&lt;li>Both the item and the grant contain the same funder DOI, for example, &lt;a href="https://api.crossref.org/works/10.35802/089928" target="_blank">10.35802/089928&lt;/a> and &lt;a href="https://api.crossref.org/works/10.1242/jcs.196758" target="_blank">10.1242/jcs.196758&lt;/a>&lt;/li>
&lt;li>The funder in the item replaced or was replaced by the funder in the grant (according to the Funder Registry), for example, &lt;a href="https://api.crossref.org/works/10.35802/104848" target="_blank">10.35802/104848&lt;/a> and &lt;a href="https://api.crossref.org/works/10.1136/medethics-2020-106821" target="_blank">10.1136/medethics-2020-106821&lt;/a>&lt;/li>
&lt;li>The funder in the paper is an ancestor or a descendant of the funder in the grant (according to the Funder Registry), for example, &lt;a href="https://api.crossref.org/works/10.46936/sthm.proj.2010.40084/60004575" target="_blank">10.46936/sthm.proj.2010.40084/60004575&lt;/a> and &lt;a href="https://api.crossref.org/works/10.1016/j.heliyon.2018.e00629" target="_blank">10.1016/j.heliyon.2018.e00629&lt;/a>&lt;/li>
&lt;/ul>
&lt;p>If the funder DOI was not available in the item, the following rules were used for the funder verification (ordered by decreasing confidence):&lt;/p>
&lt;ul>
&lt;li>The funder name in the paper is the same (ignoring the case) as the funder name in the grant, for example, &lt;a href="https://api.crossref.org/works/10.35802/110166" target="_blank">10.35802/110166&lt;/a> and &lt;a href="https://api.crossref.org/works/10.12688/wellcomeopenres.14645.4" target="_blank">10.12688/wellcomeopenres.14645.4&lt;/a>&lt;/li>
&lt;li>The funder name in the item is the same (ignoring the case) as the name of the funder that replaced/was replaced by the funder in the grant, for example, &lt;a href="https://api.crossref.org/works/10.35802/206194" target="_blank">10.35802/206194&lt;/a> and &lt;a href="https://api.crossref.org/works/10.1172/jci.insight.96381" target="_blank">10.1172/jci.insight.96381&lt;/a>&lt;/li>
&lt;li>The funder name in the item is the same (ignoring the case) as the name of the ancestor/descendant of the funder in the grant, for example, &lt;a href="https://api.crossref.org/works/10.46936/cpbl.proj.2001.2191/60002922" target="_blank">10.46936/cpbl.proj.2001.2191/60002922&lt;/a> and &lt;a href="https://api.crossref.org/works/10.1109/tkde.2016.2628180" target="_blank">10.1109/tkde.2016.2628180&lt;/a>&lt;/li>
&lt;/ul>
&lt;p>Note that this is in fact very similar to &lt;a href="https://doi.org/10.64000/pdm9z-20m09" target="_blank">our reference matching approach&lt;/a>. In both cases, first we search for candidate items, and then verify the candidates by comparing the metadata. The actual metadata used for the verification varies, because different information is typically given in the bibliographic reference and the funding information.&lt;/p>
&lt;h2 id="what-we-found">What we found&lt;/h2>
&lt;p>This procedure applied to the entire Crossref dataset resulted in 20,846 links between research outputs and grants&lt;sup id="fnref:5">&lt;a href="#fn:5" class="footnote-ref" role="doc-noteref">5&lt;/a>&lt;/sup>. Of those, 12 were flagged as unreliable, because they involved more than one grant linked to the same item and award number. The rest of this section focuses on the remaining 20,834 links.&lt;/p>
&lt;p>Within the 20,834 links, we have 17,082 research outputs and 3,858 (10.1%) grants.&lt;/p>
&lt;p>Here is the breakdown into the verification approaches used:&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>Verification&lt;/th>
&lt;th style="text-align: right">#links&lt;/th>
&lt;th style="text-align: right">%links&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>The item contains grant DOI - no verification&lt;/td>
&lt;td style="text-align: right">6&lt;/td>
&lt;td style="text-align: right">&amp;lt;0.1%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Funder DOIs are the same&lt;/td>
&lt;td style="text-align: right">8,364&lt;/td>
&lt;td style="text-align: right">40.1%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Funder DOIs are related with a replaced/was replaced by relationship&lt;/td>
&lt;td style="text-align: right">3,704&lt;/td>
&lt;td style="text-align: right">17.8%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Funder DOIs are related with an ancestor/descendant relationship&lt;/td>
&lt;td style="text-align: right">7,718&lt;/td>
&lt;td style="text-align: right">37.0%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Funder names are the same&lt;/td>
&lt;td style="text-align: right">591&lt;/td>
&lt;td style="text-align: right">2.8%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>The name of the funder in the item is the same as the name of the funder that replaced/was replaced by the funder in the grant&lt;/td>
&lt;td style="text-align: right">364&lt;/td>
&lt;td style="text-align: right">1.7%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>The name of the funder in the item is the same as the name of the ancestor or descendant of the funder in the grant&lt;/td>
&lt;td style="text-align: right">87&lt;/td>
&lt;td style="text-align: right">0.4%&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>In most cases, just using the funder DOIs for the verification was enough. Verifying by the funder name added 1,042 links, which is 5% of all links.&lt;/p>
&lt;p>And here are statistics for individual funders. Only funders with at least 10 deposited grants are listed in the table. The table shows the number of detected links, the number of distinct research outputs linked, the total number of outputs mentioning the given funder DOI, and the number of grants.&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>Funder&lt;/th>
&lt;th style="text-align: right">#links&lt;/th>
&lt;th style="text-align: right">#linked research outputs&lt;/th>
&lt;th style="text-align: right">#total outputs with funder DOI&lt;/th>
&lt;th style="text-align: right">#grants&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>Japan Science and Technology Agency&lt;/td>
&lt;td style="text-align: right">11,922&lt;/td>
&lt;td style="text-align: right">10,411&lt;/td>
&lt;td style="text-align: right">25,779&lt;/td>
&lt;td style="text-align: right">9,383&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Wellcome Trust (including both funder DOIs 10.13039/100004440 and 10.13039/100010269)&lt;/td>
&lt;td style="text-align: right">8,001&lt;/td>
&lt;td style="text-align: right">6,246&lt;/td>
&lt;td style="text-align: right">49,492&lt;/td>
&lt;td style="text-align: right">17,534&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>James S. McDonnell Foundation&lt;/td>
&lt;td style="text-align: right">463&lt;/td>
&lt;td style="text-align: right">457&lt;/td>
&lt;td style="text-align: right">2,534&lt;/td>
&lt;td style="text-align: right">557&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Melanoma Research Alliance&lt;/td>
&lt;td style="text-align: right">152&lt;/td>
&lt;td style="text-align: right">150&lt;/td>
&lt;td style="text-align: right">894&lt;/td>
&lt;td style="text-align: right">392&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Asia-Pacific Network for Global Change Research&lt;/td>
&lt;td style="text-align: right">100&lt;/td>
&lt;td style="text-align: right">100&lt;/td>
&lt;td style="text-align: right">838&lt;/td>
&lt;td style="text-align: right">539&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>ALS Association&lt;/td>
&lt;td style="text-align: right">84&lt;/td>
&lt;td style="text-align: right">78&lt;/td>
&lt;td style="text-align: right">909&lt;/td>
&lt;td style="text-align: right">434&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>U.S. Department of Energy&lt;/td>
&lt;td style="text-align: right">56&lt;/td>
&lt;td style="text-align: right">52&lt;/td>
&lt;td style="text-align: right">97,482&lt;/td>
&lt;td style="text-align: right">8,462&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Gordon and Betty Moore Foundation&lt;/td>
&lt;td style="text-align: right">51&lt;/td>
&lt;td style="text-align: right">50&lt;/td>
&lt;td style="text-align: right">5,928&lt;/td>
&lt;td style="text-align: right">94&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>American Cancer Society&lt;/td>
&lt;td style="text-align: right">3&lt;/td>
&lt;td style="text-align: right">3&lt;/td>
&lt;td style="text-align: right">7,276&lt;/td>
&lt;td style="text-align: right">107&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Children&amp;rsquo;s Tumor Foundation&lt;/td>
&lt;td style="text-align: right">1&lt;/td>
&lt;td style="text-align: right">1&lt;/td>
&lt;td style="text-align: right">759&lt;/td>
&lt;td style="text-align: right">630&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>American Parkinson Disease Association&lt;/td>
&lt;td style="text-align: right">0&lt;/td>
&lt;td style="text-align: right">0&lt;/td>
&lt;td style="text-align: right">181&lt;/td>
&lt;td style="text-align: right">12&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Neurofibromatosis Therapeutic Acceleration Program&lt;/td>
&lt;td style="text-align: right">0&lt;/td>
&lt;td style="text-align: right">0&lt;/td>
&lt;td style="text-align: right">101&lt;/td>
&lt;td style="text-align: right">68&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>International Anesthesia Research Society&lt;/td>
&lt;td style="text-align: right">0&lt;/td>
&lt;td style="text-align: right">0&lt;/td>
&lt;td style="text-align: right">94&lt;/td>
&lt;td style="text-align: right">34&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Australian National Data Service&lt;/td>
&lt;td style="text-align: right">0&lt;/td>
&lt;td style="text-align: right">0&lt;/td>
&lt;td style="text-align: right">92&lt;/td>
&lt;td style="text-align: right">67&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>Note that the fourth column reports the total number of outputs registered with Crossref and mentioning the given funder DOI, including grants, journal papers and all other record types.&lt;/p>
&lt;p>It is interesting to compare the number of linked research outputs for a given funder with the total number of research outputs mentioning a given funder DOI. In general, for a funder that registers grants, the more research outputs mentioning this funder, the more links we should be able to find.&lt;/p>
&lt;p>And for some funders (Japan Science and Technology Agency, Melanoma Research Alliance, Asia-Pacific Network for Global Change Research, Wellcome Trust, James S. McDonnell Foundation), the number of linked outputs is indeed high, as compared with how many outputs mention the funder in the first place. This suggests our procedure was quite successful in linking outputs funded by these funders, meaning that in general the metadata in their grants and the funding information in the research outputs match.&lt;/p>
&lt;p>On the other hand, we have a few funders for which we managed to link only a very small fraction of research outputs. There are several potential explanations here. A simple one is that not all relevant grants have been deposited yet. For example, a funder might be registering new grants only, whereas many research outputs mention older, not yet registered grants. It is also possible that there are systematic differences in how the publishers deposit the funding information in articles and other outputs, and how it is given in grants. Such differences might prevent us from establishing links, contributing to the overall low percentage of linked grants.&lt;/p>
&lt;h3 id="the-importance-of-being-precise">The importance of being precise&lt;/h3>
&lt;p>Here are some examples of existing links that should&amp;rsquo;ve been found, but were not.&lt;/p>
&lt;p>The award number in grant &lt;a href="https://api.crossref.org/works/10.48105/pc.gr.93156" target="_blank">10.48105/pc.gr.93156&lt;/a> is &lt;code>CTF-2020-01-004&lt;/code>. This article: &lt;a href="https://api.crossref.org/works/10.3390/ijms22094716" target="_blank">10.3390/ijms22094716&lt;/a> mentions award number &lt;code>2020‐01‐004&lt;/code> and the same funder (Children&amp;rsquo;s Tumor Foundation). It is very probable that this is the same grant, but our procedure expects exactly the same award number, and so the two were not linked.&lt;/p>
&lt;p>Paper &lt;a href="https://api.crossref.org/works/10.1128/genomea.00159-18" target="_blank">10.1128/genomea.00159-18&lt;/a> contains award number &lt;code>1931&lt;/code> and U.S. Department of Energy as the funder. There are two grants with the same award number and funder: &lt;a href="https://api.crossref.org/works/10.46936/10.25585/60001053" target="_blank">10.46936/10.25585/60001053&lt;/a> and &lt;a href="https://api.crossref.org/works/10.46936/genr.proj.2000.1931/60002530" target="_blank">10.46936/genr.proj.2000.1931/60002530&lt;/a>. It is difficult to choose between them, and these links were marked as unreliable.&lt;/p>
&lt;p>These examples could be signs of systematic errors and/or discrepancies that effectively prevent linking of those funders&amp;rsquo; grants.&lt;/p>
&lt;h2 id="whats-next">What&amp;rsquo;s next&lt;/h2>
&lt;p>In problems such as linking grants to research outputs, there are typically two key ingredients of the success, which at the same time are the main areas of improvement: the quality of the metadata, and the strength of the linking approach.&lt;/p>
&lt;p>The metadata could be improved greatly by addressing existing discrepancies between grants and research outputs and allowing (and encouraging!) the publishers to provide grant DOIs in the funding information. Thankfully, we are not alone in those efforts. Both this recent &lt;a href="https://doi.org/10.54900/rgrtzxx-nj4c28m-cef53" target="_blank">Upstream blog&lt;/a> from Alexis-Michel Mugabushaka, and this &lt;a href="https://scholarlykitchen.sspnet.org/2022/03/07/accelerating-open-research-a-multi-stakeholder-discussion/" target="_blank">Scholarly Kitchen post&lt;/a> from Robert Harrington call for the development and adoption of grant DOIs in scholarly metadata.&lt;/p>
&lt;p>In terms of the linking approach, there are some ideas that could be used to further improve the linking accuracy and completeness:&lt;/p>
&lt;ul>
&lt;li>The verification by funder name could be fuzzy and allow for minor variations like typos or additional words.&lt;/li>
&lt;li>Apart from &lt;em>replaced/replaced by&lt;/em> and &lt;em>ancestor/descendant&lt;/em>, there are other relationships between funders in the Funder Registry: &lt;em>continuation of&lt;/em>, &lt;em>incorporates/incorporated into&lt;/em>, &lt;em>merged with&lt;/em>, &lt;em>renamed as&lt;/em>, &lt;em>split into/split from&lt;/em>. We could also consider those relationships during the funder validation.&lt;/li>
&lt;li>Apart from the funder information, there is other information that could be potentially used for verification, for example, the names of the authors and the investigators, the domain, or keywords.&lt;/li>
&lt;/ul>
&lt;p>If you have any questions, do &lt;a href="mailto:feedback@crossref.org">get in touch&lt;/a>!&lt;/p>
&lt;div class="footnotes" role="doc-endnotes">
&lt;hr>
&lt;ol>
&lt;li id="fn:1">
&lt;p>All numbers are as of March 8, 2022&amp;#160;&lt;a href="#fnref:1" class="footnote-backref" role="doc-backlink">&amp;#x21a9;&amp;#xfe0e;&lt;/a>&lt;/p>
&lt;/li>
&lt;li id="fn:2">
&lt;p>&lt;a href="https://api.crossref.org/works?filter=award.number:gbmf7622" target="_blank">https://api.crossref.org/works?filter=award.number:gbmf7622&lt;/a>&amp;#160;&lt;a href="#fnref:2" class="footnote-backref" role="doc-backlink">&amp;#x21a9;&amp;#xfe0e;&lt;/a>&lt;/p>
&lt;/li>
&lt;li id="fn:3">
&lt;p>&lt;a href="https://api.crossref.org/works?filter=award.number:gbmf7622,type:grant" target="_blank">https://api.crossref.org/works?filter=award.number:gbmf7622,type:grant&lt;/a>&amp;#160;&lt;a href="#fnref:3" class="footnote-backref" role="doc-backlink">&amp;#x21a9;&amp;#xfe0e;&lt;/a>&lt;/p>
&lt;/li>
&lt;li id="fn:4">
&lt;p>&lt;a href="https://api.crossref.org/works?filter=award.number:2817" target="_blank">https://api.crossref.org/works?filter=award.number:2817&lt;/a>&amp;#160;&lt;a href="#fnref:4" class="footnote-backref" role="doc-backlink">&amp;#x21a9;&amp;#xfe0e;&lt;/a>&lt;/p>
&lt;/li>
&lt;li id="fn:5">
&lt;p>The code and data available here: &lt;a href="https://gitlab.com/crossref/labs_data_analyses/-/tree/master/analyses/22-01-26-grants-matching" target="_blank">https://gitlab.com/crossref/labs_data_analyses/-/tree/master/analyses/22-01-26-grants-matching&lt;/a>&amp;#160;&lt;a href="#fnref:5" class="footnote-backref" role="doc-backlink">&amp;#x21a9;&amp;#xfe0e;&lt;/a>&lt;/p>
&lt;/li>
&lt;/ol>
&lt;/div></description></item><item><title>A Registry of Editorial Boards - a new trust signal for scholarly communications?</title><link>https://www.crossref.org/blog/a-registry-of-editorial-boards-a-new-trust-signal-for-scholarly-communications/</link><pubDate>Wed, 09 Mar 2022 00:00:00 +0000</pubDate><author>Fabienne Michaud</author><guid>https://www.crossref.org/blog/a-registry-of-editorial-boards-a-new-trust-signal-for-scholarly-communications/</guid><description>&lt;h2 id="background">Background&lt;/h2>
&lt;p>Perhaps, like us, you&amp;rsquo;ve noticed that it is not always easy to find information on who is on a journal&amp;rsquo;s editorial board and, when you do, it is often unclear when it was last updated. The editorial board details might be displayed in multiple places (such as the publisher&amp;rsquo;s website and the platform where the content is hosted) which may or may not be in sync and retrieving this information for any kind of analysis always requires manually checking and exporting the data from a website (as illustrated by the &lt;a href="https://osf.io/preprints/socarxiv/jvzq7" target="_blank">Open Editors research&lt;/a> and its &lt;a href="https://openeditors.ooir.org" target="_blank">dataset&lt;/a>).&lt;/p>
&lt;p>For well-established as well as early career researchers, membership of an editorial board demonstrates their contribution to their community, brings prestige, improves (or maintains) their professional profile and often increases their chances of being published.&lt;/p>
&lt;p>Whilst most journal websites only give the names of the editors, others possibly add a country, some include affiliations, very few link to a professional profile, an ORCID ID. Even when it&amp;rsquo;s clear when the editorial board details were updated, it&amp;rsquo;s hardly ever possible to find past editorial boards information and almost none lists declarations of competing interest.&lt;/p>
&lt;p>We hear of instances where a researcher&amp;rsquo;s name has been listed on the board of a journal without their knowledge or agreement, potentially to deceive other researchers into submitting their manuscripts. Regular reports of &lt;a href="https://www.nature.com/articles/d41586-021-03035-y" target="_blank">impersonation&lt;/a>, &lt;a href="https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.3001133" target="_blank">nepotism&lt;/a>, &lt;a href="https://arxiv.org/pdf/2112.13322.pdf" target="_blank">collusion&lt;/a> and &lt;a href="https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0127362" target="_blank">conflicts of interest&lt;/a> have become a cause for concern.&lt;/p>
&lt;p>Similarly, recent studies on &lt;a href="https://link.springer.com/article/10.1007%2Fs12630-019-01378-9" target="_blank">gender representation&lt;/a> and &lt;a href="https://www.biorxiv.org/content/10.1101/2021.02.15.431321v1" target="_blank">gender and geographical disparity&lt;/a> on editorial boards have highlighted the need to do better in this area and provide trusted, reliable and coherent information on editorial board members in order to add transparency, prevent unethical behaviour, maintain trust, promote and support research integrity.&lt;/p>
&lt;h2 id="registry-of-editorial-boards">Registry of Editorial Boards&lt;/h2>
&lt;p>We are proposing the creation of some form of Registry of Editorial Boards to encourage best practice around editorial boards&amp;rsquo; information and governance that can easily be accessed and used by the community.&lt;/p>
&lt;h3 id="what-we-have-in-mind">What we have in mind&lt;/h3>
&lt;p>A Registry of Editorial Boards could be a new trust-signal for Crossref members and details would be included on a member&amp;rsquo;s Participation Report.&lt;/p>
&lt;p>Crossref members would register and maintain this information for their journal titles in a similar way as they currently manage their metadata. Only the owner of the title, or their trusted service provider, would be able to update it.  Editors would be linked by ORCID iD and ROR and Crossref would use &amp;lsquo;autoupdate&amp;rsquo; to push editorship information to ORCID profiles, saving researchers time. The information would be made available via Crossref&amp;rsquo;s API.&lt;/p>
&lt;p>This new service would introduce more transparency and automation to the editorial process and connect content platforms (i.e. peer review management systems, publishers&amp;rsquo; websites, ORCID and other author register systems, ROR, bibliographic databases, etc.) and make available current and historical information on editorial boards including metadata on the editorial boards&amp;rsquo; full affiliations.&lt;/p>
&lt;h3 id="the-benefits-for-the-community">The benefits for the community&lt;/h3>
&lt;p>The benefits would be wide-ranging for the different stakeholders in the scholarly communications community, from publishers, researchers, institutions, funders, bibliometricians to librarians including:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>providing those involved in the peer review process and research ethics a single, authoritative and up-to-date resource on editorial boards&lt;/p>
&lt;/li>
&lt;li>
&lt;p>reducing fraudulent claims to be or to have been on an editorial board of a publication in order to be published or publish others&lt;/p>
&lt;/li>
&lt;li>
&lt;p>connecting and automating editorship role updates with e.g. ORCID, ROR, etc.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>generating a detailed analysis of the publication practices of editorial board members and their close contacts &lt;/p>
&lt;/li>
&lt;li>
&lt;p>assessing any relationships between authors, reviewers and editorial board members for conflict of interest, etc.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>supporting researchers responding to a request to join an editorial board, making proactive approaches to a journal or wanting to ensure that an editorial board is representative of its community and assess its levels of diversity and inclusivity&lt;/p>
&lt;/li>
&lt;li>
&lt;p>providing increased visibility to researchers, particularly to early career researchers&lt;/p>
&lt;/li>
&lt;/ul>
&lt;h2 id="your-feedback">Your feedback&lt;/h2>
&lt;p>Before we progress further, we would like to fully understand what the needs of the community are and whether members would be willing and have the capacity to participate and contribute regularly in registering and maintaining details of their editorial boards.&lt;/p>
&lt;p>✏️  Please let us know what your thoughts and experience are with editorial boards by completing this brief &lt;a href="https://forms.gle/UQpbsTgjQnEY43FT6" target="_blank">survey&lt;/a> by 31 March 2022.&lt;/p></description></item><item><title>POSI fan tutte</title><link>https://www.crossref.org/blog/posi-fan-tutte/</link><pubDate>Tue, 08 Mar 2022 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/posi-fan-tutte/</guid><description>&lt;p>Just over a year ago, Crossref &lt;a href="https://doi.org/10.64000/hzemx-j7n79" target="_blank">announced&lt;/a> that our board had adopted the &lt;a href="http://openscholarlyinfrastructure.org/" target="_blank">Principles of Open Scholarly Infrastructure (POSI)&lt;/a>.&lt;/p>
&lt;p>It was a well-timed announcement, as 2021 yet again showed just how dangerous it is for us to assume that the infrastructure systems we depend on for scholarly research will not disappear altogether or adopt a radically different focus. We adopted POSI to ensure that Crossref would not meet the same fate.&lt;/p>
&lt;p>POSI proposes three areas that an Open Infrastructure organisation can address to garner the trust of the broader scholarly community: accountability (governance), funding (sustainability), and protection of community interests (insurance). POSI also proposes a set of concrete commitments that an organisation can make to build community trust in each area. There are 16 such commitments.&lt;/p>
&lt;p>In our announcement of Crossref’s adoption of POSI, we made two critical points:&lt;/p>
&lt;ul>
&lt;li>One doesn’t have to meet all the commitments of POSI already to adopt it. For one thing, this would make it impossible for new organisations to adopt POSI. So instead, we should view the adoption of the POSI principles as a “statement of intent” against which stakeholders can measure an organisation&amp;rsquo;s progress.&lt;/li>
&lt;li>That, conversely, meeting all of the POSI principles doesn’t mean an organisation can relax. It is always possible for an organisation to regress on a particular commitment. For example, an emergency expenditure might mean that the organisation no longer maintains a 12-month contingency fund and therefore has to replenish it.&lt;/li>
&lt;/ul>
&lt;p>With these two points made, we ended our announcement with a candid self-audit against the principles. We concluded that Crossref was already entirely or partially meeting the requirements of 15 of the 16 POSI commitments. And adopting the 16th commitment would just formalize a direction Crossref had already been heading toward for several years. We also said that we would update our self-audit regularly.&lt;/p>
&lt;p>But before we continue with the Crossref POSI audit update, we should talk about the immediate aftermath of our adopting the principles.&lt;/p>
&lt;p>Since Crossref adopted POSI, &lt;a href="https://openscholarlyinfrastructure.org/posse/" target="_blank">nine other organisations have made the same commitment&lt;/a> and conducted similar self-audits. We affectionately call them the “POSI Posse”.&lt;/p>
&lt;ul>
&lt;li>Dryad&lt;/li>
&lt;li>ROR&lt;/li>
&lt;li>JOSS&lt;/li>
&lt;li>OurResearch&lt;/li>
&lt;li>OpenCitations&lt;/li>
&lt;li>DataCite&lt;/li>
&lt;li>OA Switchboard&lt;/li>
&lt;li>Sciety&lt;/li>
&lt;li>Europe PMC&lt;/li>
&lt;/ul>
&lt;p>These organisations represent a critical part of the hidden infrastructure that scholarly research depends on every day. By committing to POSI, they are helping ensure their accountability to the research community. They are also emphasizing that stakeholders must participate in the governance and stewardship of organisations running that infrastructure.&lt;/p>
&lt;p>But perhaps most importantly- these ten organisations that have publicly committed to adopting POSI will not suddenly disappear or change priorities without giving the community time to react and, if need be, intervene.&lt;/p>
&lt;p>There are also more quotidian advantages to these organisations adopting POSI. Adopting the principles makes it easier for the respective organisations to collaborate to make research infrastructure more effective and efficient. The foundation of effective collaboration is trust. And, so by agreeing that we share basic principles of operation, we virtually eliminate a whole slew of negotiations that typically need to occur before two organisations trust each other enough to collaborate closely on projects.&lt;/p>
&lt;p>One of &lt;a href="https://www.crossref.org/strategy/#collaborate-and-partner">Crossref’s strategic priorities&lt;/a> is to “collaborate and partner” with other organisations on improving our open scholarly infrastructure. And the easiest way to collaborate with us is to adhere to the same principles. So we look forward to more scholarly infrastructure organisations adopting POSI in 2022 so that, together, we can make research infrastructure work better.&lt;/p>
&lt;p>Establishing this level of trust has already paid significant dividends with the Research Organisation Registry (ROR) - a relatively new infrastructure project founded jointly by DataCite, CDL, and Crossref.&lt;/p>
&lt;p>Having nine organisations adopt POSI so soon after our announcement was a wonderful feeling. It is hard for us to convey how happy we are about this without gushing.&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/images/staff/geoff_720px.jpg">Here is a picture of me gushing.&lt;/a>&lt;/p>
&lt;p>But now we have some outstanding business to update our self-audit.&lt;/p>
&lt;p>This post is the first of our regular updates on our progress (or regress) on meeting the POSI principles.&lt;/p>
&lt;h2 id="tldr">TL;DR&lt;/h2>
&lt;p>We didn’t regress on any commitment. We’ve improved a little bit where we were not meeting the POSI principles, but we have still not met all our POSI commitments.&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>Area&lt;/th>
&lt;th>Commitment&lt;/th>
&lt;th style="text-align: center">2020&lt;/th>
&lt;th style="text-align: center">2021&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>&lt;strong>Governance&lt;/strong>&lt;/td>
&lt;td>Coverage across the research enterprise&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>Non-discriminatory membership&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>Transparent operations&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>Cannot lobby&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>Living will&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>Formal incentives to fulfill mission &amp;amp; wind-down&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>Stakeholder-governed&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-red'>&lt;/i>
&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-yellow'>&lt;/i>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;strong>Sustainability&lt;/strong>&lt;/td>
&lt;td>Time-limited funds are used only for time-limited activities&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>Goal to generate surplus&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>Goal to create a contingency fund to support operations for 12 months&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-yellow'>&lt;/i>
&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-yellow'>&lt;/i>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>Mission-consistent revenue generation&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>Revenue based on services, not data&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;strong>Insurance&lt;/strong>&lt;/td>
&lt;td>Available data (within constraints of privacy laws)&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>Patent non-assertion&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-yellow'>&lt;/i>
&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-yellow'>&lt;/i>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>Open source&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-yellow'>&lt;/i>
&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-yellow'>&lt;/i>
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>Open data (within constraints of privacy laws)&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-yellow'>&lt;/i>
&lt;/td>
&lt;td style="text-align: center">&lt;i class='fa fas fa-circle font-small font-crossref-green'>&lt;/i>
&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;h2 id="details">Details&lt;/h2>
&lt;h3 id="stakeholder-governance-moves-from-red-to-yellow">Stakeholder governance moves from red to yellow&lt;/h3>
&lt;p>Our only red mark in our POSI self-audit was against the principle of stakeholder governance. Our board did not yet reflect our members&amp;rsquo; diversity or the broader stakeholder community. In particular, as funders have become more central to shaping the scholarly communications landscape, it seemed important that Crossref have funder representation in our governance.&lt;/p>
&lt;p>So this year, the Crossref nominations committee was charged with proposing a board slate that addressed some of our representational gaps. They did this, and as a direct result, two of the members elected to next year&amp;rsquo;s board were a funder (Melanoma Research Alliance) and a significant preprint platform (Center for Open Science).&lt;/p>
&lt;p>These new additions to our board mark a significant improvement in stakeholder governance, but we can do more. Researchers and research institutions are also substantial Crossref stakeholders. We need to have a better representation of their concerns.&lt;/p>
&lt;p>Also, there are still members of the scholarly communications community who depend on Crossref but cannot afford to join it because our fees are too high for them. Since membership is a prerequisite to participation in Crossref governance, we are also placing emphasis on figuring out how to further extend Crossref membership to those who still cannot afford it, through programs like Sponsorship, country-level journal gap analyses work, and a forthcoming fee review. So this is a source of stakeholder governance inequity that may be best handled by our membership &amp;amp; fees committee rather than our nominations committee.&lt;/p>
&lt;p>In short, we’ve made progress on our stakeholder governance commitment. Still, we need to do more- so we are updating our adherence to the POSI stakeholder governance principle from red to yellow.&lt;/p>
&lt;p>Another place where we have improved things is under the banner of “transparency.” But here, we see one of the shortcomings of the ‘traffic light” representation used in the self-audit. The degree that one meets a commitment falls along a gradient. And this gradient cannot be represented accurately in the ternary classification of red/yellow/green. So while last year we marked ourselves as “green” under the commitment to transparency, over the past year we have become &lt;em>greener.&lt;/em> We did this by creating sections on our website that provide further detail on our governance and finances- even including the 990 forms that are required by US tax authorities for non-profits when they submit their taxes. So what do we do here? Make it neon-green? Make it &lt;a href="https://en.wikipedia.org/wiki/Blink_element" target="_blank">blink&lt;/a>?&lt;/p>
&lt;h3 id="sustainability-moves-from-yellow-to-chartreuse-stays-yellow">Sustainability &lt;del>moves from yellow to chartreuse&lt;/del> stays yellow&lt;/h3>
&lt;p>In our first self-audit, we had several yellow marks- places where we were doing OK, but where we needed to make improvements.&lt;/p>
&lt;p>The first yellow mark involved one of the principles of “sustainability,” which stipulates that an organisation should have a goal to create a contingency fund to support operations for 12 months. At the time, we had a contingency fund of 9 months. The board instructed the finance committee to develop a plan for meeting the new 12-month goal. To do this, the board decided to create three funds. The first is fairly flexible and holds operating expenses for three months. Staff leadership can use this fund at their discretion to manage cash flow issues and support budgeted expenses. The second fund is the fund that holds operating expenses for 12 months. This fund is board-restricted and is only meant to be used in emergencies to help with substantial changes in our financial position or to, in extremis, fund an orderly wind-down of Crossref’s operations. Furthermore, the board’s investment committee established guidelines for investing our operating and investment surpluses. Any surpluses are first applied to supporting the 3-month fund. Once that funding goal is met, any surpluses are applied to the 12-month fund. And once both the 3-month and 12-month funding goals are met, any further surpluses will be put into another board-restricted fund that can be used to fund new investments or new Crossref initiatives.&lt;/p>
&lt;p>But again, the simple yellow mark against this item does not capture this level of detail. We only get to turn it green once we have the 12-month fund in place.&lt;/p>
&lt;p>It looks like we will meet the goal in 2022, but it is hard to say exactly when. If we did shades of color- we might make it chartreuse. But nobody wants to see &lt;a href="https://en.wikipedia.org/wiki/Chartreuse_%28color%29" target="_blank">chartreuse&lt;/a>. So while we have made significant progress here, our commitment to maintaining a 12-month contingency fund remains yellow until we have reached our goal.&lt;/p>
&lt;h3 id="patent-non-assertion-stays-yellow">Patent non-assertion stays yellow&lt;/h3>
&lt;p>The second yellow mark was against our publishing a patent-non-assertion statement. This feels like a missed opportunity because it will be straightforward for us to do, but we have not yet done it. We have never applied for patents, and we don’t intend to start. In short, nothing is blocking us from doing this other than our natural reluctance to have to draft anything that involves lawyers. Our lawyers are very nice people, but everything we have to draft with them makes our eyes glaze over. We need to get this done ASAP in 2022.&lt;/p>
&lt;h3 id="open-source-remains-yellow">Open source remains yellow&lt;/h3>
&lt;p>The third yellow mark makes me cringe because, as technical director, it is firmly in my bailiwick. We have committed to open-sourcing all of our code. In last year’s self-audit, I predicted that we should be able to open all of our code within 12 to 18 months. I was wrong. That means this commitment remains yellow. And what’s more- it is likely to remain yellow for a year or two. Let me try and explain why.&lt;/p>
&lt;p>First, I should note that all new services that we’ve written since 2007 have been released as open-source (under an MIT license). These include our REST API, Crossmark, Metadata Search, and Event Data. You can find all our open-source code on &lt;a href="https://gitlab.com/crossref/" target="_blank">Gitlab&lt;/a>.&lt;/p>
&lt;p>This leaves us with our “content system” with its legacy code, which handles content registration, OAI-PMH, OpenURL, and XML APIs. This code was originally developed for Crossref by a third party (who I won’t name because they are in no way to blame for our predicament). Crossref only took over the development of the code base internally ~ 2010. But the system has accumulated over twenty years of technical debt and includes many once-common engineering practices that are deprecated (to put it delicately). Additionally, the code is a labyrinth of dependencies on very old libraries under very old licenses.&lt;/p>
&lt;p>And although we have spent much of the past two years replacing critical parts of the system’s authentication and authorization code, I am certain that there remain swathes of code that, under scrutiny, would prove a security nightmare.&lt;/p>
&lt;p>Now we know that so-called “security through obscurity” is bad practice. Our legacy code base illustrates the point. We had credentials embedded in the code. We had backdoors and application-level root access. We had countless places where we didn’t sanitize input. But the code was private- and so it gave developers a false sense of confidence when they occasionally made these shortcuts in the interest of developing new features more quickly. And in those early days of hyper-growth, we often had to develop things very, very quickly. Technical debt, like any debt, is a tradeoff.&lt;/p>
&lt;p>As I said- we’ve cleaned a ton of this stuff up. For example, we’ve replaced our primary authentication system. But this experience has made us better appreciate just how difficult it would be to harden a system this old.&lt;/p>
&lt;p>And besides, we are already replacing it - albeit incrementally. We have been extracting and rewriting key components of the old system, and we plan to continue to extract and rewrite until there is nothing left of the old code. All this new code is, naturally, open-source. And it follows modern security practices.&lt;/p>
&lt;p>And so we face a difficult choice- do we try and fix code that is hard to fix and that we are replacing anyway- or do we just focus just on replacing the code and making sure the new, open-source code follows modern security best -practices? We’ve chosen to take the latter route. But it does mean this entry will have a yellow circle next to it for a few more years as we replace things.&lt;/p>
&lt;h3 id="open-data-moves-from-yellow-to-green">Open data moves from yellow to green&lt;/h3>
&lt;p>And this brings us to our final yellow mark- which was next to the principle of open data. The root of the problem is that what we colloquially call “Crossref metadata” is a mix of elements, some of which come from our members, some from third parties, and some from Crossref itself. These elements, in turn, each have different copyright implications.&lt;/p>
&lt;p>On top of this, Crossref has terms and conditions for its members and terms and conditions for specific services. These terms and conditions grant Crossref the right to do things with some classes of metadata and not do things with other classes of metadata - regardless of copyright.&lt;/p>
&lt;p>The net result is that users can freely use and redistribute any metadata they retrieve via our APIs or in our periodic public data files. But it also means we cannot just slap a CC0 waiver on all the data. Instead, we have to specify exactly what copyright and terms apply to each class of data. We’d never done this in a clear and accessible way, so some of our users were understandably concerned that maybe we were hedging or perhaps the reuse rights were unclear. But we are not hedging; they are clear. They just weren&amp;rsquo;t documented. &lt;a href="https://www.crossref.org/documentation/retrieve-metadata/rest-api/rest-api-metadata-license-information/">And now they are&lt;/a>. In human-readable form. And soon-to-be in machine-readable form. So we can move this from yellow to green.&lt;/p>
&lt;h2 id="reflections-on-the-year-since-our-adoption-of-posi">Reflections on the year since our adoption of POSI&lt;/h2>
&lt;p>When the Crossref board adopted POSI last year, frankly, a few of us were surprised. We never doubted Crossref’s direction as an open infrastructure organisation, but we were not sure that others would see the value in making a public commitment to the principles. We’d heard some people say that they thought adopting them would be seen as “Virtue Signaling.” Which, to be fair, it is. This shouldn’t be surprising or contentious. Our entire scholarly communication system is based on virtue signaling. But, of course, the term “virtue signaling” (with scare quotes) is also sometimes used to insinuate that such signaling is disingenuous and designed primarily for marketing purposes. And that would be a real danger. But the principles were drafted with a built-in safeguard against disingenuous use. The commitments POSI lists are practical things that can be verified by anyone. Is our data open? Does the diversity of our board reflect the diversity of our stakeholders?&lt;/p>
&lt;p>So from the start, we knew that the community would be able to hold us to our commitments. And knowing that made it imperative that we develop a mechanism and process for tracking whether we were meeting them. Thus was born the self-audit.&lt;/p>
&lt;p>And the self-audit, in turn, has served as a &lt;a href="https://en.wikipedia.org/wiki/Forcing_function" target="_blank">forcing function&lt;/a> to ensure that we didn’t just launch a proclamation and then forget about it. We needed to integrate our POSI commitments into all aspects of our day-to-day work. As such, “Live up to POSI” is now a prominent part of &lt;a href="https://www.crossref.org/strategy/#live-up-to-posi">Crossref’s Strategic Agenda&lt;/a>. POSI has become a fundamental part of our planning and our &lt;a href="https://trello.com/b/02zsQaeA/crossref-roadmap" target="_blank">public product roadmap&lt;/a>. POSI has even become a part of our internal staff annual development plans.&lt;/p>
&lt;p>Adopting POSI has changed the way we work. It has changed the way the board works. It has changed the way staff works.&lt;/p>
&lt;p>And we hope that it is having a similar effect on our fellow POSI Posse.&lt;/p>
&lt;h2 id="but-how-about-changing-the-way-posi-works">But how about changing the way POSI works?&lt;/h2>
&lt;p>Now that Crossref and the nine other members of the POSI Posse have had a year of considering and/or living up to the POSI standards, what would we change? What would we add?&lt;/p>
&lt;p>A few themes have started to emerge as we’ve fielded questions from the current POSI Posse and others who have expressed an interest in adopting POSI.&lt;/p>
&lt;ul>
&lt;li>How does POSI apply to non-membership organisations?&lt;/li>
&lt;li>Can POSI apply to commercial organisations?&lt;/li>
&lt;li>How could POSI be extended to apply to open infrastructure organisations &lt;em>outside&lt;/em> of scholarly communication?&lt;/li>
&lt;li>How in the hell do you pronounce “POSI?”&lt;/li>
&lt;/ul>
&lt;p>We’ve tried to answer some of these questions in &lt;a href="https://openscholarlyinfrastructure.org/faq/" target="_blank">the POSI FAQ&lt;/a>, but can we update POSI so that we don’t need the FAQ? Or at least so that we can start a new FAQ?&lt;/p>
&lt;p>And, critically, if we change POSI, how do we ensure we make it stronger and not weaker? Because, to be candid, some of the questions that we’ve fielded have come from parties concerned that POSI is too restrictive. That, for example, the stipulation that revenue should be based on services and not on data makes for inflexible business models. Yes. It does. Deliberately.&lt;/p>
&lt;p>Because one of the biggest barriers to a community being able to &lt;a href="https://en.wikipedia.org/wiki/Fork_%28software_development%29" target="_blank">fork&lt;/a> digital infrastructure is closed (incl. fee-based) data. And one of the fundamental positions of POSI is one the authors learned from open-source communities. This is that these efforts can fail no matter how much care you take to ensure financial sustainability and how much care you take to ensure community-based governance. The ultimate power the open-source community has is to take the code and fork it. This is the insurance policy that helps keep open source projects honest. And we have tried our best to bake this lesson into the POSI principles. We don’t want to weaken POSI. They are, after all, principles.&lt;/p>
&lt;p>So in 2022, we look forward to more organisations endorsing POSI. And the current POSI Posse has started a conversation about how we can strengthen the principles and also extend them so that they can more easily be applied to different kinds of organisations and perhaps even in different sectors. A summary of these discussions will be published in the coming weeks.&lt;/p>
&lt;p>But how will we open these conversations to the broader community? How will we engage those who have yet to adopt the principles but are interested in doing so? What about those interested but perhaps only if they are adapted in some way?&lt;/p>
&lt;p>We already have a mechanism for soliciting feedback, questions, and suggestions concerning POSI. However, it is a relatively primitive system, based on either sending email to one of the POSI Posse or &lt;a href="https://openscholarlyinfrastructure.org/" target="_blank">raising a GitLab ticket&lt;/a>. It was the best we could do in the short time we had to put together the POSI site. An &lt;a href="https://en.wikipedia.org/wiki/Minimum_viable_product" target="_blank">MVP&lt;/a>, if you will. The feedback mechanism served us well over the past year; we engaged with many interested parties and even managed to help nine of them adopt the principles.&lt;/p>
&lt;p>But as with all things POSI - there is room for improvement. And so, we hope to have a more user-friendly way to solicit public feedback and hold discussions. This feedback and our own experiences with adopting POSI over the past year will, in turn, inform our efforts at revising POSI to take into account the things we’ve learned since POSI was originally written.&lt;/p>
&lt;p>So look out for announcements on &lt;a href="https://openscholarlyinfrastructure.org/" target="_blank">the POSI site&lt;/a>. And we look forward to another year of expanding the list of POSI adopters and continuing our own POSI progress. If you’re POSI-curious, get in touch with any of the ten POSI adopters to start a conversation about your own path towards truly open infrastructure.&lt;/p></description></item><item><title>Image integrity: Help us figure out the scale of the problem</title><link>https://www.crossref.org/blog/image-integrity-help-us-figure-out-the-scale-of-the-problem/</link><pubDate>Mon, 07 Feb 2022 00:00:00 +0000</pubDate><author>Fabienne Michaud</author><guid>https://www.crossref.org/blog/image-integrity-help-us-figure-out-the-scale-of-the-problem/</guid><description>&lt;h2 id="some-context">Some context&lt;/h2>
&lt;p>The &lt;a href="https://www.crossref.org/working-groups/similarity-check/" target="_blank">Similarity Check Advisory Group&lt;/a> met a number of times last year to discuss current and emerging originality issues with text-based content. During those meetings, the topic of image integrity was highlighted as an area of growing concern in scholarly communications, particularly in the life sciences.&lt;/p>
&lt;p>Over the last few months, we have also read with interest the &lt;a href="https://osf.io/kgyc6/" target="_blank">recommendations for handling image integrity issues&lt;/a> by the STM Working Group on Image Alteration and Duplication Detection, followed closely image integrity sleuths such as &lt;a href="https://www.nature.com/articles/d41586-020-01363-z" target="_blank">Elizabeth Bik&lt;/a> and have, like many of you, noticed that image manipulation is increasingly given as &lt;a href="https://retractionwatch.com/2021/11/11/exclusive-university-of-glasgow-seeking-retraction-of-multiple-papers-after-findings-of-image-manipulation/" target="_blank">the reason for retractions&lt;/a>.&lt;/p>
&lt;p>Image integrity issues are often associated with &lt;a href="https://publicationethics.org/resources/forum-discussions/publishing-manipulation-paper-mills" target="_blank">paper mill activity&lt;/a> but can also originate from an individual’s intentional or unintentional unethical behaviour. Currently, such issues with figures and images are being identified manually or by using an image integrity tool, comparing images within the same article and/or the publisher’s past publications only - and we know that this is a source of frustration for the Crossref members we have spoken to.&lt;/p>
&lt;h2 id="what-next-">What next ?&lt;/h2>
&lt;p>As reported &lt;a href="https://www.nature.com/articles/d41586-021-03807-6" target="_blank">in Nature last December&lt;/a>, we believe Crossref is in a unique position to spearhead a cross-publisher solution, similar to what we do for text-based originality checking, as part of our Similarity Check service.&lt;/p>
&lt;p>Before we start exploring potential software options, &lt;a href="https://docs.google.com/forms/d/e/1FAIpQLSduZO93xDjTFwSD6HmDNvaJnNbt_OGmHDH_72kD-OCZSNgKEw/viewform" target="_blank">we need your help&lt;/a> to understand:&lt;/p>
&lt;ul>
&lt;li>the scale of the issues and whether these are focused on specific disciplines&lt;/li>
&lt;li>the type of issues we should prioritise e.g. duplication, beautification, rotation, plagiarism, GAN-generated images/deep-fakes, etc.&lt;/li>
&lt;li>what software (if any) members are using or trialling&lt;/li>
&lt;li>whether a cross-publisher service with the collective benefit of shared images would be of sufficient interest to the community&lt;/li>
&lt;/ul>
&lt;p>✏️ Let us know what your experience and thoughts are on image integrity by completing &lt;a href="https://docs.google.com/forms/d/e/1FAIpQLSduZO93xDjTFwSD6HmDNvaJnNbt_OGmHDH_72kD-OCZSNgKEw/viewform" target="_blank">this survey&lt;/a>.&lt;/p>
&lt;p>We’re planning to complete our research and share with you the results along with our proposed next steps soon.&lt;/p></description></item><item><title>Hiccups with credentials in the Test Admin Tool</title><link>https://www.crossref.org/blog/hiccups-with-credentials-in-the-test-admin-tool/</link><pubDate>Wed, 26 Jan 2022 00:00:00 +0000</pubDate><author>Isaac Farley</author><guid>https://www.crossref.org/blog/hiccups-with-credentials-in-the-test-admin-tool/</guid><description>&lt;h3 id="tldr">TL;DR&lt;/h3>
&lt;p>We inadvertently deleted data in our authentication sandbox that stored member credentials for our Test Admin Tool - test.crossref.org. We’re restoring credentials using our production data, but this will mean that some members have credentials that are out-of-sync. Please contact &lt;a href="mailto:support@crossref.org">support@crossref.org&lt;/a> if you have issues accessing test.crossref.org.&lt;/p>
&lt;h3 id="2025-update">2025 update&lt;/h3>
&lt;p>We&amp;rsquo;re working to scale back our support for the test admin tool. We will continue to support our &lt;a href="https://www.crossref.org/02publishers/parser.html" target="_blank">XML parser&lt;/a> for anyone wanting to test their XML. If you’re a service provider and would like to test your integrations, which we will continue to support, you may POST submissions to our test system using &lt;a href="https://test.crossref.org/servlet/deposit" target="_blank">https://test.crossref.org/servlet/deposit&lt;/a>. You’ll need to email us at &lt;a href="mailto:support@crossref.org">support@crossref.org&lt;/a> so we can configure an account within the test system before you test your integration.&lt;/p>
&lt;h3 id="the-details">The details&lt;/h3>
&lt;p>Earlier today the credentials in our authentication sandbox were inadvertently deleted. This was a mistake on our end that has resulted in those credentials no longer being stored for our members using our Test Admin Tool - test.crossref.org.&lt;/p>
&lt;div style="float:right;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2022/test_admin_tool_error.png"
alt="access problems test Admin Tool" width="50%">
&lt;/figure>
&lt;/div>
&lt;p>To be clear, this error has had no impact on the production Admin Tool - doi.crossref.org - or any member’s access to registering content therein. If you’re a member who registers content with us using our helper tools (e.g., the web deposit form) or OJS, you’re likely unfamiliar with the Test Admin Tool, and this issue will not affect you or your registration of content.&lt;/p>
&lt;p>We don’t configure all member accounts for the Test Admin Tool, so, fortunately, this is an issue for the minority of our members. That said, for those members who do use the Test Admin Tool, this is not a trivial problem. And, we’re going to dedicate additional resources across the organisation to ensure it is fixed.&lt;/p>
&lt;h3 id="next-steps">Next steps&lt;/h3>
&lt;p>We’ve repopulated the credentials in the Test Admin Tool based on our production accounts. It was our best option. While we don’t know your current credentials, our support and membership teams do know that the majority of our members using the Test Admin Tool have historically shared credentials between the Test Admin Tool and our production Admin Tool - doi.crossref.org. That means that many of you will be able to access the Test Admin Tool using those shared credentials; but some of you - who have used different credentials between the two systems - will not.&lt;/p>
&lt;p>We also know that for many of you testing submissions is an integral step in your workflow, so we’ve determined this is an all-hands-on-deck situation and our staff, across the organisation, will be assisting members who have issues with access to test.crossref.org. Starting today, we’re actively monitoring submissions to the Test Admin Tool for access errors through Friday, 11 February. We’ll be proactively contacting affected members to reset their passwords. If you encounter problems before we reach out to you, please do contact us at at &lt;a href="mailto:support@crossref.org">support@crossref.org&lt;/a> and include ‘Accessing Test Admin Tool’ in your subject line.&lt;/p></description></item><item><title>A ROR-some update to our API</title><link>https://www.crossref.org/blog/a-ror-some-update-to-our-api/</link><pubDate>Wed, 19 Jan 2022 00:00:00 +0000</pubDate><author>Rachael Lammey</author><discourseUsername>rlammey</discourseUsername><guid>https://www.crossref.org/blog/a-ror-some-update-to-our-api/</guid><description>&lt;p>Earlier this year, Ginny posted &lt;a href="https://doi.org/10.64000/9aaza-a3158" target="_blank">an exciting update on Crossref’s progress with adopting ROR&lt;/a>, the Research Organisation Registry for affiliations, announcing that we&amp;rsquo;d started the collection of &lt;a href="https://www.ror.org" target="_blank">ROR&lt;/a> identifiers in our metadata input schema. 🦁&lt;/p>
&lt;p>The capacity to accept ROR IDs to help reliably identify institutions is really important but the real value comes from their open availability alongside the other metadata registered with us, such as for publications like journal articles, book chapters, preprints, and for other objects such as grants. So today&amp;rsquo;s news is that ROR IDs are now connected in Crossref metadata and openly available via our APIs. 🎉&lt;/p>
&lt;div style="float:right;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2022/research--nexus-2021.png"
alt="visualizing the Research Nexus vision" width="50%">
&lt;/figure>
&lt;/div>
&lt;p>This means ROR can be used by and within all the tools services that integrate with Crossref APIs to analyse, search, recommend, or evaluate research. It’s an important element of &lt;strong>the Research Nexus&lt;/strong>, our vision of a fully connected open research ecosystem, and helps identify, share, and link the affiliations of those producing and publishing different types of research or receiving grants.&lt;/p>
&lt;p>Now that this metadata is available, it helps confer the downstream benefits of ROR for different (and interconnected) groups:&lt;/p>
&lt;ul>
&lt;li>It makes it easier for institutions to find and measure their research output by the articles their researchers have published, or perhaps make it easier to track the grants they’ve received.&lt;/li>
&lt;li>Funders need to be able to discover and track the research and researchers they have supported.&lt;/li>
&lt;li>Academic librarians need to easily find all of the publications associated with their campus.&lt;/li>
&lt;li>Journals need to know where authors are affiliated so they can determine eligibility for institutionally sponsored publishing agreements.&lt;/li>
&lt;li>Editors can use more accurate information on author and reviewer institutions during the peer review process, which can help avoid potential conflicts of interest.&lt;/li>
&lt;/ul>
&lt;p>Those are just a handful of use cases, which is why disseminating ROR affiliation identifiers via our APIs is so important; it lets others choose to do what they need to with the information, without restriction.&lt;/p>
&lt;h2 id="the-story-so-far">The story so far&lt;/h2>
&lt;p>A growing number of our members have started to include ROR in the metadata they register with us, so we’re excited to be able to see this via simple API queries.&lt;/p>
&lt;p>At the time of writing we can see &lt;a href="http://api.crossref.org/works?filter=has-ror-id:t&amp;amp;facet=publisher-name:*" target="_blank">nearly 4,000 RORs being registered by these 21 members&lt;/a> (we&amp;rsquo;ve removed test accounts). Note that many of these are being baked into metadata being registered for grant records, also &lt;a href="https://doi.org/10.64000/tynar-j7a72" target="_blank">recently released and now findable&lt;/a> through the REST API:&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-JSON" data-lang="JSON">&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;Wellcome&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">2821&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;Natural Resources Canada/CMSS/Information Management&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">277&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;University of Szeged&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">139&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;RTI Press&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">104&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;American Cancer Society&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">103&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;University of Missouri Libraries&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">77&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;Keldysh Institute of Applied Mathematics&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">52&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;Boise State University, Albertsons Library&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">52&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;Australian Research Data Commons (ARDC)&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">52&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;The Neurofibromatosis Therapeutic Acceleration Program&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">49&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;Boise State University&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">12&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;The ALS Association&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">11&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;Children&amp;#39;s Tumor Foundation&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">9&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;Episteme Health Inc&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">3&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;The University of the Witwatersrand&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">2&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;Office of Scientific and Technical Information&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">2&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;AGH University of Science and Technology Press&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">2&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;York University Libraries&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">1&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;SZTEPress&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">1&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;Masaryk University Press&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">1&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;Institut für Germanistik der Universität Szeged&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">1&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>Our grants schema accommodated ROR first, so it&amp;rsquo;s the funder members and grant records that dominate the adoption of ROR&amp;hellip; so far! But there are a few articles and reports there too already. &lt;a href="https://api.crossref.org/works?filter=has-ror-id:t&amp;amp;facet=type-name:*" target="_blank">These record types&lt;/a> include ROR in their records:&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-JSON" data-lang="JSON">&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;Grant&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">3047&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;Report&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">382&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;Dissertation&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">164&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;Journal Article&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">140&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;Conference Paper&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">22&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;Posted Content&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">12&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;Dataset&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">7&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;Monograph&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">6&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;Book&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">3&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;Chapter&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">2&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;Proceedings Series&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">1&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;Peer Review&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">1&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;Journal Issue&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">1&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;Book Set&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">1&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;Book Series&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="mi">1&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>We can currently see &lt;a href="https://api.crossref.org/works?filter=has-ror-id:t&amp;amp;facet=ror-id:*" target="_blank">205 different ROR IDs in Crossref metadata&lt;/a>, with the most frequently provided ROR ID being: &lt;a href="https://ror.org/02jx3x895" target="_blank">https://ror.org/02jx3x895&lt;/a>, or &lt;strong>University College London&lt;/strong> as it’s also known as.&lt;/p>
&lt;p>If you’re a Crossref member keen to assert affiliation identification in your content, our recent webinar, &lt;a href="https://www.youtube.com/watch?v=D9Mtqb64OEk" target="_blank">Working with ROR as a Crossref member: what you need to know&lt;/a>, covers all the detail.&lt;/p>
&lt;p>Interested in using the information? Dig into our &lt;a href="https://api.crossref.org/swagger-ui/index.html" target="_blank">REST API documentation&lt;/a> and into the API itself, use the polite pool if you can (i.e. identify yourself). There’s also a wealth of information on the &lt;a href="https://ror.readme.io/" target="_blank">ROR support site&lt;/a> or being shared among &lt;a href="https://ror.org/integrations/" target="_blank">integrators&lt;/a> in the growing ROR community.&lt;/p>
&lt;p>Join us in doing more with ROR!&lt;/p></description></item><item><title>Event Data now with added references</title><link>https://www.crossref.org/blog/event-data-now-with-added-references/</link><pubDate>Wed, 10 Nov 2021 00:00:00 +0000</pubDate><author>Martyn Rittman</author><guid>https://www.crossref.org/blog/event-data-now-with-added-references/</guid><description>&lt;p>Event Data is our service to capture online mentions of Crossref records. We monitor data archives, Wikipedia, social media, blogs, news, and other sources. Our main focus has been on gathering data from external sources, however we know that there is a great deal of Crossref metadata that can be made available as events. Earlier this year we started adding &lt;a href="https://doi.org/10.64000/gsrh2-34428" target="_blank">relationship metadata&lt;/a>, and over the last few months we have been working on bringing in citations between records.&lt;/p>
&lt;p>Our members deposit references alongside other metadata, and we have a lot of them. In fact, we have over 1.2 billion, with hundreds of thousands of new references added each day. While our metadata APIs make it easy to see which works are cited, it is much more difficult to find a list of citations to a specific work. We can make this easier by presenting citations as events in Event Data. Now that the huge majority of our members have responded positively to the &lt;a href="https://i4oc.org/" target="_blank">Initiative for Open Citations (I4OC)&lt;/a> campaign and Crossref’s open-by-default reference policy, the move to make this data available via Event Data is a natural step.&lt;/p>
&lt;h2 id="a-bumpy-ride-but-we-got-there">A bumpy ride, but we got there&lt;/h2>
&lt;p>Adding such a large amount of data means a significant increase in the data coming into Event Data, which has presented some challenges. We’ve known for some time that Event Data is &lt;a href="https://doi.org/10.64000/eyfwr-z5148" target="_blank">not very stable&lt;/a>, but we expected it to cope with the new data coming in. We have mitigated by initially only looking at new data, not trying to immediately back-fill with old references. Unfortunately, even with this limitation it hasn’t been a smooth ride, and our first effort to put references into Event Data uncovered bugs we didn’t know about and we had to walk back the changes.
We tried again and found that we were hitting rate limits for our own APIs. This is a sure sign of technical debt: we shouldn’t need to be shifting large amounts of our own data from one place to another, and not at rates that could be putting stress on APIs used by others in the community.&lt;/p>
&lt;p>We have managed to work around these problems and I’m pleased to say that we are now adding metadata from reference lists to Event Data. They can be accessed via the Event Data API:
&lt;a href="https://api.eventdata.crossref.org/v1/events?rows=10&amp;source=crossref&amp;relation-type=references&amp;from-collected-date=2021-10-01">&lt;a href="https://api.eventdata.crossref.org/v1/events?rows=10&amp;amp;source=crossref&amp;amp;relation-type=references&amp;amp;from-collected-date=2021-10-01" target="_blank">https://api.eventdata.crossref.org/v1/events?rows=10&amp;source=crossref&amp;relation-type=references&amp;from-collected-date=2021-10-01&lt;/a>&lt;/a>&lt;/p>
&lt;h2 id="where-to-next">Where to next?&lt;/h2>
&lt;p>There remains work to be done. We would like to backfill references, and there is also further work to include relationships to objects that have identifiers other than Crossref records (genes, proteins, ArXiv identifiers, and so on). Our work on &lt;a href="https://doi.org/10.64000/g3twz-j0z04" target="_blank">investigating sources&lt;/a> is proceeding and we will be looking to add more next year. While possible, these steps will be costly and time-consuming if we proceed without significant changes to the infrastructure supporting Event Data.&lt;/p>
&lt;p>When we started Event Data the volumes of data were much smaller and our infrastructure coped well, but as &lt;a href="https://doi.org/10.64000/gsrh2-34428" target="_blank">we’ve said here before&lt;/a>, it’s in need of an overhaul. In fact, our recent experience and some other considerations are making us look at some very fundamental changes in how we record events.&lt;/p>
&lt;p>We are therefore working on a new data model that will allow events to be stored alongside the rest of our metadata. This work is still in the early stages, but if we are successful it will mean that we won’t need to move data between databases. It will also make it easier to provide access to all of our reference metadata along with other relationships that we’re not currently able to provide, and give us the capacity to add new data sources.&lt;/p>
&lt;h2 id="open-references">Open references&lt;/h2>
&lt;p>&lt;em>[EDIT 6th June 2022 - all references are now open by default with the March 2022 board vote to remove any restrictions on reference distribution].&lt;/em>&lt;/p>
&lt;p>It is worth noting that only &lt;em>open&lt;/em> references will be available via Event Data. This covers 88% of works with references at present. Members have the option to deposit references with &lt;em>limited&lt;/em> visibility, meaning only &lt;a href="https://www.crossref.org/documentation/metadata-plus/">Metadata Plus&lt;/a> users can access them; or &lt;em>closed&lt;/em> visibility, meaning that only the member who owns the cited work can retrieve the citation, via &lt;a href="https://www.crossref.org/documentation/cited-by/">Cited-by&lt;/a>.&lt;/p>
&lt;p>We encourage our members to make their references open and deposit them as metadata. It makes them usable downstream by thousands of tools that researchers use. Including open references also improves the quality of metadata, and there are reciprocal benefits for the large number of members who openly share their reference data: they contribute to a large, openly available pool of data with many applications that advance research, and drives usage of the content published by our members.&lt;/p>
&lt;p>If you are a Crossref member and unsure whether your reference metadata is open or not, check your &lt;a href="https://www.crossref.org/members/prep/" target="_blank">participation report&lt;/a>. This will tell you the percentage of your records with deposited references, and the percentage of those that are open. You can change the reference visibility preference for each DOI prefix that you own by contacting our &lt;a href="https://support.crossref.org/hc/en-us/requests/new?ticket_form_id=360001642691" target="_blank">support team&lt;/a>. For guidance on how to deposit references, &lt;a href="https://www.crossref.org/documentation/register-maintain-records/maintaining-your-metadata/add-references/">see our user documentation&lt;/a>.&lt;/p></description></item><item><title>Come and get your grant metadata!</title><link>https://www.crossref.org/blog/come-and-get-your-grant-metadata/</link><pubDate>Mon, 08 Nov 2021 00:00:00 +0000</pubDate><author>Rachael Lammey</author><discourseUsername>rlammey</discourseUsername><guid>https://www.crossref.org/blog/come-and-get-your-grant-metadata/</guid><description>&lt;p>&lt;strong>Tl;dr&lt;/strong>: Metadata for the (currently 26,000) grants that have been registered by our funder members is now available via the REST API. This is quite a milestone in our program to include funding in Crossref infrastructure and a step forward in our mission to connect all.the.things. This post gives you all the queries you might need to satisfy your curiosity and start to see what&amp;rsquo;s possible with deeper analysis. So have the look and see what useful things you can discover.&lt;/p>
&lt;h2 id="how-it-started">How it started&lt;/h2>
&lt;p>Back in 2017 we posted the outcomes of some discussions with a &lt;a href="https://www.crossref.org/working-groups/funders/">newly-reformed Funder Advisory Group&lt;/a>, &lt;a href="https://doi.org/10.64000/5cfh1-1wa10" target="_blank">plotting Crossref&amp;rsquo;s path&lt;/a>. In 2018, &lt;a href="https://doi.org/10.64000/xqr28-ee750" target="_blank">Wellcome described their rationale for supporting the grants effort&lt;/a> with the help of Europe PMC, and in 2019 the sub-groups of the Advisory Board put out &lt;a href="https://doi.org/10.64000/g2yk3-hgv34" target="_blank">a call for feedback on the metadata plan&lt;/a> as the fee model they created was also approved by our board.&lt;/p>
&lt;p>Since late 2019, research funders have been registering metadata and identifiers for their grants with us. We currently have a healthy 26k grants registered with us, via 13 funding organisations. I’d specifically highlight Wellcome for volume (&lt;a href="http://blog.europepmc.org/2020/06/global-grant-ids-in-europe-pmc.html" target="_blank">registering via Europe PMC&lt;/a>), and the Australian Research Data Commons (ARDC) who was the first funder that included ROR IDs in their grant metadata, really getting the value of connecting all related entities and contributors.&lt;/p>
&lt;p>The reasons for registering grants with Crossref? Let&amp;rsquo;s recap:&lt;/p>
&lt;ul>
&lt;li>Support of open data and information about grants&lt;/li>
&lt;li>Streamlined discovery of funded content&lt;/li>
&lt;li>Improved analytics and data quality&lt;/li>
&lt;li>More complete picture of outputs and impact&lt;/li>
&lt;li>Better value from investments in reporting services&lt;/li>
&lt;li>Improved timeliness, completeness and accuracy of reporting: save time for researchers&lt;/li>
&lt;li>More complete information to support analysis and evaluation without relying on manual data entry&lt;/li>
&lt;/ul>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2021/funder-visual.png" width="75%">
&lt;/figure>
&lt;h2 id="how-its-going">How it&amp;rsquo;s going&lt;/h2>
&lt;p>For grant information to be used, it’s key that it is is openly available and disseminated as widely as possible. That work starts with funders registering their grants, and continues with us. Now that we’ve completed the REST API&amp;rsquo;s &lt;a href="https://doi.org/10.64000/nxwqn-x9m73" target="_blank">Elasticsearch migration&lt;/a>, we’re happy to announce that all our grant information is now available via our REST API.&lt;/p>
&lt;p>Here’s a snippet of the kind of metadata you can see related to the grants registered with us. This is information related to grant record &lt;a href="https://doi.org/10.35802/218300" target="_blank">https://doi.org/10.35802/218300&lt;/a>, found using &lt;a href="https://api.crossref.org/works/10.35802/218300" target="_blank">this request (https://api.crossref.org/works/10.35802/218300)&lt;/a> which you can use to see the full metadata record:&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-JSON" data-lang="JSON">&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;publisher&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="s2">&amp;#34;Wellcome&amp;#34;&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;award&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="s2">&amp;#34;107769&amp;#34;&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;DOI&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="s2">&amp;#34;10.35802/107769&amp;#34;&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;type&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="s2">&amp;#34;grant&amp;#34;&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;created&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="p">{&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;#34;date-parts&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="p">[&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">[&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="mi">2019&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="mi">9&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="mi">25&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">]&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">],&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;#34;date-time&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;2019-09-25T07:17:20Z&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;#34;timestamp&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="mi">1569395840000&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">}&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;source&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="s2">&amp;#34;Crossref&amp;#34;&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;prefix&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="s2">&amp;#34;10.35802&amp;#34;&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;member&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="s2">&amp;#34;13928&amp;#34;&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;project&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="p">[&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">{&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;#34;project-title&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="p">[&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">{&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;#34;title&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;Initiative to Develop African Research Leaders (IDeAL)&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">}&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">],&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;#34;project-description&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="p">[&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">{&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;#34;description&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;Research is key in tackling the heath challenges that Africa faces. In KWTRP we have been committed to building sustainable capacity alongside an active and diverse research programme covering social science, health services research, epidemiology, laboratory science including molecular biology and bioinformatics. Our strategy has been successful in delivering high quality PhD training, leveraging individual funding and programme funding in order to place students in productive groups and provide high quality supervision and mentorship. Here we plan to consolidate and build on these outputs to address long-term sustainability. We will emphasise the full career path needed to generate research leaders. KWTRP aims to address capacity building for research through an initiative that employs a progressive and long term outlook in the development of local research leadership. The overall aim of the \&amp;#34;Initiative to Develop African Research Leaders\&amp;#34; (IDeAL) is to build a critical mass of African researchers who are technically proficient as scientists and well-equipped to independently lead science at international level, able to engage with funders, policy makers and governments, and to act as supervisors and mentors for the next generation of researchers.&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;#34;language&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;en&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">},&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>If you dig in, you can see information about the project, investigators (including their ORCID iDs), the funder, award type, amount, description of the grant, and a link to the public page showing information about the grant. More information on the required and optional fields is available in our &lt;a href="https://www.crossref.org/documentation/content-registration/content-type-markup-guide/grants/">grants markup guide&lt;/a>.&lt;/p>
&lt;p>Here are some examples of the kind of things you can now ask:&lt;/p>
&lt;h4 id="show-me-who-is-registering-grants">Show me who is registering grants:&lt;/h4>
&lt;p>&lt;a href="https://api.crossref.org/types/grant/works?rows=0&amp;amp;facet=funder-name:*" target="_blank">https://api.crossref.org/types/grant/works?rows=0&amp;amp;facet=funder-name:*&lt;/a>&lt;/p>
&lt;h4 id="show-me-all-of-the-grants-registered-by-wellcome">Show me all of the grants registered by Wellcome:&lt;/h4>
&lt;p>&lt;a href="https://api.crossref.org/works?query.funder-name=Wellcome&amp;filter=type:grant">&lt;a href="https://api.crossref.org/works?query.funder-name=Wellcome&amp;amp;filter=type:grant" target="_blank">https://api.crossref.org/works?query.funder-name=Wellcome&amp;filter=type:grant&lt;/a>&lt;/a>&lt;/p>
&lt;h4 id="show-me-all-of-the-grants-associated-with-the-investigator-name-caldas">Show me all of the grants associated with the investigator name Caldas:&lt;/h4>
&lt;p>&lt;a href="https://api.crossref.org/works?query.contributor=Caldas&amp;filter=type:grant">&lt;a href="https://api.crossref.org/works?query.contributor=Caldas&amp;amp;filter=type:grant" target="_blank">https://api.crossref.org/works?query.contributor=Caldas&amp;filter=type:grant&lt;/a>&lt;/a>&lt;/p>
&lt;p>And bibliographic queries finding entries in&amp;hellip;&lt;/p>
&lt;h4 id="award-number">Award number:&lt;/h4>
&lt;p>&lt;a href="https://api.crossref.org/works?query.bibliographic=7196&amp;filter=type:grant">&lt;a href="https://api.crossref.org/works?query.bibliographic=7196&amp;amp;filter=type:grant" target="_blank">https://api.crossref.org/works?query.bibliographic=7196&amp;filter=type:grant&lt;/a>&lt;/a>&lt;/p>
&lt;h4 id="project-title">Project title:&lt;/h4>
&lt;p>&lt;a href="https://api.crossref.org/works?query.bibliographic=RIZ1&amp;filter=type:grant">&lt;a href="https://api.crossref.org/works?query.bibliographic=RIZ1&amp;amp;filter=type:grant" target="_blank">https://api.crossref.org/works?query.bibliographic=RIZ1&amp;filter=type:grant&lt;/a>&lt;/a>&lt;/p>
&lt;h2 id="more-to-do">More to do&lt;/h2>
&lt;p>This is a milestone but it&amp;rsquo;s not the end of the story. We have more to add relationships, encourage the use of this metadata amongst publishers and their platforms, and to add grant records to our tools such as Participation Reports and Metadata Search. But in the meantime, feel free to &lt;a href="https://www.crossref.org/contact">get in touch&lt;/a> if you have queries about registering grants with us or about using the related metadata in your tools and services.&lt;/p>
&lt;p>This information will grow over time as more funders join Crossref and add their grant metadata and as more analyses is possible. We&amp;rsquo;re looking forward to the next steps!&lt;/p></description></item><item><title>Update on the outage of October 6, 2021</title><link>https://www.crossref.org/blog/update-on-the-outage-of-october-6-2021/</link><pubDate>Wed, 27 Oct 2021 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/update-on-the-outage-of-october-6-2021/</guid><description>&lt;p>In &lt;a href="https://doi.org/10.64000/e3xe5-wae58" target="_blank">my blog post on October 6th&lt;/a>, I promised an update on what caused the outage and what we are doing to avoid it happening again. This is that update.&lt;/p>
&lt;p>Crossref hosts its services in a hybrid environment. Our original services are all hosted in a data center in Massachusetts, but we host new services with a cloud provider. We also have a few R&amp;amp;D systems hosted with Hetzner.&lt;/p>
&lt;p>We know an organisation our size has no business running its own data center, and we have been slowly moving services out of the data center and into the cloud.&lt;/p>
&lt;p>For example, over the past nine months, we have moved our authentication service and our &lt;a href="https://api.crossref.org" target="_blank">REST APIs&lt;/a> to the cloud.&lt;/p>
&lt;p>And, we are working on moving the other existing services too. For example, we are in the midst of moving &lt;a href="https://www.crossref.org/services/event-data/" target="_blank">Event Data&lt;/a> and, our next target, after Event Data, is the content registration system.&lt;/p>
&lt;p>All new services are deployed to the cloud by default.&lt;/p>
&lt;p>While moving services out of the data center, we have also been trying to shore up the data center to ensure it continues to function during the transition. One of the weaknesses we identified in the data center was that the same provider managed both our primary network connection &lt;em>and&lt;/em> our backup connection (albeit- on entirely different physical networks). We understood that we really needed a separate provider to ensure adequate redundancy, and we had already had a third network drop installed from a different provider. But, unfortunately, it had not yet been activated and connected.&lt;/p>
&lt;p>Meanwhile, our original network provider for the first two connections informed us months ago that they would be doing some major work on our &lt;em>backup&lt;/em> connection. However, they assured us that it would not affect the primary connection- something we confirmed with them repeatedly since we knew our replacement backup connection was not yet active.&lt;/p>
&lt;p>But, the change our provider made &lt;em>did&lt;/em> affect &lt;em>both&lt;/em> the backup (as intended) and the primary (not intended). They were as surprised as we were, which kind of underscores why we want two separate providers as well as two separate network connections.&lt;/p>
&lt;p>So both our primary and secondary networks went down while we had not yet activated our replacement secondary network.&lt;/p>
&lt;p>Also, our only &lt;em>local&lt;/em> infrastructure team member was in surgery at the time (He is fine. It was routine. Thanks for asking).&lt;/p>
&lt;p>This meant we had to send a local developer to the data center, but the data center’s authentication process had changed since the last time said developer had visited (pre-pandemic). So, yeah, it took us a long time to even get into the data center.&lt;/p>
&lt;p>By then, our infrastructure team member was out of surgery and on the phone with our network provider, who realized their mistake and reverted everything. This whole process (getting network connectivity restored, not the surgery) took almost two hours.&lt;/p>
&lt;p>Unfortunately, the outage didn’t just affect services hosted in the data center. It also affected our cloud-hosted systems. This is because all of our requests were still routed to the data center first, after which those destined for the cloud were split out and redirected. This routing made sense when the bulk of our requests were for services hosted in the data center. But, within the past month, that calculus had shifted. Most of our requests now are for cloud-based services. We were scheduled to switch to routing traffic through our cloud provider first, and had this been in place, many of our services would have continued running during the data center outage.&lt;/p>
&lt;p>It is very tempting to stop this explanation here and leave people with the impression that:&lt;/p>
&lt;ul>
&lt;li>The root cause of the outage was the unpredicted interaction between the maintenance on our backup line and the functionality of our primary line;&lt;/li>
&lt;li>Our slowness to respond was exclusively down to one of the two members of our infrastructure staff being (&lt;em>cough&lt;/em>) indisposed at the time.&lt;/li>
&lt;/ul>
&lt;p>But the whole event uncovered several other issues as well.&lt;/p>
&lt;p>Namely:&lt;/p>
&lt;ul>
&lt;li>Even if one of our three lines had stayed active, the routers in the data center would not have cut over to the redundant working system because we had misconfigured them and we had not tested them;&lt;/li>
&lt;li>We did not keep current documentation on the changing security processes for accessing the data center;&lt;/li>
&lt;li>Our alerting system does not support the kind of escalation logic, and coverage-scheduling that would have allowed us to automatically detect when our primary data center administrator didn’t respond (being in surgery and all) and redirect alerts and warnings to secondary responders; and&lt;/li>
&lt;li>We need to accelerate our move out of the data center.&lt;/li>
&lt;/ul>
&lt;p>What are we doing to address these issues?&lt;/p>
&lt;ul>
&lt;li>Completing the installation of the backup connection with a second provider;&lt;/li>
&lt;li>Scheduling a test of our router’s cutover processes where we will actually pull the plug on our primary connection to ensure that failover is working as intended. We will give users ample warning before conducting this test;&lt;/li>
&lt;li>Revising our emergency contact procedures and updating our documentation for navigating our data center’s security process;&lt;/li>
&lt;li>Replacing our alerting system with one that gives us better control over escalation rules; and&lt;/li>
&lt;li>Adding a third FTE to the infrastructure team to help us accelerate our move to the cloud and to implement infrastructure management best practices.&lt;/li>
&lt;/ul>
&lt;p>October 6th, 2021, was a bad day. But we’ve learned from it. So if we have a bad day in the future, it will at least be different.&lt;/p></description></item><item><title>More new faces at Crossref</title><link>https://www.crossref.org/blog/more-new-faces-at-crossref/</link><pubDate>Thu, 21 Oct 2021 00:00:00 +0000</pubDate><author>Lindsay Russell</author><guid>https://www.crossref.org/blog/more-new-faces-at-crossref/</guid><description>&lt;p>Looking at the road ahead, we’ve set some &lt;a href="https://www.crossref.org/strategy">ambitious goals&lt;/a> for ourselves and continue to see new members join from around the world, now numbering 16,000. To help achieve all that we plan in the years to come, we’ve grown &lt;a href="https://www.crossref.org/people/" target="_blank">our teams&lt;/a> quite a bit over the last couple of years, and we are happy to welcome Carlos, Evans, Fabienne, Mike, Panos, and Patrick.&lt;/p>
&lt;p>Our Software Development team has seen the most growth with the addition of Carlos, Mike, Panos, and Patrick; collectively, they bring specialist skills that are helping us to pay down technical debt, modernize our underlying infrastructure, and prepare for a consistent front-end experience. As a member of the Product team, Fabienne has a fresh take on our Similarity Check service, steering the upgrade to iThenticate v2. And Evans brings a scientific researcher perspective to our Member Experience team along with experience as a member who’s worked with our tools.&lt;/p>
&lt;p>And now some words from each of them.&lt;/p>
&lt;h2 id="carlos-del-ojo-elias">Carlos Del Ojo Elias&lt;/h2>
&lt;div style="float:left;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2021/carlos-bw-blog.jpg"
alt="image of Carlos" width="300px">
&lt;/figure>
&lt;/div>
&lt;p>I am a computer scientist with a master’s degree in Bioinformatics. Previously I used to work as a security auditor. I&amp;rsquo;ve got experience in research and software development both in academia and industry. It&amp;rsquo;s very exciting for me to join Crossref as a Senior software developer on the technology team. My current project involves working on the authentication and authorization subsystems, exploring state-of-the-art technologies in order to improve our services. I have always enjoyed contributing to the open-source community, so it is a pleasure for me to work in an organisation that promotes the principles of openness and transparency of software and data. &lt;br>&lt;br>&lt;/p>
&lt;hr>
&lt;h2 id="evans-atoni">Evans Atoni&lt;/h2>
&lt;div style="float:right;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2021/evans-bw-blog.jpg"
alt="image of Evans" width="250px">
&lt;/figure>
&lt;/div>
&lt;p>I am a member of the Technical Support team having joined Crossref just a few weeks ago. I’m passionate about advancing open access and POSI. Helping our members sort through knotty technical queries and building relations with them to service their very diverse needs is what I’m most excited about in my role. In my spare time, I enjoy anything outdoors, family time, and traveling. I work remotely from Nairobi, Kenya. &lt;br>&lt;br>&lt;br>&lt;br>&lt;br>&lt;br>&lt;br>&lt;br>&lt;/p>
&lt;hr>
&lt;h2 id="fabienne-michaud">Fabienne Michaud&lt;/h2>
&lt;div style="float:left;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2021/fabienne-blog.jpeg"
alt="image of Fabienne" width="300px">
&lt;/figure>
&lt;/div>
&lt;p>I joined Crossref in April 2021 as a Product Manager for scholarly stewardship which includes the content comparison tool Similarity Check and I am thrilled to be a member of such a lovely, supportive and international team. I have a background in teaching and have worked in academic, research, and not-for-profit libraries in the UK for over 20 years in academic liaison, customer services, and management roles. These experiences have given me a user-centered approach and a drive to find collaborative, reliable, and pertinent technological solutions to support the research and scholarly community. Since starting at Crossref and, through my work with the Similarity Check Advisory Group, I have developed a good understanding of the current ethical issues facing the publishing sector (such as paper mills and other manipulations of the publication process) and a particular interest in how AI and automation tools can play a part in addressing these challenges. &lt;br>&lt;br>&lt;/p>
&lt;hr>
&lt;h2 id="mike-gill">Mike Gill&lt;/h2>
&lt;div style="float:right;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2021/mike-gill-blog.jpeg"
alt="image of Mike" width="300px">
&lt;/figure>
&lt;/div>
&lt;p>I’ve been a software developer for twenty years, having studied software engineering at university. During my career, I have worked mostly in the banking and engineering industries so this is my first time working in scholarly publishing. I confess that before joining Crossref I wasn’t aware that the community existed so I was excited to see how I could ply my trade in this new (to me!) field. The role also appealed as, having primarily been a team leader/line manager in my recent career, this was an opportunity to be hands-on again and work with modern languages such as Kotlin. In the end, though, what really sealed it for me was reading on the Crossref website that ‘we take the work seriously but not necessarily ourselves’ which basically sums me up. So I knew I’d be in good company and that has proven to be the case!&lt;/p>
&lt;hr>
&lt;h2 id="panos-pandis">Panos Pandis&lt;/h2>
&lt;div style="float:left;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2021/panos3.jpg"
alt="image of Panos" width="300px">
&lt;/figure>
&lt;/div>
&lt;p>I joined Crossref as a Senior Software Developer in 2020, in the middle of the coronavirus pandemic. Moving to Crossref has been a much-needed breath of fresh air. I&amp;rsquo;m a big fan of open-source, and at Crossref, it just feels like home. Even more so after our recent commitment to the Principles of Open Scholarly Infrastructure (POSI). My main focus at the moment is Crossref&amp;rsquo;s Event Data service. I&amp;rsquo;m fascinated by the potential of Event Data and the broad audience I get to support and communicate with through the project. So if you spot me in a room, feel free to ask me anything about Clojure/Kotlin, Event Data, obscure technology, or kombucha recipes.&lt;/p>
&lt;p>&lt;br>&lt;br>&lt;br>&lt;br>&lt;/p>
&lt;hr>
&lt;h2 id="patrick-vale">Patrick Vale&lt;/h2>
&lt;div style="float:right;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2021/patrick-bw-blog.jpg"
alt="image of Patrick" width="300px">
&lt;/figure>
&lt;/div>
&lt;p>I&amp;rsquo;m delighted to have joined Crossref as the first Frontend Developer. My role covers the inauguration of a scalable framework in which we can build future User Interfaces, and generally making people&amp;rsquo;s lives easier as they interact with our products and services - if a human uses it, I&amp;rsquo;m interested! It&amp;rsquo;s my intention to provide a platform on which we can quickly iterate to build and adapt our interfaces to suit the rapidly changing needs of our community. It&amp;rsquo;s been a pleasure to learn about the impact Crossref has across the scholarly spectrum; and to work with a team of open, practical, and downright friendly colleagues is a privilege. Outside of work, I enjoy cycling, growing things, and most recently, avoiding two small cats while moving from anywhere to anywhere around the house. &lt;br>&lt;br>&lt;br>&lt;br>&lt;/p>
&lt;p>Your contributions have been impactful and it will be fun to see all that you will surely contribute to our road ahead!&lt;/p>
&lt;p> &lt;/p></description></item><item><title>Outage of October 6, 2021</title><link>https://www.crossref.org/blog/outage-of-october-6-2021/</link><pubDate>Wed, 06 Oct 2021 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/outage-of-october-6-2021/</guid><description>&lt;p>On October 6 at ~14:00 UTC, our data centre outside of Boston, MA went down. This affected most of our network services- even ones not hosted in the data centre. The problem was that both of our primary and backup network connections went down at the same time. We&amp;rsquo;re not sure why yet. We are consulting with our network provider. It took us 2 hours to get our systems back online.&lt;/p>
&lt;p>We are going to reprocess content that was in the process of being registered at the time of the outage in order to make sure everything gets registered correctly. This may take a few days to complete.&lt;/p>
&lt;h3 id="why-did-we-have-such-a-complete-outage-and-why-did-it-take-us-so-long-to-fix-it">Why did we have such a complete outage and why did it take us so long to fix it?&lt;/h3>
&lt;ol>
&lt;li>
&lt;p>We still run a significant amount of our infrastructure in a data centre outside of Boston that we manage ourselves. Even though we&amp;rsquo;ve been moving many of our services to the cloud, all our traffic was still routed through the data centre - so when it went down, most of our cloud services were unavailable as well.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>It took us a long time to fix this because our infrastructure team only has two people in it. Only one of them is located near the data centre and was at the doctor’s when the outage occurred. Although we were alerted to the problem immediately, we had to send one of our development team members to the data centre to diagnose and fix the problem.&lt;/p>
&lt;/li>
&lt;/ol>
&lt;p>We have been aware of these weaknesses in our system since I took the role of director of technology in 2019, and we have been putting most of our efforts over the past two years into fixing them.&lt;/p>
&lt;p>We know that an organisation of our size has no business trying to run and maintain a physical data centre ourselves. One of the strengths of cloud-based systems is that they can be administered from anywhere and don&amp;rsquo;t require anyone to physically go to a data centre to replace failed hardware or check that network connections are, in fact, live. We&amp;rsquo;ve been trying to move to the cloud as fast as we can. All new services that we build are cloud-based. At the same time we&amp;rsquo;ve been moving systems out of the data centre - starting with those that put the biggest load on our systems. To further aid this process we have budgeted to add an FTE to the infrastructure team in 2022.&lt;/p>
&lt;p>What is really painful about this event is that we had just completed the last bit of work we needed to do before changing our traffic routing so that it would hit the cloud first instead of the data centre first. This would not have avoided the outage we just experienced, but it would have made it a bit less severe.&lt;/p>
&lt;p>What is even more painful is that we had recently installed a &lt;em>third&lt;/em> network connection with an entirely different provider because we were worried about just this kind of situation. But this third connection wasn’t yet active.&lt;/p>
&lt;p>We already have a long list of tickets that we’ve created to address problems we faced in recovering from this outage. The list will undoubtedly grow as we complete a postmortem over the next few days. I will report back when we have more detail of what happened and have a solid plan for how to avoid anything similar in the future.&lt;/p>
&lt;p>We know that an outage of this severity and duration has caused a lot of people who depend on our services extra work and anxiety. For this, we apologise profusely.&lt;/p>
&lt;p>But at least we didn’t need to use an &lt;a href="https://twitter.com/cullend/status/1445156376934862848" target="_blank">angle grinder&lt;/a>.&lt;/p></description></item><item><title>2021 Board Election</title><link>https://www.crossref.org/blog/2021-board-election/</link><pubDate>Tue, 28 Sep 2021 00:00:00 +0000</pubDate><author>Lucy Ofiesh</author><guid>https://www.crossref.org/blog/2021-board-election/</guid><description>&lt;p>We are pleased to share the 2021 board election slate. Crossref’s Nominating Committee received over 60 submissions from members worldwide to fill five open board seats. It was a fantastic group of applicants and showed the strength of our membership community.&lt;/p>
&lt;p>There are five seats open for election (three small, two large), and the &lt;a href="https://www.crossref.org/committees/nominating/">Nominating Committee&lt;/a> presents the following slate.&lt;/p>
&lt;h2 id="the-2021-slate">The 2021 slate&lt;/h2>
&lt;p>Candidate organisations, in alphabetical order, for the Small category (three seats available):&lt;/p>
&lt;ul>
&lt;li>&lt;strong>California Digital Library, University of California&lt;/strong>, Lisa Schiff&lt;/li>
&lt;li>&lt;strong>Center for Open Science&lt;/strong>, Nici Pfeiffer&lt;/li>
&lt;li>&lt;strong>Melanoma Research Alliance&lt;/strong>, Kristen Mueller&lt;/li>
&lt;li>&lt;strong>Morressier&lt;/strong>, Sebastian Rose&lt;/li>
&lt;li>&lt;strong>NISC&lt;/strong>, Mike Schramm&lt;/li>
&lt;/ul>
&lt;p>Candidate organisations, in alphabetical order, for the Large category (two seats available):&lt;/p>
&lt;ul>
&lt;li>&lt;strong>AIP Publishing (AIP)&lt;/strong>, Penelope Lewis&lt;/li>
&lt;li>&lt;strong>American Psychological Association (APA)&lt;/strong>, Jasper Simons&lt;/li>
&lt;li>&lt;strong>Association for Computing Machinery (ACM)&lt;/strong>, Scott Delman&lt;/li>
&lt;/ul>
&lt;div class="shortcode-divwrap blue-highlight">
&lt;span>&lt;h3 id="here-are-the-candidates-organisational-and-personal-statementsboard-and-governanceelections2021-slate">&lt;a href="https://www.crossref.org/board-and-governance/elections/2021-slate/">Here are the candidates&amp;rsquo; organisational and personal statements&lt;/a>&lt;/h3>
&lt;/span>
&lt;/div>
&lt;h2 id="you-can-be-part-of-this-important-process-by-voting-in-the-election">You can be part of this important process by voting in the election&lt;/h2>
&lt;p>If your organisation is a voting member in good standing of Crossref as of September 20, 2021, you are eligible to vote when voting opens on September 29, 2021.&lt;/p>
&lt;h2 id="how-can-you-vote">How can you vote?&lt;/h2>
&lt;p>On September 29, 2021, your organisation&amp;rsquo;s designated voting contact will receive an email with the Formal Notice of Meeting and Proxy Form with concise instructions on how to vote. You will also receive a user name and password with a link to our voting platform.&lt;/p>
&lt;p>The election results will be announced at the &lt;a href="https://www.crossref.org/crossref-annual-meeting/archive/#2021" target="_blank">LIVE21 online meeting&lt;/a> on November 9, 2021. Save the date!&lt;/p></description></item><item><title>Similarity Check news: iThenticate v2.0 ready for launch</title><link>https://www.crossref.org/blog/similarity-check-news-ithenticate-v2.0-ready-for-launch/</link><pubDate>Mon, 20 Sep 2021 00:00:00 +0000</pubDate><author>Fabienne Michaud</author><guid>https://www.crossref.org/blog/similarity-check-news-ithenticate-v2.0-ready-for-launch/</guid><description>&lt;p>Crossref Similarity Check news: iThenticate v2.0 ready for launch&lt;/p>
&lt;p>&lt;a href="https://doi.org/10.64000/ad7s6-ag751" target="_blank">Last year&lt;/a>, we announced the upcoming launch of a new version of iThenticate, the product from Turnitin that powers &lt;a href="https://www.crossref.org/services/similarity-check">Crossref Similarity Check&lt;/a>. We know some of you have been waiting a long time for this upgrade and we are very happy to share with you that we are now ready to release it.&lt;/p>
&lt;p>We will be rolling out this new version in stages, so not everyone will be able to upgrade to the new version immediately. We&amp;rsquo;ll start with new Crossref Similarity Check subscribers who use iThenticate in the browser, and one member who uses iThenticate via the eJournalPress API integration.&lt;/p>
&lt;p>Next month, we will reach out to existing Crossref Similarity Check subscribers who use iThenticate in the browser (rather than through a manuscript tracking system), and further eJournalPress users.  From then on, we&amp;rsquo;ll be contacting those of you who use Similarity Check through your manuscript tracking system, as and when your providers are ready to work with the new version.&lt;/p>
&lt;h2 id="crossref-similarity-check---first-things-first">Crossref Similarity Check - first things first&lt;/h2>
&lt;p>Crossref Similarity Check is a content comparison tool, powered by iThenticate and produced by Turnitin, to check the originality of scholarly works and detect potential cases of plagiarism. Crossref members are eligible for this service, which offers them a reduced rate for document checking (plus enhanced functionality) in exchange for making their own published content available to be indexed into the iThenticate database.&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/services/similarity-check">The Crossref Similarity Check service&lt;/a> continues to grow in membership (1,531 members in 2020; 1,964 members in 2021, to date) and in the number of documents checked (1,922,621 manuscripts checked between January and July 2020 and 2,419,612 over the same period this year).&lt;/p>
&lt;p>Just as with the current version of iThenticate, Crossref Similarity Check subscribers will be able to compare documents against a vast database of internet sources and over 78 million full-text documents contributed by the Crossref members that use the service:&lt;/p>
&lt;ul>
&lt;li>&lt;em>Crossref&lt;/em> - research articles, books, and conference proceedings provided by publishers of scholarly content all over the world&lt;/li>
&lt;li>&lt;em>Crossref posted content&lt;/em> - preprints, eprints, working papers, reports, dissertations, and many other types of content that has not been formally published but has been registered with Crossref&lt;/li>
&lt;li>&lt;em>Internet&lt;/em> - a database of archived and live publicly-available web pages, including billions of pages of existing content, and with tens of thousands of new pages added each day&lt;/li>
&lt;li>&lt;em>Publications&lt;/em> - third-party periodical, journal, and publication content including many major professional journals, periodicals, and business publications from sources other than Crossref Similarity Check members&lt;/li>
&lt;li>&lt;em>Your Indexed Documents&lt;/em> - other documents you have uploaded for checking (within your Crossref Similarity Check user account only, and not added to iThenticate&amp;rsquo;s main indexes)&lt;/li>
&lt;/ul>
&lt;h2 id="whats-new">What&amp;rsquo;s new&lt;/h2>
&lt;p>We are delighted to introduce the following new features and enhancements with iThenticate v2.0:&lt;/p>
&lt;ul>
&lt;li>Increased document upload capacity&lt;/li>
&lt;li>Suspicious and hidden character detection&lt;/li>
&lt;li>Preprint exclusion filter&lt;/li>
&lt;li>Refreshed and responsive interface&lt;/li>
&lt;li>Similarity reports - save and share&lt;/li>
&lt;li>Annotations&lt;/li>
&lt;li>Content portal&lt;/li>
&lt;li>Improved API&lt;/li>
&lt;/ul>
&lt;h3 id="increased-document-upload-capacity">Increased document upload capacity&lt;/h3>
&lt;p>This new version of iThenticate has an increased document upload capacity of up to 800 pages/200 MB and a Google Drive document upload functionality. Please note that per-document &lt;a href="https://www.crossref.org/fees/#similarity-check-fees">fees&lt;/a> allow for a maximum of 25,000 25,000 &lt;del>characters&lt;/del> (EDIT 21/11/4: words), as one billable unit (25,001-50,000 25,000 &lt;del>characters&lt;/del> (EDIT 21/11/4: words) is two billing units, and so on).&lt;/p>
&lt;h3 id="suspicious-or-hidden-character-detection">Suspicious or hidden character detection&lt;/h3>
&lt;p>A new &amp;lsquo;Red flag&amp;rsquo; feature, highlighted at the top right hand side of the Similarity report and with in-line markers, signals the detection of hidden text such as text/quotation marks in white font or suspicious character replacement  e.g., the substitution of a Latin e for a Cyrillic е or a Latin o for a Greek ο, which may have been deliberately added to avoid text-matching detection.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2021/red-flag.png"
alt="Red flag feature: Hidden characters in the iThenticate v2.0 Similarity report" width="80%">&lt;figcaption>
&lt;p>Red flag feature: Hidden characters in the iThenticate v2.0 Similarity report&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;h3 id="preprint-exclusion-filter">Preprint exclusion filter&lt;/h3>
&lt;p>Increasingly, authors are making available a preprint of their article, either before or at the same time as submitting it to a journal. With Turnitin, we have therefore developed a new exclusion filter for &amp;lsquo;Preprint Sources&amp;rsquo;, which can be applied directly from your Similarity report.&lt;/p>
&lt;h3 id="refreshed-and-responsive-interface">Refreshed and responsive interface&lt;/h3>
&lt;p>The new iThenticate has a cleaner, more intuitive and accessible interface, with responsive design for ease of use on different screen sizes. The Similarity report is no longer a static image but a text that can be searched, copied and pasted. The display of matches has been improved and simplified with two views only: &amp;lsquo;Sources overview&amp;rsquo; and &amp;lsquo;All sources&amp;rsquo;.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2021/similarity-report.png"
alt="Similarity report in iThenticate v2.0" width="80%">&lt;figcaption>
&lt;p>Similarity report in iThenticate v2.0&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;h3 id="similarity-reports---save-and-share">Similarity reports - save and share&lt;/h3>
&lt;p>You can now save Similarity reports as a PDF file and share them via email through the iThenticate interface with authors. Please note: this is still work in progress and enhancements to this feature will be released in the coming months.&lt;/p>
&lt;h3 id="annotations">Annotations&lt;/h3>
&lt;p>Annotations in Similarity reports is a brand new feature available in private mode only (in shared folders) in this initial release. Annotations will display the date, time and comments and can be edited or deleted as required. These private annotations will not be included in the &amp;lsquo;save and share&amp;rsquo; features mentioned above. Public, shareable, annotations will be included in a future release.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2021/annotations.png"
alt="Private annotations in the new Similarity report" width="80%">&lt;figcaption>
&lt;p>Private annotations in the new Similarity report&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;h3 id="content-portal">Content portal&lt;/h3>
&lt;p>The new &amp;lsquo;Content portal&amp;rsquo; is a useful tool to check how much of your own published content has been successfully indexed into the iThenticate database and is now searchable. It will also help you self-diagnose and fix the content that has failed to be indexed.&lt;/p>
&lt;h3 id="improved-api-for-subscribers-who-integrate-similarity-check-with-their-manuscript-tracking-system">Improved API for subscribers who integrate Similarity Check with their manuscript tracking system&lt;/h3>
&lt;p>API users will benefit from a new integration with manuscript tracking systems which will allow the display of the largest matching word count and the top 5 source matches alongside the Similarity score.&lt;/p>
&lt;h2 id="whats-next">What&amp;rsquo;s next&lt;/h2>
&lt;p>We&amp;rsquo;re expecting a number of new features and enhancements to iThenticate version 2.0 as well as further manuscript tracking system API integrations in the coming months:&lt;/p>
&lt;ul>
&lt;li>User/usage reporting functionality&lt;/li>
&lt;li>Editorial Manager API integration&lt;/li>
&lt;li>Further enhancements to the Similarity report user interface&lt;/li>
&lt;li>Parent/child account management reporting, to assist Crossref Sponsors&lt;/li>
&lt;li>Public vs. private annotations&lt;/li>
&lt;li>Document resubmission flow&lt;/li>
&lt;li>Customisable welcome email&lt;/li>
&lt;/ul>
&lt;h2 id="well-keep-you-posted">We&amp;rsquo;ll keep you posted&lt;/h2>
&lt;p>We will post updates here as soon as new features, enhancements and API integrations are  available and/or we are ready to upgrade the next group of members.&lt;/p>
&lt;p>We&amp;rsquo;ll be contacting subscribers in stages to upgrade you to the new version, so keep your eyes open for an email from us. As you know, you have to supply full-text Similarity Check URLs in your Crossref metadata for over 90% of your own published content in order to be eligible for the service. We&amp;rsquo;ll be checking that anyone who wants to upgrade to v2.0 is still at 90% or above. You can check this yourself in advance on our &lt;a href="https://www.crossref.org/documentation/similarity-check/participate/eligibility/">eligibility checker&lt;/a> - if you&amp;rsquo;ve fallen below 90%, the tool will give you instructions for adding your missing full-text Similarity Check URLs.&lt;/p>
&lt;p>In the meantime, you will find the Similarity Check service documentation for the &lt;a href="https://www.crossref.org/documentation/similarity-check">current&lt;/a> version of iThenticate on our website. The documentation for the new version can be found on the &lt;a href="https://help.turnitin.com/crossref-similarity-check.htm" target="_blank">Crossref Similarity Check site provided by Turnitin&lt;/a>.&lt;/p>
&lt;p>✏️ Do get in touch via &lt;a href="mailto:support@crossref.org">support@crossref.org&lt;/a> if you have any questions or suggestions or start a discussion on &lt;a href="https://community.crossref.org" target="_blank">our Community Forum&lt;/a>&lt;/p></description></item><item><title>Lesson learned, the hard way: Let’s not do that again!</title><link>https://www.crossref.org/blog/lesson-learned-the-hard-way-lets-not-do-that-again/</link><pubDate>Wed, 08 Sep 2021 00:00:00 +0000</pubDate><author>Isaac Farley</author><guid>https://www.crossref.org/blog/lesson-learned-the-hard-way-lets-not-do-that-again/</guid><description>&lt;h2 id="tldr">TL;DR&lt;/h2>
&lt;p>We missed an error that led to resource resolution URLs of some 500,000+ records to be incorrectly updated. We have reverted the incorrect resolution URLs affected by this problem. And, we’re putting in place checks and changes in our processes to ensure this does not happen again.&lt;/p>
&lt;h2 id="how-we-got-here">How we got here&lt;/h2>
&lt;p>Our technical support team was contacted in late June by Wiley about updating resolution URLs for their content. It&amp;rsquo;s a common request of our technical support team, one meant to make the URL update process more efficient, but this was a particularly large request. Shortly thereafter, we were provided with nearly 1,200 separate files by Atypon on behalf of Wiley in order to update the resolution URLs of ~9 million records. We manually spot checked over 50 of these files, because, prior to this issue, our technical support team did not have a mechanism to automatically check for errors. That labor intensive review did not turn up any problems. That is, those 50 samples had no errors with the headers, like were found later.&lt;/p>
&lt;p>Among the files we didn’t check, there were headers included in the files with different owning &lt;code>fromPrefix&lt;/code> and acquiring &lt;code>toPrefix&lt;/code> members’ DOI prefixes. In a URL update request, the prefixes should always be the same.&lt;/p>
&lt;p>And still other files included requests to update records with DOIs that had never even been registered. Here are some examples:&lt;/p>
&lt;p>&lt;sub>H:email=support@crossref.org;fromPrefix=&lt;strong>10.5555&lt;/strong>;toPrefix=&lt;strong>10.5555&lt;/strong>&lt;br>
10.5555/doi1 &lt;a href="http://www.newurl.com/whatever" target="_blank">http://www.newurl.com/whatever&lt;/a>&lt;br>
10.5555/doi2 &lt;a href="http://www.newurl.com/whatever2" target="_blank">http://www.newurl.com/whatever2&lt;/a>&lt;/sub>&lt;/p>
&lt;p>In the example above, these fictional DOIs are both under prefix 10.5555. Thus, the result of this request will ONLY be that the resolution URLs of DOI 10.5555/doi1 and 10.5555/doi2 are updated in the metadata.&lt;/p>
&lt;p>&lt;sub>H:email=support@crossref.org;fromPrefix=&lt;strong>10.5555&lt;/strong>;toPrefix=&lt;strong>10.9876&lt;/strong> &lt;br>
10.5555/doi1 &lt;a href="http://www.newurl.com/whatever" target="_blank">http://www.newurl.com/whatever&lt;/a>&lt;br>
10.5555/doi2 &lt;a href="http://www.newurl.com/whatever2" target="_blank">http://www.newurl.com/whatever2&lt;/a>&lt;/sub>&lt;/p>
&lt;p>In this second example, these fictional DOIs are both under prefix 10.5555, but because the &lt;code>toPrefix&lt;/code> in the header differs from the &lt;code>fromPrefix&lt;/code>, the result of this request will be that the resolution URLs of 10.5555/doi1 and 10.5555/doi2 are updated in the metadata AND the owning prefix of both records will be transferred from prefix 10.5555 to prefix 10.9876.&lt;/p>
&lt;p>We kicked off the URL update request on 30 June and all legitimate DOIs whose files were free of errors were updated by 7 July (yes, it takes about a week to update the resolution URLs for ~9 million records).&lt;/p>
&lt;p>On 9 July, Peter Strickland of the International Union of Crystallography, one of 22 members affected by this mistake, contacted us to enquire how/why much of their content was resolving to incorrect URLs and why ownership of their content appeared within our &lt;a href="https://search.crossref.org/" target="_blank">search interface&lt;/a> to be Wiley. Peter was rightly concerned. We were, too. Our technical support team quickly elevated this issue, because, frankly, this is not the first time our finicky URL update process has caused unwanted metadata updates, albeit not quite at this volume.&lt;/p>
&lt;h2 id="how-we-investigated-the-problem">How we investigated the problem&lt;/h2>
&lt;p>We rallied our internal team. We investigated and discovered that we believed that some ~600,000 DOIs were erroneously included and updated in the requested 1,200 files. We later extended that estimate to include other conditions, in order to be as cautious as we could, to over 1 million DOIs. In the end, we determined that the incorrect files attempted updates of 1,228,041 DOIs. Due to the errors in the files (i.e., erroneous headers and non-registered DOIs), we only actually updated and then reverted 520,512 DOIs. The other 700,000+ DOIs were never updated (because of errors in the original files provided to us) or simply had never been registered with us.&lt;/p>
&lt;p>Prior to this mistake, Crossref had never reverted a member’s metadata update before. To be clear, and as I said above, we have had other URL update mistakes over the years, like this one; they were just smaller in scale. We knew there were holes in our process that needed to be plugged. And we knew we needed a better solution for members to manage these updates themselves without our manual intervention. So, while there were mistakes made in the files supplied to us, this was our error and we’re fixing it; more on that below.&lt;/p>
&lt;p>For this situation, we quickly realized that reversion of the metadata update was the best option for us, albeit we did not have an existing process in place to execute that reversion. That’s because we only keep the current version of each metadata record. We couldn’t back out of the change; we couldn’t simply restore these records to the metadata registered with us as of late June, because we no longer had an easily accessible, central record of those previous resolution URLs. What we did have was a record of all the previous submissions made against each DOI, so our technical team, focused their efforts there.&lt;/p>
&lt;h2 id="how-we-fixed-all-those-records">How we fixed all those records&lt;/h2>
&lt;p>We had two errors to correct: the ownership transfers (those records that had inadvertent and mismatched from/to prefixes) and the incorrect resolution URLs. We reverted all of the ownership transfers on 9 July and then double and triple checked that ownership during the week of 12 July to ensure we didn’t miss anything.&lt;/p>
&lt;p>The resolution reversion was more complicated. We invested in creating a patch to identify the records that had been updated by our team, and then extract the last legitimate resolution URL registered with us by the owning member in order to revert the metadata for each record. In order to provide confidence that this mistake was contained, we also built a check into the patch to ensure that those DOIs that did have their ownership temporarily transferred were not updated during the few days that ownership was incorrect. That check helped us determine that none of the 520,512 DOIs were incorrectly updated beyond this mistaken URL update request.&lt;/p>
&lt;p>The technical team built and tested this patch. The tests turned up gaps in the patch, so we refined it during the week of 2021 July 12. We kicked off the reversion of these records on Monday, 19 July at 20:05 UTC and the patch completed all reversions at 20:14 UTC, Thursday, 22 July.&lt;/p>
&lt;p>In the end, we successfully reverted all of the resolution URLs for those 520,512 DOIs we identified; provided &lt;a href="https://status.crossref.org/incidents/5cn1m2nw88rd" target="_blank">daily updates&lt;/a> and apologies to the 22 affected members; together we worked some longer hours; and persevered.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2021/wiley-urls-slack.png"
alt="Ed updates everyone internally on the situation and thanks all the people who worked together to resolve the issue" width="80%">&lt;figcaption>
&lt;p>Ed updates everyone internally on the situation and thanks all the people who worked together to resolve the issue&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;h2 id="next-up">Next up&lt;/h2>
&lt;p>We don&amp;rsquo;t want this to ever happen again. Like, never. We clearly need to make changes to our internal processes to prevent this in the future.&lt;/p>
&lt;p>Here’s what’s ahead:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>We are building &lt;a href="https://gitlab.com/crossref/user_stories/-/issues/651" target="_blank">a checker&lt;/a> that we can run URL update files through to automate and our checks. This means we will be able to check every single file in a large batch, rather than relying on manual and labor intensive spot-checking;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>As said above, one compounding issue in this mistake was the mismatched from/to prefixes in the file headers. Our technical support team uses the same file headers to transfer ownership/stewardship of a record or set of records between members AND to update resolution URLs. These two tasks are almost never legitimately completed in the same file. That is, there is usually a lag between ownership transfers and resolution URL updates (most members will request an ownership transfer and then a month or two later update their URLs). Because of this, simply &lt;a href="https://gitlab.com/crossref/user_stories/-/issues/650" target="_blank">decoupling these two tasks&lt;/a> (feel free to follow our work at this link) would help eliminate a glaring risk, so we’re working on that too;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Lastly, we’re researching ways we can &lt;a href="https://gitlab.com/crossref/issues/-/issues/1444" target="_blank">streamline resource resolution URL updates&lt;/a>. You can also monitor our progress on this one. No promises or specifics yet, but we’re eager to reduce toil on our technical support team, avoid problems like this one, and provide members safe and straightforward ways to better update your metadata.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>Thanks for the support of the whole Crossref team and our community - and for reading this far! Never a dull moment&amp;hellip;&lt;/p></description></item><item><title>Crossref Conversations: audio blog about helping open science</title><link>https://www.crossref.org/blog/crossref-conversations-audio-blog-about-helping-open-science/</link><pubDate>Fri, 20 Aug 2021 00:00:00 +0000</pubDate><author>Rosa Morais Clark</author><guid>https://www.crossref.org/blog/crossref-conversations-audio-blog-about-helping-open-science/</guid><description>&lt;p>Crossref Conversations is an audio blog we&amp;rsquo;re trying out that will cover various topics important to our community. This conversation is between colleagues Anna Tolwinska and Rosa Morais Clark, discussing how we can make research happen faster, with fewer hurdles, and how Crossref can help. Our members have been asking us how Crossref can support open science, and we have a few insights to share. So we invite you to have a listen.&lt;/p>
&lt;p>[&lt;em>UPDATE: Since this recording ROR IDs are now part of the Crossref schema.&lt;/em>]&lt;/p>
&lt;iframe title="Crossref Conversations: Anna and Rosa in conversation about open science" allowtransparency="true" height="150" width="100%" style="border: none; min-width: min(100%, 430px);" scrolling="no" data-name="pb-iframe-player" src="https://www.podbean.com/player-v2/?i=49ggy-1085496-pb&amp;from=pb6admin&amp;share=1&amp;download=1&amp;rtl=0&amp;fonts=Arial&amp;skin=f6f6f6&amp;font-color=auto&amp;btn-skin=60a0c8">&lt;/iframe>
&lt;h2 id="helpful-links">Helpful links&lt;/h2>
&lt;p>Here are links to all the sources mentioned in the recording.&lt;/p>
&lt;ul>
&lt;li>&lt;a href="https://docs.google.com/document/d/1wSmDfoX1SjeSnsX6L5JKzB3pMBLicEcClEBOQIEKJro/edit?usp=sharing" target="_blank">Recording transcript&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://www.crossref.org/blog/">Lots of great information on our blog&lt;/a>&lt;/li>
&lt;li>&lt;a href="mailto:feedback@crossref.org">Send questions to: feedback@crossref.org&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://community.crossref.org/" target="_blank">Let&amp;rsquo;s continue the conversation on our Community Forum&lt;/a>&lt;/li>
&lt;li>&lt;a href="http://www.metadata2020.org/" target="_blank">Metadata 20/20 - great information about how richer more open metadata can make research happen faster&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://doi.org/10.24343/C34W2H" target="_blank">Crossref’s Board votes to adopt the Principles of Open Scholarly Infrastructure (POSI)&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://doi.org/10.64000/2dkpt-h4159" target="_blank">Helping researchers identify content they can text mine&lt;/a>&lt;/li>
&lt;/ul>
&lt;p>Thanks for listening!&lt;/p></description></item><item><title>Some rip-RORing news for affiliation metadata</title><link>https://www.crossref.org/blog/some-rip-roring-news-for-affiliation-metadata/</link><pubDate>Mon, 26 Jul 2021 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/some-rip-roring-news-for-affiliation-metadata/</guid><description>&lt;p>We’ve just added to our input schema the ability to include affiliation information using ROR identifiers. Members who register content using XML can now include ROR IDs, and we’ll add the capability to our manual content registration form, participation reports, and metadata retrieval APIs in the near future. And we are inviting members to a &lt;a href="https://crossref.zoom.us/webinar/register/WN_M5EFzTZCSBqsnbWiMMmMLQ" target="_blank">Crossref/ROR webinar&lt;/a> on 29th September at 3pm UTC.&lt;/p>
&lt;h2 id="the-background">The background&lt;/h2>
&lt;p>We’ve been working on the &lt;a href="https://ror.org" target="_blank">Research Organisation Registry (ROR)&lt;/a> as a community initiative for the last few years. Along with the California Digital Library and DataCite, our staff has been involved in setting the strategy, planning governance and sustainability, developing technical infrastructure, hiring/loaning staff, and engaging with people in person and online. In our view, it’s the best current model of a collaborative initiative between like-minded &lt;a href="http://openscholarlyinfrastructure.org" target="_blank">open scholarly infrastructure (OSI)&lt;/a> organisations.&lt;/p>
&lt;p>Last year, Project Manager Maria Gould described &lt;a href="https://doi.org/10.64000/wxc0w-hcq28" target="_blank">the case for publishers adopting ROR&lt;/a> and ROR was ranked the number one priority at our last in-person annual meeting. Now it’s time that Crossref’s services themselves took up the baton to meet the growing demand.&lt;/p>
&lt;p>The inclusion of ROR in the Crossref metadata will help everyone in the scholarly ecosystem make critical connections more easily. For example, research institutions need to monitor and measure their output by the articles and other resources their researchers have produced. Journals need to know with which institutions authors are affiliated to determine eligibility for institutionally sponsored publishing agreements. Funders need to be able to discover and track the research and researchers they have supported. Academic librarians need to easily find all of the publications associated with their campus.&lt;/p>
&lt;p>Earlier this month, GRID and ROR &lt;a href="https://www.digital-science.com/press-release/grid-passes-torch-to-ror" target="_blank">announced&lt;/a> that after working together to seed the community-run Research Organisation Registry, GRID would be retiring from public service and handing the proverbial torch over to ROR as the scholarly community’s reliable universal open identifier for affiliations. That means that our members who have been using GRID now need to consider their move to ROR and think about how they can add ROR IDs into the metadata that they manage and share through Crossref.&lt;/p>
&lt;h2 id="the-plan">The plan&lt;/h2>
&lt;p>We’ve been able to include ROR IDs for our grant metadata schema as affiliation information for two years, since July 2019. And the Australia Research Data Commons (ARDC) was the first member to add ROR IDs to the Crossref system in 2020. In early July, we completed the work to accept ROR IDs for affiliation assertions for all other types of records with an &lt;code>affiliation&lt;/code> or &lt;code>institution&lt;/code> element, such as journal articles, book chapters, preprints, datasets, dissertations, and many more.&lt;/p>
&lt;p>Next, we will commence the plans to support ROR in our other tools and services, such as Participation Reports. We’ll work on alignment with the Open Funder Registry and share our plans to collect the information via the &lt;a href="https://doi.org/10.64000/30vzx-r5x16" target="_blank">new user interface we’re developing for registering and managing metadata&lt;/a>. Open Journal Systems (OJS) already has a ROR Plugin, developed by the German National Library of Science and Technology (TIB). This supports the collection of ROR IDs and future releases of this plugin and the OJS DOI plugin will allow including ROR IDs in the metadata sent to Crossref, to support thousands of our members to share ROR IDs via their Crossref metadata.
We also aim to add ROR to our metadata retrieval options, including the REST API, which recently saw the start of an unblocking with our &lt;a href="https://doi.org/10.64000/nxwqn-x9m73" target="_blank">move to a more robust technical foundation&lt;/a>.&lt;/p>
&lt;h2 id="the-call-for-participation">The call for participation&lt;/h2>
&lt;p>Many Crossref publishers, funders, and service providers are already planning to integrate ROR with their systems, &lt;a href="https://ror.readme.io/docs/map-other-organisation-id-types-to-ror" target="_blank">map their affiliation data to ROR&lt;/a>, and include ROR in Crossref metadata. In addition to publishers and funders, libraries, repositories, and other stakeholders are developing support for ROR. For example, the &lt;a href="https://journalcheckertool.org" target="_blank">Plan S Journal Checker tool&lt;/a> uses ROR IDs to let people check whether a particular journal is compliant with an author&amp;rsquo;s funder and institutional open access policies. In addition, the ROR website shows a growing list of &lt;a href="https://ror.org/integrations" target="_blank">active and in-progress ROR integrations&lt;/a>.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2021/crossref-ror-workflow-diagram.png" width="100%">
&lt;/figure>
&lt;p>Crossref members registering research grants via Altum’s ProposalCentral system can already add ROR IDs. Now those registering articles, books, preprints, datasets, dissertations, and other research objects, can start including much clearer and all-important affiliation metadata as part of their content registration going forward. As with all newly-introduced metadata elements, we recommend adding ROR IDs from now and ongoing, but planning a distinct project to backfill older records. We know that more than 80% of records have been updated and enriched at least once with additional and cleaner metadata, so as members do this routinely, they can include ROR IDs alongside updating URLs, license or funding information, and other metadata.&lt;/p>
&lt;p>For information on how ROR will be supported in the Crossref metadata, take a look at &lt;a href="https://gitlab.com/crossref/schema/-/releases/0.2.0" target="_blank">our latest schema release (version 5.3.0) &lt;/a> or in this &lt;a href="https://gitlab.com/crossref/schema/-/blob/master/best-practice-examples/journal.article5.3.0.xml" target="_blank">journal article example XML&lt;/a>.&lt;/p>
&lt;p>Join the discussion in our forum below and register for the &lt;a href="https://www.crossref.org/services/content-registration/">Crossref/ROR webinar on September 29th at 3pm UTC&lt;/a> to learn all you need to know about incorporating ROR into your Crossref metadata.&lt;/p></description></item><item><title>RFP: Help evaluate the reach and effects of metadata</title><link>https://www.crossref.org/blog/rfp-help-evaluate-the-reach-and-effects-of-metadata/</link><pubDate>Wed, 21 Jul 2021 00:00:00 +0000</pubDate><author>Jennifer Kemp</author><guid>https://www.crossref.org/blog/rfp-help-evaluate-the-reach-and-effects-of-metadata/</guid><description>&lt;p>UPDATE, 14 October 2021:&lt;/p>
&lt;p>We received several excellent proposals in response to this RFP and we’d like to thank everyone involved for their time and enthusiasm.&lt;/p>
&lt;p>We are excited to announce the two projects that have been selected, to run through early 2023. Stay tuned!&lt;/p>
&lt;div class='shortcode-row '>
&lt;div class="col-md-6 col-sm-12 no-first-para-highlight">&lt;p>&lt;strong>With or Without: Measuring Impacts of Books Metadata&lt;/strong>&lt;br>
This project will test the premise that academic books metadata improves discoverability and usage by assessing the impact of book chapter records with DOIs (unique from metadata associated with the entire book) with associated chapter and book attributes. The study aims to prove or disprove its hypothesis and rank metadata attributes by their association with successful content discovery and access. The findings will be considered alongside similar metadata research in order to develop a metadata efficacy framework, which can be used to determine the return on metadata investments by publishers and service providers.&lt;/p>
&lt;p>&lt;strong>Lettie Y. Conrad and Michelle Urberg&lt;/strong>, Independent consultants&lt;/p>
&lt;/div>
&lt;div class="col-md-6 col-sm-12 no-first-para-highlight">&lt;p>&lt;strong>Metadata For Everyone&lt;/strong>&lt;br>
This project will explore the metadata quality, consistency and completeness from various individual journals and communities. The project will pay special attention to elements that are most likely to vary across cultures, such as names and those that are potentially multi-lingual, with the understanding that metadata issues do not affect nor impact all communities in the same way.&lt;/p>
&lt;p>&lt;strong>Juan Pablo Alperin&lt;/strong>, Associate Director of Research, Public Knowledge Project &amp;amp; Co-Director, Scholarly Communications Lab&lt;br>
&lt;strong>Mike Nason&lt;/strong>, Scholarly Communications &amp;amp; Publishing Librarian, University of New Bruinswick Libraries&lt;br>
&lt;strong>Marco Tullney&lt;/strong>, Head of Publishing Services &amp;amp; Coordination Open Access at TIB – Leibniz Information Centre for Science and Technology&lt;/p>
&lt;/div>
&lt;/div>
&lt;hr>
&lt;p>We’re excited (and a little nervous) to launch a new research project designed to assess the effects of metadata on research communications. We’re expecting this effort to be a significant contribution to the
&lt;a href="https://doi.org/10.3897/rio.5.e38698" target="_blank">existing research&lt;/a> on the topic and we’re really looking forward to getting started. We’re also a little nervous because of course we don’t know what the conclusions will be (after all, if we did, we wouldn’t be starting this project).&lt;/p>
&lt;h3 id="assume-nothing">Assume nothing&lt;/h3>
&lt;p>It seems logical and very widely accepted that more and better metadata leads to good things. Does it? If so, how and how do we know that? What does the ‘before and after’ look like when metadata is corrected or enhanced? There are so many questions, so many stakeholders and enough variation around record types (books come to mind) and disciplines (hello citation styles) that the topic warrants all the attention it gets and more. This project is designed to be very broad in scope, sampling from various criteria, and is expected to last about a year.&lt;/p>
&lt;h3 id="interested-in-getting-involved">Interested in getting involved?&lt;/h3>
&lt;p>If you’re a researcher involved in scientometrics or bibliometrics or if you’re a consultant with
experience in original research, please have a read of &lt;a href="https://www.crossref.org/pdfs/metadata-reach-and-return-rfp-2021.pdf">the RFP&lt;/a> and get in touch with a statement of interest by 1st September or with questions in the meantime. We’re looking for an individual, research group or organisation that will work with us over the course of the project to define terms, finalize the approach, analyze the data and communicate the results, whatever they may be.&lt;/p>
&lt;p>&lt;strong>RFP responses are requested by 1st September&lt;/strong> so don’t hesitate to &lt;a href="mailto:feedback@crossref.org?subject=Metadata reach and return RFP">get in touch&lt;/a> with questions.&lt;/p>
&lt;p>If you’re interested in the project but not in responding to the RFP, you may still be able to help. We
would appreciate wide circulation of this announcement to help us find qualified respondents to the RFP so
please do share this with your network. And, of course, we hope you stay tuned for the outcome of the
work. Check back with us on that in about a year&amp;hellip;&lt;/p></description></item><item><title>Behind the scenes improvements to the REST API</title><link>https://www.crossref.org/blog/behind-the-scenes-improvements-to-the-rest-api/</link><pubDate>Tue, 06 Jul 2021 00:00:00 +0000</pubDate><author>Patrick Polischuk</author><guid>https://www.crossref.org/blog/behind-the-scenes-improvements-to-the-rest-api/</guid><description>&lt;p>UPDATE, 24 August 2021: All pools have been migrated to the new Elasticsearch-backed API, which already appears to be more stable and performant than the outgoing Solr API. Please report any issues via our &lt;a href="https://gitlab.com/crossref/issues" target="_blank">Crossref issue repository in Gitlab&lt;/a>.&lt;/p>
&lt;p>UPDATE, 9 August 2021: The cutovers for the polite and Plus pools are delayed again. We&amp;rsquo;re still working to ensure acceptable performance and stability before serving responses from the new application and infrastructure. Each cutover is currently delayed by one more week&amp;ndash;the polite pool is scheduled for 2021 August 17 and the Plus pool is scheduled for 2021 August 24.&lt;/p>
&lt;p>UPDATE, 2 August 2021: The cutovers for the polite and Plus pools are delayed. We&amp;rsquo;ve been mirroring traffic to the new polite pool and want to ensure acceptable performance and stability before serving responses from the new application and infrastructure. Each cutover is currently delayed by one week&amp;ndash;the polite pool is scheduled for 2021 August 10 and the Plus pool is scheduled for 2021 August 17.&lt;/p>
&lt;p>UPDATE, 13 July 2021: The first stage of the cutover is complete, so requests to the public pool are now being served by the new REST API. We took a slightly different approach to performing the cutover, so the &amp;ldquo;&lt;a href="https://doi.org/10.64000/nxwqn-x9m73#documentation" target="_blank">Documentation&lt;/a>&amp;rdquo; and &amp;ldquo;&lt;a href="https://doi.org/10.64000/nxwqn-x9m73#temporary-domain" target="_blank">Temporary domain&lt;/a>&amp;rdquo; sections below have been updated.&lt;/p>
&lt;hr>
&lt;p>Our &lt;a href="https://api.crossref.org" target="_blank">REST API&lt;/a> is the primary interface for anybody to fetch the metadata of content registered with us, and we&amp;rsquo;ve been working hard on a more robust REST API service that&amp;rsquo;s about to go live.&lt;/p>
&lt;p>The REST API is free to use and it gets around 300 million requests each month (we encourage users to adhere to our &lt;a href="https://github.com/CrossRef/rest-api-doc#etiquette" target="_blank">etiquette guidelines&lt;/a> to keep things running smoothly). It is used for &lt;a href="https://doi.org/10.64000/3gpwy-1qd71" target="_blank">bibliometric studies&lt;/a>, by &lt;a href="https://doi.org/10.64000/ggq0z-30r05" target="_blank">platforms like Dimensions&lt;/a>, by &lt;a href="https://doi.org/10.64000/49mpx-hpr56" target="_blank">organisations like the National Library of Sweden&lt;/a>, and to support &lt;a href="https://www.crossref.org/services/metadata-retrieval/user-stories/">countless other efforts&lt;/a>.&lt;/p>
&lt;p>We also offer enhanced access to our APIs and other services with &lt;a href="https://www.crossref.org/services/metadata-retrieval/metadata-plus/">Metadata Plus&lt;/a>, and we recommend it for production services and others that benefit from guaranteed up-time, a higher rate limit, and priority support from our helpful staff.&lt;/p>
&lt;p>For a while now, we&amp;rsquo;ve been working to migrate the REST API from &lt;a href="https://solr.apache.org/" target="_blank">Solr&lt;/a> to &lt;a href="https://www.elastic.co/" target="_blank">Elasticsearch&lt;/a> and from our datacenter to a cloud platform in order to address issues of scalability and extensibility.&lt;/p>
&lt;p>We&amp;rsquo;re pleased to announce that we&amp;rsquo;ll be cutting over to the Elasticsearch-backed version of the REST API over the next few weeks, beginning July 13. This cutover will occur one pool at a time&amp;ndash;the public pool will be migrated first, followed by the polite pool on August 3, and the Plus pool on August 10 (see &amp;rsquo;etiquette&amp;rsquo; link above if you&amp;rsquo;re unfamiliar with our different pools). &lt;em>Please note updates at the top of this post for changes to the original schedule.&lt;/em>&lt;/p>
&lt;p>We&amp;rsquo;ve thoroughly tested the functionality and performance of the new REST API, and we&amp;rsquo;d like to invite you to test it out before we move production traffic to the new service. Try out your favorite API queries at &lt;a href="https://api.production.crossref.org" target="_blank">https://api.production.crossref.org/&lt;/a>. &lt;/p>
&lt;h2 id="feature-parity-but-note-a-few-differences">Feature parity, but note a few differences&lt;/h2>
&lt;p>One of our primary objectives was to maintain feature parity between the old and new services, avoiding any breaking changes that might cause problems for existing services integrating with the REST API. We implemented a regression test suite which has given us the confidence to make such a foundational change. During the course of this project, we found it necessary and a good opportunity to make a few modifications. In each case, we analyzed usage and aimed to avoid making any breaking changes. We hope these represent improvements to the behavior and consistency of the REST API.&lt;/p>
&lt;ul>
&lt;li>
&lt;p>The &lt;code>group-title&lt;/code> filter uses exact matching. This filter previously worked but was undocumented and unsupported.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>The &lt;code>directory&lt;/code> filter is deprecated. This was meant to be an experimental, unsupported filter, and the data has not met the standard we require.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>The &lt;code>affiliation&lt;/code> facet returns counts of affiliation strings rather than counts of terms within affiliation fields (thus resolving &lt;a href="https://github.com/CrossRef/rest-api-doc/issues/405" target="_blank">this Github issue&lt;/a>).&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Cursors may be used to page through results from the /members, /funders, and /journals routes, in addition to /works.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>While we suggest that everyone use cursors for pagination, we still support the &lt;code>offset&lt;/code> functionality. We have introduced a limit of 80000 for offset values for the /members /funders and /journals routes&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;code>offset&lt;/code> behavior is slightly changed, now applying to the sum of rows and offsets rather than just offsets.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>The &lt;code>published&lt;/code> field is now present in API responses.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>The &lt;code>/licenses&lt;/code> route returns paged results.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Sorting by &lt;code>submitted&lt;/code> is no longer supported. This was never officially supported or documented.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>The &lt;code>/quality&lt;/code> route has been removed. This was an undocumented, experimental feature.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Funder name in &lt;code>/works&lt;/code> metadata is the name provided by the publisher.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Empty &lt;code>relation&lt;/code> fields correctly return an empty object.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Only &lt;code>ISBN&lt;/code> and &lt;code>isbn-type&lt;/code> for a record will be returned. ISBNs for associated volumes will be omitted.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>The &lt;code>institution&lt;/code> field is a list.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;code>query&lt;/code> uses different &lt;a href="https://en.wikipedia.org/wiki/Stop_word" target="_blank">stop word&lt;/a> defaults, though we expect querying to remain roughly the same.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>API responses may feature slightly different scores, as they come from different backends.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;h2 id="some-technical-notes-on-the-cutover">Some technical notes on the cutover&lt;/h2>
&lt;h3 id="documentation">Documentation&lt;/h3>
&lt;p>The above changes are documented in our new REST API documentation, which is now automatically generated via Swagger, resulting in more comprehensive coverage and more efficient feature development. During the cutover, the right documentation for you will depend on which pool you are using. The documentation for the new API can be found by visiting the &lt;a href="https://api.crossref.org" target="_blank">API in a browser&lt;/a>, or by navigating to &lt;a href="https://api.crossref.org/help" target="_blank">https://api.crossref.org/help&lt;/a>; and the docs for the old API remain here: &lt;a href="https://github.com/CrossRef/rest-api-doc" target="_blank">https://github.com/CrossRef/rest-api-doc&lt;/a>. The Github-hosted documentation will be deprecated once the cutover is complete.&lt;/p>
&lt;p>This may not come as news, but bears repeating as we mentioned GitHub. We have moved our source code repositories from GitHub to &lt;a href="https://gitlab.com/crossref" target="_blank">GitLab&lt;/a>, including all of our issue tracking.&lt;/p>
&lt;h3 id="temporary-domain">Temporary domain &lt;/h3>
&lt;p>UPDATE: We ended up performing the public pool cutover via reverse proxies instead of redirects&amp;ndash;please disregard the note about temporary domains below. The &lt;code>api.crossref.org&lt;/code> domain will remain the domain regardless of which pool you&amp;rsquo;re using or where we are in the cutover process.&lt;/p>
&lt;p>&lt;del>Please note that the &lt;code>api.production.crossref.org&lt;/code> domain is a temporary domain we are using during this cutover period. Traffic will be redirected to the new service one pool at a time via a &lt;code>307&lt;/code> http redirect. Once the cutover is complete, we will go back to using the &lt;code>api.crossref.org&lt;/code> domain. Do not update any software, scripts, libraries, tools, etc. to use the temporary domain.&lt;/del>&lt;/p>
&lt;h3 id="differences-in-query-results">Differences in query results&lt;/h3>
&lt;p>Due to inherent differences in how Solr and Elasticsearch perform queries and rank results, you may see slightly different results when comparing the old and new services. If for whatever reason your workflow involves using multiple API pools (which we don&amp;rsquo;t recommend), you may see inconsistent results. &lt;/p>
&lt;h3 id="cursor-behavior">Cursor behavior&lt;/h3>
&lt;p>&lt;a href="https://github.com/CrossRef/rest-api-doc#deep-paging-with-cursors" target="_blank">Cursors&lt;/a> may break if your script is paging through results at the exact moment the cutover is performed, and you should retry your request once the release is complete. We will post the precise maintenance window to &lt;a href="https://status.crossref.org/" target="_blank">https://status.crossref.org/&lt;/a>.&lt;/p>
&lt;h3 id="filing-issues">Filing issues&lt;/h3>
&lt;p>Feature requests and bug reports should be filed into the &lt;a href="https://gitlab.com/crossref/issues" target="_blank">Crossref issue repository in Gitlab&lt;/a> during this testing phase and once the new Elasticsearch-backed API is live in production.&lt;/p>
&lt;h2 id="coming-next">Coming next&lt;/h2>
&lt;p>While we hope the benefits of improved stability and extensibility are as exciting to you as they are to us, &amp;ldquo;feature parity&amp;rdquo; may not be the most thrilling message for our API users. In truth, one of the more exciting aspects of completing this migration is the end of the code freeze we instituted at the start of this effort. Now, we can work on new feature development and a continuous stream of bug fixes. We also improved the automatic test coverage as part of the work, meaning we can deliver features with greater confidence.&lt;/p>
&lt;p>The first new feature we&amp;rsquo;ll be delivering via the REST API will be support for the &amp;ldquo;grants&amp;rdquo; record type, allowing for the retrieval of metadata for &lt;a href="https://www.crossref.org/community/grants/">grants that have been registered with us&lt;/a>, now numbering over 20,000 from 8 different funder members. This work is well underway and will be released once we are confident that the new REST API is stable in production. From there, we&amp;rsquo;ll continue to select the highest priority issues from our &lt;a href="https://gitlab.com/groups/crossref/-/boards/1270983?scope=all&amp;amp;utf8=%E2%9C%93&amp;amp;label_name[]=Planning%3A%3ABacklog&amp;amp;label_name[]=Service%3A%3AREST%20API" target="_blank">REST API backlog&lt;/a>.&lt;/p>
&lt;p>As always, should you have any questions about our REST API, check out the &lt;a href="https://www.crossref.org/services/metadata-retrieval/">metadata retrieval section&lt;/a> of our website, start a &lt;a href="https://community.crossref.org/c/metadata-retrieval/27" target="_blank">discussion on our community forum&lt;/a>, file a Gitlab issue as mentioned above, or you can contact us via &lt;a href="mailto:support@crossref.org">support@crossref.org&lt;/a>.&lt;/p></description></item><item><title>DOAJ and Crossref sign agreement to remove barriers to scholarly publishing for all</title><link>https://www.crossref.org/blog/doaj-and-crossref-sign-agreement-to-remove-barriers-to-scholarly-publishing-for-all/</link><pubDate>Mon, 21 Jun 2021 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/doaj-and-crossref-sign-agreement-to-remove-barriers-to-scholarly-publishing-for-all/</guid><description>&lt;p>&lt;em>22 June 2021, London, UK and Boston, MA, USA&lt;/em> — The future of global open access publishing received a boost today with the signing of a Memorandum of Understanding between the Directory of Open Access Journals (DOAJ) and Crossref. The MOU formalizes an already strong partnership between the two organisations and furthers their shared pursuit of an open scholarly communications ecosystem that is inclusive of emerging publishing communities.&lt;/p>
&lt;p>Both organisations aim to encourage the dissemination and use of scholarly research using open infrastructure, online technologies, regional and international networks, and community partners - all supporting local institutional capacity and sustainability around the world.&lt;/p>
&lt;blockquote>
&lt;p>“DOAJ is delighted to be formalizing today’s agreement with Crossref, an organisation we are already closely aligned with. Together we stand a greater chance of encouraging an open, fair, and fully inclusive future for scholarly publishing,” said &lt;a href="https://doaj.org/about/team" target="_blank">Lars Bjørnshauge&lt;/a>, DOAJ Founder and Managing Director.&lt;/p>
&lt;/blockquote>
&lt;p>The agreement will enable content from journals indexed on DOAJ to be more easily identified through the use of Crossref metadata. The MOU also covers the exchange of a variety of services and information and greater coordination of technical and strategic requirements between DOAJ and Crossref. Included too is the development of outreach and training materials, coordination of service and feature development, as well as research studies to explore the overlaps and gaps in the journals and metadata covered by each organisation.&lt;/p>
&lt;blockquote>
&lt;p>“As academic-led journals continue to grow in number and geographic reach, it’s important we support this community more effectively. Our partnership with DOAJ means we can share strategies, data, and resources in order to lower barriers for emerging publishers around the world,” said &lt;a href="https://www.crossref.org/people">Ginny Hendricks&lt;/a>, Crossref’s Director of Member &amp;amp; Community Outreach.&lt;/p>
&lt;/blockquote>
&lt;h3 id="about-doaj">About DOAJ&lt;/h3>
&lt;p>DOAJ is a community curated online directory that indexes and provides access to high quality, open access, peer reviewed journals. DOAJ deploys more than one hundred carefully selected volunteers from among the community of library and other academic disciplines to assist in the curation of open access journals. This independent database contains over 15,000 peer-reviewed open access journals covering all areas of science, technology, medicine, social sciences, arts and humanities. DOAJ is financially supported worldwide by libraries, publishers and other like-minded organisations. DOAJ services (including the evaluation of journals) are free for all, and all data provided by DOAJ are harvestable via OAI/PMH and the API. See &lt;a href="https://doaj.org" target="_blank">doaj.org&lt;/a> for more information.&lt;/p>
&lt;h3 id="about-crossref">About Crossref&lt;/h3>
&lt;p>Crossref makes research objects easy to find, cite, link, assess, and reuse. We’re a not-for-profit membership organisation that exists to make scholarly communications better. We rally the community; tag and share metadata; run an open infrastructure; play with technology; and make tools and services—all to help put research in context. Visit &lt;a href="https://www.crossref.org" target="_blank">crossref.org&lt;/a> for further information.&lt;/p>
&lt;p>Please contact &lt;a href="mailto:louise@doaj.org">louise@doaj.org&lt;/a> or &lt;a href="mailto:feedback@crossref.org">feedback@crossref.org&lt;/a> with any questions.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2021/doaj-crossref-twitter-post-new.png" width="80%">
&lt;/figure></description></item><item><title>Event Data: Help us fill in the gaps</title><link>https://www.crossref.org/blog/event-data-help-us-fill-in-the-gaps/</link><pubDate>Fri, 11 Jun 2021 00:00:00 +0000</pubDate><author>Martyn Rittman</author><guid>https://www.crossref.org/blog/event-data-help-us-fill-in-the-gaps/</guid><description>&lt;p>&lt;em>UPDATE August 2, 2021: This work was awarded to Laura Paglione of the &lt;a href="https://sphericalcowgroup.com" target="_blank">Spherical Cow Group&lt;/a>.&lt;/em>&lt;/p>
&lt;p>To date, we have collected around &lt;a href="http://api.eventdata.crossref.org/v1/events?rows=0" target="_blank">740 million&lt;/a> events from 12 different source since we launched our &lt;a href="https://www.crossref.org/services/event-data/" target="_blank">Event Data service&lt;/a> service in 2017. Each event is an online mention of the research associated with a DOI, either via the DOI directly or using the associated URL. However, we know that there is much more out there. Because of this, we would like to explore where we could expand.&lt;/p>
&lt;p>We invite proposals to conduct a gap analysis for Event Data sources, looking at what we currently collect and seeing what more could be added. For the most relevant new sources, we are seeking an estimate of the effort to include them, and establish whether it is possible: we know that there are sources that are paywalled or with restrictive licensing not compatible with Event Data.&lt;/p>
&lt;p>The aim of the project is to identify a list of potential new sources. With community input, we will look to add a number of these to Event Data in the future based on needs and priorities.&lt;/p>
&lt;p>For full details of the requirements and how to make a proposal, see &lt;a href="https://www.crossref.org/pdfs/event-data-gap-analysis-rfi.pdf">here&lt;/a>. The deadline for proposals is 11 July 2021 and we anticipate that the work will be completed by the end of October 2021.&lt;/p></description></item><item><title>An Advisory Group for Preprints</title><link>https://www.crossref.org/blog/an-advisory-group-for-preprints/</link><pubDate>Wed, 09 Jun 2021 00:00:00 +0000</pubDate><author>Martyn Rittman</author><guid>https://www.crossref.org/blog/an-advisory-group-for-preprints/</guid><description>&lt;p>We are delighted to announce the formation of a new &lt;a href="https://www.crossref.org/working-groups/preprints" target="_blank">Advisory Group&lt;/a> to support us in improving preprint metadata. Preprints have grown in popularity over the last few years, with increasing focus brought by the need to rapidly disseminate knowledge in the midst of a global pandemic. We have supported metadata deposits for preprints under the record type &lt;a href="https://www.crossref.org/education/content-registration/content-types-intro/posted-content-includes-preprints/" target="_blank">‘posted content’&lt;/a> since 2016, and members currently register a total of around 17,000 new preprints metadata records each month.&lt;/p>
&lt;p>As preprints develop and different practices arise, we are keen to re-examine the metadata schema: to do this properly we need community input. We want to ensure that the schema is fit for purpose and supports the diversity of ways in which preprints are posted, linked with other objects, and used. Metadata schema need regular review, and this is just one example of a number of areas we are looking to update. Several topics we see as a high priority for preprints are better notification for when a preprint has been withdrawn or removed, accurate recording of versioning, and better indication of preprint server names.&lt;/p>
&lt;p>We have invited a number of organisations we know to be active in this area, and are looking forward to some very positive discussions. Participants span five continents and include members who post preprints, indexing services, and others with significant experience in the area of preprints. The first meeting took place earlier this week and brought up a diverse range of themes that will be tackled in future meetings.&lt;/p></description></item><item><title>Time to put the "R" back in "R&amp;D"</title><link>https://www.crossref.org/blog/time-to-put-the-r-back-in-rd/</link><pubDate>Mon, 07 Jun 2021 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/time-to-put-the-r-back-in-rd/</guid><description>&lt;p>It is time to put the &amp;lsquo;R&amp;rsquo; back into R&amp;amp;D.&lt;/p>
&lt;p>The Crossref R&amp;amp;D team was originally created to focus on the kinds of research projects that have allowed Crossref to make transformational technology changes, launch innovative new services, and engage with entirely new constituencies. Some Illustrious projects that had their origins in the R&amp;amp;D group include:&lt;/p>
&lt;div style="float:right;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2021/labs-logo-ribbon.svg" width="100%">
&lt;/figure>
&lt;/div>
&lt;ul>
&lt;li>DOI Content Negotiation&lt;/li>
&lt;li>Similarity Check (originally CrossCheck)&lt;/li>
&lt;li>ORCID (originally Author DOIs)&lt;/li>
&lt;li>Crossmark&lt;/li>
&lt;li>The Open Funder Registry&lt;/li>
&lt;li>The Crossref REST API&lt;/li>
&lt;li>Linked Clinical Trials&lt;/li>
&lt;li>Event Data&lt;/li>
&lt;li>Grant registration&lt;/li>
&lt;li>ROR&lt;/li>
&lt;/ul>
&lt;p>And for each project that has graduated, there have been several that have not. Some projects were simply designed to gather data. Others just didn’t generate enough interest. You are not truly experimenting if you don’t fail occasionally too.&lt;/p>
&lt;p>Recently we’ve been doing very little experimenting of any kind. Instead, the R&amp;amp;D team has mostly been seconded to the software development team to help them through a period of organisational and process change. We would not have made it through the past two years without their help.&lt;/p>
&lt;p>But now we’re ready to focus on more ‘R’ and less ‘D’. And to that end, we are increasing the size of the team as well. &lt;a href="https://www.crossref.org/people/rachael-lammey/">Rachael Lammey&lt;/a> will be joining the team as Head of Strategic Initiatives. She will work alongside our Principal R&amp;amp;D Developers, &lt;a href="https://www.crossref.org/people/esha-datta/">Esha Datta&lt;/a> and &lt;a href="https://www.crossref.org/people/dominika-tkaczyk/">Dominika Tkaczyk&lt;/a>. Together they will be able to engage with new communities and immediately start experimenting with ways in which Crossref might be able to address their needs and use-cases.&lt;/p>
&lt;p>We hope to soon add to our list of distinguished R&amp;amp;D project alumni.&lt;/p>
&lt;h2 id="rationale--details">Rationale &amp;amp; details&lt;/h2>
&lt;div style="float:left;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2021/creature1.svg" width="100%">
&lt;/figure>
&lt;/div>
&lt;p>The Crossref R&amp;amp;D group (AKA &amp;ldquo;Labs&amp;rdquo;) has been the incubator of many services that are now in production and which form a fundamental part of Crossref&amp;rsquo;s identity and value. Similarity Check, ORCID, Crossmark, Open Funder Registry, The REST API, Linked Clinical Trials, and Event Data all started as R&amp;amp;D projects. More recently the enhancement of our reference matching infrastructure and the development and launch of ROR were also R&amp;amp;D projects.&lt;/p>
&lt;p>And prior to the formation of the outreach group in 2015, the R&amp;amp;D group also led a critical function engaging with communities that, at the time, Crossref only had tangential connections with: &lt;a href="https://pkp.sfu.ca/" target="_blank">PKP&lt;/a>; &lt;a href="https://doaj.org/" target="_blank">DOAJ&lt;/a>; funders; and the data and altmetrics communities.&lt;/p>
&lt;p>But since the R&amp;amp;D group merged with the technology team back in 2019, we have done very little &amp;ldquo;R.&amp;rdquo; and very little community engagement of our own. Instead, the R&amp;amp;D team has supported the development team through a period of major cross-cutting projects and organisational change. Dominika has led the REST API rewrite and Esha&amp;mdash;when she is not acting as technical lead on ROR&amp;mdash;has also worked on the API rewrite and has kept Crossref metadata search on its feet. We would not have been able to make it through the past few years without their help.&lt;/p>
&lt;p>Throughout this period, Rachael Lammey has continued the vital work of identifying, engaging with, and advocating for members of our community who we previously didn&amp;rsquo;t even know were members of our community.&lt;/p>
&lt;div style="float:right;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2021/creature2.svg" width="100%">
&lt;/figure>
&lt;/div>
&lt;p>The strength of the R&amp;amp;D group was that it combined outreach, product, and development functions. It was not only able to engage with new constituencies, but to quickly experiment with ways in which Crossref might be able to serve them. Previously, members of the R&amp;amp;D team would return from a conference or workshop that no Crossref member had ever attended before with a set of new contacts and ideas for new services and tools. They&amp;rsquo;d form interest groups and develop prototypes. Sometimes the interest groups would lead nowhere and sometimes the prototypes would be discarded. But critically, some of them would turn into the major services and organisations that now form a foundational part of open scholarly infrastructure.&lt;/p>
&lt;p>And this is why it makes so much sense for Rachael to join the R&amp;amp;D team. The group is most effective when it is able to engage with new communities and immediately start experimenting with ways in which Crossref might be able to address their needs and use-cases. Rachael&amp;rsquo;s extensive experience in both product management and outreach&amp;mdash;combined with Esha and Dominika&amp;rsquo;s experience leading development projects&amp;mdash;is exactly what we need to reinvigorate the group and put the R back into R&amp;amp;D.&lt;/p>
&lt;p>To kick off, we are going to be working on some small-ish, discrete projects. These include:&lt;/p>
&lt;ul>
&lt;li>Better matching and linking of preprints to published articles;&lt;/li>
&lt;li>Extending our journal title classification to cover all journal and conference proceedings titles; and&lt;/li>
&lt;li>Tools to allow us to community-source structured metadata correction information and feed it back to our members.&lt;/li>
&lt;/ul>
&lt;div style="float:left;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2021/creature3.svg" width="80%">
&lt;/figure>
&lt;/div>
&lt;p>We will consult with and update the community on the kinds of projects we are working on through regular tech updates and a revitalised &lt;a href="https://www.crossref.org/labs">Labs&lt;/a> area of our website.&lt;/p>
&lt;p>Oh- and we will certainly be designing some new Labs creatures. &lt;br>
&amp;ndash;G&lt;/p></description></item><item><title>The road ahead: our strategy through 2025</title><link>https://www.crossref.org/blog/the-road-ahead-our-strategy-through-2025/</link><pubDate>Thu, 03 Jun 2021 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/the-road-ahead-our-strategy-through-2025/</guid><description>&lt;p>This announcement has been in the works for some time, but everything seems to take longer when there is a pandemic going on, including finding time and headspace to plan out our strategy for the next few years.&lt;/p>
&lt;p>Over the last year or so we have had our heads down addressing how to scale our 20-yr-old system and operation &amp;ndash; and adapting to new ways of working. But we&amp;rsquo;ve also spent time talking to people, forging alliances, looking ahead, and making plans. So we&amp;rsquo;re happy to now let everyone know exactly what we&amp;rsquo;ve been up to lately, what we are heading towards in 2025, and what projects and programs are prioritised on our near-term agenda.&lt;/p>
&lt;h2 id="tldr">Tl;dr&lt;/h2>
&lt;ul>
&lt;li>Introducing the &lt;a href="https://www.crossref.org/strategy">new Crossref strategy through 2025&lt;/a>, extending the one we published in 2018&lt;/li>
&lt;li>There are now two additional strategic goals, to make six: bolstering our team; living up to POSI&lt;/li>
&lt;li>Good progress has been made in reducing operational and technical debt - a lot of learning too&lt;/li>
&lt;li>We&amp;rsquo;re unblocking stuff to get more done, including expanding R&amp;amp;D (more on that next week)&lt;/li>
&lt;li>We have &lt;a href="http://bit.ly/crossref-roadmap" target="_blank">a new public roadmap&lt;/a> 🎉&lt;/li>
&lt;li>Come to next week&amp;rsquo;s mid-year update webinar to hear what&amp;rsquo;s happening and up next.&lt;/li>
&lt;/ul>
&lt;h2 id="the-emergence-of-a-strategic-agenda">The emergence of a strategic agenda&lt;/h2>
&lt;p>2018 seems like a decade ago, doesn&amp;rsquo;t it? Back then we set out a &lt;a href="https://www.crossref.org/strategy/archive-2018">2018-2021 strategic direction&amp;mdash;now archived&lt;/a>&amp;mdash;that described four goals: adapt to expanding constituencies; simplify and enrich services; selectively collaborate and partner with others; and improve our metadata quality and comprehensiveness. These themes were formed from the output of a &lt;a href="https://doi.org/10.64000/fhxf7-cnw95" target="_blank">planning exercise&lt;/a> with our board in mid-2017 which tackled scenarios that remain true today, including: the increasing diversity in scholarly publishing (library-publishing, academic-led journals, shifting geographic dominance, etc.); the growth in preprints and other content formats; the sustainability of scholarly publishing (who is funding it and whether that is an expanding or shrinking pool); and the increase in policy and regulation in this space.&lt;/p>
&lt;p>That meeting was the catalyst for embracing openness and a broader set of constituents. It was also decisive about Crossref’s role in this evolving community to focus on our core competencies, defined as:&lt;/p>
&lt;blockquote>
&lt;ol>
&lt;li>A reputation as a trusted, neutral one-stop source of metadata and services&lt;/li>
&lt;li>Managing scholarly infrastructure with technical knowledge and innovation&lt;/li>
&lt;li>Convening and facilitating scholarly community collaboration.&lt;/li>
&lt;/ol>
&lt;/blockquote>
&lt;p>So you can see how we got to focusing on metadata, services, infrastructure, and broad community collaboration.&lt;/p>
&lt;h2 id="ahh-2019-such-an-innocent-time">Ahh, 2019, such an innocent time&lt;/h2>
&lt;p>When we wrote our post at the end of 2019 &lt;a href="https://doi.org/10.64000/mmdqs-23829" target="_blank">A turning point is a time for reflection&lt;/a> we highlighted&amp;mdash;with data&amp;mdash;how different the Crossref community is nowadays. The post also linked to &lt;a href="https://docs.google.com/presentation/d/1RsqtnHssBkaFNphdWoq20_ewruYP04n8j_dYB9wvphM/edit#slide=id.g65af51c04a_1_238" target="_blank">the results of our &amp;lsquo;value&amp;rsquo; research project&lt;/a> and a &lt;a href="https://doi.org/10.13003/y8ygwm5" target="_blank">fact file&lt;/a> which had even more hard data and posed the question &lt;strong>Which Crossref initiatives should be top or bottom priorities?&lt;/strong>. To answer that, the LIVE19 annual meeting group voted (using betting chips) on priority initiatives, with the following results:&lt;/p>
&lt;ol>
&lt;li>Support and implement ROR   &lt;i class='fa fa fa-trophy font-medium font-crossref-yellow'>&lt;/i>
&lt;/li>
&lt;li>Metadata best practices and principles&lt;/li>
&lt;li>Support for multiple languages&lt;/li>
&lt;li>Address technical and operational debt&lt;/li>
&lt;li>Schema updates such as JATS and CRediT&lt;/li>
&lt;li>Engagement with funders&lt;/li>
&lt;/ol>
&lt;p>We all know what happened next: the collective health and social trauma of the COVID-19 pandemic. All of us struggled. You all did too. Homeschooling, homeworking, homestaying. Caring for&amp;mdash;and even saying goodbye to&amp;mdash;sick friends and family. Also &lt;a href="https://doi.org/10.64000/3y94q-ftp55" target="_blank">beloved colleagues&lt;/a>. Alongside these unfamiliar new stresses, members were joining in growing numbers, funders kept joining to register grants, conferences went online and we loved them (before then hating them), the number of records we hosted kept going up, and publishing (especially preprints) skyrocketed.&lt;/p>
&lt;p>The plan hasn&amp;rsquo;t actually changed much. Those charts in the 2019 fact file still make for remarkable reading as those same trends continue. We simply haven&amp;rsquo;t had time to update people on where we are with plans. So it&amp;rsquo;s high time we give an update on these priorities as well as contextualise them in longer-term goals.&lt;/p>
&lt;h2 id="but-first-some-framing">But first, some framing&lt;/h2>
&lt;p>The chart below shows the approach we took to organise our thinking. A lot of it isn&amp;rsquo;t new; we have had the current &lt;a href="https://www.crossref.org/commuinity/about/">mission statement, key messages&lt;/a> (rally, tag, run, play, make), and &lt;a href="https://www.crossref.org/truths/">truths&lt;/a> since the rebranding work in 2015/2016. More recently, we have added POSI to our values, describing the principles and rules by which we operate as a committed open scholarly infrastructure organisation.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2021/crossref-strategic-framework.png" width="100%">
&lt;/figure>
&lt;br>
We already have a lot of 'words'. So why do we also need a vision statement and where do the goals fit in? In order to prioritize the things we will work on first, we need to be able to track everything to a higher vision, ensuring that everything we do is working toward an agreed destination. When we have organisation-wide goals, it means that everyone is clear on the direction, is able to prioritize individual and team work, and can see how their contribution fits in. This, in turn, instills confidence, and motivation - amongst staff as well as members and users.
&lt;p>Our working vision statement (feedback needed!) is:&lt;/p>
&lt;blockquote>
&lt;p>We envision a rich and reusable open network of relationships connecting research organisations, people, things, and actions; a scholarly record that the global community can build on forever, for the benefit of society.&lt;/p>
&lt;/blockquote>
&lt;p>A vision is, of course, shared. It isn&amp;rsquo;t Crossref-specific but describes the world in which we all want to work together in future.&lt;/p>
&lt;h2 id="now-for-those-contextual-six-goals">Now for those contextual six goals&lt;/h2>
&lt;p>Full details are on the new &lt;a href="https://www.crossref.org/strategy">strategy&lt;/a> page but here&amp;rsquo;s a summary below.&lt;/p>
&lt;div class='shortcode-row '>
&lt;div class="col-md-6 col-sm-12 no-first-para-highlight">&lt;p>&lt;a href="https://www.crossref.org/strategy/#bolster-the-team"> &lt;img src="https://www.crossref.org/images/community-images/bolster.svg" alt="Bolster the team" width="100%" align="left" /> &lt;/a>
&lt;br>
This goal is all about people, support, culture, and resilience. Not just because we&amp;rsquo;re coming through a panedmic, but also because we&amp;rsquo;re growing and we need to be able to scale and manage growth more purposefully, with appropriate policies, fees, and resources.&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/strategy/#live-up-to-posi"> &lt;img src="https://www.crossref.org/images/community-images/live.svg" alt="Live up to POSI" width ="100%" align="left"/> &lt;/a>
&lt;br>
We published a &lt;a href="https://doi.org/10.64000/hzemx-j7n79" target="_blank">POSI self-assessment&lt;/a> earlier this year and like-minded initiatives are &lt;a href="http://openscholarlyinfrastructure.org/posse/" target="_blank">following suit&lt;/a>. This is a stated goal because we want to be held publicly accountable to the Principles of Scholarly Infrastructure standards of governance, insurance, and sustainability.&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/strategy/#engage-with-expanding-communities"> &lt;img src="https://www.crossref.org/images/community-images/engage.svg" alt="Engage with expanding communities" width="100%" align="left"/> &lt;/a>
&lt;br>
This goal centres on growth, strengthening relationships, community facilitation, and content. Working with a growing number of Sponsors helps us lower barriers to participation around the world, including in languages other than English. Expanding the support we offer for research funders and institutions are priorities.&lt;/p>
&lt;/div>
&lt;div class="col-md-6 col-sm-12 no-first-para-highlight">&lt;p>&lt;a href="https://www.crossref.org/strategy/#improve-our-metadata"> &lt;img src="https://www.crossref.org/images/community-images/improve.svg" alt="Improve our metadata" width ="100%" align="left" /> &lt;/a>
&lt;br>
This goal involves researching and communicating the value of richer, connected, and reusable, open metadata, and incentivising people to meet best practices, while also making it possible (and easier) to do so.&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/strategy/#collaborate-and-partner"> &lt;img src="https://www.crossref.org/images/community-images/collab.svg" alt="Collaborate &amp; partner" width ="100%" align="left"/> &lt;/a>&lt;br>
&lt;br>
We&amp;rsquo;ve always collaborated but we want to work even more closely with like-minded organisations to solve problems together. Perhaps in future we could also partner with others to find operating efficiencies for our overlapping stakeholders.&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/strategy/#simplify-and-enrich-services"> &lt;img src="https://www.crossref.org/images/community-images/simplify.svg" alt="Simplify &amp; enrich services" width ="100%" align="left"/> &lt;/a>&lt;br>
&lt;br>
This goal is all about focus. And about delivering easy-to-use tools that are critically important for our community. A lot of invisible work has been happening behind the scenes; we&amp;rsquo;ve been strengthening (and will continue to strengthen) our code-base (while opening up all code) in order to unblock some of the initiatives we know people have been waiting for.&lt;/p>
&lt;/div>
&lt;/div>
&lt;p>Read more about what projects are included in the above goals in our full &lt;a href="https://www.crossref.org/strategy">2025 strategic agenda&lt;/a>.&lt;/p>
&lt;h2 id="youre-invited-to-a-mid-year-update-webinar">You&amp;rsquo;re invited to a mid-year update webinar&lt;/h2>
&lt;p>Rather than saving everything for our annual&amp;mdash;usually November&amp;mdash;meeting, we&amp;rsquo;ll also do a mid-year update and plan to do so in May or June every year from now on, in addition to the November updates which include the board election and governance and budget information.&lt;/p>
&lt;p>This year, we&amp;rsquo;re covering some of the main product development work we have completed, underway, and planned for the next quarter. We&amp;rsquo;ll run it live twice - once for those nearby The Americas timezones (&lt;a href="https://bit.ly/theroadahead-June8" target="_blank">June 8th 3pm UTC&lt;/a>) and once for those nearby Asia Pacific timezones (&lt;a href="https://bit.ly/theroadahead-June9" target="_blank">June 9th 6am UTC&lt;/a>). We have a lot to cover in 90 minutes&amp;mdash;including unveiling [our public roadmap[(http://bit.ly/crossref-roadmap)]&amp;mdash;but we&amp;rsquo;re going to try really hard to have a few minutes to discuss questions too.&lt;/p>
&lt;blockquote>
&lt;p>In the meantime, or indeed anytime, join the discussion over on our community forum - see the discussion below and join in on our &lt;a href="https://community.crossref.org/t/the-road-ahead-our-strategy-through-2025-crossref" target="_blank">forum&lt;/a>.&lt;/p>
&lt;/blockquote>
&lt;p>We want to be held accountable to these goals so we’re reliant on you, as a community, to let us know what you think of our &lt;a href="https://www.crossref.org/strategy">2025 strategic agenda&lt;/a>. As always; we’re grateful for your support and advice.&lt;/p></description></item><item><title>Our annual open call for board nominations</title><link>https://www.crossref.org/blog/our-annual-open-call-for-board-nominations/</link><pubDate>Thu, 27 May 2021 00:00:00 +0000</pubDate><author>Lucy Ofiesh</author><guid>https://www.crossref.org/blog/our-annual-open-call-for-board-nominations/</guid><description>&lt;p>&lt;a href="https://www.crossref.org/committees/nominating/">Crossref&amp;rsquo;s Nominating Committee&lt;/a> is inviting expressions of interest to join the Board of Directors of Crossref for the term starting in 2022. The committee will gather responses from those interested and create the slate of candidates that our membership will vote on in an election in September. Expressions of interest will be due Friday, June 25th, 2021.&lt;/p>
&lt;h3 id="board-roles-and-responsibilities">Board roles and responsibilities&lt;/h3>
&lt;p>The role of the board at Crossref is to provide strategic and financial oversight of the organisation, as well as guidance to the Executive Director and the staff leadership team, with the key responsibilities being:&lt;/p>
&lt;ul>
&lt;li>Setting the strategic direction for the organisation;&lt;/li>
&lt;li>Providing financial oversight; and&lt;/li>
&lt;li>Approving new policies and services.&lt;/li>
&lt;/ul>
&lt;p>The board is representative of our membership base and guides the staff leadership team on trends affecting scholarly communications. The board sets strategic directions for the organisation while also providing oversight into policy changes and implementation. Board members have a fiduciary responsibility to ensure sound operations. Board members do this by attending board meetings, as well as joining more specific board committees.&lt;/p>
&lt;p>Crossref’s services provide central infrastructure to scholarly communications. Crossref’s board helps shape the future of our services, and by extension, impacts the broader scholarly ecosystem. We are looking for board members to contribute their experience and perspective.&lt;/p>
&lt;h3 id="who-can-apply-to-join-the-board">Who can apply to join the board?&lt;/h3>
&lt;p>Any active member of Crossref can apply to join the board. Crossref membership is open to organisations that produce content, such as academic presses, commercial publishers, standards organisations, and research funders. In fact, this year the board has specifically included in the committee’s remit to “propose at least one name from a funder member for the current round of elections.”&lt;/p>
&lt;p>There is a link at the bottom of this post to submit your expression of interest.&lt;/p>
&lt;h3 id="what-is-expected-of-board-members">What is expected of board members?&lt;/h3>
&lt;p>Board members attend three meetings each year that typically take place in March, July, and November. Meetings have taken place in a variety of international locations and travel support is provided when needed. Following travel restrictions as a result of COVID-19, the board adopted a plan to convene at least one of the board meetings virtually each year and all committee meetings take place virtually. Most board members sit on at least one Crossref committee. Care is taken to accommodate the wide range of timezones in which our board members live.&lt;/p>
&lt;p>While the expressions of interest are specific to an individual, the seat that is elected to the board belongs to the member organisation. The primary board member also names an alternate who may attend meetings in the event that the primary board member is unable to. There is no personal financial obligation to sit on the board. The member organisation must remain in good standing.&lt;/p>
&lt;p>Board members are expected to be comfortable assuming the responsibilities listed above and to prepare and participate in board meeting discussions.&lt;/p>
&lt;h3 id="about-the-election">About the election&lt;/h3>
&lt;p>The board is elected through the “one member, one vote” policy wherein every member organisation of Crossref has a single vote to elect representatives to the Crossref board. Board terms are for three years, and this year there are five seats open for election.&lt;/p>
&lt;p>The board maintains a balance of seats, with eight seats for smaller members and eight seats for larger members (based on total revenue to Crossref). This is in an effort to ensure that the diversity of experiences and perspectives of the scholarly community are represented in decisions made at Crossref.&lt;/p>
&lt;p>This year we will elect two of the large member seats (membership tiers $3,900 and above) and three of the small member seats (membership tiers $1,650 and below). You don’t need to specify which seat you are applying for. We will provide that information to the nominating committee.&lt;/p>
&lt;p>The election takes place online and voting will open in September. Election results will be shared at the November board meeting and new members will commence their term in 2022.&lt;/p>
&lt;h3 id="about-the-nominating-committee">About the nominating committee&lt;/h3>
&lt;p>The nominating committee will review the expressions of interest and select a slate of candidates for election. The slate put forward will exceed the total number of open seats. The committee considers the statements of interest, organisational size, geography, gender, and experience.&lt;/p>
&lt;p>2021 Nominating Committee:&lt;/p>
&lt;ul>
&lt;li>Liz Allen, F1000/Taylor &amp;amp; Francis, London, UK, committee chair&lt;/li>
&lt;li>Melissa Harrison, eLife, Cambridge, UK&lt;/li>
&lt;li>Andrew Joseph, Wits University Press, Johannesburg, South Africa&lt;/li>
&lt;li>Abel Packer, SciELO, São Paulo, Brazil&lt;/li>
&lt;li>Lisa Scott, New England Journal of Medicine, Boston, USA&lt;/li>
&lt;/ul>
&lt;h3 id="how-do-you-apply-to-join-the-board">How do you apply to join the board?&lt;/h3>
&lt;p>Please &lt;a href="https://docs.google.com/forms/d/e/1FAIpQLSe1UxsGdUkBL7z8ByfKviQAoJcmbnb5zk1qYzIp0XikzsXkbg/viewform?usp=sf_link" target="_blank">click here to submit your expression of interest&lt;/a> or contact &lt;a href="mailto:lofiesh@crossref.org">me&lt;/a>.&lt;/p></description></item><item><title>Service Provider perspectives: A few minutes with our publisher hosting platforms</title><link>https://www.crossref.org/blog/service-provider-perspectives-a-few-minutes-with-our-publisher-hosting-platforms/</link><pubDate>Mon, 24 May 2021 00:00:00 +0000</pubDate><author>Jennifer Kemp</author><guid>https://www.crossref.org/blog/service-provider-perspectives-a-few-minutes-with-our-publisher-hosting-platforms/</guid><description>&lt;p>&lt;a href="https://www.crossref.org/community/service-providers/">Service Providers&lt;/a> work on behalf of our members by creating, registering, querying and/or displaying metadata. We rely on this group to support our schema as it evolves, to roll out new and updated services to members and to work closely with us on a variety of matters of mutual interest. Many of our Service Providers have been with us since the early days of Crossref. Others have joined as scholarly communications has grown and services have evolved. Though fewer than 20 in number, their impact far outweighs the size of the group.&lt;/p>
&lt;p>They, like us, work with a great variety of members and have a broad view into publishing trends. In this post, we focus on views from some of the publishing hosting platform Service Providers, who&amp;rsquo;ve taken the time to share their thoughts on a few questions:&lt;/p>
&lt;h4 id="what-is-the-biggest-change-youve-experienced-working-with-publisher-metadata-over-the-last-few-years-and-how-have-you-adapted-to-it">What is the biggest change you&amp;rsquo;ve experienced working with publisher metadata over the last few years and how have you adapted to it?&lt;/h4>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>It has become more and more important that not only the DOIs are registered with the minimum of necessary metadata to get the DOIs registered, but that a most complete set of metadata is being sent along &amp;ndash; including author identifiers, funding information, abstracts, licenses, to support other Crossref services and improve discoverability.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; de Gruyter&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Our clients are increasingly aware of the key role metadata plays in the effective dissemination of research. With an increasing number of published articles and a clear domination of &amp;ldquo;search engines&amp;rdquo; and aggregation of content, metadata is the primary means of making sure that publications reach the right audience. Publishers&amp;rsquo; value-add includes not just copy editing, formatting, and packaging, but also now creating journal articles for the digital age that are discoverable and well linked to the research corpus. Furthermore, we sense a clear move toward standardization, which goes beyond the structure to introduce standardized semantics: adopting common taxonomies for classifying content in different dimensions.  Our response is to introduce effective, automated and consistent services that capture, and surface metadata throughout the value chain from authoring to publication and search.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Atypon&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Highwire&amp;rsquo;s publishers are always looking to use the latest DTD (Document Type Definition) for the content to stay up to current standards. Currently this would be JATS 1.2. They are choosing to remain current so that they can stay on top of all or new metadata that can enrich their deposits. We have handled this well and offer support for the latest version of DTD when they are released, but some publishers are not always familiar with what can/should be deposited with their content and this can be a learning process for them.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; MPS Limited&lt;/cite>&lt;/p>
&lt;/div>
&lt;h4 id="how-do-you-explain-to-clients-and-others-why-correct-quality-metadata-is-important">How do you explain to clients (and others!) why correct, quality metadata is important?&lt;/h4>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>In the digital age, metadata is the key to enabling effective content consumption. Publications that cannot be effectively discovered are of little value. We can only increase the impact of research with &amp;ldquo;discoverable&amp;rdquo; and &amp;ldquo;machine readable&amp;rdquo; publications. So ensuring correct and quality metadata is the key to optimizing not only the processing (finding the right journal, editor, reviewers) but also to positioning each publication properly.  As the volume of published scientific research increases, article metadata is the way forward &amp;mdash; it  brings &amp;ldquo;order&amp;rdquo; and enables our community to manage this volume.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Atypon&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Highwire always positions itself as &amp;ldquo;good content in&amp;rdquo; means &amp;ldquo;good content out&amp;rdquo;. This is true for our own content stores. Strong and valid metadata will result in valid and strong deposits. We explain this to all new clients on-boarded with Highwire and the use of current standards and for current client projects where content should/can be enriched through re-load.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; MPS Limited&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Getting our journals to care about metadata is a two step process: First, make sure they understand how metadata will help their journal succeed (i.e. why it matters to them). Second, make it easy for them to produce metadata while minimizing the cost, time, or complexity of their workflow.
The first step – making a case for why metadata matters – is often easier than you&amp;rsquo;d think. At the very least, most journal editors understand that metadata, e.g., JATS or DOI registration, is an important signifier of professionalism / prestige. In other words, they see that top journals publish metadata and want the same for their journal.
From a more technical standpoint, metadata is important because that&amp;rsquo;s the format computers understand and, like it or not, the publishing ecosystem relies on computers to deliver all sorts of critical services – such as indexing, archiving, and discoverability. So, if you&amp;rsquo;re not publishing metadata, you&amp;rsquo;re likely missing the benefit of these services. The second step – making it easy to produce metadata – is more difficult. Journal editors generally understand metadata matters but often lack the technical skills or resources necessary to create metadata.
This is where a platform, such as Scholastica, can be very helpful. Because platforms work with many journals, they can invest in tools to automate the creation of metadata, reducing costs for all their clients. For example, most platforms offer integrations to support automatic DOI registration. At Scholastica, we&amp;rsquo;re pushing this idea even further with automatic integration to more complicated services such as PubMed Central. By reducing cost and complexity, we can help new or small-budget journals have the same quality metadata normally reserved for large, established journals.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Scholastica&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>We are sending other publishers&amp;rsquo; metadata to academic libraries and distribution channels. Erroneous metadata will have a direct impact on how discoverable a title may be. The more uniform and correct the metadata, the better it will be indexed in other places.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; de Gruyter&lt;/cite>&lt;/p>
&lt;/div>
&lt;h4 id="what-is-the-one-industry-development-or-trend-youre-most-excited-about-for-the-near-future-and-why">What is the one industry development or trend you’re most excited about for the near future and why?&lt;/h4>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Open Science and the ability to deliver research with the tools for reproducing it is the most exciting and game changing trend. Technology has enabled the output of science to transition from two-dimensional printed text delivery into globally accessible and responsive web-based delivery. We are now taking the next steps to further leverage web technology to enhance research output with rich assets ranging from audio and video, datasets, executable code, high-resolution imagery, interactive applications and more. As more assets accompany research publications, viewing these assets as modular, individually citable, and reusable becomes a requirement. We are reviewing the whole research output flow from authoring to publishing, and most importantly to its dissemination through the myriad of discovery tools now available.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Atypon&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>The move of everything to the cloud &amp;ndash; this is changing and improving our infrastructure, our possibility to scale and to stay on top of technological development.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; de Gruyter&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>Thanks very much to the interviewees for their time and thoughts. We look forward to working with our entire Service Provider group on questions like these and many more. If you&amp;rsquo;d like more details, you can read about our &lt;a href="https://www.crossref.org/community/service-providers/">Service Provider program&lt;/a> or contact &lt;a href="mailto:feedback@crossref.org">me&lt;/a> for more information.&lt;/p></description></item><item><title>Next steps for Content Registration</title><link>https://www.crossref.org/blog/next-steps-for-content-registration/</link><pubDate>Mon, 17 May 2021 00:00:00 +0000</pubDate><author>Sara Bowman</author><guid>https://www.crossref.org/blog/next-steps-for-content-registration/</guid><description>&lt;div class="shortcode-divwrap yellow-highlight">
&lt;span>UPDATE, December 2025: &lt;em>The legacy Metadata Manager interace will be &lt;a href="https://doi.org/10.64000/ys7s6-pwn71" target="_blank">switched off on 1 January 2026&lt;/a>. We have been in touch with affected members throughout the year with guidance and resources on making the switch to our &lt;a href="https://www.crossref.org/documentation/register-maintain-records/record-registration-form/">newest helper tool&lt;/a> or &lt;a href="https://www.crossref.org/documentation/register-maintain-records/choose-content-registration-method/">alternative content registration methods&lt;/a>.&lt;/em>&lt;/span>
&lt;/div>
---
&lt;p>Hi, I’m Sara, one of the Product Managers here at Crossref. I joined the team in April 2020, primarily tasked with looking after Content Registration mechanisms. Prior to Crossref, I worked on open source software to support scientific research. I’ve learned a lot in the last year about how our community works with us, and I’m looking forward to working more closely with you in the coming year to improve Content Registration tools.&lt;/p>
&lt;p>Just over a year ago, we updated you on the status of &lt;a href="https://doi.org/10.64000/1a52b-7pf27" target="_blank">Metadata Manager&lt;/a>. TL;DR: We learned that our approach with the tool wasn’t flexible enough to easily and quickly add other record types or update the input schema, and paused new development. We’re back with another update on Metadata Manager and our strategy for Content Registration user interfaces (UIs) going forward.&lt;/p>
&lt;h2 id="our-helper-tools-for-content-registration">Our helper tools for Content Registration&lt;/h2>
&lt;p>The bulk of content registered with us is done so programmatically; that is, our members’ (or their service providers’) machines talking to our machines using our APIs. But, there are plenty of our members that don’t have the technical expertise to work with us this way. For those members, we provide various helper tools to assist with manual content registration.&lt;/p>
&lt;p>We offer a variety of interfaces for registering many different types of content, including Web Deposit form for most record types, Metadata Manager for journal content, and Simple Text Query to register references. Each of these has its own use cases and limitations, leading to a confusing and inconsistent experience for members who are manually depositing metadata. From our perspective, maintaining this many interfaces in different codebases is inefficient, in part because an update to the schema likely leads to separate updates in each of them. A unified user interface to register content would both improve and simplify the user experience for you, our community, and make updates quicker and more efficient. The original goal of Metadata Manager was to be this unified interface. But we’ve learned that the approach we took was flawed: there have been problems reported by users, and the tool itself isn’t flexible enough to easily and quickly add new record types or support new fields when our input schema changes.&lt;/p>
&lt;h2 id="a-new-approach-to-helper-tools">A new approach to helper tools&lt;/h2>
&lt;p>So we’ve decided to build something new and retire the old. We’ll be focusing on creating a brand new Content Registration user interface that will eventually replace Metadata Manager, the Web Deposit form, and Simple Text Query. And what we’ve learned from our experiences with Metadata Manager and Web Deposit has greatly influenced our strategy going forward. The new tool will:&lt;/p>
&lt;h3 id="have-a-community-focus">Have a Community focus&lt;/h3>
&lt;ul>
&lt;li>&lt;strong>Design for small&lt;/strong> - Our membership demographic is evolving. A large (and growing) number of our members are very small, often with a single publication and no technical resources. Creating XML can be a barrier to participating in Crossref, and our helper tools are designed to lower that barrier.&lt;/li>
&lt;li>&lt;strong>Accessibility and localization support&lt;/strong> - All of our UIs should support major international accessibility guidelines and translation into local languages, to meet the needs of our global membership.&lt;/li>
&lt;li>&lt;strong>Open source code&lt;/strong> - Build in the open, so that others can contribute. This could mean an entire UI that we haven’t prioritized, or adding a new translation file, or tweaking some CSS.&lt;/li>
&lt;/ul>
&lt;h3 id="follow-user-centered-design-processes">Follow user-centered design processes&lt;/h3>
&lt;ul>
&lt;li>&lt;strong>Unified user interface&lt;/strong> - Improve user experience and simplify tools and services by providing members with one place to go to register content via a UI.&lt;/li>
&lt;li>&lt;strong>Rapid iteration&lt;/strong> - Focus on a technical solution that allows for rapid development of UIs to support new record types and updates to our schema.&lt;/li>
&lt;li>&lt;strong>Building the right features for the right users&lt;/strong> - The needs of our large members and smaller members are different. Experience has shown us that the core audience for a helper tool is smaller members; we’ll tailor the features to solve the challenges of our smaller members.&lt;/li>
&lt;/ul>
&lt;h3 id="allow-us-to-build-content-for-the-future">Allow us to build content for the future&lt;/h3>
&lt;ul>
&lt;li>&lt;strong>Tactical approach to record types&lt;/strong> - Quickly build UIs in a strategic order. We can’t build support for every record type at once, so we want to identify and build in the areas of highest impact/lowest effort first.&lt;/li>
&lt;li>&lt;strong>Deliberate approach to supported fields&lt;/strong> - Not all members will supply metadata for all fields in our schema. Building a UI to support all fields for a specific record type before moving on to another slows progress on that next record type. We’ll identify the most-used and most-useful fields to support first, and add more in a future iteration if needed.&lt;/li>
&lt;/ul>
&lt;h2 id="deprecating-metadata-manager">Deprecating Metadata Manager&lt;/h2>
&lt;p>In order to free up the resources to develop the new Content Registration UIs, we need to stop doing other things - that means not adding to, supporting, or bug-fixing other Content Registration tools. We’re setting an aggressive goal of sunsetting Metadata Manager by the end of 2021, with a commitment to a smooth transition to our new tool. This means that new members should not start using Metadata Manager. New members who need a helper tool have a few choices:&lt;/p>
&lt;ul>
&lt;li>those who use the OJS platform from PKP to host their journals (OJS V3 and above) should use the third party Crossref OJS plugin to register their content.&lt;/li>
&lt;li>other new members should use the Web Deposit form&lt;/li>
&lt;li>current members who are using Metadata Manager may continue to do so, but are advised that we won’t be doing bug fixes or further development on the tool, and that support will be scaled back. If possible, you should transition over to using the Web Deposit form.&lt;/li>
&lt;/ul>
&lt;p>This wasn’t a decision made lightly, but one made after considering multiple options and all the data available to us about member usage and internal resources.&lt;/p>
&lt;p>To highlight some of the data that led to this decision: the Support team tracks the types of support tickets they handle. In 2020, the 3rd most common ticket type was Metadata Manager-related. But less than 4% of metadata records registered with us are registered using Metadata Manager. Supporting Metadata Manager requires resources disproportionate to the amount of use the tool gets. For comparison, twice as many records are registered using the Web Deposit Form, but it generates far fewer Support tickets. To fix the bugs and issues reported about Metadata Manager requires an equally disproportionate amount of developer resources. So far, we have been unable to free up resources we would need to fix them all. Continuing to maintain this tool is effectively preventing us from building something new that will better meet the needs of our smaller members.&lt;/p>
&lt;p>We know this will surprise and concern some of you, especially heavy users of Metadata Manager. We’re committed to making this a smooth transition, and over the coming months, we’ll provide more guidance to help current members migrate to our other tools.&lt;/p>
&lt;h2 id="involving-the-community">Involving the community&lt;/h2>
&lt;p>Building a tool that allows us to create and adapt content registration forms based on example input files is an exciting new approach - one that will allow us to better serve the needs of our smaller members across multiple record types and support those who want to adapt our tools to their own needs. We’ve already begun work on a proof-of-concept tool aligned with this new strategy and I’m excited to drive it to production. As this project develops, we’ll keep in close contact with members, conducting user interviews, feedback sessions, and using usage data to help guide our decision-making on features and design. As we’ll be building in the open, we’ll have prototypes to share along the way as we iterate to produce a tool that will stand the test of time as well as scale to support even more content and members in future. We welcome your feedback over on our &lt;a href="https://community.crossref.org/t/feedback-on-new-helper-tool/1721" target="_blank">Community Forum&lt;/a>, where we’ve set up a dedicated category to discuss this topic.&lt;/p></description></item><item><title>Doing more with relationships - via Event Data</title><link>https://www.crossref.org/blog/doing-more-with-relationships-via-event-data/</link><pubDate>Fri, 14 May 2021 00:00:00 +0000</pubDate><author>Martyn Rittman</author><guid>https://www.crossref.org/blog/doing-more-with-relationships-via-event-data/</guid><description>&lt;p>Crossref aims to &lt;a href="https://doi.org/10.64000/k2hez-ysv45" target="_blank">link research together&lt;/a>, making related items more findable, increasing transparency, and showing how ideas spread and develop. There are a number of moving parts in this effort: some related to capturing and storing linking information, others to making it available.&lt;/p>
&lt;p>By including relationship metadata in Event Data, we are taking a big step to improve the visibility of a large number of links between metadata. We know this is long-promised and we’re pleased that making this valuable metadata available supports a number of important initiatives. We will also be backfilling, so all previously deposited relationships will eventually become available as events. The first step will be to add relationships between items that have DOIs, such as between a research article and a related review report or dataset.&lt;/p>
&lt;h2 id="what-are-relationships">What are relationships?&lt;/h2>
&lt;p>When members register metadata with us, they have the possibility to identify other works, items, and websites that they know are related. This might be supplementary material or previous versions of a work (especially for preprints and working papers). Equally, identifiers for a protein, gene, or organism used in the research can be included. These are recorded as ‘relationships’ and can be &lt;a href="https://crossref.org/services/metadata-retrieval/" target="_blank">accessed in the same way as the rest of the metadata&lt;/a> we hold about registered content.&lt;/p>
&lt;h2 id="some-examples">Some examples&lt;/h2>
&lt;h4 id="relationships-in-the-metadata-show-links-to-the-published-article-from-this-biorxiv-preprinthttpsdoiorg10110120200521109546-in-the-crossref-rest-apihttpsapicrossreforgworks10110120200521109546">Relationships in the metadata show links to the published article from &lt;a href="https://doi.org/10.1101/2020.05.21.109546" target="_blank">this bioRxiv preprint&lt;/a>. In the &lt;a href="https://api.crossref.org/works/10.1101/2020.05.21.109546" target="_blank">Crossref Rest API&lt;/a>:&lt;/h4>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-JSON" data-lang="JSON">&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;relation&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="p">{&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;is-preprint-of&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="p">[&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="p">{&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;id-type&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;doi&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;id&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;10.1038/s41467-020-17892-0&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;asserted-by&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;subject&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="p">}&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="p">],&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;cites&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="p">[]&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">}&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;h4 id="and-now-in-event-datahttpapieventdatacrossreforgv1eventsmailtomrittmancrossreforgsubj-id10110120200521109546">And now in &lt;a href="http://api.eventdata.crossref.org/v1/events?mailto=mrittman@crossref.org&amp;amp;subj-id=10.1101/2020.05.21.109546" target="_blank">Event Data&lt;/a>:&lt;/h4>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-JSON" data-lang="JSON">&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;subj&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="p">{&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;pid&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;https://doi.org/10.1101/2020.05.21.109546&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;url&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;https://doi.org/10.1101/2020.05.21.109546&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;work_type_id&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;posted-content&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">}&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;obj&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="p">{&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;pid&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;https://doi.org/10.1038/s41467-020-17892-0&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;url&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;https://doi.org/10.1038/s41467-020-17892-0&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;method&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;doi-literal&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;verification&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;literal&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;work-type-id&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;journal-article&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">}&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;h4 id="linking-to-a-dataset-in-the-dryad-digital-repository-by-a-recent-elife-articlehttpsdoiorg107554elife19920-in-the-crossref-metadata">Linking to a dataset in the Dryad Digital Repository by &lt;a href="https://doi.org/10.7554/elife.19920" target="_blank">a recent eLife article&lt;/a>. In the Crossref metadata:&lt;/h4>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-JSON" data-lang="JSON">&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;relation&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="p">{&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;is-supplemented-by&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="p">[&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="p">{&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;id-type&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;doi&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;id&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;10.5061/dryad.s58qh&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;asserted-by&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;subject&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="p">}&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="p">],&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;references&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="p">[&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="p">{&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;id-type&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;doi&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;id&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;10.5061/dryad.s58qh&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;asserted-by&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;subject&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="p">}&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="p">],&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;cites&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="p">[]&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">}&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;h4 id="and-now-in-event-data">And now in Event Data:&lt;/h4>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-JSON" data-lang="JSON">&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;subj&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="p">{&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;pid&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;https://doi.org/10.7554/elife.19920&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;url&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;https://doi.org/10.7554/elife.19920&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;work_type_id&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;journal-article&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">}&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s2">&amp;#34;obj&amp;#34;&lt;/span>&lt;span class="err">:&lt;/span> &lt;span class="p">{&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;pid&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;https://doi.org/10.5061/dryad.s58qh&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;url&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;https://doi.org/10.5061/dryad.s58qh&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;method&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;doi-literal&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;verification&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;literal&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nt">&amp;#34;work-type-id&amp;#34;&lt;/span>&lt;span class="p">:&lt;/span> &lt;span class="s2">&amp;#34;Dataset&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">}&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>If you are interested in relationships for a single DOI, we still recommend checking the metadata of that record, however Event Data is a great option for looking across multiple records. For example, to check for relationships across a prefix, in a given time period, or for a specific type of relationship.&lt;/p>
&lt;h2 id="data-citation">Data citation&lt;/h2>
&lt;p>Data citations can be included in data deposits in relationship metadata, usually using the ‘is-supplemented-by’ relationship. By creating an event from each relationship, the links between journal articles and books, and the data they rely on are more visible. This makes the data much easier to locate.&lt;/p>
&lt;p>Many datasets have DOIs which are usually recorded with &lt;a href="https://datacite.org/" target="_blank">DataCite&lt;/a>, meaning you are unlikely to find them via searches of Crossref metadata. Making data citation relationship metadata available in Event Data means it will be available in the same format as citations from datasets to articles (which DataCite sends to Event Data) and citations from articles to datasets from Crossref reference metadata (more to come on this later this year). It also means we will convert this information into &lt;a href="https://documentation.ardc.edu.au/cpg/scholix" target="_blank">Scholix&lt;/a> format so that it can be harvested and combined with other sets of Scholix-compliant article/data links. Data citations will therefore be available for the community to identify, share, link and recognise research data. We’re working with initiatives like &lt;a href="https://makedatacount.org/" target="_blank">Make Data Count&lt;/a> and &lt;a href="https://www.stm-researchdata.org/" target="_blank">STM’s research data program&lt;/a> to support the growing uptake of good data citation practices. This is a big step forward in making data citation happen for the community; we have more to do, but Crossref is committed to completing this work as a strategic priority.&lt;/p>
&lt;h2 id="whats-next">What’s next?&lt;/h2>
&lt;p>In this first stage we are adding relationships that link two objects with a DOI, and later this year we will bring in relationships using other identifiers such as accession numbers and URIs. That will make it more straightforward to ask questions of Event Data such as which organisms have relationships to which works with a DOI.&lt;/p>
&lt;h2 id="more-info-and-staying-in-touch">More info and staying in touch&lt;/h2>
&lt;ul>
&lt;li>Find out more about Event Data in our &lt;a href="https://www.crossref.org/education/event-data/">support documentation&lt;/a> or check out tickets in the &lt;a href="https://gitlab.com/crossref/issues/-/issues?scope=all&amp;amp;utf8=%e2%9c%93&amp;amp;state=opened&amp;amp;label_name[]=Service%3A%3AEvent%20Data" target="_blank">GitLab repo&lt;/a>.&lt;/li>
&lt;li>Keep informed and ask us anything via our &lt;a href="https://community.crossref.org/c/crossref-services/event-data/17" target="_blank">community forum for Event Data discussion&lt;/a>&lt;/li>
&lt;/ul></description></item><item><title>Open-source code: giving back</title><link>https://www.crossref.org/blog/open-source-code-giving-back/</link><pubDate>Fri, 30 Apr 2021 00:00:00 +0000</pubDate><author>Joel Schuweiler</author><guid>https://www.crossref.org/blog/open-source-code-giving-back/</guid><description>&lt;h2 id="tldr">TL:DR;&lt;/h2>
&lt;ul>
&lt;li>Hi, I&amp;rsquo;m Joel&lt;/li>
&lt;li>GitLab UI unsatisfactory&lt;/li>
&lt;li>Wrote a UI to use the API&lt;/li>
&lt;li>Wrote a missing API&lt;/li>
&lt;li>Open company contributes changes back to another open company&lt;/li>
&lt;li>Now have a method for getting work done much easier&lt;/li>
&lt;li>Hurrah!&lt;/li>
&lt;/ul>
&lt;p>I&amp;rsquo;m Joel, a Senior Site Reliability Engineer here at Crossref. I have a long background in open source, software development, and solving unique problems. One of my earliest computer influences was my father. He wrote software to support scientists in search of things like the top quark, the most massive of all observed elementary particles.&lt;/p>
&lt;p>One day my father came home with over 40 floppy disks, excited to have this cool, free operating system called Linux. Together we installed Linux and ended up with a fully functional computer. Learning and using Linux opened up an entirely new world to me of amazing open-source software that I could use freely. As I enjoyed all this new software now available to me, I tried to fix any bugs or problems I&amp;rsquo;d encounter and report solutions for them to the software developers. It felt great to be able to contribute back so others could benefit.&lt;/p>
&lt;p>Software teams tend to manage their workflow by writing issues, reviewing them to make sure they make sense and have an achievable goal, estimate how much time it will take to complete, and finally––the crucial step––putting the issues in the order in which they should be completed. To manage my work, I’ve always used Jira––a product designed to help teams of all types prioritize work––and for the first time in over a decade, I find myself not using it in my work.&lt;/p>
&lt;h2 id="product-development-tracking-with-gitlab">Product development tracking with GitLab&lt;/h2>
&lt;p>The Crossref team took the decision a few years ago to move all our development and product tracking work via &lt;a href="https://gitlab.com/crossref/" target="_blank">GitLab&lt;/a>––a commercial open-source product anyone can use to help keep track of software throughout the development life cycle––with an open-by-default policy. Work is tracked using the issues feature of Gitlab. GitLab will host it, so you don&amp;rsquo;t have worry about maintenance and backups. One major drawback I discovered with GitLab, is a lack of maturity when it comes to doing light project management work.&lt;/p>
&lt;blockquote>
&lt;p>This is where the trouble begins with GitLab.&lt;/p>
&lt;/blockquote>
&lt;p>In the &lt;em>board&lt;/em> view of your issues, you can transition your issues from &lt;code>waiting&lt;/code>, to &lt;code>in progress&lt;/code>, from &lt;code>in progress&lt;/code> to &lt;code>done&lt;/code>. The problem with this view is its width-restricted, and things like tags on issues, which are used to help categorize, take up valuable vertical space. With enough tags and a long enough subject line, you can only see five issues at a time on a MacBook Pro monitor, for example.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2020/gitlab-board-view.png"
alt="GitLab board view graphic" width="80%">
&lt;/figure>
&lt;/div>
&lt;p>In the &lt;em>list&lt;/em> view of your issues, you get a clean compact view; the perfect view to order issues. However there&amp;rsquo;s one major flaw, it&amp;rsquo;s paginated. (You know when you&amp;rsquo;re shopping and they make you click to see another page of goods? Yes, like that.) The problem with GitLab&amp;rsquo;s implementation is you can drag and drop issues on a given page, but there is no way to move the issues to another page in the list of results. Additionally, all newly-created issues are added to the end of the list.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2020/gitlab-list-view.png"
alt="GitLab list view graphic" width="80%">
&lt;/figure>
&lt;/div>
&lt;h2 id="the-solution">The solution&lt;/h2>
&lt;p>I went about finding a solution by visiting GitLab&amp;rsquo;s own public issue page and found that requests requiring user interface (UI) changes would languish; in some cases, they would go years without getting approval. Instead of putting in all the work to open an issue with them, only to have it be discarded or ignored, I decided to look for another way.&lt;/p>
&lt;p>GitLab has an API, what more could I need? I discovered I could log in and get a list of all the issues, by project, and by group. &amp;ldquo;This is perfect!&amp;rdquo;, I thought. I can write my own UI around it. It took three evenings writing a UI that was satisfactory to me. When I started writing javascript to interact with the UI, I learned that the &amp;rsquo;re-ordering of issues&amp;rsquo; didn&amp;rsquo;t actually have an API. Further investigation lead me to the issue tracker where I found an issue by a GitLab employee asking for the same functionality––the ability to re-order issues.&lt;/p>
&lt;p>While in a chatroom for GitLab development, I was genuinely surprised by my experience. There was quick attentive help on locating the file I would need to implement the change, they set up a development environment, and even helped submit tests for my code while I worked on updating documentation and writing a changelog entry. It felt like GitLab must’ve designated an employee to work with the community on submitting improvements. In no time, the API for re-ordering was implemented. After the scheduled monthly release of GitLab rolled out with my new API, I was able to easily re-order issues.&lt;/p>
&lt;p>GitLab&amp;rsquo;s response when help was needed along the way was impressive. Now there is a much easier method for getting work done that everyone can use. It’s rewarding when you can contribute back to the community for all to benefit.&lt;/p>
&lt;p>Is GitLab as polished as Jira? No. Did they embrace me making changes by being open from the start and providing help along the way? Yes. Do I see Jira shifting its culture to match? Unlikely.&lt;/p>
&lt;p>By emulating GitLab, an open organisation like Crossref has a shot at encouraging community development.&lt;/p></description></item><item><title>Stepping up our deposit processing game</title><link>https://www.crossref.org/blog/stepping-up-our-deposit-processing-game/</link><pubDate>Mon, 08 Mar 2021 00:00:00 +0000</pubDate><author>Isaac Farley</author><guid>https://www.crossref.org/blog/stepping-up-our-deposit-processing-game/</guid><description>&lt;p>Some of you who have submitted content to us during the first two months of 2021 may have experienced content registration delays. We noticed; you did, too.&lt;/p>
&lt;p>The time between us receiving XML from members, to the content being registered with us and the DOI resolving to the correct resolution URL, is usually a matter of minutes. Some submissions take longer - for example, book registrations with large reference lists, or very large files from larger publishers can take up to 24 to 48 hours to process.&lt;/p>
&lt;p>However, in January and February 2021 we saw content registration delays of several days for all record types and all file sizes.&lt;/p>
&lt;h2 id="tell-me-more">Tell me more&lt;/h2>
&lt;p>Januaries and Februaries are usually busy at Crossref. Journal &lt;a href="https://doi.org/10.64000/91cyc-vka68" target="_blank">ownership changes hands&lt;/a>. Members migrate from one platform to another (and can need to update tens of thousands of their resolution URLs). And, many of you are registering your first issues, books, or conferences of the year. Others of you have heard the calls of &lt;a href="https://i4oc.org/" target="_blank">The Initiative for Open Citations (I4OC)&lt;/a> and &lt;a href="https://i4oa.org/" target="_blank">The Initiative for Open Abstracts (I4OA)&lt;/a> and are enriching your metadata accordingly (thank you!). Tickets into our support and membership colleagues peak for the year. But did we see significantly more submissions this year?&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2021/deposit_submissions_19_20_21-2.png" width="80%">
&lt;/figure>
&lt;/div>
&lt;br/>
&lt;p>As you can see, we did see larger-than-normal numbers of submissions in the first two months of the year. For the entire month of January 2021, we received nearly 1 million more submissions into our admin tool deposit queue than we did in January 2020 (2,757,781 in 2021 versus 1,848,261 in 2020). Under normal circumstances, this would lead to an increase in our processing times, so there’s that to consider. But there was also something else at play this year. We desperately needed to upgrade our load balancer, and so we did. Unfortunately, unforeseen at the time, these upgrades caused hiccups in our deposit processing and slowed down submissions even further, building up the number of unprocessed submissions in the queue.&lt;/p>
&lt;p>When we saw the impact this was having we suspended the load balancer work until things were stable again. We also increased the resources serving our queue to bring it back down to normal. To make sure we don&amp;rsquo;t face the same problem again, we have put in better tools to detect trends in queue usage- tools which, in turn, will allow us to anticipate problems in the queue instead of reacting to them after they&amp;rsquo;ve already occurred. And as a longer-term project, we are addressing two decades of technical debt and rearchitecting our system so that our entire system is much more efficient.&lt;/p>
&lt;h2 id="gory-technical-details">Gory technical details&lt;/h2>
&lt;p>As part of our effort to resolve our technical debt, we&amp;rsquo;re looking to transition more of our services to the cloud. To accomplish this, we first needed to upgrade our internal traffic handling capabilities to route things to their new locations better. This upgrade caused some unforeseen and hard to notice problems, like the queue being stalled. Since the queue still showed things in process, it wasn&amp;rsquo;t immediately apparent that things were not processing (normally the processing on the queue will clear a thread if a significant problem occurs).&lt;/p>
&lt;p>We &lt;a href="https://status.crossref.org/incidents/z9hg0xmtnff7" target="_blank">initially noticed&lt;/a> a problem on 5 February and thought we had a fix in place on the 10th. But, we &lt;a href="https://status.crossref.org/incidents/c49vrqhftxh5" target="_blank">again realized&lt;/a> on 16 February that the underlying problem had recurred, and we needed a closer investigation.&lt;/p>
&lt;p>For many reasons it took us too much time to realize the connection, until people started complaining.&lt;/p>
&lt;p>While our technical team worked on those load balancer upgrades, some of your submissions lingered for days in the deposit queue. In a few examples, larger submissions took over a week to complete processing. Total pending submissions began to push nearly 100,000, an unusually large backlog. We called an emergency meeting, paused all related work, and dedicated additional time and resources to processing all pending submissions. On 22 February, we completed working through the backlog of pending submissions and new submissions were being processed at normal levels. As we finish up this blog on 2 March, there are less than 3,000 pending submissions in the queue, the oldest of which has been there for less than three hours.&lt;/p>
&lt;p>This brings us back to the entire rationale for what we are doing with the load balancer - which, ironically, was to move some services out of the data centre so that we could free-up resources and scale things more dynamically to match the ebbs and flows of your content registration.&lt;/p>
&lt;p>But before we proceed, we&amp;rsquo;ll be looking at what happened. The bumps associated with upgrading ancient software were expected, so we were looking for side effects. We just didn&amp;rsquo;t look in the right place. And we should have detected that the queues had stalled well before people started to report it to us. A lot of our queue management is still manual. This means we are not adjusting it 24x7. So if something does come in when we are not around, it can exacerbate problems quickly.&lt;/p>
&lt;h2 id="what-are-we-going-to-do-about-it">What are we going to do about it?&lt;/h2>
&lt;p>In a word: much. We know that timely deposit processing is critical. We can and will do better.&lt;/p>
&lt;p>First off, we have increased the number of concurrently processing threads dedicated to metadata uploads in our deposit queue from 20 to 25. That’s a permanent increase. A million more submissions in a month necessitates additional resources, but that’s only a short-term patch. And we were only able to make this change recently due to some index optimizations we implemented late last year.&lt;/p>
&lt;p>One of the other things that we&amp;rsquo;ve immediately put into place is a better system for measuring trends in our queue usage so that we can, in turn, anticipate rather than react to surges in the queue. And, of course, the next step will be to automate this queue management.&lt;/p>
&lt;p>All this is part of an overall, multi-year effort to address a boat-load of technical debt that we&amp;rsquo;ve accumulated over two decades. Our system was designed to handle a few million DOIs. It has been incrementally poked and prodded to deal with well over a hundred million. But it is suffering.&lt;/p>
&lt;p>Anybody who is even semi-technically-aware might be wondering what all the fuss is about? Why can&amp;rsquo;t we fix this relatively easily? After all, 130 million records&amp;mdash;though a significant milestone for Crossref&amp;mdash;does not in any way qualify as &amp;ldquo;big data.&amp;rdquo; All our DOI records fit onto an average sized micro-SD card. There are open source toolchains that can manage data many, many times this size. We&amp;rsquo;ve occasionally used these tools to load and analyse all our DOI records on a desktop computer. And it has taken in just a few minutes (admittedly using a beefier-than-usual desktop computer). So how can a queue with just 100,000 items in it take so long to process?&lt;/p>
&lt;p>Our scale problem isn&amp;rsquo;t so much about the number of records we process. It is about the 20 years of accumulated processing rules and services that we have in place. Much of it undocumented and the rationale for which has been lost over the decades. It is this complexity that slows us down.&lt;/p>
&lt;p>And one of the challenges we face as we move to a new architecture is deciding which of these rules and services are &amp;ldquo;essential complexity&amp;rdquo; and which are not. For example, we have very complex rules for verifying that submissions contain a correct journal title. These rules involve a lot of text matching and, until they are successfully completed, they block the rest of the registration process.&lt;/p>
&lt;p>But the workflow these rules are designed for is one that was developed before ISSNs were widely deposited and before we had our own, internal title identifiers for items that do not have an ISSN. And so a lot of this process is probably anachronistic. It is not clear which (if any) parts of it are still essential.&lt;/p>
&lt;p>We have layers upon layers of these kinds of processing rules, many of which are mutually dependent and which are therefore not easily amenable to the kind of horizontal scaling that is the basis for modern, scalable data processing toolchains. All this means that, as part of moving to a new architecture, we also have to understand which rules and services we need to move over and which ones have outlived their usefulness. And we need to understand which remaining rules can be decoupled so that they can be run in parallel instead of in sequence.&lt;/p>
&lt;blockquote>
&lt;p>Pro tip: Due to the current checks performed in our admin tool, for those of you submitting XML, the most efficient way to do so is by packaging the equivalent of a journal issue&amp;rsquo;s worth of content in each submission (i.e., ten to twelve content items - a 1 MB submission is our suggested file size when striving for efficient processing)&lt;/p>
&lt;/blockquote>
&lt;p>Which brings us conveniently back to queues. We did not react soon enough to the queue backing up. We can do much better at monitoring and managing our existing registration pipeline infrastructure. But we are not fooling ourselves into thinking this will deal with the systemic issue.&lt;/p>
&lt;p>We recognize that, with current technology and tools, it is absurd that a queue of 100,000 items should take so long to process. It is also important that people know that we are addressing the root of the issues as well. And that we&amp;rsquo;re not succumbing to the now-legendary anti-pattern of trying to rewrite our system from scratch. Instead we are building a framework that will allow us to incrementally extract the essential complexity of our existing system and discard some of the anachronistic jetsam that has accumulated over the years.&lt;/p>
&lt;p>Content Registration should typically take seconds. We wanted to let you know, that we know, and we are working on it.&lt;/p></description></item><item><title>Discuss all things metadata in our new community forum</title><link>https://www.crossref.org/blog/discuss-all-things-metadata-in-our-new-community-forum/</link><pubDate>Thu, 11 Feb 2021 00:00:00 +0000</pubDate><author>Vanessa Fairhurst</author><guid>https://www.crossref.org/blog/discuss-all-things-metadata-in-our-new-community-forum/</guid><description>&lt;p>TL;DR: We have a Community Forum (yay!), you can come and join it here: &lt;a href="https://community.crossref.org/" target="_blank">community.crossref.org&lt;/a>.&lt;/p>
&lt;p>Community is fundamental to us at Crossref, we wouldn’t be where we are or achieve the great things we do without the involvement of you, our diverse and engaged members and users. Crossref was founded as a collaboration of publishers with the shared goal of making links between research outputs easier, building a foundational infrastructure making research easier to find, cite, link, assess, and re-use. It is at the very core of what we do and who we are. Our global community now includes publishers, libraries, government agencies, funders, researchers, universities, ambassadors, and more from over 140 countries. We are also actively part of the larger scholarly research community, which includes other &lt;a href="http://openscholarlyinfrastructure.org/" target="_blank">open scholarly infrastructure&lt;/a> organisations, metadata users and aggregators, open science initiatives, and others with shared aims and values.&lt;/p>
&lt;h2 id="what-do-we-mean-by-community">What do we mean by &amp;lsquo;community&amp;rsquo;?&lt;/h2>
&lt;p>‘Community’ is often one of those words which gets bandied around without much thought given to its meaning. At Crossref, we are aware that expertise lies within our broad, global community and we engage with them (you!) in a variety of ways to ensure that decisions we make are community-led and that what we do, as well as what we don’t do, are in line with the views of our members and developed with your insights and input. We do this via our working groups, committees, ambassador program, beta-testing groups, in-person and online events, webinars, and on-going dialogues and feedback via our support channels and even social media. We are also involved in a number of collaborative projects with other organisations such as &lt;a href="https://ror.org/" target="_blank">ROR&lt;/a>, &lt;a href="http://www.metadata2020.org/" target="_blank">Metadata 2020&lt;/a>, &lt;a href="https://makedatacount.org/" target="_blank">Make Data Count&lt;/a>, &lt;a href="https://www.pidapalooza.org/" target="_blank">PIDapooloza&lt;/a>, and the &lt;a href="https://project-freya.eu/mission.html" target="_blank">FREYA&lt;/a> project to name but a few.&lt;/p>
&lt;p>Community is more than just signing up to be a Crossref member. It’s more than just attending an event or a webinar, or levelling up to include the use of a service like Crossmark or Similarity Check –– it’s really engaging with us and creating something together of shared value for the scholarly community. As an organisation, we’ve been so thrilled that there is a new group dedicated to highlighting community managers and our work. We are working with –– and learning a lot from –– the &lt;a href="https://www.cscce.org" target="_blank">Centre for Scientific Collaboration &amp;amp; Community Engagement&lt;/a> to improve the way we interact and involve people in Crossref. The model below shows a trajectory towards true collaboration that we aim to follow in the coming months and years.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2021/cscce-participation-model.png"
alt="Cite as: Center for Scientific Collaboration and Community Engagement. (2020) The CSCCE Community Participation Model – A framework for member engagement and information flow in STEM communities. Woodley and Pratt doi: 10.5281/zenodo.3997802" width="80%">&lt;figcaption>
&lt;p>Cite as: &lt;a href="https://doi.org/10.5281/zenodo.3997802" target="_blank">Center for Scientific Collaboration and Community Engagement. (2020) The CSCCE Community Participation Model – A framework for member engagement and information flow in STEM communities. Woodley and Pratt doi: 10.5281/zenodo.3997802&lt;/a>&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;/div>
&lt;p>In the current climate, there are additional challenges and limitations on how we interact with all the various communities that we as individuals are a part of, both professionally and personally. I wrote in my last blog about &lt;a href="https://doi.org/10.64000/byv2m-9fm07" target="_blank">how we have moved our events online&lt;/a> and thought about new ways to better connect and engage with our community virtually. One of those ways is our Community Forum.&lt;/p>
&lt;h2 id="the-purpose-of-our-community-forum">The purpose of our community forum&lt;/h2>
&lt;p>Hosted on the open-source discussion platform &lt;a href="https://www.discourse.org/about" target="_blank">Discourse&lt;/a>, you can find our forum at &lt;a href="https://community.crossref.org/" target="_blank">community.crossref.org&lt;/a>. The goal of the community forum is to create an inclusive, open space where Crossref members, ambassadors, sponsors, service providers, and others who share a passion for scholarly infrastructure, can connect. This enables collaborative problem-solving, the sharing of expertise and experiences across time zones and languages, and allows members to post questions to be answered by other community members or even our staff. Members of the community engage via creating posts, commenting on existing content in the forum, volunteering for working groups or beta-testing projects, helping to co-create materials that include translations and shared FAQs, giving feedback on new developments, and joining online events and webinars. Throughout these interactions, we expect that those who use the community forum will form relationships –– a collective working together to advance their work with Crossref and shape the future of scholarly infrastructure.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2021/community-forum.png" width="80%">
&lt;/figure>
&lt;/div>
&lt;br/>
&lt;p>When I joined Crossref as Community Manager over three years ago, the idea of a forum had already begun to take shape, but it wasn’t quite there just yet. There was additional research and consultation with the community to be done to check this was the approach we wanted to take.&lt;/p>
&lt;p>This involved speaking to others working in scholarly communications about forums they were involved in running or were an active participant of –– check out the &lt;a href="https://forum.pkp.sfu.ca/" target="_blank">PKP forum&lt;/a> for instance if you haven’t already –– and having numerous valuable conversations about successes, potential downfalls, and realistic expectations. The most important –– and commonly cited –– takeaway is that building an online community takes time. We are still at the start of this journey. It will only work if it is a place of value for all and a place where people feel a sense of belonging and co-ownership.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2021/community-forum-post.png" width="80%">
&lt;/figure>
&lt;/div>
&lt;h2 id="preparing-to-rollout-the-forum">Preparing to rollout the forum&lt;/h2>
&lt;p>We tested the platform with a small group of beta-testers and also sent out a survey to over 1,700 of our members, taking a sample with a geographical and organisational spread. The responses thankfully held no major surprises and reinforced our belief that this is something of use to people.&lt;/p>
&lt;h3 id="key-research-findings">Key research findings&lt;/h3>
&lt;ul>
&lt;li>77% of respondents had previously contacted our Support team for help resolving an issue.&lt;/li>
&lt;li>90% stated either ‘yes’ or ‘maybe’ to whether they would use a community forum to post their questions, though over half have never used a forum before.&lt;/li>
&lt;li>Most common reasons of importance for joining are &amp;lsquo;Community support in solving issues or answering questions&amp;rsquo;, &amp;lsquo;To locate FAQs and quickly find answers to common issues&amp;rsquo;, and &amp;lsquo;To connect with others working in a similar role and/or with similar interests&amp;rsquo;&lt;/li>
&lt;li>Most commonly-stated things that would discourage or limit member’s participation would be how time-consuming and complex the forum is to use, and any potential language barriers.&lt;/li>
&lt;/ul>
&lt;h2 id="things-you-can-do-on-the-forum">Things you can do on the forum&lt;/h2>
&lt;p>We hope this will provide a much more open level of support for the community, enabling us to bring out all those great questions and thoughtful conversations we receive via our Support channels into the public sphere, where we can all benefit from these rich exchanges. Ultimately our goal for the future is that this space is owned by you, the Crossref community. This is a platform for you to connect and build relationships with others working in scholarly communications: metadata fanatics, identifier aficionados, developer gurus, and open research enthusiasts - we welcome you all!&lt;/p>
&lt;ul>
&lt;li>Share what activities or projects you are working on and get input from others.&lt;/li>
&lt;li>Share issues that you need some help resolving, post a question to the forum in your native language and get help from another community member.&lt;/li>
&lt;li>Give us feedback on our plans and help us shape future developments at Crossref.&lt;/li>
&lt;li>Test out new tools and services.&lt;/li>
&lt;li>Find out about upcoming events and webinars, and share any you think are of interest to the community.&lt;/li>
&lt;li>Help us identify better ways of working together through Crossref and co-create new materials and projects.&lt;/li>
&lt;/ul>
&lt;h2 id="how-to-get-started">How to get started&lt;/h2>
&lt;p>So, how do I sign up you ask? Simply head over to &lt;a href="https://community.crossref.org/" target="_blank">community.crossref.org&lt;/a> and set up an account. There&amp;rsquo;s a useful How-To guide available on &lt;a href="https://community.crossref.org/t/about-the-welcome-to-the-crossref-community-forum-category/1026" target="_blank">our welcome post&lt;/a>, as well as some Community Guidelines all our members should follow.&lt;/p>
&lt;p>Do you have a question about registering or updating your metadata? Then head over to the &lt;a href="https://community.crossref.org/c/content-registration/24" target="_blank">Content Registration category&lt;/a> and post your query to the group. Want to find out about getting started with Similarity Check service? Then take a look at our &lt;a href="https://community.crossref.org/c/crossref-services/similarity-check/22" target="_blank">Similarity Check topic&lt;/a> in our services category. Or maybe you want to know more about upcoming multilingual webinars at Crossref, or perhaps you have one of your own you’d like to share? Then check out the &lt;a href="https://community.crossref.org/c/crossref-calendar/10" target="_blank">Community Calendar&lt;/a>.&lt;/p>
&lt;p>We’re also looking for talented linguists out there to help us translate our welcome email template into multiple languages so that anyone joining the community can get a welcome in their native language. To join in, visit &lt;a href="https://community.crossref.org/t/help-us-translate-our-welcome-email/1527?u=vanessa" target="_blank">my post&lt;/a> in our ‘Questions from Crossref’ category.&lt;/p>
&lt;p>We look forward to seeing you in the community soon!&lt;/p></description></item><item><title>Event Data: A Plan of Action</title><link>https://www.crossref.org/blog/event-data-a-plan-of-action/</link><pubDate>Mon, 01 Feb 2021 00:00:00 +0000</pubDate><author>Martyn Rittman</author><guid>https://www.crossref.org/blog/event-data-a-plan-of-action/</guid><description>&lt;p>&lt;a href="https://www.crossref.org/services/event-data/">Event Data&lt;/a> uncovers links between Crossref-registered DOIs and diverse places where they are mentioned across the internet. Whereas a citation links one research article to another, events are a way to create links to locations such as news articles, data sets, Wikipedia entries, and social media mentions. We&amp;rsquo;ve collected events for several years and make them openly available via &lt;a href="https://api.eventdata.crossref.org" target="_blank">an API&lt;/a> for anyone to access, as well as creating &lt;a href="https://www.crossref.org/education/event-data/transparency/">open logs&lt;/a> of how we found each event. &lt;a href="https://www.crossref.org/education/event-data/use/#00632">Some organisations&lt;/a> are already using Event Data and we are keen for more to come on board.&lt;/p>
&lt;p>Last year we gave an &lt;a href="https://doi.org/10.64000/7e781-dzw34" target="_blank">update on Event Data&lt;/a> with apologies for being so quiet and a promise of more information at a later date. It&amp;rsquo;s been some time, so here goes&amp;hellip;&lt;/p>
&lt;p>I joined Crossref in the middle of last year as a Product Manager and was tasked with looking into Event Data. The first thing I found was a large amount of enthusiasm for Event Data, both within Crossref and further afield. The idea of gathering information beyond the metadata deposited by our members is popular, and creates valuable connections between DOIs and a range of other sources. Interest spans the spectrum of academic research, publishing, bibliometrics, and beyond.&lt;/p>
&lt;p>At the same time, I found a project with a very solid, well-built code base but unstable performance. After being put into production in 2018, we didn&amp;rsquo;t provide sufficient support. Coupled with staff changes and other competing priorities, Event Data hasn&amp;rsquo;t had the opportunity to live up to early expectations.&lt;/p>
&lt;p>To address these issues, we have embarked on a plan to make the server infrastructure more robust, improve monitoring, and make sure that the future of Event Data makes the best use of the resources we have without over-stretching. It means working with the community to determine the most essential aspects of Event Data, and providing support where it&amp;rsquo;s needed.&lt;/p>
&lt;p>The steps below are not necessarily sequential and some depend on the completion of work in other parts of Crossref, but they outline the priorities we have for Event Data in 2021.&lt;/p>
&lt;h2 id="the-plan">The Plan&lt;/h2>
&lt;h3 id="stability">Stability&lt;/h3>
&lt;p>Since we put in place our original Event Data infrastructure, the amount of incoming data has grown, and at an ever-increasing rate. In 2017 we were creating 2 million new events per month, that number is now over 20 million. We have known for some time that we need to refresh the infrastructure, but didn&amp;rsquo;t have the resources to move forward: now we do.&lt;/p>
&lt;p>In the first part of the plan we will renew the server infrastructure that underpins Event Data. Maybe not a headline-grabbing move, but the aim is to reduce downtime and pull in missing data. Through improving our monitoring and shortening the response time when things go wrong, we will be able to ensure that events are added on a regular basis and the API can reliably handle requests.&lt;/p>
&lt;p>We&amp;rsquo;ve made the first steps in this direction by upgrading our API infrastructure and making some other tweaks to improve performance. There is still work to do, but we&amp;rsquo;ve already seen a &lt;a href="https://status.crossref.org" target="_blank">significant improvement in performance&lt;/a> with nearly &amp;gt;99.99% uptime in December.&lt;/p>
&lt;h3 id="consolidation">Consolidation&lt;/h3>
&lt;p>The second component of the plan is to review performance and data quality. We will evaluate the event sources, update artefacts (such as the lists of publisher landing pages and news websites, and review performance reporting. This will help us to have a better understanding of Event Data in its current form: if the stability component is about improving what comes in and goes and out, this part will give us increased confidence in what Event Data already contains.&lt;/p>
&lt;h3 id="future-roadmap">Future roadmap&lt;/h3>
&lt;p>While the two steps above are being carried out, we will revisit the applications of Event Data and talk to organisations that currently use it or have expressed an interest. These conversations will feed into future development in which we will evaluate new sources and other ways to optimize the service.&lt;/p>
&lt;p>Central to the roadmap will be continued support of the data citation endpoint in &lt;a href="https://documentation.ardc.edu.au/cpg/scholix" target="_blank">Scholix&lt;/a> format, which we run in close collaboration with DataCite. Additionally, we will add new data from &lt;a href="https://www.crossref.org/education/content-registration/structural-metadata/relationships/">relationships&lt;/a> between Crossref works, for example a preprint is matched to a journal article, or where there are corrections, retractions, or translations of works.&lt;/p>
&lt;p>We expect to continue supporting the current sources of events and where there are organisations with either a strong interest in a particular source or a database of events that they can send directly, we are keen to build collaborations. Event Data, like everything that Crossref does, is a community-based effort.&lt;/p>
&lt;h2 id="staying-in-touch">Staying in touch&lt;/h2>
&lt;p>To join the conversation about Event Data and keep informed, head over to our &lt;a href="https://community.crossref.org/c/crossref-services/event-data/17" target="_blank">Community pages&lt;/a>. You can also check out our &lt;a href="https://gitlab.com/crossref/issues/-/issues?scope=all&amp;amp;utf8=%e2%9c%93&amp;amp;state=opened&amp;amp;label_name[]=Service%3A%3AEvent%20Data" target="_blank">Gitlab pages&lt;/a>. At the end of last year we updated the &lt;a href="https://www.crossref.org/documentation/event-data/">Education pages&lt;/a> where you can learn more about Event Data.&lt;/p></description></item><item><title>New public data file: 120+ million metadata records</title><link>https://www.crossref.org/blog/new-public-data-file-120-million-metadata-records/</link><pubDate>Tue, 19 Jan 2021 00:00:00 +0000</pubDate><author>Jennifer Kemp</author><guid>https://www.crossref.org/blog/new-public-data-file-120-million-metadata-records/</guid><description>&lt;p>2020 wasn&amp;rsquo;t all bad. In April of last year, we released our &lt;a href="https://www.crossref.org/blog/free-public-data-file-of-112-million-crossref-records/" target="_blank">first public data file&lt;/a>. Though Crossref metadata is always openly available––and our board recently cemented this by voting to adopt the &lt;a href="https://www.crossref.org/blog/crossrefs-board-votes-to-adopt-the-principles-of-open-scholarly-infrastructure/" target="_blank">Principles of Open Scholarly Infrastructure (POSI)&amp;lt;/agic––we&amp;rsquo;ve decided to release an updated file. This will provide a more efficient way to get such a large volume of records. The file (JSON records, 102.6GB) is &lt;a href="https://academictorrents.com/details/e4287cb7619999709f6e9db5c359dda17e93d515" target="_blank">now available&lt;/a>, with thanks once again to Academic Torrents.&lt;/p>
&lt;p>Use of our open APIs continues to grow, as does the metadata. Last year&amp;rsquo;s file was 112 million records and 65GB. Just nine months later (though it feels longer than that!), the new file is over 120 million records and over 102GB. That&amp;rsquo;s all of the Crossref records ever registered up to and including January, 7, 2021. We continue to see around 10% growth in records each year––and while journal articles account for most of the volume, preprints and book chapters are two of our fast-growing record types. In addition to the growth in the number of records, many of the records are getting bigger and better as members look at their &lt;a href="https://www.crossref.org/members/prep/" target="_blank">participation report&lt;/a> and understand the value of enriching metadata records for distribution throughout the scholarly ecosystem. &lt;a href="https://www.elsevier.com/connect/advancing-responsible-research-assessment" target="_blank">Elsevier recently opened its references&lt;/a>, enriching over 12 million records.  A number of members, including Royal Society, Sage, Emerald, OUP, World Scientific and more have started adding &amp;lt;a href=&amp;quot;/blog/open-abstracts-where-are-we/&amp;quot; target=&amp;quot;_blank&amp;quot;gicabstracts &lt;/a> which now number over 9 million.&lt;/p>
&lt;h3 id="help-us-help-youusing-the-torrent-and-other-important-notes">Help us help you––using the torrent and other important notes&lt;/h3>
&lt;p>We decided to release these public data files largely to help support COVID-19 research efforts but of course use cases for Crossref metadata vary widely and a few pointers should help all users:&lt;/p>
&lt;ul>
&lt;li>Use &lt;a href="https://academictorrents.com/details/e4287cb7619999709f6e9db5c359dda17e93d515" target="_blank">the torrent &lt;/a> if you want all of these records. Everyone is welcome to the metadata but it will be much faster for you and much easier on our APIs to get so many records in one file.&lt;/li>
&lt;li>Use the REST API to incrementally add new and updated records once you&amp;rsquo;ve got the initial file. &lt;a href="https://github.com/CrossRef/rest-api-doc/blob/master/api_tips.md" target="_blank">Here is how to get started &lt;/a> (and avoid getting blocked in your enthusiasm to use all this great metadata!).&lt;/li>
&lt;li>&amp;lsquo;Limited&amp;rsquo; and &amp;lsquo;closed&amp;rsquo; &amp;lt;a href=&amp;quot;/education/content-registration/descriptive-metadata/references/#00564/&amp;quot; target=&amp;quot;_blank&amp;quot;gicreferences&lt;/a> are not included in the file or our open APIs. And, while bibliographic metadata is generally required, lots of metadata is optional, so records will vary in quality and completeness.&lt;/li>
&lt;/ul>
&lt;p>Questions, comments and feedback are welcome at &lt;a href="mailto:support@crossref.org">support@crossref.org&lt;/a>.&lt;/p>
&lt;p>Here&amp;rsquo;s hoping 2021 is a better year for us all! Stay well.&lt;/p></description></item><item><title>A tribute to our Kirsty</title><link>https://www.crossref.org/blog/a-tribute-to-our-kirsty/</link><pubDate>Wed, 16 Dec 2020 00:00:00 +0000</pubDate><author>Crossref</author><guid>https://www.crossref.org/blog/a-tribute-to-our-kirsty/</guid><description>&lt;p>Our colleague and friend, &lt;a href="https://www.crossref.org/people/kirsty-meddings">Kirsty Meddings&lt;/a>, passed away peacefully on 10th December at home with her family, after a sudden and aggressive cancer. She was a huge part of Crossref, our culture, and our lives for the last twelve years.&lt;/p>
&lt;p>Kirsty Meddings is a name that almost everyone in scholarly publishing knows; she was part of a generation of Oxford women in publishing technology who have progressed through the industry, adapted to its changes, spotted new opportunities, and supported each other throughout. We hope this post will do justice to her memory in our profession.&lt;/p>
&lt;h2 id="kirstys-early-career">Kirsty&amp;rsquo;s early career&lt;/h2>
&lt;p>After completing her degree in English and Spanish American Literature at Warwick University, Kirsty started her career in scholarly communications and publishing at Blackwell’s Information Services. She was there for a year before joining CatchWord, an online journal start-up, in 1998, as Electronic Publisher and Account Manager and in 1999 was promoted to the new role of Library Relations Manager.&lt;/p>
&lt;p>CatchWord was acquired by Ingenta and Kirsty moved into product management working on integrating the CatchWord and Ingenta platforms and launching IngentaConnect in 2004. Ingenta became Publishing Technology in 2005 and Kirsty was Product Development Manager working with engineering, business development, and users on developing online products and services. She was also involved in a range of community initiatives including COUNTER, KBART, and ICEDIS.&lt;/p>
&lt;h2 id="joining-crossref">Joining Crossref&lt;/h2>
&lt;figure>&lt;a href="https://www.crossref.org/people/kirsty-meddings/">&lt;img src="https://www.crossref.org/images/staff/kirsty_720px.jpg"
alt="Kirsty&amp;rsquo;s professional headshot" width="50%">&lt;/a>&lt;figcaption>
&lt;p>Kirsty&amp;rsquo;s professional headshot&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;p>She was an early pioneer in electronic and online publishing - an innovator who understood scholarly publishing, technology, libraries, and people - a powerful combination. And Crossref was quick to offer her a role.&lt;/p>
&lt;p>In Kirsty’s introduction to Crossref she was described by the recruiter as:&lt;/p>
&lt;blockquote>
&lt;p>An experienced and highly capable individual with a solid background in product development, marketing and customer service issues related to the supply of scholarly electronic content from publishers to library and end user audiences. A good communicator and team worker with sound technical understanding and an excellent grasp of publishing industry issues.&lt;/p>
&lt;/blockquote>
&lt;p>This adequately captures Kirsty’s impressive professional achievements, but not her personality. Kirsty was a Product Manager at Crossref for 12 years and was a valued and loved friend and colleague. Committed to Crossref&amp;mdash;its values and people&amp;mdash;she was funny, human, and always asked tough questions.&lt;/p>
&lt;p>She joined us on October 27th, 2008 as our first Product Manager and the third UK employee. In her time at Crossref, Kirsty made a major impact, working on a range of important projects and services - particularly new, innovative services. Not long after she started at Crossref, she wrote a “day in the life” profile for the journal Serials that perfectly captures what it was like in 2009 at Crossref Oxford (there were three of us in Oxford and only ten total staff at Crossref): &lt;em>Meddings, K., 2009. Mini-profile: a day in the life of a product manager. Serials, 22(1), pp.5–6. DOI: &lt;a href="http://doi.org/10.1629/225" target="_blank">http://doi.org/10.1629/225&lt;/a>&lt;/em>&lt;/p>
&lt;p>Her own biography, on her staff &lt;a href="https://www.crossref.org/people/kirsty-meddings/">page&lt;/a>, states:&lt;/p>
&lt;blockquote>
&lt;p>Kirsty Meddings has been involved in a diverse set of initiatives that have kept her busy since 2008. She has spent most of her career in scholarly communications, in a variety of marketing and product development roles for intermediaries and technology suppliers. She speaks conversational geek and competent publishing, and is working towards fluency in both.&lt;/p>
&lt;/blockquote>
&lt;p>See? Funny!&lt;/p>
&lt;h2 id="professional-achievements">Professional achievements&lt;/h2>
&lt;p>Kirsty started out working on CrossCheck, now Similarity Check, the plagiarism screening service that launched in 2008. The service was in need of some attention and better organisation - Kirsty got stuck in, whipped it into shape and it has gone on to be one of Crossref’s most widely-adopted services. &lt;a href="https://cdn.ymaws.com/www.ismte.org/resource/resmgr/eon/august_2011.pdf" target="_blank">This article&lt;/a> that Kirsty wrote for ISMTE’s publication, EON, remains useful nearly 10 years after it was written! Kirsty successfully managed the partnership with Turnitin (starting as iParadigms), the technical provider for Similarity Check, for many years. Colleagues there are mourning her loss too.&lt;/p>
&lt;p>Kirsty was instrumental in launching Crossmark, which became a production service in 2012. After a few changes of hands, she resumed work on the service in recent years, and announced &lt;a href="https://doi.org/10.64000/h2vh2-35t60" target="_blank">the removal of Crossmark fees&lt;/a> to better support uptake in 2020.&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/news/2016-05-17-crossref-publishers-deliver-win-for-clinical-trial-openness/">The addition of clinical trial information to the Crossref metadata&lt;/a> was a community-driven initiative, developed from the concept of &lt;a href="https://blogs.biomedcentral.com/on-medicine/2014/01/31/threaded-publications-one-step-closer/" target="_blank">threaded publications&lt;/a>. There were/are lots of moving parts in this initiative, and in many ways it was one of the precursors to the idea of the Research Nexus: linking via metadata and relationships to provide a clearer picture of the ecosystem that exists around a research object.&lt;/p>
&lt;p>What was once FundRef (ahh, those logos!) has matured into the Open Funder Registry under Kirsty’s stewardship. In collaboration with Elsevier, the registry has grown from &lt;a href="https://www.crossref.org/news/2013-05-28-crossrefs-fundref-launches-publishers-and-funders-track-scholarly-output/" target="_blank">an initial 4,000 funders&lt;/a>, to over 25,000 and we can see &lt;a href="http://api.crossref.org/works?filter=has-funder:t&amp;amp;rows=0" target="_blank">over 5 million works&lt;/a> registered with Crossref that are linked to at least one funder. More recently, Kirsty was the Product Manager for the registration of research grants with Crossref, working with our Funder Advisory Group, and she was starting to work with CDL and DataCite to absorb the Funder Registry into ROR.&lt;/p>
&lt;p>In 2018, Kirsty launched our first ever dashboard for member best practice. She led the development and design of Participation Reports and the decision of which checks would be most important for the scholarly community to assess. This has quickly become one of Crossref’s most valuable and used tools.&lt;/p>
&lt;h2 id="public-speaking">Public speaking&lt;/h2>
&lt;p>Kirsty always spoke with authority across a range of topics, appearing totally calm even if she was nervous. Among many talks, she spoke at the STM seminar on &lt;a href="https://www.stm-assoc.org/events/publication-ethics-and-research-integrity/" target="_blank">Publication Ethics and Research Integrity&lt;/a>, ISMTE, UKSG, ALPSP seminars, the COPE Forum, ran numerous CrossCheck, CrossMark, FundRef and TDM webinars, and a recent online LIVE event.&lt;/p>
&lt;p>She was a frequent presenter at many of Crossref annual meetings, and enjoyed the opportunity to meet and catch up with our members, the board, and the community (many of whom always ask after her). Checking in after conferences on who said what, who’s moving where, what feedback we had, and picking up on opportunities for further collaboration were all things that we looked forward to sharing.&lt;/p>
&lt;p>To use UKSG’s own words, Kirsty was always a staunch supporter of the organisation - attending, exhibiting, and speaking at many UKSG conferences and events over her whole career. She was also a legend at the dinners, on the dance floor, and in the bar. At the 2019 conference she tallied the votes at the quiz night - Kirsty loved a quiz! We had an all-staff end-of-year quiz via zoom last week and it was just not the same without her.&lt;/p>
&lt;p>Here are &lt;a href="https://www2.slideshare.net/search/slideshow?searchfrom=header&amp;amp;q=Kirsty&amp;#43;meddings" target="_blank">Kirsty&amp;rsquo;s slides on SlideShare&lt;/a>, some &lt;a href="https://www.google.com/search?q=kirsty&amp;#43;meddings&amp;#43;Crossref&amp;amp;source=lnms&amp;amp;tbm=vid&amp;amp;sa=X&amp;amp;ved=2ahUKEwjf78Soq9DtAhVTolwKHXzhBOoQ_AUoA3oECAYQBQ&amp;amp;biw=1440&amp;amp;bih=707" target="_blank">videos of Kirsty&amp;rsquo;s talks on YouTube&lt;/a>, and her &lt;a href="https://orcid.org/0000-0001-9205-2956" target="_blank">ORCID record&lt;/a> which lists her published works.&lt;/p>
&lt;h2 id="strong-friendships">Strong friendships&lt;/h2>
&lt;p>One of the most rewarding experiences of working at Crossref is meeting up with the whole team and with our members. Jetlag, hunting out coeliac-friendly food, staying up far too late chatting, trying to fit in exploring bits of cities around board and other meetings, presenting, organizing, thinking, laughing (I’m sure to the annoyance of other plane passengers)&amp;mdash;these experiences were all part and parcel of working with Kirsty, and where many of us cemented connections with her.&lt;/p>
&lt;p>We started a &lt;a href="https://www.kudoboard.com/boards/DY47xRTo" target="_blank">message board&lt;/a> and within days it was populated with numerous stories, poems, and photos from so many friends and colleagues on whom Kirsty made such a lasting and loving impression.&lt;/p>
&lt;figure>&lt;a href="https://www.kudoboard.com/boards/DY47xRTo">&lt;img src="https://www.crossref.org/images/blog/kirsty-messages.jpg"
alt="Kirsty&amp;rsquo;s message board" width="25%">&lt;/a>&lt;figcaption>
&lt;p>Kirsty&amp;rsquo;s message board&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;p>It’s impossible to capture someone’s character in a blog, but some of the words that carry across the messages that people have shared are empathy, compassion, honesty, intelligence, brilliance, sincerity, laughter, human, passion, openness, and fun. We’ll miss her immensely.&lt;/p>
&lt;p>Kirsty was somewhat of an expert in grief. She lost her first husband, James Culling, to leukemia in December 2012, leaving her a widow with two sons, Dan, 7 at the time, and Luke, just 6-months old. A few years later, through the charity Widowed And Young (WAY), she met Martin Eggleston. Martin and his daughter Amy joined Kirsty, Dan, and Luke, and they created a very happy blended family. Some of us went to their wedding and it was an incredible event full of love and laughter - and of course music. Always music.&lt;/p>
&lt;p>Kirsty represented us, along with Rachael, at the funeral of another colleague last year, &lt;a href="https://doi.org/10.64000/eqnnm-c0659" target="_blank">Christine Hone&lt;/a>, in Amsterdam. Kirsty helped all of us get through the grief then. And because she made it okay to grieve and to talk about grief, it is heartbreaking and also comforting that she is indirectly helping us all now to be better able to handle her own death.&lt;/p>
&lt;h3 id="how-we-can-honour-kirstys-memory">How we can honour Kirsty’s memory&lt;/h3>
&lt;p>We heard that Kirsty’s last words were “I’m listening”. Which is just so fitting. She was always ready with an ear, a shoulder to support us all, and indeed she demanded that we express ourselves honestly.&lt;/p>
&lt;p>If you want to share memories of Kirsty, you can join others who have done so &lt;a href="https://www.kudoboard.com/boards/DY47xRTo" target="_blank">on the message board&lt;/a> or just take a few minutes to read through.&lt;/p>
&lt;p>And there is a &lt;a href="https://www.justgiving.com/fundraising/kirsty-meddings" target="_blank">justgiving page&lt;/a> in memory of Kirsty for Maggie&amp;rsquo;s Oxford, a branch of a cancer support charity who helped her and her family through James&amp;rsquo;s death and is now helping her family again.&lt;/p>
&lt;p>Professionally, Kirsty made major contributions at Crossref and in scholarly communications in general. More importantly, she had a profound impact on a personal level with many people. Our thoughts are with Martin, Dan, Amy, and Luke, and also with Kirsty’s mum Val, her brother Colin, her in-laws, her close friends, and all the people who&amp;mdash;like the rest of us&amp;mdash;are better for knowing her, and will never forget her.&lt;/p></description></item><item><title>Fast, citable feedback: Peer reviews for preprints and other record types</title><link>https://www.crossref.org/blog/fast-citable-feedback-peer-reviews-for-preprints-and-other-record-types/</link><pubDate>Wed, 09 Dec 2020 00:00:00 +0000</pubDate><author>Martyn Rittman</author><guid>https://www.crossref.org/blog/fast-citable-feedback-peer-reviews-for-preprints-and-other-record-types/</guid><description>&lt;p>Crossref has supported depositing metadata for preprints &lt;a href="https://doi.org/10.64000/5tcfp-vf140" target="_blank">since 2016&lt;/a> and peer reviews &lt;a href="https://www.crossref.org/news/2018-06-05-introducing-metadata-for-peer-review/">since 2018&lt;/a>. Now we are putting the two together, in fact we will permit peer reviews to be registered for any &lt;a href="https://www.crossref.org/education/content-registration/content-types-intro/">record type&lt;/a>.&lt;/p>
&lt;p>Currently, peer reviews can be registered for journal articles, but that means that they can only be related to some of the content our members deposit. Preprints, books, chapters, working papers, dissertations, and a host of other works can also be registered with Crossref. A number of these frequently undergo some form of review and many of our members and voices in the community have called for us to widen the net on peer reviews, including journal publishers, book publishers, review platforms, and preprint servers. We&amp;rsquo;ve listened and taken action, and from now on Crossref members can add &lt;a href="https://www.crossref.org/education/content-registration/structural-metadata/relationships/">relationship metadata&lt;/a> that links peer reviews to any record type. The metadata will also contain &lt;a href="https://www.crossref.org/documentation/schema-library/markup-guide-record-types/peer-reviews/">the type of review&lt;/a>, stating whether it is a referee report, author response, or community comment, etc. This allows accurate reporting on whether the peer review is happening within a traditional editorial process or elsewhere.&lt;/p>
&lt;h2 id="reviews-for-preprints">Reviews for preprints&lt;/h2>
&lt;p>In the last decade there has been an increase in the number of disciplines using preprints. Since enabling registration of preprint metadata, it has become our fastest-growing record type. Preprints, working papers, and other forms of early publication help to accelerate dissemination of the latest research and discovery. They can also promote discussion on important topics, and help authors to improve papers before an editorial decision for journal publication. During the COVID-19 pandemic, preprints have become invaluable for speeding the publication of vital research and case studies.&lt;/p>
&lt;p>On the other hand, preprints do not undergo formal review and editorial approval, leading to concerns about the dissemination of false information. While the issue of misinformation in preprints has been discussed for some time, the COVID-19 pandemic has brought it more sharply into focus. organisations that post preprints need to balance the benefits of rapid dissemination with promoting their responsible use.&lt;/p>
&lt;p>To support the feedback process, preprint servers along with a growing number of other platforms and services offer scholars the opportunity to post public comments on preprints. By doing this, they give extra context for readers, provide suggestions for authors, and raise awareness of work that could be flawed or too preliminary.&lt;/p>
&lt;p>Another growing trend is journal publishers adopting editorial processes that involve preprint-first options and open peer review. As Dr. Stephanie Dawson from ScienceOpen says:&lt;/p>
&lt;blockquote>
&lt;p>&amp;ldquo;We have long believed in rewarding reviewers by assigning Crossref DOIs to their open reviews to make them citable objects and we were one of the first users of Crossref&amp;rsquo;s peer review schema. However, a large percentage of the articles reviewed on ScienceOpen are publicly available preprints. The &lt;em>UCL Open: Environment&lt;/em> journal hosted on the platform, for example, is based on a workflow of open peer review of preprints. Our customers, editors, reviewers and authors are therefore extremely happy that these reviews can now also be assigned a Crossref peer review DOI for more accountability and transparency in scholarly publishing.&amp;rdquo;&lt;/p>
&lt;/blockquote>
&lt;p>At Crossref, we&amp;rsquo;re continually looking to support more record types and relations between them to build trust, support reproducibility and increase discoverability of content. This is another small step in building the &lt;a href="https://doi.org/10.64000/k2hez-ysv45" target="_blank">research nexus&lt;/a> and we look forward to working with members depositing peer reviews of preprints.&lt;/p></description></item><item><title>404: Support team down for essential maintenance</title><link>https://www.crossref.org/blog/404-support-team-down-for-essential-maintenance/</link><pubDate>Fri, 04 Dec 2020 00:00:00 +0000</pubDate><author>Amanda Bartell</author><guid>https://www.crossref.org/blog/404-support-team-down-for-essential-maintenance/</guid><description>&lt;p>2020 has been a very challenging year, and we can all agree that everyone needs a break. Crossref will be providing very limited technical and membership support from 21st December to 3rd January to allow our staff to rest and recharge. We’ll be back on January 4th raring to answer your questions. Amanda explains more about why we made this decision.&lt;/p>
&lt;p>As we all know, 2020 has been an unprecedented year, with the COVID-19 pandemic affecting lives across the globe.&lt;/p>
&lt;p>It’s been amazing to watch our members pivot their working practices and continue to publish content and register it with Crossref to keep the wheels of research and scholarly communications moving.&lt;/p>
&lt;p>Since January, we’ve seen 9,079,082 items registered with Crossref, up 13% on 2019. 2628 new members have also joined during that time and we now have almost 13.5k members from 139 countries. We’ve seen over 337 million requests to our REST API on average per month in 2020, a 9% increase over 2019 (and over 600 million total metadata queries on average per month across all our APIs and services).&lt;/p>
&lt;p>Of course, all this activity brings an increasing number of requests for help and support. Since the start of 2020, we have answered almost 24,000 support tickets from the community. Sometimes these just need a quick answer or a link to our documentation. Sometimes it&amp;rsquo;s a straightforward new member application or a routine query. But sometimes a prospective member needs a lots of advice, sometimes a long-standing member or user needs in-depth investigations and consultancy. Sometimes the request highlights a problem in one of our systems that needs input from our product and development colleagues. But either way, it’s keeping our small team of five full-time employees very busy.&lt;/p>
&lt;p>Vanessa &lt;a href="https://doi.org/10.64000/byv2m-9fm07" target="_blank">wrote&lt;/a> earlier in the year about how our Community Outreach team has changed its working practices this year. As Head of Member Experience I’ve been incredibly impressed by the way our membership, support and billing staff have done the same - remaining really focused on the needs of the Crossref community while (at the same time) balancing this with the demands of working from home, childcare, home-schooling, and supporting those affected by the pandemic in their own community. Isaac’s thoughtful &lt;a href="https://community.crossref.org/t/my-first-week-working-from-home-during-the-coronavirus-covid-19-pandemic/1236" target="_blank">post on our forum&lt;/a> about his first week working at home because of the pandemic really highlighted some of these challenges.&lt;/p>
&lt;p>We take work/life balance seriously at Crossref. We want to make sure that we’re are able to continue to help the Crossref community effectively in 2021, but are also able to continue to look after ourselves, our families, and our own communities in this difficult time. We all hope that 2021 will be a very different year, but there’s still likely to be disruption ahead for all of us, and one thing is sure: there will continue to be plenty more requests coming in for our small team to stay on top of in the meantime.&lt;/p>
&lt;p>With this in mind, we want to make sure that our support staff are able to properly rest and recharge during what is a holiday period for many of us coming up. We’ll be operating with just one person each on the technical support and membership support side between 23rd December and 3rd January. This means that while we’ll be able to answer urgent queries, &lt;strong>non-urgent questions will be left unanswered until 4th January. And we’ll not take on any new members between 21st December and 3rd January too.&lt;/strong>&lt;/p>
&lt;p>We know many of you will be continuing to work during this period. If you have a non-urgent question, do take a look at our &lt;a href="https://www.crossref.org/education/">support documentation&lt;/a> in the meantime, or see if other members (or our amazing Ambassadors) are able to &lt;a href="https://community.crossref.org/" target="_blank">help on our forum&lt;/a>. If you can’t find what you’re looking for and it&amp;rsquo;s urgent, we hope that the limited staff who are on call will still be able to help you out.&lt;/p>
&lt;p>Colleagues in the US have recently celebrated their Thanksgiving, and I remain enormously thankful for our team here at Crossref, and for you all in the scholarly community for your enthusiasm for working together collectively to help the world find, cite, link, assess, and reuse scholarly content. We all really appreciate your patience while we reset ready for 2021. Happy Holidays!&lt;/p></description></item><item><title>Crossref’s Board votes to adopt the Principles of Open Scholarly Infrastructure</title><link>https://www.crossref.org/blog/crossrefs-board-votes-to-adopt-the-principles-of-open-scholarly-infrastructure/</link><pubDate>Wed, 02 Dec 2020 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/crossrefs-board-votes-to-adopt-the-principles-of-open-scholarly-infrastructure/</guid><description>&lt;h2 id="tldr">TL;DR&lt;/h2>
&lt;p>On November 11th 2020, the Crossref Board voted to adopt the “Principles of Open Scholarly Infrastructure” (POSI). POSI is a list of sixteen commitments that will now guide the board, staff, and Crossref’s development as an organisation into the future. It is an important public statement to make in Crossref’s twentieth anniversary year. Crossref has followed principles since its founding, and meets most of the POSI, but publicly committing to a codified and measurable set of principles is a big step. If &lt;a href="https://doi.org/10.64000/mmdqs-23829" target="_blank">2019 was a reflective turning point&lt;/a>, and mid-2020 was about &lt;a href="https://doi.org/10.64000/85qb8-4m872" target="_blank">Crossref committing to open scholarly infrastructure&lt;/a> and collaboration, this is now announcing a very deliberate path. And we’re just a little bit giddy about it.&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/people/geoffrey-bilder/">Here is a picture of me being “giddy.”&lt;/a>&lt;/p>
&lt;p>If you just want to see the principles that the board has endorsed, you can see them here:&lt;/p>
&lt;p>&lt;a href="https://doi.org/10.24343/C34W2H" target="_blank">https://doi.org/10.24343/C34W2H&lt;/a>&lt;/p>
&lt;p>But if you also want some background and want to understand some of the implications of Crossref adopting the principles, read on…&lt;/p>
&lt;p>Warning - this is a long post.&lt;/p>
&lt;!--more-->
&lt;h2 id="background-and-origins">Background and Origins&lt;/h2>
&lt;p>Some of you may be surprised that we’ve done this - simply because you always assumed we operated under these principles anyway. And we have. Mostly.&lt;/p>
&lt;p>The “Principles of Open Scholarly Infrastructure” were largely inspired by a set of uncodified rules and norms that Crossref had been operating under for years. So how did we get to this circular situation where we are making a big announcement about adopting something we have largely been doing anyway?&lt;/p>
&lt;p>Six years ago I met with Cameron Neylon and Jennifer Lin when they were still at PLOS and we decided that we wanted to write a blog post about&amp;hellip;&lt;/p>
&lt;p>Well, it doesn’t really matter.&lt;/p>
&lt;p>We never finished writing that blog post because we got distracted by an issue that we kept seeing which was that services that the scholarly community depended on were increasingly taking directions that seemed antithetical to the community’s interests.&lt;/p>
&lt;p>We were concerned because the scholarly community was becoming increasingly distrustful of infrastructure services. We wondered if there were any practices that we could point to that might mitigate the risk of infrastructure being co-opted and that would help build trust. Fortunately, we had two great models to look at:&lt;/p>
&lt;ul>
&lt;li>Crossref, which had a set of informal rules and norms that it had followed since its founding (e.g., transparency of operations, being business-model neutral, one member one vote).&lt;/li>
&lt;li>ORCID, an organisation that was spun-out of Crossref and which had adopted &lt;a href="https://orcid.org/about/what-is-orcid/principles" target="_blank">a written set of principles&lt;/a>, based largely on codifying practices that they had seen at Crossref.&lt;/li>
&lt;/ul>
&lt;p>And so we wrote these practices up and added a few that we thought were missing. And we posted a different blog post to the one we had originally planned. It was titled “&lt;a href="https://cameronneylon.net/blog/principles-for-open-scholarly-infrastructures/" target="_blank">The Principles of Open Scholarly Infrastructures.&lt;/a>” And the blog post became &lt;a href="https://www.google.com/search?channel=fs&amp;amp;client=ubuntu&amp;amp;q=%E2%80%9CPrinciples&amp;#43;for&amp;#43;Open&amp;#43;Scholarly&amp;#43;Infrastructures.%E2%80%9D" target="_blank">popular&lt;/a>. &lt;a href="https://www.youtube.com/watch?v=oWPZkZ180Ho" target="_blank">And we did a bunch of talks about the Principles&lt;/a>. And, much to our surprise, POSI has influenced the directions and policies of a number of organisations and initiatives since, including &lt;a href="https://sparcopen.org/our-work/good-practice-principles-for-scholarly-communication-services/" target="_blank">SPARC&lt;/a>, &lt;a href="https://investinopen.org/blog/invest-in-open-infrastructure-launches/" target="_blank">Invest in Open Infrastructure&lt;/a>, &lt;a href="http://theodi.org/wp-content/uploads/2020/04/OPEN_Designing-sustainable-data-institutions_ODI_2020.pdf" target="_blank">Open Data Institute&lt;/a>, &lt;a href="http://oaswitchboard.org" target="_blank">OA Switchboard&lt;/a>, and others.&lt;/p>
&lt;p>Elsewhere, community organisations and likeminded community members helped further develop the implementation of POSI through discussions at FORCE11 and through additional blog posts and books. Some, like Dryad and ROR, started to work to align their organisational structure to embrace POSI.&lt;/p>
&lt;p>And this left Crossref in a strange position. Although we were largely the inspiration for these Principles - we ourselves had never codified and adopted them.&lt;/p>
&lt;h2 id="motivations-why-now">Motivations. Why Now?&lt;/h2>
&lt;h3 id="because-it-is-the-right-thing-to-do-for-those-that-currently-depend-on-crossref">Because it is the right thing to do for those that currently depend on Crossref&lt;/h3>
&lt;p>It is a healthy thing for the organisation to do. Adopting these principles strengthens Crossref’s governance. After twenty years, Crossref infrastructure has become critical to a broad segment of the community. As our membership profile changes, and as our broader stakeholder community expands, we need to explicitly evolve our governance to reflect stakeholders. And it would be irresponsible to continue to have our governance guided by a set of informal conventions. Particularly in the context of a global political period where we’ve seen the informal operating conventions and policy understandings of at least two major democracies ignored or discarded.&lt;/p>
&lt;h3 id="because-it-could-help-make-the-creation-of-new-sustainable-open-scholarly-infrastructure-easier-and-less-expensive">Because it could help make the creation of new, sustainable, open scholarly infrastructure easier and less expensive&lt;/h3>
&lt;p>There is a lot of new interest in open scholarly infrastructure. New infrastructure services and systems are being proposed almost every month. Many of them seek extensive advice and consulting from Crossref. A subset of these are incubated through Crossref. And a subset of these become Crossref services. Others are spun out as separate organisations (e.g., &lt;a href="https://orcid.org/" target="_blank">ORCID&lt;/a>) or were specifically initiated as collaborations (e.g., &lt;a href="https://ror.org" target="_blank">ROR&lt;/a>).&lt;/p>
&lt;p>Our experience has been that the vast majority of work involved in these infrastructure projects was in establishing trust amongst the stakeholder community. We think that Crossref adopting the principles will help to address fundamental questions about accountability and sustainability that are inevitably raised when a new constituency approaches Crossref with an idea for collaborating on a new or existing infrastructure service. In short, adopting the principles will make future collaboration easier.&lt;/p>
&lt;h2 id="adopting-the-principles-plus-ça-change">Adopting the Principles: Plus ça change&lt;/h2>
&lt;p>The Principles of Open Scholarly Infrastructure (POSI) proposes three areas that an Open Infrastructure organisation can address in order to garner the trust of the broader scholarly community: accountability (governance), funding (sustainability), and protection of community interests (insurance).&lt;/p>
&lt;p>POSI proposes a set of concrete commitments that an organisation can make to build trust in each of these areas. There are 16 such commitments. Of these 16 commitments, Crossref is already completely or partially meeting the requirements of 15. And adopting the 16th commitment just formalises a direction Crossref has been heading toward for several years.&lt;/p>
&lt;p>Critically, “adopting” POSI does not mean that we have to instantly meet all of the criteria. After all, when ORCID adopted its principles, it didn’t meet &lt;em>any&lt;/em> of them. They were adopted to make a statement of intent. And they were publicly adopted so that the community could measure the organisation&amp;rsquo;s progress as well as to allow the community to detect if ORCID started to stray from its stated intentions.&lt;/p>
&lt;p>Adopting the principles is akin to adopting a mission statement or a vision statement. It is an aspirational guide, not a description of the &lt;em>status quo&lt;/em>.&lt;/p>
&lt;p>Having said that, the principles are more concrete than a mission or vision statement, and this makes them easier to measure.&lt;/p>
&lt;p>It is also important to note that the criteria are designed to balance each other. So, for example, one would not want to change the governance or business model to better support the mission if doing so would also threaten the sustainability of the organisation.&lt;/p>
&lt;p>And finally, meeting a commitment is an ongoing process - it is not a one-off event. The organisation needs to keep measuring their performance against the principles in order to make sure that they have not inadvertently regressed.&lt;/p>
&lt;h2 id="implications">Implications&lt;/h2>
&lt;p>Before adopting the principles, we did a candid self-audit to see which ones we thought we currently met and which ones we still needed to work on.&lt;/p>
&lt;p>The three areas and sixteen commitments that are proposed in POSI are all designed to ensure that an infrastructure can not be co-opted by a particular party or interest group.&lt;/p>
&lt;p>And the last area, “Insurance,” is the backstop that makes sure that, if some in the community feel that the infrastructure organisation has gone in a radically wrong direction, they can recreate the infrastructure as it was when they were comfortable with it, and they will not be hindered by practices or policies that lock them into the existing organisation.&lt;/p>
&lt;p>This “insurance” is very much inspired by Crossref. Crossref itself was built, in part, to make sure that publishers were not locked into platforms and that journals and societies were not locked into publishers. Using the indirect Crossref DOI linking mechanism ensures that content can move between platforms and publishers without breaking vital citation links. Moving between platforms or publishers is never easy. And it isn’t cheap. But using Crossref DOIs for citation links at least makes it possible.&lt;/p>
&lt;p>Crossref has an extra insurance level as well. It is built on the DOI and Handle infrastructure. If Crossref were to take a direction that some of its members found unacceptable, those members could join another DOI Registry agency more amenable to them. It wouldn’t be easy. It wouldn’t be cheap. But it would be possible.&lt;/p>
&lt;p>And this knowledge helps keep Crossref grounded and attuned to the needs and concerns of its members. We know that our members are not “trapped” with us. We don’t take lightly the trust placed in us. And we know that there is trust still to build with various corners of our community. And it is this knowledge that helps keep us from developing the disdainful, take-it-or-leave-it, attitude that can be the cliché characteristic of infrastructure organisations.&lt;/p>
&lt;p>So the fundamental, overarching goal of POSI is to set out principles that ensure that the stakeholders of an infrastructure organisation have a clear say in setting its agenda and priorities and that, in extremis, the stakeholders can leave and create an alternative infrastructure if the original organisation becomes unresponsive, hostile, or disappears.&lt;/p>
&lt;p>As we look at how Crossref currently maps to the principles, please keep in mind three things:&lt;/p>
&lt;ol>
&lt;li>If we have marked something as green, that doesn’t mean we think we do this perfectly. It simply means that we already have internal processes that focus on this commitment and we have evidence that these processes have thus far been working.&lt;/li>
&lt;li>The fact that something is green and has “thus-far been working” does not mean that we should rest easy. We could regress. Our processes need to be able to detect and address regressions.&lt;/li>
&lt;li>The commitments are supposed to be balanced. So we don’t want to do something to turn something green if it has an irreversible impact on another commitment. So, for example, we should not address a shortfall in the contingency fund by generating revenue in a way that ultimately hurts Crossref’s mission.&lt;/li>
&lt;li>The implication of #3 above is that it may take us some time to meet all of the commitments. But again, the community can measure our progress against meeting the commitments.&lt;/li>
&lt;/ol>
&lt;h2 id="so-how-does-crossref-currently-meet-posi">So how does Crossref currently meet POSI?&lt;/h2>
&lt;h3 id="governance">Governance&lt;/h3>
&lt;pre>&lt;code>🟢 Coverage across the research enterprise.
🟢 Non-discriminatory membership
🟢 Transparent operations
🟢 Cannot lobby
🟢 Living will
🟢 Formal incentives to fulfil mission &amp;amp; wind-down
🔴 Stakeholder Governed
&lt;/code>&lt;/pre>
&lt;h3 id="sustainability">Sustainability&lt;/h3>
&lt;pre>&lt;code>🟢 Time-limited funds are used only for time-limited activities.
🟢 Goal to generate surplus
🟡 Goal to create contingency fund to support operations for 12 months
🟢 Mission-consistent revenue generation
🟢 Revenue based on services, not data
&lt;/code>&lt;/pre>
&lt;h3 id="insurance">Insurance&lt;/h3>
&lt;pre>&lt;code>🟢 Available data (within constraints of privacy laws)
🟡 Patent non-assertion
🟡 Open source
🟡 Open data (within constraints of privacy laws)
&lt;/code>&lt;/pre>
&lt;h3 id="governance-1">Governance&lt;/h3>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>If an infrastructure is successful and becomes critical to the community, we need to ensure it is not co-opted by particular interest groups. Similarly, we need to ensure that any organisation does not confuse serving itself with serving its stakeholders. How do we ensure that the system is run “humbly”, that it recognises it doesn’t have a right to exist beyond the support it provides for the community and that it plans accordingly? How do we ensure that the system remains responsive to the changing needs of the community?&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; POSI&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>In the area of governance, Crossref clearly meets six of the seven criteria listed. We will discuss these first.&lt;/p>
&lt;h4 id="-coverage-across-the-research-enterprise">🟢 Coverage across the research enterprise&lt;/h4>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>it is increasingly clear that research transcends disciplines, geography, institutions and stakeholders. The infrastructure that supports it needs to do the same.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; POSI&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>Crossref includes members who publish in the STM, HSS and Professional spheres. There are still some gaps in our coverage (e.g., monographs, law), but this is not through policy or lack of trying.&lt;/p>
&lt;p>Crossref has members in 139 countries and has agreements with people in 150 countries. However note that geographic diversity is &lt;em>not&lt;/em> the same as language diversity. Although we have members in many countries, the vast majority of our registered content is still in English. This does not reflect the trends in research outputs. We still need to do a lot of work to support non-English publications and non-English speaking members. But we have already identified this as a priority and are working on a number of initiatives to better support research communication in languages other than English.&lt;/p>
&lt;h4 id="-non-discriminatory-membership">🟢 Non-discriminatory membership&lt;/h4>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>we see the best option as an “opt-in” approach with a principle of non-discrimination where any stakeholder group may express an interest and should be welcome. The process of representation in day to day governance must also be inclusive with governance that reflects the demographics of the membership&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; POSI&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>It is first worth noting that “non-discriminatory” does not mean that we cannot have standards, obligations, and rules that all members of Crossref have to adhere to. It simply means that said rules are clear and that we apply them uniformly.&lt;/p>
&lt;p>Crossref has always had catholic membership criteria. Although we have until now historically defined ourselves as a primarily “publisher” organisation, we define “publisher” loosely as anybody who produces content that commonly references or is referenced by scholarly literature. Historically, this has included NGOs, IGO’s, standards bodies, institutional archives, and professional publishers. More recently it has expanded to include preprint archives and funders.&lt;/p>
&lt;p>The requirements for joining Crossref are few. We admit any applicant who:&lt;/p>
&lt;ul>
&lt;li>Agrees to the obligations of membership.&lt;/li>
&lt;li>Can pay the fees.&lt;/li>
&lt;/ul>
&lt;p>In practice we have historically had a policy of rejecting individuals as members. But even this is probably a pointless distinction as many of our members are “organisations” consisting of one person.&lt;/p>
&lt;p>And fundamental to Crossref’s governance is that a member’s influence in the governance of Crossref is not tied to the level of financial investment they make in the organisation. All members have the same single vote. All board members have one vote.&lt;/p>
&lt;p>Recently, we have also made changes to our governance and election process. The first to introduce contested elections for the board. The second to ensure that board membership was proportionally balanced amongst the membership tiers. Even as recently as 2017, when the Board established a Governance Committee, the idea of weighting votes to membership tiers was roundly rejected - on principle.&lt;/p>
&lt;p>This is not to say that we can relax on this point. For example, as more funders and institutions join Crossref, we will need to make sure that our governance reflects that. We talk about this more in the section on governance.&lt;/p>
&lt;p>Some will also point out that our fees are themselves a form of discrimination as they can still be an insurmountable barrier to some in the community. We understand this and, without trying to make light of or dismiss the situation, we are also confident that we are constantly looking at ways to lower the barrier-to-entry for joining Crossref. Our fees have gone steadily down since we were founded and we are constantly reviewing them to try and make them more equitable. We have created a category of sponsoring organisations to defray the costs of membership. We collaborate closely with organisations like PKP to try and build tools and services that make participation in Crossref easier and less expensive.&lt;/p>
&lt;h4 id="-transparent-operations">🟢 Transparent operations&lt;/h4>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>achieving trust in the selection of representatives to governance groups will be best achieved through transparent processes and operations in general (within the constraints of privacy laws).&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; POSI&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>Crossref has transparent finances and a transparent governance process. Much of this is simply a byproduct of the regulations governing non-profits with tax exempt status in the US and our specific registration as a non-profit membership association in New York State.&lt;/p>
&lt;p>Until fairly recently, the obvious exception to this was Crossref’s use of pre-picked slates in board elections, but we have since improved this with an open election process.&lt;/p>
&lt;h4 id="-cannot-lobby">🟢 Cannot lobby&lt;/h4>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>the community, not infrastructure organisations, should collectively drive regulatory change. An infrastructure organisation’s role is to provide a base for others to work on and should depend on its community to support the creation of a legislative environment that affects it&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; POSI&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>Crossref has never lobbied. Partly this is a byproduct of our commitment to be business-model neutral as most lobbying efforts in the industry seem to center around promoting the views held by members who share a business model.&lt;/p>
&lt;p>But also, Crossref has never lobbied on its own behalf. We have always relied on our members and the community to point out and promote Crossref if there is any area of legislative policy that the Crossref infrastructure could help with.&lt;/p>
&lt;h4 id="-living-will">🟢 Living will&lt;/h4>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>a powerful way to create trust is to publicly describe a plan addressing the condition under which an organisation would be wound down, how this would happen, and how any ongoing assets could be archived and preserved when passed to a successor organisation. Any such organisation would need to honour this same set of principles&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; POSI&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>Crossref has two relationships that require us to set out plans for an orderly wind-down.&lt;/p>
&lt;p>The first is a condition of our incorporation as a non-profit in the state of New York. This explicitly includes a provision that requires us to hand over our operations and responsibilities to a successor non profit organisation that has a similar constituency and mission. The NY State Attorney General reviews and approves any major changes to ensure this requirement is met.&lt;/p>
&lt;p>The second is a condition of our being members of the DOI Foundation, which includes provisions for us to hand over management of DOIs to another registration agency should Crossref ever wind-down. It is worth noting that we have already seen this clause invoked for other registration agencies that have wound down and who have, as part of the DOI Foundation provisions, handed responsibility for their DOIs to Crossref.&lt;/p>
&lt;p>This is not to say that we are perfect on this score. We do not, for example, have any single place that outlines the steps that would need to be taken in order to execute the requirements laid out by our obligations to the state of New York and the IDF.&lt;/p>
&lt;h4 id="-formal-incentives-to-fulfil-mission--wind-down">🟢 Formal incentives to fulfil mission &amp;amp; wind-down&lt;/h4>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>infrastructures exist for a specific purpose and that purpose can be radically simplified or even rendered unnecessary by technological or social change. If it is possible the organisation (and staff) should have direct incentives to deliver on the mission and wind down.”&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; POSI&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>Crossref has a track record of periodically reviewing our services and decommissioning those that are no longer needed - either because they have fulfilled their specific mission or because there is simply waning interest in them (arguably, the same thing).&lt;/p>
&lt;p>Again, this is not to say we are perfect on this score. We also have, by our last count, about 30 specialised, overlapping APIs- many of which are used by just a handful of users. These have escaped our normal scrutiny because they never had the status of a formal service and had not been through our product management process.&lt;/p>
&lt;p>But still, Crossref has long made it a habit to question its own existence. At virtually every board annual strategy meeting we ask the question “will technology X make Crossref unnecessary?” We need to continue with the attitude that the best thing we could do for our members is to make ourselves unnecessary.&lt;/p>
&lt;h4 id="-stakeholder-governed">🔴 Stakeholder Governed&lt;/h4>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>a board-governed organisation drawn from the stakeholder community builds more confidence that the organisation will take decisions driven by community consensus and consideration of different interests.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; POSI&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>Overall, Crossref meets most of the Governance requirements with the notable exception of broader stakeholder involvement.&lt;/p>
&lt;p>Of course, the key to this is how you define “stakeholder.”&lt;/p>
&lt;p>Some may dispute this and argue that Crossref “stakeholders” are “publishers” because they are the parties that invested in creating Crossref.&lt;/p>
&lt;p>But this narrow definition of “stakeholder” - focusing solely on those who have “invested”- is not widely held. In fact, common phrases like &amp;ldquo;&lt;a href="https://www.lexico.com/definition/stakeholder_economy" target="_blank">stakeholder economy&lt;/a>&amp;rdquo; and &amp;ldquo;&lt;a href="https://hbr.org/2020/01/making-stakeholder-capitalism-a-reality" target="_blank">stakeholder capitalism&lt;/a>&amp;rdquo; describe the exact opposite- systems that don&amp;rsquo;t just focus on the “investor”, but which instead balance benefits to the investor with benefits to employees, the broader community, society, and the environment.&lt;/p>
&lt;p>It is this latter, broader definition of “stakeholder” that is used in POSI.&lt;/p>
&lt;p>And just in case anybody still thinks that people other than publishers don’t consider themselves “stakeholders’ in the Crossref infrastructure, we simply point to this, recently tweeted by &lt;a href="http://orcid.org/0000-0001-6507-6848" target="_blank">Brea Manuel&lt;/a>, a researcher, in celebration of their publication in Nature Reviews Chemistry (&lt;a href="https://doi.org/10.1038/s41570-020-0214-z" target="_blank">read it, and learn how to recruit and retain a diverse workforce&lt;/a>):&lt;/p>
&lt;p>&lt;img src="https://www.crossref.org/images/blog/doi_tattoo.png" alt="Brea Manuel&amp;rsquo;s DOI tattoo" title="Brea Manuel's DOI tattoo">&lt;/p>
&lt;h3 id="sustainability-1">Sustainability&lt;/h3>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Financial sustainability is a key element of creating trust. “Trust” often elides multiple elements: intentions, resources, and checks and balances. An organisation that is both well meaning and has the right expertise will still not be trusted if it does not have sustainable resources to execute its mission. How do we ensure that an organisation has the resources to meet its obligations?&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; POSI&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>In the area of sustainability, Crossref clearly meets four of the five of the criteria listed and is most of the way to meeting the fifth.&lt;/p>
&lt;h4 id="-time-limited-funds-are-used-only-for-time-limited-activities">🟢 Time-limited funds are used only for time-limited activities&lt;/h4>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>day to day operations should be supported by day to day sustainable revenue sources. Grant dependency for funding operations makes them fragile and more easily distracted from building core infrastructure.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; POSI&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>Crossref has never supported production activities based on grants. Indeed Crossref’s delivery on this point is what inspired the approach taken in this principle. This distinguishes Crossref from many grant-funded infrastructure initiatives which either barely stay afloat or disappear altogether. Even those that survive often do so by pursuing solutions that align with their funder’s interest over their user’s needs.&lt;/p>
&lt;h4 id="-goal-to-generate-surplus">🟢 Goal to generate surplus&lt;/h4>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>organisations which define sustainability based merely on recovering costs are brittle and stagnant. It is not enough to merely survive, it has to be able to adapt and change. To weather economic, social and technological volatility, they need financial resources beyond immediate operating costs.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; POSI&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>Crossref has always attempted to generate a surplus. Crossref has generated surpluses since 2002 - so for 18 years of its 20 year existence.&lt;/p>
&lt;h4 id="-goal-to-create-contingency-fund-to-support-operations-for-12-months">🟡 Goal to create contingency fund to support operations for 12 months&lt;/h4>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>a high priority should be generating a contingency fund that can support a complete, orderly wind down (12 months in most cases). This fund should be separate from those allocated to covering operating risk and investment in development.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; POSI&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>Crossref currently has a contingency fund that would support operations for 9 months. Although this may be standard for industry, it seems prudent to extend this in the case of infrastructure organisations, particularly when they are membership organisations. First, the very fact that something is infrastructure implies that the systemic effects of its failing ungracefully could have industry-wide repercussions. Second, the decision-making process of a membership organisation whose governance is voluntary is inherently slower. It has taken Crossref Board 9 months, for example, just to discuss the ramifications of adopting POSI.&lt;/p>
&lt;p>Given our recent financial performance, we expect Crossref could comfortably increase the contingency fund to support 12 months of operations within the next 2-3 years.&lt;/p>
&lt;h4 id="-mission-consistent-revenue-generation">🟢 Mission-consistent revenue generation&lt;/h4>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>potential revenue sources should be considered for consistency with the organisational mission and not run counter to the aims of the organisation.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; POSI&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>Crossref has a good track record of periodically reviewing our services and fees and adjusting them to better support Crossref’s mission. The role of the Membership &amp;amp; Fees Committee in advising the Board has been critical. The very first example of this was in the early days of Crossref when we dropped matching fees because they were disincentivising members from linking their references. Crossref was also quick to recognise that, in order to support global research and reach smaller publishers in lower income countries, we had to develop a sponsoring mechanism to help defray the costs and ameliorate the technical complexity of participating in Crossref. Most recently we have taken the decision to drop fees for Crossmark as it was clear they had become a barrier to our members distributing retraction and correction notifications in a machine actionable format.&lt;/p>
&lt;h4 id="-revenue-based-on-services-not-data">🟢 Revenue based on services, not data&lt;/h4>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>data related to the running of the research enterprise should be a community property. Appropriate revenue sources might include value-added services, consulting, API Service Level Agreements or membership fees&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; POSI&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>Crossref does not charge for or resell its members’ data. Doing so would restrict dissemination and reduce the discoverability of our members’ content. Instead our revenue comes from a combination of membership fees and service fees. The DOI registration is a member service that generates the bulk of our revenue. But our SLA-backed APIs are becoming increasingly popular as members and others seek to integrate Crossref metadata into their production workflows and services.&lt;/p>
&lt;h3 id="insurance-1">Insurance&lt;/h3>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Even with the best possible governance structures, critical infrastructure can still be co opted by a subset of stakeholders or simply drift away from the needs of the community. Long term trust requires the community to believe it retains control. Here we can learn from Open Source practices. To ensure that the community can take control if necessary, the infrastructure must be “forkable.” The community could replicate the entire system if the organisation loses the support of stakeholders, despite all established checks and balances. Each crucial part then must be legally and technically capable of replication, including software systems and data. Forking carries a high cost, and in practice this would always remain challenging. But the ability of the community to recreate the infrastructure will create confidence in the system. The possibility of forking prompts all players to work well together, spurring a virtuous cycle. Acts that reduce the feasibility of forking then are strong signals that concerns should be raised. The following principles should ensure that, as a whole, the organisation in extremis is forkable.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; POSI&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>Crossref clearly meets two of the four Insurance requirements. And the remaining two can be met easily with some clarification and time.&lt;/p>
&lt;p>The “governance” section of POSI is designed to ensure that an infrastructure organisation is beholden to the broader stakeholder community and that it can not be co-opted by a particular party or special interest. And the “sustainability” section of POSI is designed to ensure that the infrastructure organisation takes the financial steps to ensure it can weather sudden changes in the financial or technical environment. But the last section, “insurance” is designed to protect stakeholder interests in case either “governance” or “sustainability” fail.&lt;/p>
&lt;p>The term “forkable” comes from the Open Source software community where it is used to indicate when a software community’s interests diverge and they decide to split a project into several projects, with each new project focusing on a particular sub-community&amp;rsquo;s interests.&lt;/p>
&lt;p>One of the immediate worries that people have when they first hear of the concept of “forkability” is that it will encourage the creation many variations of a project based on frivolous criteria. But this simply does not happen. Forking a project is never easy and takes a lot of effort. It is only done successfully when a critical mass of the community becomes unhappy with the direction a project is taking and is willing to take on the substantial burden of running an entirely separate project. Without such a critical mass, the fork just withers and has virtually no effect on the original project.&lt;/p>
&lt;p>And the reason for this is simple, the mere knowledge that a project is “forkable” forces project maintainers to balance the interests of the community so that no sizable subgroup grows dissatisfied enough to fork the project.&lt;/p>
&lt;p>Forkability encourages reponsivness to the community by making sure that the community is not “locked-in.”&lt;/p>
&lt;p>Crossref itself was founded, in part, to prevent lock-in. Use of the DOI in linking citations makes it easier for publishers to move platforms, and for journals and societies to move between publishers.&lt;/p>
&lt;p>And Crossref itself is architected &lt;em>in part&lt;/em> to ensure that lock-in is not possible. Crossref is just one of several DOI registration agencies. Members unhappy with Crossref, can move to another DOI registration agency and their citation links will continue to work. But there are things we could do to make this even easier.&lt;/p>
&lt;h4 id="-available-data-within-constraints-of-privacy-laws">🟢 Available data (within constraints of privacy laws)&lt;/h4>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>It is not enough that the data be made “open” if there is not a practical way to actually obtain it. Underlying data should be made easily available via periodic data dumps.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; POSI&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>Crossref provides public APIs that allow users to access Crossref metadata. We are planning to eventually release yearly public data files. We already did this once &lt;a href="https://doi.org/10.64000/wsnyw-yap64" target="_blank">when we released a public data file in support of COVID-19 research.&lt;/a> This in no way prevents the provision of data through paid Service Level Agreement tiers that provide guarantees of regularity, availability or reliability for those that need it. Existing Metadata Plus customers primarily use data that is available through the open API or existing dumps, but value additional services that support their use-cases.&lt;/p>
&lt;h4 id="-patent-non-assertion">🟡 Patent non-assertion&lt;/h4>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>“The organisation should commit to a patent non-assertion covenant. The organisation may obtain patents to protect its own operations, but not use them to prevent the community from replicating the infrastructure.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; POSI&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>Crossref has never registered a patent. But the DOI Foundation, with significant support from Crossref, had to respond to (and then monitored) a set of patent applications that, if successful, the DOI System would infringe on. The applications were filed more than 15 years ago and haven’t been successful so these applications aren’t a current concern. As a result of this, the DOI Foundation adopted a patent policy in 2005 that covers all Registration Agencies and protects the DOI System. We may want to register protective patents in the future in order to enable us to defend ourselves against patent trolls.&lt;/p>
&lt;p>The problem with patents is that they could be used by an organisation to prevent the infrastructure forking. One technique that has been used by major companies to assure communities that they will not be affected by patents, is to make a &lt;a href="http://www.iphandbook.org/handbook/ch07/p06/" target="_blank">patent non-assertion covenant&lt;/a>. For example, &lt;a href="https://www-03.ibm.com/press/us/en/pressrelease/21846.wss" target="_blank">IBM&lt;/a>, &lt;a href="https://docs.microsoft.com/en-us/openspecs/dev_center/ms-devcentlp/1c24c7c8-28b0-4ce1-a47d-95fe1ff504bc" target="_blank">Microsoft&lt;/a> and &lt;a href="https://www.google.com/patents/opnpledge/" target="_blank">Google&lt;/a> have made non-assertion statements in order to assure the open source and standards communities that they participate in that they will not co-opt an open source project or open standard by asserting patents on code or processes they contribute.&lt;/p>
&lt;p>Though Crossref has never registered a patent, issuing a patent non-assertion covenant would help assure stakeholders that we would not use patents in the future to prevent the community from forking the system.&lt;/p>
&lt;h4 id="-open-source">🟡 Open source&lt;/h4>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>All software required to run the infrastructure should be available under an open source license. This does not include other software that may be involved with running the organisation.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; POSI&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>All code for new initiatives since 2007 has been released under an open source MIT license. The legacy Content System code could be open sourced within 12-18 months with no extra effort.&lt;/p>
&lt;p>If some Crossref stakeholders wanted to “fork” Crossref or leave for another DOI registration agency, their biggest hurdle would be trying to recreate the twenty years worth of rules and algorithms we use for processing and matching metadata. Without access to the source code of the system, it would be almost impossible for these to be reverse engineered.&lt;/p>
&lt;p>Similarly, without access to the source code of our system - it is difficult to ensure that Crossref is, indeed, non-discriminatory in the way it works with member content. It would be possible, for example, for Crossref to modify its matching algorithms to deliberately favour or deprecate some members’ content.&lt;/p>
&lt;p>If we want to assure the community that we are managing our member metadata fairly and if we want to provide even better insurance to our members and the broader stakeholders, we should make all of our code open source.&lt;/p>
&lt;p>The legacy so-called “CS” (content system) is in the process of being refactored. The only reason we cannot open source this immediately is that we still need to make some security changes to it. These security changes are being done as part of a current refactoring project and should be completed without any extra effort within 12-18 months. After that, we can open source the code.&lt;/p>
&lt;p>🟡 Open data (within constraints of privacy laws)&lt;/p>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>For an infrastructure to be forked it will be necessary to replicate all relevant data. The CC0 waiver is best practice in making data legally available. Privacy and data protection laws will limit the extent to which this is possible.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; POSI&lt;/cite>&lt;/p>
&lt;/div>
&lt;p>Achieving this simply requires us clarifying copyright and license information and that this will not have any effect on the metadata registered in Crossref by our members.&lt;/p>
&lt;p>First we should outline the current copyright status of a Crossref metadata record.&lt;/p>
&lt;p>The fundamental issue is that what we colloquially call “Crossref metadata” is actually a mix of elements, some of which come from our members, and some of which come from third parties and some of which comes from Crossref itself. These elements, in turn, each have different copyright implications.&lt;/p>
&lt;p>On top of this, Crossref has terms and conditions for its members and terms and conditions for specific services. These grant Crossref the right to do things with some classes of metadata and not do things with other classes of metadata - regardless of copyright.&lt;/p>
&lt;p>Let’s start with the easiest case. Crossref already has two services with CC0 metadata:&lt;/p>
&lt;ul>
&lt;li>The Open Funder Registry&lt;/li>
&lt;li>Event Data&lt;/li>
&lt;/ul>
&lt;p>Obviously, the POSI open data provision would not change anything for either service.&lt;/p>
&lt;p>The next easiest case is private data. Crossref collects PII (usernames, passwords IP addresses, etc.). This would remain private. And we will continue to manage it in conformance with GDPR. It would not be affected by the open data provision of POSI.&lt;/p>
&lt;p>Next let’s look at what most people probably think of as “Crossref metadata”- that is, the basic bibliographic metadata that Crossref has collected from its members since its founding (titles, authors, volumes, issues, etc). For the record- this does &lt;em>not&lt;/em> include abstracts.&lt;/p>
&lt;p>Since 2000 Crossref has stated that it considers this basic bibliographic metadata to be “facts.” And under US law (Crossref is registered in the US) these facts are not subject to copyright at all. If this data is not subject to copyright at all, there is no way Crossref can “waive the copyright” under CC0. This metadata would not be affected at all under the open data provision of POSI.&lt;/p>
&lt;p>More recently, some of our members have been submitting abstracts to Crossref. These are copyrighted. In the case of subscription publishers, the copyright usually belongs to the publisher. In the case of open access publishers, the copyright most often belongs to the authors. In both cases, Crossref cannot waive copyright under CC0 because the copyright is not ours to waive. However, we are allowed to redistribute the abstracts with our metadata because that is part of the terms and conditions we have with our members. We already have &lt;a href="https://www.crossref.org/services/metadata-retrieval/#00360">language that notes the distinct copyright status of the abstracts in our metadata&lt;/a>, but, ideally, we should extend our schema to make that information available in a machine actionable form as well. In short, the copyright status of abstracts would not be affected at all by the open data provision of POSI.&lt;/p>
&lt;p>Crossref also has its &lt;a href="https://www.crossref.org/education/content-registration/descriptive-metadata/references/#00564">Reference Distribution Policy&lt;/a> that the board adopted in 2017 - limited and closed references are not distributed by Crossref and this won’t change.
&lt;em>[EDIT 6th June 2022 - all references are now open by default with the March 2022 board vote to remove any restrictions on reference distribution].&lt;/em>&lt;/p>
&lt;p>And this leaves us with the one thing that &lt;em>would&lt;/em> be affected by the open data provision of POSI- data that is created by Crossref itself as a byproduct of our services. By law, this data is under Crossref’s copyright unless we explicitly waive it. This data includes things like, participation reports, conflict reports, member IDs and Cited-by counts (just the counts, not the references) and any aggregations of our otherwise uncopyrighted data that might, by aggregating it, be subject to &lt;em>sui generis&lt;/em> database rights. At the moment, although we distribute this data freely and without restriction, we have no explicit copyright attached to it. All we would be seeking to do is explicitly say that data generated by Crossref will be distributed CC0. Again, at first it would be enough to just specify this in human readable form, along with our other copyright information. But, eventually, we would want to include this information in machine actionable form in the metadata itself.&lt;/p>
&lt;p>&lt;strong>To summarise:&lt;/strong>&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th style="text-align: left">Metadata type&lt;/th>
&lt;th style="text-align: left">Example&lt;/th>
&lt;th style="text-align: left">Current Copyright&lt;/th>
&lt;th style="text-align: center">Change under POSI&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td style="text-align: left">Already CC0&lt;/td>
&lt;td style="text-align: left">Open Funder Registry, Event Data&lt;/td>
&lt;td style="text-align: left">CC0&lt;/td>
&lt;td style="text-align: center">None&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">Private&lt;/td>
&lt;td style="text-align: left">Log files, user IDs&lt;/td>
&lt;td style="text-align: left">Private&lt;/td>
&lt;td style="text-align: center">None&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">Bibliographic&lt;/td>
&lt;td style="text-align: left">Title, authors, volume, issue&lt;/td>
&lt;td style="text-align: left">Facts&lt;/td>
&lt;td style="text-align: center">None&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">Closed references&lt;/td>
&lt;td style="text-align: left">&lt;/td>
&lt;td style="text-align: left">Facts - but no distribution under the reference distribution board policy from 2017&lt;/td>
&lt;td style="text-align: center">None&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">Limited references&lt;/td>
&lt;td style="text-align: left">&lt;/td>
&lt;td style="text-align: left">Facts - but no public distribution under the reference distribution board policy from 2017&lt;/td>
&lt;td style="text-align: center">None&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">Open references&lt;/td>
&lt;td style="text-align: left">&lt;/td>
&lt;td style="text-align: left">Facts&lt;/td>
&lt;td style="text-align: center">None&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">Crossref-generated data&lt;/td>
&lt;td style="text-align: left">Participation data, reports, extracts&lt;/td>
&lt;td style="text-align: left">Copyright Crossref&lt;/td>
&lt;td style="text-align: center">CC0&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>&lt;em>[EDIT 6th June 2022 - all references are now open by default with the March 2022 board vote to remove any restrictions on reference distribution].&lt;/em>&lt;/p>
&lt;br>
&lt;p>No member metadata will be affected by our adopting the open data provision of POSI. The only data that would be affected is data generated by Crossref itself.&lt;/p>
&lt;p>However, the adoption of this principle would likely have an effect on our decisions about future services. For example, under this principle we would not launch any new services where the data was not freely reusable or the copyright of the data was not CC0.&lt;/p>
&lt;h2 id="conclusion-and-next-steps">Conclusion and Next steps&lt;/h2>
&lt;p>So again we face the paradox- We are announcing something that is simultaneously insignificant and important. It is insignificant in that we are simply saying that we will continue to do what we have largely been doing since Crossref was founded. But it is important because, in codifying what we have been doing, we are also confirming that these principles actually worked. That they were essential to building the trust that allowed us to function over the past twenty years, and they will continue to be essential in the future- as we look to work with existing organisations to strengthen current infrastructures, and work with new stakeholders to develop new infrastructures.&lt;/p>
&lt;p>So much of the work in building scholarly infrastructure is about building trust. We would love to see other organisations and services adopt POSI as well. Doing so would help us to collaborate more efficiently by allowing us to confirm from the outset that our fundamental values align. And having a set of verifiable commitments that we can point to will also help build the community&amp;rsquo;s trust in our respective organisations and services.&lt;/p>
&lt;p>And this brings us to an important point. Although POSI might have been inspired by Crossref, POSI is not a “Crossref thang” and it never has been. The movement to create open scholarly infrastructures and to define and clarify the ground rules within which they operate has become a much broader community concern.&lt;/p>
&lt;p>To this end, we’ve worked with some sibling infrastructure organisations—such as Dryad and ROR—as well as the original authors of POSI to create a website where we could host the list of principles independent of the original blog post and independent of any single organisation:&lt;/p>
&lt;div class="shortcode-divwrap darkgrey-highlight">
&lt;span>&lt;a href="http://openscholarlyinfrastructure.org/" target="_blank">openscholarlyinfrastructure.org&lt;/a>&lt;/span>
&lt;/div>
&lt;br>
Minimally, this provides a place for anybody who wants to link to or cite POSI - either because they are endorsing them, or because they are simply discussing them.
&lt;p>If we see enough activity of this type, then the site could evolve to become a register of those organisations and services who have formally adopted POSI and a place where they can link to their self-assessments against the principles.&lt;/p>
&lt;p>The community promoting, discussing and applying POSI has long since grown beyond the original authors of the POSI blog post. And it is also much larger than any single organisation. Our hope is that this website encourages that growth.&lt;/p>
&lt;p>And, of course, in addition to the external outreach and coordination, Crossref still has internal work to do in addressing the outstanding issues that were raised in our own self-assessment above. We need to increase our contingency funds. We need to publish a patent non-assertion covenant. We need to open source our core software. And we need to clarify our metadata license information and make it explicit that Crossref waives copyright (using CC-0) for any metadata generated by Crossref. And, finally, as Crossref expands and starts working with different stakeholders, we will need to adjust our governance and the composition of our board accordingly. We will, of course, post updates here as we make progress on addressing these areas.&lt;/p>
&lt;p>2020 marked Crossref’s 20th birthday. What a grim year to have an anniversary. But we are, at least, ending it on a little bit of a high. We are delighted that the issue of open scholarly infrastructure has become so prominent in the community. And we are eager to help strengthen and extend this infrastructure. The decision by Crossref’s board to adopt POSI is the equivalent of Crossref finally adopting a written constitution. And it is a fitting launch to our next twenty years.&lt;/p></description></item><item><title>Calling all 24-hour (PID) party people!</title><link>https://www.crossref.org/blog/calling-all-24-hour-pid-party-people/</link><pubDate>Tue, 13 Oct 2020 00:00:00 +0000</pubDate><author>Kathleen Luschek</author><guid>https://www.crossref.org/blog/calling-all-24-hour-pid-party-people/</guid><description>&lt;p>While we wish we could be together in person to celebrate the fifth PIDapalooza, there&amp;rsquo;s an upside to &lt;a href="https://pidapalooza.us18.list-manage.com/track/click?u=3be6c9ed55c4e452e710b2d41&amp;amp;id=e88a641bb4&amp;amp;e=8567777e89" target="_blank">moving it online&lt;/a>: now &lt;em>everyone&lt;/em> can participate in the universe&amp;rsquo;s best PID party! With 24 hours of non-stop PID programming, you&amp;rsquo;ll be able to come to the party no matter where you happen to be.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2020/pid-blog-dance-image.png"
alt="Pidapalooza dancing graphic" width="70%">
&lt;/figure>
&lt;/div>
&lt;h3 id="send-us-your-ideas-for-pidapalooza21">Send us your ideas for #PIDapalooza21&lt;/h3>
&lt;p>Now is your chance to share your work in the #PIDapalooza21 spotlight! We&amp;rsquo;re seeking proposals for short, interactive sessions about what you are doing––or want to do––with persistent identifiers and the communities that love and use them. &lt;/p>
&lt;p>&lt;a href="https://twitter.com/hashtag/PIDapalooza21" target="_blank">#PIDapalooza21&lt;/a> will feature sessions around the broad theme of PIDs and Open Research Infrastructure, focusing on the following areas:&lt;/p>
&lt;h3 id="theme-1-pids-101">Theme 1. PIDs 101&lt;/h3>
&lt;p>For PID beginners! You&amp;rsquo;ve got just 30 minutes to get attendees up to speed on a PID or PIDs. Make it fast! Make it fact-filled! Make it fun!&lt;/p>
&lt;h3 id="theme-2-pid-communities-international">Theme 2. PID Communities International&lt;/h3>
&lt;p>Have you always wanted to host a Spanish-language PID session, or bring together PID people in the humanities? Tell us how you&amp;rsquo;d connect with PID peers around the world!&lt;/p>
&lt;h3 id="theme-3-pid-success-stories">Theme 3. PID Success Stories&lt;/h3>
&lt;p>There&amp;rsquo;s nothing better than hearing about what&amp;rsquo;s working in the PID world––and why! Share your success stories so we can all benefit from them.&lt;/p>
&lt;h3 id="theme-4-pid-party">Theme 4. PID Party!&lt;/h3>
&lt;p>It wouldn&amp;rsquo;t be PIDapalooza without the party sessions, so be creative! Help us make this the best PID party ever!&lt;/p>
&lt;div class="shortcode-divwrap blue-highlight">
&lt;span>&lt;h3 id="propose-a-session-nowhttpsdocsgooglecomformsde1faipqlsflqyhg_fn6qu-20dzsnfgnmazokn5jsjahcudrylpyvqtp-gviewform">&lt;a href="https://docs.google.com/forms/d/e/1FAIpQLSflQyhg_FN6qU-20dZSnfGnmAZoKn5JsJaHcuDRYlpyvQTp-g/viewform" target="_blank">Propose a session now!&lt;/a>&lt;/h3>
&lt;/span>
&lt;/div>
&lt;br/>
&lt;p>The call for proposals will be open until October 30. Submit your PIDea now!&lt;/p>
&lt;p>&lt;em>*Note: The PIDapalooza submission form uses Google. If you are unable to access Google Forms, &lt;a href="mailto:info@pidapalooza.org">email your session idea&lt;/a>.&lt;/em>&lt;/p>
&lt;p>Get the full low-down on #PIDapalooza21 at the &lt;a href="https://pidapalooza.us18.list-manage.com/track/click?u=3be6c9ed55c4e452e710b2d41&amp;amp;id=07e26525f0&amp;amp;e=8567777e89" target="_blank">PIDapalooza website&lt;/a>.&lt;/p></description></item><item><title>EASE Council Post: Rachael Lammey on the Research Nexus</title><link>https://www.crossref.org/blog/ease-council-post-rachael-lammey-on-the-research-nexus/</link><pubDate>Mon, 12 Oct 2020 00:00:00 +0000</pubDate><author>Rachael Lammey</author><discourseUsername>rlammey</discourseUsername><guid>https://www.crossref.org/blog/ease-council-post-rachael-lammey-on-the-research-nexus/</guid><description>&lt;p>This blog was initially posted on the &lt;a href="https://ease.org.uk/" target="_blank">European Association of Science Editors (EASE)&lt;/a> blog: &lt;a href="https://ese-bookshelf.blogspot.com/2020/10/ease-council-post-rachael-lammey-on.html" target="_blank">&amp;ldquo;EASE Council Post: Rachael Lammey on the Research Nexus&amp;rdquo;&lt;/a>. EASE President Duncan Nicholas accurately introduces it as a whole lot of information and insights about metadata and communication standards into one post&amp;hellip;&lt;/p>
&lt;p>I was given a wide brief to decide on the topic of my EASE blog, so I thought I&amp;rsquo;d write one that tries to encompass &lt;em>everything&lt;/em> - I&amp;rsquo;ll explain what I mean by that.&lt;/p>
&lt;p>In the past, Crossref has had the opportunity to talk to EASE members about the importance of registering content whose metadata contains important information related to the article. Richer metadata helps to connect the content to other key information such as who wrote it, who it was funded by, the relevant license, the research it cites, any updates to the work such as corrections and retractions, and &lt;a href="https://doi.org/10.20316/ESE.2019.45.19010" target="_blank">the data that underpin the research&lt;/a>. The use of open persistent identifiers like DOIs, funder IDs, ORCID iDs and ROR IDs are always recommended.&lt;/p>
&lt;p>Such rich and connected metadata also helps discoverability of the published research in a different way than just direct access; if you can find something based on looking at the publications related to a particular funder, author, or institution, then there are more ways to come across what you&amp;rsquo;re looking for. Making links between objects underpinning the research also helps put the research in context and can help further research by making connections to other valuable information that may have been more difficult to make otherwise.&lt;/p>
&lt;p>I&amp;rsquo;ve mentioned the Research Nexus in the title of this post. It&amp;rsquo;s achieved by declaring relationships between publications and other associated research objects, and from those objects to related publications. The metadata that reveals relationships between research objects can be as informative as the objects themselves. These relationships can assert certain facts that may not be otherwise obvious: this is our goal with the Research Nexus. These relationships and assertions need to exist not just on the web pages of the outputs, but also reflected in a standard way in the metadata so that the information is computer-readable and can be used at scale. As Jennifer Lin, who coined the term, explains:&lt;/p>
&lt;blockquote>
&lt;p>&amp;ldquo;Researchers are adopting new tools that create consistency and shareability in their experimental methods. Increasingly, these are viewed as key components in driving reproducibility and replicability. They provide transparency in reporting key methodological and analytical information. They are also used for sharing the artefacts which make up a processing trail for the results: data, material, analytical code, and related software on which the conclusions of the paper rely. Where expert feedback was also shared, such reviews further enrich this record.&amp;rdquo;&lt;/p>
&lt;/blockquote>
&lt;p>In &lt;a href="https://doi.org/10.64000/k2hez-ysv45" target="_blank">her Crossref blog&lt;/a>, Jennifer goes on to give some examples, including:&lt;/p>
&lt;ul>
&lt;li>Linking to an &lt;a href="https://doi.org/10.17504/protocols.io.r89d9z6" target="_blank">entire collection of methods&lt;/a> and &lt;a href="https://doi.org/10.17504/protocols.io.itrcem6" target="_blank">video protocols&lt;/a> via Protocols.io&lt;/li>
&lt;li>Linking to &lt;a href="https://doi.org/10.21105/joss.00384" target="_blank">software and peer reviews&lt;/a> in JOSS&lt;/li>
&lt;li>Linking to &lt;a href="https://doi.org/10.1093/gigascience/gix045" target="_blank">preprint, data, code, source code, peer reviews in Gigascience&lt;/a>&lt;/li>
&lt;/ul>
&lt;p>I&amp;rsquo;d include an additional example of linking research to the grant using the grant identifier and associated metadata from the funding section of &lt;a href="https://doi.org/10.1371/journal.pone.0222922" target="_blank">this PLOS paper&lt;/a> (read more about the example from EuroPMC who &lt;a href="https://blog.europepmc.org/2020/06/global-grant-ids-in-europe-pmc.html" target="_blank">register grants with Crossref for Wellcome)&lt;/a>.&lt;/p>
&lt;p>These links can be established by adding them into the Crossref &lt;a href="https://www.crossref.org/education/content-registration/structural-metadata/relationships/">relationship metadata&lt;/a> schema. The information is then made available to anyone via our open APIs, so that they can easily see and use the information.&lt;/p>
&lt;p>In all of these, publishers and other parties are linking to associated research outputs to support the reproducibility and discoverability of content.&lt;/p>
&lt;p>The reproducibility point is worth reiterating; EASE has always supported projects to maintain high standards around the review of research, publication standards and ethics, and the reduction of research waste. And connecting articles to data, preprints, protocols, and peer reviews, and making the relationships open for analysis will help achieve this.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2020/DOI-network-diagram_v3_600x560px-1024x956.png"
alt="Visualizing the Reseasrch Nexus image" width="50%">
&lt;/figure>
&lt;/div>
&lt;p>We also know that there are work and cost involved in establishing these links, and we&amp;rsquo;re working on ways to lower the barriers in doing so by:&lt;/p>
&lt;ul>
&lt;li>Revisiting what we charge to encourage best practice. Starting in 2020, we have &lt;a href="https://doi.org/10.64000/h2vh2-35t60" target="_blank">removed fees&lt;/a> for registering vital information on corrections, retractions and other Crossmark metadata. This is timely in light of the updates to the &lt;a href="https://ease.org.uk/publications/ease-statements-resources/ease-standard-retraction-form/" target="_blank">EASE Standardised Retraction form.&lt;/a>&lt;/li>
&lt;li>We&amp;rsquo;re also working to remove fees for translations and versions that are linked together by the appropriate relationship metadata so that publishers posting translations or different versions of an article don&amp;rsquo;t have to pay multiple times for these. Our &lt;a href="https://www.crossref.org/committees/membership-and-fees/">Membership &amp;amp; Fees Committee&lt;/a> is currently reviewing other ways we can support publishers keen to make these connections.&lt;/li>
&lt;li>Finding ways to make it easier for publishers to collect this information from authors e.g. submission systems integrations with data repositories to collect robust information on article/data links.&lt;/li>
&lt;li>Allowing the registration of peer review metadata for content other than journal articles e.g. books, preprints (coming soon).&lt;/li>
&lt;li>Making it easier for publishers to register this information with us at Crossref via the provision of simple to use tools, interfaces and reporting.&lt;/li>
&lt;/ul>
&lt;p>The outputs of the research process, such as journal articles, don&amp;rsquo;t exist in isolation - you only have to look at the interest in the corpus of COVID-19 publications, preprints and associated data to see this. This thinking is also supported by campaigns like &lt;a href="http://www.metadata2020.org/" target="_blank">Metadata 2020&lt;/a> advocating for &amp;ldquo;richer, connected, and reusable, open metadata will advance scholarly pursuits for the benefit of society.&amp;rdquo; The relationships revealed by the Research Nexus may one day help progress research to realise benefits that help us all, providing we all make efforts to effectively support them. More to come&amp;hellip;&lt;/p></description></item><item><title>2020 Board Election</title><link>https://www.crossref.org/blog/2020-board-election/</link><pubDate>Mon, 28 Sep 2020 00:00:00 +0000</pubDate><author>Lucy Ofiesh</author><guid>https://www.crossref.org/blog/2020-board-election/</guid><description>&lt;p>This year, Crossref’s Nominating Committee assumed the task of developing a slate of candidates to fill six open board seats. We are grateful that in the midst of a challenging year, we received over 70 expressions of interest from all around the world, a 40% increase from last year’s response. It was an extraordinary pool of applicants and a testament to the strength of our membership community.&lt;/p>
&lt;p>There are six seats open for election (two large, four small), and the &lt;a href="https://www.crossref.org/committees/nominating/">Nominating Committee&lt;/a> is pleased to present the following slate.&lt;/p>
&lt;h2 id="the-2020-slate">The 2020 slate&lt;/h2>
&lt;p>Candidate organisations, in alphabetical order, for the Small category (four seats available):&lt;/p>
&lt;ul>
&lt;li>&lt;strong>Beilstein-Institut&lt;/strong>, Wendy Patterson&lt;/li>
&lt;li>&lt;strong>Korean Council of Science Editors&lt;/strong>, Kihong Kim&lt;/li>
&lt;li>&lt;strong>OpenEdition&lt;/strong>, Marin Dacos&lt;/li>
&lt;li>&lt;strong>Scientific Electronic Library Online (SciELO)&lt;/strong>, Abel Packer,&lt;/li>
&lt;li>&lt;strong>The University of Hong Kong&lt;/strong>, Jesse Xiao&lt;/li>
&lt;/ul>
&lt;p>Candidate organisations, in alphabetical order, for the Large category (two seats available):&lt;/p>
&lt;ul>
&lt;li>&lt;strong>AIP Publishing&lt;/strong>, Jason Wilde,&lt;/li>
&lt;li>&lt;strong>Oxford University Press&lt;/strong>, James Phillpotts,&lt;/li>
&lt;li>&lt;strong>Taylor &amp;amp; Francis&lt;/strong>, Liz Allen&lt;/li>
&lt;/ul>
&lt;div class="shortcode-divwrap blue-highlight">
&lt;span>&lt;h3 id="here-are-the-candidates-organisational-and-personal-statementsboard-and-governanceelections2020-slate">&lt;a href="https://www.crossref.org/board-and-governance/elections/2020-slate/">Here are the candidates&amp;rsquo; organisational and personal statements&lt;/a>&lt;/h3>
&lt;/span>
&lt;/div>
&lt;h2 id="you-can-be-part-of-this-important-process-by-voting-in-the-election">You can be part of this important process, by voting in the election&lt;/h2>
&lt;p>If your organisation is a voting member in good standing of Crossref as of September 14, 2020, you are eligible to vote when voting opens on September 30, 2020.&lt;/p>
&lt;h2 id="how-can-you-vote">How can you vote?&lt;/h2>
&lt;p>On September 30, 2020, your organisation&amp;rsquo;s designated voting contact will receive an email with the Formal Notice of Meeting and Proxy Form with concise instructions on how to vote. You will also receive a user name and password with a link to our voting platform.&lt;/p>
&lt;p>The election results will be announced at LIVE20 &lt;strong>virtual&lt;/strong> meeting on November 10, 2020.&lt;/p></description></item><item><title>Open Abstracts: Where are we?</title><link>https://www.crossref.org/blog/open-abstracts-where-are-we/</link><pubDate>Fri, 25 Sep 2020 00:00:00 +0000</pubDate><author>Ludo Waltman</author><guid>https://www.crossref.org/blog/open-abstracts-where-are-we/</guid><description>&lt;p>The &lt;a href="https://i4oa.org" target="_blank">Initiative for Open Abstracts (I4OA)&lt;/a> launched this week. The initiative calls on scholarly publishers to make the abstracts of their publications openly available. More specifically, publishers that work with Crossref to register DOIs for their publications are requested to include abstracts in the metadata they deposit in Crossref. These abstracts will then be made openly available by Crossref. 39 publishers have already agreed to join I4OA and to open their abstracts.&lt;/p>
&lt;p>Where are we at the moment in terms of openness of abstracts? For an individual publisher working with Crossref, the percentage of the publisher’s content for which an abstract is available in Crossref can be found in Crossref’s &lt;a href="https://www.crossref.org/members/prep" target="_blank">Participation Reports&lt;/a>. The chart presented below gives the overall picture (as of September 1, 2020) for medium-sized and large publishers working with Crossref. The vertical axis shows the number of journal articles of a publisher in the period 2018-2020. Because of the large differences between publishers in the number of articles they publish, this axis has a logarithmic scale. The horizontal axis shows the percentage of the articles of a publisher for which an abstract is available in Crossref. The orange dots represent publishers that have agreed to join I4OA. The publishers colored in blue have not yet agreed to join the initiative.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/i4oa-chart.png"
alt="Publishers with abstracts in Crossref" width="100%">
&lt;/figure>
&lt;/div>
&lt;p>A similar chart was published a few months ago in &lt;a href="https://medium.com/@aarontay/why-openly-available-abstracts-are-important-overview-of-the-current-state-of-affairs-bb7bde1ed751" target="_blank">this blog post on the importance of open abstracts&lt;/a>. Comparing the above chart with the one published a few months ago, the first effects of I4OA are already visible. While for most publishers the percentage of abstracts available in Crossref has hardly changed, it has increased from 11% to 95% for the Royal Society, one of the founding publishers of I4OA. This reflects the efforts the Royal Society has made over the past months to improve the availability of abstracts in Crossref for its content, not only for new content but also for existing content. For SAGE, another founding publisher of I4OA, the percentage of abstracts available in Crossref has increased from 38% to 50%. A further increase can be expected to take place in the coming months. The third founding publisher of I4OA, Hindawi, has remained at a stable level, with abstracts being available for 97% of its content.&lt;/p>
&lt;p>The above chart shows that many publishers supporting I4OA are already making abstracts available in Crossref. Other publishers do not yet make abstracts available in Crossref but have nevertheless decided to join I4OA. This is the case for Frontiers, PLOS, and Karger, and also for several smaller publishers not visible in the above chart, such as EMBO and Ubiquity Press. These publishers are currently adjusting their workflows and will start submitting abstracts to Crossref soon.&lt;/p>
&lt;p>Of the publishers that have not yet joined I4OA, some may not yet be aware of I4OA, while others may need more time to decide whether they will join the initiative. As can be seen in the above chart, most publishers that have not yet joined I4OA do not make abstracts available in Crossref at the moment. However, some publishers have not yet joined I4OA even though they do make abstracts available in Crossref. We hope these publishers will join I4OA soon. By joining the initiative, these publishers would formalize their commitment to openness of abstracts.&lt;/p>
&lt;p>None of the publishers in the above chart makes abstracts available in Crossref for 100% of its journal content. Some publishers, such as Copernicus and Hindawi, are close to 100%, but even these publishers have some content for which no abstract is available. Importantly, this does not necessarily mean that publishers have failed to submit abstracts to Crossref for some of their content. Instead, it may simply mean that some of their journal content does not have an abstract. Research articles usually have an abstract, but many other types of content published in journals, such as book reviews, letters, editorials, and corrections, often do not have an abstract. For most publishers, it is therefore impossible to make abstracts available for 100% of their content. Moreover, since Crossref does not distinguish between different types of content published in journals, we cannot provide separate statistics on the availability of abstracts for different types of journal content.&lt;/p>
&lt;p>As an example, let’s consider Brill, a publisher that has joined I4OA and that mainly focuses on the humanities and social sciences. Abstracts are available in Crossref for 57% of Brill’s content in the period 2018-2020. This may suggest that Brill has failed to submit abstracts to Crossref for a significant share of its content. However, when we look up journal publications of Brill in 2018 and 2019 in the Web of Science database, abstracts turn out to be available for only 68% of these publications. Assuming that Web of Science has more or less complete coverage of abstracts, this seems to indicate that Brill has already submitted most of its abstracts to Crossref. In fact, Web of Science shows that about a quarter of the publications of Brill are book reviews and that hardly any of these book reviews has an abstract. This illustrates why some publishers, for instance those that publish many book reviews, cannot be expected to get close to 100% availability of abstracts.&lt;/p>
&lt;p>Despite the above caveats, it is clear that there is still a long way to go in improving the availability of abstracts in Crossref. As of September 1, 2020, abstracts were available for 21% of all journal articles in Crossref in the period 2018-2020. In Web of Science (Science Citation Index Expanded, Social Sciences Citation Index, and Arts &amp;amp; Humanities Citation Index), 86% of all journal publications in 2018 and 2019 that have a DOI also have an abstract.&lt;/p>
&lt;p>Publishers who wish to distribute their abstracts openly through Crossref can include them in the normal content registration process. They can send XML to Crossref (using Crossref’s metadata deposit schema), either directly via HTTPS POST or via the Crossref admin system. For back-content, a resubmission of the full XML is required. In addition, various tools can be used to deposit abstracts. Open Journal Systems (OJS) has a plugin that supports the depositing of abstracts. Metadata Manager also facilitates this, but only for journal articles. Crossref’s web deposit form does not yet support abstracts, but Crossref is working on this.&lt;/p>
&lt;p>To keep track of the progress publishers are making in depositing abstracts in Crossref, we plan to publish regular updates of the chart presented above on the I4OA website. We look forward to witnessing the impact of I4OA in the coming months!&lt;/p>
&lt;hr>
&lt;p>&lt;em>Thank you to guest authors Bianca Kramer and Ludo Waltman, as well as the other founding members of I4OA.&lt;/em>&lt;/p></description></item><item><title>Get involved with Peer Review Week 2020 and register your peer reviews with Crossref</title><link>https://www.crossref.org/blog/get-involved-with-peer-review-week-2020-and-register-your-peer-reviews-with-crossref/</link><pubDate>Mon, 21 Sep 2020 00:00:00 +0000</pubDate><author>Amanda Bartell</author><guid>https://www.crossref.org/blog/get-involved-with-peer-review-week-2020-and-register-your-peer-reviews-with-crossref/</guid><description>&lt;p>Just when you thought 2020 couldn’t go any faster, it’s Peer Review week again! Peer Review is such an important part of the research process and highlighting the role it plays is key to retaining and reinforcing trust in the publishing process.  &lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2020/prw-colour-no-background.png"
alt="Peer Review Week 2020 logo" width="50%">
&lt;/figure>
&lt;/div>
&lt;br/>
&lt;p>As the &lt;a href="https://peerreviewweek.wordpress.com/" target="_blank">Peer Review Week team&lt;/a> states:&lt;/p>
&lt;p>“Maintaining trust in the peer review decision-making process is paramount if we are to solve the world’s most pressing problems. This includes ensuring that the peer review process is transparent (easily discoverable, accessible, and understandable by anyone writing, reviewing, or reading peer-reviewed content) and that everyone involved in the process receives the training and education needed to play their part in making it reliable and trustworthy.”&lt;/p>
&lt;p>A key way that publishers can make peer reviews easily discoverable and accessible is by registering them with Crossref - creating a persistent identifier for each review, linking them to the relevant article, and providing rich metadata to show what part this item played in the evolution of the content. It also gives a way to acknowledge the incredible work done by academics in this area. &lt;/p>
&lt;p>For Peer Review week last year,  Rosa and Rachael from Crossref created this short video to explain more.&lt;/p>
&lt;center>&lt;iframe width="560" height="315" src="https://www.youtube.com/embed/A_wN3nqP07Q" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen>&lt;/iframe>&lt;/center>
&lt;br/>
&lt;br/>
&lt;p>Fast forward to 2020 and over 75k peer reviews have now been registered with us by a range of members including Wiley, Peer J, eLife, Stichting SciPost, Emerald, IOP Publishing, Publons, The Royal Society and Copernicus. We encourage all members to register peer reviews with us - and you can keep up to date with everyone who is using &lt;a href="https://api.crossref.org/types/peer-review/works?facet=publisher-name:*&amp;amp;rows=0" target="_blank">this API query&lt;/a>. (We recommend installing a JSON viewer for your browser to view these results if you haven’t done so already).&lt;/p>
&lt;h3 id="register-peer-reviews-and-contribute-to-the-research-nexus">Register peer reviews and contribute to the Research Nexus&lt;/h3>
&lt;p>At Crossref, we talk a lot about the research nexus, and it’s a theme that you’re going to hear a lot more about from us in the coming months and years. &lt;/p>
&lt;p>The published article no longer has the supremacy it once did, and other outputs - and inputs - have increasing importance. Linked data and protocols are key for reproducibility, peer reviews increase trust and show the evolution of knowledge, and other research objects help increase the discoverability of content. Registering these objects and stating the relationships between them support the research nexus.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/Article_Nexus_Reproducibility.png" width="60%" alt="The Research Nexus" >
&lt;/div>
&lt;p>Peer reviews in particular are key to demonstrating that the scholarly record is not fixed - it’s a living entity that moves and changes over time. Registering peer reviews formally integrates these objects into the scholarly record and makes sure the links between the reviews and the article both exist and persist over time.   It allows analysis or research on peer reviews and highlights richer discussions than those provided by the article alone, showing how discussion and conversation help to evolve knowledge. In particular, post-publication reviews highlight how the article is no longer the endpoint - after publication, research is further validated (or not!) and new ideas emerge and build on each other.  You can see a &lt;a href="https://doi.org/10.64000/gp78m-kkk93" target="_blank">real-life example&lt;/a> of this from F1000 in a blog post written by Jennifer Lin a few years ago.&lt;/p>
&lt;p>As we’ve said before:&lt;/p>
&lt;blockquote>
&lt;p>Article metadata + peer review metadata = a fuller picture of the evolution of knowledge &lt;/p>
&lt;/blockquote>
&lt;p>Registering peer reviews also provides publishing transparency and reviewer accountability, and enables contributors to get credit for their work.  If peer review metadata includes ORCID IDs, our &lt;a href="https://www.crossref.org/community/orcid/">ORCID auto-update service&lt;/a> means that we can automatically update the author’s ORCID record (with their permission), while our &lt;a href="https://doi.org/10.64000/cmxdb-n4v31" target="_blank">forthcoming schema update&lt;/a> will take this even further, making CRediT roles available in our schema.&lt;/p>
&lt;h3 id="how-to-register-peer-reviews-with-crossref">How to register peer reviews with Crossref&lt;/h3>
&lt;p>You need to be a member of Crossref in order to register your peer reviews with us and you can currently register peer reviews by sending us your XML files. Unfortunately, you can’t currently register peer reviews using our helper tools like the OJS plugin, Metadata Manager, or the web deposit form. &lt;/p>
&lt;p>You can find out more about &lt;a href="https://www.crossref.org/education/content-registration/content-types-intro/peer-reviews/">registering peer reviews&lt;/a> on our website - we even have a &lt;a href="https://www.crossref.org/documentation/schema-library/markup-guide-record-types/peer-reviews/" target="_blank">range of markup examples&lt;/a>. &lt;/p>
&lt;p>We know that there’s a range of outputs from the peer review process, and our schema allows you to identify many of them, including referee reports, decision letters, and author responses. You can include outputs from the initial submission only, or cover all subsequent rounds of revisions, giving a really clear picture of the evolution of the article. Members can even register content for discussions after the article was published, such as post-publication reviews.&lt;/p>
&lt;h3 id="get-involved-with-peer-review-week-2020">Get involved with Peer Review Week 2020&lt;/h3>
&lt;p>We’re looking forward to seeing the debate sparked by Peer Review Week and hearing from our members about this important area. You can get involved by checking out the &lt;a href="https://peerreviewweek.wordpress.com/" target="_blank">Peer Review Week 2020 website&lt;/a> or following &lt;a href="https://twitter.com/PeerRevWeek" target="_blank">@PeerRevWeek&lt;/a> and the hashtags #PeerRevWk20 #trustinpeerreview on Twitter.&lt;/p>
&lt;p>We’re excited to see what examples of the evolution of knowledge will be discoverable in registered and linked peer reviews this time next year!&lt;/p></description></item><item><title>Crossref at the Frankfurt Digital Book Fair</title><link>https://www.crossref.org/blog/crossref-at-the-frankfurt-digital-book-fair/</link><pubDate>Thu, 17 Sep 2020 00:00:00 +0000</pubDate><author>Rosa Morais Clark</author><guid>https://www.crossref.org/blog/crossref-at-the-frankfurt-digital-book-fair/</guid><description>&lt;p>Frankfurt Book Fair (#FBM20) will be online this year since people are really not traveling right now.  This special edition of #FBM20 will have an extensive digital program in which we will be participating. So you can hang out with us from anywhere in the world! &lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2020/FBF-stacked-combo-logo.png"
alt="Crossref Frankfurt Digital Book Fair event logo" width="75%">
&lt;/figure>
&lt;/div>
&lt;p>Similar to the in-person event of years past, members of our technical support, membership, and outreach teams will be on hand at our online &lt;strong>Crossref Cafe&lt;/strong>.  &lt;/p>
&lt;p>Here are our &lt;strong>Crossref Cafe&lt;/strong> hours: &lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>&lt;/th>
&lt;th>Support&lt;/th>
&lt;th>Membership&lt;/th>
&lt;th>Community outreach&lt;/th>
&lt;th>Product&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>&lt;a href="https://outreach.crossref.org/acton/fs/blocks/showLandingPage/a/16781/p/p-0050/t/page/fm/0" target="_blank">Wed 14 Oct 8:00 - 9:00 UTC&lt;/a>&lt;/td>
&lt;td>Paul&lt;/td>
&lt;td>Sally&lt;/td>
&lt;td>Vanessa&lt;/td>
&lt;td>Bryan&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;a href="https://outreach.crossref.org/acton/fs/blocks/showLandingPage/a/16781/p/p-0050/t/page/fm/0" target="_blank">Wed 14 Oct 14:00 - 15:00 UTC&lt;/a>&lt;/td>
&lt;td>Shayn&lt;/td>
&lt;td>Anna&lt;/td>
&lt;td>Susan&lt;/td>
&lt;td>Sara&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;a href="https://outreach.crossref.org/acton/fs/blocks/showLandingPage/a/16781/p/p-0050/t/page/fm/0" target="_blank">Thu 15 Oct 8:00 - 9:00 UTC&lt;/a>&lt;/td>
&lt;td>Paul&lt;/td>
&lt;td>Laura&lt;/td>
&lt;td>Vanessa&lt;/td>
&lt;td>Martyn&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;a href="https://outreach.crossref.org/acton/fs/blocks/showLandingPage/a/16781/p/p-0050/t/page/fm/0" target="_blank">Thu 15 Oct 14:00 - 15:00 UTC&lt;/a>&lt;/td>
&lt;td>Isaac, Shayn&lt;/td>
&lt;td>Anna, Kathleen&lt;/td>
&lt;td>Susan&lt;/td>
&lt;td>Kirsty&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;a href="https://outreach.crossref.org/acton/fs/blocks/showLandingPage/a/16781/p/p-0050/t/page/fm/0" target="_blank">Fri 16 Oct 8:00 - 9:00 UTC&lt;/a>&lt;/td>
&lt;td>Paul&lt;/td>
&lt;td>Amanda&lt;/td>
&lt;td>Vanessa, Rachael&lt;/td>
&lt;td>Rakesh&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;a href="https://outreach.crossref.org/acton/fs/blocks/showLandingPage/a/16781/p/p-0050/t/page/fm/0" target="_blank">Fri 16 Oct 14:00 - 15:00 UTC&lt;/a>&lt;/td>
&lt;td>Isaac, Shayn&lt;/td>
&lt;td>Anna, Kathleen&lt;/td>
&lt;td>Susan&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>&lt;strong>Who will be online:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>&lt;a href="https://www.crossref.org/people/susan-collins">Susan&lt;/a>, &lt;a href="https://www.crossref.org/people/vanessa-fairhurst/">Vanessa&lt;/a>, and &lt;a href="https://www.crossref.org/people/rachael-lammey/">Rachael&lt;/a> can talk to you about our upcoming &lt;a href="https://www.crossref.org/events/">events&lt;/a>.  &lt;/li>
&lt;li>&lt;a href="https://www.crossref.org/people/kirsty-meddings/">Kirsty&lt;/a> can talk to you about &lt;a href="https://www.crossref.org/services/crossmark/">Crossmark&lt;/a>.&lt;/li>
&lt;li>&lt;a href="https://www.crossref.org/people/kathleen-luschek/">Kathleen&lt;/a> can explain &lt;a href="https://www.crossref.org/services/similarity-check/">Similarity Check&lt;/a>.&lt;/li>
&lt;li>&lt;a href="https://www.crossref.org/people/laura-j-wilkinson">Laura &lt;/a> can show you how to use &lt;a href="https://www.crossref.org/metadatamanager/" target="_blank">Metadata Manager&lt;/a> for Content Registration.&lt;/li>
&lt;li>&lt;a href="https://www.crossref.org/people/isaac-farley">Isaac&lt;/a>, &lt;a href="https://www.crossref.org/people/shayn-smulyan/">Shayn&lt;/a>, and &lt;a href="https://www.crossref.org/people/paul-davis">Paul&lt;/a> can help troubleshoot any metadata, DOI, or reporting needs. &lt;/li>
&lt;li>&lt;a href="https://www.crossref.org/people/sara-bowman/">Sara&lt;/a> can talk to you about &lt;a href="https://www.crossref.org/services/content-registration/">content registration&lt;/a>.&lt;/li>
&lt;li>&lt;a href="https://www.crossref.org/people/anna-tolwinska">Anna&lt;/a> will give you a &amp;lsquo;metadata health check&amp;rsquo; including a tour of your &lt;a href="https://www.crossref.org/members/prep/" target="_blank">Participation Report&lt;/a>.&lt;/li>
&lt;li>&lt;a href="https://www.crossref.org/people/rakesh-masih/">Rakesh&lt;/a> can talk to you about product design.&lt;/li>
&lt;li>&lt;a href="https://www.crossref.org/people/sally-jennings/">Sally&lt;/a> and &lt;a href="https://www.crossref.org/people/amanda-bartell/">Amanda&lt;/a> can answer your questions about &lt;a href="https://www.crossref.org/membership/">membership&lt;/a>.&lt;/li>
&lt;li>&lt;a href="https://www.crossref.org/people/martyn-rittman/">Martyn&lt;/a> can talk to you about &lt;a href="https://www.crossref.org/services/cited-by/">Cited-by&lt;/a>.&lt;/li>
&lt;li>&lt;a href="https://www.crossref.org/people/bryan-vickery/">Bryan&lt;/a> can talk to you about recent updates to our &lt;a href="https://www.crossref.org/services/">products and services&lt;/a>.&lt;/li>
&lt;/ul>
&lt;p>We are happy to &lt;a href="mailto:feedback@crossref.org?Subject=Frankfurt%20meeting%20&amp;amp;Body=Hello%2C%20I%20would%20like%20to%20schedule%20a%20meeting%20to%20talk%20about%20...%20">schedule one-on-one virtual meetings&lt;/a> as well. &lt;/p>
&lt;p>Please do drop-in to say &lt;em>&lt;strong>&amp;ldquo;Guten Tag&amp;rdquo;&lt;/strong>&lt;/em>. We&amp;rsquo;re looking forward to seeing you online!  &lt;/p></description></item><item><title>Publishers, are you ready to ROR?</title><link>https://www.crossref.org/blog/publishers-are-you-ready-to-ror/</link><pubDate>Tue, 25 Aug 2020 00:00:00 +0000</pubDate><author>Maria Gould</author><guid>https://www.crossref.org/blog/publishers-are-you-ready-to-ror/</guid><description>&lt;p>If you manage a publishing system or workflow, you know how crucial—and how challenging!—it is to have clean, consistent, and comprehensive affiliation metadata. Author affiliations, and the ability to link them to publications and other scholarly outputs, are vital for numerous stakeholders across the research landscape. Institutions need to monitor and measure their research output by the articles their researchers have published. Funders need to be able to discover and track the research and researchers they have supported. Academic librarians need to easily find all of the publications associated with their campus. Journals need to know where authors are affiliated so they can determine eligibility for institutionally sponsored publishing agreements.&lt;/p>
&lt;p>Until recently, an open, unambiguous, and persistent identifier for research organisation affiliations has been a missing layer of the scholarly ecosystem. DOIs could identify articles and datasets and other research outputs, and ORCID IDs could identify researchers, but no equivalent solution was available to identify institutions. With the launch of the &lt;a href="https://ror.org" target="_blank">Research Organisation Registry (ROR)&lt;/a> in 2019 (which Crossref has &lt;a href="https://ror.org/about" target="_blank">helped to develop&lt;/a>), the landscape is changing. ROR IDs are an opportunity to make affiliation details easier for publishers to use and easier for those who rely on this data.&lt;/p>
&lt;p>Affiliations are a key piece of Crossref metadata that has been missing, but will soon be &lt;a href="https://doi.org/10.64000/cmxdb-n4v31" target="_blank">supported in the Crossref metadata schema&lt;/a>. This means that content registered with Crossref can be associated with a ROR IDs to  enable better tracking and discovery of research and other publication outputs by institution.&lt;/p>
&lt;h2 id="what-is-ror">What is ROR?&lt;/h2>
&lt;p>ROR is the Research Organisation Registry––&lt;strong>open, noncommercial, community-led infrastructure&lt;/strong> for research organisation identifiers. The registry currently includes globally unique persistent identifiers and associated metadata for more than &lt;a href="https://ror.org/search" target="_blank">98,000 research organisations&lt;/a> (as of August 2020).&lt;/p>
&lt;p>ROR IDs are specifically designed to be &lt;strong>implemented in any system&lt;/strong> that captures institutional affiliations and to enable connections (via persistent identifiers and networked research infrastructure) between research organisations, research outputs, and researchers.&lt;/p>
&lt;p>ROR IDs are &lt;strong>interoperable with those in other identifier registries&lt;/strong>, including GRID (which provided the seed data that ROR launched with), Crossref Funder Registry, ISNI, and Wikidata. ROR data is available under a CC0 waiver and can be accessed via a public &lt;a href="https://api.ror.org/organisations" target="_blank">API&lt;/a> and &lt;a href="https://doi.org/10.6084/m9.figshare.c.4596503" target="_blank">data dump&lt;/a>.&lt;/p>
&lt;p>ROR is not the first organisation identifier to exist. But ROR is distinct because it is &lt;strong>completely &lt;a href="https://github.com/ror-community" target="_blank">open&lt;/a>, specifically focused on &lt;a href="https://ror.org/scope" target="_blank">identifying affiliations&lt;/a>&lt;/strong>, and &lt;strong>&lt;a href="https://ror.org/supporters" target="_blank">collaboratively developed by, with, and for key stakeholders&lt;/a>&lt;/strong> in scholarly communications. ROR is operated as a joint initiative by Crossref, &lt;a href="https://datacite.org" target="_blank">DataCite&lt;/a>, and &lt;a href="https://cdlib.org" target="_blank">California Digital Library&lt;/a>, and was launched with seed data from GRID in collaboration with Digital Science. These organisations have invested resources into building an open registry of research organisation identifiers that can be embedded in scholarly infrastructure to effectively link research to organisations.&lt;/p>
&lt;h2 id="why-care-about-ror-ids-in-crossref-metadata">Why care about ROR IDs in Crossref metadata?&lt;/h2>
&lt;p>Ed Pentz, Crossref’s Executive Director, explains the key role ROR can play in enriching Crossref metadata:&lt;/p>
&lt;p>&lt;em>“Over the years Crossref has expanded the metadata it collects (for example, ORCID IDs and license URLs) based on the changing needs of our members and the scholarly research community. A key type of metadata that is missing from Crossref is affiliations. We’ve had a lot of feedback from members that adding affiliations should be a priority. At &lt;a href="https://www.crossref.org/crossref-annual-meeting/archive/#2019">Crossref LIVE19 in Amsterdam&lt;/a>, ROR was ranked joint first place for Crossref by the 100 plus attendees at the meeting. For the last few years we’ve been diligently working on the initiative and are very happy that ROR is now coming to fruition.”&lt;/em>&lt;/p>
&lt;p>Crossref metadata does include some affiliations already. But this data is not comprehensive or consistent, and appears as free-text strings only (even if originally sourced from a list of institutions). A search for UC Berkeley, for instance, returns multiple variants of the university’s name:&lt;/p>
&lt;ul>
&lt;li>University of California, Berkeley&lt;/li>
&lt;li>University of California-Berkeley&lt;/li>
&lt;li>University of California Berkeley&lt;/li>
&lt;li>UC Berkeley&lt;/li>
&lt;li>And likely more&amp;hellip;&lt;/li>
&lt;/ul>
&lt;p>While it isn&amp;rsquo;t too difficult for a human to guess that &amp;ldquo;UC Berkeley,&amp;rdquo; &amp;ldquo;University of California, Berkeley,&amp;rdquo; and &amp;ldquo;University of California at Berkeley&amp;rdquo; are all referring to the same university, a machine interpreting this information wouldn&amp;rsquo;t necessarily make the same connections. If you are trying to easily find all of the publications associated with UC Berkeley, you would need to run and reconcile multiple searches at best, or miss data completely at worst. This is where an affiliation identifier comes in: a single, unambiguous, standardized identifier that will always stay the same (for UC Berkeley, that would be &lt;a href="https://ror.org/01an7q238" target="_blank">https://ror.org/01an7q238&lt;/a>).&lt;/p>
&lt;p>ROR IDs for affiliations can transform the usability of Crossref metadata. While it&amp;rsquo;s crucial to have IDs for affiliations, it&amp;rsquo;s equally important that the affiliation data can be easily used. The ROR dataset is CC0, so ROR IDs and associated affiliation data can be freely and openly used and reused without any restrictions.&lt;/p>
&lt;h2 id="what-does-this-mean-for-publishers">What does this mean for publishers?&lt;/h2>
&lt;p>As the Crossref schema update is being cleared for takeoff, this is a good time for publishers and publishing service providers to be thinking about adopting ROR.&lt;/p>
&lt;p>ROR IDs can be useful in publishing workflows in a variety of ways. They can easily be implemented into manuscript tracking systems to identify the affiliations of submitting authors and co-authors. This can be done via a simple institution lookup that connects to the ROR API. Authors choose their affiliation from a dropdown list populated from ROR; they do not have to provide a ROR ID or even know that a ROR ID is being collected.&lt;/p>
&lt;p>&lt;a href="https://gyazo.com/65ef42890287ae978f61add5d36b1d31">&lt;img src="https://i.gyazo.com/65ef42890287ae978f61add5d36b1d31.gif" alt="Image from Gyazo" width="780"/>&lt;/a>&lt;/p>
&lt;p>Upon publication, ROR affiliation data can be included when content is registered with Crossref. ROR IDs are also supported in the JATS XML format that many publishers use. Crossref metadata can be searched and crawled, and the Crossref API will make ROR IDs available so affiliation data can be captured by tools and services and fed into downstream reporting and tracking systems.&lt;/p>
&lt;h2 id="get-ready-to-ror">Get ready to ROR!&lt;/h2>
&lt;p>ROR is already working with a number of publishers and service providers that are planning to integrate ROR in their systems, map their affiliation data to ROR IDs, and/or include ROR IDs in publication metadata.&lt;/p>
&lt;p>For example: &lt;a href="https://rupress.org/" target="_blank">Rockefeller University Press&lt;/a> has already added the collection of ROR IDs to their publication workflow. Upon submission, the author selects an institutional affiliation from a dropdown list of options that comes from ROR. Rockefeller University Press also relies on this affiliation data for billing and licensing purposes to coordinate Gold Open Access publishing agreements.&lt;/p>
&lt;p>In addition to publishers, libraries and repositories and other stakeholders are building in support for ROR. You can also see the list of active and in-progress ROR integrations &lt;a href="https://ror.org/integrations" target="_blank">here&lt;/a>.&lt;/p>
&lt;p>We know decisions about identifier adoption aren&amp;rsquo;t easy or immediate, so &lt;a href="mailto:info@ror.org">get in touch with ROR&lt;/a> if you have questions or want to be more involved in the project. ROR holds regular community meetings and &lt;a href="https://www.youtube.com/watch?v=W61JMsC3Dho" target="_blank">webinars&lt;/a> and supports several community working groups for those interested in implementing ROR IDs and working with ROR data. This is a community-driven effort so we want to hear from you!&lt;/p></description></item><item><title>Evolving our support for text-and-data mining</title><link>https://www.crossref.org/blog/evolving-our-support-for-text-and-data-mining/</link><pubDate>Fri, 21 Aug 2020 00:00:00 +0000</pubDate><author>Bryan Vickery</author><guid>https://www.crossref.org/blog/evolving-our-support-for-text-and-data-mining/</guid><description>&lt;p>Many researchers want to carry out analysis and extraction of information from large sets of data, such as journal articles and other scholarly content. Methods such as screen-scraping are error-prone, place too much strain on content sites and may be unrepeatable or break if site layouts change. Providing researchers with automated access to the full-text content via DOIs and Crossref metadata reduces these problems, allowing for easy deduplication and reproducibility. &lt;a href="https://www.crossref.org/education/retrieve-metadata/rest-api/text-and-data-mining/">Supporting text and data mining&lt;/a> echoes our mission to make research outputs easy to find, cite, link, assess, and reuse.&lt;/p>
&lt;p>In 2013 Crossref embarked on a project to better support Crossref members and researchers with Text and Data Mining requests and access. There were two main parts to the project:&lt;/p>
&lt;ol>
&lt;li>
&lt;p>To collect and make available full-text links and publisher TDM license links in the metadata.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>To provide a service (TDM click-through service) for Crossref members to post their additional TDM terms and conditions and for researchers to access, review and accept these terms.&lt;/p>
&lt;/li>
&lt;/ol>
&lt;p>&lt;a href="https://www.crossref.org/news/2014-05-29-crossref-text-and-data-mining-services-simplify-researcher-access">The TDM click-through was launched in May 2014&lt;/a>.&lt;/p>
&lt;p>To date, 37.5 million works registered with Crossref have both &lt;a href="http://api.crossref.org/works?filter=has-license:true,has-full-text:true,license.version:tdm&amp;amp;facet=publisher-name:*&amp;amp;rows=0" target="_blank">full-text links and TDM license information&lt;/a>. We continue to encourage all members to include full-text links and license information in the metadata they register to assist researchers with TDM. You can see how each member is doing via its Participation Report (e.g. &lt;a href="https://www.crossref.org/members/prep/311" target="_blank">Wiley&amp;rsquo;s&lt;/a>).&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2020/tdm_blog_prep.png"
alt="participation report screenshot for Wiley" width="60%">
&lt;/figure>
&lt;p>Members are also making subscription content available for text mining (temporarily or otherwise) for specific purposes, such as to help the research community with its response to COVID-19. Back in April &lt;a href="https://doi.org/10.64000/2dkpt-h4159" target="_blank">we highlighted how this can be achieved&lt;/a> by including:&lt;/p>
&lt;ol>
&lt;li>
&lt;p>A &amp;ldquo;free to read&amp;rdquo; element in the access indicators section of publisher metadata indicating that the content is being made available free-of-charge (gratis)&lt;/p>
&lt;/li>
&lt;li>
&lt;p>An assertion element indicating that the content being made available is available free-of-charge.&lt;/p>
&lt;/li>
&lt;/ol>
&lt;p>To access Crossref&amp;rsquo;s &lt;strong>click-through&lt;/strong> tool for text and data mining, users could log in via their ORCID iD. They could then review TDM license agreements posted by Crossref members and accept, reject or postpone their decisions until later. Having agreed to a publisher&amp;rsquo;s terms and conditions this action was logged against the user&amp;rsquo;s API token which they could use when requesting full-text from the publisher.&lt;/p>
&lt;p>Since the pilot in 2014, only 2 publishers have continued with the tool and fewer than 300 API tokens have been issued.&lt;/p>
&lt;p>Publishers have since developed their own mechanisms for managing TDM requests. The introduction of UK (&lt;a href="https://assets.publishing.service.gov.uk/government/uploads/system/uploads/attachment_data/file/375954/Research.pdf" target="_blank">2014&lt;/a>) / EU (&lt;a href="https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=uriserv:OJ.L_.2019.130.01.0092.01.ENG" target="_blank">2019&lt;/a>) copyright exceptions for TDM has significantly reduced the number of requests and at the same time, more and more content is published under an open access license.&lt;/p>
&lt;p>Given the low take-up of the click-through by both publishers and researchers, its goals are no longer being met. &lt;strong>Therefore we will retire the TDM click-through in December 2020.&lt;/strong> Until that date, it will still operate for the two publishers and various researchers who use it while they finish implementing their alternative plans.&lt;/p>
&lt;p>Crossref will continue to collect member-supplied TDM licensing information in metadata for individual works, and researchers can continue to find this via the Crossref APIs.&lt;/p></description></item><item><title>Similarity Check news: introducing the next generation iThenticate.</title><link>https://www.crossref.org/blog/similarity-check-news-introducing-the-next-generation-ithenticate./</link><pubDate>Tue, 28 Jul 2020 00:00:00 +0000</pubDate><author>Kirsty Meddings</author><guid>https://www.crossref.org/blog/similarity-check-news-introducing-the-next-generation-ithenticate./</guid><description>&lt;p>Crossref’s Similarity Check service is used by our members to detect text overlap with previously published work that may indicate plagiarism of scholarly or professional works. Manuscripts can be checked against millions of publications from other participating Crossref members and general web content using the iThenticate text comparison software from Turnitin.&lt;/p>
&lt;p>The 2000 members who already make use of Similarity Check upload almost 2,000,000 documents each month to look for matching text in other publications.&lt;/p>
&lt;p>We have some great news for those 2000 members –– a completely new version of iThenticate is on its way, and will start to roll out to users in the coming months.&lt;/p>
&lt;p>New functionality has been developed based on your feedback over the past few years and includes:&lt;/p>
&lt;ul>
&lt;li>An improved Document Viewer that makes PDFs searchable and accessible, with responsive design for ease of use on different screen sizes. All of the functionality of the Viewer and the Text-only reports in the previous version have been streamlined into just two views: Sources Overview and All Sources.&lt;/li>
&lt;li>Improved exclusion options to make refining matches even easier. Smarter citation detection now identifies probable citations both inline and in reference sections.&lt;/li>
&lt;li>A new “Content Portal” where you can see what percentage of your own content has been successfully indexed for the iThenticate comparison database, and download reports of indexing errors that need to be fixed.&lt;/li>
&lt;li>A new API for integration with manuscript submission systems allows display of the largest matching word count and the top 5 source matches alongside the Similarity Score.&lt;/li>
&lt;li>The maximum number of pages and file size per document has been doubled to 800 pages/200 MB.&lt;/li>
&lt;/ul>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/new-ithenticate-screen.png" width="80%">&lt;figcaption>
&lt;h4>The new document viewer in iThenticate v2.0&lt;/h4>
&lt;/figcaption>
&lt;/figure>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/exclude-bibliography-ithenticate.png" width="80%">&lt;figcaption>
&lt;h4>Improved reference exclusion&lt;/h4>
&lt;/figcaption>
&lt;/figure>
&lt;p>Crossref members can use Similarity Check directly by logging in, or via an integration with a submission/peer review system. We are working with many system providers to bring v2.0 to you as soon as possible. In the meantime, we are looking for members to help us test the new system directly in the iThenticate user interface. If you are interested and can spare a few hours some time in the next month &lt;a href="https://docs.google.com/forms/d/e/1FAIpQLScaqCunNVfyTe7bk9RwNbtf48KPTetVnCtvd-l194wokQ5NCQ/viewform?usp=sf_link" target="_blank">please let me know&lt;/a>.&lt;/p>
&lt;p>And if your organisation is not yet using Similarity Check to assess the originality of the manuscripts you receive do take a look at the &lt;a href="https://www.crossref.org/services/similarity-check/">many benefits&lt;/a> the service has to offer.&lt;/p></description></item><item><title>Meet the new Crossref Executive Director</title><link>https://www.crossref.org/blog/meet-the-new-crossref-executive-director/</link><pubDate>Thu, 23 Jul 2020 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/meet-the-new-crossref-executive-director/</guid><description>&lt;p>It’s me! Back in January I wrote, &lt;a href="https://doi.org/10.64000/j6sav-qm45" target="_blank">The one constant in Crossref’s 20 years has been change&lt;/a>. This continues to be true, and the latest change is that I’m happy to say that I will be staying on as Executive Director of Crossref. At the recent Crossref board meeting, I rescinded my resignation and the board happily accepted this.&lt;/p>
&lt;p>What happened? Well, a lot has changed since &lt;a href="https://doi.org/10.64000/dkth1-xxz93" target="_blank">I announced that I was leaving&lt;/a> back in February. The pandemic has upended “business as usual” and everyone is rethinking pretty much everything. It’s clear that as a result of the crisis, there will be greater economic pressure on our community. These are difficult times and they are going to continue for the foreseeable future.&lt;/p>
&lt;p>The people at Crossref are amazing and I’ve been impressed and inspired by everyone’s resilience and creativity in responding to these unusual challenges. Crossref has a very special organisational culture and I want to remain a part of it and continue to develop it.&lt;/p>
&lt;p>I’ve also been inspired by the board. In particular, at its July meeting they passed a progressive motion based on a proposal from the leadership team:&lt;/p>
&lt;blockquote>
&lt;p>RESOLVED: Crossref should proactively lead an effort to explore, with other infrastructure organisations and initiatives, how we can improve the scholarly research ecosystem. Crossref is committed to the collaborative development of open scholarly infrastructure for the benefit of our members and the wider research community.&lt;/p>
&lt;/blockquote>
&lt;p>This is the result of a process that started back in 2019. In the &lt;a href="https://doi.org/10.64000/mmdqs-23829" target="_blank">A turning point is a time for reflection&lt;/a> blog post, we took a step back as we approached Crossref’s 20th anniversary. We conducted research into &lt;a href="http://bit.ly/crvalue" target="_blank">the perceived value of Crossref&lt;/a>, reflected on what we had achieved, and what the future holds for Crossref. At our &lt;a href="https://www.crossref.org/crossref-annual-meeting/">annual meeting, &amp;ldquo;the strategy one&amp;rdquo;&lt;/a> and in our &lt;a href="https://doi.org/10.13003/y8ygwm5" target="_blank">annual report fact file&lt;/a>, we reminded people of the organisation’s original founding purpose:&lt;/p>
&lt;blockquote>
&lt;p>To promote the development and cooperative use of new and innovative technologies to speed and facilitate scientific and other scholarly research.&lt;/p>
&lt;/blockquote>
&lt;p>Following on from 2019, as the pandemic hit, we held virtual strategic sessions with the board in March, May and June. These culminated in the motion above, which allows Crossref to fully embrace this simple, but ambitious, vision. This was a game changer for me, and I realized there was nothing else I wanted to do or that better suited my skills and experience than to continue to lead Crossref and work with the community through the next phase of transformation.&lt;/p>
&lt;p>This is not the time for “business as usual”. We live in an interconnected, interdependent world and open infrastructure organisations have to collaborate more deeply and look at doing things differently in order to improve the scholarly research ecosystem. So - more to come!&lt;/p></description></item><item><title>New faces at Crossref</title><link>https://www.crossref.org/blog/new-faces-at-crossref/</link><pubDate>Tue, 30 Jun 2020 00:00:00 +0000</pubDate><author>Crossref</author><guid>https://www.crossref.org/blog/new-faces-at-crossref/</guid><description>&lt;p>Please help us welcome new faces at Crossref! Martyn, Sara, Laura, and Mark joined us very recently and we are happy they&amp;rsquo;re with us. Both Martyn and Sara have joined the Product team and this has given us the chance to reorganize the team into the following groups: content registration, scholarly stewardship, scholarly impact, metadata retrieval, and UX/UI leadership. Laura joined the Finance and Operations team to help make the billing process simple for our members. Mark joins the Technology team and one of his projects will be improving the Event Data service.&lt;/p>
&lt;p>It is exciting to already see the impact of your contributions and look forward to what’s to come!&lt;/p>
&lt;h2 id="and-now-a-few-words-from-each-of-them">And now a few words from each of them.&lt;/h2>
&lt;h3 id="martyn-rittman">Martyn Rittman&lt;/h3>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2020/martyn-rittman.jpg"
alt="image of Martyn" width="300px">
&lt;/figure>
&lt;p>I am a former university researcher who worked on interdisciplinary projects around life sciences and analytical chemistry, with positions in the UK and Germany. I spent seven years at open access publisher MDPI doing everything from running journals to handling production, developing services for authors and publishers, and supporting preprints. I’m very excited to be joining Crossref as a Product Manager and developing some great products and services that focus on how Crossref-indexed research creates impact. This includes supporting the use of preprint metadata. I’m also looking forward to getting my teeth into event data, which looks at how those in the research community and beyond reference, use, and reuse research. If you are interested in making use of event data or have examples of event data applications, I would like to hear from you. &lt;br />&lt;br />&lt;/p>
&lt;h3 id="sara-bowman">Sara Bowman&lt;/h3>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2020/sara-slack.jpeg"
alt="image of Sara" width="300px">
&lt;/figure>
&lt;p>I’m thrilled to have joined Crossref at this exciting time in the organisation. As a member of the Product team, my primary area of focus is content registration, building, and improving tools for our members to deposit rich metadata. I’m particularly interested in how we can create a unified user experience for content registration while supporting the needs of our diverse membership. A scientist by training, I’ve spent the last 6 years working on open source technologies to support scholarly communication, most recently in the role of Product Manager at the Center for Open Science. I’m passionate about open tools and using data to drive product development, building innovative solutions to improve research and scholarly communication.&lt;/p>
&lt;h3 id="laura-cuniff">Laura Cuniff&lt;/h3>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2020/laura_c.jpg"
alt="image of Laura" width="300px">
&lt;/figure>
&lt;p>I joined Crossref two months ago as a part-time Billing Support Specialist on the Finance and Operations team. With the help of my supportive and knowledgeable colleagues, I took on learning the various systems. My goal is to make the billing process as simple as possible for our members by researching, retrieving, and relaying billing information.  This allows our members to focus on the reason for their engagement with Crossref. With several part-time jobs cobbled together at different times of the day, I have the flexibility to volunteer with a few organisations in my hometown of Ipswich, MA.  If you find yourself at the Ipswich Visitor Center, I may greet you, recommend the most beautiful spots in town, give you a tour of the Ipswich Museum, or send you off with a wonderful Ipswich Humane Group cat or dog! I’m very excited to be here!&lt;/p>
&lt;h3 id="mark-woodhall">Mark Woodhall&lt;/h3>
&lt;figure>&lt;img src="https://www.crossref.org/images/staff/mark2-720px.jpg"
alt="image of Mark" width="300px">
&lt;/figure>
&lt;p>I am an open-source enthusiast who has worked in a range of technology roles at a variety of companies as a polyglot programmer with experience in Clojure(Script), Java, C#, and JavaScript. It’s really exciting to be working at Crossref as a Senior Software Developer on the Technology team and I’m proud to be part of a team with open source at its heart. I’m really looking forward to getting more involved with event data and building a scalable solution to support its future uses.&lt;/p>
&lt;p>&lt;em>Welcome to the Crossref community Martyn, Laura, Sara, and Mark.&lt;/em>&lt;/p></description></item><item><title>Community Outreach in 2020</title><link>https://www.crossref.org/blog/community-outreach-in-2020/</link><pubDate>Mon, 29 Jun 2020 00:00:00 +0000</pubDate><author>Vanessa Fairhurst</author><guid>https://www.crossref.org/blog/community-outreach-in-2020/</guid><description>&lt;p>2020 hasn’t been quite what any of us had imagined. The pandemic has meant big adjustments in terms of working; challenges for parents balancing childcare and professional lives; anxieties and tensions we never had before; the strain of potentially being away from co-workers, friends, and family for a prolonged period of time. Many have suffered job losses and around the world, many have sadly lost their lives to the virus.&lt;/p>
&lt;p>I’ve been very fortunate that my family and friends remain in good health and very grateful to work for a supportive and caring organisation such as Crossref. I don’t usually work from home every day, so adjusting to the ‘new normal’ these last few months has been difficult at times. I certainly miss seeing my colleagues in the Oxford office day-to-day, and now have a new appreciation for the challenges our remote working members of staff face, particularly when it comes to feeling quite isolated at times. I’ve also learnt about the importance of good communication and building in greater flexibility to projects, especially when you are not able to see people face-to-face.&lt;/p>
&lt;p>My role as Outreach Manager is all about people; it often involves organising and attending industry events as well as running our own educational days, which we call our &lt;a href="https://www.crossref.org/events/">Crossref LIVE events&lt;/a>. The global health crisis brought the majority of international travel to an abrupt halt, something the environment may thank us for, but that also requires a dramatic reimagining of how we can effectively and empathetically engage with our members and the wider community.&lt;/p>
&lt;p>As our planned in-person events have been postponed, for now, we converted our LIVE events into an online format, which we have so far run in &lt;a href="https://youtu.be/McDIrEpWph4" target="_blank">Arabic&lt;/a>, &lt;a href="https://community.crossref.org/t/crossref-virtual-live-spanish/1324?u=vanessa" target="_blank">Spanish&lt;/a>, &lt;a href="https://community.crossref.org/t/crossref-live-korea/1351/2?u=vanessa" target="_blank">Korean&lt;/a> and &lt;a href="https://community.crossref.org/t/crossref-virtual-live-brazil/1323/2?u=vanessa" target="_blank">Brazilian Portuguese&lt;/a> with help from &lt;a href="https://www.crossref.org/community/our-ambassadors/">our Ambassadors&lt;/a> and technical support team. We have had better attendance and engagement than we ever dreamed, with lots of thoughtful questions and positive feedback. While an online format has its limitations it also brings new opportunities, particularly by enabling us to reach many members who would not be able to attend a physical event. We have more in the works for the rest of the year, so keep a lookout on our &lt;a href="https://www.crossref.org/webinars/" target="_blank">webinar&lt;/a> and &lt;a href="https://www.crossref.org/events/" target="_blank">events&lt;/a> pages.&lt;/p>
&lt;p>We have all had to adapt to new ways of living and working this year, but vital research continues to be done and new content continues to be published. We embrace new ways of engaging with our international membership so we can continue to support them in their roles and in working with our systems, despite the uncertain circumstances we find ourselves in.&lt;/p>
&lt;h3 id="lessons-learned">Lessons learned:&lt;/h3>
&lt;ul>
&lt;li>
&lt;p>Online events need to be much shorter than physical ones. Zoom fatigue is real, no one can stay focused for long periods of time at the screen.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Flexibility is key, running events in multiple languages and time-zones make them more accessible for a geographically diverse audience, but also ensuring recordings and other materials are readily available means people can engage with the content in their own time. And they do. Our Spanish LIVE on May 19 saw 335 people attend, and a further 304 (so far) watch &lt;a href="https://youtu.be/kQNwWzcWeH8" target="_blank">the recording&lt;/a> in their own time.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Don’t forget to build in time for breaks.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Although it’s impossible to replicate the natural human interaction that occurs at a physical event, an online format can still bring hearts as well as minds together. Break-out rooms, polls, and clever use of chat functionality all help to build engagement and turn a passive audience into active participants.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>People love an online quiz.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Partner with others –– an interesting guest speaker can bring a whole new dynamic to your planned content.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Take the opportunity to be a little more experimental. We can’t do business as usual right now, so embrace new ideas and see what works!&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>Hoping you all stay safe and healthy, and that we can meet again in person in 2021.&lt;/p></description></item><item><title>Calling all prospective board members</title><link>https://www.crossref.org/blog/calling-all-prospective-board-members/</link><pubDate>Thu, 21 May 2020 00:00:00 +0000</pubDate><author>Lucy Ofiesh</author><guid>https://www.crossref.org/blog/calling-all-prospective-board-members/</guid><description>&lt;p>&lt;em>&lt;strong>English version&lt;/strong> –– &lt;a href="#spanishversion">Información en español&lt;/a> –– &lt;a href="#frenchversion">Version Française&lt;/a>&lt;/em>&lt;/p>
&lt;p>The Crossref Nominating Committee is inviting expressions of interest to join the Board of Directors of Crossref for the term starting in 2021. The committee will gather responses from those interested and create the slate of candidates that our membership will vote on in an election in September. Expressions of interest will be due Friday, June 19, 2020.&lt;/p>
&lt;p>The role of the board at Crossref is to provide strategic and financial oversight of the organisation, as well as guidance to the Executive Director and the staff leadership team, with the key responsibilities being:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>Setting the strategic direction for the organisation;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Providing financial oversight; and&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Approving new policies and services.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>The board is representative of our membership base and guides the staff leadership team on trends affecting scholarly communications. The board sets strategic directions for the organisation while also providing oversight into policy changes and implementation. Board members have a fiduciary responsibility to ensure sound operations. Board members do this by attending board meetings, as well as joining more specific board committees.&lt;/p>
&lt;p>As an example, in 2019 the board decided to remove fees for the Crossmark service. This involved a strategic review of the service and its alignment with the mission by the Membership &amp;amp; Fees committee; followed by a review of the financial implications of removing the fee; and ultimately, a vote by the full board to remove the fee starting in 2020.&lt;/p>
&lt;p>Crossref&amp;rsquo;s services provide central infrastructure to scholarly communications. Crossref&amp;rsquo;s board helps shape the future of our services, and by extension, impacts the broader scholarly ecosystem. We are looking for board members to contribute their experience and perspective. &lt;/p>
&lt;h2 id="im-interested-but-busy-what-is-expected-of-board-members">I&amp;rsquo;m interested but busy! What is expected of board members?&lt;/h2>
&lt;p>Board members attend three meetings each year that typically take place in March, July, and November. Meetings have taken place in a variety of international locations and travel support is provided when needed.&lt;/p>
&lt;p>Starting in 2020, following travel restrictions as a result of COVID-19, the board introduced a plan to convene at least one of the board meetings virtually each year and all committee meetings take place virtually. Most board members sit on at least one Crossref committee. Care is taken to accommodate the wide range of timezones in which our board members live.&lt;/p>
&lt;p>While the expressions of interest are specific to an individual, the seat that is elected to the board belongs to the member organisation. The primary board member also names an alternate who may attend meetings in the event that the primary board member is unable.&lt;/p>
&lt;p>Board members are expected to be comfortable assuming the responsibilities listed above and to prepare and participate in board meeting discussions.&lt;/p>
&lt;h2 id="about-the-election">About the election&lt;/h2>
&lt;p>The board is elected through the &amp;ldquo;one member, one vote&amp;rdquo; policy wherein every member of Crossref has a single vote to elect representatives to the Crossref board. Board terms are for three years, and this year there are six seats open for election.&lt;/p>
&lt;p>The board maintains a balance of seats, with eight seats for smaller publishers and eight seats for larger publishers, in an effort to ensure that the diversity of experiences and perspectives of the publishing community is represented in decisions made at Crossref. This year we will elect two of the larger publisher seats and four of the smaller publisher seats.&lt;/p>
&lt;p>The election takes place online and voting will open in September. Election results will be shared at the November board meeting and new members will commence their term in 2021.&lt;/p>
&lt;h2 id="about-the-nominating-committee">About the nominating committee&lt;/h2>
&lt;p>The nominating committee will review the expressions of interest and select a slate of candidates for election. The slate put forward will exceed the total number of open seats. The committee considers the statements of interest, organisational size, geography, gender, and experience.&lt;/p>
&lt;p>&lt;strong>2020 Nominating Committee:&lt;/strong>&lt;br>
Melissa Harrison, eLife, Cambridge, UK, committee chair&lt;br>
Scott Delman, ACM, New York, NY&lt;br>
Susan Murray, AJOL, Grahamstown, South Africa&lt;br>
Tanja Niemann, Erudit, Montreal, Canada&lt;br>
Arley Soto, Biteca, Bogotá, Colombia&lt;/p>
&lt;div class="shortcode-divwrap blue-highlight">
&lt;span>&lt;h4 id="how-to-submit-an-expression-of-interest">How to submit an expression of interest&lt;/h4>
&lt;p>Please &lt;a href="https://docs.google.com/forms/d/e/1FAIpQLSeJhKR34FmXVHELDXZjNYy0W4TnEpuYJMHfKAPPYjRIuDuoQg/viewform?usp=sf_link" target="_blank">click here to submit your expression of interest&lt;/a> or contact me with any questions at &lt;a href="mailto:lofiesh@crossref.org">lofiesh [at] crossref.org&lt;/a>.&lt;/p>&lt;/span>
&lt;/div>
&lt;br />
&lt;p>&lt;a id="spanishversion">&lt;/a>
&lt;em>&lt;strong>Versión en español&lt;/strong>&lt;/em>&lt;/p>
&lt;p>El Comité de Nominación de Crossref está invitando a expresiones de interés a unirse a la Junta Directiva de Crossref para el período que comienza en 2021. El comité recopilará las respuestas de los interesados ​​y creará la lista de candidatos que nuestra membresía votará en una elección en septiembre. Las expresiones de interés vencen el viernes 19 de junio de 2020.&lt;/p>
&lt;p>La función de la junta directiva de Crossref es proporcionar supervisión estratégica y financiera de la organización, así como orientación para el Director Ejecutivo y el equipo de liderazgo del personal, con responsabilidades importantes como:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>Establecer la dirección estratégica para la organización;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Proporcionar supervisión financiera; y&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Aprobar nuevas políticas y servicios.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>La junta es representativa de nuestra base de miembros y guía al equipo de liderazgo del personal sobre las tendencias que afectan las comunicaciones académicas. La junta establece direcciones estratégicas para la organización mientras supervisa los cambios e implementación de políticas. Los miembros de la junta tienen la responsabilidad fiduciaria de garantizar operaciones sólidas. Los miembros de la junta hacen esto asistiendo a las reuniones de la junta, además de unirse a comités de la junta más específicos.&lt;/p>
&lt;p>Como ejemplo, en 2019 la junta decidió eliminar las tarifas de servicio de Crossmark. Esto implicó una revisión estratégica del servicio y su alineación con la misión del comité de Membresía y Tarifas; seguido de una revisión de las implicaciones financieras de eliminar la tarifa; y, en última instancia, un voto de la junta completa para retirar la tarifa a partir de 2020.&lt;/p>
&lt;p>Los servicios Crossref proporcionan infraestructura central para las comunicaciones académicas. La junta directiva de Crossref ayuda a dar forma al futuro de nuestros servicios y, por extensión, impacta el ecosistema académico más amplio. Estamos buscando miembros de la junta para contribuir con su experiencia y perspectiva.&lt;/p>
&lt;h2 id="estoy-interesado-pero-ocupado-qué-se-espera-de-los-miembros-de-la-junta">¡Estoy interesado pero ocupado! ¿Qué se espera de los miembros de la junta?&lt;/h2>
&lt;p>Los miembros de la junta asisten a tres reuniones cada año que generalmente tienen lugar en marzo, julio y noviembre. Las reuniones se han llevado en una variedad de ubicaciones internacionales y se brinda apoyo para viajes cuando es necesario.&lt;/p>
&lt;p>A partir de 2020, después de las restricciones de viaje como resultado de COVID-19, la junta introdujo un plan para convocar al menos una de las reuniones de la junta virtualmente todos los años, y todas las reuniones del comité tienen lugar virtualmente. La mayoría de los miembros de la junta formen parte del menos un comité Crossref. Se tiene cuidado de acomodar la amplia gama de zonas horarias en las que viven los miembros de nuestra junta.&lt;/p>
&lt;p>Aunque las expresiones de interés son específicas de un individuo, el asiento elegido para la junta pertenece a la organización miembro. El miembro primario de la junta también nombra a un suplente que puede asistir a las reuniones en caso de que el miembro de la junta principal no pueda.&lt;/p>
&lt;p>Se espera que los miembros de la junta se sientan cómodos asumiendo las responsabilidades anteriores y que se preparen y participen en las discusiones de la reunión de la junta.&lt;/p>
&lt;p>Las reuniones de la junta se llevarán a cabo en inglés, por lo que los posibles miembros de la junta deben sentirse cómodos leyendo material en inglés y en inglés conversacional.&lt;/p>
&lt;h2 id="sobre-las-elecciones">Sobre las elecciones&lt;/h2>
&lt;p>La junta se elige mediante la política de &amp;ldquo;un miembro, un voto&amp;rdquo; en la que cada miembro de Crossref tiene un voto para elegir representantes en la junta de Crossref. Los términos de la junta son de tres años, y este año hay seis asientos abiertos para la elección&lt;/p>
&lt;p>La junta mantiene un equilibrio de asientos, con ocho asientos para editoriales más pequeñas y ocho asientos para editoriales más grandes, en un esfuerzo por garantizar que la diversidad de experiencias y perspectivas de la comunidad editorial esté representada en las decisiones tomadas en Crossref. Este año elegiremos dos de los asientos de editor más grandes y cuatro de los asientos de editor más pequeños.&lt;/p>
&lt;p>La elección se realiza en línea y la votación se abrirá en septiembre. Los resultados de las elecciones se compartirán en la reunión de la junta de noviembre y los nuevos miembros comenzarán su mandato en 2021.&lt;/p>
&lt;h2 id="sobre-el-comité-de-nominaciones">Sobre el comité de nominaciones&lt;/h2>
&lt;p>El comité de nominaciones revisará las expresiones de interés y seleccionará una lista de candidatos para la elección. Esta lista presentada excederá el número total de asientos disponibles. El comité considera declaraciones de interés, tamaño organizacional, geografía, género y experiencia.&lt;/p>
&lt;p>&lt;strong>Comité de nominaciones 2020:&lt;/strong>&lt;br>
Melissa Harrison, eLife, Cambridge, UK, committee chair&lt;br>
Scott Delman, ACM, New York, NY&lt;br>
Susan Murray, AJOL, Grahamstown, South Africa&lt;br>
Tanja Niemann, Erudit, Montreal, Canada&lt;br>
Arley Soto, Biteca, Bogotá, Colombia&lt;/p>
&lt;div class="shortcode-divwrap blue-highlight">
&lt;span>&lt;h4 id="cómo-presentar-una-expresión-de-interés">Cómo presentar una expresión de interés&lt;/h4>
&lt;p>Por favor &lt;a href="https://docs.google.com/forms/d/e/1FAIpQLSeJhKR34FmXVHELDXZjNYy0W4TnEpuYJMHfKAPPYjRIuDuoQg/viewform?usp=sf_link" target="_blank">haga clic aquí para enviar su expresión de interés&lt;/a> o contáctame si tiene alguna pregunta &lt;a href="mailto:lofiesh@crossref.org">lofiesh [at] crossref.org&lt;/a>.&lt;/p>&lt;/span>
&lt;/div>
&lt;br />
&lt;p>&lt;a id="frenchversion">&lt;/a>
&lt;em>&lt;strong>Version Française&lt;/strong>&lt;/em>&lt;/p>
&lt;h2 id="appel-à-tous-les-membres-potentiels-du-conseil-dadministration">Appel à tous les membres potentiels du conseil d&amp;rsquo;administration&lt;/h2>
&lt;p>Le comité de nomination de Crossref invite les personnes qui seraient intéressées à se porter candidates pour l&amp;rsquo;élection au conseil d&amp;rsquo;administration de Crossref, pour le mandat commençant en 2021. Le comité de nomination rassemblera les réponses des personnes candidates et élaborera une liste des candidats, pour lesquels nos membres pourront voter lors des élections au conseil d&amp;rsquo;administration, en septembre. Les candidatures doivent être déposées au plus tard le vendredi 19 juin 2020.&lt;/p>
&lt;p>Le rôle du conseil d&amp;rsquo;administration de Crossref est d&amp;rsquo;opérer une supervision stratégique et financière de l&amp;rsquo;organisation, et de conseiller le directeur exécutif ainsi que l&amp;rsquo;équipe de direction du personnel. Les principales responsabilités du conseil d’administration sont les suivantes :&lt;/p>
&lt;ul>
&lt;li>Fixer l&amp;rsquo;orientation stratégique de l&amp;rsquo;organisation&lt;/li>
&lt;li>Assurer la surveillance financière&lt;/li>
&lt;li>Approuver de nouvelles politiques et de nouveaux services&lt;/li>
&lt;/ul>
&lt;p>Le conseil d&amp;rsquo;administration est représentatif de nos adhérents et guide l&amp;rsquo;équipe de direction du personnel en ce qui concerne les tendances affectant les communications savantes. Le conseil d&amp;rsquo;administration établit des orientations stratégiques pour l&amp;rsquo;organisation, tout en assurant le contrôle des changements et de la mise en œuvre des politiques. Les membres du conseil ont la responsabilité fiduciaire d&amp;rsquo;assurer son bon fonctionnement. Les membres du conseil d&amp;rsquo;administration s’acquittent de cette responsabilité en assistant aux réunions du conseil d&amp;rsquo;administration et en participant à des comités, plus spécifiques, du conseil d&amp;rsquo;administration.&lt;/p>
&lt;p>A titre d’exemple, en 2019, le conseil d&amp;rsquo;administration a décidé de supprimer les frais liés au service Crossmark. Ceci a impliqué un examen stratégique du service et de son alignement avec la mission de Crossref, par le comité des adhésions et frais, puis un examen des implications financières de la suppression des frais, et, finalement, un vote par l&amp;rsquo;ensemble du conseil d&amp;rsquo;administration pour supprimer les frais à partir de 2020.&lt;/p>
&lt;p>Les services de Crossref fournissent une infrastructure centralisée pour les communications savantes. Le conseil d&amp;rsquo;administration de Crossref aide à façonner l&amp;rsquo;avenir de nos services et, par extension, a un impact sur l&amp;rsquo;écosystème universitaire plus large. Les futurs membres du conseil d&amp;rsquo;administration sont recherchés particulièrement pour leur expérience et leur point de vue.&lt;/p>
&lt;h2 id="je-suis-intéressé-mais-très-occupé-quattend-on-des-administrateurs">Je suis intéressé mais très occupé! Qu&amp;rsquo;attend-on des administrateurs?&lt;/h2>
&lt;p>Les membres du conseil d&amp;rsquo;administration assistent à trois réunions par an qui ont généralement lieu en mars, juillet et novembre. Les réunions se déroulent dans des lieux divers, à l&amp;rsquo;échelle internationale, et une assistance financière est octroyée, en cas de besoin, pour le voyage.&lt;/p>
&lt;p>À partir de 2020, à la suite des restrictions de voyage causées par la COVID-19, le conseil a présenté un plan pour convoquer au moins une des réunions du conseil en téléconférence chaque année, et toutes les réunions des comités auront lieu en téléconférence. La plupart des membres du conseil d&amp;rsquo;administration siègent à au moins un comité de Crossref. Nous souhaitons préciser que nous prenons soin de prendre en compte le large éventail de fuseaux horaires dans lesquels vivent les membres de notre conseil d&amp;rsquo;administration.&lt;/p>
&lt;p>Bien que les manifestations d&amp;rsquo;intérêt émanent d’une personne, le siège pourvu au conseil appartient à l&amp;rsquo;organisation membre dans son ensemble. Le membre titulaire du conseil d&amp;rsquo;administration nomme également un suppléant, qui pourra assister aux réunions en cas d&amp;rsquo;empêchement du membre titulaire du siège au conseil d&amp;rsquo;administration.&lt;/p>
&lt;p>Il est attendu que les membres du conseil d’administration puissent dédier aux responsabilités présentées ci-dessus le temps qui leur est raisonnablement dû, ainsi qu&amp;rsquo;à la préparation et à la participation aux discussions des réunions du conseil.&lt;/p>
&lt;h2 id="à-propos-de-lélection">À propos de l&amp;rsquo;élection&lt;/h2>
&lt;p>Le conseil d&amp;rsquo;administration est élu selon une politique de «un membre, une voix» dans laquelle chaque membre de Crossref dispose d&amp;rsquo;une seule voix pour élire les représentants au conseil d&amp;rsquo;administration de Crossref. Le mandat du conseil d&amp;rsquo;administration est de trois ans et, cette année, six sièges sont à pourvoir lors de des élections de septembre prochain.&lt;/p>
&lt;p>Le conseil d&amp;rsquo;administration maintient un équilibre des sièges, avec huit sièges pour les petits éditeurs et huit sièges pour les grands éditeurs, afin de garantir que la diversité des expériences et des perspectives de la communauté de l&amp;rsquo;édition soit représentées dans les décisions prises à Crossref. Cette année, sont à pourvoir deux sièges de grands éditeurs et quatre sièges de petits éditeurs.&lt;/p>
&lt;p>Le vote aura lieu en ligne et s&amp;rsquo;ouvrira en septembre. Les résultats de ce scrutin seront communiqués lors de la réunion du conseil d&amp;rsquo;administration de novembre et les nouveaux membres commenceront leur mandat en 2021.&lt;/p>
&lt;h2 id="à-propos-du-comité-de-nomination">À propos du comité de nomination&lt;/h2>
&lt;p>Le comité des candidatures examinera les candidatures et sélectionnera une liste de candidats aux élections. Le nombre de candidats proposés dépassera le nombre total de sièges à pourvoir. Le comité prend en compte les déclarations d&amp;rsquo;intérêt, la taille de l&amp;rsquo;organisation, la géographie, le sexe et l&amp;rsquo;expérience des personnes pour sa sélection.&lt;/p>
&lt;div class="shortcode-divwrap blue-highlight">
&lt;span>&lt;h4 id="comment-exprimer-une-manifestation-dintérêt">Comment exprimer une manifestation d&amp;rsquo;intérêt&lt;/h4>
&lt;p>Veuillez cliquer ici pour envoyer votre candidature ou contactez-moi pour toute question à lofiesh [at] crossref.org.&lt;/p>&lt;/span>
&lt;/div></description></item><item><title>Come for a swim in our new pool of Education materials</title><link>https://www.crossref.org/blog/come-for-a-swim-in-our-new-pool-of-education-materials/</link><pubDate>Wed, 29 Apr 2020 00:00:00 +0000</pubDate><author>Laura J Wilkinson</author><guid>https://www.crossref.org/blog/come-for-a-swim-in-our-new-pool-of-education-materials/</guid><description>&lt;p>After 20 years in operation, and as our system matures from experimental to foundational infrastructure, it’s time to review our documentation.&lt;/p>
&lt;p>Having a solid core of education materials about the &lt;em>why&lt;/em> and the &lt;em>how&lt;/em> of Crossref is essential in making participation possible, easy, and equitable.&lt;/p>
&lt;p>As our system has evolved, our membership has grown and diversified, and so have our tools - both for depositing metadata with Crossref, and for retrieving and making use of it.&lt;/p>
&lt;p>Our new documentation gives the full picture, with each chapter explaining an aspect of Crossref and why it matters, followed by instructions on how to participate. As far as possible, these instructions are given for each of our deposit and retrieval methods.&lt;/p>
&lt;p>The revised documentation has been edited for use of simple English, and consistent terminology. Specialist vocabulary is explained as it is introduced. Understanding what’s involved across the full range of Crossref services can often seem complicated. This makes the documentation easier for readers, and provides a good basis for human and machine translations.&lt;/p>
&lt;p>The chapters and sections are modular, so you can approach and combine them in different ways according to your existing knowledge and what you wish to learn. This &lt;a href="https://en.wikipedia.org/wiki/Choose_Your_Own_Adventure" target="_blank">Choose Your Own Adventure&lt;/a> style means that sections don&amp;rsquo;t overlap, avoiding problems of repetition and versioning, and helping us to keep the information current.&lt;/p>
&lt;p>The revised documentation includes several new topics, including: &lt;/p>
&lt;ul>
&lt;li>
&lt;p>&lt;a href="https://www.crossref.org/education/metadata/">The importance of metadata&lt;/a>, explaining why you might register metadata for different purposes (discoverability, research integrity, reproducibility, and reporting and assessment)&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;a href="https://www.crossref.org/education/metadata/persistent-identifiers/">Persistent identifiers (PIDs)&lt;/a>, explaining the structure of a DOI, and how you might use DOIs at different levels&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;a href="https://www.crossref.org/education/member-setup/choose-content-registration-method/">Choosing which way to register your content&lt;/a>, including suggested DOI registration workflow and suffix generator to make life easier&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;a href="https://www.crossref.org/education/content-registration#00116">Introduction to types of metadata&lt;/a>, including descriptive (bibliographic), administrative, and structural &lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;a href="https://www.crossref.org/education/crossmark/version-control-corrections-and-retractions/">Version control, corrections, and retractions&lt;/a>, including publication stages and DOIs&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;a href="https://www.crossref.org/documentation/register-maintain-records/">Metadata stewardship&lt;/a>, including maintaining your metadata, reports, understanding your member obligations, and maintaining your Crossref membership.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>This new documentation is part of our efforts to make Crossref participation possible, easy, and rewarding for our members large and small, all over the world. It provides a concrete basis on which to build further education and outreach projects in the future. New members will start to see our paced member onboarding program, introducing them to parts of the documentation as and when it&amp;rsquo;s useful to them. And like the rest of the Crossref website, it&amp;rsquo;s all &lt;a href="https://creativecommons.org/licenses/by/4.0/" target="_blank">licensed for reuse under CC-BY&lt;/a>.&lt;/p>
&lt;p>I would like to say a big thank you to the members of the Education Task Force, who helped guide the development of the new documentation, representing a diverse range of Crossref members large and small from around the world:&lt;/p>
&lt;ul>
&lt;li>Anjum Sherasiya - India, Editor-in-Chief of Veterinary World, Crossref Ambassador&lt;/li>
&lt;li>Budi Setiawan - Indonesia, Poltekkes Kemenkes Yogyakarta&lt;/li>
&lt;li>Caroline Breul - USA, BioOne&lt;/li>
&lt;li>Isabel Recavarren - Peru, Consejo Nacional de Ciencia, Tecnología e Innovación Tecnológica (CONCYTEC), Crossref Ambassador&lt;/li>
&lt;li>Mike Nason - Canada, Public Knowledge Project (PKP) and University of New Brunswick&lt;/li>
&lt;li>Nadine van der Merwe - South Africa, Academy of Science of South Africa (ASSAf)&lt;/li>
&lt;li>Roberto Camargo - Brazil, Associação Brasileira de Editores Científicos (ABEC)&lt;/li>
&lt;li>Sioux Cumming - UK, INASP&lt;/li>
&lt;li>Taeil Kim - South Korea, Korean Association of Medical Journal Editors (KAMJE)&lt;/li>
&lt;li>and from Crossref: Amanda, Esha, Geoffrey, Ginny, Isaac, Kirsty, Patricia, and Susan.&lt;/li>
&lt;/ul>
&lt;p>Please explore the &lt;a href="https://www.crossref.org/education/">new documentation&lt;/a>, give us your feedback using the yellow &amp;ldquo;Docs feedback&amp;rdquo; button at the bottom of each page, and share this update to spread the word!&lt;/p></description></item><item><title>Crossing the Rubicon - The case for making chapters visible</title><link>https://www.crossref.org/blog/crossing-the-rubicon-the-case-for-making-chapters-visible/</link><pubDate>Wed, 29 Apr 2020 00:00:00 +0000</pubDate><author>Jennifer Kemp</author><guid>https://www.crossref.org/blog/crossing-the-rubicon-the-case-for-making-chapters-visible/</guid><description>&lt;p>&lt;em>To help better support the discovery, sale and analysis of books, Jennifer Kemp from Crossref and Mike Taylor from Digital Science, present seven reasons why publishers should collect chapter-level metadata.&lt;/em>&lt;/p>
&lt;p>Book publishers should have been in the best possible position to take advantage of the movement of scholarly publishing to the internet. After all, they have behind them an extraordinary legacy of creating and distributing data about books: the metadata that supports discovery, sales and analysis.&lt;/p>
&lt;p>Librarianship, and the management of book catalogs at scale took off in the nineteenth century. The Dewey Decimal Classification, the various initiatives of the Library of Congress and the British Library followed. Innovations from the 1960s gave us MARC records and ISBNs. The late 90s produced ONIX, which gave the book industry a tremendous start in migrating online. &lt;/p>
&lt;p>However, progress in the decades after appears to have been less dramatic. Some might even argue that this tremendous legacy and wealth of metadata experience has acted as a weight, and has slowed progress. Nowhere is this lack of progress clearer than in the discovery and analysis of book chapters: approximately one-quarter of books published per year has chapter-level metadata, and about two-thirds of books don&amp;rsquo;t have a persistent and open identifier, ratios that have not significantly changed over the last ten years.&lt;/p>
&lt;p>&lt;strong>Only one-quarter of scholarly books make chapter level metadata available&lt;/strong>&lt;/p>
&lt;center>&lt;img src="https://www.crossref.org/images/blog/2020/rubicon-blog-piechart.png" alt="pie chart" width="550" class="img-responsive" />&lt;/center>
&lt;p>The proportion of edited books and monographs with chapter-level data is approximately one-quarter of all books published in the last ten years. Calculating this figure is necessarily approximate, using numbers published in Grimme et al (2019), and based on data and observed trends in both Dimensions and Crossref.&lt;/p>
&lt;h3 id="so-why-the-lack-of-progress">So why the lack of progress? &lt;/h3>
&lt;p>For many publishers and their vendor partners, with systems geared up to the efficient delivery of title-level information, the case for moving towards chapter-level metadata can seem daunting (and potentially expensive!).&lt;/p>
&lt;p>Metadata is necessarily detailed and it&amp;rsquo;s not the kind of thing most people will dabble in. Practitioners, as in other technical fields, have expertise that others may find difficult to leverage if they don&amp;rsquo;t know what questions to ask. organisations often find themselves entrenched in outdated approaches to metadata. &lt;/p>
&lt;p>Crossref and Metadata 2020 are collaborating to produce arguments why publishers should move from book-level metadata to chapters. They&amp;rsquo;ve been working with representatives from the scholarly community, including both small and large presses, not-for-profits and university presses. &lt;/p>
&lt;h3 id="here-we-present-7-reasons-why-publishers-should-collect-chapter-level-metadata">Here we present 7 reasons why publishers should collect chapter-level metadata:&lt;/h3>
&lt;p>1. &lt;strong>Increased discoverability&lt;/strong>&lt;br>
Increasingly, we&amp;rsquo;re seeing students and researchers move away from traditional book catalogs and onto more general purpose tools, that are often optimized for journal content, and which may - inadvertently - exclude books and chapters from search results. Making chapter level data and DOIs available places book content into these new channels at no additional cost, and starts to reduce the dependency on specialist vendors. Discovery is simplified, requiring less familiarity or expertise to find relevant book content. &lt;/p>
&lt;p>2. &lt;strong>Increased usage&lt;/strong>&lt;br>
Exposing the contents of books at a more granular level drives more users towards the book content, and increasing usage numbers and (depending on platform and business model) revenue.&lt;/p>
&lt;p>3. &lt;strong>Matching author expectations&lt;/strong>&lt;br>
New generations of authors expect their content to be easily discoverable in the platforms they use. Without chapter level data, this content won&amp;rsquo;t easily be found in Google Scholar, Mendeley or ResearchGate. For younger researchers, for those in certain disciplines or using resources well-suited to it, if the chapter metadata - which in many cases requires either an introductory paragraph or an abstract - is missing, the book may as well not exist.&lt;/p>
&lt;p>4. &lt;strong>Author exposure&lt;/strong>&lt;br>
About half of scholarly book publishing is thought to be in the form of collected works: books where two or three editors get credit at the top level, but dozens of authors contribute to the chapters. Without chapter level metadata, these authors &amp;ndash; the book authors of tomorrow &amp;ndash; get no credit for their efforts.&lt;/p>
&lt;p>5. &lt;strong>Usage and citations reporting&lt;/strong>&lt;br>
Having chapters readily available in the modern platforms means that they start to accumulate evidence of sharing and citations from the moment of being published. Where chapter content is available on its own, the lack of associated metadata inhibits this evidence. After all, the DOI is a citation identifier. Evidence of impact is now critical for research evaluation, funding, tenure and promotion, and without this data, an author&amp;rsquo;s chapter may as well remain unread.&lt;/p>
&lt;p>6. &lt;strong>Supporting your authors with funding compliance and reporting&lt;/strong>&lt;br>
Authors are increasingly being mandated by their funders to report back on the status of their books and chapters. And, in the case of Open Books and Open Chapters, the funders and authors are frequently the ultimate clients, who are looking to record and report evidence of both academic or social impact. Making chapter level information and identifiers available will facilitate this evidence gathering, especially for open chapters within otherwise non-open books, and increasingly common phenomena.&lt;/p>
&lt;p>7. &lt;strong>Understanding the hot topics in your books&lt;/strong>&lt;br>
Whether you use Altmetric, or one of the other data sources that capture book activity, being able to access the social and media metrics of the chapters in your book gives you an immediate insight into the topics that capture interest at a broader level. Vital information when it comes to planning more books in the space, especially if you&amp;rsquo;re on the look out for books with trade crossover potential.&lt;/p>
&lt;p>With chapter-level data, publishers can summarize their programs and compare how many authors they work with, how many book titles they have and where there might be gaps in subject and authors omitted from the metadata. Does the scholarly record fully reflect each book? If not, there may be a good deal of information that is simply unavailable to the machines that read the metadata and use it in systems throughout scholarly communications. &lt;/p>
&lt;p>Fortunately, it&amp;rsquo;s becoming easier to manage this data. Although traditional book metadata systems don&amp;rsquo;t always support chapter-level data, they do often permit publishers to register title-level DOIs, and with Crossref encouraging ISBN information alongside the generation of chapter level DOIs, some of the significant challenges have been reduced.&lt;/p>
&lt;p>Both &lt;a href="https://www.crossref.org/education/content-registration/content-types-intro/books-and-chapters/">Crossref&lt;/a> and &lt;a href="http://www.metadata2020.org/blog/2020-03-17-metadata-practices/" target="_blank">Metadata 2020&lt;/a> offer best practices that make clear the need for richer metadata. It&amp;rsquo;s also important to acknowledge the very real barriers to providing robust metadata, whether for book chapters or anything else, which is why having the conversations and being aware of available resources is important. Because, though it may be difficult, the hurdles are often up-front making the decision to invest in better metadata, factoring in associated costs, setting up workflows, etc.&lt;/p>
&lt;p>But as we have seen from the previous decades, book publishers and their suppliers are experts in managing substantial amounts of metadata. Just as no-one would argue to roll-back all those advantages, we believe that - once deployed - industry-wide creation and distribution of chapter data would be an advance from which there is no retreat.&lt;/p>
&lt;h4 id="references">REFERENCES&lt;/h4>
&lt;p>&lt;a href="https://riojournal.com/article/38698/" target="_blank">https://riojournal.com/article/38698/&lt;/a>&lt;/p>
&lt;p>&lt;a href="https://doi.org/10.6084/m9.figshare.8197625" target="_blank">The State of Open Monographs Report&lt;/a>&lt;/p>
&lt;p>&lt;a href="https://longleafservices.org/blog/the-sustainable-history-monograph-pilot/" target="_blank">https://longleafservices.org/blog/the-sustainable-history-monograph-pilot/&lt;/a>&lt;/p>
&lt;p>&lt;a href="https://scholarlykitchen.sspnet.org/2017/12/07/enriching-metadata-is-marketing/" target="_blank">https://scholarlykitchen.sspnet.org/2017/12/07/enriching-metadata-is-marketing/&lt;/a>&lt;/p>
&lt;p>&lt;a href="https://www.ingenta.com/blog-article/five-reasons-chapter-level-metadata-increases-value-academic-books/" target="_blank">https://www.ingenta.com/blog-article/five-reasons-chapter-level-metadata-increases-value-academic-books/&lt;/a>&lt;/p></description></item><item><title>Memoirs of a DOI detective...it’s error-mentary dear members</title><link>https://www.crossref.org/blog/memoirs-of-a-doi-detective...its-error-mentary-dear-members/</link><pubDate>Mon, 27 Apr 2020 00:00:00 +0000</pubDate><author>Paul Davis</author><guid>https://www.crossref.org/blog/memoirs-of-a-doi-detective...its-error-mentary-dear-members/</guid><description>&lt;p>Hello, I’m Paul Davis and I’ve been part of the Crossref support team since May 2017. In that time I’ve become more adept as a DOI detective, helping our members work out &lt;em>whodunnit&lt;/em> when it comes to submission errors.&lt;/p>
&lt;p>If you have ever received one of our error messages after you have submitted metadata to us, you may know that some are helpful and others are, well, difficult to decode. I&amp;rsquo;m here to help you to become your own DOI detective.&lt;/p>
&lt;h3 id="motive-ridding-the-world-of-bad-metadata">Motive: ridding the world of bad metadata&lt;/h3>
&lt;p>When depositing xml files to us, there can be a plethora of error messages returned to you in the submission logs. Wait, what are submission logs? If that is the first thing that came to mind, then you’re in the right place; do keep reading.&lt;/p>
&lt;h3 id="means-xml-deposits">Means: XML deposits&lt;/h3>
&lt;p>After each content registration or update is received into our deposit admin system, it is initially placed in the submission queue and later, once its time comes, is processed. Whether that deposit comes from the &lt;a href="https://www.crossref.org/webDeposit/" target="_blank">web deposit form&lt;/a>, &lt;a href="https://www.crossref.org/metadatamanager/" target="_blank">Metadata Manager&lt;/a>, or a good old fashioned &lt;a href="https://www.crossref.org/documentation/content-registration/metadata-deposit-schema/">XML deposit&lt;/a>, a submission log is created in our system. This log contains important information about the deposit and its success or failures.&lt;/p>
&lt;p>I will go through how you will find and receive this log later on.
At the bottom of the submission log you will see a status message that looks like this:&lt;/p>
&lt;pre tabindex="0">&lt;code> &amp;lt;batch_data&amp;gt;
&amp;lt;record_count&amp;gt;***&amp;lt;/record_count&amp;gt;
&amp;lt;success_count&amp;gt;***&amp;lt;/success_count&amp;gt;
&amp;lt;warning_count&amp;gt;***&amp;lt;/warning_count&amp;gt;
&amp;lt;failure_count&amp;gt;***&amp;lt;/failure_count&amp;gt;
&amp;lt;/batch_data&amp;gt;
&lt;/code>&lt;/pre>&lt;p>To some, this might look a bit like a crime scene. If the status report displays the same number in the &lt;code>&amp;lt;record_count&amp;gt;&lt;/code> and the &lt;code>&amp;lt;success_count&amp;gt;&lt;/code>, then no crime (against deposits) has been committed. Everything you have tried to register or update has been successful and we are all free as DOI detectives to knock off early.&lt;/p>
&lt;p>At some point you will probably come across an error or failure in the submission logs, where the failure count is 1.&lt;/p>
&lt;pre tabindex="0">&lt;code> &amp;lt;batch_data&amp;gt;
&amp;lt;record_count&amp;gt;1&amp;lt;/record_count&amp;gt;
&amp;lt;success_count&amp;gt;0&amp;lt;/success_count&amp;gt;
&amp;lt;warning_count&amp;gt;0&amp;lt;/warning_count&amp;gt;
&amp;lt;failure_count&amp;gt;1&amp;lt;/failure_count&amp;gt;
&amp;lt;/batch_data&amp;gt;
&lt;/code>&lt;/pre>&lt;p>For the purposes of this blog, this type of message means a &lt;em>“crime”&lt;/em> has been committed. The worst kind of crime - a metadata crime. In the real world, outside of this blog, it just means that your deposit has failed and you need to take some action to fix it. You will also receive accompanying error messages (an evidence log) with details about what went wrong with your submission. We’ll deliver these submission details to you as well in the following ways:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>For those submitting via the web deposit form, to the email address used to register your submission&lt;/p>
&lt;/li>
&lt;li>
&lt;p>On screen and within the admin tool using the submission ID for those submitting via Metadata Manager&lt;/p>
&lt;/li>
&lt;li>
&lt;p>For those submitting XML, to the email included in the &lt;code>&amp;lt;email_address&amp;gt;&lt;/code> element of your deposit XML&lt;/p>
&lt;/li>
&lt;li>
&lt;p>You can also find the submission log in the admin system at any point&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>More information on &lt;a href="https://www.crossref.org/education/content-registration/verify-your-registration/submission-queue-and-log/#00143">viewing past deposits&lt;/a> in the admin system can be found on our support site.&lt;/p>
&lt;h3 id="the-usual-suspects">The usual suspects&lt;/h3>
&lt;p>Those serial offenders, when it comes to failed deposits, are:&lt;/p>
&lt;h4 id="timestamps">Timestamps&lt;/h4>
&lt;ul>
&lt;li>&lt;strong>Misdemeanor&lt;/strong> - Every deposit has a &lt;code>&amp;lt;timestamp&amp;gt;&lt;/code> value, and that value needs to be incremented each time the DOI is updated. This is done automatically for you in Metadata Manager, the Web Deposit Form and the OJS plugin. But if you’re updating an existing DOI by sending us the whole XML file again, you need to make sure that you update the timestamp as well as the field you’re trying to update.&lt;/li>
&lt;/ul>
&lt;pre tabindex="0">&lt;code>Error: &amp;lt;msg&amp;gt;Record not processed because submitted version: 201907242206 is less or equal to previously submitted version 201907242206&amp;lt;/msg&amp;gt;
&lt;/code>&lt;/pre>&lt;ul>
&lt;li>&lt;strong>Rehabilitation&lt;/strong> - simply resubmit your XML file, but make sure that you increment the timestamp value to be larger than the current timestamp value.&lt;/li>
&lt;/ul>
&lt;h4 id="titles">Titles&lt;/h4>
&lt;ul>
&lt;li>&lt;strong>Misdemeanor&lt;/strong> - These need to match exactly between what we have on the system against the ISSN/ISBN and what is in the deposit file.&lt;/li>
&lt;/ul>
&lt;pre tabindex="0">&lt;code>Error: &amp;lt;msg&amp;gt;Deposit contains title error: the deposited publication title is different than the already assigned title&amp;lt;/msg&amp;gt;
&lt;/code>&lt;/pre>&lt;p>or&lt;/p>
&lt;pre tabindex="0">&lt;code>Error: &amp;lt;msg&amp;gt;ISSN &amp;#34;123454678&amp;#34; has already been assigned, issn (123454678) is assigned to another title (Journal of Metadata)&amp;lt;/msg&amp;gt;
&lt;/code>&lt;/pre>&lt;ul>
&lt;li>&lt;strong>Rehabilitation&lt;/strong> - you can check the title we have on the system against the ISSN/ISBN on the &lt;a href="https://www.crossref.org/titleList/" target="_blank">title list&lt;/a> and make the necessary changes, or contact &lt;a href="mailto:support@crossref.org">support&lt;/a> for us to check the title in our system and make changes to match the title in the deposit to the one in the system, if known.&lt;/li>
&lt;/ul>
&lt;h4 id="title-level-dois">Title level DOIs&lt;/h4>
&lt;ul>
&lt;li>&lt;strong>Misdemeanor&lt;/strong> - These also need to match up exactly in both system and deposit&lt;/li>
&lt;/ul>
&lt;pre tabindex="0">&lt;code>Error: &amp;lt;msg&amp;gt;Deposit contains title error: The journal has a different DOI assigned; If you want to change the journal&amp;#39;s DOI please contact Crossref support: title=Journal of Metadata; current-doi=10.14393/JoM; deposited-doi=10.14393/JoM.1.1&amp;lt;/msg&amp;gt;
&lt;/code>&lt;/pre>&lt;ul>
&lt;li>&lt;strong>Rehabilitation&lt;/strong> - contact us to change the journal level DOI in the system or change the DOI in the deposit yourself to match the one already registered for the title.&lt;/li>
&lt;/ul>
&lt;h4 id="errors-in-the-xml">Errors in the xml&lt;/h4>
&lt;ul>
&lt;li>&lt;strong>Misdemeanor&lt;/strong> - Poor formatting, self closing tags, invalid values.&lt;/li>
&lt;/ul>
&lt;pre tabindex="0">&lt;code>Error: &amp;lt;msg&amp;gt;Deposited XML is not well-formed or does not validate: Error on line 538&amp;lt;/msg&amp;gt;
&lt;/code>&lt;/pre>&lt;ul>
&lt;li>&lt;strong>Rehabilitation&lt;/strong> - update the xml file that was deposited as it was not well formed against our schema or as an xml file in general. Check you have saved the file correctly (as an .xml file), edited it in an xml editor and not a word processor and if that fails, then contact &lt;a href="mailto:support@crossref.org">support&lt;/a> and we will try to assist. We also have a collection of &lt;a href="https://gitlab.com/crossref/schema/-/tree/master/examples" target="_blank">new xml examples&lt;/a> you may use as a template.&lt;/li>
&lt;/ul>
&lt;h3 id="forensics">Forensics&lt;/h3>
&lt;p>There are a few tools we offer to help with the deciphering of the error messages –– we think of these as our magnifying glass(es).&lt;/p>
&lt;ul>
&lt;li>
&lt;p>The &lt;strong>&lt;a href="https://www.crossref.org/titleList/" target="_blank">Title list&lt;/a>&lt;/strong>: A list of all of the titles in our database, you can check against the ISSN/ISBN to see what the title on our system is and whether it matches the title you have in your deposit.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>The &lt;strong>&lt;a href="https://www.crossref.org/06members/51depositor.html" target="_blank">Depositor Report&lt;/a>&lt;/strong>: Shows all journals, books, and conference proceedings against each member. The report includes all DOIs for each journal, book, conference; the most recently used timestamps; and citation counts for each DOI.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>The &lt;strong>&lt;a href="https://doi.crossref.org/servlet/reports" target="_blank">Reports tab&lt;/a>&lt;/strong> in the admin system: You can find out the history behind a DOI by searching against this in the admin console.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Our &lt;strong>&lt;a href="https://www.crossref.org/education/content-registration/verify-your-registration/troubleshooting-submissions/#00152">common error messages&lt;/a>&lt;/strong> are documented within our support documentation. You can always find out more about most of the error messages are system displays at the link above.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>You can find the current &lt;strong>&lt;a href="http://doi.crossref.org/search/doi?pid=support@crossref.org&amp;amp;format=unixsd&amp;amp;doi=10.5555%2F12345678" target="_blank">xml metadata against a DOI&lt;/a>&lt;/strong> by adding the DOI to the end of this link &lt;a href="http://doi.crossref.org/search/doi?pid=support@crossref.org&amp;amp;format=unixsd&amp;amp;doi=" target="_blank">http://doi.crossref.org/search/doi?pid=support@crossref.org&amp;format=unixsd&amp;doi=&lt;/a>
(you might need an xml viewer browser extension to view the xml in a more readable format).&lt;/p>
&lt;/li>
&lt;/ul>
&lt;h3 id="calling-for-backup">Calling for backup&lt;/h3>
&lt;p>We’ll also soon be adding more leads to our submission logs and error messages for the best of our detectives. These improvements will point our DOI detectives to better documentation about interpreting error messages and taking the appropriate action to resolve those errors.&lt;/p>
&lt;p>But there are a lot more error messages out there. If you have trouble deciphering any error message you encounter, then please do send the case number (submission ID) over to CSI (Crossref Support Investigations) at &lt;a href="mailto:support@crossref.org">support@crossref.org&lt;/a>.&lt;/p>
&lt;p>You can also find lots of great information in the pages of our new &lt;a href="https://www.crossref.org/education/">documentation&lt;/a>.&lt;/p></description></item><item><title>Helping researchers identify content they can text mine</title><link>https://www.crossref.org/blog/helping-researchers-identify-content-they-can-text-mine/</link><pubDate>Thu, 16 Apr 2020 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/helping-researchers-identify-content-they-can-text-mine/</guid><description>&lt;h2 id="tldr">TL;DR&lt;/h2>
&lt;p>Many organisations are doing what they can to aid in the response to the COVID-19 pandemic. Crossref members can make it easier for researchers to identify, locate, and access content for text mining. In order to do this, members must include elements in their metadata that:&lt;/p>
&lt;ul>
&lt;li>Point to the full text of the content.&lt;/li>
&lt;li>Indicate that the content is available under an open access license or that it is being made available for free (gratis).&lt;/li>
&lt;/ul>
&lt;h2 id="how-to-do-it">How to do it.&lt;/h2>
&lt;h3 id="if-your-content-is-open-access">If your content is open access&lt;/h3>
&lt;p>Make sure the Crossref metadata for all of your open access content includes:&lt;/p>
&lt;ol>
&lt;li>The URL of the open access license the content is under.&lt;/li>
&lt;li>A URL that points to the full text of the content on your site (PDF, XML or HTML).&lt;/li>
&lt;/ol>
&lt;p>&lt;a href="https://www.crossref.org/education/retrieve-metadata/rest-api/text-and-data-mining-for-members/">Instructions for including license and full text URLs in your metadata.&lt;/a>&lt;/p>
&lt;h3 id="if-you-are-making-subscription-content-available-for-text-mining-temporarily-or-otherwise">If you are making subscription content available for text mining (temporarily or otherwise).&lt;/h3>
&lt;p>Make sure the Crossref metadata for the content you are making freely available for text mining includes:&lt;/p>
&lt;ol>
&lt;li>The URL of the publisher license the content is under.&lt;/li>
&lt;li>A URL that points to the full text of the content where it is being made freely available (PDF, XML or HTML). This might not be on your site.&lt;/li>
&lt;/ol>
&lt;p>&lt;a href="https://www.crossref.org/education/retrieve-metadata/rest-api/text-and-data-mining-for-members/">Instructions for including license and full text URLs in your metadata.&lt;/a>&lt;/p>
&lt;p>In addition, you need to flag the content that you are making freely available.&lt;/p>
&lt;ol start="3">
&lt;li>A “free to read” element in the access indicators section of your metadata indicating that the content is being made available free-of-charge (gratis).&lt;/li>
&lt;li>An assertion element indicating that the content being made available is available free-of-charge.&lt;/li>
&lt;/ol>
&lt;p>&lt;a href="https://www.crossref.org/flagging-free-to-read/">Instructions for flagging your content as “free”&lt;/a>&lt;/p>
&lt;p>Note that step #4 is required in order for users to be able to find content marked as “gratis” in Crossref’s REST API.&lt;/p>
&lt;p>And if you decide to revoke the free access in the future, you will need to update the data to reflect that restrictions have been reimposed.&lt;/p>
&lt;h2 id="sounds-great-has-anybody-else-actually-done-this">Sounds great. Has anybody else actually done this?&lt;/h2>
&lt;p>&lt;em>Yes.&lt;/em>&lt;/p>
&lt;p>Over 43 million metadata records already have a license and a full text link. &lt;a href="https://api.crossref.org/works?filter=has-license:true,has-full-text:true&amp;rows=0">&lt;a href="https://api.crossref.org/works?filter=has-license:true,has-full-text:true&amp;amp;rows=0" target="_blank">https://api.crossref.org/works?filter=has-license:true,has-full-text:true&amp;rows=0&lt;/a>&lt;/a>&lt;/p>
&lt;p>Millions of the above items have one of the &lt;a href="https://creativecommons.org/" target="_blank">Creative Commons&lt;/a> licenses or a dedicated text and data mining license provided by the publisher.&lt;/p>
&lt;p>And in the past three weeks (as of the writing of this blog post) over 23,000 articles have been flagged as “free” so they are available for text mining.&lt;/p>
&lt;p>&lt;a href="https://api.crossref.org/v1/works?filter=assertion:free,has-full-text:true" target="_blank">https://api.crossref.org/v1/works?filter=assertion:free,has-full-text:true&lt;/a>&lt;/p></description></item><item><title>Changes to resolution reports</title><link>https://www.crossref.org/blog/changes-to-resolution-reports/</link><pubDate>Fri, 10 Apr 2020 00:00:00 +0000</pubDate><author>Isaac Farley</author><guid>https://www.crossref.org/blog/changes-to-resolution-reports/</guid><description>&lt;p>This blog is long overdue. My apologies for the delay. I promised you an update in February as a follow up to the &lt;a href="https://doi.org/10.64000/xpe8h-4tt05" target="_blank">resolution reports blog&lt;/a> originally published in December by my colleague Jon Stark and me. Clearly we (I) missed that February projection, but I’m here today to provide said update. We received many great suggestions from our members as a result of the call for comments. For those of you who took time to write: thank you! We took extra time to review and evaluate all of your comments and recommendations. We have reached a decision about the major proposed change - removal of all filters from monthly resolution reports - as well as a couple of suggested improvements from that feedback.&lt;/p>
&lt;h3 id="quick-recap-of-our-original-blog">Quick recap of our original blog&lt;/h3>
&lt;p>Jon wrote the original version of the resolution report in late 2009 in an effort to provide you, our members, with information about the usage of registered Crossref DOIs. At that time, Jon and others at Crossref thought it important to segment human-driven traffic from resolutions by machines (bots). Thus, we decided to filter out well-known machine activity in an attempt to only present you with resolutions by individual humans.&lt;/p>
&lt;p>In the last ten-plus years things changed. We live in a time where most of our work requires both human and machine interaction. Therefore, we have hypothesized that some, or most, of those resolutions from machines today represent legitimate activity and should be reported to you each month. Since we don’t have a reliable method to segment those resolutions, and don’t think we should be making judgments about which resolutions should and should not be included in the reports, we proposed removing all filters and presenting you with all the numbers.&lt;/p>
&lt;h3 id="what-we-heard-from-you">What we heard from you&lt;/h3>
&lt;p>In addition to soliciting comments in the blog, I also reached out to all of our members who had written into our &lt;a href="mailto:support@crossref.org?subject=changing%20resolution%20reports">support desk&lt;/a> in the last year about anything related to resolution reports. We received dozens of responses from the blog and my outreach via email. The most common response was from members expressing their appreciation for and highlighting the utility of the reports. Most everyone told us how they were using the reports - from monitoring failure rates to mitigate issues to identifying trends over time. And a great number of respondents expressed concern that removing the filters might alter how or what we present to you in the reports (more on that soon). And, finally, several of you shared suggestions for improvement.&lt;/p>
&lt;h3 id="where-we-go-from-here">Where we go from here&lt;/h3>
&lt;p>Our existing filters have been removing between 100 and 150 million resolutions from the monthly numbers we report to all members, collectively. Based on those figures, when we remove the filters all resolutions numbers will increase by about 25%. Those increased resolutions will vary from member to member because the numbers are based on actual bots crawling specific content, so some members may see more of an increase than others. We are mindful of how our members might adjust to that new baseline, since these changes will mean a noticeable (and, significant) increase in resolution totals for the majority of our members.&lt;/p>
&lt;center>&lt;img src="https://www.crossref.org/images/blog/Total and Filtered_resolutions_18_19_OCT_new.png" alt="Total and filtered resolutions" width="550" class="img-responsive" />&lt;/center>
&lt;p>Outside of the suggested tweaks from members below and that 25% increase I mentioned (due to the retirement of the filters), the reports will remain unchanged. You’ll continue to receive successful resolutions, the report of top 10 DOIs, and the csv file containing failed resolutions. Our most important consideration throughout this process is that these reports continue to serve you.&lt;/p>
&lt;h3 id="the-changes">The changes&lt;/h3>
&lt;p>We liked some of your suggestions, so we’re set to adopt a few of the more straightforward improvements. Those that are more complicated we’re considering for the &lt;a href="https://doi.org/10.64000/1a52b-7pf27" target="_blank">Member Center&lt;/a> (working title, subject to change) project, where we will start to bring together all business and technical information for our members, service providers and metadata users.&lt;/p>
&lt;ol>
&lt;li>As I said, we’re removing the filters. Starting in June, we’ll present all of the resolutions to you. No filters. On average, monthly resolution numbers will therefore increase by about 25%.&lt;/li>
&lt;li>We currently link to the failed DOI.csv near the bottom of the resolution report. For many members with large volumes of content, the resolution report can take some time to load and sift through, so we’re moving the link to the failed DOI.csv file up the page (Note: we know they are other changes we can make to the report itself that will make it easier to work with for members with large volumes of data; we’re exploring those improvements).&lt;/li>
&lt;li>We learned during this process that some members were not receiving resolution reports when they only had failed resolutions. One of the aims of the reports is to help members identify content registration problems, so this was a bug we are keen to repair. We are fixing it. Once it is fixed, all members who have at least one resolution - successful or failed - during the previous month will receive the report.&lt;/li>
&lt;/ol>
&lt;h3 id="what-we-cant-change">What we can&amp;rsquo;t change&lt;/h3>
&lt;p>Many members who responded to the call and who also enquire throughout the year (outside of this call) express interest in receiving more information from the resolution reports. You want resolution numbers for all your DOIs. You want referral information about where the resolutions are coming from (e.g., IP addresses) and breakdowns by machine/human. You want more information about how and why the failure rate is growing over time. We understand.&lt;/p>
&lt;p>In the past, we did try to process more information for IP addresses and user agents but it turns out that generating that volume of extra data and processing monthly is simply impractical. The other issue is one of privacy. IP addresses are considered personally identifiable information (PII), or data that could potentially be used to identify particular people. We are committed to maintaining the privacy of our members and users and therefore cannot provide this level of granularity in our reports.&lt;/p>
&lt;h3 id="next-up">Next up&lt;/h3>
&lt;p>Look for these changes starting in June. If you read this far, you may not need it, but we’ll also include a reminder atop the report itself about the increase in resolution totals as a result of our changes.&lt;/p></description></item><item><title>Free public data file of 112+ million Crossref records</title><link>https://www.crossref.org/blog/free-public-data-file-of-112-million-crossref-records/</link><pubDate>Thu, 09 Apr 2020 00:00:00 +0000</pubDate><author>Jennifer Kemp</author><guid>https://www.crossref.org/blog/free-public-data-file-of-112-million-crossref-records/</guid><description>&lt;p>A lot of people have been using our public, open APIs to collect data that might be related to COVID-19. This is great and we encourage it. We also want to make it easier. To that end we have made a free data file of the public elements from Crossref’s 112.5 million metadata records.&lt;/p>
&lt;p>The file (65GB, in JSON format) is available via Academic Torrents here: &lt;a href="https://doi.org/10.13003/83B2GP" target="_blank">https://doi.org/10.13003/83B2GP&lt;/a>&lt;/p>
&lt;p>&lt;strong>&lt;em>It is important to note that &lt;a href="https://www.crossref.org/education/retrieve-metadata/">Crossref metadata&lt;/a> is always openly available.&lt;/em>&lt;/strong> The difference here is that we’ve done the time-saving work of putting all of the records registered through March 2020 into one file for download.&lt;/p>
&lt;p>The sheer number of records means that, though anyone can use these records anytime, downloading them all via our APIs can be quite time-consuming. We hope this saves the research community valuable time during this crisis.&lt;/p>
&lt;h2 id="a-few-important-notes">A few important notes&lt;/h2>
&lt;ul>
&lt;li>
&lt;p>&lt;strong>All records are included.&lt;/strong> In other words, the data file has every DOI ever registered with Crossref through March 31st, 2020. &lt;em>This means it’s a large file, 65GB.&lt;/em>&lt;/p>
&lt;ul>
&lt;li>Metadata is supplied by our members and, as such, not all records have the same completeness (or quality) of metadata. Bibliographic metadata is generally &lt;a href="https://support.crossref.org/hc/en-us/articles/213077846-Required-Recommended-and-Optional-Elements" target="_blank">required&lt;/a>. All other metadata, e.g. license and funding information, ORCIDs, etc. is optional (though very much encouraged).&lt;/li>
&lt;li>&lt;a href="https://www.crossref.org/documentation/principles-practices/">References&lt;/a> (i.e. authors’ cited sources) are also optional metadata. Nearly 50 million records include references and, of those, nearly 30 million have open references that are included in the data file. “Limited” and “Closed” references are not included in the data file. &lt;em>[EDIT 6th June 2022 - all references are now open by default with the March 2022 board vote to remove any restrictions on reference distribution].&lt;/em>&lt;/li>
&lt;li>If an error in the metadata is found, please report it directly to the publisher to correct.&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>
&lt;p>&lt;strong>The records are in JSON.&lt;/strong>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;strong>New and updated records can be added incrementally&lt;/strong> using our REST API, which includes a number of date filter options, e.g. index-date.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;strong>No registration is required&lt;/strong> to use our &lt;a href="https://github.com/CrossRef/rest-api-doc" target="_blank">REST API&lt;/a> but we do strongly encourage being a &lt;a href="https://github.com/CrossRef/rest-api-doc#etiquette" target="_blank">‘polite’&lt;/a> (i.e. identified) user. It makes troubleshooting much easier and reduces the chance of negatively impacting other users.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>Questions, comments and feedback are welcome at &lt;a href="mailto:support@crossref.org">support@crossref.org&lt;/a>.&lt;/p>
&lt;p>We thank AcademicTorrents.com for helping us make this data available.
And we are grateful for the incredible efforts of everyone working to support research everywhere&amp;ndash;stay safe and well.&lt;/p></description></item><item><title>You’ve had your say, now what? Next steps for schema changes</title><link>https://www.crossref.org/blog/youve-had-your-say-now-what-next-steps-for-schema-changes/</link><pubDate>Thu, 02 Apr 2020 00:00:00 +0000</pubDate><author>Patricia Feeney</author><guid>https://www.crossref.org/blog/youve-had-your-say-now-what-next-steps-for-schema-changes/</guid><description>&lt;p>It seems like ages ago, particularly given recent events, but we had our first &lt;a href="https://doi.org/10.64000/2h99q-cm213" target="_blank">public request for feedback&lt;/a> on proposed schema updates in December and January. The feedback we received indicated two big things: we’re on the right track, and you want us to go further. This update has some significant but important changes to contributors, but is otherwise a fairly moderate update. The feedback was mostly supportive, with a fair number of helpful suggestions about details.&lt;/p>
&lt;h2 id="feedback-and-changes">Feedback and changes&lt;/h2>
&lt;p>Many of you are excited about CRediT, and a number of members have indicated that they are ready and waiting to send us CRediT roles. To support this, as in &lt;a href="https://docs.google.com/document/d/1gCRaWqkne_QqNs0BO78KGfjPFMDkpAQ-ky2nVynkuwc/edit#heading=h.xn4d62hlps6o" target="_blank">my initial proposal&lt;/a>, we’re adding a new &lt;code>role&lt;/code> element and &lt;code>role_type&lt;/code> attribute that supports existing Crossref-defined roles and CRediT roles, as well as a required &lt;code>vocab&lt;/code> attribute to specify which vocabulary is being supplied.&lt;/p>
&lt;p>&lt;code>&amp;lt;role role_type=&amp;quot;author&amp;quot; vocab=&amp;quot;crossref&amp;quot;&amp;gt;author&amp;lt;/role&amp;gt; &amp;lt;role role_type=&amp;quot;writing-original_draft&amp;quot; vocab=&amp;quot;credit&amp;quot;/&amp;gt;&lt;/code>&lt;/p>
&lt;p>CRediT as it exists now is an informal standard &lt;a href="https://casrai.org/credit/" target="_blank">coordinated by CASRAI&lt;/a>, but a formal standard is &lt;a href="https://niso.org/niso-io/2019/12/next-steps-toward-using-credit-credit" target="_blank">in the works via NISO&lt;/a>. CRediT is currently a list of well considered and defined roles that are not particularly machine-readable. I’ve created a list for implementation that eliminates spaces and ampersands. CRediT also lacks reliable PIDs or persistent URLs for the role definitions, so that has been omitted from our implementation. We’ll adopt any changes resulting from the NISO standard, but have decided to go forward with it as-is, as many of our members are eager to implement.&lt;/p>
&lt;p>Beyond CRediT, we’ll also be expanding and refining our contributor support in a number of ways:&lt;/p>
&lt;ul>
&lt;li>We’ll be expanding our affiliation metadata beyond a simple string to include organisation identifiers like &lt;a href="https://ror.org" target="_blank">ROR&lt;/a>, and allow markup of organisation names and locations.&lt;/li>
&lt;li>We’re expanding the contributor identifiers as well - in addition to ORCID iDs, members can send us Wikidata, ISNI, and other identifiers.&lt;/li>
&lt;li>We’re adding support for multiple names to support contributors whose names can be expressed in multiple alphabets, or who have aliases or nicknames.&lt;/li>
&lt;li>We’re changing &lt;code>surname&lt;/code> to &lt;code>family_name&lt;/code> and will be relaxing the requirement that all person names have a “surname” - a given name may be supplied on its own to support contributors who do not have family names.&lt;/li>
&lt;li>The current element for corporate/group authors, &lt;code>organisation&lt;/code>, will be replaced by &lt;code>collab&lt;/code> as the term “organisation” was widely confusing (we have a lot of affiliation info registered as group authors!), and the &lt;code>collab&lt;/code> section will also allow organisation identifiers.&lt;/li>
&lt;/ul>
&lt;p>Many of these updates align with how &lt;a href="https://jats.nlm.nih.gov/" target="_blank">JATS&lt;/a> supports contributors - I hope these changes will allow our members to supply robust contributor metadata without the burden of complicated conversions.&lt;/p>
&lt;p>I’m also including the proposed changes to support data citation and typing of citations. Additionally, we’ll be adding support for members who want to:&lt;/p>
&lt;ul>
&lt;li>supply &lt;a href="https://www.crossref.org/education/content-registration/content-types-intro/grants/">Grant IDs&lt;/a> in their metadata records&lt;/li>
&lt;li>register &lt;a href="https://www.crossref.org/working-groups/conferences-projects/">identifiers for conferences&lt;/a>.&lt;/li>
&lt;/ul>
&lt;p>A draft 5.0 xsd file is available in a branch of our &lt;a href="https://gitlab.com/crossref/schema/-/blob/5.0/5.0.update.md" target="_blank">GitLab schema repository&lt;/a> with the details of the planned updates, and more robust documentation and examples are forthcoming.&lt;/p>
&lt;h2 id="implementation-plans">Implementation plans&lt;/h2>
&lt;p>My house was built in 1890 and there are always surprises whenever we need to fix or renovate anything. Our system is just as old in technology years - it’s been chugging along since the aughts. This means while we don’t think it’s powered by knob-and-tube wiring, we can’t be sure until we open up the walls. We want to implement our plans (in fact we want to do more!) but if we run into any big blockers or crucial issues, we may roll out the changes over several iterations. These updates are fairly conservative and I remain optimistic we’ll be able to implement them as-is. Our update will help us build a foundation for future updates, allowing us to continuously evolve our schema as we move forward.&lt;/p>
&lt;p>Some of you are understandably worried about our implementation schedule and backwards incompatibility. We’re aware that changes are expensive and inconvenient, and making them on our schedule doesn’t always work for your schedule. That’s why we’ve sustained 12+ versions of our schema over the past 12 years. We won’t be mandating a change any time soon, and definitely won’t do so without sufficient warning and community involvement. In the future we’ll need to make a sustained effort to retire older schema, but now isn’t the time for that.&lt;/p>
&lt;p>We intend to commence work in Q2 but won’t have a firm timeline for a few more weeks. I will be providing regular updates as we progress, and will be asking for volunteers to test the updates when we’re ready. I’ll also be sharing more documentation and information about how the changes will be represented in our metadata outputs.&lt;/p>
&lt;h2 id="have-more-to-say">Have more to say?&lt;/h2>
&lt;p>Our feedback period has finished and we do plan to implement the changes as described, but if you have opinions, please &lt;a href="mailto:feedback@crossref.org">share them&lt;/a>.&lt;/p></description></item><item><title>Encouraging even greater reporting of corrections and retractions</title><link>https://www.crossref.org/blog/encouraging-even-greater-reporting-of-corrections-and-retractions/</link><pubDate>Mon, 30 Mar 2020 00:00:00 +0000</pubDate><author>Kirsty Meddings</author><guid>https://www.crossref.org/blog/encouraging-even-greater-reporting-of-corrections-and-retractions/</guid><description>&lt;p>TL;DR: We no longer charge fees for members to participate in Crossmark, and we encourage all our members to register metadata about corrections and retractions - even if you can’t yet add the Crossmark button and pop-up box to your landing pages or PDFs.&lt;/p>
&lt;p>&amp;ndash;&lt;/p>
&lt;p>Research doesn’t stand still; even after publication, articles can be updated with supplementary data or corrections. When research outputs are is changed in this way the publisher should report and link it, so that those accessing and citing the content know if it’s been updated, corrected or even retracted. This also emphasizes the member&amp;rsquo;s commitment to the ongoing stewardship of research outputs.&lt;/p>
&lt;p>Many people find and store articles to read later, either as PDFs on their laptop or on one of any number of reference management systems - when they come back to read and cite these articles, possibly many months later, they want to know if the version they have is current or not.&lt;/p>
&lt;h3 id="removing-crossmark-fees">Removing Crossmark fees&lt;/h3>
&lt;p>To encourage even wider adoption of Crossmark, and to promote best practice around better reporting of corrections and retractions, we will no longer be charging additional fees for our &lt;a href="https://www.crossref.org/services/crossmark/">Crossmark&lt;/a> service. This change applies to all Crossmark metadata registered from 1 January 2020. All members are now encouraged to add Crossmark metadata and add the Crossmark button and pop-up box to their publications - and you can do so as part of your regular content registration.&lt;/p>
&lt;h3 id="richer-metadata-gives-important-context">Richer metadata gives important context&lt;/h3>
&lt;p>We know that there are many more corrections and retractions that are not yet being registered, and to address this, we are now asking all of our members to start registering metadata for significant updates to your publications, even if you don&amp;rsquo;t implement the Crossmark button and pop-up box on your content. Remember, anyone can access the Crossmark metadata through our public REST API, and start using it straight away - even if you&amp;rsquo;re not ready to implement the Crossmark button.&lt;/p>
&lt;p>Check out &lt;a href="https://support.crossref.org/hc/en-us/articles/115000108983-Getting-started" target="_blank">how to get started&lt;/a>; if you only want to deposit metadata, follow steps one through four. If you also want to add the Crossmark button and pop-up box to your web pages/PDFs so that readers can easily see when content has changed, then also follow the rest of the steps.&lt;/p>
&lt;h3 id="crossmark">Crossmark&lt;/h3>
&lt;p>We launched Crossmark in 2012 to raise awareness of these critical changes, by asking Crossref members to:&lt;/p>
&lt;ul>
&lt;li>record such updates in your metadata, either as part of your regular &lt;a href="https://support.crossref.org/hc/en-us/articles/214169586-Metadata-deposit-schema" target="_blank">Crossref metadata deposit&lt;/a>, or &lt;a href="https://support.crossref.org/hc/en-us/articles/214002366-Adding-metadata-to-an-existing-record-resource-deposits-" target="_blank">deposited as stand-alone data&lt;/a> for back-year records&lt;/li>
&lt;li>help readers find out about the changes by placing a &lt;a href="https://www.crossref.org/get-started/crossmark/">Crossmark button&lt;/a> and pop-up box (which is consistent across all members making it recognizable to readers) on your landing pages and in PDFs&lt;/li>
&lt;/ul>
&lt;p>Members can also use Crossmark to register additional metadata about content, giving further context and background for the reader. These metadata appear in the “More Information” section of the Crossmark box. 7 million DOIs have some additional metadata, the most common being copyright statements, publication history, and peer review methods.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2020/crossmarkfees_blog_updates.png" width="100%">
&lt;/figure>
&lt;p>Anyone can access the Crossmark metadata through our public REST API, providing a myriad of opportunities for integration with other systems, and analysis of changes to the scholarly record.&lt;/p>
&lt;h3 id="who-has-implemented-crossmark">Who has implemented Crossmark?&lt;/h3>
&lt;p>440 Crossref members have implemented Crossmark to date. 11.4 million DOIs have some Crossmark metadata.&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th style="text-align: left">&lt;/th>
&lt;th style="text-align: right">Total DOIs&lt;/th>
&lt;th style="text-align: right">DOIs with Crossmark metadata&lt;/th>
&lt;th style="text-align: right">%&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td style="text-align: left">Journal articles&lt;/td>
&lt;td style="text-align: right">80,862,460&lt;/td>
&lt;td style="text-align: right">10,155,340&lt;/td>
&lt;td style="text-align: right">12.56%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">Book chapters&lt;/td>
&lt;td style="text-align: right">14,040,646&lt;/td>
&lt;td style="text-align: right">792,953&lt;/td>
&lt;td style="text-align: right">5.65%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">Conference Papers&lt;/td>
&lt;td style="text-align: right">6,175,733&lt;/td>
&lt;td style="text-align: right">457,237&lt;/td>
&lt;td style="text-align: right">7.40%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">Datasets&lt;/td>
&lt;td style="text-align: right">1,862,852&lt;/td>
&lt;td style="text-align: right">19,206&lt;/td>
&lt;td style="text-align: right">1.03%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">Books&lt;/td>
&lt;td style="text-align: right">753,298&lt;/td>
&lt;td style="text-align: right">239&lt;/td>
&lt;td style="text-align: right">0.03%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">Monographs&lt;/td>
&lt;td style="text-align: right">469,333&lt;/td>
&lt;td style="text-align: right">23&lt;/td>
&lt;td style="text-align: right">0.00%&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>Of those, about 130,000 contain an update:&lt;/p>
&lt;p>&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2020/crossmarkfees_blog_graph.png" width="60%">
&lt;/figure>
&lt;br />
You can see which members or journals have implemented Crossmark by viewing the relevant Crossref &lt;a href="https://www.crossref.org/members/prep/" target="_blank">Participation Report&lt;/a>.&lt;/p></description></item><item><title>Events got the better of us</title><link>https://www.crossref.org/blog/events-got-the-better-of-us/</link><pubDate>Fri, 27 Mar 2020 00:00:00 +0000</pubDate><author>Bryan Vickery</author><guid>https://www.crossref.org/blog/events-got-the-better-of-us/</guid><description>&lt;p>Publisher metadata is one side of the story surrounding research outputs, but conversations, connections and activities that build further around scholarly research, takes place all over the web. We built &lt;a href="https://www.crossref.org/services/event-data/">Event Data&lt;/a> to capture, record and make available these &amp;lsquo;Events&amp;rsquo; –– providing open, transparent, and traceable information about the provenance and context of every Event. Events are comments, links, shares, bookmarks, references, etc.&lt;/p>
&lt;p>In September 2018 we said &lt;a href="https://doi.org/10.64000/q9s4t-vjt21" target="_blank">Event Data&lt;/a> was &amp;lsquo;production ready.&amp;rsquo; What we meant was development of the service had reached a point where we expected no further major changes to the code, and we encouraged you to use it. What normally would have followed was a detailed handover to our operations team, for monitoring and performance management, and for Product Management to expand Event Data by adding new Crossref member domains and evaluating additional event sources.&lt;/p>
&lt;h2 id="why-so-quiet">Why so quiet?&lt;/h2>
&lt;p>But many things changed on the &lt;a href="https://doi.org/10.64000/eqnnm-c0659" target="_blank">staff front&lt;/a>, meaning 2019 was a year of reinvention for the Technical and Product teams and of critical knowledge sharing and learning –– Event Data had to take a back seat as we focused resources on other key projects (more on that later). From a technical perspective, we&amp;rsquo;ve found the Elasticsearch index is not performing well and the approach taken to specifically support data citations through &lt;a href="https://documentation.ardc.edu.au/cpg/scholix" target="_blank">Scholix&lt;/a> has not really scaled.&lt;/p>
&lt;p>When things go wrong, whether in ways you can or can&amp;rsquo;t anticipate, the most important thing is communication –– in dealing with the challenges we forgot to do that. We understand how frustrating that can be and we&amp;rsquo;re extremely sorry to have gone so quiet.&lt;/p>
&lt;h2 id="so-where-are-we-today">So, where are we today?&lt;/h2>
&lt;p>Event Data is important to us and clearly important to you too as you&amp;rsquo;ve contacted us about your use-cases and the reliability of the service. Event Data remains &lt;a href="https://www.eventdata.crossref.org/guide/" target="_blank">available&lt;/a> and you&amp;rsquo;re welcome to use it, but you should expect instability to continue and be aware that it does not find events for &lt;a href="https://www.eventdata.crossref.org/guide/data/ids-and-urls/#dois-for-objects" target="_blank">DOIs/domains of our newer members&lt;/a> (who joined Crossref since 2019) –– so we&amp;rsquo;re conscious it might be hard to say whether it&amp;rsquo;s a good fit for your project at this point.&lt;/p>
&lt;h2 id="what-are-we-doing">What are we doing?&lt;/h2>
&lt;p>We have brought in additional expert Elasticsearch resources to assist with a separate project to migrate our REST API from SOLR to Elasticsearch. We&amp;rsquo;re making fantastic progress on this. As soon as we&amp;rsquo;re confident we can make this switch, we will move those same Elasticsearch resources to shoring up Event Data. The REST API takes priority over Event Data because we need to add support for important new record types (like research grants) that aren&amp;rsquo;t yet available via the API.&lt;/p>
&lt;p>We&amp;rsquo;re also concluding the process of hiring two new Product Managers which means we&amp;rsquo;ll be in a position to assign someone to head up the product management of Event Data. When we do return to Event Data in the coming months, our initial priority will be increased support for data citation and Scholix. If that means radical changes to the rest of the service, we&amp;rsquo;ll let you know. &lt;/p>
&lt;h2 id="opening-up-the-discussion">Opening up the discussion&lt;/h2>
&lt;p>We will have more news on Event Data in mid-2020. We&amp;rsquo;d love you to join the &lt;a href="https://community.crossref.org/c/event-data/17" target="_blank">Crossref Community Forum&lt;/a>; we&amp;rsquo;ve created a new Category for Event Data where you can post details of how you are using, or plan to use Event Data; post questions to the group; suggestions for future development and provide general feedback on the Event Data service.&lt;/p></description></item><item><title>Metadata Manager Update</title><link>https://www.crossref.org/blog/metadata-manager-update/</link><pubDate>Tue, 24 Mar 2020 00:00:00 +0000</pubDate><author>Bryan Vickery</author><guid>https://www.crossref.org/blog/metadata-manager-update/</guid><description>&lt;p>At Crossref, we&amp;rsquo;re committed to providing a simple, usable, efficient and scalable web-based tool for registering content by manually making deposits of, and updates to, metadata records. Last year we launched Metadata Manager in beta for journal deposits to help us explore this further. Since then, many members have used the tool and helped us better understand their needs.&lt;/p>
&lt;p>What we&amp;rsquo;ve learned has made us realize how useful such a tool can be to both large and small publishers, but also that the approach we took with Metadata Manager needs to be changed - it&amp;rsquo;s not flexible enough to easily add other record types, like books/book chapters, or to include any changes we may make to our input schema.&lt;/p>
&lt;p>With that in mind, we&amp;rsquo;re pausing development on Metadata Manager to allow us to properly evaluate what we&amp;rsquo;ve learned. If you&amp;rsquo;re currently using Metadata Manager for journal deposits without any problems, please do continue - you&amp;rsquo;re helping us learn a lot! But if you haven&amp;rsquo;t used Metadata Manager before, or are having problems, please:&lt;/p>
&lt;ul>
&lt;li>use our existing &lt;a href="http://www.crossref.org/webDeposit" target="_blank">Web Deposit Form&lt;/a> instead, or&lt;/li>
&lt;li>upload XML directly through the &lt;a href="https://doi.crossref.org/" target="_blank">deposit system admin interface&lt;/a>&lt;/li>
&lt;/ul>
&lt;p>We won&amp;rsquo;t be fixing bugs in Metadata Manager, except for providing any essential security updates. Of course, if you still need help please read our &lt;a href="https://support.crossref.org/hc/en-us/categories/201752243-Registering-content" target="_blank">Content Registration help pages&lt;/a>, or contact the &lt;a href="mailto:support@crossref.org">Support team&lt;/a>.&lt;/p>
&lt;p>Metadata Manager&amp;rsquo;s features will be reimagined as part of our planned Member Center (working title, subject to change) project, where we will start to bring together all business and technical information for our members, service providers and metadata users. The Member Center will be the heart of our strategy to make it easier for you to work with Crossref to:&lt;/p>
&lt;ul>
&lt;li>register and update metadata&lt;/li>
&lt;li>view, update and transfer titles&lt;/li>
&lt;li>visualize your activity/participation and act on problems with metadata&lt;/li>
&lt;li>understand your bills and invoices&lt;/li>
&lt;li>manage your users and service providers and their access and entitlements&lt;/li>
&lt;li>and more&lt;/li>
&lt;/ul>
&lt;p>We&amp;rsquo;re in the early stages of planning for the Member Center and will be seeking feedback from members, service providers and metadata users in the coming months.&lt;/p></description></item><item><title>Double trouble with DOIs</title><link>https://www.crossref.org/blog/double-trouble-with-dois/</link><pubDate>Tue, 10 Mar 2020 00:00:00 +0000</pubDate><author>Dominika Tkaczyk</author><guid>https://www.crossref.org/blog/double-trouble-with-dois/</guid><description>&lt;p>Detective Matcher stopped abruptly behind the corner of a short building, praying that his loud heartbeat doesn&amp;rsquo;t give up his presence. This missing DOI case was unlike any other before, keeping him awake for many seconds already. It took a great effort and a good amount of help from his clever assistant Fuzzy Comparison to make sense of the sparse clues provided by Miss Unstructured Reference, an elegant young lady with a shy smile, who begged him to take up this case at any cost.&lt;/p>
&lt;p>The final confrontation was about to happen, the detective could feel it, and his intuition rarely misled him in the past. He was observing DOI &lt;a href="https://doi.org/10.2307/257306" target="_blank">&lt;code>10.2307/257306&lt;/code>&lt;/a>, which matched Miss Reference&amp;rsquo;s description very well. So far, there was no indication that DOI had any idea he was being observed. He was leaning on a wall across the street in a seemingly nonchalant way, just about to put out his cigarette. Empty dark streets and slowly falling snow together created an excellent opportunity to capture the fugitive.&lt;/p>
&lt;p>Suddenly, Matcher heard a faint rustling sound. Out of nowhere, another shady figure, looking very much like &lt;a href="https://doi.org/10.5465/amr.1982.4285592" target="_blank">&lt;code>10.5465/amr.1982.4285592&lt;/code>&lt;/a>, appeared in front of the detective, crossed the street and started running away. Matcher couldn&amp;rsquo;t believe his eyes. These two DOIs had identical authors, year and title. They were even wearing identical volume and issue! He quickly noticed minor differences: slight alteration in the journal title and lack of the second page number in one of the DOIs, but this was likely just a random mutation. How could have he missed the other DOI? And more importantly, which of them was the one worried Miss Reference simply couldn&amp;rsquo;t live without?&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/duplicates_cover.jpg">
&lt;/figure>
&lt;h2 id="tldr">TL;DR&lt;/h2>
&lt;ul>
&lt;li>Crossref metadata contains duplicates, i.e. items with different DOIs and identical (or almost identical) bibliographic metadata. This often happens when there is more than one DOI pointing to the same object. In some cases, but not all of them, one of the DOIs is explicitly marked as an alias of the other DOI.&lt;/li>
&lt;li>In this blog post, I analyze those duplicates, that are not marked with an alias relation. &lt;strong>The analysis shows that the problem exists, but is not big&lt;/strong>.&lt;/li>
&lt;li>Among 524,496 DOIs tested in the analysis, 4,240 (0.8%) were flagged as having non-aliased duplicates. I divided those duplicates into two categories:
&lt;ul>
&lt;li>&lt;strong>Self-duplicate&lt;/strong> is a duplicate deposited by the same member as the other DOI, there were 3,603 (85%) of them.&lt;/li>
&lt;li>&lt;strong>Other-duplicate&lt;/strong> is a duplicate deposited by a different member than the other DOI&amp;rsquo;s depositor, there were only 637 (15%) of them.&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>I used three member-level metrics to estimate the volume of duplicates deposited by a given member:
&lt;ul>
&lt;li>&lt;strong>Self-duplicate index&lt;/strong> is the fraction of self-duplicates in member&amp;rsquo;s DOIs: on average 0.67%.&lt;/li>
&lt;li>&lt;strong>Other-duplicate&lt;/strong> index is the fraction of other-duplicates in a member&amp;rsquo;s DOIs: on average 0.13%.&lt;/li>
&lt;li>&lt;strong>Global other-duplicate index&lt;/strong> is the fraction of globally detected other-duplicates involving a given member: on average 0.34%.&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;h2 id="introduction">Introduction&lt;/h2>
&lt;p>In an ideal world, the relationship between research outputs and DOIs is one-to-one: every research output has exactly one DOI assigned and each DOI points to exactly one research output.&lt;/p>
&lt;p>As we all know too well, we do not live in a perfect world, and this one-to-one relationship is also sometimes violated. One way to violate it is to assign more than one DOI to the same object. This can cause problems.&lt;/p>
&lt;p>First of all, if there are two DOIs referring to the same object, eventually they both might end up in different systems and datasets. As a result, merging data between data sources becomes an issue, because we no longer can rely on comparing the DOI strings only.&lt;/p>
&lt;p>Reference matching algorithms will also be confused when they encounter more than one DOI matching the input reference. They might end up assigning one DOI from the matching ones at random, or not assigning any DOI at all.&lt;/p>
&lt;p>And finally, more than one DOI assigned to one object is hugely problematic for document-level metrics such as citation counts, and eventually affects h-indexes and impact factors. In practice, metrics are typically calculated per DOI, so when there are two DOIs pointing to one document, the citation count might be split between them, effectively lowering the count, and making every academic author&amp;rsquo;s biggest nightmare come true.&lt;/p>
&lt;p>It seems we shouldn&amp;rsquo;t simply cover our eyes and pretend this problem does not exist. So what are we doing at Crossref to make the situation better?&lt;/p>
&lt;ul>
&lt;li>It is possible for our members to explicitly mark a DOI as an alias of another DOI, if it was deposited by mistake. This does not remove the problem, but at least allows metadata consumers to access and use this information.&lt;/li>
&lt;li>Whenever a DOI is registered or updated in Crossref, we automatically compare its metadata to the metadata of existing DOIs. If the metadata is too similar to the metadata of another DOI, this information is sent to the member and they have a chance to modify the metadata as they see fit.&lt;/li>
&lt;/ul>
&lt;p>Despite these efforts, we still see duplicates that are not explained by anything in the metadata. In this blog post, I will try to understand this problem better and assess how big it is. I also define three member-level metrics that can show how much a given member contributes to duplicates in the system and can flag members with unusually high fractions of duplicates.&lt;/p>
&lt;h2 id="gathering-the-data">Gathering the data&lt;/h2>
&lt;ol>
&lt;li>The data for this analysis was collected in the following way:&lt;/li>
&lt;li>Only journal articles were considered in the analysis.&lt;/li>
&lt;li>Only members with at least 5,000 journal article DOIs were considered in the analysis.&lt;/li>
&lt;li>For each member, a random sample of 1,000 journal article DOIs was selected.&lt;/li>
&lt;li>DOIs with no title, title shorter than 20 characters or shorter than 3 words were removed from each sample. This was done because items with short titles typically result in incorrectly flagged duplicates (false positives).&lt;/li>
&lt;li>For each remaining DOI in the sample, a simple string representation was generated. This representation is a concatenation of the following fields: authors, title, container-title, volume, issue, page, published date.&lt;/li>
&lt;li>This string representation was used as &lt;code>query.bibliographic&lt;/code> in &lt;a href="https://github.com/CrossRef/rest-api-doc" target="_blank">Crossref&amp;rsquo;s REST API&lt;/a> and the resulting item list was examined.&lt;/li>
&lt;li>If the original DOI came back as the first or the second hit, the relevance score difference between the first two hits is less than 1, they are both journal articles, and there is no relation (alias or otherwise) between them, the other one of the two is considered a duplicate of the original DOI. The score difference threshold was chosen through a manual examination of a number of cases. Most detected duplicates came back scored identically.&lt;/li>
&lt;/ol>
&lt;h2 id="overall-results">Overall results&lt;/h2>
&lt;p>In total, I tested 590 members and 524,496 DOIs. Among them, 4,240 DOIs (0.8%) were flagged as duplicates of other DOIs. This shows the problem exists, but is not huge.&lt;/p>
&lt;p>I also analyzed separately two categories of duplicates:&lt;/p>
&lt;ul>
&lt;li>&lt;strong>self-duplicates&lt;/strong> are two DOIs with (almost) identical metadata, deposited by the same member,&lt;/li>
&lt;li>&lt;strong>other-duplicates&lt;/strong> are two DOIs with (almost) identical metadata, deposited by two different members.&lt;/li>
&lt;/ul>
&lt;p>Self-duplicates are more common: 3,603 (85%) of all detected duplicates are self-duplicates, and only 637 (15%) are other-duplicates. This is also good news: self-duplicates involve one member only, so they are easier to handle.&lt;/p>
&lt;h2 id="self-duplicates">Self-duplicates&lt;/h2>
&lt;p>To explore the levels of self-duplicates among members, I used a custom member-level metric called self-duplicate index. &lt;strong>Self-duplicate index&lt;/strong> is the fraction of self-duplicates among the member&amp;rsquo;s DOIs, in this case calculated over a sample.&lt;/p>
&lt;p>On average, members have a very small self-duplicate index of 0.67%. In addition, in the samples of 44% of analyzed members no self-duplicates were found. The histogram shows the skewness of the distribution:&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/duplicates_distr_self.png" width="500px">
&lt;/figure>
&lt;p>As we can see in the distribution, there are only a few members with high self-duplicate index. The table shows all members with the self-duplicate higher than 10%:&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>Name&lt;/th>
&lt;th style="text-align: right">Total DOIs&lt;/th>
&lt;th style="text-align: right">Sample size&lt;/th>
&lt;th style="text-align: right">Self-duplicate index&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>University of California Press&lt;/td>
&lt;td style="text-align: right">129,741&lt;/td>
&lt;td style="text-align: right">798&lt;/td>
&lt;td style="text-align: right">36%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Inderscience Publishers&lt;/td>
&lt;td style="text-align: right">127,729&lt;/td>
&lt;td style="text-align: right">998&lt;/td>
&lt;td style="text-align: right">29%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>American Society of Hematology&lt;/td>
&lt;td style="text-align: right">137,124&lt;/td>
&lt;td style="text-align: right">990&lt;/td>
&lt;td style="text-align: right">24%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Pro Reitoria de Pesquisa, Pos Graduacao e Inovacao - UFF&lt;/td>
&lt;td style="text-align: right">7,756&lt;/td>
&lt;td style="text-align: right">919&lt;/td>
&lt;td style="text-align: right">19%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>American Diabetes Association&lt;/td>
&lt;td style="text-align: right">49,536&lt;/td>
&lt;td style="text-align: right">946&lt;/td>
&lt;td style="text-align: right">18%&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;h2 id="other-duplicates">Other-duplicates&lt;/h2>
&lt;p>&lt;strong>Other-duplicate index&lt;/strong> is the fraction of other duplicates among the member&amp;rsquo;s DOIs, in this case calculated from a sample.&lt;/p>
&lt;p>On average, members have a very low other-duplicate index of only 0.13%. What is more, 89% members have no other-duplicates in the sample, and the distribution is even more skewed than in the case of self-duplicates:&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/duplicates_distr_other.png" width="500px">
&lt;/figure>
&lt;p>Here is the list of all members with more than 2% of other-duplicates in the sample:&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>Name&lt;/th>
&lt;th style="text-align: right">Total DOIs&lt;/th>
&lt;th style="text-align: right">Sample size&lt;/th>
&lt;th style="text-align: right">Other-duplicate index&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>American Bryological and Lichenological Society&lt;/td>
&lt;td style="text-align: right">5,593&lt;/td>
&lt;td style="text-align: right">844&lt;/td>
&lt;td style="text-align: right">41%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Maney Publishing&lt;/td>
&lt;td style="text-align: right">15,342&lt;/td>
&lt;td style="text-align: right">832&lt;/td>
&lt;td style="text-align: right">6%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>JSTOR&lt;/td>
&lt;td style="text-align: right">1,612,174&lt;/td>
&lt;td style="text-align: right">864&lt;/td>
&lt;td style="text-align: right">4%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>American Mathematical Society (AMS)&lt;/td>
&lt;td style="text-align: right">83,015&lt;/td>
&lt;td style="text-align: right">844&lt;/td>
&lt;td style="text-align: right">4%&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>&lt;em>American Bryological and Lichenological Society&lt;/em> is a clear outlier with 41% of their sample flagged as duplicates. Interestingly, all those duplicates come from one other member only (JSTOR) and JSTOR was the first to deposit them.&lt;/p>
&lt;p>Similarly, all other-duplicates detected in the &lt;em>American Mathematical Society&lt;/em>&amp;rsquo;s sample are shared with JSTOR, and JSTOR was the first to deposit them.&lt;/p>
&lt;p>&lt;em>Maney Publishing&lt;/em>&amp;rsquo;s 51 other-duplicates are all shared with a member not listed in this table: Informa UK Limited.&lt;/p>
&lt;p>&lt;em>JSTOR&lt;/em> is the only member in this table, whose 36 other-duplicates are shared with multiple (8) members.&lt;/p>
&lt;p>Another interesting observation is that the members in this table (apart from JSTOR) are rather small or medium, in terms of total DOIs registered by them. It is also worrying that Informa UK Limited, a member that shares 51 other-duplicates flagged in Maney Publishing&amp;rsquo;s sample, was not flagged by this index. The reason might be differences in the overall number of registered DOIs: two members that deposited the same number of other-duplicates, but have different overall numbers of registered DOIs, will have different other-duplicate indexes.&lt;/p>
&lt;p>To address this issue, I looked at a third index called global other-duplicate index. &lt;strong>Global other-duplicate index&lt;/strong> is the fraction of globally detected other-duplicates involving a given member.&lt;/p>
&lt;p>Global other-duplicate index has a useful interpretation: it tells us how much the overall number of other-duplicates would drop, if the given member resolved all its other-duplicates (for example by setting appropriate relations or correcting the metadata so that it is no longer so similar).&lt;/p>
&lt;p>Here is the list of members with global-duplicate index higher than 2%:&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>Name&lt;/th>
&lt;th style="text-align: right">Total DOIs&lt;/th>
&lt;th style="text-align: right">Global other-duplicate index&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>JSTOR&lt;/td>
&lt;td style="text-align: right">1,612,174&lt;/td>
&lt;td style="text-align: right">69%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>American Bryological and Lichenological Society&lt;/td>
&lt;td style="text-align: right">5,593&lt;/td>
&lt;td style="text-align: right">54%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Informa UK Limited&lt;/td>
&lt;td style="text-align: right">4,275,507&lt;/td>
&lt;td style="text-align: right">15%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Maney Publishing&lt;/td>
&lt;td style="text-align: right">15,342&lt;/td>
&lt;td style="text-align: right">8%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>American Mathematical Society (AMS)&lt;/td>
&lt;td style="text-align: right">83,015&lt;/td>
&lt;td style="text-align: right">6%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Project Muse&lt;/td>
&lt;td style="text-align: right">326,300&lt;/td>
&lt;td style="text-align: right">5%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Wiley&lt;/td>
&lt;td style="text-align: right">8,003,815&lt;/td>
&lt;td style="text-align: right">3%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Elsevier BV&lt;/td>
&lt;td style="text-align: right">16,268,943&lt;/td>
&lt;td style="text-align: right">3%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Liverpool University Press&lt;/td>
&lt;td style="text-align: right">31,870&lt;/td>
&lt;td style="text-align: right">3%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Cambridge University Press (CUP)&lt;/td>
&lt;td style="text-align: right">1,621,713&lt;/td>
&lt;td style="text-align: right">2%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Ovid Technologies (Wolters Kluwer Health)&lt;/td>
&lt;td style="text-align: right">2,152,723&lt;/td>
&lt;td style="text-align: right">2%&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>University of Toronto Press Inc. (UTPress)&lt;/td>
&lt;td style="text-align: right">46,778&lt;/td>
&lt;td style="text-align: right">2%&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>Note that the values add up to more than 100%. This is because in every other-duplicate there are two members involved, so the involvement adds up to 200%.&lt;/p>
&lt;p>As we can see, all the members from the previous table are in this one as well. Apart from them, however, this index flagged several large members. Among them, Informa UK Limited, that was missing from the previous table.&lt;/p>
&lt;p>All the indexes defined here are useful in identifying members that contribute a lot of duplicates to the Crossref metadata. They can be used to help to clean up the metadata, and also to monitor the situation in the future.&lt;/p>
&lt;h2 id="limitations">Limitations&lt;/h2>
&lt;p>It is important to remember that index values presented here were calculated on a single sample of DOIs drawn for a given member. The values would be different if a different sample was used, and so they shouldn&amp;rsquo;t be treated as exact numbers.&lt;/p>
&lt;p>The tables include members with the index exceeding a certain threshold, chosen arbitrarily, for illustrative purposes. Different runs with different samples could result in different members being included in the tables, especially in their lower parts.&lt;/p>
&lt;p>To obtain more stable values of indexes, multiple samples could be used. Alternatively, in the case of smaller members, exact values could be calculated from all their DOIs.&lt;/p></description></item><item><title>Can you help us to launch Distributed Usage Logging?</title><link>https://www.crossref.org/blog/can-you-help-us-to-launch-distributed-usage-logging/</link><pubDate>Mon, 02 Mar 2020 00:00:00 +0000</pubDate><author>Kirsty Meddings</author><guid>https://www.crossref.org/blog/can-you-help-us-to-launch-distributed-usage-logging/</guid><description>&lt;p>Update: Deadline extended to 23:59 (UTC) 13th March 2020.&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/community/project-dul/">Distributed Usage Logging&lt;/a> (DUL) allows publishers to capture traditional usage activity related to their content that happens on sites other than their own so they can provide reports of “total usage”, for example to subscribing institutions, regardless of where that usage happens.&lt;/p>
&lt;p>We are looking for a consultant to take the lead with DUL outreach, promoting the service and its benefits in order to solicit participation from publishers (receivers) and content-hosting platforms/scholarly collaboration networks (senders).&lt;/p>
&lt;p>Crossref provides the infrastructure for DUL. The call for participation is being led by COUNTER and the selected consultant will be representing COUNTER, with additional support from Crossref&lt;/p>
&lt;p>If you are interested in this opportunity, please download the &lt;a href="https://www.projectcounter.org/wp-content/uploads/2020/02/FINAL-RFI_-Distributed-Usage-Logging-DUL-Outreach-Consultant-1.pdf" target="_blank">request for information&lt;/a> (RFI).&lt;/p>
&lt;p>The RFI response deadline is 23:59 (UTC) 13 March 2020.&lt;/p></description></item><item><title>Crossref metadata for bibliometrics</title><link>https://www.crossref.org/blog/crossref-metadata-for-bibliometrics/</link><pubDate>Fri, 21 Feb 2020 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/crossref-metadata-for-bibliometrics/</guid><description>&lt;p>Our paper, &lt;a href="https://doi.org/10.1162/qss_a_00022" target="_blank">Crossref: the sustainable source of community-owned scholarly metadata&lt;/a>, was recently published in &lt;a href="https://www.mitpressjournals.org/loi/qss" target="_blank">&lt;em>Quantitative Science Studies&lt;/em> (MIT Press)&lt;/a>. The paper describes the scholarly metadata collected and made available by Crossref, as well as its importance in the scholarly research ecosystem.&lt;/p>
&lt;p>Containing over 106 million records and expanding at an average rate of 11% a year, Crossref&amp;rsquo;s metadata has become one of the major sources of scholarly data for publishers, authors, librarians, funders, and researchers. The metadata set consists of 13 record types, including not only traditional types, such as journals and conference papers, but also data sets, reports, preprints, peer reviews, and grants. The metadata is not limited to basic publication metadata, but can also include abstracts and links to full text, funding and license information, citation links, and the information about corrections, updates, retractions, etc. This scale and breadth make Crossref a valuable source for research in scientometrics, including measuring the growth and impact of science and understanding new trends in scholarly communications. The metadata is available through a number of APIs, including REST API and OAI-PMH.&lt;/p>
&lt;p>In the paper, we describe the kind of metadata that Crossref provides and how it is collected and curated. We also look at Crossref&amp;rsquo;s role in the research ecosystem and trends in metadata curation over the years, including the evolution of its citation data provision. We summarize the research that used Crossref&amp;rsquo;s metadata and describe plans that will improve metadata quality and retrieval in the future.&lt;/p></description></item><item><title>Leaving Crossref</title><link>https://www.crossref.org/blog/leaving-crossref/</link><pubDate>Fri, 14 Feb 2020 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/leaving-crossref/</guid><description>&lt;h2 id="where-does-the-time-go">Where does the time go&amp;hellip;&lt;/h2>
&lt;p>In my &lt;a href="https://doi.org/10.64000/j6sav-qm45" target="_blank">blog post on January 14th&lt;/a> about Crossref’s 20th anniversary I said, “The one constant in Crossref’s 20 years has been change”. It’s true that there has been constant change, but there has been another constant at Crossref –– me (and DOIs, to be fair). I started as Crossref’s first employee and Executive Director on February 1st, 2000, so I just marked my &lt;a href="https://www.crossref.org/news/2000-02-02-journal-reference-linking-service-names-executive-director-board-of-directors-new-members-and-a-go-live-timetable/">20th anniversary with the organisation&lt;/a>.&lt;/p>
&lt;p>This milestone prompted me to reflect on where I am and where I’m heading. After 20 years leading the organisation, I’ve decided to leave Crossref. It’s time for a new challenge. I’m still very committed to the mission and very proud of my time at Crossref, the culture we’ve created and what the organisation has achieved. It’s been an honor serving as Executive Director and a pleasure working with so many great people over the years. And to be clear –– I’m not ill, being pushed or having a midlife crisis (yet).&lt;/p>
&lt;p>It’s a difficult and emotional decision but I think the transition can be positive for me, the staff, the board, and the organisation. I’ll be working with the Crossref board, Chair, Treasurer and staff on the transition –– the plan is for me to be around through September or October to enable the recruitment and handover to a new Executive Director. There will be more information about the transition and recruitment process after the Crossref board meeting March 11-12 in London.&lt;/p>
&lt;p>Crossref has a bright future and many opportunities to do new things. Crossref provides essential, open scholarly infrastructure and services that benefit its members and the wider scholarly research ecosystem –– and we’ve got a lot of interesting things in development and ambitious plans. To anyone who might be interested in being Crossref’s next Executive Director, I can honestly say it is fantastic, challenging, fun, and very fulfilling –– that’s why I’ve done it for 20 years.&lt;/p>
&lt;p>What’s next for me? I don’t know but it’s something I’ll be thinking about over the coming months. I do know that working for a mission driven organisation and staying involved with scholarly communications and research –– a fascinating and worthy field –– will be top of my list.&lt;/p>
&lt;p>Anyway - it’s back to work and full steam ahead for Crossref!&lt;/p></description></item><item><title>Using the Crossref REST API (with Open Ukrainian Citation Index)</title><link>https://www.crossref.org/blog/using-the-crossref-rest-api-with-open-ukrainian-citation-index/</link><pubDate>Wed, 05 Feb 2020 00:00:00 +0000</pubDate><author>Rachael Lammey</author><discourseUsername>rlammey</discourseUsername><guid>https://www.crossref.org/blog/using-the-crossref-rest-api-with-open-ukrainian-citation-index/</guid><description>&lt;p>Over the past few years, I&amp;rsquo;ve been really interested in seeing the breadth of uses that the research community is finding for the Crossref REST API. When we ran Crossref LIVE Kyiv in March 2019, Serhii Nazarovets joined us to present his plans for the Open Ukrainian Citation Index, an initiative he explains below.&lt;/p>
&lt;p>But first an introduction to Serhii and his colleague Tetiana Borysova.&lt;/p>
&lt;p>Serhii Nazarovets is a Deputy Director for Research at the State Scientific and Technical Library of Ukraine. Serhii has a Ph.D. in Social Communication Science. His research interests lie in the area of scientometrics and library science. Serhii is the Associate Editor for DOAJ (&lt;a href="http://www.doaj.org/" target="_blank">www.doaj.org&lt;/a>) and the Regional Editor for E-LIS (Eprints in Library and Information Science). Serhii has worked in different scientific libraries of Ukraine for more than 10 years. Tetiana Borysova is a Senior Researcher at the State Scientific and Technical Library of Ukraine. Her research interests are focused on topics such as research data management, journal management and scientometrics.&lt;/p>
&lt;h2 id="introducing-ouci">Introducing OUCI&lt;/h2>
&lt;p>OUCI (&lt;a href="http://ouci.dntb.gov.ua/en/" target="_blank">Open Ukrainian Citation Index&lt;/a>) is a new search engine and a citation database based on publication metadata from Crossref members.&lt;/p>
&lt;p>OUCI is intended to simplify the search of scientific publications, to attract the editors&amp;rsquo; attention to the problem of completeness and quality of the metadata of Ukrainian scholarly publications, and will allow bibliometricians to freely study the relations between authors and documents from various disciplines, in particular in the field of social sciences and humanities. OUCI is open for every user in the world without any restrictions.&lt;/p>
&lt;p>OUCI launched in November 2019. The project is being implemented by the &lt;a href="https://dntb.gov.ua/en/science" target="_blank">State Scientific and Technical Library of Ukraine&lt;/a> with the support of the Ministry of Education and Science of Ukraine.&lt;/p>
&lt;p>In Ukraine, we do not have a national citation database, and this significantly impedes the search and analysis of information about Ukrainian publications. According to preliminary estimates, more than 3,000 titles of scientific journals are currently published in Ukraine. At the same time, only around 100 Ukrainian journal titles are indexed in authoritative citation databases, such as Scopus and Web of Science Core Collection. Thus, researchers and managers lack this citation data to understand the impact of Ukrainian journals and their demand in the scientific communication system. Our approach is that OUCI database contains metadata from all publishers that use the Crossref&amp;rsquo;s &lt;a href="https://www.crossref.org/services/cited-by/">Cited-by&lt;/a> service and who support the &lt;a href="https://i4oc.org/" target="_blank">Initiative for Open Citations&lt;/a> by making the reference metadata they publish with Crossref openly available.&lt;/p>
&lt;h2 id="how-is-crossref-metadata-used-in-ouci">How is Crossref metadata used in OUCI?&lt;/h2>
&lt;p>A publication can only be indexed in OUCI if there is a DOI. At first glance, the idea of creating an index of national publications based on this condition may seem too optimistic. However, in January 2018, a new requirement was adopted by the &lt;a href="https://zakon.rada.gov.ua/laws/main/z0148-18" target="_blank">List of scientific publications of Ukraine&lt;/a> (a list of Ukrainian journals recognized by experts as qualitative for publishing their research results for a scientific degree), which listed a DOI as one of the requirements for inclusion. After that, the number of publishers who received the DOI prefix from Crossref has tripled, to 352 in November 2019.&lt;/p>
&lt;p>Another important feature of OUCI is that publishers have to use Crossref&amp;rsquo;s &lt;a href="https://www.crossref.org/services/cited-by/">Cited-by&lt;/a> service and support the &lt;a href="https://i4oc.org/" target="_blank">Initiative for Open Citations.&lt;/a> We are working to build a new fair infrastructure where everyone who is interested in the dissemination of scientific knowledge can present their publications to the community, develop expert judgment skills and access citations to explore the links between documents. The philosophy of the index is to use only open resources to fill it.&lt;/p>
&lt;p>In addition to standard filters from Crossref metadata (such as publisher, publication, type, year), OUCI offers to refine search results by:&lt;/p>
&lt;ul>
&lt;li>indexation in Web of Science and/or Scopus,&lt;/li>
&lt;li>journal category (A or B according to the List of scientific publications of Ukraine),&lt;/li>
&lt;li>the field of knowledge and scientific specialties (according to the Ukrainian legislation) and other aspects important to Ukrainian users characteristics.&lt;/li>
&lt;/ul>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2020/ouci_blog_filters.png"
alt="Figure 1: OUCI search and filter options" width="75%">&lt;figcaption>
&lt;p>Figure 1: OUCI search and filter options&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;p>Beyond the ability to search articles, OUCI displays profiles for Ukrainian journals (the titles of these journals will include hyperlinks in the search results). Administrators can manage them, add and edit information about their journals: web-site, aims and scope, scientific fields of the journal according to the Ukrainian classification. Also, you can see some quantitative characteristics of journals: number of publications, number of citations, h-index, i10-index etc.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2020/ouci_blog_profiles.png"
alt="Figure 2: Display of journal information in OUCI" width="75%">&lt;figcaption>
&lt;p>Figure 2: Display of journal information in &lt;a href="http://ouci.dntb.gov.ua/en/editions/xmnGEm0L/" target="_blank">OUCI&lt;/a>&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;p>In addition, we have implemented an analytics module. Using the data about the number of articles and citations from Crossref, it allows users to analyze Ukrainian journals by field.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2020/ouci_blog_analysis.png"
alt="Figure 3: Publication and citation information" width="75%">&lt;figcaption>
&lt;p>Figure 3: Publication and citation information&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;h2 id="what-are-the-future-plans-for-ouci">What are the future plans for OUCI?&lt;/h2>
&lt;p>In the near future, we plan to add:&lt;/p>
&lt;ul>
&lt;li>the ability to export search results for further analysis;&lt;/li>
&lt;li>integration with &lt;a href="https://unpaywall.org/" target="_blank">Unpaywall&lt;/a>;&lt;/li>
&lt;li>alternative metrics from &lt;a href="https://www.crossref.org/services/event-data/terms/">Crossref Event Data&lt;/a>.&lt;/li>
&lt;/ul>
&lt;p>In the ideal future for our index, every Ukrainian article will be registered with Crossref and have open references. We plan to promote the importance of reach and quality metadata in Crossref among Ukrainian publishers. We also encourage all publishers to support the &lt;a href="https://i4oc.org/" target="_blank">Initiative for Open Citations&lt;/a>.&lt;/p>
&lt;h2 id="what-else-would-ouci-like-to-see-in-crossref-metadata">What else would OUCI like to see in Crossref metadata?&lt;/h2>
&lt;p>One of the main problems we encountered when creating OUCI was the metadata about the authors. Very few publications contain data about the author&amp;rsquo;s ORCID iD. Focusing publishers on the need to transmit full metadata to Crossref, as well as monitoring their quality is a must for the resources like this. Also we look forward to the growing usage of ROR (&lt;a href="https://ror.org/" target="_blank">Research Organisation Registry&lt;/a>) - identifiers for research organisations, similar to the way that ORCID offers identifiers for researchers. We believe that the ROR will help to obtain reliable data for analyzing the scientific activity of Ukrainian institutions.&lt;/p>
&lt;p>Another issue we&amp;rsquo;ve identified in some Ukrainian journals that some of the small publishers that register content via Crossref Sponsors did not take care getting their own prefix, so it can be difficult to see their publications - this is something that showing the metadata via an index can help them see and therefore fix.&lt;/p>
&lt;h2 id="questions">Questions?&lt;/h2>
&lt;p>We&amp;rsquo;ve had lots of questions about OUCI in the run up to the launch and now that it&amp;rsquo;s live. Here is a selection of our FAQs, &lt;a href="http://ouci.dntb.gov.ua/en/about/faq/" target="_blank">all available on our website&lt;/a>. You can also &lt;a href="mailto:nazarovets@gntb.gov.ua">get in touch&lt;/a> directly if you have another question we haven&amp;rsquo;t answered yet.&lt;/p></description></item><item><title>Crossref is 20</title><link>https://www.crossref.org/blog/crossref-is-20/</link><pubDate>Tue, 14 Jan 2020 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/crossref-is-20/</guid><description>&lt;h2 id="it-seems-like-only-yesterday">It seems like only yesterday&amp;hellip;&lt;/h2>
&lt;p>On January 19th, 2000 a new not-for-profit organisation was registered in New York State. It was called Publishers International Linking Association, Inc but was more commonly referred to as &amp;ldquo;CrossRef&amp;rdquo;. This means that Crossref will be 20 years old on January 19th, 2020 so I wanted to mark the occasion with a short post. We are planning more ways to mark our 20th anniversary later this year so keep a lookout.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/2020/crossref_20anniv_logo_RGB.png" alt="20th anniversary logo" width="50% class="img-responsive" />
&lt;/div>
&lt;p>Crossref becoming a legal entity was the result of developments over the previous few years and the DOI-X pilot in 1999. Moving quickly, the fledgling organisation issued its first news release on February 2nd, 2000 - &lt;a href="https://www.crossref.org/news/2000-02-02-journal-reference-linking-service-names-executive-director-board-of-directors-new-members-and-a-go-live-timetable/">Crossref Update Journal Reference Linking Service Names Executive Director, Board of Directors, New Members, and a “Go Live” Timetable&lt;/a> - announcing the appointment of an Executive Director (me!), that there were 22 members, and a plan for launching the system. From these beginnings, Crossref has grown into one of the most successful examples of sustainable scholarly infrastructure. This is due to the hard work and support of many people and organisations, and an organisational structure and governance and sustainability model that has proven very robust.&lt;/p>
&lt;p>Looking back, Crossref has achieved an amazing amount but it certainly wasn&amp;rsquo;t a forgone conclusion that we would be successful. On our tenth anniversary we wrote an overview of Crossref&amp;rsquo;s founding and early years &lt;a href="https://www.crossref.org/pdfs/CrossRef10Years.pdf">The Formation of Crossref: A Short History&lt;/a>, which highlights that vision, collaboration, trust and utility all contributed to Crossref&amp;rsquo;s success. I particularly want to recognize Eric Swanson, from Wiley, and Pieter Bolman, from Academic Press/Harcourt Brace for their critical role in the founding of Crossref and in its early success by providing the vision, bringing everyone together, serving as the first Chair and Treasurer of the organisation, and providing me with support and guidance in Crossref&amp;rsquo;s early start-up phase.&lt;/p>
&lt;p>Our history document notes that Crossref grew more quickly than expected, &amp;ldquo;By the end of 2003, CrossRef had 300 members with 12 million DOIs assigned, compared to the initial projection of 60 participating publishers and 3 million DOIs assigned.&amp;rdquo; Looking at the 2010 annual report at the ten year mark, Crossref had 43 million content items, 943 members and 15 staff. Since then, Crossref has continued to grow faster than expected and, in fact, at the start of of 20th year, growth is increasing. Our latest annual report &lt;a href="https://doi.org/10.13003/y8ygwm5" target="_blank">“Crossref Annual Report &amp;amp; Fact File 2018-19”&lt;/a> highlights that there we have 111 million content items - an average annual increase of 15%; over 11,500 members with over 180 joining per month - an average annual increase of 112%; and 37 staff - an average annual increase of 7%. Crossref is also financially stable, having generated surpluses every year since 2003 and with no fee increases in 15 years - an effective 30%+ decrease for members.&lt;/p>
&lt;p>Some of the most important statistics for me are those around DOI resolutions - humans and machines following persistent DOI links - and metadata dissemination via our open APIs and paid services. In 2010 there were around 470 million DOI resolutions for the entire year - we now see over 400 million resolutions per month. With metadata dissemination in 2010 there were on average about 40 million queries per month and there are now over 600 million per month meaning that huge amounts of metadata are flowing out into the ecosystem and improving persistent linking, discovery, and the research process. Also, very importantly, we are much more global and diverse than we were, with members and users from over 120 countries, representing all disciplines and all types of organisations (societies, commercial publishers, funders, start-ups, universities and other research institutions). And in a big change, the members in the top three fee categories accounted for 36% of revenue in 2019 - down from 56% in 2011, while the bottom three categories accounted for 46% of revenue in 2019 - up from 25% in 2011.&lt;/p>
&lt;p>As we noted in our blog post from November 2019, &lt;a href="https://doi.org/10.64000/mmdqs-23829" target="_blank">A turning point is a time for reflection&lt;/a>, &amp;ldquo;different people have always wanted different things from us and, since our founding, we have brought together diverse organisations to have discussions&amp;mdash;sometimes contentious&amp;mdash;to agree on how to help make scholarly communications better. Being inclusive can mean slow progress, but we’ve been able to advance by being flexible, fair, and forward-thinking.&amp;rdquo; While we&amp;rsquo;ve been very successful, there is a lot we can do better and it is tricky keeping all our stakeholders happy - but that&amp;rsquo;s what we&amp;rsquo;ve always done and we&amp;rsquo;ll continue to do it by being open, inclusive, collaborative, and willing to change and adapt. The one constant in Crossref&amp;rsquo;s 20 years has been change. The staff and board will be reviewing Crossref&amp;rsquo;s strategy in 2020 with the &lt;a href="https://docs.google.com/presentation/d/1RsqtnHssBkaFNphdWoq20_ewruYP04n8j_dYB9wvphM/edit#slide=id.g65af51c04a_1_238" target="_blank">value research report&lt;/a> and LIVE19 Amsterdam workshops as input. I&amp;rsquo;m confident we can continue to play a vital role in the scholarly research ecosystem.&lt;/p>
&lt;p>A huge thank you to everyone over the years who has contributed to Crossref&amp;rsquo;s success - it&amp;rsquo;s a very long list and includes staff, board members, members, users, supporters, partners, consultants, and many others. Personally, I&amp;rsquo;m proud and honored to have played a role in Crossref&amp;rsquo;s success and development over the last 20 years and the best part is that there is more to come.&lt;/p>
&lt;p>Keep an eye out for the publication of the outputs from our LIVE19 meeting and further 20th anniversary activities.&lt;/p></description></item><item><title>Metadata Corrections, Updates, and Additions in Metadata Manager</title><link>https://www.crossref.org/blog/metadata-corrections-updates-and-additions-in-metadata-manager/</link><pubDate>Mon, 13 Jan 2020 00:00:00 +0000</pubDate><author>Shayn Smulyan</author><guid>https://www.crossref.org/blog/metadata-corrections-updates-and-additions-in-metadata-manager/</guid><description>&lt;p>It&amp;rsquo;s been a year since &lt;a href="https://www.crossref.org/education/member-setup/metadata-manager/">Metadata Manager&lt;/a> was first launched in Beta.  We&amp;rsquo;ve received a lot of helpful feedback from many Crossref members who made the switch from Web Deposit Form to Metadata Manager for their journal article registrations.&lt;/p>
&lt;p>The most common use for Metadata Manager is to register new DOIs for newly published articles. For the most part, this is a one-time process.  You enter the metadata, register your DOI, and success!&lt;/p>
&lt;p>But everything doesn&amp;rsquo;t always go quite as expected. Humans make mistakes, and typos in metadata are bound to happen on occasion, even for the most careful users.&lt;/p>
&lt;p>We always want to make it as easy as possible for our members to find and correct metadata errors, and to add additional metadata when it becomes available.  Our &lt;a href="https://support.crossref.org/hc/en-us/articles/213197406-Schematron-report" target="_blank">Schematron&lt;/a>, &lt;a href="https://support.crossref.org/hc/en-us/articles/213197206-Conflict-report" target="_blank">Conflict&lt;/a>, and &lt;a href="https://doi.org/10.64000/xpe8h-4tt05" target="_blank">Resolution&lt;/a> reports can help you identify existing metadata errors. We never charge content registration fees for metadata updates, additions, or corrections, so cost won&amp;rsquo;t be a barrier to getting the most accurate and thorough metadata possible.  And, now, Metadata Manager can make those corrections easier to do.&lt;/p>
&lt;h2 id="correcting-errors">Correcting Errors&lt;/h2>
&lt;p>Because accurate and comprehensive metadata is so important for the linking and discoverability of your publications, it&amp;rsquo;s important to catch these occasional errors and correct them.&lt;/p>
&lt;p>We send out &lt;a href="https://support.crossref.org/hc/en-us/articles/213197406-Schematron-report" target="_blank">reports that automatically screen for particular types of metadata errors&lt;/a>, and we pass along comments from users who contact us with concerns about metadata quality to our contacts at the relevant publisher. &lt;/p>
&lt;p>The &amp;ldquo;Review all&amp;rdquo; feature in Metadata Manager also allows you to do a final check of all the metadata you entered right before you&amp;rsquo;re about to submit your deposits.  So, we also rely on you to evaluate your own accuracy there as well.&lt;/p>
&lt;center>&lt;img src="https://www.crossref.org/images/blog/2020/metadata manager review.png" alt="Metadata Manager Review All" width="550" class="img-responsive" />&lt;/center>
&lt;p>Once you’ve identified an error, you’ll need to correct it. To do that, you must resubmit a whole new metadata deposit for the affected item. The newly deposited metadata will entirely overwrite the previously deposited metadata.&lt;/p>
&lt;p>If you’re used to using the Web Deposit Form, you know that the redeposit can be a little tedious. For example, if you find that you misspelled an author’s last name, you’d have to manually type in or copy-paste not just the corrected last name, but all of the journal-level, issue-level, and article-level metadata that applies to the article.&lt;/p>
&lt;p>Using Metadata Manager, the process is much simpler. The full metadata record is retained or imported and you only need to correct the error itself.&lt;/p>
&lt;h3 id="for-articles-originally-registered-using-metadata-manager">For articles originally registered using Metadata Manager&lt;/h3>
&lt;p>If you find a metadata error in an article which you initially registered in Metadata Manager itself, you can locate the article in one of two ways:&lt;/p>
&lt;ol>
&lt;li>
&lt;p>Navigate through the list of Accepted articles within a given journal&lt;/p>
&lt;center>&lt;img src="https://www.crossref.org/images/blog/2020/Metadata Manager Accepted Articles.png" alt="Metadata Manager Accepted Articles" width="550" class="img-responsive" />&lt;/center>
&lt;/li>
&lt;li>
&lt;p>Or, search by article title in the Deposit History&lt;/p>
&lt;/li>
&lt;/ol>
&lt;center>&lt;img src="https://www.crossref.org/images/blog/2020/Metadata Manager Deposit History.png" alt="Metadata Manager Deposit History" width="550" class="img-responsive" />&lt;/center>
&lt;p>Once you’ve located the relevant article, click on the article title to open the article’s metadata record. From there, you can make the necessary corrections. With the corrections complete, click “Continue” and then “Add to deposit.” After that, the process is exactly the same as depositing a new article.&lt;/p>
&lt;h3 id="for-articles-registered-using-the-web-deposit-form-or-any-other-deposit-method">For articles registered using the Web Deposit Form or any other deposit method&lt;/h3>
&lt;p>If you registered an article using the Web Deposit Form, an XML deposit, or the OJS plugin, you can still use Metadata Manager to quickly correct an error. But, first you have to import the article’s metadata into Metadata Manager.&lt;/p>
&lt;p>To do this, click into the relevant journal from your Metadata Manager home page. Then, search for the article title using the “Add existing article” search box. Select “Add” next to the article title in the search results, which will import the article’s metadata record into Metadata Manager.&lt;/p>
&lt;center>&lt;img src="https://www.crossref.org/images/blog/2020/metadata manager search.png" alt="Metadata Manager Article Search" width="550" class="img-responsive" />&lt;/center>
&lt;p>From here, make any necessary corrections and click “Continue” and then “Add to deposit.” Navigate to the “To deposit” tab and “Review all” to ensure that your metadata record is accurate. Then select “Deposit” to finalize your submission. You’ll receive immediate feedback as to whether your metadata deposit was successful or not.&lt;/p>
&lt;center>&lt;img src="https://www.crossref.org/images/blog/2020/Metadata Manager deposit submission.png" alt="Metadata Manager Deposit Submission" width="550" class="img-responsive" />&lt;/center>
&lt;h2 id="adding-additional-metadata">Adding additional metadata&lt;/h2>
&lt;p>Perhaps there are no problems with your metadata, and everything is completely accurate.  That&amp;rsquo;s great! But, we encourage our members to submit metadata that is not just accurate, but also as thorough as possible.  Check your &lt;a href="https://www.crossref.org/members/prep/" target="_blank">Participation Report&lt;/a> to see if there are any types of metadata that you haven&amp;rsquo;t been submitting yet, or that you haven&amp;rsquo;t been submitting for certain journals.&lt;/p>
&lt;p>Metadata Manager allows you to deposit references, licenses, and relationships between your articles and other DOIs, which weren’t possible to add using the Web Deposit Form. The same process described above for corrections will allow you to import previously registered articles and add in these new metadata elements.&lt;/p>
&lt;p>We also know that many of our members register DOIs for their articles when they’re first published online, but aren’t yet included in an issue. When the articles are published in their final versions, there is important metadata added which wasn’t yet available when the DOI was first registered. This includes things like volume number, issue number, page numbers, and full publication date, all of which are extremely important for linking and discoverability. Sometimes the resolution URL changes when the article is moved from its pre-publication status to its final version.&lt;/p>
&lt;p>So, when each issue is published, you can use Metadata Manager to pull up all the already-registered articles included in that issue and add in the newly relevant metadata like page numbers, issue number, URL, etc. Then add them to a new deposit, review, and submit.&lt;/p>
&lt;p>Please check out the full &lt;a href="https://www.crossref.org/education/member-setup/metadata-manager/">Metadata Manager help documentation&lt;/a> for more details, or join us on an &lt;a href="https://www.crossref.org/webinars/">upcoming workshop&lt;/a> to test out Metadata Manager in real-time with us.  And, as always, feel free to email us at &lt;a href="mailto:support@crossref.org">support@crossref.org&lt;/a> with any questions.&lt;/p></description></item><item><title>Resolution reports: a look inside and ahead</title><link>https://www.crossref.org/blog/resolution-reports-a-look-inside-and-ahead/</link><pubDate>Tue, 17 Dec 2019 00:00:00 +0000</pubDate><author>Isaac Farley</author><guid>https://www.crossref.org/blog/resolution-reports-a-look-inside-and-ahead/</guid><description>&lt;p>Isaac Farley, technical support manager, and Jon Stark, software developer, provide a glimpse into the history and current state of our popular monthly &lt;a href="https://support.crossref.org/hc/en-us/articles/213197246-Resolution-Report" target="_blank">resolution reports&lt;/a>. They invite you, our members, to help us understand how you use these reports. This will help us determine the best next steps for further improvement of these reports, and particularly what we do and don’t filter out of them.&lt;/p>
&lt;p>Isaac joined Crossref in April 2018. Before that, he was with one of our members, a geoscience society in Oklahoma (USA). As a Crossref member, like all of our members, he received the resolution reports to his inbox during the first week of each month. And like many of you, he had questions.&lt;/p>
&lt;ul>
&lt;li>What exactly is this report?&lt;/li>
&lt;li>What are all these numbers?&lt;/li>
&lt;li>Now, what about those 10 top DOIs is making them so popular?&lt;/li>
&lt;li>Why are some of these DOIs failing?&lt;/li>
&lt;li>And, what’s with this filtering of “known search engine crawlers?”&lt;/li>
&lt;/ul>
&lt;p>Now that Isaac is the Crossref Technical Support Manager, instead of asking these questions, he answers many of them.&lt;/p>
&lt;h3 id="whoatoo-fastwhat-exactly-are-resolution-reports">Whoa&amp;hellip;too fast&amp;hellip;what exactly are resolution reports?&lt;/h3>
&lt;p>The resolution report provides an overview of DOI resolution traffic, and can identify problems with your DOI links. The failed DOI.csv linked to your resolution report email contains a list of all DOIs with failed resolution attempts (more on this later). If a user clicks on a DOI with your DOI prefix and the DOI is not registered, it won’t resolve to a web page, and thus will appear on your report.&lt;/p>
&lt;h3 id="what-are-those-numbers">What are those numbers?&lt;/h3>
&lt;p>This is always a good starting point for wrangling statistical information. Resolution statistics are based on the number of DOI resolutions made through the &lt;a href="https://www.doi.org/" target="_blank">DOI proxy server&lt;/a> on a month-by-month basis. These statistics give an indication of the traffic generated by users - both human and machine - clicking (or, resolving) DOIs. CNRI (the organisation that manages the DOI proxy server) sends us resolution logs at the end of every month and we pass the data on to you.&lt;/p>
&lt;p>Resolution reports are sent by default to the business contact on your account, and we can always add or change the recipient(s) as needed. We send a separate report for each DOI prefix you’re responsible for.&lt;/p>
&lt;p>Historically we have done our best to filter out obvious crawlers and machine activity - thus valuing human-driven traffic to traffic generated by machines. That sentence above about those obvious crawlers is the real reason we are here today blogging.&lt;/p>
&lt;h3 id="why-are-some-of-those-dois-failing">Why are some of those DOIs failing?&lt;/h3>
&lt;p>The ideal failure rate is 0%. A failure rate of 0% would mean that every DOI you owned that was clicked in the previous month successfully resolved to the resolution URL you registered with us. But, in reality, a 0% failure rate is rare, because any string of characters that is combined with your prefix (e.g., 10.5555/ThisIsNOTARealDOI) and attempted to be resolved will go through the resolver and result in another single failed count toward your monthly resolution report. If you are new to Crossref, or have only deposited metadata for a small number of content items, you may have a high failure percentage (for example, 2 failures and 8 successes = 20% failure rate).&lt;/p>
&lt;p>Before 2019, the overall resolution failure rate across all publishers held fairly steady each month between 2 and 4%. You may have noticed that that number has been climbing this year. And, as a result, we think a new normal is closer to 10%.&lt;/p>
&lt;center>&lt;img src="https://www.crossref.org/images/blog/resolutions_unfiltered_table_new.png" alt="Total resolutions unfiltered" width="550" class="img-responsive" />&lt;/center>
&lt;center>&lt;img src="https://www.crossref.org/images/blog/resolutions_percentage_rate_chart_new.png" alt="Unfiltered resolution failure rate" width="550" class="img-responsive" />&lt;/center>
&lt;p>Given this new norm, if your overall resolution failure rate is higher than 8 to 12%, we advise you to look closely at the failed DOI.csv file that we include in the monthly report we email you. The first step in your analysis of this portion of the report is to make sure the DOIs listed have been registered. Very often failures of legitimate DOIs are the result of content registration errors or workflow inefficiencies (i.e., DOIs are shared with the editorial team and/or contributors before being registered with us, leading to premature clicks). If during your investigation, you find invalid DOIs (like the example above: 10.5555/ThisIsNOTARealDOI) - and you will find invalid DOIs because we all make mistakes when resolving DOIs - you may simply ignore those DOIs within the report.&lt;/p>
&lt;h3 id="whats-with-this-filtering-of-known-search-engine-crawlers">What’s with this filtering of “known search engine crawlers?”&lt;/h3>
&lt;p>You may have recently noticed that we made a few changes to the resolution reports. We merged, rearranged, and in some cases completely rewrote the report you receive to your inboxes, because, well, it needed it. It was confusing. Parts of the report still are. Most specifically, those “known search engine crawlers.” To that point, you may have also noticed that the reports that arrived to your inboxes in early November 2019 were scrubbed of nearly 150 million resolutions across all members.&lt;/p>
&lt;p>Based on Jon’s analysis of these 150 million filtered resolutions, they were from bots. In the past, it was important to filter out bots, as we found our community was most focused on human readers. But should we be filtering out resolutions from bots any more? We live in a time where most of our work (at least in the Crossref community) requires both human and machine interaction; thus, aren’t at least some of these resolutions from machines legitimate?&lt;/p>
&lt;p>Our internal analysis shows that we cannot reliably determine which usage is from:&lt;/p>
&lt;ul>
&lt;li>Individual humans;&lt;/li>
&lt;li>Machines acting as intermediaries between researchers and DOIs;&lt;/li>
&lt;li>Internet service providers with real human users behind them; or,&lt;/li>
&lt;li>Bots that do not result in actual human usage.&lt;/li>
&lt;/ul>
&lt;p>As a result, it is our thinking that we may serve you better by not filtering any traffic, as we cannot guarantee that we’re removing the right things. We feel that it may be better for us to just give you everything we know. And invite you to make your own judgments.&lt;/p>
&lt;center>&lt;img src="https://www.crossref.org/images/blog/Total_resolutions_18_19_OCT_new.png" alt="Total resolutions last year" width="550" class="img-responsive" />&lt;/center>
&lt;center>&lt;img src="https://www.crossref.org/images/blog/Filtered_resolutions_18_19_OCT_new.png" alt="Filtered resolutions last year" width="550" class="img-responsive" />&lt;/center>
&lt;center>&lt;img src="https://www.crossref.org/images/blog/Total and Filtered_resolutions_18_19_OCT_new.png" alt="Total and filtered resolutions" width="550" class="img-responsive" />&lt;/center>
&lt;h3 id="howd-we-get-here">How&amp;rsquo;d we get here?&lt;/h3>
&lt;p>Jon joined Crossref in 2004. He wrote the original version of the resolution report in late 2009 in an effort to provide you, our members, with information about the usage of registered Crossref DOIs. At that time, most members were creating DOIs, but then had no real feedback about the traffic that was getting to their content (via the DOI proxy server lookups of their DOIs). These reports filled that gap. The other benefit of the report was the information it provided about failed resolutions. As suggested above, the list of failed resolutions helped members identify potential problems with the content registration process.&lt;/p>
&lt;p>A DOI that appeared on the report as a failed resolution could be cause of concern for the member. But, then again, humans and machines make mistakes when attempting to resolve DOIs (e.g., typos). Thus, not much has changed in the last ten years - the DOIs that appear in the failed resolution reports must be evaluated. Care should especially be taken when a DOI that should have been registered has not and appears as a failed resolution (e.g., data problem, agent behind on deposits, etc.) within this report.&lt;/p>
&lt;p>Like we said, mistakes happen. Users may enter a DOI incorrectly when looking it up. Or, it could be a bot throwing randomly generated traffic that looks like a DOI, but is not. And, sometimes bots are scraping through PDFs for DOIs and simply extract them incorrectly. These are all user errors, and not necessarily a concern for our members. That’s why we provide that list of what failed.&lt;/p>
&lt;p>At the start, there were a few well known crawlers that were resolving large numbers of DOIs regularly. It was our opinion at the time that it would be helpful to filter that usage since we assumed members only cared about human-driven traffic. As the next decade passed, it became clear that the internet had and would continue to change. With bots popping up every day and IP addresses moving or spanning broad address ranges (and IPs we had already filtered with the potential of being repurposed), it was obvious that we would always miss as much as we caught.&lt;/p>
&lt;p>Between the constantly changing landscape and the fact that real usage can be hidden behind IP addresses that appear like bot traffic, we no longer have confidence in our filtering process. It may be best for our users to just get the data as the data exists and know that our metadata world covers a vast range of usages - many as valid and valuable today as that human-driven traffic we prioritized ten years ago. Perhaps there is some other metric we can provide that might be useful for understanding the traffic in better ways, but filtering some of this traffic seems no longer useful.&lt;/p>
&lt;h3 id="your-help-with-next-steps">Your help with next steps&lt;/h3>
&lt;p>There you have it. Our thinking: we’ve been filtering these resolution reports the best we can for ten years. Today, our confidence in the filtering process has waned. We’re proposing a change: we want to give you the raw resolution numbers, for machines and humans alike. We want to make this change soon, but we also want to hear from you.&lt;/p>
&lt;ul>
&lt;li>How are you using the resolution reports?&lt;/li>
&lt;li>What you do you think of this proposed change?&lt;/li>
&lt;li>Will our removal of all filters from monthly resolution reports affect how you use the information within?&lt;/li>
&lt;/ul>
&lt;p>We want to hear from you, and we’re inviting you to help us determine our next steps. We are going to give you until Friday, 31 January to &lt;a href="mailto:support@crossref.org?subject=Filtering%20resolution%20reports">tell us&lt;/a> what you think of this proposed change. Then, Isaac and Jon will be back in early February to share with you what you have helped us decide. Thanks in advance!&lt;/p></description></item><item><title>A Journey of a Crossref Ambassador in Latin America</title><link>https://www.crossref.org/blog/a-journey-of-a-crossref-ambassador-in-latin-america/</link><pubDate>Wed, 11 Dec 2019 00:00:00 +0000</pubDate><author>Arley Soto</author><guid>https://www.crossref.org/blog/a-journey-of-a-crossref-ambassador-in-latin-america/</guid><description>&lt;p>&lt;em>&lt;strong>English version&lt;/strong> –– &lt;a href="#spanishversion">Información en español&lt;/a>&lt;/em>&lt;/p>
&lt;p>In this post, Arley Soto shares some experiences about his work as a Crossref ambassador in Latin America.&lt;/p>
&lt;p>When I joined as a volunteer Crossref ambassador in 2018, I never imagined that in less than two years, I would have the opportunity to travel to three Latin American cities, visit Toronto, organize the first Crossref LIVE in Spanish and hold webinars in Spanish about Crossref&amp;rsquo;s services. After almost two years of continuous learning, I think it is worth sharing my experience with the Crossref community for a better understanding of the ambassadors&amp;rsquo; role in Latin America and to inspire ambassadors from other parts of the world to write and post their experiences.&lt;/p>
&lt;p>Before becoming a Crossref ambassador, I had already been working with Crossref since 2011, when we started to coordinate DOI registration for the Biomédica Journal of the National Health Institute, one of the first journals to implement the DOI in Colombia. During these first years of relations with Crossref, I acquired basic knowledge on membership and the technical aspects of the services the agency offers, including &lt;a href="https://www.crossref.org/services/reference-linking">Reference Linking&lt;/a>, &lt;a href="https://www.crossref.org/services/content-registration">Content Registration&lt;/a> and &lt;a href="https://www.crossref.org/services/crossmark">Crossmark&lt;/a>. This close relationship with Crossref enabled us to hold the PKP-Crossref workshop in 2018 with Juan Pablo Alperín and Susan Collins at the &lt;a href="https://web.archive.org/web/20200316022408/http://congreso.redalyc.org/ocs/public/congresoEditores/index.html" target="_blank">Third International Congress of Redalyc Editors at Universidad César Vallejo, city of Trujillo&lt;/a>.&lt;/p>
&lt;p>In the same year, thanks to the invitation by the State University System (SUE, for the Spanish original) (Bogotá chapter), I had the opportunity to give a presentation on Crossref during the 2018 International Open Access Week held at Universidad Militar Nueva Granada. Around 50 people participated, including members and non-members of Crossref. There, I emphasized the nature of Crossref as a non-profit organisation, based on affiliations and the importance of new members participating in the annual elections organized by Crossref and running to be representatives in the Crossref Board of Directors.&lt;/p>
&lt;p>In November 2018, I had the pleasure of participating in the &lt;a href="https://www.youtube.com/playlist?list=PLe_-TawAqQj2QMxKbOmBs4WFHnIAK4iwn" target="_blank">Crossref Meeting in Toronto&lt;/a>, thanks to an invitation from the organizers. There, I talked to the representatives of other organisations who are members of Crossref around the world and I also met some of the members of the Crossref team in person. This event was essential for me as an ambassador, because I learned about Crossref&amp;rsquo;s vision and different projects firsthand, which increased my capacity to explain Crossref&amp;rsquo;s scope and role in the area of scientific communications. I remember that the booth Crossref provided to answer technical questions was particularly useful. There, Isaac, Shayn and other members of the technical team were always available to resolve specific queries that I had not been able to resolve before myself.&lt;/p>
&lt;p>In my second year as an ambassador, I represented Crossref at the Universidad Central del Ecuador (Quito, Ecuador), in a talk with an average of 40 people from different parts of Ecuador. There, I emphasized the technical aspects of the DOI and good practices for its use in academic publications. This talk was held on April 21, 2019, in collaboration with Crossref and &lt;a href="http://biteca.com/" target="_blank">BITECA S.A.S&lt;/a>., a &lt;a href="https://www.crossref.org/community/sponsors">sponsoring member&lt;/a> of Crossref.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2019/arley-biteca-blog.jpg"
alt="images of Arley Soto presenting" width="70%">
&lt;/figure>
&lt;/div>
&lt;p>In May 2019, with Susan Collins and Vanessa Fairhurst, we organized &lt;a href="https://www.crossref.org/events">Crossref LIVE Bogotá&lt;/a>, which was not only successful because of the number of attendees from different parts of Colombia and other countries in the region, but also due to the meeting of Latin American ambassadors, where we worked the full morning discussing the priorities and issues of the region with ambassadors from Brazil, Mexico, Chile and Peru. Apart from other issues, at this meeting, it became clear the need to have better resources and support in Spanish for Spanish-speaking members.&lt;/p>
&lt;p>Additionally, we helped to review the Spanish translation of the &amp;ldquo;You are Crossref&amp;rdquo; booklet, which we printed and distributed at Crossref LIVE Bogotá.&lt;/p>
&lt;p>During 2019, I participated in the &lt;a href="https://www.slideshare.net/CrossRef/introduction-to-crossref-and-content-registration-in-spanish" target="_blank">Introduction to Crossref and Content Registration&lt;/a> and &lt;a href="https://www.slideshare.net/CrossRef/reference-linking-and-cited-by-in-spanish" target="_blank">Introduction to Reference Linking and Cited-by webinar&lt;/a> webinars and held the first webinar in Spanish about the new &lt;a href="https://www.crossref.org/education/member-setup/metadata-manager/">Metadata Manager&lt;/a> tool, always with the ongoing support and assistance of the Crossref team.&lt;/p>
&lt;p>And to end the year with a bang, together with Rachael Lammey, we organized the presentation: Open infrastructure and open data for the global metrics community: what can you build? I presented this at the &lt;a href="https://www.latmetrics.com/" target="_blank">2Latmetrics: Altmetrics and Open Science in Latin America colloquium on November 4 in the city of Cusco (Peru).&lt;/a>&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2019/arley-blog-3.jpg"
alt="image of people on the panel" width="70%">
&lt;/figure>
&lt;/div>
&lt;p>This account of activities is a demonstration of the commitment of Crossref&amp;rsquo;s ambassadors to transmit the message of the importance of ethically and responsibly sharing, citing and making science visible on the web.&lt;/p>
&lt;p>&lt;a id="spanishversion">&lt;/a>
&lt;em>&lt;strong>Spanish version&lt;/strong>&lt;/em>&lt;/p>
&lt;p>Cuando me vinculé como embajador voluntario de Crossref en 2018, no imaginaba que en menos de dos años tendría la oportunidad de viajar a 3 ciudades en Latinoamérica, conocer Toronto, organizar el primer Crossref LIVE en español y realizar webinars en español sobre los servicios de Crossref. Después de casi dos años de continuo aprendizaje, creo que vale la pena compartir mi experiencia a la comunidad de Crossref para entender mejor el rol de los embajadores en Latinoamérica y para inspirar embajadores de otras regiones del mundo a que escriban y publiquen sus experiencias.&lt;/p>
&lt;p>Antes de convertirme en embajador de Crossref ya había trabajado con Crossref desde el año 2011, año en el que empezamos a gestionar DOI para la revista Biomédica del Instituto Nacional de Salud, una de las primeras revistas en implementar DOI en Colombia. Durante esos primeros años de relaciones con Crossref, adquirí un conocimiento básico sobre las membresías y los aspectos técnicos de los servicios que la agencia ofrece, incluyendo el &lt;a href="https://www.crossref.org/services/reference-linking">Reference Linking&lt;/a>, &lt;a href="https://www.crossref.org/services/content-registration">Content Registration&lt;/a> y &lt;a href="https://www.crossref.org/services/crossmark">Crossmark&lt;/a>, entre otros. Esta relación estrecha con Crossref favoreció para que en 2018 realizáramos el taller de PKP - Crossref entre Juan Pablo Alperín y Susan Collins en el &lt;a href="https://web.archive.org/web/20200316022408/http://congreso.redalyc.org/ocs/public/congresoEditores/index.html" target="_blank">3er Congreso Internacional de Editores Redalyc, en la Universidad César Vallejo, ciudad de Trujillo &lt;/a>&lt;/p>
&lt;p>En ese mismo año, gracias a la invitación realizada por el Sistema Universitario Estatal, SUE (capítulo Bogotá) tuve la oportunidad de hacer una presentación de Crossref en la Semana Internacional de Acceso Abierto 2018, realizado en Universidad Militar Nueva Granada 2018, allí participaron alrededor de 50 personas entre miembros y no miembros de Crossref, aquí hice énfasis en la naturaleza de Crossref como organización sin ánimo de lucro, basada en afiliaciones y la importancia de que los nuevos miembros participen en las votaciones anuales que organiza Crossref y que se postulen para ser representantes en la junta directiva de Crossref.&lt;/p>
&lt;p>En noviembre de 2018 tuve el placer de participar en el &lt;a href="https://www.youtube.com/playlist?list=PLe_-TawAqQj2QMxKbOmBs4WFHnIAK4iwn" target="_blank">Crossref Meeting en la ciudad de Toronto&lt;/a>, gracias a una invitación de los organizadores. Allí conversé con representantes de  otras organizaciones afiliadas a Crossref alrededor del mundo y también conocí en persona a algunos de los integrantes del equipo de Crossref. Este evento fue de vital importancia para mí como embajador ya que conocí de primera mano la visión y los diferentes proyectos que realiza Crossref, lo que aumentó mi capacidad para explicar en mi contexto el alcance y el papel de Crossref en el entorno de la comunicación científica. Recuerdo que fue particularmente útil el kiosco que dispuso Crossref para atender inquietudes técnicas en donde Isaac, Shane y otros miembros del equipo técnico siempre estuvieron dispuestos a solucionar dudas específicas que no había podido resolver antes por mi mismo.&lt;/p>
&lt;p>En el segundo año como embajador representé a Crossref en la Universidad Central del Ecuador (Quito, Ecuador), charla a la que asistieron en promedio 40 personas de diversos lugares del Ecuador, allí hice énfasis en los aspectos técnicos del DOI y buenas prácticas de su utilización en publicaciones académicas.. Esta charla tuvo lugar el 21 de abril de 2019 y la realizamos en colaboración con Crossref y &lt;a href="https://www.biteca.com/" target="_blank">BITECA SAS&lt;/a> miembro &lt;a href="https://www.crossref.org/community/sponsors">patrocinador en Crossref&lt;/a>.&lt;/p>
&lt;p>En mayo de 2019 organizamos junto con Susan Collins y Vanessa Fairshuit el &lt;a href="https://www.crossref.org/events">Crossref LIVE Bogotá&lt;/a>, que no solamente fue exitoso por la cantidad de asistentes de diferentes partes de Colombia y de otros países de la región, sino por la reunión de embajadores de Latinoamérica, donde trabajamos una mañana completa para discutir acerca de las prioridades y temáticas propias de la región con embajadores de Brasil, México, Chile y Perú. Entre otros asuntos, en esta reunión se hizo evidente la necesidad de tener mayores recursos y soporte en Español para los miembros hispanohablantes.&lt;/p>
&lt;p>Así mismo contribuimos con la revisión de la traducción al español de la cartilla &amp;ldquo;Usted es Crossref&amp;rdquo; que imprimimos y repartimos durante el Crossref LIVE Bogotá.&lt;/p>
&lt;p>Durante 2019 participé en los webinars &lt;a href="https://www.slideshare.net/CrossRef/introduction-to-crossref-and-content-registration-in-spanish" target="_blank">Introduction to Crossref and Content Registration&lt;/a> y &lt;a href="https://www.slideshare.net/CrossRef/reference-linking-and-cited-by-in-spanish" target="_blank">Introduction to Reference Linking and Cited-by webinar&lt;/a> y llevé a cabo el primer Webinar en español sobre la nueva herramienta Metadata Manager, siempre con el acompañamiento y el soporte permanente del equipo de Crossref.&lt;/p>
&lt;p>Y para terminar el año de la mejor manera, preparamos junto con Rachael Lammey la ponencia Open infrastructure and open data for the global metrics community: what can you build? Que presenté en el congreso &lt;a href="https://www.latmetrics.com/" target="_blank">2Latmetrics: métricas alternativas y ciencia abierta en américa latina el 04 de noviembre en la ciudad de Cusco (Perú).&lt;/a>&lt;/p>
&lt;p>Este recuento de actividades es una muestra del compromiso de los embajadores de Crossref en transmitir el mensaje de la importancia de compartir, citar y hacer visible la ciencia en la web, de una manera ética y responsable.&lt;/p></description></item><item><title>Introducing our new Director of Finance &amp; Operations</title><link>https://www.crossref.org/blog/introducing-our-new-director-of-finance-operations/</link><pubDate>Mon, 09 Dec 2019 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/introducing-our-new-director-of-finance-operations/</guid><description>&lt;p>I&amp;rsquo;m happy to announce that Lucy Ofiesh has joined Crossref as our new Director of Finance and Operations. Lucy has experience supporting the sustainability and governance of not-for-profit organisations having held roles such as Executive Vice President of the Brooklyn Children&amp;rsquo;s Museum and for the last few years as Chief Operating Officer at Center for Open Science, a Crossref member.&lt;/p>
&lt;p>At Center for Open Science, Lucy built her knowledge of the research communications community; she is knowledgeable about how diverse this community has become and the challenges of planning and scale that this comes with. She knows how to manage the complexities of an expanding global operation, where members, users––and staff––in several locations need fair, timely, and accurate information, whether it’s about how invoices relate to their use of our services or information about our approach to health benefits.&lt;/p>
&lt;p>Finance underpins all that Crossref does and is crucial to long term sustainability while ‘Operations’ is a varied function and it is only becoming more so as Crossref grows. The role encompasses human resources, organisation culture, governance (including serving as secretary of the organisation), and working as part of the senior leadership team. Lucy will bring community focus to our operations, putting member experience first so that it becomes easier to work with us, from implementing systems and processes that work for multiple languages and currencies to providing personable billing support.&lt;/p>
&lt;p>She will also play a vital role on the Crossref leadership team, working with me and the other directors Bryan, Ginny, and Geoffrey to hone the strategies, goals, and metrics that will allow us to track progress and meet our ambitious goals.&lt;/p>
&lt;h2 id="a-word-from-lucy">A word from Lucy…&lt;/h2>
&lt;p>I am excited to be joining Crossref as its next Director of Finance and Operations. I previously worked for an organisation that was a Crossref member and two qualities stood out to me: first, the focus with which Crossref has provided solutions to shared challenges across scholarly publishing; and second, the ways Crossref operates transparently and from a values-driven perspective.&lt;/p>
&lt;p>My past experience has been in helping organisations run as effectively as they can, navigate change and growth, and build and support high functioning teams. Specifically, my work has focused on strategic and sustainability planning, financial forecasting, organisational governance, and staff management. My goal in finance and operations is to ensure that the working experience at Crossref––both for external partners and members and internal staff––is as frictionless as possible so we can have the greatest impact on our community.&lt;/p>
&lt;p>I am only the second person to step into this role. Lisa Hart Martin has led finance and operations for the first twenty years of Crossref&amp;rsquo;s existence. I am fortunate to be overlapping with her for a couple of weeks and grateful for the trust of the Crossref team to help guide us into our third decade. I really want to hear from our members so please &lt;a href="mailto:lofiesh@crossref.org">reach out to me with your thoughts on Crossref&amp;rsquo;s finance and operations&lt;/a>.&lt;/p>
&lt;p>&lt;em>Please join us in welcoming Lucy to the Crossref community!&lt;/em>&lt;/p></description></item><item><title>Proposed schema changes - have your say</title><link>https://www.crossref.org/blog/proposed-schema-changes-have-your-say/</link><pubDate>Wed, 04 Dec 2019 00:00:00 +0000</pubDate><author>Patricia Feeney</author><guid>https://www.crossref.org/blog/proposed-schema-changes-have-your-say/</guid><description>&lt;p>The first version of our metadata input schema (a DTD, to be specific) was created in 1999 to capture basic bibliographic information and facilitate matching DOIs to citations. Over the past 20 years the bibliographic metadata we collect has deepened, and we’ve expanded our schema to include funding information, license, updates, relations, and other metadata. Our schema isn’t as venerable as a MARC record or as comprehensive as JATS, but it’s served us well. It’s not currently positioned to fully support everything we want to do long term - we’d like to support assertions, map cleanly to JATS and schema.org magically at the same time, and maybe even move beyond XML - but for now it’s something we can work with to empower member metadata to help find, cite, and connect scholarly content.&lt;/p>
&lt;p>We’ve maintained backwards compatibility for most things since 2007 but this update will require some moderate changes to how contributors are modeled. The balance between supporting established tagging and addressing the evolution of what we collect and how it is expressed can be tricky. We want to collect good metadata without significantly disrupting the workflow of our membership, who are the source of the metadata. Even so, this is a fairly pragmatic update that will position us well for the future. I look forward to supporting new types of content and metadata in the future, but for now &lt;a href="https://docs.google.com/document/d/1gCRaWqkne_QqNs0BO78KGfjPFMDkpAQ-ky2nVynkuwc/edit?usp=sharing" target="_blank">take a look at what I&amp;rsquo;m proposing&lt;/a>.&lt;/p>
&lt;div class="shortcode-divwrap blue-highlight">
&lt;span>Leave feedback, ask questions, and make suggestions in the &lt;a href="https://docs.google.com/document/d/1gCRaWqkne_QqNs0BO78KGfjPFMDkpAQ-ky2nVynkuwc/edit?usp=sharing" target="_blank">feedback document&lt;/a> or via email to &lt;a href="mailto:feedback@crossref.working">feedback@crossref.org&lt;/a>.&lt;/span>
&lt;/div>
&lt;h2 id="next-update">Next update&lt;/h2>
&lt;p>I’m proposing some updates and additions to the metadata we collect, and would like your feedback. To fully and elegantly support affiliation identifiers and multiple author roles, we need to break backwards compatibility. Specifically, we want to:&lt;/p>
&lt;h3 id="add-support-for-credit">Add support for CRediT&lt;/h3>
&lt;p>The CASRAI &lt;a href="https://credit.niso.org/" target="_blank">CRediT taxonomy&lt;/a> is increasingly used to represent roles common to contributors to research outputs. Our members are applying CRediT to contributors, so we want to capture them as well. Supporting CRediT allows Crossref and our membership to identify and credit contributors beyond authors and editors.&lt;/p>
&lt;p>As most of you know, a contributor often does more than one thing - they write, they edit, they curate. We currently only allow one contributor role as an attribute, but, to realistically support CRediT and accurately capture evidence about the work, we need to allow multiple contributor roles. This will break backwards compatibility. We can potentially support the old way and the new way, but I’m trying to avoid awkward compromises wherever possible.&lt;/p>
&lt;p>Supporting CRediT doesn’t mean you need to adopt CRediT. We’ll continue to support existing author roles, but they’ll be marked up differently. Details are in our request for feedback document.&lt;/p>
&lt;h3 id="expand-support-for-author-and-organisation-identifiers">Expand support for author and organisation identifiers&lt;/h3>
&lt;p>We collect ORCID iDs in our metadata but do not currently support other types of contributor identifiers. We also don&amp;rsquo;t support affiliation or organisation identifiers beyond those assigned within our funder and clinical trial registries. We’ve had increasing demands from both metadata suppliers and users to expand support for affiliation identifiers because&amp;hellip;identifiers are useful. We also want to expand author identifier support as ORCID IDs may only be registered by researchers who are able to curate their own ORCID record. Adding support for ISNI and Wikidata IDs is a common request, but we anticipate there&amp;rsquo;s a need for other identifiers as well.&lt;/p>
&lt;p>Our plan is to accept identifiers registered with identifiers.org as well as other identifiers upon request. We prefer to remain consistent with the identifiers.org registry as much as possible.&lt;/p>
&lt;p>We’re particularly keen to support open community-led identifiers like ORCID and &lt;a href="https://ror.org" target="_blank">ROR&lt;/a> and will continue to do so, but also want to support the metadata our members want to distribute. organisation identifiers will be particularly useful as they’ll help us populate records with ROR IDs in the future, leading to better quality affiliation metadata.&lt;/p>
&lt;h3 id="expand-support-for-a-range-of-contributor-names">Expand support for a range of contributor names&lt;/h3>
&lt;p>We currently require a surname for all contributors, and don’t provide comprehensive support for contributors whose names are represented by multiple alphabets, or who have nicknames or aliases, or who don’t have a surname. To begin with, we’ll replace surname with the more widely used ‘family name’ and remove the fixed surname requirement, allowing only a given name to be provided where appropriate. We’ll also allow a variety of names to be provided for each contributor.&lt;/p>
&lt;h3 id="expand-affiliation-support">Expand affiliation support&lt;/h3>
&lt;p>We currently collect affiliation as a single string - we’re going to break that up to support affiliation names, and add in support for organisational identifiers like ROR.&lt;/p>
&lt;h3 id="expand-support-for-data-citation">Expand support for data citation&lt;/h3>
&lt;p>For those of you who send us references, we’re adding a few fields to better support data citation. We’re also going to allow you to (optionally) supply a specific publication type for references.&lt;/p>
&lt;h3 id="other-updates">Other updates&lt;/h3>
&lt;p>We’re making some other small updates as well. If you have a small request, we may be able to accommodate it in our next update. Larger changes or additions will probably have to wait for future updates, but we’d love to start collecting suggestions now.&lt;/p>
&lt;h2 id="we-need-your-feedback">We need your feedback!&lt;/h2>
&lt;blockquote>
&lt;p>I&amp;rsquo;ll be giving a webinar on December 19 at 02:00 and 15:00 UTC to go over these changes in detail - please &lt;a href="https://www.crossref.org/webinars/#proposed-schema-changes-have-your-say">visit our webinars page&lt;/a> to register.&lt;/p>
&lt;/blockquote>
&lt;p>Again, please leave feedback, ask questions, and make suggestions in the &lt;a href="https://docs.google.com/document/d/1gCRaWqkne_QqNs0BO78KGfjPFMDkpAQ-ky2nVynkuwc/edit?usp=sharing" target="_blank">feedback document&lt;/a>, or if you prefer send feedback via email to &lt;a href="mailto:feedback@crossref.working">feedback@crossref.org&lt;/a>. We&amp;rsquo;ll be taking feedback through January 15, 2020.&lt;/p></description></item><item><title>A turning point is a time for reflection</title><link>https://www.crossref.org/blog/a-turning-point-is-a-time-for-reflection/</link><pubDate>Sat, 09 Nov 2019 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/a-turning-point-is-a-time-for-reflection/</guid><description>&lt;p>Crossref strives for balance. Different people have always wanted different things from us and, since our founding, we have brought together diverse organisations to have discussions&amp;mdash;sometimes contentious&amp;mdash;to agree on how to help make scholarly communications better. Being inclusive can mean slow progress, but we’ve been able to advance by being flexible, fair, and forward-thinking.&lt;/p>
&lt;p>We have been helped by the fact that Crossref’s founding organisations defined a clear purpose in our original &lt;a href="https://www.crossref.org/board-and-governance/incorporation-certificate">certificate of incorporation&lt;/a>, which reads:&lt;/p>
&lt;blockquote>
&lt;p>“To promote the development and cooperative use of new and innovative technologies to speed and facilitate scientific and other scholarly research.”&lt;/p>
&lt;/blockquote>
&lt;p>As Crossref prepares to turn 20 in January 2020, it’s an opportunity to reflect on achievements and highlights from 2018-19 and also ponder the preceding decades. Change is a constant at Crossref but the organisation has never strayed from its initial defined purpose. Our services and value now extend well beyond persistent identifiers and reference linking, and our connected open infrastructure benefits our 11,000+ membership as well as all those involved in scholarly research. This expansion is exactly what was envisioned to meet the goal of “speeding and facilitating” research.&lt;/p>
&lt;p>This year&amp;rsquo;s &lt;a href="https://doi.org/10.13003/y8ygwm5" target="_blank">annual report&lt;/a> is different from previous years’; it has been expanded into a ‘fact file’ so that we can invite comments on the path ahead, based on transparent access to data about our membership, activities, and finances. As we were pulling together the charts and tables for this annual report we noticed stark differences in where Crossref is today compared to years past.&lt;/p>
&lt;p>The rate of membership growth has accelerated and we now have over 180 new members joining every month, leading to one of the most striking changes we found. The lowest three membership tiers now account for 46% of revenue (up from 25% in 2011) while the highest three tiers account for 36% (down from 56% in 2011).
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/revenue-distribution-by-fee-tier-2011-2019.png"
alt="Revenue distribution by membership fee tier, comparing 2011 with 2019" width="600px">
&lt;/figure>
&lt;br>&lt;/p>
&lt;p>Today, the typical Crossref member has just a few hundred registered content items.
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/percentage-members-by-content-registration-band.png"
alt="Percentage of members by Content Registration band" width="600px">
&lt;/figure>
&lt;br>
One way we have been able to accommodate this growth efficiently is by collaborating with &lt;a href="https://www.crossref.org/community/sponsors">sponsors&lt;/a> in different countries. Very small members can join via a local sponsor that is able to provide technical, financial, language, and administrative support. We now have more members joining via sponsors, who otherwise would largely not be able to join at all. While you’d need to be a millionaire by US standards to join directly from Indonesia in our lowest fee tier (calculated using &lt;a href="https://data.oecd.org/conversion/purchasing-power-parities-ppp.htm" target="_blank">Purchasing Power Parity&lt;/a>), the sponsor program&amp;mdash;supported often by government investment in science and education&amp;mdash;has enabled Indonesian organisations to join Crossref in large numbers, supporting their aim to become one of the fastest-growing nations in open research, and to help that research be discovered.&lt;/p>
&lt;h2 id="crossref-has-repeatedly-stayed-ahead-of-developments-in-the-community">Crossref has repeatedly stayed ahead of developments in the community&lt;/h2>
&lt;p>In 2007, when the Similarity Check working group discussions and pilot started, there was disagreement on the board about whether Crossref should provide such a service and whether it was a strategic priority for members. By the end of the pilot, when the decision came to launch a production service, it was seen as essential and a top priority. This conclusion has been borne out in &lt;a href="https://docs.google.com/presentation/d/1RsqtnHssBkaFNphdWoq20_ewruYP04n8j_dYB9wvphM/edit#slide=id.g65af51c04a_1_238" target="_blank">recent research into the value of Crossref&lt;/a>; Similarity Check is one of the services of most importance to members.&lt;/p>
&lt;p>Adding preprints as a record type was controversial at the time. The board discussed the topic of “duplicative works” for about two years with strong opinions on all sides. The working group delivered a good set of policies and technical specifications and in the July 2015 board meeting there was a majority—but not 100%—agreement on the &lt;a href="https://www.crossref.org/board-and-governance/#july-2015-board-meeting">motion to approve&lt;/a>. We implemented preprints as a record type just in time to accommodate the snowballing of preprint servers emerging from existing and new members.&lt;/p>
&lt;p>Another example of a former&amp;mdash;and current&amp;mdash;area of contention is the approach to metadata. When Crossref first launched, there were lengthy discussions about what metadata we should collect. The initial focus was on the minimal set of metadata to enable reference matching in support of reference linking. In the beginning, neither article titles, lists of authors, references, nor abstracts were included in the minimal metadata set. We supported them as optional but most members opted out. However, the huge set of metadata that Crossref collects and disseminates now is seen as essential, providing a lot of value for members in terms of discoverability.&lt;/p>
&lt;p>Today, Crossref enables metadata retrieval on a large scale—an average of more than 600 million queries per month—through a variety of interfaces, most notably the REST API (Public, Polite, and Plus versions). The metadata is used by thousands of organisations and services—both commercial and not-for-profit—increasing the discoverability of member content. In fact, members of all stripes have long initiated projects to expand the metadata Crossref is able to collect and disseminate: from facilitating text mining (through license and full-text URLs); to enabling better connections with and evidence of contributions (through Funder IDs, ORCID iDs, and soon CRediT roles and ROR IDs).&lt;/p>
&lt;p>These are all examples of where Crossref has successfully “promoted the cooperative use of new and innovative technologies” and where we are meeting our mission to make scholarly communications a little bit better. As ever, we need to thank our brilliant staff for their unfailing resilience, balance, and diligence, in these times of dynamic change.&lt;/p>
&lt;h2 id="considering-the-value-and-future-of-crossref">Considering the value and future of Crossref&lt;/h2>
&lt;p>Research is global, and supporting a diverse global community is a challenge. This year, we conducted our first wide-ranging investigation into what people value from Crossref. This involved telephone interviews with over 40 community members as well as an online survey of 600+ respondents.&lt;/p>
&lt;p>The &lt;a href="https://docs.google.com/presentation/d/1RsqtnHssBkaFNphdWoq20_ewruYP04n8j_dYB9wvphM/edit#slide=id.g65af51c04a_1_238" target="_blank">results of the value research&lt;/a> are referenced throughout the &lt;a href="https://www.crossref.org/operations-and-sustainability/annual-report">annual report/fact file&lt;/a> and are available online publicly. We will be discussing the insights in various forums and posing some questions, such as:&lt;/p>
&lt;ol>
&lt;li>How should Crossref balance the different dynamics in the community?&lt;/li>
&lt;li>Are the right members involved in key decisions?&lt;/li>
&lt;li>Are the sustainability model we have and the fees we charge fair?&lt;/li>
&lt;li>Which initiatives should be top or bottom priorities?&lt;/li>
&lt;/ol>
&lt;p>Director of MIT Press, Amy Brand, recently reflected that &lt;a href="https://scholarlykitchen.sspnet.org/2019/10/22/crossref-at-a-crossroads-all-roads-lead-to-crossref/" target="_blank">Crossref is currently at a crossroads&lt;/a>, envisioning that:&lt;/p>
&lt;blockquote>
&lt;p>“The Crossref of 2040 could be an even more robust, inclusive, and innovative consortium to create and sustain core infrastructures for sharing, preserving, and evaluating research information.”&lt;/p>
&lt;/blockquote>
&lt;p>But only if Crossref is not:&lt;/p>
&lt;blockquote>
&lt;p>“held back, and its remit circumscribed, by legacy priorities and forces within the industry that may perceive open data and infrastructure as a threat to their own evolving business interests.”&lt;/p>
&lt;/blockquote>
&lt;p>We welcome this public commentary and encourage others in the community to respond and report what value Crossref offers as community-owned infrastructure, and how they’d like to see the organisation evolve.&lt;/p>
&lt;p>More than ever, we need to have this discussion with a broad and representative group. So please, read the value research report and the annual report/fact file, and get ready to voice your opinions!&lt;/p></description></item><item><title>What's your (citations') style?</title><link>https://www.crossref.org/blog/whats-your-citations-style/</link><pubDate>Tue, 29 Oct 2019 00:00:00 +0000</pubDate><author>Dominika Tkaczyk</author><guid>https://www.crossref.org/blog/whats-your-citations-style/</guid><description>&lt;p>Bibliographic references in scientific papers are the end result of a process typically composed of: finding the right document to cite, obtaining its metadata, and formatting the metadata using a specific citation style. This end result, however, does not preserve the information about the citation style used to generate it. Can the citation style be somehow guessed from the reference string only?&lt;/p>
&lt;h2 id="tldr">TL;DR&lt;/h2>
&lt;ul>
&lt;li>I built an automatic citation style classifier. It classifies a given bibliographic reference string into one of 17 citation styles or &amp;ldquo;unknown&amp;rdquo;.&lt;/li>
&lt;li>The classifier is based on supervised machine learning. It uses TF-IDF feature representation and a simple Logistic Regression model.&lt;/li>
&lt;li>For training and testing, I used datasets generated automatically from Crossref metadata.&lt;/li>
&lt;li>The accuracy of the classifier estimated on the test set is 94.7%.&lt;/li>
&lt;li>The classifier is &lt;a href="https://gitlab.com/crossref/citation_style_classifier" target="_blank">open source&lt;/a> and can be used as a &lt;a href="https://pypi.org/project/styleclass/" target="_blank">Python library&lt;/a> or &lt;a href="http://styleclass.labs.crossref.org/citationstyle" target="_blank">REST API&lt;/a>.&lt;/li>
&lt;/ul>
&lt;h2 id="introduction">Introduction&lt;/h2>
&lt;pre tabindex="0">&lt;code>Threadgill-Sowder, J. (1983). Question Placement in Mathematical Word Problems. School Science and Mathematics, 83(2), 107-111
&lt;/code>&lt;/pre>&lt;p>This reference is the end result of a process that typically includes: finding the right document, obtaining its metadata, and formatting the metadata using a specific citation style. Sadly, the intermediate reference forms or the details of this process are not preserved in the end result. In general, just by looking at the reference string we cannot be sure which document it originates from, what its metadata is, or which citation style was used.&lt;/p>
&lt;p>Global multi-billion dollar fashion industry proves without a doubt that people care about their fashion style. But why should we care about the citation style used to generate a specific reference? This might seem like an insignificant piece of information, but it can be a powerful clue when we try to solve tasks like:&lt;/p>
&lt;ul>
&lt;li>Reference parsing, i.e., extracting metadata from the reference string. If the style is known, we also know where to expect metadata fields in the string, and it is typically enough to use simple regular expressions instead of complicated (and slow) machine learning-based parsers.&lt;/li>
&lt;li>Discipline/topic classification. Citation styles used in documents correlate with their discipline. As a result, knowing the citation style used in the document could provide a useful clue for a discipline classifier.&lt;/li>
&lt;li>Extracting references from documents. Conforming to a specific style might suggest that the reference string was correctly located within a larger document.&lt;/li>
&lt;/ul>
&lt;p>Even though the style is not directly mentioned in the reference string, the string contains useful clues. Some styles will abbreviate the authors&amp;rsquo; first names, and others won&amp;rsquo;t. Some will place the year in parentheses, others separate it with commas. The presence of such fragments in the reference string can be used as the input for the style classifier.&lt;/p>
&lt;p>I used these clues to build an automatic style classifier. It takes a single reference string on the input and classifies it into one of 17 styles or &amp;ldquo;unknown&amp;rdquo;. You can use it as a &lt;a href="https://pypi.org/project/styleclass/" target="_blank">Python library&lt;/a> or via &lt;a href="http://styleclass.labs.crossref.org/citationstyle" target="_blank">REST API&lt;/a>. The &lt;a href="https://gitlab.com/crossref/citation_style_classifier" target="_blank">source code&lt;/a> is also available. If you find this project useful, I would love to hear about it!&lt;/p>
&lt;p>And if you are interested in more details about the classifier and how it was built, read on.&lt;/p>
&lt;h2 id="data">Data&lt;/h2>
&lt;p>The data for the experiments was generated automatically. The training and the test set were generated in the same way but from two different samples. The process was the following:&lt;/p>
&lt;ul>
&lt;li>5,000 documents were randomly chosen from Crossref collection.&lt;/li>
&lt;li>Each document was formatted into 17 citation styles. This resulted in 85,000 pairs (reference string, citation style).&lt;/li>
&lt;li>Very short reference strings were removed. A short reference string typically results from very incomplete metadata of the document.&lt;/li>
&lt;li>From a number of randomly selected references, I removed fragments like the name of the month. These fragments appear in the automatically generated reference strings because sometimes months are included in the metadata records in Crossref collection. However, they rarely appear in the real-life reference strings, so removing them made the dataset more reliable.&lt;/li>
&lt;li>5,000 strings labelled as &amp;ldquo;unknown&amp;rdquo; were also added. These were generated by randomly swapping the words in the &amp;ldquo;real&amp;rdquo; reference strings.&lt;/li>
&lt;/ul>
&lt;p>This process resulted in two sets: training set containing 87,808 data points and test set containing 87,625 data points. The training set was used to choose various classification parameters and to train the final model. The test set was used to obtain the final estimation of the classifier&amp;rsquo;s accuracy.&lt;/p>
&lt;h2 id="styles">Styles&lt;/h2>
&lt;p>The classifier was trained on the following 17 citation styles (+ &amp;ldquo;unknown&amp;rdquo;):&lt;/p>
&lt;ul>
&lt;li>acm-sig-proceedings&lt;/li>
&lt;li>american-chemical-society&lt;/li>
&lt;li>american-chemical-society-with-titles&lt;/li>
&lt;li>american-institute-of-physics&lt;/li>
&lt;li>american-sociological-association&lt;/li>
&lt;li>apa&lt;/li>
&lt;li>bmc-bioinformatics&lt;/li>
&lt;li>chicago-author-date&lt;/li>
&lt;li>elsevier-without-titles&lt;/li>
&lt;li>elsevier-with-titles&lt;/li>
&lt;li>harvard3&lt;/li>
&lt;li>ieee&lt;/li>
&lt;li>iso690-author-date-en&lt;/li>
&lt;li>modern-language-association&lt;/li>
&lt;li>springer-basic-author-date&lt;/li>
&lt;li>springer-lecture-notes-in-computer-science&lt;/li>
&lt;li>vancouver&lt;/li>
&lt;/ul>
&lt;p>These 17 styles were chosen to cover a vast majority of references that we see in the real-life data, without including too many variants of very similar styles.&lt;/p>
&lt;p>If you need a different style set, fear not. You can use the library to train your own model based on exactly the styles you need.&lt;/p>
&lt;h2 id="features">Features&lt;/h2>
&lt;p>Our learning algorithm cannot work directly with the raw text on the input. It needs numerical features. In the case of text classification (and reference strings are text), one very common feature representation is &lt;a href="https://en.wikipedia.org/wiki/Bag-of-words_model" target="_blank">bag-of-words&lt;/a>. In the simplest variant, each feature represents a single word, and the value of the feature is binary: 1 if the word is present in the text, 0 otherwise.&lt;/p>
&lt;p>There are many variants of this representation, for example:&lt;/p>
&lt;ul>
&lt;li>The input text typically undergoes normalization before the features are extracted. Depending on the use case, this might include lowercasing, removing punctuation, bringing the words to their canonical form by stemming, etc.&lt;/li>
&lt;li>We do not have to use single words as features. In some use cases, it is beneficial to use &lt;a href="https://en.wikipedia.org/wiki/N-gram" target="_blank">n-grams&lt;/a>, which correspond to fixed-length sequences of words.&lt;/li>
&lt;li>Instead of binary values, we might want to use some other feature weight schemes, such as the famous &lt;a href="https://en.wikipedia.org/wiki/Tf%e2%80%93idf" target="_blank">TF-IDF representation&lt;/a>.&lt;/li>
&lt;/ul>
&lt;p>Our use case is not a typical case of text classification. We cannot use raw words as features, as words do not carry the information about the citation style. Imagine the same document formatted in different styles –– those reference strings will contain the same words, and the learning algorithm won&amp;rsquo;t be able to distinguish between them.&lt;/p>
&lt;p>As a side note, in some cases, some specific words might be important. For example, if the reference contains the word &amp;ldquo;algorithm&amp;rdquo;, chances are the document is from computer science. If so, then perhaps the citing paper is from computer science as well. And in computer science, some styles are more popular than others. Machine learning algorithms are pretty good at detecting such correlations in the data. In the first version of our classifier, however, we do not take this into account. This keeps things simpler.&lt;/p>
&lt;p>If not words, then what matters in our case? It seems that the information about the style is present in punctuation, capitalization and abbreviations.&lt;/p>
&lt;p>To capture these clues, before extracting the features we first map our reference string into a sequence of &amp;ldquo;word types&amp;rdquo; (or &amp;ldquo;character types&amp;rdquo;). The types are the following: &lt;em>lowercase-word&lt;/em>, &lt;em>lowercase-letter&lt;/em>, &lt;em>uppercase-word&lt;/em>, &lt;em>uppercase-letter&lt;/em>, &lt;em>capitalized-word&lt;/em>, &lt;em>other-word&lt;/em>, &lt;em>year&lt;/em>, &lt;em>number&lt;/em>, &lt;em>dot&lt;/em>, &lt;em>comma&lt;/em>, &lt;em>left-parenthesis&lt;/em>, &lt;em>right-parenthesis&lt;/em>, &lt;em>left-bracket&lt;/em>, &lt;em>right-bracket&lt;/em>, &lt;em>colon&lt;/em>, &lt;em>semicolon&lt;/em>, &lt;em>slash&lt;/em>, &lt;em>dash&lt;/em>, &lt;em>quote&lt;/em>, &lt;em>other&lt;/em>.&lt;/p>
&lt;p>In addition, we mark the beginning and the end of the reference string with special types &lt;em>start&lt;/em> and &lt;em>end&lt;/em>.&lt;/p>
&lt;p>So for example this string:&lt;/p>
&lt;pre tabindex="0">&lt;code>Eberlein, T. J. Yearbook of Surgery 2006, 322–324.
&lt;/code>&lt;/pre>&lt;p>is mapped into this sequence:&lt;/p>
&lt;pre tabindex="0">&lt;code>start capitalized-word comma uppercase-letter dot uppercase-letter dot capitalized-word lowercase-word capitalized-word year comma number dash number dot end
&lt;/code>&lt;/pre>&lt;p>This transformation effectively brings together different words, as long as their form is the same.&lt;/p>
&lt;p>After transforming the reference string we extract 2-grams, 3-grams and 4-grams. The values of the features are TF-IDF weights.&lt;/p>
&lt;p>Some example features in our representation include:&lt;/p>
&lt;ul>
&lt;li>&lt;em>lowercase-word lowercase-word lowercase-word lowercase-word&lt;/em> - a sequence of four lowercase words. It is most likely the part of the article title and won&amp;rsquo;t have a huge impact on the decision about the citation style.&lt;/li>
&lt;li>&lt;em>capitalized-word comma uppercase-letter dot&lt;/em> - typical representation of an author in some styles, where the first name is given as an initial only and follows the last name.&lt;/li>
&lt;li>&lt;em>left-parenthesis year right-parenthesis&lt;/em> - typical for styles that enclose the year in parentheses.&lt;/li>
&lt;li>&lt;em>number dash number&lt;/em> - this sequence is most likely pages range.&lt;/li>
&lt;/ul>
&lt;h2 id="learning-algorithm">Learning algorithm&lt;/h2>
&lt;p>I tested four learning algorithms (&lt;a href="https://en.wikipedia.org/wiki/Naive_Bayes_classifier" target="_blank">naive Bayes&lt;/a>, &lt;a href="https://en.wikipedia.org/wiki/Logistic_regression" target="_blank">logistic regression&lt;/a>, &lt;a href="https://en.wikipedia.org/wiki/Support-vector_machine" target="_blank">linear support vector classification&lt;/a> and &lt;a href="https://en.wikipedia.org/wiki/Random_forest" target="_blank">random forest&lt;/a>) in a 5-fold cross validation on the training set. The plot shows the distribution of accuracies obtained by each algorithm:&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/citation_style_classification_algorithms.png"
alt="reference forms" width="600px">
&lt;/figure>
&lt;br/>
&lt;p>Based on these results, logistic regression was chosen as the algorithm with the best mean accuracy and the lowest variance of the results.&lt;/p>
&lt;h2 id="final-accuracy-estimation">Final accuracy estimation&lt;/h2>
&lt;p>The final model was trained on the entire training set and evaluated on the test set. As evaluation metric &lt;a href="https://en.wikipedia.org/wiki/Accuracy_and_precision" target="_blank">accuracy&lt;/a> was used. In this case, accuracy is simply the fraction of the references in the test set correctly classified by the classifier.&lt;/p>
&lt;p>The accuracy on the test set was 94.7%. The confusion matrix shows which styles were most often confused with each other:&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/citation_style_classification_confusion_matrix.png"
alt="reference forms" width="800px">
&lt;/figure>
&lt;br/>
&lt;p>The most often confused styles are chicago-author-date and american-sociological-association. Let&amp;rsquo;s see some example strings from these two styles:&lt;/p>
&lt;pre tabindex="0">&lt;code>Legros, F. 2003. &amp;#34;Can Dispersive Pressure Cause Inverse Grading in Grain Flows?: Reply.&amp;#34; Journal of Sedimentary Research 73(2):335–335
Legros, F. 2003. &amp;#34;Can Dispersive Pressure Cause Inverse Grading in Grain Flows?: Reply.&amp;#34; Journal of Sedimentary Research 73 (2) : 335–335
&lt;/code>&lt;/pre>&lt;pre tabindex="0">&lt;code>Clarke, Jennie T. 2011. &amp;#34;Recognizing and Managing Reticular Erythematous Mucinosis.&amp;#34; Archives of Dermatology 147(6):715
Clarke, Jennie T. 2011. &amp;#34;Recognizing and Managing Reticular Erythematous Mucinosis.&amp;#34; Archives of Dermatology 147 (6) : 715
&lt;/code>&lt;/pre>&lt;pre tabindex="0">&lt;code>Chalmers, Alan, and Richard Nicholas. 1983. &amp;#34;Galileo on the Dissipative Effect of a Rotating Earth.&amp;#34; Studies in History and Philosophy of Science Part A 14(4):315–40
Chalmers, Alan, and Richard Nicholas. 1983. &amp;#34;Galileo on the Dissipative Effect of a Rotating Earth.&amp;#34; Studies in History and Philosophy of Science Part A 14 (4) : 315–340
&lt;/code>&lt;/pre>&lt;p>It seems that the styles are indeed very similar. The strings look almost identical, apart from spacing, which is not included in any way in our feature representation. No wonder that the classifier confuses these two styles a lot.&lt;/p>
&lt;p>A more detailed analysis of the classifier can be found &lt;a href="https://gitlab.com/crossref/citation_style_classifier/blob/master/analyses/citation_style_classification.ipynb" target="_blank">here&lt;/a>.&lt;/p></description></item><item><title>Accidental release of internal passwords, &amp; API tokens for the Crossref system</title><link>https://www.crossref.org/blog/accidental-release-of-internal-passwords-api-tokens-for-the-crossref-system/</link><pubDate>Fri, 04 Oct 2019 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/accidental-release-of-internal-passwords-api-tokens-for-the-crossref-system/</guid><description>&lt;h2 id="tldr">TL;DR&lt;/h2>
&lt;p>On Wednesday, October 2nd, 2019 we discovered that we had accidentally pushed the main Crossref system as part of a docker image into a developer’s account on Docker Hub. The binaries and configuration files that made up the docker image included embedded passwords and API tokens that could have been used to compromise our systems and infrastructure. When we discovered this, we immediately secured the repo, changed all the passwords and secrets, and redeployed the system code. We have since been scanning all of our logs and systems to see if there has been any unusual activity that could be related to the exposure of the container.&lt;/p>
&lt;p>Please note that no external data e.g. member passwords or personal information were exposed; our source code contains only internal passwords and ‘secrets’ such as API tokens.&lt;/p>
&lt;p>Thankfully, the way in which these secrets were exposed (in compressed, binary files which were, in turn, in a Docker image) means that they were probably overlooked by the automated exploitation tools which focus on scanning source code. And, so far, we have seen nothing that would indicate that these passwords and secrets have been exploited. We will, of course, inform our members directly (and update this blog) if that changes.&lt;/p>
&lt;h2 id="more-than-you-probably-want-to-know">More than you probably want to know&lt;/h2>
&lt;p>If you are continuing to read this, my guess is that you might have questions like:&lt;/p>
&lt;ol>
&lt;li>Why are you doing something as silly as embedding secrets and passwords in your code?&lt;/li>
&lt;li>And wait a minute… I thought Crossref code was open source?&lt;/li>
&lt;li>And why is the director of strategic initiatives announcing this?&lt;/li>
&lt;/ol>
&lt;p>Let me answer these questions in random order.&lt;/p>
&lt;p>In March 2019 I took over Crossref’s technical teams when Chuck Koscher announced that he would be retiring at the end of the year. I’m now the director of technology &amp;amp; research.&lt;/p>
&lt;p>A few months earlier we had already concluded that a major portion of the Crossref system had accumulated 20 years of technical debt and that we were going to spend a significant portion of 2019 and 2020 paying down that debt.&lt;/p>
&lt;p>Specifically, a lot of the code that runs Crossref was inherited from a third party who developed it back in the early 2000s. This means that, even though any new systems that we’ve developed since 2007 have been open-source, the code for the oldest parts of the system has remained closed because it contained potentially proprietary code as well as a lot of deprecated coding practices. Also - the architecture, the tooling, and the development processes behind the Crossref system had not changed much in those twenty years. It was fantastic architecture, tooling, and code for its time. But architectures that scale to millions of records need to change to handle hundreds of millions of records. Processes that work for configuring one service need to change when you are managing dozens of services. And support tools that work for a few hundred members break down when you are dealing with tens of thousands of members.&lt;/p>
&lt;p>These parts of the Crossref system were decidedly not &lt;a href="https://en.wikipedia.org/wiki/Twelve-Factor_App_methodology" target="_blank">12 factor&lt;/a>. We were not using &lt;a href="https://en.wikipedia.org/wiki/DevOps" target="_blank">DevOps&lt;/a> or &lt;a href="https://en.wikipedia.org/wiki/Site_Reliability_Engineering" target="_blank">SRE&lt;/a> working practices to run them. And the bulk of that part of the system is still being run in a traditional data center.&lt;/p>
&lt;p>But since March we have been slowly fixing that. In incremental steps. Some of which are visible as a side effect of the security incident that precipitated this blog post. For example, one of our first moves was to move our development to &lt;a href="https://gitlab.com/crossref" target="_blank">Gitlab&lt;/a>. Even though a big chunk of the base Crossref code is still closed source, we saw moving to Gitlab as a priority because Gitlab offers a fantastic suite of tools to help automate and manage our deployments. Similarly, we have been Dockerizing the Crossref system so that it is easier to scale and run in different environments. And as part of this effort, we have spent a lot of time on the issue of how to best handle secrets. We knew our secrets management in this part of the codebase was horrible. We have been developing some experiments and infrastructure for handling these secrets securely. But we haven’t finished this work yet. And so the system slipped out into a public repo too early. Ironically, this too illustrates a fundamental change in the way we develop things. &lt;a href="https://www.crossref.org/truths/">Our default is to be open and transparent&lt;/a>. This case is currently an exception. An exception we want to eliminate, but one we are not ready to do yet. We have to audit and scrub the code first.&lt;/p>
&lt;p>Yes, this incident has been embarrassing. But not nearly as embarrassing as the fact that Crossref has succumbed to a technology industry cliche. That we spent so much time growing and focusing on new features for our members, that we neglected some of the creaking infrastructure of our infrastructure.&lt;/p>
&lt;p>And I should be clear about two things:&lt;/p>
&lt;p>First, not all of our code is like this. We have, for a long time, been building open source software and using modern best practices for secrets management in our newer subsystems and services. The problems described above are confined to twenty-year-old-code that we didn’t write in the first place and that we had been avoiding refactoring.&lt;/p>
&lt;p>And second, the technology team has been marvelous at responding to the challenge we face. They have adopted new processes and tools. They are learning new techniques. We are steadily chipping away at these problems.&lt;/p>
&lt;p>It is generally considered bad practice to praise or reward technology teams for fire-fighting instead of fire prevention, but this may be the exception that proves the rule.&lt;/p>
&lt;p>I was blown away by how the technology, product, and support teams worked together. When we discovered this problem, I sat at my desk in rural France and watched as staff from the UK, and all three US time zones shut down this problem in just a couple of hours. Obviously, I wish we hadn’t had the problem in the first place, but seeing their response did a great deal to encourage me that we are on the right track.&lt;/p>
&lt;p>In any case, it looks like we’ve been lucky. And we’ll be working even harder to refactor our code, tools, and processes so that this kind of thing doesn’t happen again.&lt;/p></description></item><item><title>Request for feedback: Conference ID implementation</title><link>https://www.crossref.org/blog/request-for-feedback-conference-id-implementation/</link><pubDate>Fri, 13 Sep 2019 00:00:00 +0000</pubDate><author>Patricia Feeney</author><guid>https://www.crossref.org/blog/request-for-feedback-conference-id-implementation/</guid><description>&lt;p>We’ve all been subject to floods of conference invitations, it can be &lt;a href="https://doi.org/10.64000/skv7b-cef25" target="_blank">difficult to sort the relevant from the not-relevant&lt;/a> or (even worse) sketchy conferences competing for our attention. In 2017, DataCite and Crossref started a &lt;a href="https://www.crossref.org/working-groups/conferences-projects/">working group&lt;/a> to investigate creating identifiers for conferences and projects. Identifiers describe and disambiguate, and applying identifiers to conference events will help build clear durable connections between scholarly events and scholarly literature.&lt;/p>
&lt;p>Chaired by Aliaksandr Birukou, the Executive Editor for Computer Science at Springer Nature, the group has met regularly over the past two years, collaborating to create use cases and define metadata to identify and describe conference series and events. We first asked for input on metadata specifications in &lt;a href="https://doi.org/10.64000/sscc6-we508" target="_blank">April 2018&lt;/a>. Technical implementation kicked off in February with a workshop at CERN to discuss the mechanics of making PIDs for conferences a reality.&lt;/p>
&lt;h2 id="weve-reached-another-milestone-and-want-your-feedback">We’ve reached another milestone and want your feedback&lt;/h2>
&lt;p>Crossref has supported a number of conference publication-related PIDs for years - members can currently register PIDs for conference series publications, conference proceedings, and of course individual conference papers - and that won’t change, but we will also be supporting DOI registration for conferences. A crucial step towards this is of course integrating the new identifier into our metadata input schema.&lt;/p>
&lt;h2 id="the-details">The details&lt;/h2>
&lt;p>We currently collect some limited metadata describing the conference itself such as theme, location, and dates as part of the conference series or proceeding metadata, but do not apply a DOI to that information. The new Conference ID records will include expanded metadata as &lt;a href="https://docs.google.com/document/d/1URIvkUpzcfjSd2YFIS-rdRIrOyrKSbFfhkdpGPRTAFI/edit" target="_blank">defined by the working group&lt;/a>. You&amp;rsquo;ll be able to register a distinct metadata record for a single conference. You&amp;rsquo;ll also be able to register a record for a conference series, and connect Conference IDs to conference proceeding metadata records and DOIs.&lt;/p>
&lt;p>Changes to the conference-specific metadata are backwards compatible. Members will be able to register event metadata per usual, or can instead use the new event metadata to register an identifier for their conference event and/or series. This means a member can:&lt;/p>
&lt;ul>
&lt;li>Register conference, conference series, proceedings series, proceedings, and papers in one submission&lt;/li>
&lt;li>Register proceedings or proceedings series and papers without a Conference ID included&lt;/li>
&lt;li>Register Conference IDs only&lt;/li>
&lt;li>Update an existing conference record with a Conference PID&lt;/li>
&lt;/ul>
&lt;p>I’ve written up our proposal &lt;a href="https://docs.google.com/document/d/17hKUa2WHxeUpqEe9H0I022Ggod4ID5bmuDDNmvZQn58/edit#" target="_blank">in this google doc&lt;/a> and we want your feedback before we proceed with implementation. Please comment directly in the Google doc, open a Gitlab issue, or &lt;a href="mailto:feedback@crossref.org">feedback@crossref.org&lt;/a>. We’ll keep the document open for comments until September 30.&lt;/p></description></item><item><title>Speaking, Traveling, Listening, Learning</title><link>https://www.crossref.org/blog/speaking-traveling-listening-learning/</link><pubDate>Thu, 29 Aug 2019 00:00:00 +0000</pubDate><author>Vanessa Fairhurst</author><guid>https://www.crossref.org/blog/speaking-traveling-listening-learning/</guid><description>&lt;p>2019 has been busy for the Community Outreach Team; our small sub-team travels far and wide, talking to members around the world to learn how we can better support the work they do. We run one-day &lt;a href="https://www.crossref.org/events/">LIVE local events&lt;/a> alongside multi-language &lt;a href="https://www.crossref.org/webinars/">webinars&lt;/a>, with the addition of a new &lt;a href="https://community.crossref.org/" target="_blank">Community Forum&lt;/a>, to better support and communicate with our global membership.&lt;/p>
&lt;p>This year we held a publisher workshop in London in collaboration with the British Library in February to talk about all things metadata and Open Access, before heading over to speak to members in Kyiv in March at the National Technical University of Ukraine. June saw our first ever non-English LIVE local event in Bogota held in collaboration with Biteca, and in an action-packed week in July, Rachael Lammey and myself jetted across to Kuala Lumpur and Bangkok where we collaborated with Malaysian Ministry of Education, USIM, Chulalongkorn University, iGroup, and ORCID to run two events for our South-East Asian members.&lt;/p>
&lt;p>Despite the varied locations, speakers and audiences at these events, some common themes emerged&amp;hellip;&lt;/p>
&lt;h2 id="language-matters">Language Matters&lt;/h2>
&lt;p>We currently work with member organisations in over 125 countries around the world, spanning an even greater number of languages. Whilst, at the moment at least, it is not possible to provide support across all these languages, we are improving support for non-native English speakers. We now have service &lt;a href="https://www.youtube.com/channel/UCO0pjPM4wCJRnjI6ivFXKGA/playlists?view=50&amp;amp;sort=dd&amp;amp;shelf_id=2" target="_blank">videos&lt;/a>, &lt;a href="https://www.crossref.org/services/">factsheets&lt;/a>, and brochures available in 8 languages including: French, Spanish, Brazilian Portuguese, Arabic, Chinese, Japanese, Korean, and Bahasa Indonesia. As well as expanding our webinars to include a series in Russian, Brazilian Portuguese, Arabic, Spanish and Turkish so far.&lt;/p>
&lt;p>Our global team of 24 &lt;a href="https://www.crossref.org/community/ambassadors/">Ambassadors&lt;/a> have been key in helping us to provide translated documentation, to run multi-lingual webinars and in-person events, and to answer questions from our members across languages and timezones. Our LIVE local event in Bogota, saw us run our first ever Spanish event with support from our Latin American ambassador team.&lt;/p>
&lt;p align="center">
&lt;img src="https://www.crossref.org/images/blog/2019/ambassadors-bogota.jpg" alt="Ambassadors in Bogota" width="500px" />
&lt;/p>
&lt;p>I know first hand how daunting public speaking can be, particularly in a second language. As a non-native Spanish speaker, the fear of being misunderstood or mis-pronouncing a word can be paralysing. Members come along to our events with a whole host of questions, sometimes preferring to come and speak to us one-on-one at the break or follow up with us after the event. Everyone has their own preferences, however, being able to communicate in the local language helps to break down barriers and boosts audience participation by taking away these added pressures.&lt;/p>
&lt;p>Additionally after running a number of these events, one of the key things we have learnt is how much content to cover in a day. Our LIVE locals are free to attend and open to the whole community. This however can mean that we have a very varied audience in terms of technical know-how and experience of working with our systems. At first we attempted to cover all we could, addressing as many needs, questions and uses of Crossref metadata that we could. However, creating content to please everyone is often a recipe for disaster and information overload. If you start to see your attendee’s eyes glaze over or they start answering emails on their smartphones, you’ve lost them.&lt;/p>
&lt;p>Instead we are now going to tailor our events a little more, asking registrants questions in advance, and selecting specific topics to cover. Having a good range of distinct topics and presenters, including local guest speakers, also helps to maintain momentum and avoid audience fatigue. Wider information and conversations will then continue on our &lt;a href="https://community.crossref.org/" target="_blank">Community Forum&lt;/a> as well as events being supplemented by &lt;a href="https://www.crossref.org/webinars/">webinars in local languages and timezones&lt;/a>.&lt;/p>
&lt;h2 id="relationship-status-its-complicated">Relationship status: It’s complicated&lt;/h2>
&lt;p>A question we are often asked when talking to members is how to link distinct content items in the metadata - whether this be a data-set to the published results, a preprint with the version of record, or a translated version of an article with the original.
Linking these related research outputs is extremely important; researchers need to be able to cite the correct version of the work they have used in their research. Creating a network of these linkages between scholarly outputs also helps ourselves, our members, and the wider community better track how research is used and developed.&lt;/p>
&lt;p>English is by far the most common language used in international academic journals and often is required for publication, however the article can be published in two or more languages, enabling greater discovery and use of the research. A frequent question we get asked is how to register the two versions, whether they use the same DOI or whether each should be assigned its own identifier. Our &lt;a href="https://support.crossref.org/hc/en-us/articles/214357426-Relationships-between-DOIs-and-other-objects" target="_blank">advice&lt;/a> is that each version of the article should have it’s own DOI for citation reasons, but should be linked in the metadata of the translated version as in the xml example below:&lt;/p>
&lt;p align="center">
&lt;img src="https://www.crossref.org/images/blog/2019/relationship-example-xml.png" alt="Relationship example xml" width="600px" />
&lt;/p>
&lt;p>However, our schema covers far more relationship types than purely translations. Another interesting area of discussion which has become increasingly prevalent in the last couple of years is around preprints. We began supporting the registration of preprints in November 2016, using their specific record type and enabling linking in the metadata to the version of record, providing a clear publication history for accurate citation. Today we have almost 150k registered in our system.&lt;/p>
&lt;p>In Kyiv, we had a request to talk more about data citation; the importance of making data available and persistently linked to. Although data is often shared, it is not routinely referenced in the same way as journal articles or other publications, and this is something we want to encourage. When data is cited it provides clarity and context about the research underpinning the published article, as well as enabling greater discovery and re-use of that data in future research and publications. You can do this in two ways at Crossref, either by including data citations your reference lists, or, again, by using the relations section of the schema. If you want to learn more about the ‘how’ of data citation, we have some &lt;a href="https://doi.org/10.64000/prxtc-78q32" target="_blank">useful guidance&lt;/a> you can take a look at.&lt;/p>
&lt;div style="float:left;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/2019/Otters.png" alt=“Meaningful connections like the otters" height="100px" width="300px" class="img-responsive" />
&lt;/div>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/2019/Otters.png" alt=“Meaningful connections like the otters" height="100px" width="300px" class="img-responsive" />
&lt;/div>
&lt;p>As we are always saying Crossref is all about making connections. Linking research objects by capturing and declaring relationships within your metadata helps to map the evolution of research. Making the distinct parts of the research and publication process accessible by both readers and machines, enables wider discovery, re-use, transparency, accuracy of citations and provides greater acknowledgment of contributors.&lt;/p>
&lt;h2 id="finding-solutions-to-resolutions">Finding Solutions to Resolutions&lt;/h2>
&lt;p>Reports are rarely the things that get pulses racing (you should probably take a long, hard look at yourself if so) but they are important and can be very useful to make sure your content and the associated metadata is being registered correctly.&lt;/p>
&lt;p>We often get questions from members who want to better understand their resolution reports. These are reports generated on a monthly basis for each DOI prefix, sent to the business contact for your organisation, which provide statistics on the resolution rates of your content. So what do we mean by a resolution? Well simply, when a reader clicks on a DOI link for an article, that counts as one DOI resolution. No information is captured about the user or where they are coming from. Although we work to filter out computer-generated usage, the numbers are not a precise measure of human click-throughs to a publishers website - cached articles, search engine crawlers, and traffic directed through a library link resolver can be included in these numbers. However, the reports still provide a good idea of traffic to your publications via the DOI.&lt;/p>
&lt;p>Often the part of the report which is of particular interest is the resolution failure rate. Although in an ideal world this would be 0%, realistically 2-3% is the norm. Publishers who are new to Crossref or who have created a small number of DOIs may have a higher failure percentage and this isn’t necessarily a problem (for example, a publisher with 1 failure and 9 successes will have a 10% failure rate). A .csv file containing a list of all failed DOI resolution attempts for the month is attached to each report so that you can review any significant number of failures or any dramatic changes which may indicate a problem that needs to be solved.&lt;/p>
&lt;p>&lt;strong>Possible reasons for DOI failures:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>Bad links - check that your DOI is directing readers to the correct location of your full text or landing page.&lt;/li>
&lt;li>Undeposited DOIs - any DOIs that have been distributed or published should be deposited immediately. Simply adding a DOI to your content page will not automatically register this link.&lt;/li>
&lt;li>Similarly, if your DOI was deposited mid-month and distributed earlier, any attempts prior to this date will appear as failures on your report.&lt;/li>
&lt;li>User error - sometimes users can make mistakes when typing or copy-and-pasting DOIs. To minimize the risk of this keep your DOIs simple and short.&lt;/li>
&lt;/ul>
&lt;p>It is also important to make sure you keep the contact details we have on file for your organisation up to date. Otherwise you might miss out on receiving important information about your account. Where it is possible we ask members to submit at least three separate contacts and review this regularly as people often move within and between organisations. We want to keep in touch to give you helpful, essential and interesting information (no spam!)&lt;/p>
&lt;h2 id="get-involved">Get involved&lt;/h2>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/2019/live-bangkok.jpg" alt=“LIVE Bangkok" height="125px" width="375px" class="img-responsive" />
&lt;/div>
&lt;p>Our next LIVE local event will be held in Oakland, California on 19 September, &lt;a href="https://crossrefoakland.eventbrite.com" target="_blank">registration is open&lt;/a> and spaces are still available. Alternatively you might want to sign up to one of our interactive &lt;a href="https://www.crossref.org/webinars/">Metadata Manager webinars&lt;/a> to learn how to use our new content registration tool. Our plans for 2020 are still in the inception phase and we welcome any interest in collaboration, you can contact us at &lt;a href="mailto:feedback@crossref.org">feedback@crossref.org&lt;/a> or send us a message on the &lt;a href="https://community.crossref.org/" target="_blank">Community Forum&lt;/a>, where you can also keep up to date with our plans as well as giving us your feedback and suggestions. Speaking of feedback and, we have a survey which is trying to collect just that. Please &lt;a href="https://www.surveygizmo.com/s3/5151355/cabad33fcc9b" target="_blank">let us know what you value about Crossref&lt;/a> (and what you don’t) - we’d love to hear your thoughts.&lt;/p>
&lt;br/>
&lt;br/>
&lt;br/>
&lt;hr></description></item><item><title>2019 election slate</title><link>https://www.crossref.org/blog/2019-election-slate/</link><pubDate>Fri, 23 Aug 2019 00:00:00 +0000</pubDate><author>Lisa Hart Martin</author><guid>https://www.crossref.org/blog/2019-election-slate/</guid><description>&lt;h2 id="2019-board-election">2019 Board Election&lt;/h2>
&lt;p>The annual board election is a very important event for Crossref and its members. The board of directors, comprising 16 member organisations, governs Crossref, sets its strategic direction and makes sure that we fulfill our mission. Our members elect the board - its &amp;ldquo;one member one vote&amp;rdquo; - and we like to see as many members as possible voting. We are very pleased to announce the 2019 election slate - we have a great set of candidates and an update to the ByLaws addressing the composition of the slate to ensure that the board continues to be representative of our membership.&lt;/p>
&lt;h2 id="2019-election-slate">2019 Election Slate&lt;/h2>
&lt;p>Crossref received 52 expressions of interest this year through the link that was sent out via our blog, and over 100 emails from members interested in serving on our Board. It is very exciting to see that our members want to be involved.&lt;/p>
&lt;p>In March of this year, the Board made a motion per the recommendation of an adhoc Governance Committee. It was resolves to &amp;ldquo;provide the following guidance to the Nominating Committee: To achieve balance between revenue tiers by proposing a 2019 slate consisting of one Revenue Tier 1 seat and four Revenue tier 2 seats, and a 2020 slate consisting of four Revenue Tier 1 seats and two Revenue Tier 2 seats; thereby resulting in, as nearly as practicable, an equal balance between board members representing Revenue Tier 1 and Revenue Tier 2 (as those terms are defined in Crossref&amp;rsquo;s ByLaws below).&amp;rdquo;&lt;/p>
&lt;p>&lt;em>Section 2.&lt;/em>     Nominating Committee. The Board shall appoint a Nominating Committee of five (5) members, each of whom shall be either a Director or the designated representative of a member that is not represented on the Board, whose duty it shall be to nominate candidates for Directors to be elected at the next annual election. The Nominating Committee shall designate a slate of candidates for each election that is at least equal in number to the number of Directors to be elected at such election. Each such slate will be comprised such that, as nearly as practicable, one-half of the resulting Board shall be comprised of Directors designated by Members then representing Revenue Tier 1; and one-half of the resulting Board shall be comprised of Directors designated by Members then representing Revenue Tier 2.  &amp;ldquo;Revenue Tier 1&amp;rdquo; means all consecutive membership dues categories, starting with the lowest dues category, that, when taken together, aggregate, as nearly as possible, to fifty percent (50%) of Crossref&amp;rsquo;s annual revenue. &amp;ldquo;Revenue Tier 2&amp;rdquo; means all membership dues categories above Revenue Tier 1. The Nominating Committee shall notify the Secretary in writing, at least twenty (20) days before the date of the annual meeting, of the names of such candidates, and the Secretary, except as herein otherwise provided, shall transmit a copy thereof to the last recorded address of each member of record simultaneously with the notice of the meeting.&lt;/p>
&lt;p>The Committee and the Board has worked very hard to balance the Board, so you will see two categories on the ballot, large and small.&lt;/p>
&lt;h2 id="the-2019-slate-includes-seven-candidates-for-five-available-seats">The 2019 slate includes: seven candidates for five available seats&lt;/h2>
&lt;p>Candidate organisations, in alphabetical order, for the Small category (1 seat available):&lt;/p>
&lt;ul>
&lt;li>&lt;strong>eLife&lt;/strong>, Melissa Harrison&lt;/li>
&lt;li>&lt;strong>The Royal Society&lt;/strong>, Stuart Taylor&lt;/li>
&lt;/ul>
&lt;p>Candidate organisations, in alphabetical order, for the Large category (4 seats available):&lt;/p>
&lt;ul>
&lt;li>&lt;strong>Clarivate Analytics&lt;/strong>, Nandita Quaderi&lt;/li>
&lt;li>&lt;strong>Elsevier&lt;/strong>, Chris Shillum&lt;/li>
&lt;li>&lt;strong>IOP&lt;/strong>, Graham McCann&lt;/li>
&lt;li>&lt;strong>Springer Nature&lt;/strong>, Reshma Shaikh&lt;/li>
&lt;li>&lt;strong>Wiley&lt;/strong>, Todd Toler&lt;/li>
&lt;/ul>
&lt;div class="shortcode-divwrap blue-highlight">
&lt;span>&lt;h3 id="take-a-look-at-the-candidates-organisational-and-personal-statementsboard-and-governanceelections2019-slate">&lt;a href="https://www.crossref.org/board-and-governance/elections/2019-slate/">Take a look at the candidates&amp;rsquo; organisational and personal statements&lt;/a>&lt;/h3>
&lt;/span>
&lt;/div>
&lt;h2 id="you-can-be-part-of-this-important-process-by-voting-in-the-election">You can be part of this important process, by voting in the election&lt;/h2>
&lt;p>If your organisation is a voting member in good standing of Crossref as of September 13, 2019, you are eligible to vote when voting opens on September 27, 2019.&lt;/p>
&lt;h2 id="how-can-you-vote">How can you vote?&lt;/h2>
&lt;p>On September 27, 2019, your organisation&amp;rsquo;s designated voting contact will receive an email with the Formal Notice of Meeting and Proxy Form with concise instructions on how to vote.  You will also receive a user name and password with a link to our voting platform.&lt;/p>
&lt;p>The election results will be announced at &lt;a href="https://www.crossref.org/crossref-annual-meeting/archive/#2019">LIVE19 Amsterdam&lt;/a> on November 13, 2019.&lt;/p></description></item><item><title>Building better metadata with schema releases</title><link>https://www.crossref.org/blog/building-better-metadata-with-schema-releases/</link><pubDate>Wed, 21 Aug 2019 00:00:00 +0000</pubDate><author>Patricia Feeney</author><guid>https://www.crossref.org/blog/building-better-metadata-with-schema-releases/</guid><description>&lt;p>This month we have officially released a new version of our input metadata schema. As well as walking through the latest additions, I&amp;rsquo;ll also describe here how we&amp;rsquo;re starting to develop a new streamlined and open approach to schema development, using GitLab and some of the ideas under discussion going forward.&lt;/p>
&lt;h2 id="whats-included-in-version-442">What&amp;rsquo;s included in version 4.4.2&lt;/h2>
&lt;p>The latest schema as of August 2019 is version 4.4.2 and this release now includes:&lt;/p>
&lt;ul>
&lt;li>Support for &amp;ldquo;&lt;a href="https://www.crossref.org/help/pending-publication/">pending publication&lt;/a>&amp;rdquo;&lt;/li>
&lt;li>Support for JATS 1.2 abstracts&lt;/li>
&lt;li>Abstract support to dissertations, reports, and allow multiple abstracts wherever available&lt;/li>
&lt;li>Support for multiple dissertation authors&lt;/li>
&lt;li>A new &lt;code>acceptance_date&lt;/code> element added to journal article, book, book chapter, and conference paper record types&lt;/li>
&lt;/ul>
&lt;p>&amp;ldquo;Pending publication&amp;rdquo; is the term we&amp;rsquo;ve coined for the phase where a manuscript has been accepted for publication but where the publisher needs to communicate a DOI much earlier than most article metadata is available. Some members asked for the ability to register and assign DOIs prior to online publication, even without a title, so this allows members to register a DOI with minimal metadata, temporarily, before online publication. There is of course no obligation to use this feature.&lt;/p>
&lt;p>It&amp;rsquo;s worth calling out the addition of &lt;code>acceptance_date&lt;/code> too. This is a key attribute that is heavily requested by downstream metadata users like universities. Acceptance dates allow people to report on outputs much more accurately, so we do encourage all members to start including acceptance dates in their metadata. It&amp;rsquo;s highly appreciated!&lt;/p>
&lt;h2 id="schema-files-public-on-gitlab">Schema files public on GitLab&lt;/h2>
&lt;p>I’ve added our latest schema to a new &lt;a href="https://gitlab.com/crossref/schema" target="_blank">GitLab repository&lt;/a>, There you’ll find the schema files, some documentation, and the opportunity to suggest enhancements. The schema has been released as bundle &lt;a href="https://gitlab.com/crossref/schema/-/releases" target="_blank">0.1.1&lt;/a> and also includes our new Grant metadata schema for members that fund research.&lt;/p>
&lt;p>The schema has been available in some form for months but at this point we consider it ‘officially’ released to kick off our new but necessary practice of formal schema releases. Any forthcoming updates will be added to the next version.&lt;/p>
&lt;h2 id="schema-management-process">Schema management process&lt;/h2>
&lt;p>We’ve been adding sets of metadata and new record types over the years, but also need to have a defined process for small but vital pieces of metadata that you need to provide and retrieve from our metadata records. If you’re wondering what our procedure for updating our schema is, you are not alone! We have not had a formal process, instead relying on ad-hoc requests from our membership and working groups. Our release management and schema numbering has also not been consistent.&lt;/p>
&lt;p>Going forward, I will ensure that all forthcoming versions of our metadata schema are be posted as a draft on GitLab for review and comment, and the final version will be officially released via GitLab as well.&lt;/p>
&lt;p>It&amp;rsquo;s important to note that when we talk about &amp;ldquo;the schema&amp;rdquo;, we generally mean the &lt;em>input&lt;/em> schema specifically i.e. what members of Crossref can register about the content they produce. As always, the output for retrieving that metadata is subject to separate development plans for our Metadata APIs. I&amp;rsquo;m working with our technical team so we can develop and introduce an &amp;rsquo;end-to-end&amp;rsquo; approach that doesn&amp;rsquo;t in future treat the input and the output as such separate considerations.&lt;/p>
&lt;h2 id="whats-next">What&amp;rsquo;s next&lt;/h2>
&lt;p>Many of the updates in this latest release have been in the works for some time. Changes to our metadata both large and small are considered carefully, but I’d like to do this in a transparent and cooperative way with our community.&lt;/p>
&lt;p>I recently set up the &amp;ldquo;Metadata Practitioners Interest Group&amp;rdquo; and we&amp;rsquo;ve just had our second call. A big topic was how to best manage the ideas and requests from the community. The ability for public comments on GitLab is a first step.&lt;/p>
&lt;p>This most recent update contains a mix of long term projects and updates to keep our metadata current and useful. Other changes that are under discussion will require more development on our end. But stay tuned for more information about forthcoming changes, as well information about how you can contribute.&lt;/p></description></item><item><title>Introducing our new Director of Product</title><link>https://www.crossref.org/blog/introducing-our-new-director-of-product/</link><pubDate>Mon, 19 Aug 2019 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/introducing-our-new-director-of-product/</guid><description>&lt;p>I&amp;rsquo;m happy to announce that &lt;a href="https://www.crossref.org/people/bryan-vickery">Bryan Vickery&lt;/a> has joined Crossref today as our new Director of Product. Bryan has extensive experience developing products and services at publishers such as Taylor &amp;amp; Francis, where he led the creation of the open-access platform Cogent OA. Most recently he was Managing Director of Research Services at T&amp;amp;F, including Wizdom.ai after it was acquired.&lt;/p>
&lt;p>He previously held a range of roles from Publisher to Chief Operations Officer at BioMedCentral, as well as online community and technology leadership roles at Elsevier.&lt;/p>
&lt;p>Bryan is a great addition to Crossref and we are lucky to have him. The product team is keen to progress the long list of wishes from our community with his guidance. Bryan will bring focus and clarity to our roadmap and our development processes, making it easier for people to adopt and participate in our services, and ensuring that we are working on the issues that are most important to our members.&lt;/p>
&lt;p>He will also be a vital part of the leadership team, working with me and the other directors Geoffrey, Ginny, and Lisa to help us take the organisation forward in a transparent way that serves our mission and empowers our excellent staff.&lt;/p>
&lt;h2 id="and-now-a-few-words-from-bryan">And now a few words from Bryan…&lt;/h2>
&lt;p>I’m thrilled to be joining Crossref as Director of Product at a time of considerable change in scholarly communication. I’ve worked in, and around, scholarly publishing for more than 20 years.&lt;/p>
&lt;p>This is a challenging role. We have many exciting services and collaborations to progress, and also technical debt to address (like everyone else) to upgrade our existing services - it’s essential we balance these. My priority is to stay on top of the issues of the highest value to the scholarly community, now and in the future, and ensure we deliver services that are both useful and usable.&lt;/p>
&lt;p>I will be attending Crossref LIVE19 “The strategy one” along with other staff and look forward to meeting many of our members then. In the meantime, I&amp;rsquo;d love to hear your thoughts on where we’ve been (what it’s like working with us and using our services) and where we&amp;rsquo;re going (what you’d like to see from us). You can &lt;a href="mailto:feedback@crossref.org">reach me via our feedback email&lt;/a>.&lt;/p>
&lt;p>&lt;em>Please join us in welcoming Bryan to the Crossref community.&lt;/em>&lt;/p></description></item><item><title>We'll be rocking your world again at PIDapalooza 2020</title><link>https://www.crossref.org/blog/well-be-rocking-your-world-again-at-pidapalooza-2020/</link><pubDate>Sun, 18 Aug 2019 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/well-be-rocking-your-world-again-at-pidapalooza-2020/</guid><description>&lt;p>The official countdown to PIDapalooza 2020 begins here! It&amp;rsquo;s 163 days to go till our flame-lighting opening ceremony at the fabulous Belem Cultural Center in Lisbon, Portugal. Your friendly neighborhood PIDapalooza Planning Committee&amp;mdash;Helena Cousijn (DataCite), Maria Gould (CDL), Stephanie Harley (ORCID), Alice Meadows (ORCID), and I&amp;mdash;are already hard at work making sure it’s the best one so far!&lt;/p>
&lt;div class="shortcode-divwrap align-right">
&lt;span>&lt;div style="width:195px; text-align:center;" >&lt;iframe src="https://www.eventbrite.com/countdown-widget?eid=60971406117" frameborder="0" height="212" width="195" marginheight="0" marginwidth="0" scrolling="no" allowtransparency="true">&lt;/iframe>&lt;div style="font-family:Helvetica, Arial; font-size:12px; padding:10px 0 5px; margin:2px; width:195px; text-align:center;" >&lt;/div>&lt;/div>&lt;/span>
&lt;/div>
We have a shiny [new website](https://pidapalooza.org), with loads more information than before, including spotify playlists (please add your PID songs to [the 2020 one](https://open.spotify.com/playlist/1oJtbpTzF9I3MewQ1Yasml?si=D0TKdR8BTJSL-GA3X_LwVQ)!), an instagram photo gallery, and of course registration information. Look out for updates there and on [Twitter](https://twitter.com/pidapalooza).
&lt;p>And, led by Helena, the Program Committee is starting its search for sessions that meet PIDapalooza’s goals of being PID-focused, &lt;strong>fun&lt;/strong>, informative, and interactive. If you’ve a PID story to share, a PID practice to recommend, or a PID technology to launch, the Committee wants to hear from you. Please send them your ideas, using &lt;a href="https://forms.gle/oeSeiZEni3cPipKm6" target="_blank">this form&lt;/a>, by September 27. We aim to finalize the program by late October/early November.&lt;/p>
&lt;h2 id="dont-forget-to-tie-your-proposal-into-one-of-the-six-festival-themes">Don’t forget to tie your proposal into one of the six festival themes:&lt;/h2>
&lt;h4 id="theme-1-putting-principles-into-practice">Theme 1: Putting Principles into Practice&lt;/h4>
&lt;p>FAIR, Plan S, the 4 Cs; principles are everywhere. Do you have examples of how PIDs helped you put principles into practice? We’d love to hear your story!&lt;/p>
&lt;h4 id="theme-2-pid-communities">Theme 2: PID Communities&lt;/h4>
&lt;p>We believe PIDs don’t work without community around them. We would like to hear from you about best practice among PID communities so we can learn from each other and spread the word even further!&lt;/p>
&lt;h4 id="theme-3-pid-success-stories">Theme 3: PID Success Stories&lt;/h4>
&lt;p>We already know PIDs are great, but which strategies worked? Share your victories! Which strategies failed? Let’s turn these into success stories together!&lt;/p>
&lt;h4 id="theme-4-achieving-persistence-through-sustainability">Theme 4: Achieving Persistence through Sustainability&lt;/h4>
&lt;p>Persistence is a key part of PIDs, but there can’t be persistence without sustainability. Do you want to share how you sustain your PIDs or how PIDs help you with sustainability?&lt;/p>
&lt;h4 id="theme-5-bridging-worlds---social-and-technical">Theme 5: Bridging Worlds - Social and Technical&lt;/h4>
&lt;p>What would make heterogeneous PID systems &amp;lsquo;interoperate&amp;rsquo; optimally? Would standardized metadata and APIs across PID types solve many of the problems, and if so, how would that be achieved? And what about the social aspects? How do we bridge the gaps between different stakeholder groups and communities?&lt;/p>
&lt;h4 id="theme-6-pid-party">Theme 6: PID Party!&lt;/h4>
&lt;p>You don’t just learn about PIDs through powerpoints. What about games? Interpretive dance? Get creative and let us know what kind of activity you’d like to organize at PIDapalooza this year!&lt;/p>
&lt;h2 id="pidapalooza-the-essentials">PIDapalooza: the essentials&lt;/h2>
&lt;p>&lt;strong>What?&lt;/strong> &lt;a href="https://pidapalooza.org" target="_blank">PIDapalooza 2020&lt;/a> - the open festival of persistent identifiers &lt;br>
&lt;strong>When?&lt;/strong> 29-30 January 2020 (kickoff party the evening of January 28) &lt;br>
&lt;strong>Where?&lt;/strong> Belem Cultural Center, Lisbon, Portugal (&lt;a href="https://goo.gl/maps/HEmmQUjkJcEoqFTZ7" target="_blank">map&lt;/a>) &lt;br>
&lt;strong>Why?&lt;/strong> To think, talk, live persistent identifiers for two whole days with your fellow PID people, experts, and newcomers alike!&lt;/p>
&lt;p>We hope you’re as excited about PIDapalooza 2020 as we are and we look forward to seeing you in Lisbon.&lt;/p></description></item><item><title>LIVE19, the strategy one: have your say</title><link>https://www.crossref.org/blog/live19-the-strategy-one-have-your-say/</link><pubDate>Sun, 11 Aug 2019 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/live19-the-strategy-one-have-your-say/</guid><description>&lt;p>With a smaller group than usual, we&amp;rsquo;re dedicating this year&amp;rsquo;s annual meeting to hear what you value about Crossref. Which initiatives would you put first and/or last? Where would you have us draw the line between mission and ambition? What is “core” for you? How could/should we adapt for the future in order to meet your needs?&lt;/p>
&lt;div class="shortcode-divwrap align-right">
&lt;span>&lt;img src="https://www.crossref.org/images/community-images/crossref-live-19-logo copy.jpg" alt="Crossref LIVE19 logo" width="200px" />&lt;/span>
&lt;/div>
&lt;h2 id="striving-for-balance">Striving for balance&lt;/h2>
&lt;p>Different people want different things from us. As Aristotle said: &lt;em>&amp;ldquo;There is only one way to avoid criticism: do nothing, say nothing, and be nothing.&amp;rdquo;&lt;/em> As we prepare for our 20th year of operation, please join this unique meeting to help shape the future of Crossref.&lt;/p>
&lt;p>There won&amp;rsquo;t be any plenary talks about trends in scholarly communications, but instead workshop-style activities to help hone our strategy, do some scenario planning, and prioritize goals together, as a community.&lt;/p>
&lt;h2 id="have-your-say">Have your say&lt;/h2>
&lt;p>Whether you can make it in person or not, you can still pitch in by giving us your opinion in advance. We&amp;rsquo;re gathering broad input on what you think we&amp;rsquo;re doing well, whether we&amp;rsquo;re on the right track strategically, and how we can improve. There&amp;rsquo;s never been such a comprehensive study of what value we offer so we hope to learn a lot and will adjust plans based on the results.
&lt;br>&lt;/p>
&lt;div class="shortcode-divwrap blue-highlight">
&lt;span>Please take the &amp;ldquo;&lt;a href="https://www.surveygizmo.com/s3/5151355/blog" target="_blank">Value of Crossref&lt;/a>&amp;rdquo; survey. It&amp;rsquo;ll take 10-12 minutes.&lt;/span>
&lt;/div>
&lt;h2 id="at-the-meeting">At the meeting&lt;/h2>
&lt;p>Please join us at the Tobacco Theater in central Amsterdam on the afternoon of 13th November from 12:30 pm and for the full day of 14th November. The first afternoon will involve some scene-setting talks with key information you&amp;rsquo;ll need for the following day&amp;rsquo;s workshops, including the results of the survey above. There will also be some announcements, including who members have voted onto our board (this year&amp;rsquo;s slate is yet to be communicated), and of course plenty of time for discussion and questions among peers.&lt;/p>
&lt;p>In addition to the results of the survey, during the meeting each participant will be furnished with a &amp;lsquo;fact pack&amp;rsquo; to reference in their discussions and recommendations. It will include answers to questions like &lt;code>who pays to keep Crossref sustainable?&lt;/code>. I&amp;rsquo;m looking forward to busting some myths on that one! Everyone will be pre-assigned to a particular table/topic (like a wedding!) and will stay in those groups for roundtable discussions. There will be a community facilitator and a staff member on each table. You will be able to mingle more widely in the breaks and the evening drinks reception on the 13th.&lt;/p>
&lt;p>Based on this provided data, we&amp;rsquo;ll be asking participants to think about key questions such as:&lt;/p>
&lt;ul>
&lt;li>Who, ultimately, does Crossref serve?&lt;/li>
&lt;li>What should Crossref&amp;rsquo;s product development priorities be?&lt;/li>
&lt;li>What (if anything) would be missed if Crossref went away? (i.e. what&amp;rsquo;s our central value)&lt;/li>
&lt;li>What does &amp;lsquo;community&amp;rsquo; really mean and how should Crossref work to better balance opposing priorities?&lt;/li>
&lt;/ul>
&lt;blockquote>
&lt;p>Research is global, and supporting a diverse global community is a challenge. Come and have your say. &lt;a href="http://crossreflive19.eventbrite.com" target="_blank">Register today&lt;/a>.&lt;/p>
&lt;/blockquote>
&lt;p>I can&amp;rsquo;t wait to see you there and hear your thoughts.&lt;/p></description></item><item><title>Funders and infrastructure: let’s get building</title><link>https://www.crossref.org/blog/funders-and-infrastructure-lets-get-building/</link><pubDate>Mon, 29 Jul 2019 00:00:00 +0000</pubDate><author>Josh Brown</author><guid>https://www.crossref.org/blog/funders-and-infrastructure-lets-get-building/</guid><description>&lt;p>Human intelligence and curiosity are the lifeblood of the scholarly world, but not many people can afford to pursue research out of their own pocket. We all have bills to pay. Also, compute time, buildings, lab equipment, administration, and &lt;a href="https://web.archive.org/web/20190729155623/https://fap-dep.web.cern.ch/rpc/2019-annual-contributions-cern-budget" target="_blank">giant underground thingumatrons do not come cheap&lt;/a>. In 2017, according to statistics from &lt;a href="https://en.unesco.org/" target="_blank">UNESCO&lt;/a>, &lt;a href="http://uis.unesco.org/apps/visualisations/research-and-development-spending/" target="_blank">$1.7 trillion dollars&lt;/a> were invested globally in Research and Development. A lot of this money comes from the public - &lt;a href="http://data.uis.unesco.org/Index.aspx?DataSetCode=SCN_DS&amp;amp;lang=en" target="_blank">22c in every dollar &lt;/a>spent on R&amp;amp;D in the USA comes from government funds, for example. Funders really do support a LOT of research.&lt;/p>
&lt;p>For that research to count, it needs to be communicated. For us to interpret those research communications critically, we need to understand how the research was done and &lt;a href="https://www.ucsusa.org/disguising-corporate-influence-science-about-sugar-and-health" target="_blank">who paid&lt;/a> for it.&lt;/p>
&lt;p>At Crossref, we’ve been &lt;a href="https://www.crossref.org/community/funders">working with funders&lt;/a> for many years. The &lt;a href="https://www.crossref.org/services/funder-registry/">Open Funder Registry&lt;/a> was launched (with donated support from Elsevier) in 2012, and provides a taxonomy of funders, each uniquely identified, which has grown to cover 20,000 funders around the world. This resource has helped to connect the organisations that provide research funds to resources, projects, and publications. Some are also members and have been registering content with us. This is a growing trend as more funders start to launch their own &lt;a href="https://amrcopenresearch.org/" target="_blank">open platforms&lt;/a>. Funders also consume metadata from Crossref members, using it to track and report on the published outputs of the researchers they support.&lt;/p>
&lt;p>More recently, we have been exploring the ways that we can do more in partnership with the funding community. As our &lt;a href="https://www.crossref.org/board-and-governance/">board&lt;/a> concluded in 2017,&lt;/p>
&lt;blockquote>
&lt;p>Crossref requires increased emphasis on funders, understanding their needs and requirements and increasingly including funders in the scholarly communication dialogue.&lt;/p>
&lt;/blockquote>
&lt;p>In response, we have explored new services and practical enhancements to our existing portfolio, such as the &lt;a href="https://www.crossref.org/education/content-registration/content-types-intro/grants/">new grants registration system&lt;/a>, which will also power search and lookup tools.&lt;/p>
&lt;p>This new initiative will link &lt;a href="https://github.com/CrossRef/grantID-schema" target="_blank">structured information about grants&lt;/a> with DOIs, and enable us to provide open tools to help institutions, publishers, and research supporting organisations to re-use that data and make long-lasting connections between specific funding (and &lt;a href="https://doi.org/10.64000/5cfh1-1wa10" target="_blank">other kinds of research support&lt;/a>) and research activities and outcomes. The value of this was beautifully explained by our friends at &lt;a href="https://wellcome.ac.uk/" target="_blank">Wellcome&lt;/a> (now members) in this &lt;a href="https://doi.org/10.64000/xqr28-ee750" target="_blank">blog post&lt;/a>, and was reinforced by a recent survey undertaken by ORCID in which linking grants to outputs was cited as one of the major challenges facing funders. The Crossref Grant Linking System launched this July with a group of early adopter funders, ably supported by the team at &lt;a href="https://europepmc.org/" target="_blank">Europe PMC&lt;/a>.&lt;/p>
&lt;p>We’re not stopping there though: we are lucky to have a dedicated and engaged &lt;a href="https://www.crossref.org/working-groups/funders">funder advisory group&lt;/a>, and we will continue to work with them to understand how our interactions with funders can benefit the wider ecosystem that we support, and help funders to achieve their goals.&lt;/p>
&lt;p>There are many platforms providing vital intelligence to funders, from &lt;a href="https://www.dimensions.ai/" target="_blank">Dimensions&lt;/a> to &lt;a href="https://www.openaire.eu/" target="_blank">OpenAIRE&lt;/a>, which rely on Crossref data. Last month, I was at the &lt;a href="https://indico.cern.ch/event/786048/" target="_blank">OAI11 workshop&lt;/a> in &lt;a href="https://www.geneve.com/" target="_blank">Geneva&lt;/a>, and it was striking how many presentations included a slide that mentioned using Crossref data. There were 200 people from the open science community there, and they clearly rely on Crossref as a &lt;a href="https://cameronneylon.net/blog/where-are-the-pipes-building-foundational-infrastructures-for-future-services/" target="_blank">foundational infrastructure&lt;/a> to build their ecosystem. That community is also just a subset of the more than 2,500 registered consumers of Crossref metadata. We need to keep asking how this metadata can improve the information available to funders, to their partners and to service providers. Adding grants to the mix will help all of these parties provide an even richer picture of research.&lt;/p>
&lt;p>As we move forward with our engagement with the global funding community, new opportunities are becoming visible, and not just for funders. Better experiences for authors, reduced overhead for publishers and easier benchmarking for institutions are a selection of benefits that this work can help us realize.&lt;/p>
&lt;p>When we really start to get to grips with opening up information about the inputs to research in the way we already have with its outputs, truly exciting things can happen. The really great thing about this is that, quite literally, everyone benefits: from Crossref members to everyone touched by advances in our understanding of the world. Let’s get building!&lt;/p></description></item><item><title>Big things have small beginnings: the growth of the Open Funder Registry</title><link>https://www.crossref.org/blog/big-things-have-small-beginnings-the-growth-of-the-open-funder-registry/</link><pubDate>Sun, 21 Jul 2019 00:00:00 +0000</pubDate><author>Kirsty Meddings</author><guid>https://www.crossref.org/blog/big-things-have-small-beginnings-the-growth-of-the-open-funder-registry/</guid><description>&lt;p>The &lt;a href="https://www.crossref.org/services/funder-registry/">Open Funder Registry&lt;/a> plays a critical role in making sure that our members correctly identify the funding sources behind the research that they are publishing. It addresses a similar problem to the one that led to the creation of &lt;a href="http://orcid.org" target="_blank">ORCID&lt;/a>: researchers&amp;rsquo; names are hard to disambiguate and are rarely unique; they get abbreviated, have spelling variations and change over time.&lt;/p>
&lt;p>The same is true of organisations. You don’t have to read all that many papers to see authors acknowledge funding from the US National Institutes of Health as NIH, National Institutes for Health, National Institute of Health, etc. And wait, are you sure they didn’t mean National Institute for Health Research? (An entirely separate UK-based funder).&lt;/p>
&lt;p>And a lot of countries have a National Science Foundation…&lt;/p>
&lt;p>If each funder has a unique identifier, our members can &lt;a href="https://www.crossref.org/education/funder-registry/">include it in the metadata&lt;/a> that they register with us, giving a clear and accurate link between the funder of the research and the published outcomes. And we can make that information available to everyone via our &lt;a href="https://www.crossref.org/education/retrieve-metadata/rest-api/">API&lt;/a>, and build &lt;a href="https://search.crossref.org/funding" target="_blank">human interfaces&lt;/a> so that you can look it up.&lt;/p>
&lt;p>Many types of funding bodies are represented in the Funder Registry, from government agencies and large international foundations to small single-mission charities, and everything in between. As well as a unique DOI for each institution, the Registry contains additional metadata that can help to identify the funder such as country, abbreviated or alternate names, translated names, and so on.&lt;/p>
&lt;p>The Registry also supports relationships between different funders. These can be hierarchical parent/child relationships for larger organisations, or connections between archival and current entries in instances where a funder has changed its name or become part of another body (to tell us about these kinds of changes you just need to &lt;a href="mailto:funder.registry@crossref.org">get in touch&lt;/a>).&lt;/p>
&lt;p>The Registry was donated to Crossref by Elsevier when we first introduced funding information as part of our Content Registration schema back in 2012. We started out with a list of just over 4000 funders. Through an ongoing partnership the list has been - and continues to be - updated on a monthly basis by Elsevier, and sent to Crossref as a formatted XML file that we process and release.&lt;/p>
&lt;p>In return, Crossref sends Elsevier a feed of funder names that our members have registered with us that are not present in the Registry, which a team at Elsevier validates and adds to their databases, and then puts those newly-identified funders in to the next iteration of the list they send to us. It’s nice and circular and benefits both parties.&lt;/p>
&lt;h3 id="we-released-v127-of-the-funder-registryhttpsgitlabcomcrossrefopen_funder_registry-last-week-and-it-contains-entries-for-an-impressive--21356-funders">We released &lt;a href="https://gitlab.com/crossref/open_funder_registry" target="_blank">v1.27 of the Funder Registry&lt;/a> last week, and it contains entries for an impressive 21,356 funders.&lt;/h3>
&lt;p>I’ve been involved in this project since its inception, and have enjoyed a productive and cooperative working relationship with the team at Elsevier, headed by Peter Berkvens (Senior Product Manager) and Paul Mostert (Director Product Management). I asked them to explain a little about the process from their side:&lt;/p>
&lt;p>“Our team maintains a workflow in which Acknowledgement and Funding sections from articles are scanned for appearances of funding organisations using Natural Language Processing techniques. External Elsevier vendors then edit the data and add the validated names of the funders to what is called the Funding Bodies Taxonomy. The latter feeds Crossref’s Open Funder Registry.&lt;/p>
&lt;p>Currently, the Taxonomy is nearing 22,000 Funders. It is expected it will grow to 25,000 Funders eventually. When this stage is reached, Elsevier believes that all existing Funders will be covered in the Funder Registry. Elsevier will continue to maintain the list adding new Funders as soon as they appear in scientific papers.&lt;/p>
&lt;p>Elsevier’s Primary Articles production workflow for ScienceDirect uses the Funder Registry during the copyediting process, validating and tagging the Funders that appear in the accepted articles for Elsevier journals hosted by ScienceDirect. We then send the funder names and IDs to Crossref as part of our metadata.”&lt;/p>
&lt;p>Thanks to everyone involved for getting us ever-closer to a truly comprehensive list of funders.&lt;/p>
&lt;p>And if you’re a member who’s not already registering funding information, why not look into &lt;a href="https://www.crossref.org/education/funder-registry/">getting started?&lt;/a> It all leads to richer metadata which means more people can find, cite and re-use research &amp;ndash; and we all know that’s a &lt;a href="https://scholarlykitchen.sspnet.org/2019/06/11/better-metadata-could-help-save-the-world/" target="_blank">good thing&lt;/a>&amp;hellip;&lt;/p></description></item><item><title>What if I told you that bibliographic references can be structured?</title><link>https://www.crossref.org/blog/what-if-i-told-you-that-bibliographic-references-can-be-structured/</link><pubDate>Mon, 08 Jul 2019 00:00:00 +0000</pubDate><author>Dominika Tkaczyk</author><guid>https://www.crossref.org/blog/what-if-i-told-you-that-bibliographic-references-can-be-structured/</guid><description>&lt;p>Last year I spent several weeks studying how to automatically match unstructured references to DOIs (you can read about these experiments in &lt;a href="https://doi.org/10.64000/e6ey2-wce96" target="_blank">my previous blog posts&lt;/a>). But what about references that are not in the form of an unstructured string, but rather a structured collection of metadata fields? Are we matching them, and how? Let&amp;rsquo;s find out.&lt;/p>
&lt;h2 id="tldr">TL;DR&lt;/h2>
&lt;ul>
&lt;li>43% of open/limited references deposited with Crossref have no publisher-asserted DOI and no unstructured string. This means they need a matching approach suitable for structured references. &lt;em>[EDIT 6th June 2022 - all references are now open by default].&lt;/em>&lt;/li>
&lt;li>I adapted our new matching algorithms: Search-Based Matching (SBM) and Search-Based Matching with Validation (SMBV) to work with both structured and unstructured references.&lt;/li>
&lt;li>I compared three matching algorithms: Crossref&amp;rsquo;s current (legacy) algorithm, SBM and SBMV, using a dataset of 2,000 structured references randomly chosen from Crossref&amp;rsquo;s references.&lt;/li>
&lt;li>SBMV and the legacy algorithm performed almost the same. SBMV&amp;rsquo;s F1 was slightly better (0.9660 vs. 0.9593).&lt;/li>
&lt;li>Similarly as in the case of unstructured references, SBMV achieved slightly lower precision and better recall than the legacy algorithm.&lt;/li>
&lt;/ul>
&lt;h2 id="introduction">Introduction&lt;/h2>
&lt;p>Those of you who often read scientific papers are probably used to bibliographic references in the form of unstructured strings, as they appear in the bibliography, for example:&lt;/p>
&lt;pre tabindex="0">&lt;code>[5] Elizabeth Lundberg, “Humanism on Gallifrey,” Science Fiction Studies, vol. 40, no. 2, p. 382, 2013.
&lt;/code>&lt;/pre>&lt;p>This form, however, is not the only way we can store the information about the referenced paper. An alternative is a structured, more machine-readable form, for example using BibTeX format:&lt;/p>
&lt;pre tabindex="0">&lt;code>@article{Elizabeth_Lundberg_2013,
year = 2013,
publisher = {{SF}-{TH}, Inc.},
volume = {40},
number = {2},
pages = {382},
author = {Elizabeth Lundberg},
title = {Humanism on Gallifrey},
journal = {Science Fiction Studies}
}
&lt;/code>&lt;/pre>&lt;p>Probably the most concise way to provide the information about the referenced document is to use its identifier, for example (🥁drum roll&amp;hellip;) the DOI:&lt;/p>
&lt;pre tabindex="0">&lt;code>&amp;lt;https://doi.org/10.5621/sciefictstud.40.2.0382&amp;gt;
&lt;/code>&lt;/pre>&lt;p>It is important to understand that these three representations (DOI, structured reference and unstructured reference) are not equivalent. The amount of information they carry varies:&lt;/p>
&lt;ul>
&lt;li>The DOI, by definition, provides the full information about the referenced document, because it identifies it without a doubt. Even though the metadata and content are not directly present in the DOI string, they can be easily and deterministically accessed. It is by far the preferred representation of the referenced document.&lt;/li>
&lt;li>The structured reference contains the metadata of the referenced object, but it doesn&amp;rsquo;t identify the referenced object without a doubt. In our example, we know that the paper was published in 2013 by Elizabeth Lundberg, but we might not know exactly which paper it is, especially if there are more than one document with the same or similar metadata.&lt;/li>
&lt;li>The unstructured reference contains the metadata field values, but without the names of the fields. This also doesn&amp;rsquo;t identify the referenced document, and even its metadata is not known without a doubt. In our example, we know that the word “Science” appears somewhere in the metadata, but we don&amp;rsquo;t know for sure whether it is a part of the title, journal title, or maybe the author&amp;rsquo;s (very cool) name.&lt;/li>
&lt;/ul>
&lt;p>The diagram presents the relationships between all these three forms:&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/structured_matching_reference_forms.png"
alt="reference forms" width="800px">
&lt;/figure>
&lt;br/>
&lt;p>The arrows show actions that Crossref has to perform to transform one form to another.&lt;/p>
&lt;p>Green transformations are in general easy and can be done without introducing any errors. The reason is that green arrows go from more information to less information. We all know how easy it is to forget important stuff!&lt;/p>
&lt;p>Green transformations are typically performed when the publication is being created. At the beginning the author can access the DOI of the referenced document, because they know exactly which document it is. Then, they can extract the bibliographic metadata (the structured form) of the document based on the DOI, for example by following the DOI to the document&amp;rsquo;s webpage or retrieving the metadata from &lt;a href="https://github.com/CrossRef/rest-api-doc" target="_blank">Crossref&amp;rsquo;s REST API&lt;/a>. Finally, the structured form can be formatted into an unstructured string using, for example, the &lt;a href="https://en.wikipedia.org/wiki/CiteProc" target="_blank">CiteProc&lt;/a> tool.&lt;/p>
&lt;p>We&amp;rsquo;ve also automated it further and these two green transformation (getting the document&amp;rsquo;s metadata based on the DOI and formatting it into a string) can be done in one go using &lt;a href="https://www.crossref.org/labs/citation-formatting-service/">Crossref&amp;rsquo;s content negotiation&lt;/a>.&lt;/p>
&lt;p>Red transformations are often done in systems that store bibliographic metadata (like our own metadata collection), often at a large scale. In these systems, we typically want to have DOIs (or other unique identifiers) of the referenced documents, but in practise we often have only structured and/or unstructured form. To fix this, we match references. Some systems also perform reference parsing (thankfully, we discovered &lt;a href="https://www.crossref.org/labs/resolving-citations-we-dont-need-no-stinkin-parser/">we do not need to do this in our case&lt;/a>).&lt;/p>
&lt;p>In general, red transformations are difficult, because we have to go from less information to more information, effectively recreating the information that has been lost during paper writing. This requires a bit of reasoning, educated guessing, and juggling probabilities. Data errors, noise, and sparsity make the situation even more dire. As a result, we do not expect any matching or parsing algorithm to be always correct. Instead, we perform evaluations (like in this blog post) to capture how well they perform on average.&lt;/p>
&lt;p>My &lt;a href="https://doi.org/10.64000/e6ey2-wce96" target="_blank">previous blog post&lt;/a> focused on matching unstructured references to DOIs (long red &amp;ldquo;matching&amp;rdquo; arrow). In this one, I analyse how well we can match structured references to DOIs (short red &amp;ldquo;matching&amp;rdquo; arrow).&lt;/p>
&lt;h2 id="references-in-crossref">References in Crossref&lt;/h2>
&lt;p>You might be asking yourself how important it is to have the matching algorithm working for both structured and unstructured references. Let&amp;rsquo;s look more closely at the references our matching algorithm has to deal with.&lt;/p>
&lt;p>29% of open/limited references deposited with Crossref already have the DOI provided by the publisher member. At Crossref, when we come across those references, we start dancing on a rainbow to the tunes of &lt;a href="https://en.wikipedia.org/wiki/Linkin_Park" target="_blank">Linkin Park&lt;/a>, while the references holding their DOIs sprinkle from the sky. Some of us sing along. We live for those moments, so if you care about us, please provide as many DOIs in your references as possible!&lt;/p>
&lt;p>You might be wondering how we are sure these publisher-provided DOIs are correct. The short answer is that we are not. After all, the publisher might have used an automated matcher to insert the DOIs before depositing the metadata. Nevertheless, our current workflow assumes these publisher-provided DOIs are correct and we simply accept them as they are.&lt;/p>
&lt;p>Unfortunately, the remaining 71% of references are deposited without a DOI. Those are the references we try to match ourselves.&lt;/p>
&lt;p>Here is the distribution of all the open/limited references:&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/structured_matching_reference_distribution.png"
alt="reference distibution" width="600px">
&lt;/figure>
&lt;p>17% of the references are deposited with no DOI and both structured and unstructured form. 11% have no DOI and only an unstructured form, and 43% have no DOI and only a structured form. These 43% cannot be directly processed by the unstructured matching algorithm.&lt;/p>
&lt;p>This distribution clearly shows that we need a matching algorithm able to process both structured and unstructured references. If our algorithm worked only with one type, we would miss a large percentage of the input references, and the quality of our citation metadata would be questionable.&lt;/p>
&lt;h2 id="the-analysis">The analysis&lt;/h2>
&lt;p>Let&amp;rsquo;s get to the point. I evaluated and compared three matching algorithms, focusing on the structured references.&lt;/p>
&lt;p>The first algorithm is one of the legacy algorithms currently used in Crossref. It uses fuzzy querying in a relational database to find the best matching DOI for the given structured reference. It can be accessed through a &lt;a href="https://support.crossref.org/hc/en-us/articles/214880143-OpenURL%23openurl2" target="_blank">Crossref OpenURL&lt;/a> query.&lt;/p>
&lt;p>The second algorithm is an adaptation of the Search-Based Matching (SBM) algorithm for structured references. In this algorithm, we concatenate all metadata fields of the reference and use it to search in the Crossref&amp;rsquo;s REST API. The first hit is returned as the target DOI if its relevance score exceeds the predefined threshold.&lt;/p>
&lt;p>The third algorithm is an adaptation of the Search-Based Matching with Validation (SBMV) for structured references. Similarly as in the case of SBM, we also concatenate all metadata fields of the input reference and use it to search in the &lt;a href="https://github.com/CrossRef/rest-api-doc" target="_blank">Crossref&amp;rsquo;s REST API&lt;/a>. Next, a number of top hits are considered as candidates and their similarity score with the input reference is calculated. The candidate with the highest similarity score is returned as the target DOI if its score exceeds the predefined threshold. The similarity score is based on fuzzy comparison of the metadata field values between the candidate and the input reference.&lt;/p>
&lt;p>I compared these three algorithms on a test set composed of 2,000 structured bibliographic references randomly chosen from Crossref&amp;rsquo;s metadata. For each reference, I manually checked the output of all matching algorithms, and in some cases performed additional manual searching. This resulted in the true target DOI (or null) assigned to each reference.&lt;/p>
&lt;p>The metrics are the same as in the previous evaluations: precision, recall and F1 calculated over the set of input references.&lt;/p>
&lt;p>The thresholds for SBM and SBMV algorithms were chosen on a separate validation dataset. The validation dataset also contains 2,000 structured references with manually-verified target DOIs.&lt;/p>
&lt;h2 id="the-results">The results&lt;/h2>
&lt;p>The plot shows the results of the evaluation of all three algorithms:&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/structured_matching_results.png"
alt="structured matching evaluation results" width="600px">
&lt;/figure>
&lt;br/>
&lt;p>The vertical black lines on top of the bars represent the confidence intervals.&lt;/p>
&lt;p>As we can see, SBMV and the legacy approach achieved very similar results. SBMV slightly outperforms the legacy approach in F1: 0.9660 vs. 0.9593.&lt;/p>
&lt;p>SBMV is slightly worse that the legacy approach in precision (0.9831 vs. 0.9929) and better in recall (0.9495 vs. 0.9280).&lt;/p>
&lt;p>The SBM algorithm performs the worst, especially in precision. Why is there such a huge difference between SBM and SBMV? The algorithms differ in the post-processing validation stage. SBM relies on the ability of the search engine to select the best target DOI, while SBMV re-scores a number of candidates obtained from the search engine using custom similarity. The results here suggest that in the case of structured references, the right target DOI is usually somewhere close to the top of the search results, but often it is not in the first position. One of the reasons might be missing titles in 76% of the structured references, which can confuse the search engine.&lt;/p>
&lt;p>Let&amp;rsquo;s look more closely at a few interesting cases in our test set:&lt;/p>
&lt;pre tabindex="0">&lt;code>first-page = 1000
article-title = Sequence capture using PCR-generated probes: a cost-effective method of targeted high-throughput sequencing for nonmodel organisms
volume = 14
author = Peñalba
year = 2014
journal-title = Molecular Ecology Resources
&lt;/code>&lt;/pre>&lt;p>The reference above was successfully matched by SBMV to &lt;a href="https://doi.org/10.1111/1755-0998.12249" target="_blank">https://doi.org/10.1111/1755-0998.12249&lt;/a>, even though the document&amp;rsquo;s volume and pages are missing from Crossref&amp;rsquo;s metadata.&lt;/p>
&lt;pre tabindex="0">&lt;code>issue = 2
first-page = 101
volume = 6
author = Abraham
year = 1987
journal-title = Promoter: An Automated Promotion Evaluation System
&lt;/code>&lt;/pre>&lt;p>Here the structure incorrectly labels article title as journal title. Despite this, the reference was correctly matched by our brave SBMV to &lt;a href="https://doi.org/10.1287/mksc.6.2.101" target="_blank">https://doi.org/10.1287/mksc.6.2.101&lt;/a>.&lt;/p>
&lt;pre tabindex="0">&lt;code>author = Marshall Day C.
volume = 39
first-page = 572
year = 1949
journal-title = India. J. A. D. A.
&lt;/code>&lt;/pre>&lt;p>Above we have most likely a parsing error. A part of the article title appears in the journal name, and the main journal name is abbreviated. ‘I see what you did there, my old friend Parsing Algorithm! Only a minor obstacle!&amp;rsquo; said SBMV, and matched the reference to &lt;a href="https://doi.org/10.14219/jada.archive.1949.0114" target="_blank">https://doi.org/10.14219/jada.archive.1949.0114&lt;/a>.&lt;/p>
&lt;pre tabindex="0">&lt;code>volume = 5
year = 2015
article-title = A retrospective analysis of the effect of discussion in teleconference and face-to-face scientific peer-review panels
journal-title = BMJ Open
&lt;/code>&lt;/pre>&lt;p>Here the the page number and author are not in the structure, but our invincible SBMV jumped over the holes left by the missing metadata and gracefully grabbed the right DOI &lt;a href="https://doi.org/10.1136/bmjopen-2015-009138" target="_blank">https://doi.org/10.1136/bmjopen-2015-009138&lt;/a>.&lt;/p>
&lt;pre tabindex="0">&lt;code>issue = 2
first-page = 533
volume = 30
author = Uthman BM
year = 1989
journal-title = Epilepsia
&lt;/code>&lt;/pre>&lt;p>In this case we have a mismatch in the page number (“533” vs. “S33”). But did SBMV give up and burst into tears? I think we already know the answer! Of course, it conquered the nasty typo with the sword made of fuzzy comparisons (yes, it&amp;rsquo;s a thing!) and brought us back the correct DOI &lt;a href="https://doi.org/10.1111/j.1528-1157.1989.tb05823.x" target="_blank">https://doi.org/10.1111/j.1528-1157.1989.tb05823.x&lt;/a>.&lt;/p>
&lt;h2 id="structured-vs-unstructured">Structured vs. unstructured&lt;/h2>
&lt;p>How does matching structured references compare to matching unstructured references?&lt;/p>
&lt;p>The general trends are the same. For both structured and unstructured references, SBMV outperforms the legacy approach in F1, achieving worse precision and better recall. This tells us that our legacy algorithms are more strict and as a result they miss some links.&lt;/p>
&lt;p>Structured reference matching seems easier than unstructured reference matching. The reason is that when we have the structure, we can compare the input reference to the candidate field by field, which is more precise than using the unstructured string.&lt;/p>
&lt;p>Structured matching, however, in practise brings new challenges. One big problem is data sparsity. 15% of structured references without DOIs have fewer than four metadata fields. This is not always enough to identify the DOI. Also, 76% of the structured references without DOIs do not contain the article title, which poses a problem for candidate selection using the search engine.&lt;/p>
&lt;h2 id="whats-next">What&amp;rsquo;s next?&lt;/h2>
&lt;p>So far, I have focused on evaluating SBMV for unstructured and structured references separately. 17% of the open/limited references at Crossref, however, have both unstructured and structured form. In those cases, it might be beneficial to use the information from both forms. I plan to perform some experiments on this soon.&lt;/p>
&lt;p>The data and code for this evaluation can be found at &lt;a href="https://github.com/CrossRef/reference-matching-evaluation" target="_blank">https://github.com/CrossRef/reference-matching-evaluation&lt;/a>. The Java version of SBMV (for both structured and unstructured references) can be found at &lt;a href="https://gitlab.com/crossref/search-based-reference-matcher" target="_blank">https://gitlab.com/crossref/search-based-reference-matcher&lt;/a>.&lt;/p></description></item><item><title>License metadata FTW</title><link>https://www.crossref.org/blog/license-metadata-ftw/</link><pubDate>Fri, 28 Jun 2019 00:00:00 +0000</pubDate><author>Rachael Lammey</author><discourseUsername>rlammey</discourseUsername><guid>https://www.crossref.org/blog/license-metadata-ftw/</guid><description>&lt;p>More and better license information is at the top of a lot of Christmas lists from a lot of research institutions and others who regularly use Crossref metadata. I know, I normally just ask for socks too. To help explain what we mean by this, we&amp;rsquo;ve collaborated with &lt;a href="https://www.jisc.ac.uk/" target="_blank">Jisc&lt;/a> to set out some guidance for publishers on registering this license metadata with us.&lt;/p>
&lt;p>At the most basic level, complete and accurate license metadata helps anyone interested in using a research work out how they can do so. Making the information machine-readable helps this to be done easily and at scale by all kinds of tools and services.&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/documentation/schema-library/markup-guide-metadata-segments/license-information/" target="_blank">In this best practice guide&lt;/a>, we’re specifically focusing on a use case for license metadata that comes from research institutions. They need to know which version of an article (or other content item) may be exposed in an open repository, and from what date, and tell anyone who comes across the piece of content in the repository what they can do with it once they find it there.&lt;/p>
&lt;p>Without this being stated simply and clearly in the Crossref metadata, the institution won’t know which works they can make available and which they cannot, even if you as the publisher know that the item is open access, or is open access after a certain date. This can impact the research community’s capacity to find and use the research you publish.&lt;/p>
&lt;p>The guidance offers advice on:&lt;/p>
&lt;ol>
&lt;li>the kind of license information it’s useful to link out to from the Crossref metadata&lt;/li>
&lt;li>what the Crossref metadata might look like for:&lt;/li>
&lt;/ol>
&lt;ul>
&lt;li>gold open access content&lt;/li>
&lt;li>green open access content with a Creative Commons License&lt;/li>
&lt;li>green open access content with a publisher-defined post-embargo license&lt;/li>
&lt;/ul>
&lt;ol start="3">
&lt;li>how to add this metadata to existing or new Crossref deposits&lt;/li>
&lt;/ol>
&lt;h3 id="take-a-look-at-the-full-guidelinesdocumentationschema-librarymarkup-guide-metadata-segmentslicense-information-here">Take a look at &lt;a href="https://www.crossref.org/documentation/schema-library/markup-guide-metadata-segments/license-information/">the full guidelines&lt;/a> here.&lt;/h3>
&lt;p>Maybe there’s more to the story than this, or more information that you need as a publisher or as a research institution - if so, &lt;a href="mailto:feedback@crossref.org">let us know&lt;/a> and we can adapt this document based on your feedback. Requests for socks may be declined.&lt;/p></description></item><item><title>Rest in peace Christine Hone</title><link>https://www.crossref.org/blog/rest-in-peace-christine-hone/</link><pubDate>Sun, 09 Jun 2019 00:00:00 +0000</pubDate><author>Crossref</author><guid>https://www.crossref.org/blog/rest-in-peace-christine-hone/</guid><description>&lt;p>&lt;em>Our friend and colleague Christine Hone (née Buske) passed away in May from a short but brutal illness. Here is our attempt at &amp;lsquo;some words&amp;rsquo;, which we wrote for her funeral book and are posting here with her husband Dave&amp;rsquo;s permission.&lt;/em>&lt;/p>
&lt;p>We are devastated to lose Christine as a colleague and friend. It’s hard to put into words the effect she had on our small organisation in such a short time, and how much we’re already missing her. But here it goes.&lt;/p>
&lt;p>It was 2015 when some of us first met Chris, and we immediately saw how much of an asset she could be to our organisation. She was very active in the community and well-known in many academic and publishing circles around the world. And she had an enviable combination of technical skills, a scientific mind, and a natural ability to engage people.&lt;/p>
&lt;p>We tried to recruit her back then but she was in demand by others and it wasn’t until early 2018 that we succeeded. We finally got her! She became the Product Manager for a very advanced and complex system but she took to it perfectly, with real excitement and a complete understanding of how we (and therefore she) could help the research community all over the world see and make connections.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2019/christine-headshot.jpg"
alt="Christine&amp;rsquo;s official Crossref headshot 😊" width="40%">&lt;figcaption>
&lt;p>Christine&amp;rsquo;s official Crossref headshot 😊&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;p>With colleagues spread around the world, she joined an organisation that had exciting opportunities and its share of challenges. Chris engaged with all of this head-on. She handled a constant stream of queries from people spread across time zones, whilst at the same time getting to grips with a service that was difficult to pin down. She balanced these tasks which were at very opposite ends of the spectrum. She added so much and with such energy and intelligence to everything she got involved in, always bringing human attention and creativity.&lt;/p>
&lt;p>&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2019/winners-chris-uksg-2018.jpg"
alt="Chris was also on the winning team at 2018&amp;rsquo;s UKSG quiz!" width="50%">&lt;figcaption>
&lt;p>Chris was also on the winning team at 2018&amp;rsquo;s UKSG quiz!&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2019/uksg-chris-2018.jpg"
alt="Ed, Amanda, and Chris: the Crossref contingent of the winning quiz team" width="50%">&lt;figcaption>
&lt;p>Ed, Amanda, and Chris: the Crossref contingent of the winning quiz team&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;/p>
&lt;p>In her talk at the 5:AM altmetrics conference she brought together technical detail, big-picture ideas, and her own particular passion. Her opening words were “My name is Christine and I’m a recovering fish scientist”. Never afraid to bring her personal brand of humour into the workplace, her opening slide was a photograph of her covered in rats. That presentation was the first time that much of the audience really understood our service. Having cracked the messaging for us, she was due to give the same talk at our annual meeting in Toronto a few months later…&lt;/p>
&lt;p>&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2019/5am-rats.png"
alt="Chris&amp;rsquo;s opening slide at her 5:AM talk" width="50%">&lt;figcaption>
&lt;p>Chris&amp;rsquo;s opening slide at her 5:AM talk&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2019/5am-chris.jpg"
alt="Chris giving her now legendary talk at 5:AM on Event Data" width="50%">&lt;figcaption>
&lt;p>Chris giving her now legendary talk at 5:AM on Event Data&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;/p>
&lt;p>Many of us were in Toronto for that meeting; it was two weeks after we’d heard the news of her diagnosis. Some of us were able to visit her in the hospital where she told us of her and Dave’s decision to bring forward their wedding plans. It was a bittersweet announcement but, clearly, they adored each other and were determined to be happy together despite the challenging times ahead.&lt;/p>
&lt;p>Over the last few months, even when she had little energy to spare, Chris popped in (virtually) to chat and update us, share pictures and, selflessly, to see how we were doing. Even people who never met or worked closely with her started to follow her vlog and exchange notes and news directly.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2019/chris-message-may-9-2018.png"
alt="Always checking in with us, an update from Chris shortly before she passed" width="60%">&lt;figcaption>
&lt;p>Always checking in with us, an update from Chris shortly before she passed&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;p>We have all been rocked by the news and there is a lot of sadness and grief among the Crossref staff and community. Even in the last moments we shared together Chris always asked about how her projects were going. Her passion for her products was a big part of what animated her when she first joined. Throughout her late-stage illness, this remained constant. She yearned to return to work. This zeal will forever be an inspiration to us all at Crossref.&lt;/p>
&lt;p>&lt;a href="https://web.archive.org/web/20190609161008/https://www.crossref.org/people/christine-buske/" target="_blank">Christine&lt;/a> taught us a lot, through her work, with her attitude to life, and in &lt;a href="https://www.youtube.com/channel/UCeUvw-bWejaHH3bhaB6aEEg" target="_blank">the manner that she dealt with this terrible illness&lt;/a>. We thank her for giving us so many great memories and we will never forget her.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/2019/chris4.png"
alt="A Crossref photoshoot; our Christine ❤️" width="80%">&lt;figcaption>
&lt;p>A Crossref photoshoot; our Christine ❤️&lt;/p>
&lt;/figcaption>
&lt;/figure></description></item><item><title>Similarity Check is changing</title><link>https://www.crossref.org/blog/similarity-check-is-changing/</link><pubDate>Thu, 30 May 2019 00:00:00 +0000</pubDate><author>Jennifer Lin</author><guid>https://www.crossref.org/blog/similarity-check-is-changing/</guid><description>&lt;h2 id="tldr">Tl;dr&lt;/h2>
&lt;p>Crossref is taking over the service management of Similarity Check from Turnitin. That means we&amp;rsquo;re your first port of call for questions and your agreement will be direct with us. This is a very good thing because we have agreed and will continue to agree the best possible set-up for our collective membership. Similarity Check participants need to take action to confirm the new terms with us as soon as possible and before 31st August 2019. Instructions will be circulated early June via email.&lt;/p>
&lt;h2 id="background">Background&lt;/h2>
&lt;p>Many of our members use &lt;a href="https://www.crossref.org/services/similarity-check">Similarity Check&lt;/a> which gives their editors reduced-rate access to Turnitin’s iThenticate system for plagiarism checking. Some use Similarity Check directly and some as part of a submission system.&lt;/p>
&lt;p>The service launched in 2008 when we &lt;a href="https://www.crossref.org/news/2008-06-19-crosscheck-plagiarism-screening-service-launches-today">announced our initial partnership with Turnitin&lt;/a>. Since then it&amp;rsquo;s gone from strength to strength and now has over 60 million full-text documents (from over 87 thousand titles) available for text comparison and almost 1500 members using the service.&lt;/p>
&lt;p>The way that the Similarity Check arrangement works is changing, and it’s important that users know what’s happening. We have worked with Turnitin to set up a process that will transition participants easily and swiftly into the upgraded service with no access interruptions to iThenticate access.&lt;/p>
&lt;h2 id="so-what-is-changing-and-why">So, what is changing and why?&lt;/h2>
&lt;p>We know that Similarity Check is a critical service for our members, and we want to improve people&amp;rsquo;s experience of using it. So, in consultation with members, we’ve strengthened the service by updating our relationship with Turnitin to consolidate all the components of the service under our care and stewardship. From next week, Similarity Check participants will move from having an agreement with Turnitin to one with Crossref. And at Crossref, we have a new agreement with Turnitin as the technology provider for the service.&lt;/p>
&lt;p>The new arrangement puts us in a strong position to improve support and drive future improvements of the system. Representing our collective membership, we’ve agreed better terms than what people have today and what members would get acting individually.&lt;/p>
&lt;p>There are five key changes specifically:&lt;/p>
&lt;ol>
&lt;li>Members&amp;rsquo; Similarity Check service agreement will be with us and not Turnitin.&lt;/li>
&lt;li>Per-document checking fees will be invoiced by us, and not Turnitin. They’ll be included in members&amp;rsquo; regular invoices, reducing international transfer fees for many.&lt;/li>
&lt;li>The first 100 documents checked each year will be free of charge.&lt;/li>
&lt;li>Turnitin will operate as a vendor for Crossref. We’ve already agreed a range of additions to their technology roadmap. Turnitin will remain responsible for fixing any bugs or technical issues with the system, but we&amp;rsquo;re in a stronger position to ensure these are fixed quickly.&lt;/li>
&lt;li>Users will get training and on-boarding support from Crossref. This will cover both how to use the interface and how to interpret the results.&lt;/li>
&lt;/ol>
&lt;h2 id="whats-staying-the-same">What’s staying the same?&lt;/h2>
&lt;ul>
&lt;li>&lt;strong>The system itself and how it&amp;rsquo;s accessed&lt;/strong> - people&amp;rsquo;s logins will stay exactly the same and nothing will change about how participants have their systems set up.&lt;/li>
&lt;li>&lt;strong>The &lt;a href="https://www.crossref.org/fees/#similarity-check-fees">fees&lt;/a>&lt;/strong> - the annual Similarity Check fee and the per-document checking fees will remain at the same level (although under the new arrangement users will get the first 100 documents each year for free - see &amp;ldquo;what&amp;rsquo;s changing&amp;rdquo; above!)&lt;/li>
&lt;li>&lt;strong>Your service obligations&lt;/strong> - members still need to make at least 90% of all their journal article content available for Turnitin to index. This is achieved through the dedicated full-text URLs that members register in their metadata.&lt;/li>
&lt;li>&lt;strong>Licensing and privacy&lt;/strong> - there are no changes to the licensing of members&amp;rsquo; content or the privacy requirements for Turnitin’s use of member content.&lt;/li>
&lt;/ul>
&lt;h2 id="for-existing-users">For existing users&lt;/h2>
&lt;p>We’ve worked closely with Turnitin to ensure an easy transition to the &lt;a href="https://www.crossref.org/services/similarity-check/terms/">new Crossref terms&lt;/a>. You can transition to the new terms at any stage from next week through to 31st August, and Turnitin will end your contract with them in the month you take that action.&lt;/p>
&lt;p>Next week, we’ll email your main Crossref membership contact with a link to a form asking them to click-through accept the new terms. This will confirm and commence the transition process.&lt;/p>
&lt;p>You’ll then need to:&lt;/p>
&lt;ul>
&lt;li>Pay your final Turnitin invoice, which will be sent at the end of the month you complete the form. This will cover your per-document checking fees up to the 25th of that month.&lt;/li>
&lt;li>Continue to use iThenticate as usual.&lt;/li>
&lt;/ul>
&lt;p>Your service agreement will officially move from the Turnitin agreement to the Crossref agreement on the 25th of the month that you complete the transition form. The next Similarity Check invoices you receive will be from Crossref in January 2020 and will include your Similarity Check annual fee and your per-document checking fees for the remainder of 2019.&lt;/p>
&lt;p>If you haven’t transitioned to the new agreement by 31st August, you risk losing access to the iThenticate system as Turnitin will not be able to automatically renew your direct contract.&lt;/p>
&lt;p>If you have any questions about these changes, do contact our &lt;a href="mailto:member@crossref.org">membership specialist&lt;/a>. We’ll be in touch next week with a link to a new form where you’ll be able to check your details and accept the new agreement directly with us.&lt;/p>
&lt;h2 id="for-prospective-users">For prospective users&lt;/h2>
&lt;p>When you apply to participate in Similarity Check you will be accepting terms directly with Crossref and not Turnitin. Eligible members can apply any time from next week.&lt;/p>
&lt;h2 id="any-questions">Any questions?&lt;/h2>
&lt;p>There are many benefits to this new set-up, but we understand these things can be a bit of a hassle. We&amp;rsquo;ve welcomed a new colleague (say hello to &lt;a href="https://www.crossref.org/people/kathleen-luschek">Kathleen&lt;/a>!) to help people transition and get the best from their use of Similarity Check. Please &lt;a href="mailto:support@crossref.org">contact her via Support&lt;/a> with any questions.&lt;/p>
&lt;blockquote>
&lt;p>[Update June 5th: we&amp;rsquo;ve added a new &lt;a href="https://www.crossref.org/faqs/similarity-check-transition/">FAQ page&lt;/a> for members who signed up for Similarity Check prior to June 2019]&lt;/p>
&lt;/blockquote></description></item><item><title>Putting content in context</title><link>https://www.crossref.org/blog/putting-content-in-context/</link><pubDate>Mon, 13 May 2019 00:00:00 +0000</pubDate><author>Kirsty Meddings</author><guid>https://www.crossref.org/blog/putting-content-in-context/</guid><description>&lt;p>You can’t go far on this blog without reading about the importance of registering &lt;a href="https://doi.org/10.64000/5bxhj-wws87" target="_blank">rich metadata&lt;/a>. Over the past year we’ve been encouraging all of our members to review the metadata they are sending us and find out which gaps need filling by looking at their &lt;a href="https://www.crossref.org/members/prep/" target="_blank">Participation Report&lt;/a>.&lt;/p>
&lt;p>The metadata elements that are tracked in Participation Reports are mostly beyond the standard bibliographic information that is used to identify a work. They are important because they provide context: they tell the reader how the research was funded, what license it’s published under, and more about its authors via links to their &lt;a href="https://orcid.org/" target="_blank">ORCID&lt;/a> profiles. And while this metadata is all available through our APIs, we also display much of it to readers through our &lt;a href="https://www.crossref.org/services/crossmark/">Crossmark&lt;/a> service.&lt;/p>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/crossmark.png" alt=“the crossmark box" height="448px" width="350px" class="img-responsive" />
&lt;/div>
&lt;p>Crossmark is also about providing context. It is a button placed on content, which when clicked on brings up a pop-up box that tells the reader about significant updates such as corrections and retractions, together with other information about the publishing and editorial processes that have been applied to the content ahead of publication.&lt;/p>
&lt;p>The Crossmark box can display information about authors, funders and licenses. In addition, our members can add “More information” and often do in the form of publication history, links to supporting materials, and peer review information. All of this supporting information helps the reader assess how well the content has been - and continues to be - curated by the publisher.&lt;/p>
&lt;h3 id="whos-in">Who’s in?&lt;/h3>
&lt;p>250 Crossref members have signed up to use Crossmark (it’s an add-on service with its own &lt;a href="https://www.crossref.org/fees/#crossmark-fees">fees&lt;/a>). Though optional, some star pupils have even added Crossmark to their back-year content and as a result have Crossmark coverage on 99% of their content (kudos to PLOS, Rockefeller University Press and the societies represented by KAMJE, to name a few).&lt;/p>
&lt;p>At the other extreme, some have applied Crossmark to less than 10% - these tend to be members with back-year records going back many decades, who are just implementing Crossmark for their more recent research outputs. Crossmark coverage is one of the things tracked in Participation Reports - pop over and &lt;a href="https://www.crossref.org/members/prep/" target="_blank">take a look&lt;/a> if you want to see what your organisation is doing.&lt;/p>
&lt;p>So what additional metadata has been registered by members using Crossmark? (data snapshot from our &lt;a href="https://www.crossref.org/education/retrieve-metadata/rest-api/">REST API&lt;/a> April 2019):&lt;/p>
&lt;ul>
&lt;li>8,711,500 content items have some Crossmark metadata&lt;/li>
&lt;li>104,650 updates to content have been registered. Of these&lt;/li>
&lt;li>55,000 are corrections and 28,000 errata&lt;/li>
&lt;li>16,000 are new versions or new editions&lt;/li>
&lt;li>2,700 are retractions and 1,280 are withdrawals&lt;/li>
&lt;li>4,830,510 content items have some custom metadata, which appears in the More Information section of the Crossmark box. The most common metadata provided here is publication history, followed by copyright statements, the peer review method used, and whether the item has been checked for originality using Similarity Check.&lt;/li>
&lt;/ul>
&lt;h3 id="some-news-on-clicks-and-views">Some news on clicks and views&lt;/h3>
&lt;p>We’ve been collecting usage statistics more or less since the Crossmark service launched in 2012, but have lacked a suitable way to share them. This will change later this year! In preparation, I’ve been digging around in the data and uncovered some interesting things.&lt;/p>
&lt;p>I was able to do a degree of comparison between Crossmark usage against overall article views using PLOS articles as they make their usage data openly available. I spot-checked fifteen articles and found that most of them had a monthly number of clicks on the Crossmark button in the low-twenties, regardless of the number of total page views the article had received.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/crossmark-plos-stats.png" alt=“graph of crossmark clicks vs article views" height="267px" width="600px" class="img-responsive" />
&lt;/div>
&lt;p>The highly viewed paper above shows relatively very few clicks on the Crossmark button, whereas on the paper with fewer views, below, clicks on the button follow the overall pattern of usage.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/crossmark-plos-stats-2.png" alt=“graph of crossmark clicks vs article views" height="267px" width="600px" class="img-responsive" />
&lt;/div>
&lt;p>It’s not unreasonable to suppose that a paper with very high usage has a higher proportion of lay readers visiting it, whereas a more niche paper is being visited by those with a research interest. This is encouraging, as it suggests researchers are interested in checking the status of the content and the additional “trust signals” that the Crossmark box can provide.&lt;/p>
&lt;h3 id="web-pages-vs-pdfs">Web pages vs PDFs&lt;/h3>
&lt;p>We track the number of clicks on the Crossmark button in PDFs separately to those that come from web pages. (There are some that we can’t determine, usually because the link behind the button has been incorrectly formatted, but for most members these are minimal.)&lt;/p>
&lt;p>I looked at the 30 members with most Crossmark coverage, and averaged the number of clicks over a six month period in 2018. For two thirds of these members, clicks on the Crossmark button on their web pages exceed those in their PDFs, but there are also definite outliers.&lt;/p>
&lt;div style="text-align:center;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/crossmark-pdf-html.png" alt=“graph of crossmark clicks vs article views" height="370px" width="600px" class="img-responsive" />
&lt;/div>
&lt;p>Some are easily explained: member #6 hasn’t put the Crossmark button in any of their PDFs, while member #21 has &lt;em>only&lt;/em> put it in their PDFs. Member 10 has the button on its article landing pages hidden in a “more information” section that the reader has to click to expand.&lt;/p>
&lt;p>That said, member #20 has the button displayed prominently next to the article title but gets 85% of Crossmark clicks from PDFs. There’s no obvious subject bias - four of the members above are physics publishers - two have many more PDF clicks, two have more HTML.&lt;/p>
&lt;p>None of the findings above contain nearly enough data to draw any definitive conclusions, but I hope they pique your interest to find out more when we make Crossmark usage statistics available to all members later this year. In the meantime if you have any suggestions/questions, or would be interested in helping us when we come to testing the statistics interface, please &lt;a href="mailto:kmeddings@crossref.org">let me know&lt;/a>.&lt;/p></description></item><item><title>A simpler text query form</title><link>https://www.crossref.org/blog/a-simpler-text-query-form/</link><pubDate>Tue, 30 Apr 2019 00:00:00 +0000</pubDate><author>Isaac Farley</author><guid>https://www.crossref.org/blog/a-simpler-text-query-form/</guid><description>&lt;p>The &lt;a href="https://apps.crossref.org/SimpleTextQuery" target="_blank">Simple Text Query form&lt;/a> (STQ) allows users to retrieve existing DOIs for journal articles, books, and chapters by cutting and pasting a reference or reference list into a simple query box. For years the service has been heavily used by students, editors, researchers, and publishers eager to match and link references.&lt;/p>
&lt;p>We had changes to the service planned for the first half of this year - an upgraded reference matching algorithm, a more modern interface, etc. In the spirit of openness and transparency, part of our project plan was to communicate these pending changes to STQ users well in advance of our 30 April completion date. What would users think? Could they help us improve upon our plans?&lt;/p>
&lt;p>About a month ago, I reached out to the 21,000 plus users we had on record of using STQ since January 2018. We received nearly 85 responses from the messages we sent. Questions ranged from: if we were making changes, would PubMed ID matching be supported? To: What about the reliability of the returned reference links? And: Could we better accommodate larger reference lists?&lt;/p>
&lt;p>Many of the users we heard from told us how STQ was critical to their work. I read all these messages. The concerns raised by users were legitimate and much appreciated. We reassessed our project timeline and plans, and decided to shift course. So, what &lt;em>are&lt;/em> we doing?&lt;/p>
&lt;h3 id="whats-changing">What’s changing?&lt;/h3>
&lt;ul>
&lt;li>The previous hurdle of having to register your email address simply to return reference links was confusing and unnecessary. We removed it.&lt;/li>
&lt;li>We previously limited the number of monthly reference links to 5,000 per email address. Most didn’t reach the limit, but those who did were frustrated by it and/or found ways around it. We want you to match and register as many references as possible, so we removed the monthly limit too.&lt;/li>
&lt;li>Many of you with long reference lists found that you were occasionally reaching our limit of 30,000 characters per submission. Once again, we want you to match and register as many references as possible so we removed the character limit altogether and instead are just looking at the number of references per submission. We now provide space for 1,000 references per submission (We checked. The most references we have ever received via the STQ form in one submission was around 750. Thus, we rounded up.).&lt;/li>
&lt;li>We did make a change to the backend of the service. We updated &lt;a href="https://doi.org/10.64000/pdm9z-20m09" target="_blank">the algorithm&lt;/a> we use to return reference links. We think it’s &lt;a href="https://doi.org/10.64000/e6ey2-wce96" target="_blank">an improvement&lt;/a>. Let us know how you find it.&lt;/li>
&lt;/ul>
&lt;h3 id="whats-remaining-the-same">What’s remaining the same?&lt;/h3>
&lt;ul>
&lt;li>Core functionality. It&amp;rsquo;s all in the name. Retrieve DOIs for journal articles, books, and chapters by cutting and pasting a reference or reference list into a simple query box.&lt;/li>
&lt;li>PubMed ID matching. You use it. You need it. We’re keeping it.&lt;/li>
&lt;li>Deposits. You’ll still need an email address for this, but we won’t ask for it until you’re at the deposit screen.&lt;/li>
&lt;li>The interface. We’re still eager to give the user interface a much-needed refresh, but, as many users pointed out to us, there’s still some core functionality that’s important that we need to retain with any interface update. For instance, you need to be able to easily copy and paste reference links into your reference list. That functionality isn’t going anywhere.&lt;/li>
&lt;li>Resetting reference links. Submit references, match, reset, and repeat. Many users like the reset button. It’s not going anywhere either.&lt;/li>
&lt;/ul>
&lt;h3 id="xml-queries">XML queries&lt;/h3>
&lt;p>The change to the backend of the service that I mentioned above is not confined to reference matching and depositing for STQ users. XML queries for reference matching are also now powered by that new backend. We think it’s a seamless transition, but if you find it is not, please let us know.&lt;/p>
&lt;p>I’m excited for these changes and hope you are too. I invite you to try the simpler and improved STQ form, and &lt;a href="mailto:support@crossref.org">let us know what you think&lt;/a>.&lt;/p></description></item><item><title>Express your interest in serving on the Crossref board</title><link>https://www.crossref.org/blog/express-your-interest-in-serving-on-the-crossref-board/</link><pubDate>Wed, 24 Apr 2019 00:00:00 +0000</pubDate><author>Lisa Hart Martin</author><guid>https://www.crossref.org/blog/express-your-interest-in-serving-on-the-crossref-board/</guid><description>&lt;p>The Crossref Nominating Committee is inviting expressions of interest to serve on the Board as it begins its consideration of a slate for the November 2019 election.&lt;/p>
&lt;p>The board&amp;rsquo;s purpose is to provide strategic and financial oversight and counsel to the Executive Director and the staff leadership team, with the key responsibilities being:&lt;/p>
&lt;ol>
&lt;li>Setting the strategic direction for the organisation;&lt;/li>
&lt;li>Providing financial oversight; and&lt;/li>
&lt;li>Approving new policies and services.&lt;/li>
&lt;/ol>
&lt;p>The Board tends to review the strategic direction every few years, taking a landscape view of the scholarly communications community and trends that may affect Crossref&amp;rsquo;s mission. In July 2017, the board and staff came up with four strategic themes and these have been developed into an &lt;a href="https://www.crossref.org/strategy">organisation-wide roadmap&lt;/a>.&lt;/p>
&lt;p>The board votes on any new policy or service that staff and committees propose if it is a departure from normal practice for Crossref.Some of the recent things the board has approved include:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>Approval of all the new terms of membership; broadening of the membership eligibility criteria to include non-publishers.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Involvement in the ROR.org initiative including community outreach, technical prototyping, and helping to explore governance options.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Approval of a proposal for funders to join at a reduced annual fee; the registration of DOIs for research grants.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Allocating $50,000 USD of the operating budget to research the community&amp;rsquo;s level of interest in a distributed usage service.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Specifying the Board makeup to include equal numbers of small and large members; reframing the election processes.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;h3 id="what-is-expected-of-a-crossref-board-member">What is expected of a Crossref Board member?&lt;/h3>
&lt;p>Board members should be able to attend all board meetings, which occur three times a year in different parts of the world. If you are unable to attend in person you may send your named alternate as your proxy or be able to attend via telephone.&lt;/p>
&lt;p>Board members must:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>be familiar with the three key responsibilities listed above;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>actively participate and contribute towards discussions; and&lt;/p>
&lt;/li>
&lt;li>
&lt;p>read the board documents and materials provided, prior to attending meetings.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;h3 id="how-to-submit-an-expression-of-interest-to-serve-on-the-board">How to submit an expression of interest to serve on the Board&lt;/h3>
&lt;p>We are seeking people who know about scholarly communications and would like to be part of our future. If you have experience on a governing board (as opposed to an operational board) and have a vision for the international Crossref community, we are interested in hearing from you.&lt;/p>
&lt;p>If you are a Crossref member, are &lt;a href="https://doi.org/10.64000/wfmdf-hmv37" target="_blank">eligible to vote&lt;/a>, and would like to be considered, you can complete and submit the &lt;a href="https://docs.google.com/forms/d/e/1FAIpQLSdwqraD2fjb3eqZgLpTQWsMYPQvvz4LARLq6k8H8mA7xGbZAw/viewform" target="_blank">expression of interest form&lt;/a> with both your organisation&amp;rsquo;s statement and your personal statement before &lt;strong>21 May 2019&lt;/strong>.&lt;/p>
&lt;blockquote>
&lt;p>It is important to note it is your organisation who is the Crossref member&amp;mdash;and therefore the seat will belong to your organisation.&lt;/p>
&lt;/blockquote>
&lt;h3 id="about-the-election-and-our-board">About the election and our Board&lt;/h3>
&lt;p>We have a principle of &lt;a href="https://www.crossref.org/truths/">&amp;ldquo;one member, one vote&amp;rdquo;&lt;/a>; our board comprises a cross-section of members and it doesn&amp;rsquo;t matter how big or small you are, every member gets a single vote. Board terms are three years, and one third of the Board is eligible for election every year. There are five seats up for election in 2019, 4 large and 1 small.&lt;/p>
&lt;p>The board meets in a variety of international locations in March, July, and November each year. &lt;a href="https://www.crossref.org/board-and-governance/">View a list of the current Crossref Board members and a history of the decisions they&amp;rsquo;ve made (motions).&lt;/a>&lt;/p>
&lt;p>The slate will be decided by the Nominating Committee and interested parties will be informed if they have made the slate by July 15, 2019.&lt;/p>
&lt;p>The election opens online in September 2019 and voting is done by proxy online, results will be announced at the annual business meeting during &amp;lsquo;Crossref LIVE19&amp;rsquo; on 13th November 2019 in Amsterdam, Netherlands. Election materials and instructions for voting will be available online to all Crossref members in September 2019.&lt;/p>
&lt;h3 id="the-role-of-the-nominating-committee">The role of the Nominating Committee&lt;/h3>
&lt;p>The Nominating Committee meets to discuss change, process, criteria, and potential candidates, ensuring a fair representation of membership. The Nominating Committee is charged with selecting a slate of candidates for election from those who have expressed an interest.&lt;/p>
&lt;p>The selection of the slate (which might exceed the number of open seats) is based on the quality of the expressions of interest and the nominating committee&amp;rsquo;s review of the candidates in light of the board&amp;rsquo;s directive of maintaining an appropriately balanced and representative board. The nominating committee will prioritize maintaining representation of members having both commercial and non-commercial business models, in addition to continuing to seek balance across factors such as gender, ethnic and racial background, geography, and sector.&lt;/p>
&lt;p>The Board voted in March 2019 that balance according to size (based on revenue tier) will be achieved by a 2019 slate consisting of one revenue tier 1 seat (small) and 4 revenue 2 seats (large), and a 2020 slate consisting of 4 revenue tier 1 seats and 2 revenue tier 2 seats &lt;em>(see &lt;a href="https://www.crossref.org/board-and-governance/bylaws">Crossref&amp;rsquo;s amended Bylaws&lt;/a> on the Crossref website)&lt;/em>.&lt;/p>
&lt;p>The Committee is made up of three board members not up for election, and two non-board members. The current Nominating Committee members are:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>Jasper Simons, APA (Chair);&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Scott Delman ACM;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Catherine Mitchell, CDL;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Vincent Cassidy, The Institution of Engineering &amp;amp; Technology (IET); and&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Claire Moulton, The Company of Biologists.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>Please &lt;a href="https://docs.google.com/forms/d/e/1FAIpQLSdwqraD2fjb3eqZgLpTQWsMYPQvvz4LARLq6k8H8mA7xGbZAw/viewform" target="_blank">submit your expression of interest&lt;/a> or reply to me with any questions at &lt;a href="mailto:lhart@crossref.org">lhart@crossref.org&lt;/a>. This is your opportunity to help guide our wonderful organisation!&lt;/p></description></item><item><title>Quarterly deposit invoices: avoiding surprises</title><link>https://www.crossref.org/blog/quarterly-deposit-invoices-avoiding-surprises/</link><pubDate>Wed, 10 Apr 2019 00:00:00 +0000</pubDate><author>Isaac Farley</author><guid>https://www.crossref.org/blog/quarterly-deposit-invoices-avoiding-surprises/</guid><description>&lt;p>Whenever we send out our quarterly deposit invoices, we receive queries from members who have registered a lot of backlist content, but have been charged at the current year’s rate. As the invoices for the first quarter of 2019 have recently hit your inboxes, I thought I’d provide a timely reminder about this in case you spot this problem on your invoice.&lt;/p>
&lt;p>This problem is usually the result of metadata being registered that makes it look as though the content was current, despite the fact that it was backlist. This post will show you what to do if you spot this problem in your latest invoice - and more importantly, how you can avoid this situation in the future.&lt;/p>
&lt;h3 id="about-current-and-backlist-content-registration-fees">About current and backlist Content Registration fees&lt;/h3>
&lt;p>There are different fees for registering content depending on whether it’s current (this year and the previous two years - 2017, ‘18, and ‘19) or backlist (older than that). As an example, it’s $1 each for a current journal article, and $0.15 for each backlist article. So, if you’ve incorrectly registered your content as published in 2019 when actually it was published in 2012, your quarterly invoice will overcharge you based on the metadata discrepancy.&lt;/p>
&lt;center>&lt;img src="https://www.crossref.org/images/blog/quarterly_invoice_test.png" alt="Sample quarterly deposit invoice" width="550" class="img-responsive" />&lt;/center>
&lt;ol>
&lt;li>We send you the quarterly deposit invoice at the end of each quarter. This example is an invoice for all deposits of the first quarter of 2018 for username ‘test’ - months January, February, and March.&lt;/li>
&lt;li>The BY code represents backlist (or, back year) content (journal article, in this example). Backlist content is charged at $0.15 per content item.&lt;/li>
&lt;li>The CY code represents current year content (journal article, in this example, although you can see that this invoice has charges for other content items as well). Current year content is charged at $1 per content item.&lt;/li>
&lt;/ol>
&lt;h3 id="determining-whether-content-is-current-or-backlist">Determining whether content is current or backlist&lt;/h3>
&lt;p>A record is determined to be either a backlist or current year deposit based on the metadata that you deposit with us. If you use our helper tools - &lt;a href="https://www.crossref.org/metadatamanager/" target="_blank">Metadata Manager&lt;/a> or the &lt;a href="https://apps.crossref.org/webDeposit/" target="_blank">web deposit form&lt;/a> - the system looks at the information you’ve entered into the “publication date” field. If you deposit XML with us, it looks at the date in the &lt;code>&amp;lt;publication_date&amp;gt;&lt;/code> element. And we look at each individual item separately—so even if you’ve put a publication date at journal level, you still need to put it at the journal article level too.&lt;/p>
&lt;p>Additionally, sometimes we find that deposits mistakenly include the deposit date in place of the publication date. These two dates - the deposit date and the publication date - are not necessarily one and the same, especially if you are depositing backlist content. Please take care to double check this before you submit your deposit(s).&lt;/p>
&lt;h3 id="what-to-do-if-you-think-youve-registered-the-wrong-publication-date">What to do if you think you’ve registered the wrong publication date&lt;/h3>
&lt;p>As you can only update a publication date by running a full redeposit, it’s important to get it right the first time. If you’ve registered the wrong publication date and have received an invoice for the wrong amount, please redeposit your content and then &lt;a href="mailto:support@crossref.org">get in contact with us&lt;/a>. If you do this as soon as you spot the error, we’ll be able to send a new invoice for the correct amount.&lt;/p></description></item><item><title>Here’s to year one!</title><link>https://www.crossref.org/blog/heres-to-year-one/</link><pubDate>Fri, 22 Mar 2019 00:00:00 +0000</pubDate><author>Vanessa Fairhurst</author><guid>https://www.crossref.org/blog/heres-to-year-one/</guid><description>&lt;p>Our Ambassador Program is now one year old, and we are thrilled at how the first 12 months have gone. In 2018 we welcomed 16 &lt;a href="https://www.crossref.org/community/our-ambassadors/">ambassadors&lt;/a> to the team, based in Australia, Brazil, Colombia, India, Indonesia, Mexico, Nigeria, Peru, Russia, Singapore, South Korea, UAE, Ukraine, USA, and Venezuela.&lt;/p>
&lt;p>Our ambassadors are volunteers with a good knowledge of Crossref and the wider scholarly community, they are well connected and passionate about the work that we do. Participating in the ambassador program is complementary to people’s existing roles and enables those who already work with Crossref to have a mechanism to feed back to us and to provide support for their communities.&lt;/p>
&lt;p>We reflected on the successes and challenges of the first 12 months and discovered quite a lot has been achieved so far.&lt;/p>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>The Ambassador Program better equips me to support researchers to conduct outreach and collaborate in multidisciplinary discovery!&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Woei Fuh Wong, Research 123, Singapore&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Within the framework of the Ambassador Crossref program, I ran a seminar, webinar, and held several meetings in Ukrainian scientific organisations.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Andrii Zolkover, Internauka, Ukraine&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>In my role as ambassador, I am able to provide a greater level of support in Russian. Alongside translated materials, we have also received over 400 tickets to our Russian electronic support system and made over 300 consultations by phone.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Maxim Mitrofanov, NEICON, Russia&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>Being an ambassador has enabled me to increase knowledge of Crossref within my community.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Edilson Damasio, Department of Mathematics Library of State University of Maringá-UEM, Brazil&lt;/cite>&lt;/p>
&lt;/div>
&lt;div class="quotecite">
&lt;blockquote>
&lt;p>The ambassador program has helped in vastly raising the awareness of Crossref and its services all over the world. Based in the Middle East, I see the need in the Arab region to know more about Crossref in their mother tongue (Arabic). The program has proven success and its positive impact is tangible.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;cite>&amp;ndash; Mohamad Mostafa, Knowledge E, UAE&lt;/cite>&lt;/p>
&lt;/div>
&lt;h2 id="highlights">Highlights&lt;/h2>
&lt;p>Over the course of 2018, there were a number of big achievements which would not have been possible without the help of our ambassadors.&lt;/p>
&lt;p>Due to your feedback, we’re very keen to expand the level of multi-language support we offer our diversifying community. In addition to translating key messages, slide decks, and other educational materials, our ambassadors (and some members - thanks also!) helped us in the production of a series of short videos. We now have videos available for each of the Crossref services in nine languages including English, French, Spanish, Brazilian Portuguese, Chinese, Japanese, Korean, Arabic, and Bahasa Indonesia. You can see in the chart below, that although our English videos have the most views (this is the default language), others have also experienced a lot of visitors, particularly notable are the Chinese and Spanish language videos. This underscores the importance of further support in non-English languages, as our series of multi-lingual webinars also demonstrated.&lt;/p>
&lt;p align="center">
&lt;img src="https://www.crossref.org/images/blog/Service-video-plays-ambassador-blog.png" alt="Service video plays" width="500px" />
&lt;/p>
&lt;p>In 2018 we ran webinars in Russian, Brazilian Portuguese, Spanish and Arabic. Several ambassadors have taken a lead in running these webinars in their local languages with assistance from Crossref staff on producing materials and answering questions on the day. Spanish language webinars saw record numbers of attendees from a range of different countries, and our Russian webinar recordings have been viewed over 200 times. We will be continuing to offer more webinars in different time zones and languages, and the &lt;a href="https://www.crossref.org/webinars/">recordings&lt;/a> are always available for anyone who can’t attend on the day.&lt;/p>
&lt;p align="center">
&lt;img src="https://www.crossref.org/images/blog/Arabic-webinar-ambassador blog.png" alt="Arabic webinar" width="500px" />
&lt;/p>
&lt;p>Our ambassadors have also been helping us improve and expand our &lt;a href="https://www.crossref.org/events/">LIVE local events&lt;/a>. Last year we held events in Japan, South Africa, Russia, Germany, Brazil and India. Ambassadors help by providing recommendations on venues, accommodation, guest speakers, or even attending and speaking at the event themselves. Some run their own Crossref events which we can help provide materials and also represent Crossref at related industry events in their region. You may have had the chance to meet some of our ambassadors at our annual event in Toronto last November as well.&lt;/p>
&lt;p>As our ambassadors are our representatives, acting as our eyes and ears in the wider community, it is important that they are kept up to date with new developments and have good opportunities to report back to us. The ambassador team has participated in beta-testing of a number of new initiatives including our new &lt;a href="https://www.crossref.org/education/member-setup/metadata-manager/">Metadata Manager&lt;/a> and &lt;a href="https://www.crossref.org/members/prep/" target="_blank">Participation Reports&lt;/a> and our upcoming Community Forum. By providing feedback from their own user perspective and from how they anticipate those in their communities will view and use these tools, it enables us get better insights into how an initiative might work before launching it more widely.&lt;/p>
&lt;h2 id="future-plans">Future Plans&lt;/h2>
&lt;p>Initial feedback on the program has been overwhelmingly positive, both from the ambassadors themselves and the wider Crossref community, so we’re looking at what we can do to hone the program over time. In 2019 we will be welcoming some more ambassadors to the team to further support our global community. We want to support our ambassadors, so we don’t foresee the group growing to the point where there are too many ambassadors for us to be able to engage with. You can read about the team and where they are based, as well as all about the new ambassadors we have welcomed so far this year, on &lt;a href="https://www.crossref.org/community/our-ambassadors/">Our Ambassadors&lt;/a> page.&lt;/p>
&lt;p>This year our ambassadors will be involved when we launch our online community forum (more to come on that soon). They’re already helping with the task-force that is advising on our new documentation, and we’ll be providing them with further training on Crossref tools and services. We also have more webinars and LIVE locals in the pipeline. Keep an eye on our &lt;a href="https://www.crossref.org/webinars/">webinars&lt;/a> and &lt;a href="https://www.crossref.org/events/">events&lt;/a> pages for more details as they come.&lt;/p>
&lt;p>So a final thank you to our ambassador team - it has been great to work with you over the last year, and we look forward to how we can continue to work together!&lt;/p></description></item><item><title>Before, during, and after - a journey through title transfers</title><link>https://www.crossref.org/blog/before-during-and-after-a-journey-through-title-transfers/</link><pubDate>Mon, 25 Feb 2019 00:00:00 +0000</pubDate><author>Isaac Farley</author><guid>https://www.crossref.org/blog/before-during-and-after-a-journey-through-title-transfers/</guid><description>&lt;p>In January, I wrote about how we’ve &lt;a href="https://doi.org/10.64000/pmnxw-5kx52" target="_blank">simplified the journal title transfer process&lt;/a> using our new Metadata Manager tool. For those disposing publishers looking for an easy, do-it-yourself option for transferring ownership of your journal, I suggest you review that blog post. But, whether you choose to process the transfer yourself via Metadata Manager or need some help from &lt;a href="https://www.crossref.org/people/paul-davis/">Paul&lt;/a>, &lt;a href="https://www.crossref.org/people/shayn-smulyan/">Shayn&lt;/a>, or myself, there’s more to a transfer than just the click of a transfer button or the submission of an email to &lt;a href="mailto:support@crossref.org">support@crossref.org&lt;/a>, as I’m sure those of you who have been through a title transfer can attest.&lt;/p>
&lt;h3 id="prepping-your-title-transfer">Prepping your title transfer&lt;/h3>
&lt;p>Sometimes members get on the other side of a &lt;a href="https://doi.org/10.64000/91cyc-vka68" target="_blank">title transfer&lt;/a> and find you’re encountering problems even if you followed the process for transferring titles. You might find you can register new content for the new title against your own prefix without any issues. But you are not able to update the metadata for back-year records after we’ve made the transfer.&lt;/p>
&lt;p>When we investigate, the problem is usually that the DOIs you’re trying to update don’t exist in our system yet. This means the deposit isn’t considered an update to the content, it’s considered a new deposit. And you don’t have permission to do that, since you’re effectively attempting to register new content to a prefix that is not your own.&lt;/p>
&lt;p>This problem is because the former publisher didn’t ever register the DOIs with us - even though they’ve been displaying them on their website. This is bad practice and isn’t in keeping with our membership terms, but it does sometimes happen.&lt;/p>
&lt;p>Before you request a title transfer, do check with the former publisher that they’ve definitely registered all the DOIs that they’ve been displaying and distributing to their readership. You can spot check this yourself by following a few of the DOI links and checking that they resolve to the right place. If you want a full list of DOIs registered to a journal title, our &lt;a href="https://www.crossref.org/06members/51depositor.html" target="_blank">depositor reports&lt;/a> are the place to start. Depositor Reports list all DOIs deposited for a title on a publisher-by-publisher basis. Or, alternatively, if you know the journal cite ID, the unique internal, Crossref identifier for the journal, you can bypass the publisher-by-publisher title list (in my example you’d need to replace my fictional 123456 journal ID with your journal’s cite ID):&lt;/p>
&lt;center>`http://data.crossref.org/depositorreport?pubid=J123456`&lt;/center>
&lt;h3 id="top-tips-for-a-pain-free-title-transfer">Top tips for a pain-free title transfer&lt;/h3>
&lt;p>If your organisation has gained new titles, you’ve checked the depositor report for your new journal and are happy that all the existing DOIs have been registered, then you’re ready to process the transfer. Here are three key steps to ensure a pain-free transfer.&lt;/p>
&lt;ol>
&lt;li>
&lt;p>If you are not acquiring all existing journal articles as part of this transfer, you’ll need to contact us at &lt;a href="mailto:support@crossref.org">support@crossref.org&lt;/a> to confirm the details. Once we have those details sorted, we&amp;rsquo;ll transfer ownership for the select, specified articles.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Carefully check the existing metadata associated with your new titles - some metadata provided for &lt;a href="https://www.crossref.org/education/retrieve-metadata/rest-api/text-and-data-mining">text and data mining&lt;/a> or &lt;a href="https://www.crossref.org/services/similarity-check/">Similarity Check&lt;/a> are publisher-specific and must be updated or &lt;a href="https://support.crossref.org/hc/en-us/articles/115003564483-Removing-metadata-from-a-record" target="_blank">removed&lt;/a> when content is acquired by another member.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>If the metadata supplied is fine, you just need to update the URLs to direct DOIs to your content. You can do this by sending us a &lt;a href="https://support.crossref.org/hc/en-us/articles/213022526" target="_blank">URL update file&lt;/a> or by &lt;a href="https://support.crossref.org/hc/en-us/articles/213022486" target="_blank">redepositing the metadata&lt;/a> with the correct URLs.&lt;/p>
&lt;/li>
&lt;/ol>
&lt;p>If you need to update more than the URLs, you should redeposit the metadata with the correct information plus the correct URLs.&lt;/p>
&lt;ul>
&lt;li>Note: If you, as the disposing publisher, are prepared to transfer your journal to an acquiring publisher, and would like to transfer ownership of the journal and all existing journal articles, please try your new &lt;a href="https://doi.org/10.64000/pmnxw-5kx52" target="_blank">title transfer via Metadata Manager&lt;/a>.&lt;/li>
&lt;/ul>
&lt;h3 id="on-the-other-side">On the other side&lt;/h3>
&lt;p>If you follow the steps I’ve outlined above, you should get to the other side of your title transfer with few problems and are likely to encounter smooth metadata seas ahead. That said, some of our members follow these steps to a tee and still are faced with occasional transfer-related problems.&lt;/p>
&lt;p>Perhaps the previous journal owner used a different scheme to assign timestamps and now you’re receiving &lt;a href="https://support.crossref.org/hc/en-us/articles/215789303-Error-and-warning-messages-" target="_blank">mysterious timestamp errors&lt;/a> when you deposit. Or, that same previous owner made a mistake with a previous deposit and accidentally submitted more than one journal title record. Or, you encounter a strange, new error in Metadata Manager when working with your new titles (yes, we’re still in beta!). If so, please reach out to us at &lt;a href="mailto:support@crossref.org">support@crossref.org&lt;/a> and we’ll help solve what are surely confounding problems, since you’ve undoubtedly read this post in its entirety and taken heed of the above advice.&lt;/p>
&lt;p>As always, if you have questions, need guidance as you’re working through this process, or have recommendations on how we can improve title transfers, please contact us at &lt;a href="mailto:support@crossref.org">support@crossref.org&lt;/a>.&lt;/p></description></item><item><title>Work through your PID problems on the PID Forum</title><link>https://www.crossref.org/blog/work-through-your-pid-problems-on-the-pid-forum/</link><pubDate>Thu, 21 Feb 2019 00:00:00 +0000</pubDate><author>Rachael Lammey</author><discourseUsername>rlammey</discourseUsername><guid>https://www.crossref.org/blog/work-through-your-pid-problems-on-the-pid-forum/</guid><description>&lt;p>As self-confessed PID nerds, we’re big fans of a persistent identifier. However, we’re also conscious that the uptake and use of PIDs isn’t a done deal, and there are things that challenge how broadly these are adopted by the community.&lt;/p>
&lt;p>At &lt;a href="https://pidapalooza.org/" target="_blank">PIDapalooza&lt;/a> (an annual festival of PIDs) in January, ORCID, DataCite and Crossref ran an interactive session to chat about the cool things that PIDs allow us to do, what’s working well and, just as importantly, what isn’t, so that we can find ways to improve and approaches that work.&lt;/p>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/yin_yang_board.jpg" alt=“the yin yang board" height="150px" width="400px" class="img-responsive" />
&lt;/div>
&lt;p>We titled the session the Yin &amp;amp; Yang of PIDs and challenged attendees to put down on paper (post-its) their thoughts about what’s good about PIDs (the Yin) and what’s not so good (the Yang). Attendees could also upvote other’s comments by adding a smiley face sticker to the concept(s) they supported.&lt;/p>
&lt;p>So what came out of the session? &lt;a href="https://doi.org/10.5281/zenodo.2572718" target="_blank">Lots of things&lt;/a>!&lt;/p>
&lt;ul>
&lt;li>Limits to PID uptake are often more cultural than technical&lt;/li>
&lt;li>Yay for &lt;a href="https://support.orcid.org/hc/en-us/articles/360006896394-Auto-updates-time-saving-and-trust-building" target="_blank">auto-update&lt;/a>!&lt;/li>
&lt;li>Slow adoption of new PID types&lt;/li>
&lt;li>Trust issues (I don’t want my information in another system)&lt;/li>
&lt;li>&amp;ldquo;I&amp;rsquo;m the only Erik, I don&amp;rsquo;t need an ORCID&amp;rdquo;&lt;/li>
&lt;li>User stories work!&lt;/li>
&lt;/ul>
&lt;div style="float:left;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/what_are_pids.jpg" alt=“what are PIDs" height="100px" width="300px" class="img-responsive" />
&lt;/div>
&lt;p>We know we only scratched the surface in the session, but fortunately PIDapalooza also brought a good way to continue the conversation: &lt;a href="https://pidforum.org" target="_blank">pidforum.org&lt;/a>! The PIDforum was &lt;a href="https://doi.org/10.5281/zenodo.2548649" target="_blank">launched at PIDapalooza&lt;/a> and is a global discussion platform for all things PID-related. Many PID providers and PID users are already on there, so help us understand more about the yin and yang of PIDs by sharing your own PID problems!&lt;/p></description></item><item><title>ROR announces the first Org ID prototype</title><link>https://www.crossref.org/blog/ror-announces-the-first-org-id-prototype/</link><pubDate>Sun, 10 Feb 2019 00:00:00 +0000</pubDate><author>Maria Gould</author><guid>https://www.crossref.org/blog/ror-announces-the-first-org-id-prototype/</guid><description>&lt;p>What has hundreds of heads, 91,000 affiliations, and roars like a lion? If you guessed the Research Organisation Registry community, you&amp;rsquo;d be absolutely right!&lt;/p>
&lt;p>Last month was a big and busy one for the ROR project team: we released a working API and search interface for the registry, we held our first ROR community meeting, and we showcased the initial prototypes at PIDapalooza in Dublin.&lt;/p>
&lt;p>We&amp;rsquo;re energized by the positive reception and response we&amp;rsquo;ve received and we wanted to take a moment to share information with the community. Here are the links to our latest work, a recap of everything that happened in Dublin, some of the next steps for the project, and how the community can continue to be involved.&lt;/p>
&lt;h2 id="-ta-da-the-first-ror-prototype">🎉 Ta da! The first ROR prototype&lt;/h2>
&lt;p>The Research Organisation Registry (ROR) is finally here! We&amp;rsquo;re thrilled to officially announce the launch of our ROR MVR (minimum viable registry). The MVR consists of the following components, which are ready for anyone to use right now.&lt;/p>
&lt;ul>
&lt;li>
&lt;p>&lt;strong>ROR IDs:&lt;/strong> Starting with seed data from &lt;a href="https://www.grid.ac/" target="_blank">GRID&lt;/a>, ROR has begun assigning unique identifiers to approximately 91,000 organisations in its registry. ROR IDs include a random, unique, and opaque 9-character string and are expressed as URLs that resolve to the organisation&amp;rsquo;s record. For instance, here is the ROR ID for California Digital Library: &lt;a href="https://ror.org/03yrm5c26" target="_blank">https://ror.org/03yrm5c26&lt;/a>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;strong>Search:&lt;/strong> We also built a search interface to look up organisations in the registry: &lt;a href="https://ror.org/search" target="_blank">https://ror.org/search&lt;/a>.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>&lt;img src="https://lh5.googleusercontent.com/UQfE-D0oO6TNUdWPapf3LT-hj6v5l9NdD4LzGDR_A_ZPSKjvTKOlS9LsiTSVEgh_ia--yAbVWBukOHVmucYEymzxPvpAhp15zv1R0bYcQy_OArLAeiasDaIlPXaunVhPbU_Ebrg8" alt="">&lt;/p>
&lt;ul>
&lt;li>&lt;strong>ROR records:&lt;/strong> ROR IDs are stored with additional metadata about the organisation, such as alternate names/abbreviations, external URLs (e.g., an organisation&amp;rsquo;s official website), and other identifiers, such as Wikidata, ISNI, and the Open Funder Registry. This metadata will allow ROR to be interoperable with other identifiers and across different systems. The current schema is based on GRID&amp;rsquo;s dataset and we plan to incorporate other metadata fields over time and according to community needs.&lt;/li>
&lt;/ul>
&lt;p>&lt;img src="https://lh6.googleusercontent.com/0e54ZDo4MMbXFcwFCjFR27ZC7c1EmqAiybwEV12a4wLSvQNbIIyMeIdKyBJNk2SQLYPXNsLXMmDoUozf4fHSF7Qjlhvq1UtnP_poFPPkdavmd9YQaTN5JvJ9zL_9lVPdVyU83l1M" alt="">&lt;/p>
&lt;ul>
&lt;li>
&lt;p>&lt;strong>API:&lt;/strong> The ROR API is now public. You can access the JSON files at &lt;a href="https://api.ror.org/organisations" target="_blank">https://api.ror.org/organisations&lt;/a>.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;strong>OpenRefine reconciler:&lt;/strong> We&amp;rsquo;ve released an OpenRefine reconciler that can map your internal identifiers to ROR identifiers: &lt;a href="https://github.com/ror-community/ror-reconciler" target="_blank">https://github.com/ror-community/ror-reconciler&lt;/a>.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;strong>Documentation:&lt;/strong> We have begun storing documentation on Github and will be adding more as we go along. Please feel free to follow and contribute:  &lt;a href="https://github.com/ror-community/ror-reconciler" target="_blank">https://github.com/ror-community&lt;/a>.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;h2 id="community-meeting-recap">Community meeting recap&lt;/h2>
&lt;p>On January 22, 60+ representatives from across the research and publishing community gathered in Dublin to see what the ROR project team has been up to, demo the first prototypes in action, and discuss where we want to go next - and, of course, to practice ROR-ing together.&lt;/p>
&lt;p>&amp;lt;img src=&amp;quot;/images/blog/pride-of-lions.jpg&amp;quot; alt=“ROR-ing lions Dublin 2019&amp;quot; height=&amp;ldquo;300px&amp;rdquo; class=&amp;ldquo;img-responsive&amp;rdquo;&amp;gt;&lt;/p>
&lt;p>In the second half of the meeting, attendees split into discussion groups to identify specific aspirations for ROR and brainstorm concrete actions needed to achieve these goals, focusing on the main use case of exposing and capturing all research outputs of a given institution. The proposed ideas covered a spectrum of possibilities for ROR, highlighting the following themes:&lt;/p>
&lt;h3 id="ror-as-seamlessly-integrated-and-sometimes-invisible-infrastructure">ROR as seamlessly-integrated and sometimes invisible infrastructure&lt;/h3>
&lt;ul>
&lt;li>
&lt;p>Integration between and within existing systems (and in new ones!)&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Auto-detection of ROR IDs for example in manuscript tracking and funding application platforms&lt;/p>
&lt;/li>
&lt;li>
&lt;p>As such, researchers don&amp;rsquo;t ever have to be responsible for knowing what a ROR is and using it appropriately - the systems they use will do this for them.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;h3 id="ror-as-a-critical-piece-of-funder-workflows-and-infrastructure">ROR as a critical piece of funder workflows and infrastructure&lt;/h3>
&lt;ul>
&lt;li>
&lt;p>Demonstrate to funders how ROR can help them analyze impact of research they fund&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Conduct outreach with key international funders, especially those interested in open infrastructure&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Make funders aware of ROR and encourage them to adopt and mandate use of ROR IDs - involve funders at the beginning to collaborate on technology&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Integrate ROR with existing systems and identifiers already in use by funders and other stakeholders&lt;/p>
&lt;/li>
&lt;/ul>
&lt;h3 id="ror-as-a-trusted-registry-collaborative-partner-and-responsible-steward">ROR as a trusted registry, collaborative partner, and responsible steward&lt;/h3>
&lt;ul>
&lt;li>
&lt;p>Culturally sensitive, inclusive, and respectful of what countries are already doing with regard to organisational identifiers, partnering with national bodies working on this and mapping ROR IDs to locally used identifiers.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Involve the institutions listed in the registry early on as well as CRIS systems&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Interoperability with existing communities and governance bodies&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Workflows to support trust and responsible management of organisational metadata, with policies and procedures for long-term curation and maintenance of records&lt;/p>
&lt;/li>
&lt;/ul>
&lt;h2 id="what-were-hearing">What we&amp;rsquo;re hearing&lt;/h2>
&lt;p>Now that the ROR MVR is here, we&amp;rsquo;re hearing some really good questions about the data we&amp;rsquo;re capturing, how it can be used, and how we&amp;rsquo;ll be maintaining the registry going forward. We wanted to take a moment to respond to some of these questions.&lt;/p>
&lt;h3 id="what-is-the-criteria-for-being-listed-in-ror-what-is-a-research-organisation">What is the criteria for being listed in ROR? What is a &amp;ldquo;research organisation&amp;rdquo;?&lt;/h3>
&lt;p>We define the notion of &amp;ldquo;research organisation&amp;rdquo; quite broadly as any organisation that conducts, produces, manages, or touches research. This is in line with ROR&amp;rsquo;s stated scope, which is to address the affiliation use case and be able to identify which organisations are associated with which research outputs. We use &amp;ldquo;affiliation&amp;rdquo; to describe any formal relationship between a researcher and an organisation associated with researchers, including but not limited to their employer, educator, funder, or scholarly society.&lt;/p>
&lt;h3 id="will-ror-map-organisational-hierarchies">Will ROR map organisational hierarchies?&lt;/h3>
&lt;p>No - ROR is focused on being a top-level registry of organisations so we can address the fundamental affiliation use case, and provide a critical source of metadata that can interoperate with other institutional identifiers.&lt;/p>
&lt;h3 id="ror-ids-are-cool---what-can-i-do-with-them">ROR IDs are cool - what can I do with them?&lt;/h3>
&lt;p>Now that we have built our MVR, we will be working to incorporate ROR IDs into relevant pieces of the scholarly communication infrastructure. If you are a publisher, funder, metadata provider, research office, or anyone else interested in capturing affiliations, please get in touch with us to discuss how we might coordinate. If you are a developer, you are welcome to start playing around with the API: &lt;a href="https://api.ror.org/organisations" target="_blank">https://api.ror.org/organisations&lt;/a>.&lt;/p>
&lt;h3 id="theres-an-error-in-my-organisations-ror-record-----can-you-fix-it">There&amp;rsquo;s an error in my organisation&amp;rsquo;s ROR record &amp;mdash; can you fix it?&lt;/h3>
&lt;p>For the time being, please email &lt;a href="mailto:info@ror.org">info@ror.org&lt;/a> to request an update to an existing record in ROR or request that a new record be added. We will formalize our data management policies and procedures in the next stage of the project.&lt;/p>
&lt;h3 id="what-is-rors-relationship-to-other-organisational-identifiers">What is ROR&amp;rsquo;s relationship to other organisational identifiers?&lt;/h3>
&lt;p>For ROR to be useful, it needs to augment the current offerings in a way that is open, trusted, complementary, and collaborative, and not intentionally competitive. We are committed to providing a service that the community finds helpful and not duplicative, and enables as many connections as possible between organisation records across systems.&lt;/p>
&lt;h3 id="i-have-my-own-dataset-of-institutional-affiliations-----can-i-give-it-to-ror">I have my own dataset of institutional affiliations &amp;mdash; can I give it to ROR?&lt;/h3>
&lt;p>We are always happy to hear about other efforts to capture affiliation data. Please get in touch with us to discuss how we might coordinate.&lt;/p>
&lt;h3 id="can-ror-support-multiple-languages-and-character-sets">Can ROR support multiple languages and character sets?&lt;/h3>
&lt;p>GRID already supports multiple languages and character sets, so by extension ROR will have this enabled as well. Here is one example: &lt;a href="https://ror.org/01k4yrm29" target="_blank">https://ror.org/01k4yrm29&lt;/a>.&lt;/p>
&lt;h3 id="how-will-ror-handle-curation-ie-updating-records-if-an-organisation-changes-its-name-or-ceases-to-exist">How will ROR handle curation, i.e., updating records if an organisation changes its name or ceases to exist?&lt;/h3>
&lt;p>The curation and long-term management of records will be a cornerstone of our efforts in 2019 and we hope to release a working set of policies and procedures soon.&lt;/p>
&lt;h2 id="whats-next-for-ror">What&amp;rsquo;s next for ROR&lt;/h2>
&lt;p>Now that we have our MVR, what happens next for ROR? We&amp;rsquo;re eager to sustain the momentum from January&amp;rsquo;s stakeholder meeting at the same time we know there are some longer-term plans to put in place, and so we&amp;rsquo;re looking at both some immediate tasks as well as bigger-picture questions.&lt;/p>
&lt;h3 id="product-development">Product development&lt;/h3>
&lt;p>We have a few to-do items on our list following the launch of the MVR to keep everything running smoothly while we develop a comprehensive long-term product roadmap.&lt;/p>
&lt;ul>
&lt;li>
&lt;p>Rewrite some of the code for both the API and the OpenRefine reconciler&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Address a few bugs in our repos&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Provide guidance for troubleshooting issues&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Communicate our processes for users to request changes, report bugs, and suggest features&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>As a reminder, you can access the existing code in Github: &lt;a href="https://github.com/ror-community" target="_blank">https://github.com/ror-community&lt;/a>&lt;/p>
&lt;h3 id="policy-development">Policy development&lt;/h3>
&lt;p>We&amp;rsquo;ve been emphasizing here and in community conversations that our primary focus now turns to formulating policies and procedures to ensure the successful management of ROR data over the long term. This is something we can&amp;rsquo;t (and shouldn&amp;rsquo;t) do on our own &amp;mdash; we want to work with community stakeholders to develop the right solutions and establish the right frameworks. We understand the urgency of firming up these policies, but we are also aware that something this important can take time to complete and is not something to rush into lightly.&lt;/p>
&lt;h3 id="community-development">Community development&lt;/h3>
&lt;p>To help guide the next stages of the project, we are putting out an open call for participation in the ROR community advisory group. Advisory group members will be involved in giving input on data management, testing out new features, giving feedback on the product roadmap, and discussing ideas for events and outreach. We plan to convene this advisory group through bimonthly calls and asynchronous communication channels through the end of the year. We hope you will consider joining us! Please email &lt;a href="mailto:info@ror.org">info@ror.org&lt;/a> if you are interested.&lt;/p>
&lt;p>For those who want to stay informed about the project but not necessarily be part of the advisory group, you have other options!&lt;/p>
&lt;ul>
&lt;li>
&lt;p>Sign up for our mailing list (via the footer at &lt;a href="https://www.ror.org" target="_blank">ror.org&lt;/a>)&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Join our community on Slack (&lt;a href="http://www.tinyurl.com/ror-community" target="_blank">www.tinyurl.com/ror-community&lt;/a>),&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Follow us on Twitter (&lt;a href="https://twitter.com/ResearchOrgs" target="_blank">@ResearchOrgs&lt;/a>).&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>You can also always drop us a line at &lt;a href="mailto:info@ror.org">info@ror.org&lt;/a>, and let us know if you&amp;rsquo;d ever like to set up a meeting or conference call to talk about the project in more detail.&lt;/p>
&lt;h2 id="final-thoughts">Final thoughts&lt;/h2>
&lt;p>Community engagement has been vital to ROR&amp;rsquo;s beginnings and will likewise be critically important for the next steps that we take. As both a registry of identifiers and a community of stakeholders involved in building open scholarly infrastructure, ROR depends on guidance and involvement at multiple levels. Thank you for being part of the journey thus far, and for joining us on the road that lies ahead. 🦁&lt;/p></description></item><item><title>Request for feedback on grant identifier metadata</title><link>https://www.crossref.org/blog/request-for-feedback-on-grant-identifier-metadata/</link><pubDate>Thu, 07 Feb 2019 00:00:00 +0000</pubDate><author>Patricia Feeney</author><guid>https://www.crossref.org/blog/request-for-feedback-on-grant-identifier-metadata/</guid><description>&lt;p>We first announced plans to investigate identifiers for grants in 2017 and are almost ready to violate the &lt;a href="https://doi.org/10.64000/5cfh1-1wa10" target="_blank">first rule of grant identifiers&lt;/a> which is “they probably should not be called grant identifiers”. Research support extends beyond monetary grants and awards, but our end goal is to make grants easy to cite, track, and identify, and ‘Grant ID’ resonates in a way other terms do not. The truth is in the metadata, and we intend to collect (and our funder friends are prepared to provide) information about a number of funding types. Hopefully we encompass all of them.&lt;/p>
&lt;p>Our technical &amp;amp; metadata working group (a subset of the broader &lt;a href="https://www.crossref.org/working-groups/funders">Funder Advisory Group&lt;/a>) includes folks from Children&amp;rsquo;s Tumor Foundation, Europe PMC, European Research Council, JST, OSTI (DOE), Smithsonian, Swiss National Science Foundation, UKRI, Wellcome, as well as colleagues at DataCite and ORCID.&lt;/p>
&lt;p>They have provided a wealth of funding data and feedback, and together we’ve come up with a metadata schema that works for us. Just as important - does this set of metadata meet your needs? Did we miss something? Let us know.&lt;/p>
&lt;h3 id="the-details">The details&lt;/h3>
&lt;p>For those of you familiar with Crossref Content Registration, Grant IDs will have their own dedicated schema that differs from our publication schema. The Grant ID schema will follow some of the same conventions as we’ll be using the same system to process the files (which will be XML) but since we are collecting metadata for a new community and moving beyond published content, this is an opportunity to rethink how we handle some basics like person names and dates.&lt;/p>
&lt;p>Each Grant ID can be assigned to multiple projects. The metadata within each project includes basics like titles, descriptions, and investigator information (including affiliations) as well as funding information. Funders will supply funder information (including funder identifiers from the Crossref Funder Registry) as well as information about funding types and amounts.&lt;/p>
&lt;p>A major accomplishment of the group was to develop a simple taxonomy of types of funding. Supported types are:&lt;/p>
&lt;ul>
&lt;li>award&lt;/li>
&lt;li>contract&lt;/li>
&lt;li>grant&lt;/li>
&lt;li>salary-award&lt;/li>
&lt;li>endowment&lt;/li>
&lt;li>secondment&lt;/li>
&lt;li>loan&lt;/li>
&lt;li>facilities&lt;/li>
&lt;li>equipment&lt;/li>
&lt;li>seed-funding&lt;/li>
&lt;li>fellowship&lt;/li>
&lt;li>training-grant&lt;/li>
&lt;li>other&lt;/li>
&lt;/ul>
&lt;p>Funding involves more than monetary grants or awards and we’ve attempted to capture the broad categories of funding types. This list is taken from types of funding as defined by our participating funder organisations. We anticipate this list will evolve over time.&lt;/p>
&lt;p>Ready to dig in? The schema and documentation are &lt;a href="https://github.com/CrossRef/grantID-schema/" target="_blank">available on GitHub&lt;/a>. We will actively take feedback until the end of February 2019. We hope to begin implementation soon after that. Please let us know what you think through GitHub, or feel free to contact me via &lt;a href="mailto:feedback@crossref.org">feedback@crossref.org&lt;/a>.&lt;/p></description></item><item><title>Underreporting of matched references in Crossref metadata</title><link>https://www.crossref.org/blog/underreporting-of-matched-references-in-crossref-metadata/</link><pubDate>Tue, 05 Feb 2019 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/underreporting-of-matched-references-in-crossref-metadata/</guid><description>&lt;h2 id="tldr">TL;DR&lt;/h2>
&lt;p>About 11% of available references in records in our OAI-PMH &amp;amp; REST API don&amp;rsquo;t have DOIs when they should. We have deployed a fix, but it is running on billions of records, and so we don’t expect it to be complete until mid-April.&lt;/p>
&lt;p>Note that the Cited-by API that our members use appears to be &lt;em>unaffected&lt;/em> by this problem.&lt;/p>
&lt;h2 id="the-gory-details">The gory details&lt;/h2>
&lt;p>When a Crossref member registers metadata for a publication, they often include references. Sometimes the member will also include DOIs in the references, but often they don’t. When they don’t include a DOI in the reference, Crossref tries to match the reference to metadata in the Crossref system. If we succeed, we add the DOI of the matched record to the reference metadata. If we fail, we append the reference to an ever-growing list which we re-process on an ongoing basis.&lt;/p>
&lt;p>You may have seen that &lt;a href="https://doi.org/10.64000/e6ey2-wce96" target="_blank">the R&amp;amp;D team has been doing work to improve our reference matching system&lt;/a>. We will soon be rolling out a new reference matching process that will increase recall significantly.&lt;/p>
&lt;p>But while testing our new reference matching approach, we started to see inconsistent results with our existing legacy reference matching system. When we implemented new regression tests, we noticed that, even when using our legacy system, we were consistently getting &lt;em>better&lt;/em> results than were reflected in the metadata we exposed via our APIs. For example, we would pick a random Crossref DOI record that included 3 matched references, and when we tried matching all the references in the record again using our existing technology, we would get &lt;em>more&lt;/em> matched references than were reported in the metadata.&lt;/p>
&lt;p>At first, we thought this might have something to do with sequencing issues. For example, that article &lt;em>A&lt;/em> might cite article &lt;em>B&lt;/em>, but somehow article &lt;em>A&lt;/em> would get its DOI registered with Crossref prior to article &lt;em>B&lt;/em>. In this theoretical case, we would initially fail to match the reference, but it would eventually get matched as we continued to reprocess our unmatched references. But this wasn’t the issue. And the problem was not with the matching technology we are using. Instead, we discovered a problem with the way we process references on deposit.&lt;/p>
&lt;p>When a member deposits references with Crossref, each reference has to include a member-defined key that is unique to each reference they are depositing in the DOI record. When we match a reference- we report to the members that we matched the reference with key X to DOI Y. The problem is that sometimes members would deposit references with an empty key. If there was only one such reference, then, technically, it would pass our test for making sure the key was unique within the record. So we would process the reference, and match it, and report it via our Cited-by service, but later in the process, when we went to include the matched DOI in the reference section of our API metadata, we’d skip including DOIs for references that had blank keys. The reference itself would be included in the metadata, it would just appear that we hadn’t matched it to a DOI when we actually had.&lt;/p>
&lt;p>Again, we estimate this to have resulted in about 11% of the references in our metadata to be missing matched DOIs. We are processing our references again and inserting the correctly matched DOIs in the metadata. We expect the process to complete in mid-April. We will keep everybody up-to-date on the progress of this fix.&lt;/p>
&lt;p>We will also be integrating the new matching system that we’ve developed. As mentioned at the start of this post, this matching system will also increase the recall rate of our reference matching and so, the two changes combined, should result in users seeing a significant increase in the number of matched references included in Crossref metadata.&lt;/p>
&lt;p>And finally, as part of the work that we are doing to improve our reference matching, we are putting a comprehensive testing framework that will make it easier for us to detect inconsistencies and/or regressions in our reference matching.&lt;/p>
&lt;p>Please contact &lt;a href="mailto:support@crossref.org">Crossref support&lt;/a> with any questions or concerns.&lt;/p></description></item><item><title>How Crossref metadata is helping bring migration research in Europe under one roof</title><link>https://www.crossref.org/blog/how-crossref-metadata-is-helping-bring-migration-research-in-europe-under-one-roof/</link><pubDate>Tue, 29 Jan 2019 00:00:00 +0000</pubDate><author>Christine Cormack Wood</author><discourseUsername>ccormackwood</discourseUsername><guid>https://www.crossref.org/blog/how-crossref-metadata-is-helping-bring-migration-research-in-europe-under-one-roof/</guid><description>&lt;p>&lt;em>Conflict, instability and economic conditions are just some of the factors driving new migration into Europe—and European policy makers are in dispute about how to manage and cope with the implications. Everyone agrees that in order to respond to the challenges and opportunities of migration, a better understanding is required of what drives migration towards Europe, what trajectories and infrastructures facilitate migration, and what the key characteristics of different migrant flows are, in order to inform and improve policy making.&lt;/em>&lt;/p>
&lt;p>The abstract above is taken from the successful Horizon 2020[1] project proposal called CrossMigration, an initiative of &lt;a href="https://www.imiscoe.org/" target="_blank">IMISCOE&lt;/a>, Europe’s largest migration research network, in which a consortium of 15 universities, think tanks and international organisations, led by &lt;a href="https://www.eur.nl/en" target="_blank">Erasmus University Rotterdam&lt;/a> is currently designing a Migration Research Hub. The Hub is a web-based platform aimed at helping researchers and policymakers get a quick and comprehensive overview on research in the field of migration studies. This platform will also feature reports on specific fields, methodological briefing papers and other relevant content produced by the consortium.&lt;/p>
&lt;p>The core of this Hub will consist of a database providing access to publications, research projects and datasets on migration drivers, and infrastructures, flows, and policies on current and future migration questions, indicators and scenarios. And that’s where our metadata story starts.&lt;/p>
&lt;p>At the tail end of December I had the pleasure of speaking to the four researchers and developers working on this database; Vienna-based researchers Roland Hosner and Meike Palinkas from the &lt;a href="https://www.icmpd.org/home/" target="_blank">International Centre for Migration Policy and Development&lt;/a> (ICMPD), Bogdan Taut, CEO of &lt;a href="http://www.youngminds.ro/" target="_blank">YoungMinds&lt;/a>, in Bucharest, Romania, and Nathan Levy, currently studying for his PhD at Erasmus University Rotterdam, Department of Public Administration and Sociology, Netherlands.&lt;/p>
&lt;h3 id="there-are-four-of-you-can-each-of-you-give-me-a-very-brief-introduction-to-yourselves-and-how-you-fit-into-project">There are four of you, can each of you give me a very brief introduction to yourselves and how you fit into project?&lt;/h3>
&lt;p>&lt;strong>Bogdan:&lt;/strong> I’m from YoungMinds, based in Bucharest in Romania. We were the last to join the consortium as the technical developer on the project. I am the project manager of the team, coordinating the technical development of the database.&lt;/p>
&lt;p>&lt;strong>Roland:&lt;/strong> I am a research officer with the International Centre for Migration Policy Development (ICMPD) in Vienna, and we are leading a part of this research project which deals with the population and implementation of the research database—which is core to the Migration Research Hub, and to the whole project.&lt;/p>
&lt;p>&lt;strong>Meike:&lt;/strong> I am also a research officer at ICMPD and work together with Roland. I joined the team in September this year.&lt;/p>
&lt;p>&lt;strong>Nathan:&lt;/strong> I’m part of the coordinating team of the overall project of CrossMigration. We are coordinating putting together the Migration Research Hub, the biggest part of which is the migration database. I am based at Erasmus University in Rotterdam and I work for Professor Peter Scholten who is the overall coordinator of the whole project along with Dr. Asya Pisarevskaya.&lt;/p>
&lt;h3 id="how-long-has-the-project-been-in-progress">How long has the project been in progress?&lt;/h3>
&lt;p>&lt;strong>Roland:&lt;/strong> It’s a two-year project than runs from March 2018 to the end of February 2020.&lt;/p>
&lt;h3 id="so-its-a-two-year-project-and-you-are-10-months-inthat-makes-it-nearly-at-the-halfway-mark-have-you-encountered-any-stumbling-blocks-that-have-held-you-back">So it’s a two-year project and you are 10 months in—that makes it nearly at the halfway mark. Have you encountered any stumbling blocks that have held you back?&lt;/h3>
&lt;p>&lt;strong>Bogdan:&lt;/strong> How to put this in a diplomatic way? We are all working around the clock to meet the deadline that we set ourselves and promised to deliver by. We have made the decision to produce the database in stages—very soon we will have the beta version out, so we have something to present. Then we are going to continue populating it with more items from every record type – journal articles, datasets, books, book chapters, reports etc.. At this point the other partners in the consortium can actually use it and work with it to map the fields and find the most recent and relevant literature on their respective subtopics such as migration drivers or migration infrastructures. In the summer when we are confident that it is a sound and attractive tool to be released, we will make it publicly available.&lt;/p>
&lt;p>&lt;strong>Nathan:&lt;/strong> In terms of specific deliverables for the project so far, our team has developed a taxonomy for migration research to give the fields a logical structure, and to structure this research database.&lt;/p>
&lt;h3 id="how-has-crossref-metadata-contributed-to-your-project">How has Crossref metadata contributed to your project?&lt;/h3>
&lt;p>&lt;strong>Bogdan:&lt;/strong> We began by discussing all of the sources that need to be in the database and we put together an inventory of publishers, books and book chapters, etc., that would be relevant. Part of the scope of work for YoungMinds was to find ways of extracting information and relevant content from those sources. Once we started to dig into the content we found out that there are relevant aggregators, such as Scopus, Crossref, Web of Science and so on. We actually found Crossref through a recommendation from Scopus, someone there said ‘OK Crossref might be able to help you more’. Then Crossref became one of our main sources for metadata—in terms of basic metadata related to some types of content we gather for our database, such as journals and journal articles.&lt;/p>
&lt;p>&lt;strong>Roland:&lt;/strong> The more we moved forward, the more we saw how difficult it was to get in touch with each publisher individually, with each journal individually, to try and secure an agreement with them. So, it became very clear to us very quickly that we would not be able to create a properly inclusive database this way and we knew we had to look for partners and make use of existing resources. As we progressed from one conversation to the next we received a lot of advice, and that’s how we found out about Crossref. It soon became clear that Crossref was the ideal source for us because everything that has a DOI can be found in there. We knew if we had an agreement with Crossref then our project is half won, our database is halfway built, perhaps even more. And, then we just need to fill the gaps.&lt;/p>
&lt;p>&lt;strong>Nathan:&lt;/strong> Yes, this is one of Crossref’s key strengths—rather than having individual researchers or individual projects go to each publisher to try to find the appropriate people to talk to and negotiate—you use Crossref.&lt;/p>
&lt;h3 id="which-of-the-metadata-values-are-important-to-you-what-do-you-extract">Which of the metadata values are important to you, what do you extract?&lt;/h3>
&lt;p>&lt;strong>Roland:&lt;/strong> We thought about this a lot at the beginning, what we wanted to include. There are certain key things that are indisputably relevant—such as titles, names of the authors, editors, the year, DOI, dataset and so on, because we always link to the original source—the publisher’s website, or the journal article website. Ideally we would include keywords and abstracts (where they are available) because the richer the information the better. We also wanted to classify the items we have according to the taxonomy the CrossMigration project has established.&lt;/p>
&lt;p>&lt;strong>Nathan:&lt;/strong> In addition, abstracts and keywords have value for us. We want to apply a logical structure into the taxonomy on migration research, but we need content in order to do that. We need something for the algorithms that YoungMinds have developed to read to in order categorize research accordingly. The body of research on migration is so great and we cannot read through every abstract that’s ever been published on migration. That’s where the value of abstracts and keywords comes in for the Taxonomizer (as we fondly refer to it!).&lt;/p>
&lt;h3 id="what-else-would-you-like-to-see-in-the-rest-api-that-isnt-there">What else would you like to see in the REST API that isn’t there?&lt;/h3>
&lt;p>&lt;strong>Roland:&lt;/strong> More abstracts! We love abstracts!&lt;/p>
&lt;p>&lt;strong>Bogdan:&lt;/strong> Our data schema contains more fields, so we need more metadata than we can find from Crossref and other sources. Basically, the publisher’s website would produce the richest data, but it is the hardest to read. We are on a quest to find more sources because our algorithm works better if it has more information.&lt;/p>
&lt;h3 id="once-its-complete-what-are-your-plans-to-roll-it-out-to-the-wider-world">Once it’s complete, what are your plans to roll it out to the wider world?&lt;/h3>
&lt;p>&lt;strong>Bogdan:&lt;/strong> IMISCOE is the leading organisation of this consortium and it is in touch with most of the migration experts in Europe, so we already have all the contacts of the relevant people in the field.&lt;/p>
&lt;p>&lt;strong>Meike:&lt;/strong> It’s a tool for helping the community, so once we have all the relevant content inside it, we believe that word will spread relatively easily.&lt;/p>
&lt;h3 id="have-you-all-actually-met-in-person">Have you all actually met in person?&lt;/h3>
&lt;p>&lt;strong>Roland:&lt;/strong> Yes! Myself and Nathan met at the project kick-off meeting in Rotterdam in March 2018, then we met at a conference in Florence in June that was partly for the consortium but also had other invited experts and scholars. That was where we met face-to-face for the first time—it was just after we signed with YoungMinds for the IT services. And we recently met at another joint conference of IMISCOE and CrossMigration called &lt;a href="https://twitter.com/CrossMigration/status/1067762112485879808" target="_blank">'Towards the IMISCOE Research Infrastructure of the Future'&lt;/a>.&lt;/p>
&lt;p>&lt;em>[1] Horizon 2020, the biggest EU Research and Innovation programme ever with nearly €80 billion of funding available.&lt;/em>&lt;/p>
&lt;hr>
&lt;p>Great speaking to you all and learning a bit about this important project that will help policymakers manage and cope with the implications of migration—and may possibly even help them find ways to influence it.&lt;/p>
&lt;hr>
&lt;p>If you&amp;rsquo;d like to share how you use our Metadata APIs please contact the &lt;a href="mailto:feedback@crossref.org">Community team&lt;/a>.&lt;/p></description></item><item><title>Zen and the Art of Platform Migration</title><link>https://www.crossref.org/blog/zen-and-the-art-of-platform-migration/</link><pubDate>Mon, 28 Jan 2019 00:00:00 +0000</pubDate><author>Amanda Bartell</author><guid>https://www.crossref.org/blog/zen-and-the-art-of-platform-migration/</guid><description>&lt;p>Nowadays we’re all trying to eat healthier, get fitter, be more mindful and stay in the now. You think you’re doing a good job — perhaps you’ve started a yoga class or got a book on mindfulness. And then, wham! Someone in your organisation casually mentions they’re planning a platform migration. I can sense the panic from here.&lt;/p>
&lt;p>While the &lt;a href="https://www.stress.org/holmes-rahe-stress-inventory/" target="_blank">Holmes and Rahe Stress Scale&lt;/a> doesn’t include platform migration as one of the top ten most stressful life events, we hear from our members that it should probably be in there somewhere. There’s so much to think about and plan for - how do you know you’re choosing the right platform partners for the future? How can you be sure that your understanding of what they offer really matches what you need? Will it make it easier for your readers to access your content? What about delays? What if it all breaks on changeover day?&lt;/p>
&lt;p>Gaaaaah!&lt;/p>
&lt;p>With all that to think about, worrying about whether your DOIs will resolve and what the migration will mean for the quality of your Crossref metadata just seems like an unnecessary layer of stress. It is, however, very important to consider this - even before you start thinking about who your platform partners will be. The process of working through these things up front could help you make better decisions, and set you up for success with the project and into the future.&lt;/p>
&lt;p>So, to help you plan ahead, we’ve created a &lt;a href="https://www.crossref.org/service-providers/migrating-platforms/">platform migration guide&lt;/a> that offers guidance on things like:&lt;/p>
&lt;ul>
&lt;li>What to consider even before you start selecting a new service provider&lt;/li>
&lt;li>Planning the change over process&lt;/li>
&lt;li>The change over itself (and what that means for your URLs)&lt;/li>
&lt;li>What you should do after the migration is complete&lt;/li>
&lt;/ul>
&lt;p>The guide gives advice on how to plan for what you really need right now, and what you’re going to need in the future. For example, what metadata are you going to want to register with us and share with the thousands of industry organisations that make use of the data? What other Crossref services might benefit you in the future? What different record types are in your publishing plans?&lt;/p>
&lt;p>The guide also has a &lt;a href="https://www.crossref.org/education/member-setup/working-with-a-service-provider/checklist-for-platform-migration/">handy checklist&lt;/a> which you can include in your Request For Proposal documentation, to ensure that you’re asking the right questions of potential suppliers.&lt;/p>
&lt;p>Once you’ve read the &lt;a href="https://www.crossref.org/service-providers/migrating-platforms/">platform migration guide&lt;/a>, &lt;a href="mailto:feedback@crossref.org">let us know&lt;/a> if there’s anything else you think we should add to it - we’re sure many of you have platform migration stories, and it’s good to share!&lt;/p>
&lt;hr></description></item><item><title>What can often change, but always stays the same?</title><link>https://www.crossref.org/blog/what-can-often-change-but-always-stays-the-same/</link><pubDate>Thu, 24 Jan 2019 00:00:00 +0000</pubDate><author>Isaac Farley</author><guid>https://www.crossref.org/blog/what-can-often-change-but-always-stays-the-same/</guid><description>&lt;p>Hello. &lt;a href="https://www.crossref.org/people/isaac-farley/">Isaac&lt;/a> here again to talk about what you can tell just by looking at the prefix of a DOI. Also, as we get a lot of title transfers at this time of year, I thought I’d clarify the difference between a title transfer and a prefix transfer, and the impact of each.&lt;/p>
&lt;p>When you join Crossref, you are provided with a unique prefix, you then add suffixes of your choice to your prefix and this creates the DOIs for your content.&lt;/p>
&lt;center>&lt;img src="https://www.crossref.org/images/blog/DOI-structure.png" alt="Structure of a DOI directory suffix and prefix" width="550" class="img-responsive" />&lt;/center>
&lt;p>It’s a logical step then to assume you can tell just by looking at a DOI prefix who the current publisher is—but that’s not always the case. Things can (and often do) change. Individual journals get purchased by other publishers, and whole organisations get bought and sold.&lt;/p>
&lt;p>What you can tell from looking at a DOI prefix is who originally registered it, but not necessarily who it currently belongs to. That’s because if a journal (or whole organisation) is acquired, DOIs don’t get deleted and re-registered to the new owner. The update will of course be reflected in the relevant metadata, but the prefix itself will stay the same. It never changes—and that’s the whole point, that’s what makes the DOI persistent.&lt;/p>
&lt;p>Here’s a breakdown of how this works internally at Crossref:&lt;/p>
&lt;h3 id="title-transfers">Title transfers&lt;/h3>
&lt;p>Member A acquires a single title from member B. We transfer the title (and all relevant reports) over to member A. Member A must then register new content for that journal on their own prefix. The existing (newly acquired) DOIs maintain the ‘old’ prefix but member A can update metadata against these existing DOIs for that journal. Back-year and current DOIs for that journal may, therefore, have different prefixes—and that’s OK!&lt;/p>
&lt;h3 id="organisation-transfers">Organisation transfers&lt;/h3>
&lt;p>Member C acquires member D. We move the entire prefix (and all relevant reports) over to Member C, and close down Member D’s account with Crossref. Member C can continue to register DOIs on member D’s prefix (the original prefix) if they want to, or they can use their own existing prefix. So again, back-year and current records for that journal may have different prefixes.&lt;/p>
&lt;p>And, if Member C uses a service provider to register metadata on their behalf, we will simply enable their username to work with the prefix.&lt;/p>
&lt;h3 id="its-now-easier-to-transfer-titles">It’s now easier to transfer titles&lt;/h3>
&lt;p>We&amp;rsquo;ve recently made the process of &lt;a href="https://doi.org/10.64000/pmnxw-5kx52" target="_blank">transferring journal titles&lt;/a> a lot easier with our new Content Registration tool, &lt;a href="https://www.crossref.org/metadatamanager/" target="_blank">Metadata Manager&lt;/a>.&lt;/p>
&lt;hr></description></item><item><title>Myth busting in Mumbai</title><link>https://www.crossref.org/blog/myth-busting-in-mumbai/</link><pubDate>Tue, 22 Jan 2019 00:00:00 +0000</pubDate><author>Vanessa Fairhurst</author><guid>https://www.crossref.org/blog/myth-busting-in-mumbai/</guid><description>&lt;p>In December, Crossref’s Head of Metadata, &lt;a href="https://www.crossref.org/people/patricia-feeney">Patricia Feeney&lt;/a> and I headed to Mumbai for our first ever LIVE local event in India, held in collaboration with &lt;a href="https://www.editage.com/" target="_blank">Editage&lt;/a>.&lt;/p>
&lt;p>Crossref membership in India has escalated in recent years, with a fifth of its 500 members joining in 2017 alone. Around 40% of these new members are smaller organisations who joined through one of the eight &lt;a href="https://www.crossref.org/community/sponsors/">sponsors&lt;/a> we currently have in the country.&lt;/p>
&lt;p>With such a large increase in membership numbers, it seemed timely to visit and meet both our new and longer-standing members face-to-face. Our LIVE local events provide a great opportunity for us to learn what challenges our members in the community face, so we can understand how to best meet their needs. It also gives us a chance to explain in detail how to benefit from the services we offer, as well as keep them informed about any future developments. A special thanks goes to Editage for all their help in organizing, promoting, and running this event with us.&lt;/p>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/Clarinda Cerejo - LIVE Mumbai.png" alt=“LIVE Mumbai" height="150px" width="400px" class="img-responsive" />
&lt;/div>
&lt;p>The Mumbai event was held at the Sahara Star hotel and attended by participants from a range of organisations, with varying levels of knowledge about Crossref. Patricia talked about how to register your content and the importance of providing us with accurate and comprehensive metadata. She also introduced our new &lt;a href="https://www.crossref.org/education/member-setup/metadata-manager/">Metadata Manager&lt;/a> tool, which many participants were excited to hear more about. I gave an overview of Crossref services, with a specific focus on &lt;a href="https://www.crossref.org/services/crossmark/">Crossmark&lt;/a> and &lt;a href="https://www.crossref.org/services/similarity-check/">Similarity Check&lt;/a>. The afternoon session was run by Editage, and featured a session on ‘&lt;a href="https://www.slideshare.net/CrossRef/editage-workshop-helping-journals-and-publishers-get-closer-to-authors" target="_blank">Helping journals and publishers to get closer to authors&lt;/a>’, followed by a lively debate on research integrity. The debate brought up a number of interesting talking points, including how to attract more students into a career in research, issues around malpractice and plagiarism, and how to improve India’s research culture.&lt;/p>
&lt;p>The Q&amp;amp;A part of the day highlighted a number of myths about Crossref that I thought would be worth detailing here, as other members may benefit from these explanations.&lt;/p>
&lt;h3 id="myth-1-crossref-is-a-mark-of-publisher-and-content-quality">Myth #1: Crossref is a mark of publisher and content quality&lt;/h3>
&lt;p>We have a membership application process where we ask for different types of information and make it clear what the Crossref member &lt;a href="https://www.crossref.org/membership/terms">obligations&lt;/a> are. Crossref doesn’t assess the quality of its members’ content or verify members’ publication processes and procedures. It’s not our role or part of &lt;a href="https://www.crossref.org/about/">our mission&lt;/a> to do these things.
It’s important to remember that content with a Crossref DOI &lt;a href="https://doi.org/10.64000/3gjb5-tkm69" target="_blank">says nothing about the quality of the content&lt;/a>, or that it is peer-reviewed or authoritative.&lt;/p>
&lt;h3 id="myth-2-crossref-archives-content">Myth #2: Crossref archives content&lt;/h3>
&lt;p>We store the metadata our members provide about a piece of content, not the content itself. Our metadata is openly available across our &lt;a href="https://www.crossref.org/services/metadata-retrieval/">APIs&lt;/a> and &lt;a href="https://search.crossref.org" target="_blank">search interfaces&lt;/a>. The same applies for access to the full-text. A DOI will take you to a landing page for a piece of content, but access to the full-text will depend upon the content owner’s publishing model.&lt;/p>
&lt;h3 id="myth-3-crossref-provides-impact-factors">Myth #3: Crossref provides impact factors&lt;/h3>
&lt;p>On publisher websites, you’ll sometimes see the number of times a paper has been cited in Crossref, Google Scholar, Web of Science, etc. The Crossref citation information is made available to publishers through our &lt;a href="https://www.crossref.org/services/cited-by/">Cited-by&lt;/a> service, but it is not an impact factor. Cited-by counts are based on the subset of Crossref’s members participating in that service, so they’ll probably differ from other sources. Crossref Cited-by counts are meant to complement other services rather than replace them.&lt;/p>
&lt;h3 id="myth-4-crossref-charges-to-make-updates-or-corrections-to-the-metadata-associated-with-a-doi">Myth #4: Crossref charges to make updates or corrections to the metadata associated with a DOI&lt;/h3>
&lt;p>Not true - while you have to pay for your initial registration, any subsequent updates, corrections or additions you make to the metadata of a content item is free of charge (apart from &lt;a href="https://www.crossref.org/services/crossmark/">Crossmark&lt;/a> metadata). If you’re a member, we actively encourage you to update your metadata to ensure that your records are as comprehensive and accurate as possible. This helps the scholarly community find and use the content you publish.&lt;/p>
&lt;h3 id="myth-5-crossref-charges-for-failed-deposits">Myth #5: Crossref charges for failed deposits&lt;/h3>
&lt;p>Only deposits which are successful will be counted. You will receive an error message if your metadata deposit has failed, so you are aware of any errors and can resubmit. If you’re not sure what has gone wrong, you can &lt;a href="mailto:support@crossref.org">contact our support team&lt;/a>.&lt;/p>
&lt;h3 id="myth-6-you-need-to-have-separate-prefixes-to-register-different-recordresource-types">Myth #6: You need to have separate prefixes to register different record/resource types&lt;/h3>
&lt;p>You can register all your record types under one prefix (and you don’t need to tell us if you start to do so).&lt;/p>
&lt;h3 id="myth-7-doi-resolutions-are-how-many-dois-you-have-registered">Myth #7: DOI resolutions are how many DOIs you have registered&lt;/h3>
&lt;p>No. When someone clicks on a DOI link for an article, we count that as one DOI resolution. This is different than the number of unique DOIs you have registered with us. We’ll send you a &lt;a href="https://support.crossref.org/hc/en-us/articles/213197246-Resolution-Report" target="_blank">resolution report&lt;/a> once a month which provides details of your total number of resolutions, as well as DOIs which have been most frequently clicked, and any resolution failures. These failures can be an indication that you need to update your metadata with us for that particular article to ensure your DOI is directing readers to the correct webpage.&lt;/p>
&lt;h3 id="myth-8-crossref-own-the-plagiarism-software-used-in-similarity-check">Myth #8: Crossref own the plagiarism software used in Similarity Check&lt;/h3>
&lt;p>The Similarity Check service is provided in collaboration with &lt;a href="https://www.turnitin.com/" target="_blank">Turnitin&lt;/a> who run the &lt;a href="https://www.ithenticate.com/" target="_blank">iThenticate&lt;/a> text-comparison tool. The iThenticate database is the largest comparison database of full-text academic content in the world. Similarity Check participants enjoy cost-effective use of iThenticate because they contribute their own published content into Turnitin’s database. Turnitin also provides our members with access to additional features in iThenticate, such as enhanced text-matches within the document viewer and access to a dedicated Similarity Check support team in order to discuss any technical or billing queries.&lt;/p>
&lt;p>It’s great to have the opportunity to do some myth-busting! You’re bound to have more questions, so we’ll be running more LIVE locals in 2019, as well as virtual events. To keep updated, follow us @CrossrefOrg, or &lt;a href="https://www.crossref.org/subscribe-newsletter/">subscribe to our newsletter&lt;/a>.&lt;/p>
&lt;hr></description></item><item><title>What's that DOI?</title><link>https://www.crossref.org/blog/whats-that-doi/</link><pubDate>Mon, 21 Jan 2019 00:00:00 +0000</pubDate><author>Joe Wass</author><guid>https://www.crossref.org/blog/whats-that-doi/</guid><description>&lt;p>This is a long overdue followup to 2016&amp;rsquo;s &amp;ldquo;&lt;a href="https://doi.org/10.64000/jw4t5-5yt89" target="_blank">URLs and DOIs: a complicated relationship&lt;/a>&amp;rdquo;. Like that post, this accompanies my talk at &lt;a href="https://www.pidapalooza.org" target="_blank">PIDapalooza&lt;/a>, the festival of open persistent identifiers). I don&amp;rsquo;t think I need to give a spoiler warning when I tell you that it&amp;rsquo;s still complicated. But this post presents some vocabulary to describe exactly &lt;em>how&lt;/em> complicated it is. Event Data has been up and running and collecting data for a couple of years now, but this post describes changes we made toward the end of 2018.&lt;/p>
&lt;p>If Event Data is new to you, you can read about its development in &lt;a href="https://www.crossref.org/categories/event-data/">other blog posts&lt;/a> and the &lt;a href="https://www.eventdata.crossref.org/guide" target="_blank">User Guide&lt;/a>. Today I&amp;rsquo;ll be describing a specific but important part of the machinery: how we match landing pages to DOIs.&lt;/p>
&lt;h2 id="some-background">Some background&lt;/h2>
&lt;p>Our Event Data service provides you with a live database of links to DOIs, found from across the web and social media. Data comes from a variety of places, and most of it is produced by Agents operated by Crossref. We have Agents monitoring Twitter, Wikipedia, Reddit, Stack Overflow, blogs and more besides. It is a sad truth that the good news of DOIs has not reached all corners of world, let alone the dustiest vertices of the world wide web. And even within scholarly publishing and academia, not everyone has heard of DOIs and other persistent identifiers.&lt;/p>
&lt;p>Of course, this means that when we look for links to content-that-has-DOIs, what we at Crossref call &amp;lsquo;registered content&amp;rsquo;, we can&amp;rsquo;t content ourselves with only looking for DOIs. We also have to look for article landing pages. These are the pages you arrive at when you click on a DOI, the page you&amp;rsquo;re on when you decide to share an article.&lt;/p>
&lt;h2 id="half-full-or-half-empty">Half full or half empty?&lt;/h2>
&lt;p>So we&amp;rsquo;re trying to track down links to these landing pages, rather than just DOIs. You could look at this two ways.&lt;/p>
&lt;p>The glass-half-empty view would be that it&amp;rsquo;s a real shame people don&amp;rsquo;t use DOIs. Don&amp;rsquo;t they know that their links aren&amp;rsquo;t future-proof? Don&amp;rsquo;t they know that DOIs allow you to punch the identifier into other services?&lt;/p>
&lt;p>The glass-half-full view is that it&amp;rsquo;s really exciting that people outside the traditional open identifier crowd are interacting with the literature. We&amp;rsquo;ve been set a challenge to try and track this usage. By collecting this data and processing it into a form that&amp;rsquo;s compatible with other services we can add to its value and better help join the dots in and around the community that we serve. Not everyone tweeting about articles counts as &amp;lsquo;scholarly Twitter&amp;rsquo;, and hopefully we can bridge some divides (the subject of my talk at PIDapalooza last year, &lt;a href="https://doi.org/10.64000/yagrq-cv833" target="_blank">'Bridging Identifiers'&lt;/a>).&lt;/p>
&lt;h2 id="how-do-we-do-it">How do we do it?&lt;/h2>
&lt;p>One of the central tenets of Event Data is transparency. We record as much information as we can about the data we ingest, how we process it, and what we find. Of course, you don&amp;rsquo;t &lt;em>have&lt;/em> to use this data, it&amp;rsquo;s up to you how much depth you want to go into. But it&amp;rsquo;s there if you want it.&lt;/p>
&lt;p>The resulting data set in Event Data is easy to use, but allows you to peek beneath the surface. We do this by linking every Event that our Agents collect through to an Evidence Record. This in turn links to Artifacts, which describe our working data set.&lt;/p>
&lt;p>One such Artifact is the humbly named &lt;code>domain-decision-structure&lt;/code>. This is a big tree that records DOI prefixes, domain names, and how they&amp;rsquo;re connected. It includes information such as &amp;ldquo;some DOIs with the prefix &lt;code>10.31139&lt;/code> redirect to the domain &lt;code>polishorthopaedics.pl&lt;/code>, and we can confirm that pages on that domain correctly represent their DOI&amp;rdquo;. We produce this list by visiting a sample of DOIs from every known prefix. We then ask the following questions:&lt;/p>
&lt;ol>
&lt;li>Which webpage does this DOI redirect to, and what domain name does it have?&lt;/li>
&lt;li>Does the webpage include its correct DOI in the HTML metadata?&lt;/li>
&lt;/ol>
&lt;p>From this we build the Artifact that records &lt;code>prefix → domain&lt;/code> relationships, along with a flag to say whether or not the domain correctly represents its DOI in at least one case. You can put this data to a number of uses, but we use it to help inform our URL to DOI matching.&lt;/p>
&lt;h2 id="what-agents-do">What Agents do&lt;/h2>
&lt;p>The Agents use the domain list to search for links. For example, the Reddit Agent uses it to query for new discussions about websites on each domain. They then pass this data to the Percolator, which is the machinery that produces Events.&lt;/p>
&lt;p>The Percolator takes each input, whether it&amp;rsquo;s a blog post or a Tweet, and extracts links. If it finds a DOI link, that&amp;rsquo;s low hanging fruit. It then looks for links to URLs on one of the domains in the list. All of these are considered to be candidate landing page URLs. Once it has found a set of candidate links in the webpage it then has to find which ones correspond to DOIs, and validate that correspondence.&lt;/p>
&lt;p>For each candidate URL it follows the link and retrieves the webpage. It looks in the HTML metadata, specifically in the &lt;code>&amp;lt;meta name='dc.identifier' content='10.5555/12345678' &amp;gt;&lt;/code>, to see if the article indicates its DOI. It also looks in the webpage to see if it reports its DOI in the body text.&lt;/p>
&lt;h2 id="not-so-fast">Not so fast&lt;/h2>
&lt;p>But can you trust the web page to indicate its own DOI? What about sites that say that they have a DOI belonging to another member? What about those pages that have invalid or incorrect DOIs? These situations can, and do, occur.&lt;/p>
&lt;p>We have the following methods at our disposal, in order of preference.&lt;/p>
&lt;ol>
&lt;li>&lt;code>doi-literal&lt;/code> - This is the most reliable, and it indicates that the URL we found in the webpage was a DOI not a landing page. We didn&amp;rsquo;t even have to visit the article page.&lt;/li>
&lt;li>&lt;code>pii&lt;/code> - The input was a PII (Publisher Item Identifier). We used our own metadata to map this into a DOI.&lt;/li>
&lt;li>&lt;code>landing-page-url&lt;/code> - We thought that the URL was the landing page for an article. Some webpages actually contain the DOI embedded in URL. So we don&amp;rsquo;t even have to visit the page.&lt;/li>
&lt;li>&lt;code>landing-page-meta-tag&lt;/code> - We had to visit the article landing page. We found a meta tag, eg. &lt;code>dc.identifier&lt;/code>, indicating the DOI.&lt;/li>
&lt;li>&lt;code>landing-page-page-text&lt;/code> - We visited the webpage but there was no meta tag. We did find a DOI in the body text and we think this is the DOI for this page. This is the least reliable.&lt;/li>
&lt;/ol>
&lt;p>On top of this, we have a number of steps of validation. Again, these are listed in order of preference.&lt;/p>
&lt;ol>
&lt;li>&lt;code>literal&lt;/code> - We found a DOI literal, so we didn&amp;rsquo;t have to do any extra work. This is the most reliable.&lt;/li>
&lt;li>&lt;code>lookup&lt;/code> - We looked up the PII in our own metadata, and we trust that.&lt;/li>
&lt;li>&lt;code>checked-url-exact&lt;/code> - We visited the landing page and found a DOI. We visited that DOI and confirmed that it does indeed lead back to this landing page. We are therefore confident that this is the correct DOI for the landing page URL.&lt;/li>
&lt;li>&lt;code>checked-url-basic&lt;/code> - We visited the DOI and it led back to &lt;em>almost&lt;/em> the same URL. The protocol (http vs https), query parameters or upper / lower case may be different. This can happen if tracking parameters are automatically added by the website meaning the URLs are no longer identical. We are still quite confident in the match.&lt;/li>
&lt;li>&lt;code>confirmed-domain-prefix&lt;/code> - We were unable to check the link between the DOI and the landing page URL, so we had to fall back to previously observed data. On previous occasions we have seen that DOIs with the given prefix (e.g. &amp;ldquo;10.5555&amp;rdquo;) redirect to webpages with the same domain (e.g. &amp;ldquo;&lt;a href="https://www.example.com" target="_blank">www.example.com&lt;/a>&amp;rdquo;) and those websites correctly report their DOIs in meta tags. Only the domain and DOI prefix are considered. We therefore believe that the domain reliably reports its own DOIs correctly in at least some cases.&lt;/li>
&lt;li>&lt;code>recognised-domain-prefix&lt;/code> - On previous occasions we have seen that DOIs with the given prefix (e.g. &amp;ldquo;10.5555&amp;rdquo;) redirect to webpages with the same domain (e.g. &amp;ldquo;&lt;a href="https://www.example.com" target="_blank">www.example.com&lt;/a>&amp;rdquo;). Those websites do not always correctly report their DOIs in meta tags. This is slightly less reliable.&lt;/li>
&lt;li>&lt;code>recognised-domain&lt;/code> - On previous occasions we have seen that this domain is associated with DOIs in general. This is the least reliable.&lt;/li>
&lt;/ol>
&lt;p>We record the method we used to find the DOI, and the way we verified it, right in the Event. Look in the &lt;code>obj.method&lt;/code> and &lt;code>obj.verification&lt;/code> fields.&lt;/p>
&lt;p>Of course, there&amp;rsquo;s a flowchart.&lt;/p>
&lt;img class="img-responsive" src="https://www.crossref.org/images/blog/2019/whats-that-doi/landing-page-flow.png">
&lt;p>You can take a closer look in the &lt;a href="https://www.eventdata.crossref.org/guide/data/matching-landing-pages/" target="_blank">User Guide&lt;/a>.&lt;/p>
&lt;p>If you think that&amp;rsquo;s a bit long-winded, well, you&amp;rsquo;re right. But it does enable us to capture DOI links without giving a false sense of security.&lt;/p>
&lt;h2 id="so-what-happens">So, what happens?&lt;/h2>
&lt;p>If you &lt;a href="http://api.eventdata.crossref.org/v1/events/distinct?from-collected-date=2019-01-01&amp;amp;until-collected-date=2019-01-20&amp;amp;rows=0&amp;amp;facet=obj.url.domain:10" target="_blank">ask the Event Data Query API&lt;/a> for the top ten domains that we matched to DOIs in the first 20 days of January 2019, it would tell you:&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>Domain&lt;/th>
&lt;th>Number of Events captured&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>&lt;code>doi.org&lt;/code>&lt;/td>
&lt;td>2058433&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;code>dx.doi.org&lt;/code>&lt;/td>
&lt;td>242707&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;code>www.nature.com&lt;/code>&lt;/td>
&lt;td>170808&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;code>adsabs.harvard.edu&lt;/code>&lt;/td>
&lt;td>163387&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;code>www.sciencedirect.com&lt;/code>&lt;/td>
&lt;td>96849&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;code>onlinelibrary.wiley.com&lt;/code>&lt;/td>
&lt;td>88760&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;code>link.springer.com&lt;/code>&lt;/td>
&lt;td>63869&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;code>www.tandfonline.com&lt;/code>&lt;/td>
&lt;td>41911&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;code>www.sciencemag.org&lt;/code>&lt;/td>
&lt;td>39489&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;code>academic.oup.com&lt;/code>&lt;/td>
&lt;td>39267&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>Here we see a healthy showing for actual DOIs (which you can explain by Wikipedia&amp;rsquo;s excellent use of DOIs) followed by some of the larger publishers. This demonstrates that we&amp;rsquo;re capturing a healthy number of Events from Wikipedia pages, tweets, blog posts etc that reference landing pages.&lt;/p>
&lt;h2 id="awkward-questions">Awkward questions&lt;/h2>
&lt;p>This is not a perfect process. The whole point of PIDs is to unambiguously identify content. When users don&amp;rsquo;t use PIDs, there will inevitably be imperfections. But because we collect and make available all the processing along the way, hopefully we can go back to the old data, or allow any researchers to try and squeeze more information out of the data.&lt;/p>
&lt;h3 id="q-why-bother-with-all-of-this-cant-you-just-use-the-urls">Q: Why bother with all of this? Can&amp;rsquo;t you just use the URLs?&lt;/h3>
&lt;p>We care about persistent identifiers. They are stable identifiers, which means they don&amp;rsquo;t change over time. The same DOI will always refer to the same content. In contrast, publishers&amp;rsquo; landing pages can and do change their URLs over time. If we didn&amp;rsquo;t use the DOIs then our data would suffer from link-rot.&lt;/p>
&lt;p>DOIs are also compatible across different services. You can use the DOI for an article to look it up in metadata and citation databases, and to make connections with other services.&lt;/p>
&lt;p>This is not the only solution to the problem. Other services out there, such as Cobalt Metrics, do record the URLs and store an overlaid data set of identifier mappings. At Crossref we have a specific focus on our members and their content, and we all subscribe to the value of persistent identifiers for their content.&lt;/p>
&lt;p>Of course, we don&amp;rsquo;t throw anything away. The URLs are still included in the Events. Look in the &lt;code>obj.url&lt;/code> field.&lt;/p>
&lt;h3 id="q-if-dois-are-so-amazing-why-keep-urls">Q: If DOIs are so amazing why keep URLs?&lt;/h3>
&lt;p>Event Data is useful to a really wide range of users. Some will need DOIs to work with the data. But others, who may want to research the stuff under the hood, such as the behaviour of social media users, or the processes we employ, may want to know more detail. So we include it all.&lt;/p>
&lt;h3 id="q-cant-you-just-decide-for-me">Q: Can&amp;rsquo;t you just decide for me?&lt;/h3>
&lt;p>In a way, we do. If an Event is included in our data set, we are reasonably confident that it belongs there. All we are doing is providing you with more information.&lt;/p>
&lt;h3 id="q-why-only-dois">Q: Why only DOIs?&lt;/h3>
&lt;p>We specialise in DOIs and believe they are the right solution for unambiguously and persistently identifying content. Furthermore the content registered with Crossref has been done so for the specific benefits that DOIs bring.&lt;/p>
&lt;h3 id="q-what-about-websites-that-require-cookies-andor-javascript-to-execute">Q: What about websites that require cookies and/or JavaScript to execute?&lt;/h3>
&lt;p>Some sites don&amp;rsquo;t work unless you allow your browser to accept cookies. Some sites don&amp;rsquo;t render any content unless you allow their JavaScript to execute. Large crawlers, like Google, emulate web browsers when they scrape content, but it&amp;rsquo;s resource-intensive and not everyone has the resources of Google!&lt;/p>
&lt;p>This is an issue we&amp;rsquo;ve known about for a while. My talk &lt;a href="https://doi.org/10.64000/jw4t5-5yt89" target="_blank">two years ago&lt;/a> was about precisely this topic. We know it&amp;rsquo;s a hurdle we&amp;rsquo;ll have to overcome at some point. We do have plans to look into it, but we haven&amp;rsquo;t found a sufficiently cost-effective and reliable way to do it yet.&lt;/p>
&lt;p>Any sites that do do this will be inherently less reliable, so we recommend everyone to put their Dublin Core Identifiers in the HTML, render your HTML server-side (which is the default way of doing things) and don&amp;rsquo;t require cookies.&lt;/p>
&lt;h3 id="q-whats-the-success-rate">Q: What&amp;rsquo;s the success rate?&lt;/h3>
&lt;p>This is an interesting question. The results aren&amp;rsquo;t black and white. At the low end of the confidence spectrum we do have a cut-off point, at which we don&amp;rsquo;t generate an Event. But when we do create one we qualify it by describing the method we used to match and verify the connection. What level of confidence you want to trust is for you to decide. We just describe the steps we took to verify it.&lt;/p>
&lt;p>It&amp;rsquo;s tricky quantifying false negatives. We have plenty of unmatched links, but not every unmatched link even could be matched to a DOI, for example there are some domains that have some DOI-registered content mixed with non-registered content.&lt;/p>
&lt;p>We therefore err on the side of optimism, and let users choose what level of verification they require.&lt;/p>
&lt;p>So talking of false positives or false negatives is a complicated question. We&amp;rsquo;ve not done any analytical work on this yet, but would welcome any input from the community.&lt;/p>
&lt;h3 id="q-why-isnt-the-domain-decision-structure-artifact-more-detailed">Q: Why isn&amp;rsquo;t the &lt;code>domain-decision-structure&lt;/code> Artifact more detailed?&lt;/h3>
&lt;p>We looked into various ways of constructing this, including more detailed statistics. At the end of the day our processes have to be understandable and easy to re-use. The process already takes a flow-chart to understand, and we felt that we got the balance right. Of course, as a user of this data, you are welcome to further refine and verify it.&lt;/p></description></item><item><title>Improved processes, and more via Metadata Manager</title><link>https://www.crossref.org/blog/improved-processes-and-more-via-metadata-manager/</link><pubDate>Thu, 17 Jan 2019 00:00:00 +0000</pubDate><author>Shayn Smulyan</author><guid>https://www.crossref.org/blog/improved-processes-and-more-via-metadata-manager/</guid><description>&lt;p>Hi, Crossref blog-readers. I’m &lt;a href="https://www.crossref.org/people/shayn-smulyan/">Shayn&lt;/a>, from Crossref’s support team. I’ve been fielding member questions about how to effectively deposit metadata and register content (among other things) for the past three years. In this post, I’ll take you through some of the improvements that Metadata Manager provides to those who currently use the &lt;a href="https://apps.crossref.org/webDeposit/" target="_blank">Web Deposit form&lt;/a>.&lt;/p>
&lt;p>&lt;a href="https://doi.org/10.64000/99444-1qs40" target="_blank">We recently announced the launch of Metadata Manager&lt;/a>, a new tool from Crossref that makes it easier for you to submit robust, accurate, and thorough metadata for the content you register. Metadata Manager already covers journals and articles; more record types will be supported soon. It offers some extra features that will make your experience less stressful, make your metadata better, and ultimately make your content more discoverable.&lt;/p>
&lt;p>Metadata Manager has the potential to improve your metadata registration experience in a number of ways:&lt;/p>
&lt;ul>
&lt;li>by correcting one-off errors in previously registered metadata&lt;/li>
&lt;li>by directly allowing you to add references, license data, funder information, or any other ancillary metadata to items that have previously been registered&lt;/li>
&lt;li>by updating Crossmark data, in the case of a retraction or withdrawal&lt;/li>
&lt;/ul>
&lt;h2 id="login-first-not-last">Login first, not last&lt;/h2>
&lt;p>With the Web Deposit form, you finish entering your metadata for a new issue of your journal, and then get asked for your password, and of course that&amp;rsquo;s when you realize you&amp;rsquo;ve forgotten it (it happens a lot!). With &lt;a href="https://www.crossref.org/metadatamanager/" target="_blank">Metadata Manager&lt;/a>, the very first step is to log in, so you know your login credentials are accurate before you get down to the task of entering your metadata.&lt;/p>
&lt;h2 id="easily-import-journals-or-add-new-ones">Easily import journals, or add new ones&lt;/h2>
&lt;p>When you switch to Metadata Manager, you can import the journals already associated with your account. Simply go to the search bar on the Home screen, search for your journal by title, then click ’Add’. If you are registering your first article for a journal that you’ve not registered before, you can add the journal information on the Home screen, by clicking “New Publication”.&lt;/p>
&lt;center>&lt;img src="https://www.crossref.org/images/blog/shayn-mm2.png" alt="metadata manager home screen" width="600" class="img-responsive" />&lt;/center>
&lt;h2 id="adding-a-journal-doi">Adding a Journal DOI&lt;/h2>
&lt;p>In the Web Deposit form, the Journal DOI is optional, as long as you include a valid ISSN. However, with Metadata Manager, &lt;strong>a Journal DOI must be created for each journal you register&lt;/strong>. So, you need to enter a Journal DOI and a Journal URL for each of your journals before your deposits can be submitted. The Journal DOI won’t become active until you submit your first successful deposit for an article within that journal.&lt;/p>
&lt;p>If you’ve never registered a Journal DOI before and are unsure what to use for your Journal DOI’s suffix, take a look at our suggested &lt;a href="https://support.crossref.org/hc/en-us/articles/214669823-Constructing-your-identifiers" target="_blank">best practice for constructing DOI suffixes&lt;/a>.&lt;/p>
&lt;h2 id="adding-new-articles">Adding new articles&lt;/h2>
&lt;p>Once your journal is added, the process of adding articles in Metadata Manager should be familiar, as it’s similar to the Web Deposit form process. You type in or paste as plain text (without formatting) all your relevant, accurate, and thorough metadata into the appropriate fields in the form.&lt;/p>
&lt;h2 id="save-your-work-as-you-go">Save your work as you go&lt;/h2>
&lt;p>In Metadata Manager there is no need to complete a full issue’s worth of articles at once. And, you don’t need to worry about losing your progress if you accidentally close your browser window, or your laptop runs out of battery while you’re in the middle of a deposit. You can simply and easily ‘save-as-you-go’, one article at a time, until you’re ready to submit them all. You can even review your saved metadata to make sure there aren’t any errors before the deposit is finalized.&lt;/p>
&lt;h2 id="other-metadata-fields-you-didnt-know-you-needed-but-you-do">Other metadata fields you didn’t know you needed (but you do!)&lt;/h2>
&lt;p>Have you ever wanted to add an abstract to your content’s metadata? How about license information, so that other organisations know what they can and can’t do with the work? Does your journal use article ID numbers instead of page numbers? These are all elements that can be added to Metadata Manager that were not available in the Web Deposit form. Additionally, you can add funding data, Similarity Check links, and &lt;a href="https://support.crossref.org/hc/en-us/articles/214357426-Relationships-between-DOIs-and-other-objects" target="_blank">relationships between your articles and other content&lt;/a>. These types of metadata are hugely valuable for &lt;a href="https://doi.org/10.64000/k2hez-ysv45" target="_blank">building a robust, interconnected web of scholarly communication&lt;/a>.&lt;/p>
&lt;h2 id="adding-references">Adding references&lt;/h2>
&lt;p>Unlike the Web Deposit form, Metadata Manager allows you to easily add references to your article’s metadata—this is an important requirement for participating in our &lt;a href="https://www.crossref.org/services/cited-by/">Cited-by&lt;/a> service.&lt;/p>
&lt;p>To add references to an article’s metadata, you can copy and paste its reference list into the references field on the same screen as the rest of the article metadata (as per the image below).&lt;/p>
&lt;center>&lt;img src="https://www.crossref.org/images/blog/shayn-mm1.png" alt="metadata manager home screen" width="600" class="img-responsive" />&lt;/center>
&lt;p>Metadata Manager will match DOIs to those references (where available), and include the full list in your record. So, if you’ve been putting off participating in Cited-by because the reference deposit requirement was too much of a hassle, we hope this will help ease the way! The more references everyone registers, the more robust our Cited-by counts and Cited-by data become.&lt;/p>
&lt;h2 id="edit-mistakes-without-having-to-re-enter-all-your-metadata">Edit mistakes without having to re-enter all your metadata&lt;/h2>
&lt;p>Mistakes happen. Sometimes you put an author’s first name in the last name field. Sometimes you copy and paste some stray HTML tags into your abstract. You might break a link by leaving a space in the middle of a URL, or enter the first-page number as 3170 instead of 317.&lt;/p>
&lt;blockquote>
&lt;p>With Metadata Manager you can fix any errors quickly and easily right in the interface, then just click to redeposit the article with its metadata corrected. You won’t need to re-enter all the metadata or worry about editing the XML files directly.&lt;/p>
&lt;/blockquote>
&lt;p>We’ll have another blog post coming soon that will be devoted entirely to updating, correcting, or otherwise editing metadata for already-registered DOIs in Metadata Manager.&lt;/p>
&lt;h2 id="find-out-immediately-if-your-registration-was-successful">Find out immediately if your registration was successful&lt;/h2>
&lt;p>When you have finished adding the metadata for your articles, navigate to the “To deposit” section and click ‘Deposit’ to submit them. Instead of having to wait for your content to go through our processing queue, you’ll get immediate feedback. The number of Accepted and Failed deposits show immediately. Any articles which have failed are clearly marked with a red triangle icon and an explanation for the error. If you don’t understand an error message or how to correct the metadata, please contact us at &lt;a href="mailto:support@crossref.org">support@crossref.org&lt;/a>.&lt;/p>
&lt;p>To get started with Metadata Manager take a look at our &lt;a href="https://www.crossref.org/education/member-setup/metadata-manager/">full help documentation&lt;/a>.&lt;/p>
&lt;hr></description></item><item><title>Resolutions 2019: Journal Title Transfers = Metadata Manager</title><link>https://www.crossref.org/blog/resolutions-2019-journal-title-transfers-metadata-manager/</link><pubDate>Thu, 03 Jan 2019 00:00:00 +0000</pubDate><author>Isaac Farley</author><guid>https://www.crossref.org/blog/resolutions-2019-journal-title-transfers-metadata-manager/</guid><description>&lt;div class="shortcode-divwrap blue-highlight">
&lt;span>UPDATE, 12 December 2022&lt;br>
&lt;em>Due to the scheduled &lt;a href="https://doi.org/10.64000/30vzx-r5x16" target="_blank">sunsetting of Metadata Manager&lt;/a>, this title transfer process has been deprecated. Please find detailed guidance for transferring titles on our documentation site &lt;a href="https://www.crossref.org/documentation/register-maintain-records/creating-and-managing-dois/transferring-responsibility-for-dois/">here&lt;/a>.&lt;/em>&lt;/span>
&lt;/div>
&lt;p>When you thought about your resolutions for 2019, Crossref probably didn’t cross your mind—but, maybe it should have&amp;hellip;&lt;/p>
&lt;p>Because we know—with a high level of certainty—that &lt;a href="https://www.crossref.org/people/shayn-smulyan/">Shayn&lt;/a>, &lt;a href="https://www.crossref.org/people/paul-davis/">Paul&lt;/a> and I will be spending the first few weeks of the year transferring the ownership of many journal titles. Last year we processed almost 60 journal transfer requests during this time, and we’re heading toward a similar number for 2019. There’s no objection; it’s a just a fact. We’re happy to do it, but there is another way.&lt;/p>
&lt;p>Unlike previous years, we now have a tool that gives you the control to transfer titles without any intervention from the Crossref support team—&lt;a href="https://www.crossref.org/metadatamanager/" target="_blank">Metadata Manager&lt;/a>. With just a few clicks, you, as the disposing publisher, can transfer your journal to the acquiring publisher yourself. Here’s how:&lt;/p>
&lt;h3 id="transferring-your-journal-in-five-easy-steps-using-metadata-manager">Transferring your journal in five easy steps using Metadata Manager:&lt;/h3>
&lt;ol>
&lt;li>Log into &lt;a href="https://www.crossref.org/metadatamanager/" target="_blank">Metadata Manager&lt;/a> using your username and password (the same one you use for the Crossref Web Deposit form).&lt;/li>
&lt;/ol>
&lt;center>&lt;img src="https://www.crossref.org/images/blog/mm-home.png" alt="metadata manager home screen" width="600" class="img-responsive" />&lt;/center>
&lt;ol start="2">
&lt;li>Find the journal you’re transferring on your Metadata Manager workspace using the “search publications” box and click to load the journal’s container (or, dashboard).&lt;/li>
&lt;/ol>
&lt;center>&lt;img src="https://www.crossref.org/images/blog/mm-journal.png" alt="select journal" width="600" class="img-responsive" />&lt;/center>
&lt;ol start="3">
&lt;li>Within the journal container, select &lt;strong>Transfer Title&lt;/strong> from the &lt;strong>Action&lt;/strong> drop-down.&lt;/li>
&lt;/ol>
&lt;center>&lt;img src="https://www.crossref.org/images/blog/mm-action.png" alt="action on drop down menu" width="600" class="img-responsive" />&lt;/center>
&lt;ol start="4">
&lt;li>On the transfer title screen select the acquiring (destination) publisher’s name and DOI prefix of where ownership will be transferred to. Click &lt;strong>Transfer&lt;/strong>.&lt;/li>
&lt;/ol>
&lt;p>(In addition to transferring ownership of the title itself, all existing journal article DOIs previously registered will also be transferred to the new owner using this mechanism. They will persist on their original prefix, but the acquiring publisher will be able to update the metadata associated with these DOIs).&lt;/p>
&lt;center>&lt;img src="https://www.crossref.org/images/blog/mm-transfer.png" alt="transfer to new owner" width="600" class="img-responsive" />&lt;/center>
&lt;ol start="5">
&lt;li>&lt;strong>Confirm&lt;/strong> the title transfer. It may take up to 24 hours for the transfer to be reflected within Metadata Manager, and we’ll send a courtesy email to the acquiring (destination) publisher’s technical contact when the transfer has been completed.&lt;/li>
&lt;/ol>
&lt;center>&lt;img src="https://www.crossref.org/images/blog/mm-confirm.png" alt="confirm transfer" width="600" class="img-responsive" />&lt;/center>
&lt;p>As always, if you have questions, need guidance as you’re working through this process, or have recommendations on how we can improve title transfers—or anything else within Metadata Manager (the tool is in beta)–please let us know at &lt;a href="mailto:support@crossref.org">support@crossref.org&lt;/a>. There’s also comprehensive &lt;a href="https://www.crossref.org/education/member-setup/metadata-manager/">support documentation&lt;/a> available for Metadata Manager to help and guide you.&lt;/p></description></item><item><title>LIVE18: Roaring attendees, incomplete zebras, and missing tablecloths</title><link>https://www.crossref.org/blog/live18-roaring-attendees-incomplete-zebras-and-missing-tablecloths/</link><pubDate>Thu, 27 Dec 2018 00:00:00 +0000</pubDate><author>Rosa Morais Clark</author><guid>https://www.crossref.org/blog/live18-roaring-attendees-incomplete-zebras-and-missing-tablecloths/</guid><description>&lt;p>Running a smooth event is always the goal, but not always the case! No matter how well managed an event is, there is always a chance that things will not go according to plan. And so it was with LIVE18.&lt;/p>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/notablecloth.png" alt="image of tables" width="325" >
&lt;/div>
&lt;p>For the first day we were without the tablecloths we had ordered, which actually gave the room quite a nice, but unintentional, ‘rustic’ look. When they finally did arrive the following day, we realized we preferred the rustic look! Some of the merchandise we had prepared ended up sitting in Canadian Customs for a day and a half, which meant they arrived to us halfway through the first day of the event. Luckily attendees were distracted by the very cool ‘I heart metadata’ bags and didn’t seem to notice.&lt;/p>
&lt;p>Unfortunately a significant number of registrants also had problems with Canadian regulations: they were denied visas to enter the country. Despite always trying to choose countries with international airport hubs and a welcoming policy, this was an unforseen blow.&lt;/p>
&lt;p>But from setting up to take down, LIVE18 was truly a team effort. Even though many Crossref staff had traveled far and wide to get there, they all rallied to help the night before—hauling boxes through the streets of Toronto, stuffing attendee bags, hanging signage, and moving furniture around until 11:30 pm.&lt;/p>
&lt;p>Because of these efforts&amp;mdash;and despite the glitches&amp;mdash;Crossref LIVE18 was a great success.&lt;/p>
&lt;h2 id="how-good-is-your-metadata">How good is your metadata?&lt;/h2>
&lt;p>That was the framing question at Crossref LIVE 18 in Toronto which this year focused on all things metadata. Over the course of the two-day event, we heard from guest speakers on the importance of collaboration, the significance of metadata to metrics, and what good metadata looks like. In our usual lively way, Crossref staff introduced a variety of new services, initiatives, and collaborations.&lt;/p>
&lt;blockquote>
&lt;p>&lt;a href="https://www.crossref.org/crossref-live-annual">Crossref LIVE&lt;/a> is helping surface key issues in the cleanup of metadata mismatch, after decades of the industry working in silos. I applaud Crossref for doing this. It’s great that we’re considering how to change the way we work and collaborate as an industry to make sure that we don’t run into metadata issues in this way again.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;em>- Keynote speaker, Kristen Ratan, Co-Founder of the Collaborative Knowledge Foundation (Coko)&lt;/em>&lt;/p>
&lt;p>In her keynote speech, ‘Publishing Infrastructure: The Good, The Bad, and The Expensive’, Coko’s Kristen Ratan challenged the industry to rethink its slow, inefficient, and expensive resignation to infrastructure; and instead consider how a collaborative approach to sharing expertise in developing community-owned infrastructure could be faster, more flexible, and less costly.&lt;/p>
&lt;ul>
&lt;li>View Kristen’s talk, &lt;a href="https://youtu.be/V_Y5uSCL4ec" target="_blank">The Good, The Bad, and The Expensive&lt;/a>&lt;/li>
&lt;/ul>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/cruse-ror.png" alt="image of Patricia Cruse" width="350" >
&lt;/div>
&lt;h2 id="the-collaborations">The collaborations&lt;/h2>
&lt;p>Collaboration was a running theme at LIVE18. Geoffrey Bilder provided an overview of Crossref’s selective collaborations; DataCite’s Patricia Cruse introduced &lt;a href="https://ror.org/" target="_blank">ROR&lt;/a>, the community project to develop an open, sustainable, usable and unique identifier for every research organisation in the world—and she got the crowd really engaged at the beginning of her talk by encouraging us all to ROAR out loud!; Clare Dean and Ravit David sketched out the evolution of &lt;a href="http://www.metadata2020.org/" target="_blank">Metadata 2020&lt;/a>, and Shelley Stall from the AGU introduced the ways they are urging the scientific community to adopt FAIR data principles (using her first data collection as an 11-year-old as an example!)&lt;/p>
&lt;ul>
&lt;li>View Geoffrey’s talk, &lt;a href="https://youtu.be/3_s6M9OKWp0" target="_blank">How Crossref (selectively) collaborates with others&lt;/a>&lt;/li>
&lt;li>View Patricia’s talk, &lt;a href="https://youtu.be/TknM8YaTl8M" target="_blank">ROR: The Research Oragnization Registry&lt;/a> (Roar!) 🦁&lt;/li>
&lt;li>View Clare and Ravit’s talk, &lt;a href="https://youtu.be/QjvpQNwEmA8" target="_blank">Metadata 2020: This talk is sooo meta&lt;/a>&lt;/li>
&lt;li>View Shelley’s talk, &lt;a href="https://youtu.be/VvZpTLjGWxs" target="_blank">My first data collection: Was it FAIR?&lt;/a>&lt;/li>
&lt;/ul>
&lt;h2 id="the-solutions">The solutions&lt;/h2>
&lt;ul>
&lt;li>Patricia Feeney, in the newly-created role of Head of Metadata, used a zebra to illustrate that not all of a publisher’s metadata is deposited with Crossref.&lt;/li>
&lt;li>View Patricia’s talk, &lt;a href="https://youtu.be/RHUCf3p-TUk" target="_blank">I am the boss of your Metadata&lt;/a> (this one has the zebras) and also her talk on &lt;a href="https://youtu.be/DHd6oRJiVE8" target="_blank">New resource/record types in the works at Crossref&lt;/a>.&lt;/li>
&lt;/ul>
&lt;center>&lt;img src="https://www.crossref.org/images/blog/crossref-zebra-unicorn-comic-strip.png" width="80%" /> &lt;/center>
&lt;h2 id="new-tools">New tools&lt;/h2>
&lt;p>Jennifer Lin introduced &lt;a href="https://www.crossref.org/services/event-data/">Event Data&lt;/a>, the new API that Crossref and DataCite have built together, enabling organisations to capture what happens to a DOI, including all of the places it is mentioned and links from/to. She also talked about &lt;a href="https://www.crossref.org/members/prep/" target="_blank">Participation Reports&lt;/a>, the new open dashboard to help members evaluate the completeness of their own metadata deposited with Crossref.&lt;/p>
&lt;ul>
&lt;li>View Jennifer’s talks on &lt;a href="https://youtu.be/IkaNajvRXGY" target="_blank">Event Data&lt;/a>, and &lt;a href="https://youtu.be/c3oo31VLsiA" target="_blank">Simplifying our services&lt;/a>&lt;/li>
&lt;/ul>
&lt;h2 id="the-community">The community&lt;/h2>
&lt;p>We also heard from the community. Paul Dlug from the American Physical Society boldly gave his view on ‘Why Crossref sucks’, and, with a view to helping Crossref improve in key areas, surfaced issues that members struggle with. Ed Pentz, Executive Director, provided an overview of the direction that Crossref is headed towards. Ginny Hendricks, Director of Member &amp;amp; Community Outreach, updated everyone on the expanding Crossref community and all the outreach activities her team conducts to engage them. Isaac Farley, new Technical Support Manager in the community team, told of his vision for moving to a more public, open, support model. Lisa Hart, Director of Finance &amp;amp; Operations announcing the results of our members votes in this year&amp;rsquo;s &lt;a href="https://www.crossref.org/board-and-governance/">board election&lt;/a>.&lt;/p>
&lt;ul>
&lt;li>View Paul’s talk, &lt;a href="https://youtu.be/TrYAsX4vjU0" target="_blank">Crossref sucks and how to cope!&lt;/a>&lt;/li>
&lt;li>View Ed’s talk, &lt;a href="https://youtu.be/z3sZVVvSExg" target="_blank">Our strategic direction&lt;/a>&lt;/li>
&lt;li>View Ginny&amp;rsquo;s talk, &lt;a href="https://youtu.be/RtaJq-NUFJE" target="_blank">Expanding our constituencies&lt;/a>&lt;/li>
&lt;li>View Isaac&amp;rsquo;s talk, &lt;a href="https://youtu.be/4F8Cv9NTaRQ" target="_blank">Open Support: From 1:1 to everyone&lt;/a>&lt;/li>
&lt;/ul>
&lt;h3 id="the-perspectives">The perspectives&lt;/h3>
&lt;p>Guest speakers provided a range of fascinating perspectives from across scholarly communications. Graham Nott, who works with eLife, outlined how they were making their JATS to Crossref schema conversion tool openly available to the community for use. Jodi Schneider, Assistant Professor of Information Sciences at the University of Illinois at Urbana-Champaign, gave us an in-depth look at problem citations, with a focus on retractions. Bianca Kramer from Utrecht University discussed Crossref metadata use in an open scholarly ecosystem. Stephanie Haustein from the University of Ottawa gave a researcher perspective on the problems with traditional journal metrics, and how they are dependent on metadata, which is essentially flawed. She outlined her efforts to increase metrics literacy, putting metrics in context with comprehensive metadata. Geoffrey Bilder talked about Dominika&amp;rsquo;s work to evaluate our reference matching, and finally closed the show discussing the role of metadata in creating a provenance infrastructure, providing trustworthiness which is essential to progress the scholarly research cycle.&lt;/p>
&lt;ul>
&lt;li>View Graham’s talk, &lt;a href="https://youtu.be/W0xaEw4FDjs" target="_blank">JATS at eLife&lt;/a>&lt;/li>
&lt;li>View Jodi’s talk, &lt;a href="https://youtu.be/vCQexoeGqjY" target="_blank">Trouble at The Academy: Problem Citations&lt;/a>&lt;/li>
&lt;li>View Bianca’s talk, &lt;a href="https://youtu.be/IOMn5Brzxzs" target="_blank">DOIs for whom? Crossref metadata in an open scholarly ecosystem&lt;/a>&lt;/li>
&lt;li>View Stephanie’s talk, &lt;a href="https://youtu.be/tlwSt9P4feo" target="_blank">Good metadata + metrics literacy = better academia&lt;/a>&lt;/li>
&lt;li>View Geoffrey’s talks on &lt;a href="https://youtu.be/sq00YZt8TxQ" target="_blank">Reference matching&lt;/a>, and &lt;a href="https://youtu.be/MLCAVbwBL5A" target="_blank">Metadata as a signal of trust&lt;/a>.&lt;/li>
&lt;/ul>
&lt;p>As LIVE18 came to a close we took the opportunity to acknowledge and thank everyone once again for helping us reach the milestone of 100 million registered content items this September. Everyone took to the stage and waved their Crossref Bigger Ambitions flags.&lt;/p>
&lt;h2 id="thank-you-to-everyone-who-participated-in-the-event-please-save-the-dates-for-live19-in-europe-on-13-14-november-2019">Thank you to everyone who participated in the event. Please save the dates for LIVE19 in Europe on 13-14 November, 2019!&lt;/h2>
&lt;center>&lt;img src="https://www.crossref.org/images/blog/100milgroup-small.png" alt="group of people holding flags" width="600" class="img-responsive" />&lt;/center></description></item><item><title>Reference matching: for real this time</title><link>https://www.crossref.org/blog/reference-matching-for-real-this-time/</link><pubDate>Tue, 18 Dec 2018 00:00:00 +0000</pubDate><author>Dominika Tkaczyk</author><guid>https://www.crossref.org/blog/reference-matching-for-real-this-time/</guid><description>&lt;p>In my previous blog post, &lt;a href="https://doi.org/10.64000/pdm9z-20m09" target="_blank">Matchmaker, matchmaker, make me a match&lt;/a>, I compared four approaches for reference matching. The comparison was done using a dataset composed of automatically-generated reference strings. Now it&amp;rsquo;s time for the matching algorithms to face the real enemy: the &lt;strong>unstructured reference strings&lt;/strong> deposited with Crossref by some members. Are the matching algorithms ready for this challenge? Which algorithm will prove worthy of becoming the guardian of the mighty citation network? Buckle up and enjoy our second matching battle!&lt;/p>
&lt;h2 id="tldr">TL;DR&lt;/h2>
&lt;ul>
&lt;li>I evaluated and compared four reference matching approaches: the legacy approach based on reference parsing, and three variants of search-based matching.&lt;/li>
&lt;li>The dataset comprises 2,000 unstructured reference strings from the Crossref metadata.&lt;/li>
&lt;li>The metrics are &lt;a href="https://en.wikipedia.org/wiki/Precision_and_recall" target="_blank">precision and recall&lt;/a> calculated over the citation links. I also use &lt;a href="https://en.wikipedia.org/wiki/F1_score" target="_blank">F1&lt;/a> as a standard single-number metric that combines precision and recall, weighing them equally.&lt;/li>
&lt;li>The best variant of &lt;strong>search-based matching outperforms the legacy approach in F1 (96.3% vs. 92.5%)&lt;/strong>, with the precision worse by only 0.9% (98.09% vs. 98.95%), and the recall better by 8.9% (94.56% vs. 86.85%).&lt;/li>
&lt;li>Common causes of SBMV&amp;rsquo;s errors are: incomplete/erroneous metadata of the target documents, and noise in the reference strings.&lt;/li>
&lt;li>The results reported here generalize to the subset of references in Crossref that are deposited without the target DOI and are present in the form of unstructured strings.&lt;/li>
&lt;/ul>
&lt;h2 id="introduction">Introduction&lt;/h2>
&lt;p>In reference matching, we try to find the DOI of the document referenced by a given input reference. The input reference can have a structured form (a collection of metadata fields) and/or an unstructured form (a string formatted in a certain citation style).&lt;/p>
&lt;p>In my &lt;a href="https://doi.org/10.64000/pdm9z-20m09" target="_blank">previous blog post&lt;/a>, I used reference strings generated automatically to compare four matching algorithms: Crossref&amp;rsquo;s legacy approach based on reference parsing and three variations of search-based matching. The best algorithm turned out to be Search-Based Matching with Validation (SBMV). SBMV uses our &lt;a href="https://search.crossref.org" target="_blank">REST API&amp;rsquo;s bibliographic search function&lt;/a> to select the candidate target documents, and a separate validation-scoring procedure to choose the final target document. The legacy approach and SBMV achieved very similar average precision, and SBMV was much better in average recall.&lt;/p>
&lt;p>This comparison had important limitations, which affect the interpretation of these results.&lt;/p>
&lt;p>First of all, the reference strings in the dataset might be too perfect. Since they were generated automatically from the Crossref metadata records, any piece of information present in the string, such as the title or the name of the author, will exactly match the information in Crossref&amp;rsquo;s metadata. In such a case, a matcher comparing the string against the record can simply apply exact matching and everything should be fine.&lt;/p>
&lt;p>In real life, however, we should expect all sorts of errors and noise in the reference strings. For example, a string might have been manually typed by a human, so it can have typos. The string might have been scraped from the PDF file, in which case it could have unusual unicode characters, &lt;a href="https://en.wikipedia.org/wiki/Typographic_ligature" target="_blank">ligatures&lt;/a> or missing and extra spaces. A string can also have typical OCR errors, if it was extracted from a scan.&lt;/p>
&lt;p>These problems are typical for messy real-life data, and our matching algorithms should be robust enough to handle them. However, when we evaluate and compare approaches using the perfect reference strings, the results won&amp;rsquo;t tell us how well the algorithms handle harder, noisy cases. After all, even if you repeatedly win chess games against your father, it doesn&amp;rsquo;t mean you will likely defeat Garry Kasparov (unless, of course, you are Garry Kasparov&amp;rsquo;s child, in which case, please pass on our regards to your dad!).&lt;/p>
&lt;p>Even though I attempted to make the data more similar to the noisy real-life data by simulating some of the possible errors (typos, missing/extra spaces) in two styles, this might not be enough. We simply don&amp;rsquo;t know the typical distribution of the errors, or even what all the possible errors are, so our data was probably still far from the real, noisy reference strings.&lt;/p>
&lt;p>The differences in the distributions are a second major issue with the previous experiment. To build the dataset, I used a random sample from Crossref metadata, so the distribution of the cited item types (journal paper, conference proceeding, book chapter, etc.) reflects the overall distribution in our collection. However, the distribution in real life might be different if, for example, journal papers are on average cited more often than conference proceedings.&lt;/p>
&lt;p>Similarly, the distribution of the citation styles is most likely different. To generate the reference strings, I used 11 styles distributed uniformly, while the real distribution most likely contains more styles and is skewed.&lt;/p>
&lt;p>All these issues can be summarized as: &lt;strong>the data used in my previous experiment is different from the data our matching algorithms have to deal with in the production system&lt;/strong>. Why is this important? Because in such a case, &lt;strong>the evaluation results do not reflect the real performance in our system&lt;/strong>, just like the child&amp;rsquo;s score on the math exam says nothing about their score on the history test. We can hope my previous results accurately showed the strengths and weaknesses of each algorithm, but the estimations could be far off.&lt;/p>
&lt;blockquote>
&lt;p>So, can we do better? Sure!&lt;/p>
&lt;/blockquote>
&lt;p>This time, instead of automatically-generated reference strings, I will use real reference strings found in the Crossref metadata. This will give us a much better picture of the matching algorithms and their real-life performance.&lt;/p>
&lt;h2 id="evaluation">Evaluation&lt;/h2>
&lt;p>This time the &lt;strong>evaluation dataset is composed of 2,000 unstructured reference strings from the Crossref metadata&lt;/strong>, along with the target true DOIs. The dataset was prepared mostly manually:&lt;/p>
&lt;ol>
&lt;li>First, I drew a random sample of 100,000 metadata records from the system.&lt;/li>
&lt;li>Second, I iterated over all sampled items, and extracted those unstructured reference strings, that do not have the DOI provided by the member.&lt;/li>
&lt;li>Next, I randomly sampled 2,000 reference strings.&lt;/li>
&lt;li>Finally, I assigned a target DOI (or null) to each reference string. This was done by verifying DOIs returned by the algorithms and/or manual searching.&lt;/li>
&lt;/ol>
&lt;p>The metrics this time are based on the citation links. A citation link points from the reference (or the document containing the reference) to the referenced (target) document.&lt;/p>
&lt;p>When we apply a matching algorithm to a set of reference strings in our collection, we get a set of citation links between our documents. I will call those citation links &lt;strong>returned links&lt;/strong>.&lt;/p>
&lt;p>On the other hand, in our collection we have real, &lt;strong>true links&lt;/strong> between the documents. In the best-case scenario, the set of true links and the set of returned links are identical. But we don&amp;rsquo;t live in a perfect world and our matching algorithms make mistakes.&lt;/p>
&lt;p>To measure how close the returned links are to the true links, I used precision, recall and F1. This time they are calculated over all citation links in the dataset. More specifically:&lt;/p>
&lt;ul>
&lt;li>&lt;strong>Precision&lt;/strong> is the fraction of the returned links that are correct. Precision answers the question: if I see a citation link A-&amp;gt;B in the output of a matcher, how certain can I be that paper A actually cites paper B?&lt;/li>
&lt;li>&lt;strong>Recall&lt;/strong> is the percentage of true links that were returned by the algorithm. Recall answers the question: if paper A cites paper B and B is in the collection, how certain can I be that the matcher&amp;rsquo;s output contains the citation link A-&amp;gt;B?&lt;/li>
&lt;li>&lt;strong>F1&lt;/strong> is the harmonic mean of precision and recall.&lt;/li>
&lt;/ul>
&lt;p>In the previous experiment, I also used precision, recall and F1, but they were calculated for each target document and then averaged. This time precision, recall and F1 are not averaged but simply calculated over all citation links. This is a more natural approach, since now the dataset comprises isolated reference strings rather than target documents, and in practice each target document has at most one incoming reference.&lt;/p>
&lt;p>I tested the same four approaches as before:&lt;/p>
&lt;ul>
&lt;li>the &lt;strong>legacy approach&lt;/strong>, based on reference parsing&lt;/li>
&lt;li>&lt;strong>SBM with a simple threshold&lt;/strong>, which searches for the reference string in the search engine and returns the first hit, if its relevance score exceeds the predefined threshold&lt;/li>
&lt;li>&lt;strong>SBM with a normalized threshold&lt;/strong>, which searches for the reference string in the search engine and returns the first hit, if its relevance score divided by the string length exceeds the predefined threshold&lt;/li>
&lt;li>&lt;strong>SBMV&lt;/strong>, which first applies SBM with a normalized threshold to select a number of candidate items, and a separate validation procedure is used to select the final target item&lt;/li>
&lt;/ul>
&lt;p>All the thresholds are parameters which have to be set prior to the matching. The thresholds used in the experiments were chosen using a separate dataset, as the values maximizing the F1 of each algorithm.&lt;/p>
&lt;h2 id="results">Results&lt;/h2>
&lt;p>The plot shows the overall results of all tested approaches:&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/matching_comparison_real_data.png"
alt="overall comparison of reference matching algorithms on real dataset" width="500px">
&lt;/figure>
&lt;br />
&lt;p>The exact values are also given in the table (the best result for each metric is bolded):&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>&lt;/th>
&lt;th>precision&lt;/th>
&lt;th>recall&lt;/th>
&lt;th>F1&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>legacy approach&lt;/td>
&lt;td>&lt;strong>0.9895&lt;/strong>&lt;/td>
&lt;td>0.8685&lt;/td>
&lt;td>0.9251&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>SBM (simple threshold)&lt;/td>
&lt;td>0.8686&lt;/td>
&lt;td>0.8191&lt;/td>
&lt;td>0.8431&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>SBM (normalized threshold)&lt;/td>
&lt;td>0.7712&lt;/td>
&lt;td>0.9121&lt;/td>
&lt;td>0.8358&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>SBMV&lt;/td>
&lt;td>0.9809&lt;/td>
&lt;td>&lt;strong>0.9456&lt;/strong>&lt;/td>
&lt;td>&lt;strong>0.9629&lt;/strong>&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>As we can see, the legacy approach is the best in precision, slightly outperforming SBMV. In recall, SBMV is clearly the best, which also decided about its victory over the legacy approach in F1.&lt;/p>
&lt;p>How do these results compare to the results from my previous blog post? The overall trends (the legacy approach slightly outperforms SBMV in precision, and SBMV outperforms the legacy approach in recall and F1) are the same. The most important differences are: 1) on the real dataset SBM without validation is worse than the legacy approach, and 2) this time the algorithms achieved much higher recall. These differences are most likely related to the difference in data distributions explained before.&lt;/p>
&lt;h3 id="sbmvs-strengths-and-weaknesses">SBMV&amp;rsquo;s strengths and weaknesses&lt;/h3>
&lt;p>Let&amp;rsquo;s look at a few example cases where SBMV successfully returned the correct DOI, while the legacy approach failed.&lt;/p>
&lt;pre tabindex="0">&lt;code>Lundqvist D, Flykt A, Ohman A: The Karolinska Directed Emotional Faces - KDEF, CD ROM from Department of Clinical Neuroscience, Psychology section, Karolinska Institutet. 1998
&lt;/code>&lt;/pre>&lt;p>matched to &lt;a href="https://doi.org/10.1037/t27732-000" target="_blank">https://doi.org/10.1037/t27732-000&lt;/a>&lt;/p>
&lt;p>The target item is a dataset, which means unusual metadata fields and an unusual reference string.&lt;/p>
&lt;pre tabindex="0">&lt;code>Schminck, A. , ‘The Beginnings and Origins of the “Macedonian” Dynasty’ in J. Burke and R. Scott , eds., Byzantine Macedonia: Identity, Image and History (Melbourne, 2000), 61–8.
&lt;/code>&lt;/pre>&lt;p>matched to &lt;a href="https://doi.org/10.1163/9789004344730_006" target="_blank">https://doi.org/10.1163/9789004344730_006&lt;/a>&lt;/p>
&lt;p>This is an example of a book chapter. The reference string contains special quotes and dash characters.&lt;/p>
&lt;pre tabindex="0">&lt;code>R. Schneider,On the Aleksandrov-Fenchel inequality, inDiscrete Geometry and Convexity (J. E. Goodman, E. Lutwak, J. Malkevitch and R. Pollack, eds.), Annals of the New York Academy of Sciences440 (1985), 132–141.
&lt;/code>&lt;/pre>&lt;p>matched to &lt;a href="https://doi.org/10.1111/j.1749-6632.1985.tb14547.x" target="_blank">https://doi.org/10.1111/j.1749-6632.1985.tb14547.x&lt;/a>&lt;/p>
&lt;p>In this case, spaces are missing in the reference string, which might be problematic for the parsing.&lt;/p>
&lt;pre tabindex="0">&lt;code>R. B. Husar andE. M. Sparrow, Int. J. Heat Mass Transfer11, 1206 (1968).
&lt;/code>&lt;/pre>&lt;p>matched to &lt;a href="https://doi.org/10.1016/0017-9310%2868%2990036-7" target="_blank">https://doi.org/10.1016/0017-9310(68)90036-7&lt;/a>&lt;/p>
&lt;p>This is another example of a reference string with missing spaces.&lt;/p>
&lt;pre tabindex="0">&lt;code>F. Cappello, A. Geist, W. Gropp, S. Kale, B. Kramer, and M. Snir. Toward exascale resilience: 2014 update. Supercomputing frontiers and innovations, 1(1), 2014.
&lt;/code>&lt;/pre>&lt;p>matched to &lt;a href="https://doi.org/10.14529/jsfi140101" target="_blank">https://doi.org/10.14529/jsfi140101&lt;/a>&lt;/p>
&lt;p>In this case authors are missing in the Crossref metadata.&lt;/p>
&lt;pre tabindex="0">&lt;code>Li KZ, Shen XT, Li HJ, Zhang SY, Feng T, Zhang LL. Ablation of the Carbon/carbon Composite Nozzle-throats in a Small Solid Rocket Motor[J]. Carbon, 2011, 49: 1 208–1 215
&lt;/code>&lt;/pre>&lt;p>matched to &lt;a href="https://doi.org/10.1016/j.carbon.2010.11.037" target="_blank">https://doi.org/10.1016/j.carbon.2010.11.037&lt;/a>&lt;/p>
&lt;p>Here we have unexpected spaces inside page numbers.&lt;/p>
&lt;pre tabindex="0">&lt;code>N. Kaloper, A. Lawrence and L. Sorbo, An Ignoble Approach to Large Field Inflation, JCAP 03 (2011) 023 [ arXiv:1101.0026 ] [ INSPIRE ].
&lt;/code>&lt;/pre>&lt;p>matched to &lt;a href="https://doi.org/10.1088/1475-7516/2011/03/023" target="_blank">https://doi.org/10.1088/1475-7516/2011/03/023&lt;/a>&lt;/p>
&lt;p>In this case we have an acronym of the journal name and additional arXiv id.&lt;/p>
&lt;pre tabindex="0">&lt;code>KrönerE. ?Stress space and strain space continuum mechanics?, Phys. Stat. Sol. (b), 144 (1987) 39?44.
&lt;/code>&lt;/pre>&lt;p>matched to &lt;a href="https://doi.org/10.1002/pssb.2221440104" target="_blank">https://doi.org/10.1002/pssb.2221440104&lt;/a>&lt;/p>
&lt;p>This reference string has a missing space, a missing word in the title, and incorrectly encoded special characters.&lt;/p>
&lt;pre tabindex="0">&lt;code>Suyemoto K. L., (1998) The functions of self-mutilationClinical Psychology Review 18(5): 531–554
&lt;/code>&lt;/pre>&lt;p>matched to &lt;a href="https://doi.org/10.1016/s0272-7358%2897%2900105-0" target="_blank">https://doi.org/10.1016/s0272-7358(97)00105-0&lt;/a>&lt;/p>
&lt;p>In this case the space is missing between the title and the journal name.&lt;/p>
&lt;pre tabindex="0">&lt;code>Ono , N. 2011 Stable and fast update rules for independent vector analysis based on auxiliary function technique Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics 189 192
&lt;/code>&lt;/pre>&lt;p>matched to &lt;a href="https://doi.org/10.1109/aspaa.2011.6082320" target="_blank">https://doi.org/10.1109/aspaa.2011.6082320&lt;/a>&lt;/p>
&lt;p>The parsing can also have problems with missing punctuation, like in this case.&lt;/p>
&lt;pre tabindex="0">&lt;code>Hybertsen M.S., Witzigmann B., Alam M.A., Smith R.K. (2002) 1 113
&lt;/code>&lt;/pre>&lt;p>matched to &lt;a href="https://doi.org/10.1023/a:1020732215449" target="_blank">https://doi.org/10.1023/a:1020732215449&lt;/a>&lt;/p>
&lt;p>In this case both title and journal name are missing from the reference string.&lt;/p>
&lt;p>We can see from these examples that SBMV is fairly robust and able to deal with a small amount of noise in the metadata and reference strings.&lt;/p>
&lt;p>What about the errors SBMV made? From the perspective of citation links, we have two types of errors:&lt;/p>
&lt;ul>
&lt;li>&lt;strong>False positives&lt;/strong>: incorrect links returned by the algorithm.&lt;/li>
&lt;li>&lt;strong>False negatives&lt;/strong>: links that should have been returned but weren&amp;rsquo;t.&lt;/li>
&lt;/ul>
&lt;p>When we apply SBMV instead of the legacy approach, the fraction of false positives within the returned links increases from 1.05% to 1.91%, and the fraction of false negatives within the true links decreases from 13.15% to 5.44%. This means with SBMV:&lt;/p>
&lt;ul>
&lt;li>1.91% of the links in the algorithm&amp;rsquo;s output are incorrect&lt;/li>
&lt;li>5.44% of the true links are not returned by the algorithm&lt;/li>
&lt;/ul>
&lt;p>We can also classify all the references in the dataset into several categories, based on the values of true and returned DOIs:&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/matching_references_errors.png"
alt="references errors distribution" width="800px">
&lt;/figure>
&lt;p>We have the following categories:&lt;/p>
&lt;ul>
&lt;li>References matched to correct DOIs (1129 cases, returned and true blue)&lt;/li>
&lt;li>References correctly not matched to anything (791 cases, returned and true white)&lt;/li>
&lt;li>References not matched to anything, when they should be (58 cases, returned white, true grey)&lt;/li>
&lt;li>References matched to wrong DOIs (7 cases, returned red, true yellow)&lt;/li>
&lt;li>References matched to something, when they shouldn&amp;rsquo;t be matched to anything (15 cases, returned black, true white)&lt;/li>
&lt;/ul>
&lt;p>Note that in terms of these categories, precision is equal to:&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/matching_precision.png"
alt="precision" width="200px">
&lt;/figure>
&lt;p>And recall is equal to:&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/matching_recall.png"
alt="recall" width="200px">
&lt;/figure>
&lt;p>What are the most common causes of SBMV&amp;rsquo;s errors?&lt;/p>
&lt;ul>
&lt;li>Incomplete or incorrect Crossref metadata. Even a perfect reference string formatted in the most popular citation style will not be matched, if the target record in the Crossref collection has many missing or incorrect fields.&lt;/li>
&lt;li>Similarly, missing or incorrect information in the reference string is very problematic for the matchers.&lt;/li>
&lt;li>Errors/noise in the reference string, such as:
&lt;ul>
&lt;li>HTML/XML markup not stripped from the string&lt;/li>
&lt;li>multiple references mixed in one string&lt;/li>
&lt;li>spacing issues and typos&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>In a few cases a document related to the real target was matched, such as the book instead of its chapter, or the conference proceedings paper instead of the thesis.&lt;/li>
&lt;/ul>
&lt;h2 id="limitations">Limitations&lt;/h2>
&lt;p>The most important limitation is the size of the dataset. Every item had to be verified manually, which significantly limited the possibility of creating a large set and also using a lot of independent sets.&lt;/p>
&lt;p>Finally, the numbers reported here still don&amp;rsquo;t reflect the overall precision and recall of the current links in the Crossref metadata. This is because:&lt;/p>
&lt;ol>
&lt;li>we still use the legacy approach for matching,&lt;/li>
&lt;li>some references are deposited along with the target DOIs and are not matched by Crossref, these links are not analyzed here, and&lt;/li>
&lt;li>in Crossref we have both unstructured and structured references, and in this experiment only the unstructured ones were tested.&lt;/li>
&lt;/ol>
&lt;h2 id="whats-next">What&amp;rsquo;s next?&lt;/h2>
&lt;p>The next experiment will be related to the structured references. Similarly as here, I will try to estimate the performance of the search-based matching approach and compare it to the performance of the legacy approach.&lt;/p>
&lt;p>The evaluation framework, evaluation data and experiments related to the reference matching are available in the repository &lt;a href="https://github.com/CrossRef/reference-matching-evaluation" target="_blank">https://github.com/CrossRef/reference-matching-evaluation&lt;/a>. Future experiments will be added there as well.&lt;/p>
&lt;p>&lt;a href="https://github.com/CrossRef/reference-matching-evaluation" target="_blank">https://github.com/CrossRef/reference-matching-evaluation&lt;/a> also contains the Python implementation of the SBMV algorithm. The Java implementation of SBMV is available in the repository &lt;a href="https://gitlab.com/crossref/search_based_reference_matcher" target="_blank">https://gitlab.com/crossref/search_based_reference_matcher&lt;/a>.&lt;/p></description></item><item><title>Phew - its been quite a year</title><link>https://www.crossref.org/blog/phew-its-been-quite-a-year/</link><pubDate>Thu, 13 Dec 2018 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/phew-its-been-quite-a-year/</guid><description>&lt;p>As the end of the year approaches it’s useful to look back and reflect on what we’ve achieved over the last 12 months—a lot! To be honest, there were some things we didn’t get done—or didn’t make as much progress with as we hoped—but that happens when you have an ambitious agenda. However, we also got some things done that we didn’t expect to or that weren’t even on our radar at the end of 2017—this is inevitable as the research and scholarly communications landscape is rapidly changing.&lt;/p>
&lt;p>In my &lt;a href="https://doi.org/10.64000/hnk6j-p5q04" target="_blank">blog post&lt;/a> from the beginning of the year, the key projects I highlighted were &lt;a href="https://www.crossref.org/services/metadata-retrieval/metadata-plus/">Metadata Plus&lt;/a>, &lt;a href="https://www.crossref.org/services/event-data/">Event Data&lt;/a>, &lt;a href="https://doi.org/10.64000/e84m9-x0652" target="_blank">organisation IDs&lt;/a>, &lt;a href="https://doi.org/10.64000/5cfh1-1wa10" target="_blank">Grant IDs&lt;/a> and &lt;a href="http://www.metadata2020.org" target="_blank">Metadata 2020&lt;/a>, and that richer metadata and more record types were key goals. We did make very good progress on all of these projects as reported below.&lt;/p>
&lt;p>For 2018 we were operating in the framework of the four strategic themes, or areas of focus, developed by the board and staff. These are: 1) Simplifying and enriching our services; 2) Improving our metadata; 3) Expanding constituencies, and 4) Selectively collaborating and partnering. These themes will also be guiding us in 2019.&lt;/p>
&lt;h2 id="simplifying-and-enriching-our-services">Simplifying and enriching our services&lt;/h2>
&lt;h3 id="upgrading-our-tools">Upgrading our tools&lt;/h3>
&lt;p>Over the past year, we’ve been busy streamlining our processes, developing new tools and adding new services. A key new tool is &lt;a href="https://www.crossref.org/metadatamanager/" target="_blank">Metadata Manager&lt;/a> which supports the &lt;a href="https://www.crossref.org/services/content-registration/">Content Registration service&lt;/a> by offering a simpler, more user-friendly, non-technical way to register and update metadata. It provides lots of context-sensitive help, registers content immediately, in real time, and provides guidance on how to make corrections—thereby ensuring each deposit is successful. Metadata Manager currently supports journal deposits (we would have liked to add more in 2018) but we will be adding other record types in 2019.&lt;/p>
&lt;h3 id="upgrading-our-services">Upgrading our services&lt;/h3>
&lt;p>&lt;a href="https://www.crossref.org/services/metadata-retrieval/">Crossref metadata&lt;/a> has always been open through a number of interfaces without restriction, but this year we introduced an option for extra support and functionality, through &lt;a href="https://www.crossref.org/services/metadata-retrieval/metadata-plus/">Metadata Plus&lt;/a>. Metadata Plus provides guaranteed uptime, snapshots of the complete set of metadata and enhanced support for organisations (members or not) that want to use Crossref metadata in their own services and systems.&lt;/p>
&lt;h3 id="improving-the-member-experience-new-membership-terms">Improving the member experience: New membership terms&lt;/h3>
&lt;p>This year we began to redesign the member experience and have made a lot of improvements to the sign-up and onboarding process, the most significant of which is the new click-through membership terms, introduced in July for new members and coming into effect for existing members in March 2019, which is proving to be a huge time saver for both our members and our team.&lt;/p>
&lt;h2 id="improving-our-metadata">Improving our metadata&lt;/h2>
&lt;p>Our objective this year was to better communicate what metadata best practice is, to equip our members with all the data and tools they need to meet this best practice, and to achieve closer cooperation from service providers.&lt;/p>
&lt;h3 id="best-practice-tools-participation-reports">Best practice tools: Participation Reports&lt;/h3>
&lt;p>Released in Beta in August this year, &lt;a href="https://www.crossref.org/members/prep/" target="_blank">Participation Reports&lt;/a> provides a dashboard that gives a clear picture of the metadata that each member provides. This is a useful visualization of metadata that has long been available via our public REST API. Members can see where the gaps in the metadata are and get information on how to fill those gaps.&lt;/p>
&lt;h3 id="communicating-metadata-best-practice-data-citations">Communicating metadata best practice: Data Citations&lt;/h3>
&lt;p>&lt;a href="https://doi.org/10.64000/ae1q9-mtq08" target="_blank">The importance of linking data&lt;/a> with literature can’t be understated. Research integrity and reproducibility depend on it. We&amp;rsquo;re committed to exposing the links between the literature and the data or software that supports it, and earlier this year we partnered with &lt;a href="https://www.datacite.org/" target="_blank">DataCite&lt;/a> to make this a reality. All the data citations coming in from Crossref and DataCite are being pulled into Event Data.&lt;/p>
&lt;h3 id="equipping-members-with-all-the-data-event-data">Equipping members with all the data: Event Data&lt;/h3>
&lt;p>&lt;a href="https://www.crossref.org/services/event-data/">Event Data&lt;/a> reached technical readiness. Event Data captures and records “events” such as comments, links, shares, bookmarks, and references. It provides open, transparent, and traceable information about the provenance and context of every event.&lt;/p>
&lt;h2 id="expand-constituencies">Expand constituencies&lt;/h2>
&lt;p>Crossref currently has 15,000 members in 140 countries. With that comes the need to increasingly and proactively work with emerging markets as they start to share research outputs globally.&lt;/p>
&lt;h3 id="ambassador-program">Ambassador program&lt;/h3>
&lt;p>&lt;a href="https://www.crossref.org/community/ambassadors/">The Crossref Ambassador program&lt;/a> launched in January and now has a team of 16 trusted contacts who work within our communities (as librarians, researchers, publishers, and innovators) around the world. They share great enthusiasm and belief in our work. We provide them with training and support, and they help us improve education about global research infrastructure in general and the opportunities that are enabled through richer metadata.&lt;/p>
&lt;h3 id="funders-and-grant-identifiers">Funders and grant identifiers&lt;/h3>
&lt;p>I’m very happy to report that the Crossref board approved grants as a new record/resource type to be rolled out in 2019 - we made faster progress on this than expected. The proposal for grant identifiers was developed by staff in collaboration with the &lt;a href="https://www.crossref.org/working-groups/funders/">Crossref Funder Advisory Group&lt;/a> and the &lt;a href="https://www.crossref.org/committees/membership-and-fees/">Membership and Fees Committee&lt;/a>. This means that funders will be joining Crossref and registering a standard set of metadata and a persistence identifier - a DOI - for their grants.&lt;/p>
&lt;h2 id="collaborate-and-partner">Collaborate and partner&lt;/h2>
&lt;p>So that our alliances with others have the greatest impact, we have aligned our strategic plans for scholarly infrastructure with others. Some of these alliances are led or driven by Crossref and with others we are involved but not leading.&lt;/p>
&lt;h3 id="ror">ROR&lt;/h3>
&lt;p>We are working with the &lt;a href="https://www.cdlib.org/" target="_blank">California Digital Library&lt;/a>, &lt;a href="https://www.datacite.org/" target="_blank">DataCite&lt;/a> and &lt;a href="https://www.digital-science.com/" target="_blank">Digital Science&lt;/a> as the Steering group for &lt;a href="https://ror.org/" target="_blank">ROR&lt;/a> - the Research Organisation Registry - which is a new, community-led project that is developing an open, sustainable, usable, and unique identifier for research organisations based on the work done by the &lt;a href="https://orcid.org/content/organisation-identifier-working-group" target="_blank">organisation Identifier Working Group&lt;/a> in 2017 and 2018.&lt;/p>
&lt;h3 id="metadata-2020">Metadata 2020&lt;/h3>
&lt;p>&lt;a href="http://www.metadata2020.org" target="_blank">Metadata 2020&lt;/a> is a collaboration that advocates richer, connected, and reusable, open metadata for all research outputs, which will advance scholarly pursuits for the benefit of society. Over 140 volunteers—including publishers, librarians, researchers, platforms/tools, and other stakeholders—from 86 organisations, are working in six project groups. The projects are very strategically focused, looking at key issues like researcher communications, incentives, and shared best practices.&lt;/p>
&lt;p>I can’t close off the year without mentioning the incredible milestone we reached this September when &lt;a href="https://doi.org/10.64000/c8tcs-9vm83" target="_blank">the 100th million content item was registered&lt;/a> with Crossref. This was made possible by our members’ and the wider community’s commitment and contribution, so once again, thank you.&lt;/p>
&lt;blockquote>
&lt;p>Roll on 2019!&lt;/p>
&lt;/blockquote>
&lt;hr></description></item><item><title>Newly approved membership terms will replace existing agreement</title><link>https://www.crossref.org/blog/newly-approved-membership-terms-will-replace-existing-agreement/</link><pubDate>Wed, 05 Dec 2018 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/newly-approved-membership-terms-will-replace-existing-agreement/</guid><description>&lt;p>In its July 2018 meeting, the Crossref &lt;a href="https://www.crossref.org/board-and-governance">Board&lt;/a> voted unanimously to approve and introduce a &lt;a href="https://www.crossref.org/membership/terms">new set of membership terms&lt;/a>. At the same meeting, the board also voted to change the description of membership eligibility in our &lt;a href="https://www.crossref.org/board-and-governance/bylaws">Bylaws&lt;/a>, officially broadening our remit beyond publishers, in line with current practice and positioning us for future growth.&lt;/p>
&lt;h2 id="tldr">Tl;dr&lt;/h2>
&lt;p>It’s a very good thing to have clearer terms; we want everyone to understand what Crossref is about and what you’re getting into. It’s a material change so we will be notifying members by direct email in December. Nobody needs to sign anything as the new terms are not signed, but are click-through acceptances on application, and that process is already in effect for new applicants. The new terms come into effect on 1st March 2019 for existing members and no action is needed.&lt;/p>
&lt;ul>
&lt;li>If you&amp;rsquo;re a &lt;strong>sponsored member&lt;/strong> you&amp;rsquo;ll have a slightly adapted message soon as we work with your sponsor.&lt;/li>
&lt;li>If you&amp;rsquo;re an NGO or US State Actor you will receive a slightly adapted message.&lt;/li>
&lt;/ul>
&lt;div class="shortcode-divwrap blue-highlight">
&lt;span>This post is for background explanation and information. We will email existing members directly, but no acceptance or signature&amp;mdash;nor any action&amp;mdash;will be needed.&lt;/span>
&lt;/div>
&lt;h2 id="why-are-we-updating-the-terms">Why are we updating the terms?&lt;/h2>
&lt;p>Being almost 20 years old the &lt;a href="https://www.crossref.org/membership/2018-agreement">old agreement&lt;/a> is out-of-date with current practice and technology, and has become quite long and confusing, especially for applicants for whom English is not their first language. Specific reasons include:&lt;/p>
&lt;h3 id="1-to-improve-efficiency">1. To improve efficiency&lt;/h3>
&lt;p>Over the years we’ve had feedback that our application process is too long and involved. The membership agreement used to be signed manually by each new Crossref member, often days after they applied. We also now process around 180 new members each month which is too many for a wholly manual process managed by just one person.&lt;/p>
&lt;h3 id="2-to-clarify-the-wording">2. To clarify the wording&lt;/h3>
&lt;p>People would tell us that the agreement is too long and confusing, especially when English is not their first language. There are often questions about the “legalese” style of language that takes up too much time in back-and-forth discussions to ensure everyone has understood. Also, the main structure of the agreement has been in place for over a decade and needs updating to avoid confusion and to align with up-to-date language, services, technologies, and current practices.&lt;/p>
&lt;h3 id="3-to-emphasize-the-community-aspect-and-our-members-obligations">3. To emphasize the community aspect and our members’ obligations&lt;/h3>
&lt;p>It is quite a commitment to participate fully in Crossref, and we want people to understand up-front what their obligations are as part of the collective membership. And also to realize what value they are receiving as well as contributing to other members. We needed clearer terms so that every organisation can understand what they are getting into.&lt;/p>
&lt;p>Additionally, moving from signing contracts to click-through acceptance of standard terms emphasizes that Crossref is not a service provider or vendor. We are a not-for-profit community organisation. We don’t have the resources to negotiate and keep track of individual custom agreements.&lt;/p>
&lt;h2 id="whats-changing-step-by-step">What’s changing, step-by-step&lt;/h2>
&lt;p>We consulted with former and current legal counsel, the &lt;a href="https://www.crossref.org/committees/membership-and-fees">Membership &amp;amp; Fees Committee&lt;/a>, and also with the M&amp;amp;F organisations individually. We have also absorbed a lot of feedback from many other members of all kinds and sizes.&lt;/p>
&lt;h3 id="for-new-members">For new members&lt;/h3>
&lt;p>The manually-signed membership agreement has already&amp;mdash;for new members&amp;mdash; been turned into a set of click-through terms that organisations agree to as part of the initial application process. It is no longer a separate document that needs to be signed or countersigned. This will simplify the application process for both new applicants and our staff.&lt;/p>
&lt;h3 id="for-existing-members">For existing members&lt;/h3>
&lt;p>The new membership terms will come into effect for existing members on March 1st, 2019. Because this is a material change to the terms, we will be emailing members with more information but it’s important to note that no action is necessary from existing members. The new terms will replace the old terms automatically.&lt;/p>
&lt;p>The table below sets out clause-by-clause the precise changes. Here is the &lt;a href="https://www.crossref.org/membership/2018-agreement/">2018 membership agreement&lt;/a> and the &lt;a href="https://www.crossref.org/membership/terms">new terms in full&lt;/a>.&lt;/p>
&lt;h3 id="the-nitty-gritty-details">The nitty-gritty details&lt;/h3>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>Topic&lt;/th>
&lt;th>New section&lt;/th>
&lt;th>Old section&lt;/th>
&lt;th>Summary of change(s)&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>Overall&lt;/td>
&lt;td>&lt;/td>
&lt;td>&lt;/td>
&lt;td>Eliminates legalese in favor of plain English. Updates defined terms to current usage. Shifts from execution by signature to acceptance by affirmative action.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Introduction&lt;/td>
&lt;td>Background&lt;/td>
&lt;td>1&lt;/td>
&lt;td>Updates description of Crossref’s activities to be current. Provides for a new applicant’s acceptance of Terms upon acceptance of application by Crossref and payment of first annual fee.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Members’ rights&lt;/td>
&lt;td>1&lt;/td>
&lt;td>2(a)&lt;/td>
&lt;td>Streamlines wording; eliminates reference to right to recommend working committee members.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Members’ obligations&lt;/td>
&lt;td>2&lt;/td>
&lt;td>2(b)&lt;/td>
&lt;td>Significant revision. Old 2(b) mentioned only payment of fees and appointment of a contact person. New Sec. 2 aims to capture all of a Member’s operational obligations in one place.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Metadata deposits&lt;/td>
&lt;td>2(a), (b)&lt;/td>
&lt;td>3(a)(i)&lt;/td>
&lt;td>Updates language regarding metadata deposits to current terminology and practice.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Rights to content&lt;/td>
&lt;td>2(c)&lt;/td>
&lt;td>15&lt;/td>
&lt;td>Streamlines wording.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Registering identifiers&lt;/td>
&lt;td>2(d)&lt;/td>
&lt;td>3(a)ii)&lt;/td>
&lt;td>Streamlines the language around registering identifiers.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Linking&lt;/td>
&lt;td>2(e)&lt;/td>
&lt;td>3(a)(iii)&lt;/td>
&lt;td>States, in clearer language, the obligation to embed identifiers.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Reference linking&lt;/td>
&lt;td>2(f)&lt;/td>
&lt;td>3(a)(iv)&lt;/td>
&lt;td>Eliminates outdated provision on Cross-Linking; replaces with a best efforts covenant to engage in Reference Linking.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Display identifiers&lt;/td>
&lt;td>2(g)&lt;/td>
&lt;td>N/A&lt;/td>
&lt;td>Adds an obligation to comply with Crossref’s display guidelines and ensure each identifier is hyperlinked to be citable.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Maintaining and updating metadata&lt;/td>
&lt;td>2(h)&lt;/td>
&lt;td>3(b)&lt;/td>
&lt;td>Streamlines language. Adds obligation to maintain the URL and the accuracy of identifier data. Adds common examples of failure to maintain and update metadata.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Archiving&lt;/td>
&lt;td>2(i)&lt;/td>
&lt;td>3(d)&lt;/td>
&lt;td>Adds link to examples of third-party archive providers. Adds option for Crossref to point to a “defunct DOI” page. Inserts best efforts obligation to contract with a third-party archive.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Content-specific obligations&lt;/td>
&lt;td>2(j)&lt;/td>
&lt;td>N/A&lt;/td>
&lt;td>Adds reference to Crossref’s record type rules and obligation to comply.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Fees&lt;/td>
&lt;td>3&lt;/td>
&lt;td>2(b)&lt;/td>
&lt;td>Old agreement referred generally to “all membership dues and any charges or fees as established by the Board from time to time and set forth on the PILA Site.” New Section 3 aims to summarize the categories of fees associated with membership, including a reference to service fees for optional services if and when elected by the Member. Adds Member obligation to cover wire transfer fees/other payment costs.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>General license&lt;/td>
&lt;td>4(a)&lt;/td>
&lt;td>4&lt;/td>
&lt;td>Clarifies that the license grant covers only metadata and identifiers “corresponding to such Member’s Content.”&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Metadata rights &amp;amp; limitations&lt;/td>
&lt;td>4(b)&lt;/td>
&lt;td>5&lt;/td>
&lt;td>Significantly streamlines wording.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Crossref’s IP&lt;/td>
&lt;td>4(c)&lt;/td>
&lt;td>6&lt;/td>
&lt;td>Significantly streamlines wording.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Distribution of metadata&lt;/td>
&lt;td>5&lt;/td>
&lt;td>9(b)&lt;/td>
&lt;td>Updates language regarding Crossref’s rights to distribute Metadata. Adds an explicit carveout for a Member’s reference distribution preference.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>N/A&lt;/td>
&lt;td>7, 8, 9(a)&lt;/td>
&lt;td>Deletes extensive provision relating to obsolete “Clean-Up” and “Reverse Look-Up” services. Deletes provisions relating to obsolete “caching and transfer” activities, and local hosting.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Use of marks&lt;/td>
&lt;td>6&lt;/td>
&lt;td>10&lt;/td>
&lt;td>Substantially rewritten, including to reflect Crossref’s more permissive approach to use of its logo.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Maintenance of the Crossref Infrastructure&lt;/td>
&lt;td>7&lt;/td>
&lt;td>[No analog.]&lt;/td>
&lt;td>Adds covenant of Crossref to maintain the Crossref Infrastructure.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Term&lt;/td>
&lt;td>8&lt;/td>
&lt;td>11&lt;/td>
&lt;td>Eliminates the concept of automatically renewing 12-month terms. Replaces with a perpetual term that continues until superseded by an amended version.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Termination of membership&lt;/td>
&lt;td>9(a)&lt;/td>
&lt;td>11&lt;/td>
&lt;td>Provides for termination by the member upon written notice, rather than 90 days’ written notice, to align with the Bylaws. Adds a for-cause termination right by the Member, and corresponding right to receive a refund of fees. Sets out certain bases for termination of membership by Crossref, consistent with the Bylaws.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Appeal rights&lt;/td>
&lt;td>9(b)&lt;/td>
&lt;td>13&lt;/td>
&lt;td>No material change.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Effect of termination of membership&lt;/td>
&lt;td>9(c)&lt;/td>
&lt;td>12&lt;/td>
&lt;td>Adds refund right for for-cause terminations.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Enforcement&lt;/td>
&lt;td>10&lt;/td>
&lt;td>13&lt;/td>
&lt;td>Replaces “Crossref has the right but not the obligation to enforce the terms of this Agreement …” with “Crossref shall take reasonable steps to enforce these Terms … .”&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Governing law; venue&lt;/td>
&lt;td>11&lt;/td>
&lt;td>14(a)&lt;/td>
&lt;td>Keeps New York as choice of law, but moves forum to Boston, nearer to Crossref’s US location.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Disputes&lt;/td>
&lt;td>12&lt;/td>
&lt;td>14(b)&lt;/td>
&lt;td>No material change (but note venue provision moved to 11(a)).&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>N/A&lt;/td>
&lt;td>15&lt;/td>
&lt;td>Eliminates mutual “warranty” provision; addresses rights to content and anti-infringement under other provisions.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Indemnification&lt;/td>
&lt;td>13&lt;/td>
&lt;td>16&lt;/td>
&lt;td>Removes concept that Member is indemnifying other Crossref Members. Streamlines and cleans up the indemnity language.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Limitation of Liability&lt;/td>
&lt;td>14&lt;/td>
&lt;td>17&lt;/td>
&lt;td>Adds explicit reference to the Crossref Infrastructure.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Assignment&lt;/td>
&lt;td>16(c)&lt;/td>
&lt;td>22&lt;/td>
&lt;td>Removed language providing that Crossref’s consent to assignment of the Terms shall not be unreasonably delayed or conditioned.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Amendment&lt;/td>
&lt;td>18&lt;/td>
&lt;td>2(c)&lt;/td>
&lt;td>Old: “The Board shall have the power to modify the terms of this Agreement by publishing amended versions that will automatically supersede prior versions … . PILA will use its reasonable discretion in deciding if a modification is material, and if so will provide written notice” to the Member of the material changes. New: “These Terms may be amended by Crossref, via updated Terms posted on the Website and emailed to each Member not less than sixty (60) days prior to effectiveness. By using the Crossref Infrastructure after the effective date of any such amendment hereto, the Member accepts the amended Terms.”&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Data privacy&lt;/td>
&lt;td>19&lt;/td>
&lt;td>N/A&lt;/td>
&lt;td>Adds a GDPR-compliant privacy provision; adds a linked reference to Crossref’s new Privacy Policy.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Compliance&lt;/td>
&lt;td>20&lt;/td>
&lt;td>N/A&lt;/td>
&lt;td>Adds a mutual compliance covenant and an OFAC/sanctions representation.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>Various legal “boilerplate” terms (taxes, waiver, independent contractor&lt;/td>
&lt;td>15-17&lt;/td>
&lt;td>18-28&lt;/td>
&lt;td>Streamlined; replaced with more contemporary formulations; eliminated some excess verbiage.&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;h2 id="thanks-for-reading-this-far">Thanks for reading this far!&lt;/h2>
&lt;p>Please contact our &lt;a href="mailto:member@crossref.org">member experience team&lt;/a> with any questions.&lt;/p></description></item><item><title>Updates to our by-laws</title><link>https://www.crossref.org/blog/updates-to-our-by-laws/</link><pubDate>Thu, 29 Nov 2018 00:00:00 +0000</pubDate><author>Lisa Hart Martin</author><guid>https://www.crossref.org/blog/updates-to-our-by-laws/</guid><description>&lt;p>Good governance is important and something that Crossref thinks about regularly so the board frequently discusses the topic, and this year even more so. At the November 2017 meeting there was a motion passed to create an ad-hoc Governance Committee to develop a set of governance-related questions/recommendations. The Committee has met regularly this year and the following questions are under deliberation regarding term limits, role of the Nominating Committee, implications of contested elections, and more.&lt;/p>
&lt;p>The full motion to create the committee is:&lt;/p>
&lt;blockquote>
&lt;p>The ad hoc Governance Committee should discuss and make specific recommendations (including where necessary proposing appropriate by-law amendments) about (i) the timing of the annual election of members and whether the newly elected Board can take office a fixed period after the election results are finalized; (ii) the role and responsibilities of the Nominating Committee and its relationship to the Board; (iii) the implications of having contested Board elections; (iv) the election of officers, Executive Committee members, and committee chairs, and v) options and required changes for board members to represent specific constituencies (e.g. based on membership types).&lt;/p>
&lt;/blockquote>
&lt;p>The Governance Committee members are:&lt;/p>
&lt;ul>
&lt;li>Paul Peters (Hindawi and Board Chair)&lt;/li>
&lt;li>Scott Delman (ACM and Board Treasurer)&lt;/li>
&lt;li>Chris Shillum (Elsevier and Executive Committee member)&lt;/li>
&lt;li>Mark Patterson (eLife)&lt;/li>
&lt;li>Ed Pentz (Crossref Executive Director)&lt;/li>
&lt;li>Lisa Hart (Crossref Finance &amp;amp; Operations Director)&lt;/li>
&lt;li>Emily Cooke (Pierce Atwood, legal counsel).&lt;/li>
&lt;/ul>
&lt;p>The committee’s goal was to try to maintain and increase transparency; consider practicality and impact of any changes and ensure continuity and balance.&lt;/p>
&lt;p>At the March meeting the committee provided an overview of the issues they had discussed. There was consensus to accept the committee’s recommendation to address all of the governance matters comprehensively at the July 2018 meeting.&lt;/p>
&lt;p>Discussions resulted in two changes to our &lt;a href="https://www.crossref.org/board-and-governance/bylaws/">by-laws&lt;/a>:&lt;/p>
&lt;h2 id="1-membership-eligibility">1. Membership eligibility&lt;/h2>
&lt;p>To provide clarity around membership qualification, we resolved to amend Article I Section 1 by replacing the text in its entirety with the following text:&lt;/p>
&lt;blockquote>
&lt;p>Membership in Crossref shall be open to any organisation that publishes professional and scholarly materials and content and otherwise meets the terms and conditions of membership established from time to time by the Board of Directors, and to such other entities as the Board of Directors shall determine from time to time.&lt;/p>
&lt;/blockquote>
&lt;h2 id="2-start-date-of-board-terms">2. Start date of board terms&lt;/h2>
&lt;p>We also resolved to amend Article V Section 4 to replace the phrase “on the day after” with the phrase “during the next calendar quarter immediately following”. This allows the Board to meet directly ahead of Crossref’s Annual Meeting and Board election (from 2019) instead of directly after.&lt;/p>
&lt;p>The first change captures the fact that we have a very broad community beyond what is seen as traditional publishers, who themselves do not solely identify as publishers anymore. It reflects how our membership has evolved, and also includes organisations that publish that aren’t publishers (universities, government agencies, etc.)&lt;/p>
&lt;p>The second change was a practical one. As Crossref had its first contested election in 2017, and in 2018 as well, it seemed unreasonable to have a brand new Board meet the day after the election, especially when there is potential for officers to not be re-elected. The old by-laws were very specific about holding the Board meeting the day after the election. With this change, starting with the March meeting, the new Board will have a full calendar year of meetings, which seems more practical, and we will establish a process for the election of officers.&lt;/p>
&lt;p>During that meeting we deliberated the following questions/recommendations raised by the committee:&lt;/p>
&lt;ul>
&lt;li>Development of a policy on canvassing/campaigning by candidates in Board elections;&lt;/li>
&lt;li>Development of policies on nominations to the positions of Chair, Treasurer, Executive Committee members, the Nominating Committee Chair, and the Audit Committee Chair;&lt;/li>
&lt;li>Analysis of how best to achieve balance and representation on the Board going forward (designated seats and/or a binding Board remit to the Nominating Committee);&lt;/li>
&lt;li>Analysis as to whether to impose term limits on board members;&lt;/li>
&lt;li>Analysis as to how best to handle independent nominations to the Board (eliminate the option, or improve the process); and&lt;/li>
&lt;li>Review of our governing documents’ provisions on vacancies, to confirm that the Board follows the required steps on the filling of vacancies.&lt;/li>
&lt;/ul>
&lt;p>At the November 2018 Board meeting&amp;mdash;following Crossref LIVE18&amp;mdash;there were two more amendments to the bylaws:&lt;/p>
&lt;h2 id="3-removal-of-independant-nominations">3. Removal of independant nominations&lt;/h2>
&lt;p>To remove Art. V II Sec. 3 on independent nominations. This change reflects the consensus that there is no need for independent nominations with the introduction of contested elections.&lt;/p>
&lt;h2 id="4-membership-start-date-of-record">4. Membership start date of record&lt;/h2>
&lt;p>To amend Art. I Sec. 2 to amend language dealing with record date of membership. This is a practical change following the July 2018 introduction of &lt;a href="https://www.crossref.org/membership/terms">new membership terms&lt;/a> which are click-through online terms and don&amp;rsquo;t need counter-signatures.&lt;/p>
&lt;p>The new Board will resume the discussion on designated seats at our March 2019 meeting.&lt;/p></description></item><item><title>Data Citation: what and how for publishers</title><link>https://www.crossref.org/blog/data-citation-what-and-how-for-publishers/</link><pubDate>Fri, 23 Nov 2018 00:00:00 +0000</pubDate><author>Rachael Lammey</author><discourseUsername>rlammey</discourseUsername><guid>https://www.crossref.org/blog/data-citation-what-and-how-for-publishers/</guid><description>&lt;p>We’ve mentioned &lt;a href="https://doi.org/10.64000/ae1q9-mtq08" target="_blank">why data citation is important to the research community&lt;/a>. Now it’s time to roll up our sleeves and get into the ‘how’. This part is important, as citing data in a standard way helps those citations be recognised, tracked, and used in a host of different services.&lt;/p>
&lt;p>This week &lt;a href="https://doi.org/10.1038/sdata.2018.259" target="_blank">A Data Citation Roadmap for Scientific Publishers&lt;/a> was published in &lt;a href="http://www.nature.com/scientificdata" target="_blank">Scientific Data&lt;/a>. This roadmap is the outcome of a collaboration between different publishers that worked on identifying all steps you need to take as a publisher to implement data citation. If you want to know more about establishing a data policy, capturing data citations at the point of submission, or tagging data citations in your XML, we recommend you take a look at this article!&lt;/p>
&lt;p>In this blog post, we’ll discuss the steps you need to take after you’ve implemented this roadmap. The steps in the roadmap describe how you can track &amp;amp; tag data citation yourself. Here we describe how Crossref can help you make these available to the rest of the community.&lt;/p>
&lt;h2 id="the-what">The &amp;lsquo;what&amp;rsquo;&lt;/h2>
&lt;p>Here’s the recap! From the Crossref perspective, there are two ways to add data citation links into the metadata that you register:&lt;/p>
&lt;h3 id="1-metadata-deposits-using-the-references-section-of-the-schema">1. Metadata deposits using the references section of the schema&lt;/h3>
&lt;p>This is where ‘citations’ are normally recorded. Publishers include the data citation into the deposit of bibliographic references for each publication.&lt;/p>
&lt;p>Publishers can deposit the full data or software citation as a unstructured reference. For guidance here, we recommend that authors cite the dataset or software based on community best practice (&lt;a href="https://www.force11.org/group/joint-declaration-data-citation-principles-final" target="_blank">Joint Declaration of Data Citation Principles&lt;/a>, &lt;a href="https://www.force11.org/node/4771" target="_blank">FORCE11 citation placement&lt;/a>, &lt;a href="https://www.force11.org/software-citation-principles" target="_blank">FORCE11 Software Citation Principles&lt;/a>).&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-XML" data-lang="XML">&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;citation&lt;/span> &lt;span class="na">key=&lt;/span>&lt;span class="s">&amp;#34;ref=3&amp;#34;&lt;/span>&lt;span class="nt">&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;unstructured_citation&amp;gt;&lt;/span>Morinha F, Dávila JA, Estela B, Cabral JA, Frías Ó, González JL, Travassos P, Carvalho D, Milá B, Blanco G (2017) Data from: Extreme genetic structure in a social bird species despite high dispersal capacity. Dryad Digital Repository. http://dx.doi.org/10.5061/dryad.684v0&lt;span class="err">&amp;lt;&lt;/span>/unstructured_citation\&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;/citation&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;/citation_list&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>Or they can employ any number of &lt;a href="https://support.crossref.org/hc/en-us/articles/215578403-Adding-references-to-your-metadata-record" target="_blank">reference tags&lt;/a> currently accepted by Crossref.&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-XML" data-lang="XML">&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;citation&lt;/span> &lt;span class="na">key=&lt;/span>&lt;span class="s">&amp;#34;ref2&amp;#34;&lt;/span>&lt;span class="nt">&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;doi&amp;gt;&lt;/span>10.5061/dryad.684v0&lt;span class="nt">&amp;lt;/doi&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;cYear&amp;gt;&lt;/span>2017&lt;span class="nt">&amp;lt;/cYear&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;author&amp;gt;&lt;/span>Morinha F, Dávila JA, Estela B, Cabral JA, Frías Ó, González JL, Travassos P, Carvalho D, Milá B, Blanco G&lt;span class="nt">&amp;lt;/author&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;/citation&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>We are exploring &lt;a href="http://jats4r.org/data-citations" target="_blank">JATS4R recommendations&lt;/a> to expand the current collection and better support these citations - more on this soon. We also encourage additional suggestions from the community.&lt;/p>
&lt;h3 id="2-metadata-deposits-using-the-relations-section-of-the-schema">2. Metadata deposits using the relations section of the schema&lt;/h3>
&lt;p>This is where other relationships can be recorded. Publishers assert the data link in the &lt;a href="https://support.crossref.org/hc/en-us/articles/214357426-Relationships-between-DOIs-and-other-objects" target="_blank">relationship section&lt;/a> of the metadata deposit. Here, publishers can identify data which are direct outputs of the research results if this is known. This level of specificity is optional, but we’d recommend it as it can support scientific validation and research funding management.&lt;/p>
&lt;p>Data and software citations via relation type enables precise tagging of the dataset and its specific relationship to the research results published. To tag the data &amp;amp; software citation in the metadata deposit, we ask for the description of the dataset &amp;amp; software (optional), dataset &amp;amp; software identifier and identifier type (DOI, PMID, PMCID, PURL, ARK, Handle, UUID, ECLI, and URI), and &lt;a href="http://data.crossref.org/reports/help/schema_doc/4.3.5/NO_NAMESPACE.html#inter_work_relation_relationship-type" target="_blank">relationship type&lt;/a>.&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-XML" data-lang="XML">&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;program&lt;/span> &lt;span class="na">xmlns=&lt;/span>&lt;span class="s">&amp;#34;http://www.crossref.org/relations.xsd&amp;#34;&lt;/span>&lt;span class="nt">&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;related_item&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;description&amp;gt;&lt;/span>Data from: Extreme genetic structure in a social bird species despite high dispersal capacity&lt;span class="nt">&amp;lt;/description&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;inter_work_relation&lt;/span> &lt;span class="na">relationship-type=&lt;/span>&lt;span class="s">&amp;#34;references&amp;#34;&lt;/span> &lt;span class="na">identifier-type=&lt;/span>&lt;span class="s">&amp;#34;doi&amp;#34;&lt;/span>&lt;span class="nt">&amp;gt;&lt;/span>10.5061/dryad.684v0&lt;span class="nt">&amp;lt;/inter_work_relation&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;/related_item&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;/program&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nt">&amp;lt;/doi_relations&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;br>
&lt;blockquote>
&lt;p>In general, use the relation type &lt;code>references&lt;/code> for data and software resources.&lt;/p>
&lt;/blockquote>
&lt;p>Publishers who wish to specify that the data or software resource was generated as part of the research results can use the &lt;code>isSupplementedBy&lt;/code> relation type.&lt;/p>
&lt;h2 id="the-how">The &amp;lsquo;how&amp;rsquo;&lt;/h2>
&lt;h3 id="i-create-my-own-xml-and-register-it-with-crossref">I create my own XML and register it with Crossref&lt;/h3>
&lt;p>Add links to datasets into your reference lists, including their DOIs if available as shown above and deposit them with Crossref. We’ll do the rest. If you want to add references to existing metadata records, you don’t need to redeposit the full article metadata, you can send us a &lt;a href="https://support.crossref.org/hc/en-us/articles/215578403" target="_blank">resource-only deposit&lt;/a> that just contains the reference metadata to append that to the existing metadata for the article. You can also use this method if you prefer to deposit references in a separate workflow to registering your content (we know some members prefer to work this way).&lt;/p>
&lt;h3 id="ive-started-using-metadata-manager-for-journal-article-deposits">I’ve started using Metadata Manager for journal article deposits&lt;/h3>
&lt;div style="text-align:center;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/dc.png"
alt="Article&amp;lt;-&amp;gt;Data relationships in Crossref" width="350">&lt;figcaption>
&lt;p>Article&amp;lt;-&amp;gt;Data relationships in Crossref&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;/div>
&lt;p>You can deposit data citations using either method using our new &lt;a href="https://www.crossref.org/education/member-setup/metadata-manager/">Metadata Manager&lt;/a> tool. When entering journal article metadata, you can use the ‘Related Items’ section to enter the DOI (or other identifier) for the dataset, the type of identifier, a description of the relation type e.g. &amp;lsquo;Data from: Extreme genetic structure in a social bird species despite high dispersal capacity’, and the relation type - ‘references’ or ‘is supplemented by’ depending on the relationship between the data and the article as described above. When you make the deposit, this relationship information will be registered in Crossref along with the rest of the article metadata.&lt;/p>
&lt;p>Metadata Manager also has a section where you can enter and match your references, and then deposit these with Crossref. If you choose this method, enter any data citations into the references section before depositing the article metadata with Crossref.&lt;/p>
&lt;p>If you want to add this information to deposits you have already made using Metadata Manager, you can search for the journals and articles in the interface, bring up the existing metadata and add in the additional information before redepositing.&lt;/p>
&lt;h3 id="i-use-simple-text-query-to-search-for-and-deposit-references">I use &amp;ldquo;simple text query&amp;rdquo; to search for and deposit references&lt;/h3>
&lt;p>Make sure you include any citations to data in the references you add into Simple Text Query. When you use simple text query to deposit these references, they will then be added into the article metadata in the Crossref database.&lt;/p>
&lt;p>If you use OJS, they’re working on functionality (due for release soon) that will make it easier to deposit reference metadata with Crossref, so you can include citations to data in that.&lt;/p>
&lt;p>All of this metadata&amp;mdash;registered with Crossref&amp;mdash;make it possible to build up pictures of data citations, linking, and relationships. Whether the citations come from the authors in the reference list or they are extracted by the publisher and then deposited, Crossref collects them across publishers. We then make the aggregate set freely available via &lt;a href="https://www.crossref.org/services/metadata-retrieval">Crossref’s APIs&lt;/a> in multiple interfaces (REST, OAI-­PMH, OpenURL) and formats (XML and JSON). DataCite does the same for data repositories and so this provides an easy way for publishers and data repositories to exchange information about data citations. As mentioned previously, this all feeds in Event Data. Data is made openly available to a wide host of parties across the extended research ecosystem including funders, research organisations, technology and service providers, indexers, research data frameworks such as &lt;a href="https://documentation.ardc.edu.au/cpg/scholix" target="_blank">Scholix&lt;/a>, etc.&lt;/p>
&lt;p>Do you have questions about how to add these links to your Crossref or DataCite metadata? We’ll be running a series of webinars in early 2019 to give you a chance to join us live and ask any questions you have. Eager to get started in the meantime? &lt;a href="mailto:support@crossref.org">Let us know&lt;/a> and we’ll start to coordinate.&lt;/p></description></item><item><title>Matchmaker, matchmaker, make me a match</title><link>https://www.crossref.org/blog/matchmaker-matchmaker-make-me-a-match/</link><pubDate>Mon, 12 Nov 2018 00:00:00 +0000</pubDate><author>Dominika Tkaczyk</author><guid>https://www.crossref.org/blog/matchmaker-matchmaker-make-me-a-match/</guid><description>&lt;p>Matching (or resolving) bibliographic references to target records in the collection is a crucial algorithm in the Crossref ecosystem. Automatic reference matching lets us discover citation relations in large document collections, calculate citation counts, H-indexes, impact factors, etc. At Crossref, we currently use a matching approach based on reference string parsing. Some time ago we realized there is &lt;a href="https://www.crossref.org/labs/resolving-citations-we-dont-need-no-stinkin-parser/">a much simpler approach&lt;/a>. And now it is finally battle time: which of the two approaches is better?&lt;/p>
&lt;h3 id="tldr">TL;DR&lt;/h3>
&lt;ul>
&lt;li>I evaluated and compared four approaches to reference matching: the legacy approach based on reference parsing, and three variants of the new idea called &lt;strong>search-based matching&lt;/strong>.&lt;/li>
&lt;li>A large &lt;strong>automatically generated dataset&lt;/strong> was used for the experiments. It is composed of 7,374 metadata records from the Crossref collection, each of which was formatted automatically into reference strings using 11 citation styles.&lt;/li>
&lt;li>The main metrics used for the evaluation are &lt;a href="https://en.wikipedia.org/wiki/Precision_and_recall" target="_blank">precision and recall&lt;/a>. I also use &lt;a href="https://en.wikipedia.org/wiki/F1_score" target="_blank">F1&lt;/a> as a standard metric that combines precision and recall into a single number, weighing them equally. All values are calculated for each metadata record separately and averaged over the dataset.&lt;/li>
&lt;li>In general, search-based matching is better than the legacy approach in F1 and recall, but worse in precision.&lt;/li>
&lt;li>The best variant of &lt;strong>search-based matching outperforms the legacy approach in average F1 (84.5% vs. 52.9%)&lt;/strong>, with the average precision worse by only 0.1% (99.2% vs 99.3%), and the average recall better by 88% (79.0% vs. 42.0%).&lt;/li>
&lt;li>The best variant of search-based matching also outperforms the legacy approach in average F1 for each one of the 11 styles.&lt;/li>
&lt;li>A weak spot of the parsing-based approach is degraded/noisy reference strings, which do not appear to use any of the known citation styles.&lt;/li>
&lt;li>A weak spot of search-based approach is short reference strings, and in particular citation styles that do not include the title in the reference string.&lt;/li>
&lt;/ul>
&lt;h3 id="introduction">Introduction&lt;/h3>
&lt;p>In reference matching, on the input we have a bibliographic reference. It can have the form of an unstructured string, such as:&lt;/p>
&lt;p>&lt;em>(1) Adamo, S. H.; Cain, M. S.; Mitroff, S. R. Psychological Science 2013, 24, 2569–2574.&lt;/em>&lt;/p>
&lt;p>The input can also have the form of a structured reference, such as (BibTex format):&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-JSON" data-lang="JSON">&lt;span class="line">&lt;span class="cl"> &lt;span class="err">@article&lt;/span>&lt;span class="p">{&lt;/span>&lt;span class="err">adamo2013,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="err">author&lt;/span> &lt;span class="err">=&lt;/span> &lt;span class="err">{Stephen&lt;/span> &lt;span class="err">H.&lt;/span> &lt;span class="err">Adamo&lt;/span> &lt;span class="err">and&lt;/span> &lt;span class="err">Matthew&lt;/span> &lt;span class="err">S.&lt;/span> &lt;span class="err">Cain&lt;/span> &lt;span class="err">and&lt;/span> &lt;span class="err">Stephen&lt;/span> &lt;span class="err">R.&lt;/span> &lt;span class="err">Mitroff&lt;/span>&lt;span class="p">}&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="err">title&lt;/span> &lt;span class="err">=&lt;/span> &lt;span class="p">{&lt;/span>&lt;span class="err">Self-Induced&lt;/span> &lt;span class="err">Attentional&lt;/span> &lt;span class="err">Blink:&lt;/span> &lt;span class="err">A&lt;/span> &lt;span class="err">Cause&lt;/span> &lt;span class="err">of&lt;/span> &lt;span class="err">Errors&lt;/span> &lt;span class="err">in&lt;/span> &lt;span class="err">Multiple-Target&lt;/span> &lt;span class="err">Search&lt;/span>&lt;span class="p">}&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="err">journal&lt;/span> &lt;span class="err">=&lt;/span> &lt;span class="p">{&lt;/span>&lt;span class="err">Psychological&lt;/span> &lt;span class="err">Science&lt;/span>&lt;span class="p">}&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="err">volume&lt;/span> &lt;span class="err">=&lt;/span> &lt;span class="p">{&lt;/span>&lt;span class="err">24&lt;/span>&lt;span class="p">}&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="err">number&lt;/span> &lt;span class="err">=&lt;/span> &lt;span class="p">{&lt;/span>&lt;span class="err">12&lt;/span>&lt;span class="p">}&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="err">pages&lt;/span> &lt;span class="err">=&lt;/span> &lt;span class="p">{&lt;/span>&lt;span class="err">2569-2574&lt;/span>&lt;span class="p">}&lt;/span>&lt;span class="err">,&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="err">year&lt;/span> &lt;span class="err">=&lt;/span> &lt;span class="p">{&lt;/span>&lt;span class="err">2013&lt;/span>&lt;span class="p">}&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="err">}&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>The goal of matching is to find the document, which the input reference points to.&lt;/p>
&lt;h3 id="matching-algorithms">Matching algorithms&lt;/h3>
&lt;p>Matching references is not a trivial task even for a human, not to mention the machines, which are still a bit less intelligent than us (or so they want us to believe…). A typical meta-approach to reference matching might be to score the similarity between the input reference and the candidate target documents. The document most similar to the input is then returned as the target.&lt;/p>
&lt;p>Of course, still a lot can go wrong here. We can have more than one potential target record with the same score (which one do we choose?). We can have only documents with low to medium scores (is the actual target even present in our collection?). We can also have errors in the input string (are the similarity scores robust enough?). Life&amp;rsquo;s tough!&lt;/p>
&lt;p>The main difference between various matching algorithms is in fact how the similarity is calculated. For example, one idea might be to compare the records field by field (how similar is the title/author/journal in the input reference to the title/author/journal of our candidate target record?). This is roughly how the matching works currently at Crossref.&lt;/p>
&lt;p>The main problem with this approach is that it requires a structured reference, and in practise, often all we have is a plain reference string. In such a case we need to extract the metadata fields from the reference string (this is called parsing). Parsing introduces errors, since no parser is omniscient. The errors propagate further and affect the scoring… you get the picture.&lt;/p>
&lt;p>Luckily, as &lt;a href="https://www.crossref.org/labs/resolving-citations-we-dont-need-no-stinkin-parser/">we have known for some time now&lt;/a>, this is not the only approach. Instead of comparing structured objects, we could calculate the similarity between them using their unstructured textual form. This effectively eliminates the need for parsing, since the unstructured form is either already available on the input or can be easily generated from the structured form.&lt;/p>
&lt;p>What about the similarity scores? We already know a powerful method for scoring the similarities between texts. Those are (you guessed it!) scoring algorithms used by search engines. Most of them, including &lt;a href="https://search.crossref.org" target="_blank">Crossref&amp;rsquo;s&lt;/a>, do not need a structured representation of the object, they are perfectly happy with just a plain text query.&lt;/p>
&lt;p>So all we need to do is to pass the original reference string (or some concatenation of the reference fields, if only a structured reference is available) to the search engine and let it score the similarity for us. It will also conveniently sort the results so that it is easy to find the top hit.&lt;/p>
&lt;h3 id="evaluation">Evaluation&lt;/h3>
&lt;p>So far so good. But which strategy is better? Is it better to develop an accurate parser, or just rely on the search engine? I don&amp;rsquo;t feel like guessing. Let&amp;rsquo;s try to answer this using (data) science. But first, we need to decompose our question into smaller pieces.&lt;/p>
&lt;h4 id="question-1-how-can-i-measure-the-quality-of-a-reference-matcher">Question 1. How can I measure the quality of a reference matcher?&lt;/h4>
&lt;p>Generally speaking, this can be done by checking the resulting citation links. Simply put, the better the links, the better the matching approach must have been.&lt;/p>
&lt;p>A few standard metrics can be applied here, including &lt;a href="https://en.wikipedia.org/wiki/Accuracy_and_precision" target="_blank">accuracy&lt;/a>, &lt;a href="https://en.wikipedia.org/wiki/Precision_and_recall" target="_blank">precision, recall&lt;/a> and &lt;a href="https://en.wikipedia.org/wiki/F1_score" target="_blank">F1&lt;/a>. We decided to calculate precision, recall and F1 separately for each document in the dataset, and then average those numbers over the entire dataset.&lt;/p>
&lt;p>When I say &amp;ldquo;documents&amp;rdquo;, I really mean &amp;ldquo;target documents&amp;rdquo;:&lt;/p>
&lt;ul>
&lt;li>&lt;strong>precision&lt;/strong> for a document X tells us, what percentage of links to X in the system are correct&lt;/li>
&lt;li>&lt;strong>recall&lt;/strong> for a document X tells us, what percentage of true links to X are present in the system&lt;/li>
&lt;li>&lt;strong>F1&lt;/strong> is the harmonic mean of precision and recall&lt;/li>
&lt;/ul>
&lt;p>F1 is a single-number metric combining precision and recall. In F1 precision and recall are weighted equally. It is also possible to combine precision and recall using different weights, to place more emphasis on one of those metrics.&lt;/p>
&lt;p>We decided to look at links from the target document&amp;rsquo;s perspective, because this is what the academic world cares about (i.e. how accurate the citation counts of academic papers are).&lt;/p>
&lt;p>Calculating separate numbers for individual documents and averaging them within a dataset is the best way to have reliable confidence intervals (which makes the whole analysis look much smarter!).&lt;/p>
&lt;h4 id="question-2-which-approaches-should-be-compared">Question 2. Which approaches should be compared?&lt;/h4>
&lt;p>In total we tested four reference matching approaches.&lt;/p>
&lt;p>The first approach, called the &lt;strong>legacy approach&lt;/strong>, is the approach currently used in Crossref ecosystem. It uses a parser and matches the extracted metadata fields against the records in the collection.&lt;/p>
&lt;p>The second approach is the &lt;strong>search-based matching (SBM)&lt;/strong> with a &lt;strong>simple threshold&lt;/strong>. It queries the search engine using the reference string and returns the top hit from the results, if its relevance score exceeds the threshold.&lt;/p>
&lt;p>The third approach is the &lt;strong>search-based matching (SBM)&lt;/strong> with a &lt;strong>normalized threshold&lt;/strong>. Similarly as in the simplest SBM, in this approach we query the search engine using the reference string. In this case the first hit is returned if its normalized score (the score divided by the reference length) exceeds the threshold.&lt;/p>
&lt;p>Finally, the fourth approach is a variation of the search based matching, called &lt;strong>search-based matching with validation (SBMV)&lt;/strong>. In this algorithm we use additional validation procedure on top of SBM. First, SBM with a normalized threshold is applied and the search results with the scores exceeding the normalized threshold are selected as candidate target documents. Second, we calculate validation similarity between the input string and each of the candidates. This validation similarity is based on the presence of the candidate record&amp;rsquo;s metadata fields (year, volume, issue, pages, the last name of the first author, etc.) in the input reference string, as well as the relevance score returned by the search engine. Finally, the most similar candidate is returned as the final target document, if its validation similarity exceeds the &lt;strong>validation threshold&lt;/strong>.&lt;/p>
&lt;p>By adding the validation stage to the search-based matching we make sure that the same bibliographic numbers (year, volume, etc.) are present in both the input reference and the returned document. We also don&amp;rsquo;t simply take the first result, but rather use this validation similarity to choose from results scored similarly by the search engine.&lt;/p>
&lt;p>All the thresholds are parameters which have to be set prior to the matching. The thresholds used in these experiments were chosen using a separate dataset, as the values maximizing the F1 of each algorithm.&lt;/p>
&lt;h4 id="question-3-how-to-create-the-dataset">Question 3. How to create the dataset?&lt;/h4>
&lt;h3 id="results">Results&lt;/h3>
&lt;p>We could try to calculate our metrics for every single document in the system. Since we currently have &lt;a href="https://doi.org/10.64000/c8tcs-9vm83" target="_blank">over 100M of them&lt;/a>, this would take a while, and we already felt impatient&amp;hellip;&lt;/p>
&lt;p>A faster strategy was to use &lt;a href="https://en.wikipedia.org/wiki/Sampling_%28statistics%29" target="_blank">sampling&lt;/a> with all the tools statistics was so generous to provide. And this is exactly what we did. We used a random sample of 2500 items from our system, which is big enough to give reliable results and, as we will see later, produces quite narrow confidence intervals.&lt;/p>
&lt;p>Apart from the sample, we needed some input reference strings. We generated those automatically by formatting the metadata of the chosen items using various citation styles. (Similarly to what happens when you automatically format the bibliography section for your article. Or at least we hope you don&amp;rsquo;t produce those reference strings manually…)&lt;/p>
&lt;p>For each record in our sample, we generated 11 citation strings, using the following styles:&lt;/p>
&lt;ul>
&lt;li>Well known citation styles from various disciplines:
&lt;ul>
&lt;li>american-chemical-society (acs)&lt;/li>
&lt;li>american-institute-of-physics (aip)&lt;/li>
&lt;li>elsevier-without-titles (ewt)&lt;/li>
&lt;li>apa&lt;/li>
&lt;li>chicago-author-date&lt;/li>
&lt;li>modern-language-association (mla)&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>Known styles + random noise. To simulate not-so-clean data, we randomly added noise (additional spaces, deleted spaces, typos) to the generated strings of the following styles:
&lt;ul>
&lt;li>american-institute-of-physics&lt;/li>
&lt;li>apa&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>Custom degraded &amp;ldquo;styles&amp;rdquo;:
&lt;ul>
&lt;li>degraded: a simple concatenation of authors&amp;rsquo; names, title, container title, year, volume, issue and pages&lt;/li>
&lt;li>one author: a simple concatenation of the first author&amp;rsquo;s name, title, container title, year, volume, issue and pages&lt;/li>
&lt;li>title scrambled: same as degraded, but with title words randomly shuffled&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;p>Some styles include the DOI in the reference string. In such cases we stripped the DOI from the string, to make the matching problem non-trivial.&lt;/p>
&lt;p>An ideal matching algorithm will match every generated string to the record it was generated from. In practise, some of the expected matches will be missing, which will lower the recall of the tested matching approach. On the other hand, it is very probable that we will get the precision of 100%. To have the precision lower than 100%, we would have to have some unexpected matches to our sampled documents, which is unlikely. This is obviously not great, because we are missing a very important piece of information.&lt;/p>
&lt;p>What can we do to “encourage” such mismatches to our sampled documents? We could generate additional reference strings of documents that are not in our sample, but are similar to the documents in our sample. Hopefully, we will see some incorrect links from those similar strings to our sampled documents.&lt;/p>
&lt;p>For each sampled document I added up to 2 similar documents (I used, surprise surprise, our search engine to find the most similar documents). I ended up with 7,374 items in total (2,500 originally sampled and 4,874 similar items). For each item, 11 different reference strings were generated. Each reference string was then matched using the tested approaches and I could finally look at some results.&lt;/p>
&lt;h3 id="results-1">Results&lt;/h3>
&lt;p>First, let&amp;rsquo;s compare the overall results averaged over the entire dataset:&lt;/p>
&lt;img src="https://www.crossref.org/images/blog/matching_comparison_overall.png" alt="overall comparison of reference matching evaluation" width="500px" />
&lt;p>The small vertical black lines at the top of the boxes show the confidence intervals at the confidence level 95%. The table gives the exact values and the same confidence intervals. The best result for each metric is bolded.&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>&lt;/th>
&lt;th>average precision&lt;/th>
&lt;th>average recall&lt;/th>
&lt;th>average F1&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>legacy approach&lt;/td>
&lt;td>&lt;strong>0.9933&lt;/strong>&lt;br />(0.9910 - 0.9956)&lt;/td>
&lt;td>0.4203&lt;br />(0.4095 - 0.4312)&lt;/td>
&lt;td>0.5289&lt;br /> (0.5164 - 0.5413)&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>SBM (simple threshold)&lt;/td>
&lt;td>0.9890&lt;br />(0.9863 - 0.9917)&lt;/td>
&lt;td>0.7127&lt;br />(0.7021 - 0.7233)&lt;/td>
&lt;td>0.7866&lt;br />(0.7763 - 0.7968)&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>SBM (normalized threshold)&lt;/td>
&lt;td>0.9872&lt;br />(0.9844 - 0.9901)&lt;/td>
&lt;td>&lt;strong>0.7905&lt;/strong>&lt;br />(0.7796 - 0.8015)&lt;/td>
&lt;td>0.8354&lt;br />(0.8249 - 0.8458)&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>SBMV&lt;/td>
&lt;td>0.9923&lt;br />(0.9902 - 0.9945)&lt;/td>
&lt;td>0.7902&lt;br />(0.7802 - 0.8002)&lt;/td>
&lt;td>&lt;strong>0.8448&lt;/strong>&lt;br />(0.8352 - 0.8544)&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>The confidence intervals given in the table are the ranges, in which it is 95% likely to have the real average precision, recall and F1. For example, we are 95% sure that the real F1 for SBMV in our entire collection is within the range 0.8352 - 0.8544.&lt;/p>
&lt;p>As we can see, each metric has a different winner.&lt;/p>
&lt;p>&lt;strong>The legacy approach is the best in precision&lt;/strong>. This suggests the legacy approach is quite conservative and outputs a match only if it is very sure about it. This might also result in missing a number of true matches (false negatives).&lt;/p>
&lt;p>According to the paired Student&amp;rsquo;s t-test, the difference between the average precision of the legacy approach and the average precision of the second best SBMV is not statistically significant. This means we cannot rule out that this difference is simply the effect of the randomness in sampling, and not the sign of the true difference.&lt;/p>
&lt;p>&lt;strong>SBM with a normalized threshold is the best in recall&lt;/strong>. This suggests that it is fairly tolerant and returns a lot of matches, which might also result in returning more incorrect matches (false positives). Also in this case the difference between the winner and the second best (SBMV) is not statistically significant.&lt;/p>
&lt;p>&lt;strong>SBMV is the best in F1&lt;/strong>. This shows that this approach balances precision and recall the best, despite being only the second best in both of those metrics. According to the paired Student&amp;rsquo;s t-test, the difference between SBMV and the second best approach (SBM with a normalized threshold) is &lt;strong>statistically significant&lt;/strong>.&lt;/p>
&lt;p>&lt;strong>All variants of the search-based matching outperform the parsing-based approach in terms of F1&lt;/strong>, with statistically significant differences. This shows that in search based-matching it is possible to keep precision almost as good as in the legacy approach, and still include many more true positives.&lt;/p>
&lt;p>Let&amp;rsquo;s also look at the same results split by the citation style:&lt;/p>
&lt;img src="https://www.crossref.org/images/blog/matching_comparison_by_style.png" alt="comparison of reference matching evaluation by style" width="500px" />
&lt;p>For all styles the precision values are very high, and the legacy approach is slightly better than all variations of the search-based approach.&lt;/p>
&lt;p>In terms of recall and F1 SBM with a simple threshold is better than the legacy approach in 8 out of 11 styles. The three styles for which the legacy approach outperforms SBM with a simple threshold are styles that do not include the title in the reference strings (acs, aip and ewt). The reason for this is that the simple threshold cannot be well calibrated for shorter and longer reference strings at the same time.&lt;/p>
&lt;p>SBM with a normalized threshold and &lt;strong>SBMV is better than the legacy approach in recall and F1 for all 11 styles&lt;/strong>.&lt;/p>
&lt;p>The weak spot of the legacy approach is degraded and noisy reference strings, which do not appear to use any of the known citation styles.&lt;/p>
&lt;p>The weak spot of the search-based matching is short reference strings, and in particular citation styles that do not include the title in the string.&lt;/p>
&lt;h3 id="limitations">Limitations&lt;/h3>
&lt;p>The limitations are related mostly to the method of building the dataset.&lt;/p>
&lt;ul>
&lt;li>All the numbers reported here are estimates, since they were calculated on a sample.&lt;/li>
&lt;li>The numbers show strengths and weaknesses of each approach, but they do not reflect the real precision and recall in the system:
&lt;ul>
&lt;li>Since we included only 2 similar documents for each document in the sample, precision is most likely lower in the real data.&lt;/li>
&lt;li>We used a number of styles distributed uniformly. Of course in the real system the styles and their distribution might be different, which affects all the calculated numbers.&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul></description></item><item><title>What does the sample say?</title><link>https://www.crossref.org/blog/what-does-the-sample-say/</link><pubDate>Fri, 09 Nov 2018 00:00:00 +0000</pubDate><author>Dominika Tkaczyk</author><guid>https://www.crossref.org/blog/what-does-the-sample-say/</guid><description>&lt;p>At Crossref Labs, we often come across interesting research questions and try to answer them by analyzing our data. Depending on the nature of the experiment, processing &lt;a href="https://doi.org/10.64000/c8tcs-9vm83" target="_blank">over 100M records&lt;/a> might be time-consuming or even impossible. In those dark moments we turn to sampling and statistical tools. But what can we infer from only a sample of the data?&lt;/p>
&lt;p>Imagine you are cooking soup. You just put some salt in it and now you are wondering if it is salty enough. What do you do next?&lt;/p>
&lt;ul>
&lt;li>Option #1: Since you carefully measured 1/7 of a teaspoon of salt per 0.13 litres of soup (as always), you already know the soup is fine. Everyone else better stop asking silly questions and eat their soup.&lt;/li>
&lt;li>Option #2: You stir everything carefully and taste a tablespoon. If it is not salty enough, you put more salt in the soup and repeat the tasting procedure.&lt;/li>
&lt;li>Option #3: You eat a tablespoon of soup and it tastes fine. But wait, there&amp;rsquo;s more soup in the pot, what if the sip you&amp;rsquo;ve just tasted was somehow different than the rest? You decide it&amp;rsquo;s better to eat another spoon of soup (which tastes fine). Still, a lot of soup left, who knows what that tastes like? It might be safer to eat an entire bowl of soup. Hmm, still not sure, you&amp;rsquo;ve eaten such a small fraction of the soup, who can guarantee the rest tastes the same? You have no choice but to eat another bowl, and then some more… Ooops, now you have eaten the entire pot of soup! At least you can be 100% sure now that the soup was indeed salty enough. The problem is, there is no soup left, and also, you don&amp;rsquo;t feel so good. But people are getting hungry, so you start cooking a new batch…&lt;/li>
&lt;/ul>
&lt;p>If your answer was option #3, read on. Your life is going to get easier!&lt;/p>
&lt;h3 id="tldr">TL;DR&lt;/h3>
&lt;ul>
&lt;li>Sampling and confidence intervals can be used to estimate the mean of a certain feature, or the proportion of items passing a certain test, by calculating it only for a random sample of items, instead of the entire large set of items. Note that estimating =/= guessing.&lt;/li>
&lt;li>Confidence intervals are a way of controlling the amount of uncertainty related to randomness in sampling.&lt;/li>
&lt;li>The confidence interval has a form (estimated value - something, estimated value + something). Confidence interval at the confidence level 95% is interpreted as follows: we are 95% sure that the real value that we are estimating is within our calculated confidence interval.&lt;/li>
&lt;li>The higher the confidence level (i.e. the more certain we want to be about the interval), the wider the interval has to be.&lt;/li>
&lt;li>The larger the sample, the narrower the confidence interval.&lt;/li>
&lt;li>We are never 100% sure that the value we are estimating is actually within our calculated confidence interval. By setting the confidence level high, we only make sure this is a very likely event.&lt;/li>
&lt;/ul>
&lt;h3 id="the-problem">The problem&lt;/h3>
&lt;p>Sampling and estimating drew my attention while I was working on the evaluation of the reference matching algorithms. In Crossref&amp;rsquo;s case, reference matching is the task of finding the target document DOI for the given input reference string, such as:&lt;/p>
&lt;p>&lt;em>(1) Adamo, S. H.; Cain, M. S.; Mitroff, S. R. Psychological Science 2013, 24, 2569–2574.&lt;/em>&lt;/p>
&lt;p>Accurate reference matching is very important for the scientific community. Thanks to automatic reference matching we are able to find citing relations in large document sets, calculate citation counts, H-indexes, impact factors, etc.&lt;/p>
&lt;p>For several weeks now I have been investigating &lt;a href="https://www.crossref.org/labs/resolving-citations-we-dont-need-no-stinkin-parser/">a simple reference matching algorithm based on the search engine&lt;/a>. In this algorithm, we use the input reference string as the query in the search engine, and we return the first item from the results as the target document. Luckily, at Crossref we already have &lt;a href="https://search.crossref.org" target="_blank">a good search engine&lt;/a> in place, so all the pieces are there.&lt;/p>
&lt;p>I was interested in how well this simple algorithm works, i.e. how often the correct target document is found. For example, let&amp;rsquo;s say we have a reference string in APA citation style generated for a specific record in Crossref system. How certain can I be that it will be correctly matched to the record&amp;rsquo;s DOI?&lt;/p>
&lt;p>I could calculate this directly by generating the APA reference string for every record in the system and trying to match those strings to DOIs. Since we already have &lt;a href="https://doi.org/10.64000/c8tcs-9vm83" target="_blank">over 100M records&lt;/a>, this would take a while and I was getting impatient. So instead of eating the whole pot of soup, I decided to stir and taste just a little bit of it, or, academically speaking, use &lt;a href="https://en.wikipedia.org/wiki/Sampling_%28statistics%29" target="_blank">sampling&lt;/a> and &lt;a href="https://en.wikipedia.org/wiki/Confidence_interval" target="_blank">confidence intervals&lt;/a>.&lt;/p>
&lt;p>These statistical tools are useful in situations, where we have a large set of items, and we want to know the average of a certain feature of an item in our set, or the proportion of items passing a certain test, but calculating it directly is impossible or difficult. For example, we might want to know the average height of all women living in USA, the average salary of a Java programmer in London, or the proportion of book records in the Crossref collection. The entire set we are interested in is called a &lt;strong>population&lt;/strong> and the value we are interested in is called a &lt;strong>population average&lt;/strong> or a &lt;strong>population proportion&lt;/strong>. Sampling and confidence intervals let us estimate the population average or proportion using only a sample of items, in a reliable and controlled way.&lt;/p>
&lt;h3 id="experiments">Experiments&lt;/h3>
&lt;p>In general I wanted to see, how well I can estimate the population proportion of records passing a certain test, using only a sample.&lt;/p>
&lt;p>In the following experiments, the population is 1 million metadata records from the Crossref collection. I didn&amp;rsquo;t use the entire collection as the population, because I wanted to be able to calculate the real proportion and compare it to the estimates.&lt;/p>
&lt;p>The test for a single record is: whether the APA reference string generated from said record is correctly matched to the record&amp;rsquo;s original DOI. In other words: if I generate the APA reference string from my record and use it as the query in Crossref&amp;rsquo;s search, will the record be the first element in the result list? Note that this proportion can also be interpreted as the probability that the APA reference string will be correctly matched to the target DOI.&lt;/p>
&lt;h4 id="estimating-from-a-sample">Estimating from a sample&lt;/h4>
&lt;p>I took a random sample of size 100 from my population and calculated the proportion of the records correctly matched - this is called a &lt;strong>sample proportion&lt;/strong>. In my case, the sample proportion is 0.92. This means that in my sample 92 reference strings were successfully matched to the right DOIs. Not too bad.&lt;/p>
&lt;p>I could now treat this number as the estimate and assume that 0.92 is close to the population proportion. On the other hand, this is only a sample, and a rather small one, which raises doubts. What if our 92 correct matches happen to be the only correct matches in the entire 1M population? In such a case, our estimate of 0.92 would be very far from the population proportion. This uncertainty related to sampling randomness can be captured by the confidence interval.&lt;/p>
&lt;h4 id="confidence-interval">Confidence interval&lt;/h4>
&lt;p>The confidence interval for my 100-point sample, at the confidence level 95%, is 0.8668-0.9732. This is interpreted as follows: we are 95% sure that the real population proportion is within the range 0.8668-0.9732. Note that the sample average (0.92) is exactly in the middle of this range.&lt;/p>
&lt;p>100 items is not a big sample. Let&amp;rsquo;s calculate the confidence interval for a sample 10 times larger. From a sample of size 1000 I got the estimate 0.932, and the confidence interval 0.9164-0.9476. Based on this, we can be 95% sure that the real population proportion is within the range 0.9164-0.9476.&lt;/p>
&lt;p>It seems the our interval got smaller when we increased the sample size. Let&amp;rsquo;s plot the intervals for a variety of sample sizes:&lt;/p>
&lt;img src="https://www.crossref.org/images/blog/sampling_ci_by_size.png" alt="confidence interval vs sample size" width="500px" />
&lt;p>The blue line represents the estimated proportion for samples of different sizes, and the grey vertical lines are confidence intervals. The estimated proportion varies, because for each size a different sample was drawn.&lt;/p>
&lt;p>We can see that increasing the sample size decreases the interval. This should make intuitive sense: if we have more data to estimate from, we can expect our estimate to be more reliable (i.e. closer to the population proportion).&lt;/p>
&lt;p>What about the confidence level? By setting the confidence level we specify, how certain we want to be about our confidence interval. So far I used 95%. What happens if I calculate the confidence intervals for my original sample of 100 records, but with varying confidence level?&lt;/p>
&lt;img src="https://www.crossref.org/images/blog/sampling_ci_by_cl.png" alt="confidence interval vs confidence level" width="500px" />
&lt;p>In this case the average is always the same, because only one sample was used.&lt;/p>
&lt;p>As we can see, increasing the confidence level widens the interval. In other words, the more certain we want to be about the interval containing the real population average, the wider the interval has to be.&lt;/p>
&lt;h4 id="sampling-distribution">Sampling distribution&lt;/h4>
&lt;p>So far so good, but where does this magic confidence interval actually come from? It is calculated by the theoretical analysis of the sampling distribution (not to be confused with sample distribution):&lt;/p>
&lt;ul>
&lt;li>&lt;strong>Sample distribution&lt;/strong> is when we collect one sample of size &lt;em>k&lt;/em> and calculate a certain feature for every element in the sample. It is a distribution of &lt;em>k&lt;/em> values of the feature in one sample.&lt;/li>
&lt;li>&lt;strong>Sampling distribution&lt;/strong> is when we independently collect &lt;em>n&lt;/em> samples, each of size &lt;em>k&lt;/em>, and calculate the sample proportion for each sample. It is the distribution of &lt;em>n&lt;/em> sample proportions.&lt;/li>
&lt;/ul>
&lt;p>Imagine I collect all samples of size 100 from my population and I calculate the sample proportion for each sample. This is the sampling distribution. Now I randomly choose one number from this sampling distribution. Note that this is equivalent to what I did before: choosing one random sample of size 100 and calculating its sample proportion.&lt;/p>
&lt;p>According to &lt;a href="https://en.wikipedia.org/wiki/Central_limit_theorem" target="_blank">Central Limit Theorem&lt;/a>, sampling distribution is approximately normal with the mean equal to the population proportion. Here is the visualisation of the sampling distribution:&lt;/p>
&lt;img src="https://www.crossref.org/images/blog/sampling_sampling_distribution.png" alt="visualization of sampling distribution" width="500px" />
&lt;p>The black vertical line shows the mean of the sampling distribution. This is also the real population proportion. The grey area covers the middle 95% of the distribution mass (within 2 standard deviations from the mean).&lt;/p>
&lt;p>When we choose one sample and calculate the sample proportion, there are two possibilities:&lt;/p>
&lt;ul>
&lt;li>With 95% probability, we were lucky and the sample proportion is within the grey area. In that case, the real population proportion is not further than 2 standard deviations from our estimate.&lt;/li>
&lt;li>With 5% probability, we were unlucky and the sample proportion is outside the grey area. In that case, the real population proportion is further than 2 standard deviations from our estimate.&lt;/li>
&lt;/ul>
&lt;p>So with the confidence of 95% we can say that the real population proportion is within 2 standard deviations from our sample proportion. We can see now that these 2 standard deviations of the sampling distribution define our confidence interval at the confidence level of 95%.&lt;/p>
&lt;p>Smaller confidence level would make the grey area narrower, and the confidence interval would shrink as well. Larger confidence level makes the grey area, and the confidence interval, larger.&lt;/p>
&lt;p>To look more closely at the sampling distribution, I generated sampling distributions for all combinations of &amp;ldquo;&lt;em>n&lt;/em> samples of size &lt;em>k&lt;/em>&amp;rdquo;, where &lt;em>n&lt;/em> and &lt;em>k&lt;/em> are the elements of the set {25, 50, 100, 200, 400, 800, 1600, 3200}. This is only an approximation, since the real sampling distributions would contain many more samples.&lt;/p>
&lt;p>Here is the heatmap showing the mean of each sampling distribution (this should be approximately the same as the real population proportion):&lt;/p>
&lt;img src="https://www.crossref.org/images/blog/sampling_sampling_means.png" alt="means of sampling distributions" width="500px" />
&lt;p>We can see that there is some variability in the top left part of the heatmap, which corresponds to small sample sizes and small numbers of samples. The bottom right part of the heatmap shows much less variability. As we increase the sample size and number of samples, the mean of the sampling distribution approaches numbers around 0.933.&lt;/p>
&lt;p>Here is the heatmap showing the standard deviation for each sampling distribution:&lt;/p>
&lt;img src="https://www.crossref.org/images/blog/sampling_sampling_stdevs.png" alt="standard deviations of sampling distributions" width="500px" />
&lt;p>We can clearly see how the standard deviation decreases when we increase the sample size. This is consistent with the previous observation, that the confidence interval decreases when the sample size is increased.&lt;/p>
&lt;p>Let&amp;rsquo;s also see the histograms of all the sampling distributions:&lt;/p>
&lt;img src="https://www.crossref.org/images/blog/sampling_sampling_histograms.png" alt="histograms of sampling distributions" width="500px" />
&lt;p>Here we can see the following patterns:&lt;/p>
&lt;ul>
&lt;li>All histograms indeed seem to be centered around approximately the same number.&lt;/li>
&lt;li>The more samples we include, the more normal the sampling distribution appears. This happens because with more samples the real sampling distribution is better approximated.&lt;/li>
&lt;li>The larger the sample size, the narrower the sampling distribution (i.e. smaller standard deviation).&lt;/li>
&lt;/ul>
&lt;h4 id="the-estimation-vs-the-real-value">The estimation vs. the real value&lt;/h4>
&lt;p>Let&amp;rsquo;s go back to my original question. What is the proportion of reference strings in APA style, that are successfully matched to the original DOIs of the records they were generated from? So far we observed the following:&lt;/p>
&lt;ul>
&lt;li>A small sample of 100 gave the estimate 0.92 (confidence interval 0.8668-0.9732)&lt;/li>
&lt;li>A larger samples of 1000 gave the estimate 0.932 (confidence interval 0.9164-0.9476)&lt;/li>
&lt;li>The means of sampling distributions seem to slowly approach 0.933&lt;/li>
&lt;/ul>
&lt;p>So what is the real population proportion in my case? It is 0.933005. As we can see, the estimations were fairly close, and the intervals indeed contain the real value.&lt;/p>
&lt;p>Now I can also calculate the confidence interval for each sample in my sampling distributions, and then the fraction of the intervals that contain the real population proportion (I expect these numbers to be close to the confidence level 95%). Here is the heatmap:&lt;/p>
&lt;img src="https://www.crossref.org/images/blog/sampling_sampling_fractions.png" alt="fractions of samples containing the real proportion in confidence interval" width="500px" />
&lt;p>We can see that for larger sample sizes indeed the fractions are high. The fraction is not always above 95%, as we would expect, especially for smaller sample sizes. One of the reasons is that when we calculate the confidence interval, we approximate the standard deviation of the population with the standard deviation of the sample. This is not always a reliable estimate, especially for small samples. This suggests that sample sizes of at least 1000-2000 should be used.&lt;/p>
&lt;h3 id="be-careful">Be careful&lt;/h3>
&lt;p>Some important things to remember:&lt;/p>
&lt;ul>
&lt;li>&lt;strong>Aggregate functions&lt;/strong>. As mentioned before, apart from estimating the proportion, a similar procedure can be applied for estimating the average of a certain numeric feature.&lt;/li>
&lt;li>&lt;strong>(Lack of) certainty&lt;/strong>. Remember that the confidence level &amp;lt; 1. This means that we are never sure that our confidence interval contains the true population proportion. If for any reason you need to be 100% sure, just process the entire dataset.&lt;/li>
&lt;li>&lt;strong>Randomness&lt;/strong>, a.k.a. “stirring before tasting”. The sample has to be chosen randomly. Beware of assuming that the dataset is shuffled and taking the first 1000 rows!&lt;/li>
&lt;li>&lt;strong>Sample size&lt;/strong>. We know already that the larger the sample, the better. As a rule of thumb, using sample sizes &amp;lt; 30 makes the estimates, including the interval, rather unreliable.&lt;/li>
&lt;li>&lt;strong>Skewness&lt;/strong>. In general, the more skewed the original feature distribution, the larger sample we need. In case of the proportion, the sample should contain at least 5 data points of each value of the feature (passes/doesn&amp;rsquo;t pass the test).&lt;/li>
&lt;li>&lt;strong>Generalization&lt;/strong>. The sample average/proportion can be used as an estimate for the population average/proportion, but only the population it was drawn from. This means that if we applied any filters before sampling (which is equivalent to sampling from a subset passing the filter), we can reason only about the filtered subset of the data.&lt;/li>
&lt;li>&lt;strong>Reproducibility&lt;/strong>. This is more of an engineering concern. In short, all the analyses we do should be reproducible. In the context of sampling it means, at the very least, that we should record the samples we use.&lt;/li>
&lt;/ul></description></item><item><title>Why Data Citation matters to publishers and data repositories</title><link>https://www.crossref.org/blog/why-data-citation-matters-to-publishers-and-data-repositories/</link><pubDate>Thu, 08 Nov 2018 00:00:00 +0000</pubDate><author>Helena Cousijn</author><guid>https://www.crossref.org/blog/why-data-citation-matters-to-publishers-and-data-repositories/</guid><description>&lt;p>A couple of weeks ago we shared with you that &lt;a href="https://doi.org/10.64000/y3w79-cfb36" target="_blank">data citation is here&lt;/a>, and that you can start doing data citation today. But why would you want to? There are always so many priorities, why should this be at the top of the list?&lt;/p>
&lt;p>I’m sure you heard this before—data sharing and data citation are important for scientific progress. The three key reasons for this are:&lt;/p>
&lt;h3 id="1-transparency-and-reproducibility">1) Transparency and reproducibility&lt;/h3>
&lt;p>Most scientific results that are shared today are just a summary of what researchers did and found. The underlying data are not available, making it difficult to verify and replicate results. If data would always be made available with publications, transparency of research would be greatly improved.&lt;/p>
&lt;h3 id="2-reuse">2) Reuse&lt;/h3>
&lt;p>The availability of raw data allows other researchers to reuse the data. Not just for replication purposes, but to answer new research questions.&lt;/p>
&lt;h3 id="3-credit">3) Credit&lt;/h3>
&lt;p>When researchers cite the data they used, this forms the basis for a data credit system. Right now researchers are not really incentivized to share their data, because nobody is looking at data metrics and measuring their impact. Data citation is a first step towards changing that.&lt;/p>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/dc.png" alt="data article nexus" width="500px" />
&lt;/div>
&lt;p>The benefits described above are all quite long-term, so why, as a publisher or data repository, should you put your resources towards implementing data citation workflows now? During our &lt;a href="https://doi.org/10.5438/qm7p-wy23" target="_blank">pre-conference workshop at FORCE2018&lt;/a> we asked repositories and publishers this question. Below you’ll find some of the answers.&lt;/p>
&lt;h3 id="data-repositories">Data repositories&lt;/h3>
&lt;p>For data repositories, data citation leads to increased visibility of both the repository and the datasets. The workshop revealed that many repositories do a lot of work to establish links between articles and datasets, thereby significantly contributing to transparency in research. Some of the repositories explained that they hire curators that text mine articles to find associations and manually curate datasets to ensure information about links is part of the metadata. This is reflected in Event Data, where 99% of links between articles and datasets comes from data repository metadata. This downstream enrichment of metadata is useful, but it would be more effective if all stakeholders strive to establish these links at a much earlier stage in the research communication process.&lt;/p>
&lt;p>&lt;a href="https://www.icpsr.umich.edu/icpsrweb/" target="_blank">ICPSR&lt;/a>, the Inter-university Consortium for Political and Social Research, shared:&lt;/p>
&lt;p>ICPSR views data citation as vital. As a large social science data archive, ICPSR curates, preserves, and distributes data for the research community to re-use over time. Data citation makes data visible to the research community. Without it, data cannot be accessed for re-use or reproduced for transparency. Its use cannot be tracked and counted to reveal its impact and potential for new uses by investigators in new fields or in combination with new types of data. Data creators cannot receive adequate credit for their intellectual output. And the original investment by funders and scientists to create those data stops producing dividends. Therefore, data citation plays an essential role in the data sharing lifecycle.&lt;/p>
&lt;p>Proper data citation, with a unique identifier, makes it much easier to measure impact. When data use is not cited or cited obliquely, it is rendered virtually invisible. Hence, much data use is still not easily detected. The &lt;a href="https://web.archive.org/web/20181206103836/https://www.icpsr.umich.edu/icpsrweb/ICPSR/citations/" target="_blank">ICPSR Bibliography of Data-related Literature&lt;/a> represents ICPSR’s efforts to identify publications that analyze data distributed at ICPSR and link them directly to the data in the ICPSR catalog. As of 2018, ICPSR has a searchable database that contains nearly 80,000 citations of published and unpublished works resulting from analyses of data held in the archive. ICPSR also makes the case for data citation in its brief new video, &lt;a href="https://www.youtube.com/watch?v=jiCZKV-alC0" target="_blank">“ICPSR 101: Why Should I Cite Data?”&lt;/a>&lt;/p>
&lt;p>&lt;a href="https://www.gbif.org/" target="_blank">GBIF&lt;/a>, the Global Biodiversity Information Facility, explained:&lt;/p>
&lt;p>The work required to collect, clean, compile and publish biodiversity datasets is significant and deserves recognition. Researchers publish studies based on data made available through &lt;a href="https://www.gbif.org/" target="_blank">GBIF.org&lt;/a> at a rate of about 2 papers every single day. It is crucial for GBIF to link these scientific uses to the underlying data as one measure of demonstrating the value and impact of sharing free and open biodiversity data. At the moment, however, only about 10 percent of authors cite or acknowledge the datasets used in research papers properly. As a result, data publishers efforts often risk going unnoticed, and the true impact of sharing data remains invisible. GBIF will continue to work with publishers and researchers to provide guidance and input for how to best cite the use of GBIF-mediated data in scientific journals to ensure proper attribution and reproducible research and to demonstrate the true value of free and open access to biodiversity data.&lt;/p>
&lt;h3 id="publishers">Publishers&lt;/h3>
&lt;p>By ensuring data is cited in a consistent way, publishers help provide transparency and context for the content they publish. Depositing that information as part of the Crossref metadata helps that work go further by uncovering how data is being used across multiple publications and publishers This means patterns can be explored and researchers can gain more comprehensive recognition and credit for the work they have done.&lt;/p>
&lt;p>Melissa Harrison, Head of Production Operations at &lt;a href="https://elifesciences.org/" target="_blank">eLife&lt;/a> says:&lt;/p>
&lt;p>eLife is committed to ensuring researchers get credit for all their outputs, and data is a major component of this. We&amp;rsquo;re working with Crossref and JATS4R to enable publishers to tag their JATS data content consistently and thus create an easy crosswalk to their Crossref deposits. The JATS4R guidance on Data Availability Statements, linked to and incorporating data citations, will be updated soon, please watch that space!&lt;/p>
&lt;p>It will be really interesting to see how much re-use of previously published data is happening, look for patterns in re-use, and see links and hopefully building up of data by different research groups. Ultimately, this will incentivize researchers and publishers to ensure it is correctly accredited at source and in publications, improving the cycle further.’&lt;/p>
&lt;p>Anita de Waard, VP of Research Collaborations at &lt;a href="https://www.elsevier.com/" target="_blank">Elsevier&lt;/a>, says:&lt;/p>
&lt;p>One of the key recommendations of the &lt;a href="https://www.force11.org/about/manifesto" target="_blank">Force11 Manifesto&lt;/a> was to “&lt;a href="https://www.force11.org/about/manifesto#x1-200003.3" target="_blank">3.3&lt;/a> Add data, software, and workflows into the publication as first-class research objects”, which will allow greater reproducibility and rigor to experimental research, and allow the reuse of all digital artefacts in the scholarly lifecycle. By following the data citation principles, we achieve two things: the author presents a richer representation of their work, and the data producer receives credit for the hard work of curating and publishing citable datasets.&lt;/p>
&lt;p>Mendeley Data and Elsevier are active contributors to the &lt;a href="http://www.scholix.org/" target="_blank">Scholix framework&lt;/a> that as a collaborative and open standard, allows the open mining of relationships between articles and datasets. We are also active participants in the new &lt;a href="http://www.copdess.org/enabling-fair-data-project/" target="_blank">Enabling FAIR Data Project&lt;/a>, and next to &lt;a href="https://www.elsevier.com/connect/elsevier-supports-top-guidelines-in-ongoing-efforts-to-ensure-research-quality-and-transparency" target="_blank">supporting the TOP Guidelines&lt;/a> in all domains, require all authors in the earth and space sciences to deposit their data before publication.&lt;/p>
&lt;p>Next week at &lt;a href="https://www.crossref.org/crossref-live-annual/">Crossref LIVE18&lt;/a>, Patricia Cruse from DataCite will talk about Data Citations and why they matter. If you’re in Toronto next week, do not hesitate to ask her or anyone from Crossref anything you want to know about data citation!&lt;/p></description></item><item><title>Ten more days 'til Toronto</title><link>https://www.crossref.org/blog/ten-more-days-til-toronto/</link><pubDate>Fri, 02 Nov 2018 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/ten-more-days-til-toronto/</guid><description>&lt;p>Our LIVE Annual Meeting is back in North America for the first time since 2015, and with just 10 days to go, there’s a lot going on in preparation. As you’d expect with a &lt;code>How good is your metadata?&lt;/code> theme&amp;mdash;the two-days will be entirely devoted to the subject of metadata&amp;mdash;because it touches everything we do, and everything that publishers, hosting platforms, funders, researchers, and librarians do. Oh, and it&amp;rsquo;s actually super awesome too&amp;mdash;and occasionally fun.&lt;/p>
&lt;p>Metadata is what is used to describe the story of research: its origin, its contributors, its attention, and its relationships with other objects. The more machines start to do what humans cannot&amp;mdash;parse millions of files through multiple views&amp;mdash;the more we see what connections are missing, and the more we start to understand the opportunities that better metadata could offer.&lt;/p>
&lt;p>We love metadata so much that we&amp;rsquo;re producing an 8-foot-high depiction of the &amp;lsquo;perfect&amp;rsquo; record, in both XML and JSON, for people to gape at and annotate in person. Sneak preview:&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/perfect-record.png"
alt="The perfect metadata record is eight feet tall." width="500">&lt;figcaption>
&lt;p>The perfect metadata record is eight feet tall.
&lt;a href="https://www.crossref.org/people/patricia-feeney">SchemaSchemer&lt;/a>&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;p>Both days feature plenary-style talks, insights from ourselves and guests who will regale us with tales of metadata woes and wonders.&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/people/lisa-hart-martin">Lisa&lt;/a> will be there at the end of Day 1 to update everyone on some recent and potential governance changes, and&amp;mdash;the reason we started these gatherings&amp;mdash;to reveal the results of our &lt;a href="https://www.crossref.org/board-and-governance/elections/2018-slate">2018 board election&lt;/a>, the second contested election we&amp;rsquo;ve held, and already with twice the voters from 2017.&lt;/p>
&lt;p>Our amazing guest speakers are too brilliant and too experienced to highlight in just one blog. But check out the &lt;a href="https://www.crossref.org/crossref-annual-meeting/archive/#2018">LIVE18 schedule&lt;/a> to see what they&amp;rsquo;ll be talking about:&lt;/p>
&lt;div class='shortcode-row '>
&lt;div class="col-md-6 col-sm-12 no-first-para-highlight">&lt;ul>
&lt;li>&lt;strong>Patricia Cruse&lt;/strong>, DataCite&lt;/li>
&lt;li>&lt;strong>Ravit David&lt;/strong>, University of Toronto&lt;/li>
&lt;li>&lt;strong>Clare Dean&lt;/strong>, Metadata 2020&lt;/li>
&lt;li>&lt;strong>Paul Dlug&lt;/strong>, American Physical Society&lt;/li>
&lt;li>&lt;strong>Kristen Fisher Ratan&lt;/strong>, CoKo Foundation&lt;/li>
&lt;/ul>
&lt;/div>
&lt;div class="col-md-6 col-sm-12 no-first-para-highlight">&lt;ul>
&lt;li>&lt;strong>Stefanie Haustein&lt;/strong>, University of Ottawa&lt;/li>
&lt;li>&lt;strong>Bianca Kramer&lt;/strong>, Utrecht University&lt;/li>
&lt;li>&lt;strong>Graham Nott&lt;/strong>, Freelance developer (eLife/JATS)&lt;/li>
&lt;li>&lt;strong>Jodi Schneider&lt;/strong>, University of Urbana-Champaign&lt;/li>
&lt;li>&lt;strong>Shelley Stall&lt;/strong>, American Geophysical Union&lt;/li>
&lt;/ul>
&lt;/div>
&lt;/div>
&lt;p>We’ll be taking over the entire second floor of the Toronto Reference Library, whose three rooms will house a bunch of conversational sessions as well as some more formal talks:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>&lt;code>Rally&lt;/code> is the main room where we’ll have the plenary-style talks, a corner for &lt;code>Unscheduled Maintenance&lt;/code> offering live support for your questions about billing or tech for &lt;a href="https://www.crossref.org/people/ryan-mcfall">Ryan&lt;/a>, &lt;a href="https://www.crossref.org/people/shayn-smulyan">Shayn&lt;/a>, &lt;a href="https://www.crossref.org/people/isaac-farley">Isaac&lt;/a>, &lt;a href="https://www.crossref.org/people/jason-hanna">Jason&lt;/a>, &lt;a href="https://www.crossref.org/people/chuck-koscher">Chuck&lt;/a>, &amp;amp; &lt;a href="https://www.crossref.org/people/mike-yalter">Mike&lt;/a>. Running down the whole left side of this room is also the &lt;code>You-are-Crossref&lt;/code> wall where the community will showcase their work with metadata through posters - feel free to bring one along and find &lt;a href="https://www.crossref.org/people/patricia-feeney">Patricia&lt;/a> to get the sticky tack.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>The &lt;code>LIVE Lounge&lt;/code> is where you can eat, drink, rest, and chat and where you&amp;rsquo;ll likely find &lt;a href="https://www.crossref.org/people/rosa-morais-clark/">Rosa&lt;/a> as she laises between the caterers, the venue, AV, and all of us. The Lounge is also where we&amp;rsquo;ll gather for much-needed post-election refreshments at the end of Tuesday.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;code>The Bigger Ambitions Room&lt;/code> is where a lot of the &lt;code>Unplugged&lt;/code> sessions will take place. This room will feature three separate stations:&lt;/p>
&lt;ul>
&lt;li>R&amp;amp;D &amp;amp; Product where you can chat with &lt;a href="https://www.crossref.org/people/geoffrey-bilder">Geoffrey&lt;/a>, &lt;a href="https://www.crossref.org/people/esha-datta">Esha&lt;/a>, &lt;a href="https://www.crossref.org/people/jennifer-lin">Jennifer L&lt;/a>, &lt;a href="https://www.crossref.org/people/patrick-polischuk/">Patrick&lt;/a>, and &lt;a href="https://www.crossref.org/people/christine-buske">Christine&lt;/a> about your big ideas for us, and what we&amp;rsquo;re working on already.&lt;/li>
&lt;li>Metadata discussions and annotations of the perfect record (previewed above) with &lt;a href="https://www.crossref.org/people/patricia-feeney">Patricia&lt;/a>, together with space to ideate around metadata principles.&lt;/li>
&lt;li>Uses and users of metadata where &lt;a href="https://www.crossref.org/people/jennifer-kemp">Jennifer K&lt;/a> will help us understand just how far Crossref metadata can reach, and who and what people are doing with it.&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;p>We cannot wait to show you what else we have planned :-)&lt;/p>
&lt;p>For those of you not able to attend, recordings of the presentations will be made available on the &lt;a href="https://www.crossref.org/crossref-annual-meeting/archive/">event page&lt;/a> directly soon after.&lt;/p>
&lt;p>Otherwise - see you there!&lt;/p></description></item><item><title>Crossref LIVE Brazil evoked vibrant Q&amp;A session</title><link>https://www.crossref.org/blog/crossref-live-brazil-evoked-vibrant-qa-session/</link><pubDate>Wed, 31 Oct 2018 00:00:00 +0000</pubDate><author>Susan Collins</author><guid>https://www.crossref.org/blog/crossref-live-brazil-evoked-vibrant-qa-session/</guid><description>&lt;p>There has been a steady increase in the growth of our membership in Latin America—and in Brazil in particular—over the past few years. We currently have more than 800 Brazil-based members; some as individual members, but most are sponsored by another organisation. As part of our &lt;a href="https://www.crossref.org/events/">LIVE Local program&lt;/a> Chuck Koscher and I traveled to meet some of these members in Goiânia and Fortaleza, where we co-hosted events with Associação Brasileira de Editores Científicos do Brasil (ABEC Brasil)—one of our largest &lt;a href="https://www.crossref.org/community/sponsors/">Sponsors&lt;/a>.&lt;/p>
&lt;p>These events always provide a great opportunity for us to update our members on new and upcoming Crossref developments. They are also an important way for us to discover more about the varied needs of our members’ communities and learn how we can work together better.&lt;/p>
&lt;p>The LIVE Brazil events were attended by more than two hundred members and were held at the Universidade Federal de Goiás and the Universidade de Fortaleza respectively. Chuck and I enthusiastically demonstrated two new tools from Crossref— &lt;a href="https://www.crossref.org/participation/">Participation Reports&lt;/a> and &lt;a href="https://www.crossref.org/metadatamanager/" target="_blank">Metadata Manager&lt;/a>, we discussed our newest record types—preprints and peer review reports, and continually highlighted the importance (and the uses) of quality metadata.&lt;/p>
&lt;p>We were joined by some fantastic guest speakers; Milton Shintaku from ABEC explained how to register content using the Crossref/OJS deposit plugin and Crossref ambassador, Edilson Damasio, spoke about Similarity Check and gave a demonstration of how to use the iThenticate interface when checking papers for originality.&lt;/p>
&lt;p>The vibrant Q&amp;amp;A sessions reflected the varying needs of the audience. We talked generally about the different Crossref services and went more in-depth with discussions around submitting &lt;a href="https://support.crossref.org/hc/en-us/articles/214357426" target="_blank">relationship&lt;/a> metadata for peer review and preprints. &lt;a href="https://www.crossref.org/services/crossmark/">Crossmark&lt;/a> and its implementation was also a hot topic, as was how to benefit from &lt;a href="https://www.crossref.org/services/similarity-check/">Similarity Check&lt;/a>—and in particular how to address cases of duplication in submitted manuscripts, and the setting up of plagiarism policies for each journal. There was also a lot of discussion around OJS integrations, and we were able to share that PKP/OJS is currently in the process of enhancing the Crossref/OJS integration, including the ability for publishers to deposit references.&lt;/p>
&lt;p>We were also pleased to see so much interest in supplementing Crossref metadata with references, Similarity Check URLs, license information, etc. To address this we’re running a webinar in Brazilian Portuguese entitled: “Registering content and adding to your Crossref metadata in Portuguese” on 26th November. You can sign up &lt;a href="https://outreach.crossref.org/acton/fs/blocks/showLandingPage/a/16781/p/p-0051/t/page/fm/0" target="_blank">here&lt;/a> if you’d like to attend.&lt;/p>
&lt;p>I’d like to thank Universidade Federal de Goiás and the Universidade de Fortaleza for hosting the events, providing the venues and the translation team, and of course, thanks to everyone who came!&lt;/p>
&lt;p>A special mention of ABEC for their help in organizing and promoting the events. As a &lt;a href="https://www.crossref.org/community/sponsors/">Sponsor&lt;/a>, they relieve our team of an intense amount of technical support, billing, and other administrative burdens, saving us time and expense, while offering a localized service to Brazilian publishers.&lt;/p>
&lt;div style="margin:10px;">
&lt;img src="https://www.crossref.org/images/blog/LIVE-Brazil-ABEC.png" alt=“Brazil LIVE Goiânia" height="150px" width="400px" class="img-responsive" />
&lt;/div>
&lt;p>Crossref staff with co-hosts ABEC and representatives from UFG who helped with the event - thank you!&lt;/p>
&lt;hr></description></item><item><title>It’s not about the money, money, money.</title><link>https://www.crossref.org/blog/its-not-about-the-money-money-money./</link><pubDate>Thu, 18 Oct 2018 00:00:00 +0000</pubDate><author>Amy Bosworth</author><guid>https://www.crossref.org/blog/its-not-about-the-money-money-money./</guid><description>&lt;p>But actually, sometimes it is about the money. As a not-for-profit membership organisation that is obsessed with persistence, we have a duty to remain sustainable and manage our finances in a responsible way. Our annual audit is incredibly thorough, and our outside auditors and Board-based Audit committee consistently report that we’re in good shape.&lt;/p>
&lt;p>Our Membership &amp;amp; Fees committee regularly reviews both membership fees and Content Registration fees for a growing range of research outputs. Together with our staff, the Board regularly reviews financial projections that inform our budgeting process and approve our budget each year.&lt;/p>
&lt;h3 id="financial-sustainability-means-the-persistence-of-our-infrastructure-and-services">Financial sustainability means the persistence of our infrastructure and services&lt;/h3>
&lt;p>We run a tight ship here at Crossref. We have to. So it’s not ideal when we have to chase members and users for late payments, but it’s an important part of keeping the organisation afloat, and keeping our dedicated service to scholarly communications running. And that’s my job at Crossref.&lt;/p>
&lt;p>Working here for over six years now, I’ve seen a lot of development in our finance department. We strive as a team to always improve our communication with members and users to deliver the best ‘customer’ experience. To do this, we are always tweaking our processes to improve efficiency and accuracy, and &lt;a href="mailto:billing@crossref.org">welcome all feedback&lt;/a>.&lt;/p>
&lt;h3 id="how-the-invoice-schedule-works">How the invoice schedule works&lt;/h3>
&lt;p>Our annual membership invoices are sent out each January, and our Content Registration invoices are generated four times a year, each quarter. All invoices are emailed to the billing contact for your organisation (please be sure to update us with any contact changes!) and have a due date of net 45 days. Our invoices now have a “pay now” link in the body of the email. This offers a faster and more convenient way for you to pay, simply by clicking on the link to our payment portal. You can also view invoices as PDFs in the payment portal. An important part of our accounting process is the automated invoice reminder schedule. There are three billing reminders we send by email:&lt;/p>
&lt;ol>
&lt;li>The day immediately after the invoice due date;&lt;/li>
&lt;li>21 days past the invoice due date; and&lt;/li>
&lt;li>45 days past the invoice due date.&lt;/li>
&lt;/ol>
&lt;h3 id="we-dont-want-to-see-you-go">We don’t want to see you go!&lt;/h3>
&lt;p>We understand there are many factors that can make prompt payment a challenge for some people: international transfer delays or fees; funding for your publishing operations may end; change of contacts; problems receiving our emails.&lt;/p>
&lt;p>When an account is 90 days past due, a further email notifies you that your service is at risk of suspension. If an account is then suspended for non-payment it becomes at risk of being ‘terminated’. Once an account has been terminated, you will need to contact our membership specialist to rejoin Crossref. Please note that we send numerous notifications/reminders before suspension or termination takes place (we don’t want to see you go!). We can always be reached at &lt;a href="mailto:billing@crossref.or">billing@crossref.org&lt;/a> for any invoice inquiries you may have.&lt;/p>
&lt;h3 id="tips-that-work-for-other-users">Tips that work for other users&lt;/h3>
&lt;p>There are some things you can do to speed-up or simplify payments:&lt;/p>
&lt;ul>
&lt;li>Pay with a credit card, using our online payment portal. This is fast, convenient, and lower in fees&lt;/li>
&lt;li>Always reference an invoice number on the payment to ensure that it’s applied to your account efficiently&lt;/li>
&lt;li>Be sure to make &lt;a href="mailto:billing@crossref.org">&lt;code>billing@crossref.org&lt;/code>&lt;/a> a ‘safe’ email address, so that you receive our invoices and reminders&lt;/li>
&lt;li>Always keep us up-to-date with any contact changes at your organisation, to ensure that we have accurate information for invoicing and other communication&lt;/li>
&lt;li>We recommend giving us a generic email address for your accounts payable team, such as &lt;code>accounts@publisher.com&lt;/code> so that if somebody leaves that job, invoices can still get through.&lt;/li>
&lt;/ul>
&lt;p>Thanks for working with us! Please let me know in the comments below if you have any feedback or additional tips for your fellow Crossref community members.&lt;/p>
&lt;hr></description></item><item><title>Good, better, best. Never let it rest.</title><link>https://www.crossref.org/blog/good-better-best.-never-let-it-rest./</link><pubDate>Tue, 16 Oct 2018 00:00:00 +0000</pubDate><author>Jennifer Kemp</author><guid>https://www.crossref.org/blog/good-better-best.-never-let-it-rest./</guid><description>&lt;p>Best practices seem to be having a moment. In the ten years since the &lt;a href="https://www.crossref.org/working-groups/books">Books Advisory Group&lt;/a> first created a &lt;a href="https://www.crossref.org/education/content-registration/content-types-intro/books-and-chapters/">best practice guide for books&lt;/a>, the community beyond Crossref has developed or updated at least 17 best practice resources, as &lt;a href="http://www.metadata2020.org/resources/metadata-best-practices/" target="_blank">collected here&lt;/a> by the &lt;a href="http://www.metadata2020.org/" target="_blank">Metadata 2020&lt;/a> initiative. (Full disclosure: I co-chair its Best Practices group.)&lt;/p>
&lt;p>Books have been one of the fastest growing resource/record types at Crossref for some time, and best practices are just one of the Book Advisory Group&amp;rsquo;s efforts. Over the past ten years, the members of the books group have updated and added to the guide, and it’s now time for it to get some visibility, &lt;a href="https://www.crossref.org/education/content-registration/content-types-intro/books-and-chapters/">so we have added it to our website&lt;/a> for easy reference.&lt;/p>
&lt;p align="center">&lt;img src="https://www.crossref.org/images/blog/bookcontent.png" alt="bookscontent" width="75%" />
&lt;/p>
&lt;p>These best practices are not documented for the sake of it. They have real value and can help guide internal conversations to evaluate current practices, for example. They can also play a role in making or changing policies, training staff and providing instructions to authors on citation formatting.&lt;/p>
&lt;p>Here are a few recent changes I’d like to highlight:&lt;/p>
&lt;ul>
&lt;li>A new section has been added that addresses books hosted on multiple platforms&lt;/li>
&lt;li>The section on versions, (including books in multiple formats) has been expanded and clarified&lt;/li>
&lt;li>A section on the use of DOIs in citations has been added&lt;/li>
&lt;/ul>
&lt;p>It is neither final nor comprehensive, and never will be. Best practices by their very nature must evolve over time—and those with such a broad scope as books will inevitably lack some detail—but that’s all the more reason for the community to stay engaged. Looking ahead to future work from the group, chapter-level metadata is likely to get more attention.&lt;/p>
&lt;p>Over the past few years the Books Advisory Group, chaired with aplomb by Emily Ayubi of the American Psychological Association (APA), has spent a lot of time on Crossref initiatives, like &lt;a href="https://www.crossref.org/get-started/multiple-resolution/">Multiple Resolution&lt;/a> and &lt;a href="https://doi.org/10.13003/5jchdy" target="_blank">DOI display changes&lt;/a> but also on broader industry topics like ORCID iDs for book authors, and the Books Citation Index.&lt;/p>
&lt;p>As Emily’s term as chair comes to an end this year, we welcome Charles Watkinson of the University of Michigan as chair starting in 2019. The group meets next on 12 December when we will hear from &lt;a href="https://coko.foundation/" target="_blank">Coko&lt;/a> about Editoria and have a discussion about developing our new &lt;a href="https://doi.org/10.64000/99444-1qs40" target="_blank">Metadata Manager&lt;/a> Content Registration tool for books, and more.&lt;/p>
&lt;p>If you want to share your thoughts on best practices or if you have other topics you’d like us to consider, &lt;a href="mailto:feedback@crossref.org">please get in touch&lt;/a>.&lt;/p>
&lt;hr></description></item><item><title>Metadata Manager: Members, represent!</title><link>https://www.crossref.org/blog/metadata-manager-members-represent/</link><pubDate>Mon, 15 Oct 2018 00:00:00 +0000</pubDate><author>Jennifer Lin</author><guid>https://www.crossref.org/blog/metadata-manager-members-represent/</guid><description>&lt;p>&lt;a href="https://doi.org/10.64000/c8tcs-9vm83" target="_blank">Over 100 Million unique scholarly works&lt;/a> are distributed into systems across the research enterprise 24/7 via our APIs at a rate of around 633 Million queries a month. Crossref is broadcasting descriptions of these works (metadata) to all corners of the digital universe.&lt;/p>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/broadcastmetadata.png" alt="broadcastmetadata" width="150px" />
&lt;/div>
&lt;p>Whether you’re a publisher, institution, governmental agency, data repository, standards body, etc.: when you register and update your metadata with Crossref, you’re relaying it to the entire research enterprise. So make sure your publications are fully and accurately represented.&lt;/p>
&lt;h2 id="metadata-manager-is-here-to-help">Metadata Manager is here to help&lt;/h2>
&lt;p>This year, we’ve released a new tool aimed to make this easier and give you, members, full control over your metadata. Presenting: &lt;strong>&lt;a href="https://www.crossref.org/metadatamanager/" target="_blank">Metadata Manager&lt;/a>&lt;/strong>. It helps to:&lt;/p>
&lt;ul>
&lt;li>Simplify and streamline the &lt;a href="https://www.crossref.org/services/content-registration">Content Registration&lt;/a> service, with a user-friendly interface&lt;/li>
&lt;li>Give you greater flexibility and control of metadata deposits&lt;/li>
&lt;li>Support users who are less familiar with XML&lt;/li>
&lt;li>Boost metadata quality, encourage cleaner and more complete metadata records&lt;/li>
&lt;/ul>
&lt;p>Metadata Manager is available to all our members and the service providers they work with, providing assistance with a wide range of metadata-related tasks:&lt;/p>
&lt;ul>
&lt;li>Regular Content Registration conducted by journal staff, editors and service providers&lt;/li>
&lt;li>Registering corrections, retractions, or other editorial expressions of concern&lt;/li>
&lt;li>Matching references to their DOIs and registering them with the publication&lt;/li>
&lt;li>Adding metadata to existing records such as license and funding information, abstracts, or data citations&lt;/li>
&lt;li>Late-arriving editorial updates/corrections after initial publication&lt;/li>
&lt;li>Unexpected corrections to production hiccups&lt;/li>
&lt;li>Emergency editorial changes that affect publication record&lt;/li>
&lt;li>Accelerated registration for special pieces published outside of regular workflow&lt;/li>
&lt;li>Securely and efficiently transfer titles to another publisher as the authorized owner&lt;/li>
&lt;/ul>
&lt;p>Issues arise all the time in the dynamic and challenging work of scholarly communications. Metadata Manager provides a fast and easy way to meet these head-on when broadcasting new content or updating existing content. Submissions through this tool are processed immediately upon submission (i.e., no queues!).&lt;/p>
&lt;p>This new tool empowers our members to “represent” in the exhilarating thrum of data reaching our API users. At this moment in time, it only supports journals, but our development team is currently working hard to include the remaining record types.&lt;/p>
&lt;h2 id="features">Features&lt;/h2>
&lt;p>Here’s a smattering of highlights from the Metadata Manager feature list:&lt;/p>
&lt;ul>
&lt;li>All metadata: easily adds any and all metadata, allowing publishers to add richness and depth to their records.&lt;/li>
&lt;li>Prevents rejected submissions: it ensures you have satisfied all the basic Content Registration requirements and points out any input errors.&lt;/li>
&lt;li>Expedited deposit: the Content Registration system processes each submission immediately, bypassing the deposit queue.&lt;/li>
&lt;li>Historic log: easy to read archive of all previous submissions.&lt;/li>
&lt;li>Effortless review: provides a clean, condensed view of metadata (invariably complicated and lengthy) to support human review of the content before submission.&lt;/li>
&lt;li>Aids members to follow best practices: checks for completeness and reminds users of the full breadth of metadata available for the article, volume/issue, and the journal itself.&lt;/li>
&lt;li>Full control over title transfers: no need to make these requests through our support channels. Complete the transfer at your convenience, directly through the system.&lt;/li>
&lt;/ul>
&lt;p>For those of you that have looked at your own metadata contribution with the use of our new &lt;a href="https://www.crossref.org/members/prep/" target="_blank">Participation Reports&lt;/a>, you’ll find using Metadata Manager a quick and useful way to help you level-up your records.&lt;/p>
&lt;h2 id="members-represent">Members, represent!&lt;/h2>
&lt;p>We invite you to register and update your publications with Metadata Manager, relay the metadata fully and accurately to the entire research enterprise. Check out the comprehensive &lt;a href="https://www.crossref.org/education/member-setup/metadata-manager/">help documentation&lt;/a> to find out how to set up your workspace and get started right away with your usual Content Registration login details.&lt;/p>
&lt;p>As mentioned, we are continuing development, adding support for all remaining record types as well as enhancing existing features. The webDeposit form will remain available throughout this time. For journal publishers, give us a whirl and &lt;a href="mailto:support@crossref.org">let us know&lt;/a> if you see something missing or there’s a function that would improve your Content Registration experience!&lt;/p></description></item><item><title>Using the Crossref REST API. Part 12 (with Europe PMC)</title><link>https://www.crossref.org/blog/using-the-crossref-rest-api.-part-12-with-europe-pmc/</link><pubDate>Wed, 10 Oct 2018 00:00:00 +0000</pubDate><author>Christine Cormack Wood</author><discourseUsername>ccormackwood</discourseUsername><guid>https://www.crossref.org/blog/using-the-crossref-rest-api.-part-12-with-europe-pmc/</guid><description>&lt;p>As part of our blog series highlighting &lt;a href="https://www.crossref.org/categories/api-case-study">some of the tools and services that use our API&lt;/a>, we asked Michael Parkin&amp;mdash;Data Scientist at the European Bioinformatics Institute&amp;mdash;a few questions about how Europe PMC uses our metadata where preprints are concerned.&lt;/p>
&lt;h3 id="tell-us-a-bit-about-europe-pmc">Tell us a bit about Europe PMC&lt;/h3>
&lt;p>&lt;a href="https://europepmc.org/" target="_blank">Europe PMC&lt;/a> is a knowledgebase for life science research literature and a platform for innovation based on the content, such as text mining. It contains 34.6 million abstracts and 5 million full-text articles. At Europe PMC we support the research community by developing tools for knowledge discovery, linking publications with underlying research data, and building infrastructure to support text and data mining. Our goal is to create a supportive environment around open access content and data, to maximise its reuse.&lt;/p>
&lt;h3 id="what-problem-is-your-service-trying-to-solve">What problem is your service trying to solve?&lt;/h3>
&lt;p>Recent years have seen a dramatic increase in &lt;a href="https://www.crossref.org/categories/preprints">the popularity of preprints&lt;/a> within life sciences literature. Preprints have been supported by Crossref since November 2016. In response to the rise in popularity, we have started indexing preprints alongside traditional journal publishing within Europe PMC. We expect this will:&lt;/p>
&lt;ol>
&lt;li>provide another means to access and discover this emergent form of scholarly content&lt;/li>
&lt;li>help explore more transparently the role of preprints in the publishing ecosystem&lt;/li>
&lt;li>support their inclusion in processes such as grant reporting and credit attribution systems&lt;/li>
&lt;/ol>
&lt;p align="center">&lt;img src="https://www.crossref.org/images/blog/epmc1.png" alt="context" width="75%" />
&lt;/p>
&lt;h3 id="how-do-you-use-crossref-metadata">How do you use Crossref metadata?&lt;/h3>
&lt;p>Europe PMC operates an open citation network that uses reference lists from our full-text content, supplemented with metadata supplied by the Crossref OAI-PMH API. The number of citations we retrieve from Crossref increased significantly in 2017 thanks to the efforts of the &lt;a href="https://i4oc.org/" target="_blank">Initiative for Open Citations&lt;/a> (I4OC) in improving awareness about sharing citation data.&lt;/p>
&lt;p>Our work to ingest preprints into Europe PMC, however, represents our first use of the Crossref REST API. We make a series of queries for each preprint provider, making use of the “posted-content”, “prefix” and (optionally) “has-abstract” filters. We intend to migrate to using the REST API for the majority of retrievals of Crossref content in due course.&lt;/p>
&lt;h3 id="what-metadata-values-do-you-make-use-of">What metadata values do you make use of?&lt;/h3>
&lt;p>Currently we make use of the following fields:&lt;/p>
&lt;ul>
&lt;li>&lt;code>posted&lt;/code> as a publication date&lt;/li>
&lt;li>&lt;code>abstract&lt;/code>&lt;/li>
&lt;li>&lt;code>DOI&lt;/code>&lt;/li>
&lt;li>&lt;code>author&lt;/code> for author given names and surnames&lt;/li>
&lt;li>&lt;code>title&lt;/code> as the preprint title&lt;/li>
&lt;li>&lt;code>is-preprint-of&lt;/code> to establish preprint –&amp;gt; article links&lt;/li>
&lt;/ul>
&lt;h3 id="how-often-do-you-extractquery-metadata">How often do you extract/query metadata?&lt;/h3>
&lt;p>We query the REST API daily making use of the &lt;code>from-index-date&lt;/code> filter and cursor pagination to insert new or modify existing records. This means that preprints will be available in Europe PMC within 24 hours of the metadata being sent to Crossref. We store the full REST response in MongoDB, a document-based database. Here are some examples of Crossref API queries used to preprint provider &lt;em>PeerJ Preprints&lt;/em>:&lt;/p>
&lt;pre tabindex="0">&lt;code>calling `https://api.crossref.org/works?filter=type:posted-content,has-abstract:true,from-index-date:2018-07-29,prefix:10.7287&amp;amp;sort=updated&amp;amp;rows=1000&amp;amp;cursor=*`
calling `https://api.crossref.org/works?filter=type:posted-content,has-abstract:true,from-index-date:2018-07-29,prefix:10.7287&amp;amp;sort=updated&amp;amp;rows=1000&amp;amp;cursor=AoN4ldf88uQCe6e1g%2FPkAj8SaHR0cDovL2R4LmRvaS5vcmcvMTAuNzI4Ny9wZWVyai5wcmVwcmludHMuMjcwNjJ2MQ%3D%3D`
Done importing PeerJ Preprints
modified: 2
inserted: 10
&lt;/code>&lt;/pre>&lt;h3 id="what-do-you-do-with-the-metadata">What do you do with the metadata?&lt;/h3>
&lt;p>From the database we parse out the relevant fields and pass them to our main relational database prior to indexing. This avails the preprint abstracts to all of the value-added services we offer for peer-reviewed abstracts, such as citations, grants, ORCID claiming, text mining, etc. We assign a unique persistent identifier comprising “PPR” followed by a number (1) to each preprint record.&lt;/p>
&lt;p>This is displayed on the Europe PMC site as an abstract record, analogous to PubMed records, but with an obvious banner (2) indicating to readers the preprint designation; a tooltip provides further explanation of what a preprint is in comparison to a peer-reviewed article.&lt;/p>
&lt;p>Once available on the Europe PMC platform, we then apply downstream processes including:&lt;/p>
&lt;ul>
&lt;li>providing an Unpaywall link directly to the full-text (3);&lt;/li>
&lt;li>adding a hyperlink to the final published version (if there is one that we can detect) (4);&lt;/li>
&lt;li>incorporating the preprint into our citation network (5);&lt;/li>
&lt;li>adding useful links to e.g. alternative metrics, scientific comments and peer reviews, underlying research data in life science databases (6);&lt;/li>
&lt;li>providing text mined annotations via SciLite (7);&lt;/li>
&lt;li>including funding information (8);&lt;/li>
&lt;li>displaying ORCID claims in the author list (9).&lt;/li>
&lt;/ul>
&lt;p align="center">&lt;img src="https://www.crossref.org/images/blog/epmc2.png" alt="context" width="75%" />
&lt;/p>
&lt;h3 id="what-are-the-future-plans-for-europe-pmc-and-preprints">What are the future plans for Europe PMC and preprints?&lt;/h3>
&lt;p>The inclusion of preprints within Europe PMC is of immediate benefit to researchers who want to explore the very latest research. Moreover we see this as an opportunity for both ourselves and the community to explore how preprints fit into the wider publishing ecosystem; for example to answer questions such as: How often will they be cited? How will they be linked to grant funding and other credit systems? How will they be reused?&lt;/p>
&lt;h3 id="what-else-would-you-like-our-api-to-do">What else would you like our API to do?&lt;/h3>
&lt;p>The REST API and rich metadata model provided by Crossref around preprints are both excellent, but the population of the metadata fields by preprint providers can be limited and/or heterogeneous. The key challenge we see is in encouraging providers to populate the Crossref metadata fields more fully and in a uniform manner.&lt;/p>
&lt;hr>
&lt;p>Thanks to Michael.&lt;/p>
&lt;p>If you&amp;rsquo;d like to share how you use our Metadata APIs please contact the &lt;a href="mailto:feedback@crossref.org">Community team&lt;/a>.&lt;/p></description></item><item><title>A wrap up of the Crossref blog series for SciELO</title><link>https://www.crossref.org/blog/a-wrap-up-of-the-crossref-blog-series-for-scielo/</link><pubDate>Fri, 05 Oct 2018 00:00:00 +0000</pubDate><author>Christine Cormack Wood</author><discourseUsername>ccormackwood</discourseUsername><guid>https://www.crossref.org/blog/a-wrap-up-of-the-crossref-blog-series-for-scielo/</guid><description>&lt;p>Crossref member SciELO (Scientific Electronic Library Online), based in Brazil, celebrated two decades of operation last week with a three-day event &lt;a href="https://www.scielo20.org/en/" target="_blank">The SciELO 20 Years Conference&lt;/a>.&lt;/p>
&lt;p>The celebration constituted an important landmark in SciELO’s evolution, and an exceptional moment for them to promote the advancement of an inclusive, global approach to scholarly communication and to the open access movement.&lt;/p>
&lt;p>As part of the anniversary activities SciELO asked us to write a series of five blogs that would help the organisations of Brazil to better understand the following:&lt;/p>
&lt;ul>
&lt;li>Why all articles should have a DOI&lt;/li>
&lt;li>The critical role of the DOI&lt;/li>
&lt;li>The basics of record types, translations, preprints, Crossmark, and more&lt;/li>
&lt;li>The basics of Crossref sponsorship, and&lt;/li>
&lt;li>How to make the most of your Crossref membership&lt;/li>
&lt;/ul>
&lt;p>Below you’ll find an abstract of each of these blog posts as well as a link to the published posts in Brazilian Portuguese, Spanish and English.&lt;/p>
&lt;p>&lt;strong>Why all articles should have a DOI&lt;/strong> &lt;br>
In today’s world, an author’s work needs a Digital Object Identifier (DOI) for it to become discoverable, citable, and linkable. This unique alphanumeric string identifies the content of a research work, and remains associated with it irrespective of changes to its web location. Discover the origins of the DOI, how Crossref was founded, and why they continue to exist and persist.&lt;/p>
&lt;p>Read the full blog in &lt;a href="https://blog.scielo.org/blog/2018/07/17/as-razoes-porque-o-crossref-existe-e-persiste/#.W7XScBNKhQI" target="_blank">Brazilian Portuguese&lt;/a>, &lt;a href="https://blog.scielo.org/es/2018/07/17/por-que-crossref-existe-y-persiste/#.W7XSYRNKhQI" target="_blank">Spanish&lt;/a>, or &lt;a href="https://blog.scielo.org/en/2018/07/17/why-crossref-exists-and-persists/#.W3QO7ZNKg0o" target="_blank">English&lt;/a>&lt;/p>
&lt;p>&lt;strong>The critical role of the DOI&lt;/strong> &lt;br>
Find out why URL links to research articles are fragile, and how DOIs are essential in building stable, persistent links between research objects. This is achieved through the metadata that members deposit with Crossref, as part of their obligations. Learn how we can all contribute to creating a global, robust research record.&lt;/p>
&lt;p>Read the full blog in &lt;a href="https://blog.scielo.org/es/2018/08/02/el-papel-critico-del-doi/#.W7db8hNKhQI" target="_blank">Spanish&lt;/a> or &lt;a href="https://blog.scielo.org/en/2018/08/02/the-critical-role-of-the-doi/#.W7dcARNKhQI" target="_blank">English&lt;/a>&lt;/p>
&lt;p>&lt;strong>The basics of record types: Preprints, Crossmark, translations, and more&lt;/strong> &lt;br>
What’s the difference between preprints and ahead of print? When should you use each; and, what are the DOI requirements? This article answers those questions and provides a basic overview of how to connect the metadata records of related record types, like translations.&lt;/p>
&lt;p>Read the full blog in &lt;a href="https://blog.scielo.org/blog/2018/08/22/os-fundamentos-sobre-os-tipos-de-conteudo-preprints-crossmark-traducoes-e-muito-mais/#.W7dcDhNKhQI" target="_blank">Brazilian Portuguese&lt;/a>, &lt;a href="https://blog.scielo.org/es/2018/08/22/conceptos-basicos-de-los-tipos-de-contenido-preprints-crossmark-traducciones-y-mas/" target="_blank">Spanish&lt;/a>, or &lt;a href="https://blog.scielo.org/en/2018/08/22/the-basics-of-content-types-preprints-crossmark-translations-and-more/#.W7dcLBNKhQI" target="_blank">English&lt;/a>&lt;/p>
&lt;p>&lt;strong>The basics of Crossref sponsorship&lt;/strong> &lt;br>
There are many organisations that want to register content and benefit from the services Crossref provides, but may not be able to do so alone. These organisations use sponsors. Sponsors are organisations who publish on behalf of groups of smaller organisations. Nearly 650 of our 800 Brazilian members are represented by such a sponsor.&lt;/p>
&lt;p>Read the full blog in &lt;a href="https://blog.scielo.org/blog/2018/08/31/os-fundamentos-do-patrocinio-no-crossref/#.W7dcQRNKhQI" target="_blank">Brazilian Portuguese&lt;/a>, &lt;a href="https://blog.scielo.org/es/2018/08/31/los-fundamentos-del-patrocinio-en-crossref/" target="_blank">Spanish&lt;/a>, or &lt;a href="https://blog.scielo.org/en/2018/08/31/the-basics-of-sponsorship-at-crossref/#.W7dcWhNKhQI" target="_blank">English&lt;/a>&lt;/p>
&lt;p>&lt;strong>How to make the most of your Crossref membership&lt;/strong> &lt;br>
Since Crossref was founded in 2000, its member organisations have registered metadata and persistent identifiers (DOIs) for over 100 million content items. This information is used extensively by the research community—individuals and organisations—who need to find, cite, link and assess research outputs. As a SciELO member, the metadata you provide to Crossref when you register content is key to the discoverability of your journal content.&lt;/p>
&lt;p>Read the full blog in &lt;a href="https://blog.scielo.org/blog/2018/10/03/como-os-periodicos-podem-aproveitar-ao-maximo-sua-associacao-ao-crossref/#.W7dcaBNKhQK" target="_blank">Brazilian Portuguese&lt;/a>, &lt;a href="https://blog.scielo.org/es/2018/10/03/como-las-revistas-pueden-aprovechar-al-maximo-la-membresia-de-crossref/#.W7XRsRNKhQI" target="_blank">Spanish&lt;/a>, or &lt;a href="https://blog.scielo.org/en/2018/10/03/how-journals-can-make-the-most-of-crossref-membership/#.W7UYkGhKiUk" target="_blank">English&lt;/a>&lt;/p>
&lt;hr></description></item><item><title>Data citation: let’s do this</title><link>https://www.crossref.org/blog/data-citation-lets-do-this/</link><pubDate>Thu, 04 Oct 2018 00:00:00 +0000</pubDate><author>Rachael Lammey</author><discourseUsername>rlammey</discourseUsername><guid>https://www.crossref.org/blog/data-citation-lets-do-this/</guid><description>&lt;p>Data citation is seen as one of the most important ways to establish data as a first-class scientific output. At Crossref and DataCite we are seeing growth in journal articles and other record types citing data, and datasets making the link the other way. Our organisations are committed to working together to help realize the data citation community’s ambition, so we’re embarking on a dedicated effort to get things moving.&lt;/p>
&lt;p>Efforts regarding data citation are not a new thing. One of the first large-scale initiatives to establish data citation as a standard academic practice was the FORCE11 &lt;a href="https://www.force11.org/datacitationprinciples" target="_blank">Joint Declaration of Data Citation Principles&lt;/a> (JDDCP) in 2014. This declaration was endorsed by over 100 organisations in the scholarly community as well as many individuals.&lt;/p>
&lt;p>Following this agreement on how data citation should be done, many projects followed. Within FORCE11, the &lt;a href="https://force11.org/group/data-citation-implementation-pilot-dcip/" target="_blank">Data Citation Implementation Pilot&lt;/a> brought together publishers and repositories to put data citation into practice and work on the implementation of the JDDCP. Within the context of the &lt;a href="https://www.rd-alliance.org/" target="_blank">Research Data Alliance&lt;/a>,
a data-literature linking group started under the name of &lt;a href="https://documentation.ardc.edu.au/cpg/scholix" target="_blank">Scholix&lt;/a> to establish a framework for exchanging information about the relationships between articles and datasets. The infrastructure building blocks now feed into projects such as &lt;a href="https://makedatacount.org/" target="_blank">Make Data Count&lt;/a> and &lt;a href="https://copdess.org/enabling-fair-data-project/" target="_blank">Enabling FAIR Data&lt;/a>.&lt;/p>
&lt;p>Projects aside, if datasets are cited consistently and in a standard way, it will make it much easier for the research community to see links between different research outputs and work with these outputs. It also makes it much easier to count these citations, so that researchers can get credit for their data and the sharing of that data.&lt;/p>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/data_article_nexus_short.png" alt="An exemplary image" width="500px" />
&lt;/div>
&lt;p>The underlying work has been done to create an infrastructure that will effectively support and disseminate information on data citation. Data citation is here today!&lt;/p>
&lt;p>Different organisations know how to handle data citations, and are starting to count these and make that information available in turn. This means that the only thing that’s needed is for people to actually cite data, and this information be captured and passed on. Some Crossref and DataCite members have already made great progress on this already (see Melissa Harrison’s &lt;a href="https://doi.org/10.64000/vbfmx-mt44" target="_blank">blog on what eLife is doing&lt;/a>).&lt;/p>
&lt;p>The goals of all the data citation projects can only be realized if you start doing data citation, and we know you’ll have questions about it…&lt;/p>
&lt;p>In the coming months, we’ll be posting several blogs and organizing sessions to tell you how you can start doing data citation - if you’re attending FORCE2018 you can catch our &lt;a href="https://force2018.sched.com/event/Fs0A/contributing-and-consuming-data-metrics-to-make-your-data-count" target="_blank">joint workshop&lt;/a> there. So stay tuned and please &lt;a href="mailto:rlammey@crossref.org">get in touch&lt;/a> if you can’t wait, we’d love to help you get started!&lt;/p></description></item><item><title>100,000,000 records - thank you!</title><link>https://www.crossref.org/blog/100000000-records-thank-you/</link><pubDate>Wed, 26 Sep 2018 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/100000000-records-thank-you/</guid><description>&lt;p>100,000,000. Yes, it’s a really big number—and you helped make it happen. We’d like to say thank you to all our members, without your commitment and contribution we would not be celebrating this significant milestone. It really is no small feat.&lt;/p>
&lt;p>To help put this number into context; the National Museum of China has just over 1 million artifacts, the British Library has around 25 million books, Napster has 40 million tracks, and Wikidata currently contains 50 million+ items.&lt;/p>
&lt;p align="center">&lt;img src="https://www.crossref.org/images/blog/100-mill-1.png" alt="context" width="75%" />
&lt;/p>
&lt;h3 id="digging-into-the-100-million">Digging into the 100 Million&lt;/h3>
&lt;p>Within these 100 Million registered content records there are &lt;a href="https://www.crossref.org/services/content-registration/">many different record types&lt;/a>.&lt;/p>
&lt;p align="center">&lt;img src="https://www.crossref.org/images/blog/100-mill-2.png" alt="record types" width="75%" />
&lt;/p>
&lt;p>And within these record types, more than 69 million records have full-text links, 31 million+ have license information and 3 million+ contain some kind of funding information. An overview of these and other &lt;a href="https://www.crossref.org/dashboard/">Crossref vital statistics&lt;/a> is available on our dashboard.&lt;/p>
&lt;h3 id="100-millionwhat-does-your-contribution-look-like">100 Million—what does your contribution look like?&lt;/h3>
&lt;p>Our recently-launched &lt;a href="https://www.crossref.org/members/prep/" target="_blank">participation reports&lt;/a> allow anyone to see the metadata Crossref has. It’s a valuable education tool for publishers, institutions and other service providers looking to understand the availability of the metadata they have registered with us.&lt;/p>
&lt;p>Through an itemized dashboard Participation Reports allows you to monitor the metadata you are registering, even if this work is done by a third party or another department. You can see for yourself where your gaps are, and what you could improve upon. Next to each metadata element, there’s a short definition, letting you know more about it, and—crucially—what practical steps you can take to improve the score.&lt;/p>
&lt;p>The dashboard provides the percentage counts across ten key metadata elements: References, ORCID iDs, Funder Registry IDs, Funding award numbers, Crossmark metadata, License URLs, Text-mining links, Similarity Check URLs, and Abstracts.&lt;/p>
&lt;p>And not only can you see your own metadata—the dashboard enables you to view the registered metadata of all our 11,076 members.&lt;/p>
&lt;h3 id="how-are-these-100-million-content-records-being-used">How are these 100 Million content records being used?&lt;/h3>
&lt;p>Every service we provide is based on our metadata, and our APIs expose all of that metadata. Over the past year or so we have been collecting use cases from members that actively utilize the Metadata APIs and have turned these into a &lt;a href="https://www.crossref.org/categories/api-case-study">Metadata APIs blog series&lt;/a> so that we can share these stories of how our metadata is used with the wider community.&lt;/p>
&lt;h3 id="a-big-number-even-bigger-ambitions">A big number. Even bigger ambitions.&lt;/h3>
&lt;p>Gaps or errors in metadata are passed on to thousands of other services, which causes problems downstream and means we all suffer. So it makes sense for the metadata you deposit to be as accurate and complete as possible. The more elements there are to the metadata, the higher the chance of others finding and using the content. We aim to continually find effective ways to communicate this wider story around the importance of open infrastructure and metadata.&lt;/p>
&lt;p>Over the years we’ve made great progress in connecting information about researchers, their affiliations, grants, and research outputs. Imagine how much more powerful this information would be if supplemented by more comprehensive, accurate, and up-to-date metadata.&lt;/p>
&lt;hr>
&lt;p>&lt;em>Sources - all data as of Sept 26, 2018&lt;/em>&lt;br>
&lt;a href="https://en.wikipedia.org/wiki/National_Museum_of_China" target="_blank">National Museum of China&lt;/a> has 1,050,000 artifacts&lt;br>
&lt;a href="https://simple.wikipedia.org/wiki/British_Library" target="_blank">The British Library&lt;/a> has around 25 million books, more than any other library&lt;br>
&lt;a href="https://www.wikidata.org/wiki/Wikidata:Statistics" target="_blank">Wikidata&lt;/a> currently contains 50,290,632 items&lt;br>
&lt;a href="https://help.napster.com/hc/en-us/articles/115001406007-Napster-Subscription-Plans" target="_blank">NAPSTER&lt;/a> currently has 40 million tracks (Napster is known as Rhapsody in the US)&lt;/p></description></item><item><title>Join us in Toronto this November for LIVE18</title><link>https://www.crossref.org/blog/join-us-in-toronto-this-november-for-live18/</link><pubDate>Tue, 25 Sep 2018 00:00:00 +0000</pubDate><author>Christine Cormack Wood</author><discourseUsername>ccormackwood</discourseUsername><guid>https://www.crossref.org/blog/join-us-in-toronto-this-november-for-live18/</guid><description>&lt;p>LIVE18, your Crossref annual meeting, is fast approaching! We’re looking forward to welcoming everyone in Toronto, November 13-14.&lt;/p>
&lt;p>This year’s theme “How good is your metadata?” centers around the definition and benefits of metadata completeness, and each half day will cover some element of the theme:&lt;/p>
&lt;ul>
&lt;li>Day one, AM &lt;em>Defining good metadata&lt;/em>&lt;/li>
&lt;li>Day one, PM &lt;em>Improving metadata quality and completeness&lt;/em>&lt;/li>
&lt;li>Day two, AM &lt;em>What does good metadata enable?&lt;/em>&lt;/li>
&lt;li>Day two, PM &lt;em>Who is using our metadata and what are they doing with it?&lt;/em>&lt;/li>
&lt;/ul>
&lt;p>Both days will be packed with a mixture of plenary and interactive sessions. Speakers include:&lt;/p>
&lt;ul>
&lt;li>Patricia Cruse, DataCite&lt;/li>
&lt;li>Kristen Fisher Ratan, CoKo Foundation&lt;/li>
&lt;li>Stefanie Haustein, University of Ottawa&lt;/li>
&lt;li>Bianca Kramer, Utrecht University&lt;/li>
&lt;li>Shelley Stall, American Geophysical Union&lt;/li>
&lt;li>Ravit David, University of Toronto Libraries&lt;/li>
&lt;li>Graham Nott, Freelance developer of an eLife JATS conversion tool&lt;/li>
&lt;li>Paul Dlug, American Physical Society&lt;/li>
&lt;/ul>
&lt;p>A ‘meet and mingle’ drinks reception will be held directly after the election results on day one.&lt;/p>
&lt;h2 id="about-the-themehow-good-is-your-metadata">About the theme—how good is your metadata?&lt;/h2>
&lt;p>The reach and usefulness of research outputs are only as good as how well they are described. Metadata is what is used to describe the story of research: its origin, its contributors, its attention, and its relationship with other objects.&lt;/p>
&lt;p>The more machines start to do what humans cannot—parse millions of files through multiple views—the more we see what connections are missing, the more we start to understand the opportunities that better metadata can offer.&lt;/p>
&lt;p>LIVE18 will focus this year entirely on the subject of metadata. It touches everything we do, and everything that publishers, hosting platforms, funders, researchers, and libraries do.&lt;/p>
&lt;h2 id="come-and-join-the-discussions">Come and join the discussions&lt;/h2>
&lt;p>&lt;a href="https://www.eventbrite.com/e/crossref-live18-toronto-nov-13-14-crlive18-registration-46284552342" target="_blank">Register to join&lt;/a> us this 13 and 14 November, at the &lt;a href="https://www.torontopubliclibrary.ca/torontoreferencelibrary/" target="_blank">Toronto Reference Library&lt;/a>, 789 Yonge Street, Toronto, Canada—we look forward to seeing you there.&lt;/p>
&lt;hr>
&lt;p>&lt;a href="https://www.crossref.org/crossref-annual-meeting/archive/">Read more about our annual events&lt;/a>&lt;/p></description></item><item><title>Using the Crossref REST API. Part 11 (with MDPI/Scilit)</title><link>https://www.crossref.org/blog/using-the-crossref-rest-api.-part-11-with-mdpi/scilit/</link><pubDate>Tue, 18 Sep 2018 00:00:00 +0000</pubDate><author>Christine Cormack Wood</author><discourseUsername>ccormackwood</discourseUsername><guid>https://www.crossref.org/blog/using-the-crossref-rest-api.-part-11-with-mdpi/scilit/</guid><description>&lt;p>Continuing our blog series highlighting &lt;a href="https://www.crossref.org/categories/api-case-study">the uses of Crossref metadata&lt;/a>, we talked to Martyn Rittman and Bastien Latard who tell us about themselves, MDPI and Scilit, and how they use Crossref metadata.&lt;/p>
&lt;h2 id="can-you-give-us-a-brief-introduction-yourselves-and-to-mdpiscilit">Can you give us a brief introduction yourselves, and to MDPI/Scilit&lt;/h2>
&lt;p>Martyn is Publishing Services Manager at MDPI. He joined five years ago as an editor and has worked on editorial, production, and software projects. Prior to joining MDPI, he completed a PhD and worked as a postdoc. His research covered physical chemistry, biochemistry and instrument development.
Bastien Latard is the project leader of Scilit. He created Scilit as part of his Master’s degree in 2013. He is now completing a PhD on the subject of semantically linking research articles, using data from Scilit.&lt;/p>
&lt;p>Scilit was developed in 2014 by open access (OA) publisher MDPI with the goal of having a backup of metadata for all OA articles. Soon, Scilit became more general and embraced all articles with a digital object identifier (DOI) from Crossref and those with a Pubmed ID (PMID). After seeing the potential of the database and how it could be used in a number of different contexts, we decided to make it public. Recently, other article types, including preprints have been integrated. Our main goal now is to provide useful services to the research and academic publishing communities.&lt;/p>
&lt;h2 id="what-problem-is-your-service-trying-to-solve">What problem is your service trying to solve?&lt;/h2>
&lt;p>Other indexing databases offer paid access, are highly selective, or host documents apart from research articles. We want to offer a comprehensive database, but also one that clearly identifies open access material. The last part is still a work in progress, but we have made good progress recently.&lt;/p>
&lt;p>To make the access as direct as possible, we have recently integrated several OA aggregators that pick up or host free versions of full-text articles, including CORE, Unpaywall, and PubMed Central.&lt;/p>
&lt;h2 id="can-you-tell-us-how-you-are-using-the-crossref-metadata-api-at-mdpiscilit">Can you tell us how you are using the Crossref Metadata API at MDPI/Scilit?&lt;/h2>
&lt;p>Scilit queries Crossref’s API in order to index metadata for single articles. DOIs are a key part of the system; because they are standards, we can use them to merge new sources into Scilit while avoiding duplicates. We cross-check the data from Crossref against other sources and update it as necessary. Citation data is also really appreciated and opens doors to further developments.&lt;/p>
&lt;p>As a publisher, MDPI makes daily deposits to Crossref, to register journal articles on &lt;a href="http://www.mdpi.com/" target="_blank">mdpi.com&lt;/a>, conference papers from &lt;a href="https://sciforum.net" target="_blank">sciforum.net&lt;/a>, and preprints from &lt;a href="https://www.preprints.org/" target="_blank">Preprints.org&lt;/a>. We also use the data collected at Scilit to find suitable reviewers and let authors know when their work has been cited.&lt;/p>
&lt;h2 id="what-metadata-values-do-you-pull-from-the-api">What metadata values do you pull from the API?&lt;/h2>
&lt;p>As much as we can! Scilit crawls the latest indexed articles every few hours to ensure it is as up-to-date as possible. This is the most important function of our system because it provides metadata for the very latest published articles, including a link to the publisher version. Scilit parses Crossref metadata and saves them. They are then indexed into our solr search engine for fast, real-time usage.&lt;/p>
&lt;h2 id="have-you-built-your-own-interface-to-extract-this-data">Have you built your own interface to extract this data?&lt;/h2>
&lt;p>We wrote our own code to get the data, but the API interface made this very straightforward. Scilit has been developed completely in-house by MDPI and the lead developer, Bastien Latard, is currently completing a PhD looking at how to make the most of the data using semantic data extraction.&lt;/p>
&lt;h2 id="what-are-the-future-plans-for-mdpiscilit">What are the future plans for MDPI/Scilit?&lt;/h2>
&lt;p>Scilit is and will be highly used in MDPI current and future projects. We have a few ideas about how to improve Scilit. We are, for example, implementing a scientific profile networking service, which will allow scholars to build their own (scientific) network with lots of functionalities. We think that it will be a really good place to search, comment, exchange around articles… maybe even more!&lt;/p>
&lt;h2 id="what-else-would-you-like-to-see-the-rest-api-offer">What else would you like to see the REST API offer?&lt;/h2>
&lt;p>Crossref is already doing a great job, especially with its integrated citation data. Maybe further analysis and mapping of data about organisations and institutions would be an improvement.&lt;/p>
&lt;hr>
&lt;p>Thank you Martin and Bastien. If you&amp;rsquo;d like to share how you use the Crossref Metadata APIs please contact the &lt;a href="mailto:feedback@crossref.org">Community team&lt;/a>.&lt;/p></description></item><item><title>Where does publisher metadata go and how is it used?</title><link>https://www.crossref.org/blog/where-does-publisher-metadata-go-and-how-is-it-used/</link><pubDate>Mon, 17 Sep 2018 00:00:00 +0000</pubDate><author>Laura J Wilkinson</author><guid>https://www.crossref.org/blog/where-does-publisher-metadata-go-and-how-is-it-used/</guid><description>&lt;p>Earlier this week, colleagues from Crossref, ScienceOpen, and OPERAS/OpenEdition joined forces to run a webinar on “Where does publisher metadata go and how is it used?”.&lt;/p>
&lt;p>Stephanie Dawson explained how ScienceOpen’s freely-accessible, interactive search and discovery platform works by connecting and exposing metadata from Crossref. Her case study showed that articles with additional metadata had much higher average views than those without - depositing richer metadata helps you get the best value from your DOIs!&lt;/p>
&lt;p>Pierre Mounier of OPERAS/OpenEdition showed us how a variety of persistent identifiers (PIDs) including DOIs, ORCID iDs, and Funder Registry IDs have been used on OA book platforms to improve citations, author attribution, and tracking of funding. He described a forthcoming annotations project with Hypothes.is, and explained how Crossref metadata is being used in both usage and alternative metrics.&lt;/p>
&lt;h2 id="five-ways-to-register-content-with-crossref">Five ways to register content with Crossref&lt;/h2>
&lt;p>My overview of Content Registration outlined the five ways to register content with Crossref:&lt;/p>
&lt;ul>
&lt;li>Via the manual &lt;a href="https://apps.crossref.org/webDeposit/" target="_blank">web deposit form&lt;/a>&lt;/li>
&lt;li>Through Crossref’s new &lt;a href="https://www.crossref.org/metadatamanager/" target="_blank">Metadata Manager&lt;/a> tool (beta)&lt;/li>
&lt;li>With OJS’s Crossref plugin - &lt;a href="https://docs.pkp.sfu.ca/crossref-ojs-manual/en/config" target="_blank">more information here&lt;/a> (&lt;a href="https://pkp.sfu.ca/ojs/ojs_download/" target="_blank">see OJS downloads&lt;/a> Version 3.1.0 and above is the best option for supporting the fullest Crossref metadata)&lt;/li>
&lt;li>With a &lt;a href="https://doi.crossref.org" target="_blank">manual XML upload file&lt;/a>&lt;/li>
&lt;li>Or, using HTTPS to POST XML&lt;/li>
&lt;/ul>
&lt;p>I also emphasized the importance of depositing, adding, and updating your metadata, and spoke about:&lt;/p>
&lt;ul>
&lt;li>Basic citation metadata: titles, author names, author affiliations, funding data, publication dates, issue numbers, page numbers, ISSNs, ISBNs&amp;hellip;&lt;/li>
&lt;li>Non-bibliographic metadata: reference lists, ORCID iDs, license data, clinical trial information, abstracts, relationships&amp;hellip;&lt;/li>
&lt;li>Crossmark: errata, retractions, updates, and more&lt;/li>
&lt;li>How important it is to have accurate, clean, and complete metadata&lt;/li>
&lt;li>The importance of registering your back-year records&lt;/li>
&lt;/ul>
&lt;h2 id="how-to-see-the-metadata-you-have">How to see the metadata you have&lt;/h2>
&lt;p>Anna Tolwinksa, Crossref’s Member Experience Manager, gave us an overview of the new &lt;a href="https://www.crossref.org/members/prep/" target="_blank">Participation Reports&lt;/a> tool. She explained how Participation Reports allows anyone to see the metadata Crossref members have registered with us, and how you can see for yourself where the gaps in your metadata are, and—importantly—how you can improve your coverage.&lt;/p>
&lt;h2 id="what-we-learnt">What we learnt&lt;/h2>
&lt;ul>
&lt;li>There are &lt;a href="https://www.crossref.org/participation/">10 key metadata elements or checks&lt;/a> in Participation Reports that aid in Crossref members’ content discoverability, reproducibility and research integrity:
&lt;ul>
&lt;li>References&lt;/li>
&lt;li>&lt;del>Open References&lt;/del> &lt;em>[EDIT 6th June 2022 - all references are now open by default].&lt;/em>&lt;/li>
&lt;li>ORCID iDs&lt;/li>
&lt;li>Funder Registry IDs&lt;/li>
&lt;li>Funding award numbers&lt;/li>
&lt;li>Text mining URLs&lt;/li>
&lt;li>License URLs&lt;/li>
&lt;li>Similarity Check URLs&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>Every day, research organisations around the world rely on metadata from Crossref, and use it in a variety of systems. Here are &lt;a href="https://www.crossref.org/categories/api-case-study/">a few examples&lt;/a>. Many organisations that enable research depend on Crossref’s metadata; we received over 650 million queries just last month&lt;/li>
&lt;li>Crossref members should check Participation Reports to see what percentage of their content includes rich metadata
If the percentages are low, Crossref is happy to work with you to help understand and improve your coverage&lt;/li>
&lt;li>Richer metadata helps research to be found, cited, linked to, assessed, and reused&lt;/li>
&lt;li>To make sure your work can be found!&lt;/li>
&lt;/ul>
&lt;p>Catch up with the &lt;a href="https://www.youtube.com/watch?v=RJhDHWhFFAs&amp;amp;feature=youtu.be" target="_blank">webinar recording&lt;/a>, and slides from &lt;a href="https://www.crossref.org/pdfs/crossref-webinar-laura-wilkinson-where-does-publisher-metadata-go-and-how-is-it-used-sep11-2018.pdf" target="_blank">Laura&lt;/a>, &lt;a href="https://www.slideshare.net/slideshow/crossref-webinar-stephanie-dawson-sciencopen-metadata-091118/114165046" target="_blank">Stephanie&lt;/a>, &lt;a href="https://www.crossref.org/pdfs/crossref-webinar-pierre-mounier-where-does-publisher-metadata-go-and-how-is-it-used-sep11-2018.pdf" target="_blank">Pierre&lt;/a>, and &lt;a href="https://www.slideshare.net/slideshow/crossref-webinar-anna-tolwinska-crossref-participation-reports-metadata-091118/114163162" target="_blank">Anna’s&lt;/a> presentations, and please &lt;a href="mailto:feedback@crossref.org">contact us&lt;/a> if you have any questions.&lt;/p>
&lt;hr></description></item><item><title>Event Data is production ready</title><link>https://www.crossref.org/blog/event-data-is-production-ready/</link><pubDate>Wed, 12 Sep 2018 00:00:00 +0000</pubDate><author>Christine Buske</author><guid>https://www.crossref.org/blog/event-data-is-production-ready/</guid><description>&lt;p>We’ve been working on &lt;a href="https://www.crossref.org/services/event-data">Event Data&lt;/a> for some time now, and in the spirit of openness, much of that story has already been &lt;a href="https://www.crossref.org/categories/event-data">shared&lt;/a> with the community. In fact, when I recently joined as Crossref’s &lt;a href="https://doi.org/10.64000/dc6xp-ejp53" target="_blank">Product Manager for Event Data&lt;/a>, I jumped onto an already fast moving train—headed for a bright horizon.&lt;/p>
&lt;p>What’s on the horizon? Well, the reality is you never really reach the horizon. Good product development—in my opinion—is like that train. You keep aiming for the horizon and passing all the stations (milestones) along the way, but the horizon keeps moving as you add features, improve the service, and maybe even review where you are headed. However, for Event Data we are pleased to say we have now arrived at a rather important station.&lt;/p>
&lt;h3 id="technical-readiness">Technical readiness&lt;/h3>
&lt;p>Thank you to all the beta testers who have journeyed with us this far—we’ve listened and learned, refined and rebuilt with the help of your feedback. We are now thrilled to say that we are service production ready. We’ve reached the station called ‘technical readiness’, and are eager to see more users board our train!&lt;/p>
&lt;p>During this time of building and refining, Event Data has grown to include at least 66,7 million events from sources like (in order of magnitude): Wikipedia, Cambia Lens, Twitter, Datacite, F1000, Newfeeds, Reddit links, Wordpress.com, Crossref, Reddit, Hypothesis, and Stackexchange. Wikipedia alone accounts for 50 million events (and counting).&lt;/p>
&lt;h3 id="what-does-this-mean">What does this mean?&lt;/h3>
&lt;p>Event Data is production ready.&lt;/p>
&lt;p>Being production ready means we are not going to make any breaking changes to the code, and we are excited to see more people &lt;a href="https://www.eventdata.crossref.org/guide/" target="_blank">jump on board&lt;/a> to explore where you can go with Event Data, and what product or service you might want to build with it.&lt;/p>
&lt;h3 id="getting-started">Getting started&lt;/h3>
&lt;p>Having a look at Event Data, and using it, is easy. While the &lt;a href="https://www.eventdata.crossref.org/guide/" target="_blank">user guide&lt;/a> outlines everything you need to know to get fully engrossed, you can get your feet wet with a few sample queries:&lt;/p>
&lt;p>Above I mentioned Event Data has about 50 million Wikipedia events, you can check if that has grown by looking at a query that lists all distinct events by source (your browser will need a &lt;a href="https://chrome.google.com/webstore/search/json?hl=en&amp;amp;_category=extensions" target="_blank">JSON viewer&lt;/a> extension):&lt;/p>
&lt;p>&lt;a href="https://api.eventdata.crossref.org/v1/events/distinct?facet=source%3A*&amp;amp;rows=0" target="_blank">&lt;code>https://api.eventdata.crossref.org/v1/events/distinct?facet=source/:*&amp;amp;rows=0&lt;/code>&lt;/a>&lt;/p>
&lt;p>You can also see a &lt;a href="http://live.eventdata.crossref.org/live.html" target="_blank">live stream of events&lt;/a> going through Event Data.&lt;/p>
&lt;p>For all events registered for a specific content item, you simply query &lt;code>http://api.eventdata.crossref.org/v1/events?obj-id=https://doi.org/XXX&lt;/code>, where XXX is replaced with the DOI.&lt;/p>
&lt;h3 id="what-next">What next?&lt;/h3>
&lt;p>We are now focusing on the final stretch towards the official roll-out. Beyond this, we will continue to add sources and features and have a healthy roadmap to keep us on track. We value any feedback you have for us about your own journey with Event Data. Your feedback may help shape the direction we take in the future. Most of all, we are all excited to see what people build with it!&lt;/p>
&lt;p>We look forward to continuing on our Event Data journey and we welcome you all aboard the train! Please &lt;a href="mailto:eventdata@crossref.org">contact me&lt;/a> with your ideas.&lt;/p>
&lt;hr></description></item><item><title>Crossref at the Frankfurt Book Fair 2018</title><link>https://www.crossref.org/blog/crossref-at-the-frankfurt-book-fair-2018/</link><pubDate>Tue, 11 Sep 2018 00:00:00 +0000</pubDate><author>Amanda Bartell</author><guid>https://www.crossref.org/blog/crossref-at-the-frankfurt-book-fair-2018/</guid><description>&lt;h3 id="how-good-is-your-metadata-find-out-at-the-frankfurt-book-fair">How good is your metadata? Find out at the Frankfurt Book Fair&amp;hellip;&lt;/h3>
&lt;p>At the Frankfurt Book Fair this year (Hall 4.2, Stand M82), the Crossref team will be on hand to give you a personal tour of our new &lt;a href="https://www.crossref.org/members/prep/" target="_blank">Participation Reports&lt;/a> tool. Or join us at The Education Stage to hear about how this new tool can help you view, evaluate and improve your metadata participation.&lt;/p>
&lt;center>
&lt;div class="shortcode-divwrap blue-highlight">
&lt;span>&lt;strong>How good is your metadata?&lt;/strong>
Join us Thursday 11th October at 15.30
at the Education Stage in Hall 4.2 to find out&lt;/span>
&lt;/div>
&lt;/center>
&lt;h3 id="lots-of-reasons-to-visit-our-stand">Lots of reasons to visit our stand&lt;/h3>
&lt;p>We’ll be located in the same place as last year, Hall 4.2, Stand M82, and there are lots of reasons to visit us:&lt;/p>
&lt;p>Get your metadata participation evaluated - &lt;a href="https://www.crossref.org/people/anna-tolwinska">Anna Tolwinska&lt;/a> and &lt;a href="https://www.crossref.org/people/amanda-bartell">Amanda Bartell&lt;/a> will walk you through your own Participation Report and provide guidance on how to improve your results. Discover how complete your metadata is, where the gaps are, and how other publishers compare.&lt;/p>
&lt;p>Discuss a technical issue that’s hindering your metadata participation (or any other technical issue) with &lt;a href="https://www.crossref.org/people/isaac-farley">Isaac Farley&lt;/a> and &lt;a href="https://www.crossref.org/people/paul-davis">Paul Davis&lt;/a> from our Technical Support team.&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/people/jennifer-kemp">Jennifer Kemp&lt;/a> will also be around to answer all your metadata use and reuse questions. She’s looking forward to chatting with all kinds of service providers and toolmakers.&lt;/p>
&lt;p>On the strategy side, &lt;a href="https://www.crossref.org/people/ginny-hendricks/">Ginny Hendricks&lt;/a> will be there on Wednesday 10th if you’d like to discuss any policy stuff, new ideas, or find out what Crossref is planning next.&lt;/p>
&lt;h3 id="ask-us-anything">Ask us anything&lt;/h3>
&lt;p>Not just Participation Reports—you can ask us about anything. Perhaps about our newer record types such as &lt;a href="https://doi.org/10.64000/j5z8g-wdw85" target="_blank">preprints&lt;/a>, pending publications (i.e. DOIs on acceptance), or &lt;a href="https://doi.org/10.64000/hdj5p-8vy92" target="_blank">data citations&lt;/a>. Or, ask us how you can:&lt;/p>
&lt;ul>
&lt;li>Advance scholarly pursuits for the benefit of society, through &lt;a href="http://www.metadata2020.org/" target="_blank">Metadata 2020&lt;/a>&lt;/li>
&lt;li>Check papers for originality, with our service for editorial rigour, through &lt;a href="https://www.crossref.org/services/similarity-check/">Similarity Check&lt;/a>&lt;/li>
&lt;li>Discover where and how research is being discovered, through &lt;a href="https://www.crossref.org/services/event-data/">Event Data&lt;/a>&lt;/li>
&lt;li>Reveal who is citing your published papers and how platforms can display this information, with our &lt;a href="https://www.crossref.org/services/cited-by/">Cited-by service&lt;/a>&lt;/li>
&lt;li>Provide evidence of trust in published outputs, revealing updates, corrections and retractions, through our &lt;a href="https://www.crossref.org/services/crossmark/">Crossmark service&lt;/a>&lt;/li>
&lt;/ul>
&lt;p>&lt;a href="mailto:feedback@crossref.org">Let us know&lt;/a> if you’d like to book in a meeting with one of us, or do just stop by the stand to say “Guten Tag”.&lt;/p>
&lt;p>We look forward to seeing you there - bis dann!&lt;/p></description></item><item><title>Presenting PIDapalooza 2019</title><link>https://www.crossref.org/blog/presenting-pidapalooza-2019/</link><pubDate>Tue, 28 Aug 2018 00:00:00 +0000</pubDate><author>Christine Cormack Wood</author><discourseUsername>ccormackwood</discourseUsername><guid>https://www.crossref.org/blog/presenting-pidapalooza-2019/</guid><description>&lt;p>PIDapalooza, the open festival of persistent identifiers is back and it’s better than ever. Mark your calendar for Dublin, Ireland, January 23-24, 2019 and send us your session ideas by September 21.&lt;/p>
&lt;p>Yes, it’s back and &amp;ndash; with your support &amp;ndash; it’s going to be better than ever! The third annual &lt;a href="https://pidapalooza.org" target="_blank">PIDapalooza&lt;/a> open festival of persistent identifiers will take place at the &lt;a href="https://www.griffith.ie/conference-centre" target="_blank">Griffith Conference Centre&lt;/a>, Dublin, Ireland on January 23-24, 2019 - and we hope you’ll &lt;a href="https://www.eventbrite.com/e/pidapalooza-2019-registration-49295286529" target="_blank">join us&lt;/a> there!&lt;/p>
&lt;p>Hosted, once again, by California Digital Library, Crossref, DataCite, and ORCID, PIDapalooza will follow the same format as past events &amp;ndash; rapid-fire, interactive, 30-60 minute sessions (presentations, discussions, debates, brainstorms, etc.) presented on three stages &amp;ndash; plus main stage attractions, which will be announced shortly. New for this year is an unconference track, as suggested by several attendees last time.&lt;/p>
&lt;p>In the meantime, get those creative juices flowing and send us your session PIDeas! What would you like to talk about? Hear about? Learn about? What’s important for your organisation and your community and why? What’s working and what’s not? What’s needed and what’s missing? We want to hear from as many PID people as possible! Please use &lt;a href="https://goo.gl/forms/EddXcg7TWTCy6Lgk2" target="_blank">this form&lt;/a> to send us your suggestions. The PIDapalooza Festival Committee will review all forms submitted by September 21, 2018 and decide on the lineup by mid-October.&lt;/p>
&lt;p>As a reminder, the regular themes are:&lt;/p>
&lt;ul>
&lt;li>PID myths: Are PIDs better in our minds than in reality? PID stands for Persistent IDentifier, but what does that mean and does such a thing exist?&lt;br>&lt;/li>
&lt;li>PIDs forever - achieving persistence: So many factors affect persistence: mission, oversight, funding, succession, redundancy, governance. Is open infrastructure for scholarly communication the key to achieving persistence?&lt;br>&lt;/li>
&lt;li>PIDs for emerging uses: Long-term identifiers are no longer just for digital objects. We have use cases for people, organisations, vocabulary terms, and more. What additional use cases are you working on?&lt;br>&lt;/li>
&lt;li>Legacy PIDs: There are of thousands of venerable old identifier systems that people want to continue using and bring into the modern data citation ecosystem. How can we manage this effectively?&lt;br>&lt;/li>
&lt;li>Bridging worlds: What would make heterogeneous PID systems &amp;lsquo;interoperate&amp;rsquo; optimally? Would standardized metadata and APIs across PID types solve many of the problems, and if so, how would that be achieved? What about standardized link/relation types?&lt;br>&lt;/li>
&lt;li>PIDagogy: It’s a challenge for those who provide PID services and tools to engage the wider community. How do you teach, learn, persuade, discuss, and improve adoption? What&amp;rsquo;s it mean to build a pedagogy for PIDs?&lt;br>&lt;/li>
&lt;li>PID stories: Which strategies worked? Which strategies failed? Tell us your horror stories! Share your victories!&lt;br>&lt;/li>
&lt;li>Kinds of persistence: What are the frontiers of &amp;lsquo;persistence&amp;rsquo;? We hear lots about fraud prevention with identifiers for scientific reproducibility, but what about data papers promoting PIDs for long-term access to reliably improving objects (software, pre-prints, datasets) or live data feeds?&lt;/li>
&lt;/ul>
&lt;p>We’ll be posting more information on the &lt;a href="https://pidapalooza.org" target="_blank">PIDapalooza website&lt;/a> over the coming months, as well as keeping you updated on Twitter (@pidaplooza).&lt;/p>
&lt;p>In the meantime, what are you waiting for!? &lt;a href="https://www.eventbrite.com/e/pidapalooza-2019-registration-49295286529" target="_blank">Book your place now&lt;/a> &amp;ndash; and we also strongly recommend that you book your accommodation early as there are other big conferences in Dublin that week.&lt;/p>
&lt;p>PIDapalooza, Dublin, Ireland, January 23-24, 2019 - it’s a date!&lt;/p>
&lt;hr></description></item><item><title>Leaving the house - where preprints go</title><link>https://www.crossref.org/blog/leaving-the-house-where-preprints-go/</link><pubDate>Tue, 21 Aug 2018 00:00:00 +0000</pubDate><author>Jennifer Lin</author><guid>https://www.crossref.org/blog/leaving-the-house-where-preprints-go/</guid><description>&lt;p>“Pre-prints” are sometimes neither Pre nor Print (c.f. &lt;a href="https://doi.org/10.12688/f1000research.11408.1" target="_blank">https://doi.org/10.12688/f1000research.11408.1&lt;/a>, but they do go on and get published in journals. While researchers may have different motivations for posting a preprint, such as establishing a record of priority or seeking rapid feedback, the primary motivation appears to be timely sharing of results prior to journal publication.&lt;/p>
&lt;h3 id="so-where-in-fact-do-preprints-get-published">So where in fact do preprints get published?&lt;/h3>
&lt;p>Although this is a simple question, we have not had an easy way to answer how this varies across disciplines, preprint repositories and journals. Until now. Crossref metadata provides not only an open and easy way to do so, but up-to-date data to get the latest results.&lt;/p>
&lt;!--more-->
&lt;h3 id="ropensci-makin-it-sweet--easy">rOpenSci makin&amp;rsquo; it sweet &amp;amp; easy&lt;/h3>
&lt;p>Crossref asks preprint repositories to update their metadata once a preprint has been published by adding the article link into its record via the “is-preprint-of” relation. As the record is processed, we make the link available going both directions, while preserving the provenance of the statement in the metadata output (&amp;ldquo;asserted-by&amp;rdquo;: &amp;ldquo;subject&amp;rdquo; or &amp;ldquo;asserted-by&amp;rdquo;: &amp;ldquo;object&amp;rdquo;). This results in bidirectional assertions in the Crossref REST API where search engines, analytics providers, indexes, etc. can get from the preprint to the article (“is-preprint-of”) as well as vice versa (“has-preprint”), making it easier to find, cite, link, assess, and reuse.&lt;/p>
&lt;p>Using &lt;a href="https://ropensci.org/" target="_blank">rOpenSci’s&lt;/a> R library for the Crossref REST API (rcrossref), we pulled all articles connected to a previous preprint (&lt;a href="https://api.crossref.org/works?filter=relation.type:has-preprint&amp;amp;facet=publisher-name" target="_blank">https://api.crossref.org/works?filter=relation.type:has-preprint&amp;facet=publisher-name&lt;/a>:&lt;em>&amp;amp;rows=0) and then aggregated them based on journal via their ISSNs (&lt;a href="https://api.crossref.org/works?filter=relation.type:has-preprint&amp;amp;facet=issn" target="_blank">https://api.crossref.org/works?filter=relation.type:has-preprint&amp;facet=issn&lt;/a>:&lt;/em>), tallying the results in a tidy table with the journal name (ex: PLOS Biology (&lt;a href="https://api.crossref.org/journals/2167-8359%29%29" target="_blank">https://api.crossref.org/journals/2167-8359))&lt;/a>.&lt;/p>
&lt;h3 id="the-big-reveal">The big reveal&lt;/h3>
&lt;p>So without further delay, let’s look at the results of the 20 journals with the highest number of preprints associated with its articles (data from August 21, 2018):&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th style="text-align: left">Publisher&lt;/th>
&lt;th style="text-align: left">Journal&lt;/th>
&lt;th style="text-align: left">Count&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td style="text-align: left">PeerJ&lt;/td>
&lt;td style="text-align: left">PeerJ&lt;/td>
&lt;td style="text-align: left">1184&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">Springer Nature&lt;/td>
&lt;td style="text-align: left">Scientific Reports&lt;/td>
&lt;td style="text-align: left">394&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">eLife&lt;/td>
&lt;td style="text-align: left">eLife&lt;/td>
&lt;td style="text-align: left">375&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">PLOS&lt;/td>
&lt;td style="text-align: left">PLOS ONE&lt;/td>
&lt;td style="text-align: left">338&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">Proceedings of the National Academy of Sciences&lt;/td>
&lt;td style="text-align: left">PNAS&lt;/td>
&lt;td style="text-align: left">205&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">PLOS&lt;/td>
&lt;td style="text-align: left">PLOS Computational Biology&lt;/td>
&lt;td style="text-align: left">196&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">Springer Nature&lt;/td>
&lt;td style="text-align: left">Nature Communications&lt;/td>
&lt;td style="text-align: left">187&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">PLOS&lt;/td>
&lt;td style="text-align: left">PLOS Genetics&lt;/td>
&lt;td style="text-align: left">169&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">The Genetics Society of America&lt;/td>
&lt;td style="text-align: left">Genetics&lt;/td>
&lt;td style="text-align: left">168&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">Oxford University Press&lt;/td>
&lt;td style="text-align: left">Nucleic Acids Research&lt;/td>
&lt;td style="text-align: left">148&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">Oxford University Press&lt;/td>
&lt;td style="text-align: left">Bioinformatics&lt;/td>
&lt;td style="text-align: left">138&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">The Genetics Society of America&lt;/td>
&lt;td style="text-align: left">Genetics&lt;/td>
&lt;td style="text-align: left">120&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">The Genetics Society of America&lt;/td>
&lt;td style="text-align: left">G3: Genes, Genomes, Genetics&lt;/td>
&lt;td style="text-align: left">104&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">Cold Spring Harbor Laboratory&lt;/td>
&lt;td style="text-align: left">Genome Research&lt;/td>
&lt;td style="text-align: left">104&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">Oxford University Press&lt;/td>
&lt;td style="text-align: left">Molecular Biology and Evolution&lt;/td>
&lt;td style="text-align: left">100&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">MDPI AG&lt;/td>
&lt;td style="text-align: left">Energies&lt;/td>
&lt;td style="text-align: left">98&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">MDPI AG&lt;/td>
&lt;td style="text-align: left">Sensors&lt;/td>
&lt;td style="text-align: left">96&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">Springer Nature&lt;/td>
&lt;td style="text-align: left">BMC Genomics&lt;/td>
&lt;td style="text-align: left">92&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">MDPI AG&lt;/td>
&lt;td style="text-align: left">International Journal of Molecular Sciences&lt;/td>
&lt;td style="text-align: left">86&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">JMIR Publications&lt;/td>
&lt;td style="text-align: left">Journal of Medical Internet Research&lt;/td>
&lt;td style="text-align: left">83&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">&lt;br>&lt;/td>
&lt;td style="text-align: left">&lt;/td>
&lt;td style="text-align: left">&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">This list has not been normalized or weighted based on the size of the journal. The following observations are informed speculations, as we can only infer so much from the raw data:&lt;/td>
&lt;td style="text-align: left">&lt;/td>
&lt;td style="text-align: left">&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;ul>
&lt;li>&lt;b>Disciplinary practice:&lt;/b> This phenomenon where preprints are a part of disciplinary practice accounts for about half of the journals represented on the list. Certain communities such as genetics and computational fields have been early adopters of preprints. As such, we see higher rates of preprint-to-article publication in journals that publish their work.&lt;/li>
&lt;li>&lt;b>Partnerships:&lt;/b> Partnerships that facilitate submission from the preprint repository directly to a publisher or peer review service (ex: BioRxiv B2J program) make it easier for researchers to move from preprint-sharing seamlessly to submitting their journal article manuscript.&lt;/li>
&lt;li>&lt;b>Tie-ins:&lt;/b> A quarter of the journals on the list are run by publishers with a preprint service, and have been able to tie together both arms of publishing. This removes barriers to journal article submission in the same manner as integrations between repositories and publishers, but does so as a single party.&lt;/li>
&lt;li>&lt;b>Publisher support and treatment:&lt;/b> We also see that strong proponents and early partners of preprint repositories tend to have higher counts. Some publishers have been more outspoken in their welcome of preprints, such as &lt;a href="https://web.archive.org/web/20180829235413/http://www.pnas.org/content/114/48/12630" target="_blank">PNAS&lt;/a>. Sometimes this support also comes in the form of special treatment. In the process of crafting editorial policy on publishing results previously posted in a preprint, some journals have carved out particular affordances in their publication workflow and content delivery streams that may contribute to the higher counts of articles. For example, Nature Research displays the preprints of submitted articles under consideration: &lt;a href="https://nature-research-under-consideration.nature.com/" target="_blank">https://nature-research-under-consideration.nature.com/&lt;/a>.&lt;/li>
&lt;li>&lt;b>Mega-journals:&lt;/b> Mega-journals such as Scientific Reports and PLOS ONE have not discouraged preprints. As such, and due to the size of their publication output, they have easily found a place among the higher counts on the list.&lt;/li>
&lt;/ul>
&lt;h3 id="taking-a-closer-look">Taking a closer look&lt;/h3>
&lt;p>One major consideration in these results, concerns what’s missing in the data. These fall into two camps: incomplete member data, and incomplete membership coverage.&lt;/p>
&lt;p>We have been working with our members to deposit preprints using the proper record type, and to provide links to published articles in their metadata. However, not all have yet done so (ex: SSRN), leading to holes in our &lt;a href="https://doi.org/10.64000/k2hez-ysv45" target="_blank">research nexus graph&lt;/a>, which subsequently detracts from the completeness of the data.&lt;/p>
&lt;p>We celebrate the preprint repositories who are required to update their metadata when an article is published from a preprint, thereby populating the map with critical bridges between preprints and articles. Crossref participation benefits not only the content owner, but the membership at large and all the systems across the research ecosystem powered by Crossref metadata.&lt;/p>
&lt;p>Lastly, this data is dependent on the coverage of preprint repositories who register content with us. We are thrilled that &lt;a href="https://cos.io/" target="_blank">Center for Open Science&lt;/a>, our &lt;a href="https://cos.io/blog/we-are-now-registering-preprint-dois-crossref/" target="_blank">newest preprints addition&lt;/a> who represents 21 community repositories, has recently filled in swaths of the map. But there remain dead zones in the research graph from repositories who are not Crossref members (ex: ArXiv). Their disciplines, as a result, are under represented in these results.&lt;/p>
&lt;h3 id="everyone-dive-in">Everyone dive in!&lt;/h3>
&lt;p>As to the question of “where do preprints get published?”, anyone in fact can answer this question based on the metadata Crossref collects and provides to the community as an open infrastructure provider. We encourage the community to explore and analyze the data further with other available datasets to glean more insights on how scholarly communications is changing with the increasing growth of preprints. For example, the effective results across all journals represented can be weighted based on the number of articles published by each journal.&lt;/p>
&lt;p>Crossref data is open for all to examine and reuse through our &lt;a href="https://github.com/CrossRef/rest-api-doc" target="_blank">REST API&lt;/a>. Please dive in and share your findings with us!&lt;/p></description></item><item><title>2018 election slate</title><link>https://www.crossref.org/blog/2018-election-slate/</link><pubDate>Fri, 17 Aug 2018 00:00:00 +0000</pubDate><author>Lisa Hart Martin</author><guid>https://www.crossref.org/blog/2018-election-slate/</guid><description>&lt;p>With Crossref developing and extending its services for members and other constituents at a rapid pace, it’s an exciting time to be on our board. We recieved 26 expressions of interest this year, so it seems our members are also excited about what they could help us achieve.&lt;/p>
&lt;p>From these 26, the &lt;a href="https://www.crossref.org/committees/nominating">Nominating Committee&lt;/a> has put forward the following slate.&lt;/p>
&lt;h2 id="the-2018-slate-seven-candidates-for-five-available-seats">The 2018 slate: seven candidates for five available seats&lt;/h2>
&lt;ul>
&lt;li>&lt;strong>African Journals OnLine (AJOL),&lt;/strong> Susan Murray, South Africa&lt;/li>
&lt;li>&lt;strong>American Psychological Association (APA),&lt;/strong> Jasper Simons, USA&lt;/li>
&lt;li>&lt;strong>Association for Computing Machinery (ACM),&lt;/strong> Scott Delman, USA&lt;/li>
&lt;li>&lt;strong>California Digital Library (CDL),&lt;/strong> Catherine Mitchell, USA&lt;/li>
&lt;li>&lt;strong>Hindawi,&lt;/strong> Paul Peters, UK&lt;/li>
&lt;li>&lt;strong>Sage,&lt;/strong> Richard Fidczuk, USA&lt;/li>
&lt;li>&lt;strong>Wiley,&lt;/strong> Duncan Campbell, USA&lt;/li>
&lt;/ul>
&lt;div class="shortcode-divwrap blue-highlight">
&lt;span>&lt;h3 id="read-the-candidates-organisational-and-personal-statementsboard-and-governanceelections2018-slate">&lt;a href="https://www.crossref.org/board-and-governance/elections/2018-slate">Read the candidates’ organisational and personal statements&lt;/a>&lt;/h3>
&lt;/span>
&lt;/div>
&lt;p>Candidates were chosen based on the following criteria:&lt;/p>
&lt;ul>
&lt;li>Follow the guidance from the Board to provide a slate or seven or fewer.&lt;/li>
&lt;li>Maintain the current balance of the board with respect to size of organisations.&lt;/li>
&lt;li>Improve balance in other areas, with respect to gender and geography.&lt;/li>
&lt;li>Also consider types of organisations and sector, as well as engagement with Crossref and its services.&lt;/li>
&lt;/ul>
&lt;h2 id="you-can-be-part-of-this-important-process-by-voting-in-the-election">You can be part of this important process, by voting in the election&lt;/h2>
&lt;p>If your organisation is a member of Crossref on September 14, 2018 you are eligible to vote when voting opens on September 28, 2018 (affiliates, however, are not eligible to vote).&lt;/p>
&lt;h2 id="how-can-you-vote">How can you vote?&lt;/h2>
&lt;p>On September 28, 2018, your organisation’s designated voting contact will receive an email with a link to the formal Notice of Meeting and Proxy Form with concise instructions on how to vote. An additional email will be sent with a username and password along with a link to our online voting platform. It is important to make sure your voting contact is up-to-date.&lt;/p>
&lt;h2 id="want-to-add-your-voice">Want to add your voice?&lt;/h2>
&lt;p>We are accepting independent nominations until November 7, 2018. organisations interested in standing as an independent candidate should contact me by this date with a list of ten other Crossref members that endorse their candidacy.&lt;/p>
&lt;p>The election itself will be held at &lt;a href="https://www.crossref.org/crossref-annual-meeting/archive/#2018">LIVE18 Toronto&lt;/a>, our annual meeting, on 13 November 2018 in Canada. We hope you’ll be there to hear the results.&lt;/p></description></item><item><title>Using the Crossref REST API. Part 10 (with Kudos)</title><link>https://www.crossref.org/blog/using-the-crossref-rest-api.-part-10-with-kudos/</link><pubDate>Mon, 13 Aug 2018 00:00:00 +0000</pubDate><author>Christine Cormack Wood</author><discourseUsername>ccormackwood</discourseUsername><guid>https://www.crossref.org/blog/using-the-crossref-rest-api.-part-10-with-kudos/</guid><description>&lt;p>Continuing our blog series highlighting &lt;a href="https://www.crossref.org/categories/api-case-study">the uses of Crossref metadata&lt;/a>, we talked to David Sommer, co-founder and Product Director at the research dissemination management service, &lt;a href="http://www.growkudos.com/" target="_blank">Kudos&lt;/a>. David tells us how Kudos is collaborating with Crossref, and how they use the REST API as part of our &lt;a href="https://www.crossref.org/services/metadata-retrieval/metadata-plus/">Metadata Plus&lt;/a> service.&lt;/p>
&lt;h3 id="introducing-kudos">Introducing Kudos&lt;/h3>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/kudos-logo.png" alt=“Kudos logo" height="150px" width="250px" class="img-responsive" />
&lt;/div>
&lt;p>At Kudos we know that effective dissemination is the starting point for impact. Kudos is a platform that allows researchers and research groups to plan, manage, measure, and report on dissemination activities to help maximize the visibility and impact of their work.&lt;/p>
&lt;p>We launched the service in 2015 and now work with almost 100 publishers and institutions around the world, and have nearly 250,000 researchers using the platform.&lt;/p>
&lt;p>We provide guidance to researchers on writing a plain language summary about their work so it can be found and understood by a broad range of audiences, and then we support researchers in disseminating across multiple channels and measuring which dissemination activities are most effective for them.&lt;/p>
&lt;p>As part of this, we developed the &lt;a href="https://blog.growkudos.com/2017/11/15/kudos-solution-illegal-sharing-copyright-content/" target="_blank">Sharable-PDF&lt;/a> to allow researchers to legitimately share publication profiles across a range of sites and networks, and track the impact of their work centrally. This also allows publishers to prevent copyright infringement, and reclaim lost usage from sharing of research articles on scholarly collaboration networks.&lt;/p>
&lt;center>&lt;figure>&lt;a href="https://www.growkudos.com/publications/10.12688%25252Ff1000research.8013.1/reader">&lt;img src="https://www.crossref.org/images/blog/kudos-page.png"
alt="Kudos publication page" width="75%">&lt;/a>&lt;figcaption>
&lt;h4>An example of a Kudos publication page showing the plain language summary&lt;/h4>
&lt;/figcaption>
&lt;/figure>
&lt;/center>
&lt;h3 id="how-is-crossref-metadata-used-in-kudos">How is Crossref metadata used in Kudos?&lt;/h3>
&lt;p>Since our launch, Crossref has been our metadata foundation. When we receive notification from our publishing partners that an article, book or book chapter has been published, we query using the Crossref REST API to retrieve the metadata for that publication. That data allows us to populate the Kudos publication page.&lt;/p>
&lt;p>We also integrate earlier in the researcher workflow, interfacing with all of the major &lt;a href="https://blog.growkudos.com/2018/03/28/extended-integrations-with-manuscript-submission-systems/" target="_blank">Manuscript Submission Systems&lt;/a> to support authors who want to build impact from the point of submission.&lt;/p>
&lt;p>More recently, we started using the Crossref REST API to retrieve citation counts for a DOI. This enables us to include the number of times content is cited as part of the ‘basket of metrics’ we provide to our researchers. They can then understand the performance of their publications in context, and see the correlation between actions and results.&lt;/p>
&lt;p align="center">&lt;img src="https://www.crossref.org/images/blog/kudos-metrics.png" alt="Kudos metrics page" width="75%" />
&lt;/p>
&lt;p align="center">A Kudos metrics page, showing the basket of metrics and the correlation between actions and results&lt;/p>
&lt;h3 id="what-are-the-future-plans-for-kudos">What are the future plans for Kudos?&lt;/h3>
&lt;p>We have exciting plans for the future! We are developing Kudos for Research Groups to support the planning, managing, measuring and reporting of dissemination activities for research groups, labs and departments. We are adding a range of new features and dissemination channels to support this, and to help researchers to better understand how their research is being used, and by whom.&lt;/p>
&lt;h3 id="what-else-would-kudos-like-to-see-in-crossref-metadata">What else would Kudos like to see in Crossref metadata?&lt;/h3>
&lt;p>We have always found Crossref to be very responsive and open to new ideas, so we look forward to continuing to work together. We are keen to see an industry standard article-level subject classification system developed, and it would seem that Crossref is the natural home for this.&lt;/p>
&lt;p>We are also continuing to monitor &lt;a href="https://www.crossref.org/services/event-data/">Crossref Event Data&lt;/a> which has the potential to provide a rich source of events that could be used to help demonstrate dissemination and impact.&lt;/p>
&lt;p>Finally, we are pleased to see the work Crossref are doing to help improve the quality of the metadata and supporting publishers in auditing their data. If we could have anything we wanted, our dream would be to prevent “funny characters” in DOIs that cause us all kinds of escape character headaches!&lt;/p>
&lt;hr>
&lt;p>Thank you David. If you would like to contribute a case study on the uses of Crossref Metadata APIs please contact the &lt;a href="mailto:feedback@crossref.org">Community team&lt;/a>.&lt;/p></description></item><item><title>Peer review publications</title><link>https://www.crossref.org/blog/peer-review-publications/</link><pubDate>Sun, 12 Aug 2018 00:00:00 +0000</pubDate><author>Jennifer Lin</author><guid>https://www.crossref.org/blog/peer-review-publications/</guid><description>&lt;h2 id="peer-review-publications---not-peer-reviewed-publications-but-peer-reviews-as-publications">Peer review publications&amp;mdash;not peer-reviewed publications, but peer reviews as publications&lt;/h2>
&lt;p>Our newest dedicated record type&amp;mdash;peer review&amp;mdash;has received a warm welcome from our members since rollout last November. We are pleased to formally integrate them into the scholarly record, giving the scholars who participated credit for their work, ensuring readers and systems dependably get from the reviews to the article (and vice versa), and making sure that links to these works persist over time.&lt;/p>
&lt;p>Many of our members make the peer review history available to researchers in different ways. Their extra effort to post review materials alongside the article will now go further once they are registered with us and linked to the journal article. They spoke of publishing peer reviews as a standard part of their publishing operation. The scholarly contributions of their editors and referees are validated, stewarded, and published in the manner of the articles: as per general practice. To fully realize this, they are ensuring that these publications are discoverable, citable, and part of the formal scholarly record—for all the thousands of systems which draw on Crossref metadata.&lt;/p>
&lt;blockquote>
&lt;p>Article metadata + peer review metadata = a fuller picture of the evolution of knowledge&lt;/p>
&lt;/blockquote>
&lt;h2 id="the-growing-collection">The growing collection&lt;/h2>
&lt;p>As of August 12, 2018 three publishers have registered &lt;a href="https://api.crossref.org/types/peer-review/works?facet=publisher-name:*&amp;amp;rows=0" target="_blank">12446 peer reviews&lt;/a> in the dedicated resource type schema we rolled out last November. PeerJ (10.7287) with &lt;a href="https://api.crossref.org/prefixes/10.7287/works?filter=type:peer-review" target="_blank">12015&lt;/a> at time of writing and Stichting SciPost (10.21468) with &lt;a href="https://api.crossref.org/prefixes/10.21468/works?filter=type:peer-review" target="_blank">297 works&lt;/a>. ScienceOpen (10.14293) has registered &lt;a href="https://api.crossref.org/prefixes/10.14293/works?filter=type:peer-review" target="_blank">126 reviews&lt;/a> of papers on their post-publication platform.&lt;/p>
&lt;p>The peer review metadata collected is partly similar, though otherwise unique to other content. In the former, general metadata that we accept for the articles, as well as the reviews, include an ORCID iD to identify the reviewer, editor, and/or author &lt;a href="https://api.crossref.org/types/peer-review/works?filter=has-orcid:true&amp;amp;facet=publisher-name:*&amp;amp;rows=0" target="_blank">0&lt;/a>; license &lt;a href="https://api.crossref.org/types/peer-review/works?filter=has-license:true&amp;amp;facet=publisher-name:*&amp;amp;rows=0" target="_blank">0&lt;/a>.
This metadata is quite distinct from the article and is important to collect, not only as a discrete publication in its own right, but also to provide richer context for the actual results shared in the associated article. They are authored by different people than the paper’s contributors (author response/rebuttal excepting). They need not have the same license.&lt;/p>
&lt;p>Currently, none of this data has been registered. (From the publishers we’ve talked to, this is largely due to factors related to limitations in their technology systems.) And like other record types, we link up scholarly materials in the metadata and fill in the &lt;a href="https://doi.org/10.64000/k2hez-ysv45" target="_blank">research nexus graph&lt;/a> through relations.&lt;/p>
&lt;p>There’s no better way to understand peer review metadata than to look at real examples from our members:&lt;/p>
&lt;ul>
&lt;li>PeerJ review (&lt;a href="https://doi.org/10.7287/peerj.2707v0.1/reviews/1" target="_blank">https://doi.org/10.7287/peerj.2707v0.1/reviews/1&lt;/a>) and its metadata (&lt;a href="https://api.crossref.org/works/10.7287/peerj.2707v0.1/reviews/1" target="_blank">https://api.crossref.org/works/10.7287/peerj.2707v0.1/reviews/1&lt;/a>)&lt;/li>
&lt;li>ScienceOpen review (&lt;a href="https://doi.org/10.14293/s2199-1006.1.sor-uncat.a5995373.v1.rhrmgu" target="_blank">https://doi.org/10.14293/s2199-1006.1.sor-uncat.a5995373.v1.rhrmgu&lt;/a>) and its metadata (&lt;a href="https://api.crossref.org/works/10.14293/s2199-1006.1.sor-uncat.a5995373.v1.rhrmgu" target="_blank">https://api.crossref.org/works/10.14293/s2199-1006.1.sor-uncat.a5995373.v1.rhrmgu&lt;/a>)&lt;/li>
&lt;li>SciPost review (&lt;a href="https://doi.org/10.21468/scipost.report.10" target="_blank">https://doi.org/10.21468/scipost.report.10&lt;/a>) and its metadata (&lt;a href="https://api.crossref.org/works/10.21468/scipost.report.10" target="_blank">https://api.crossref.org/works/10.21468/scipost.report.10&lt;/a>)&lt;/li>
&lt;/ul>
&lt;p>Review-specific metadata is also critical to capturing the shape of the scholarly discussion. These include:&lt;/p>
&lt;ul>
&lt;li>Review date (required)&lt;/li>
&lt;li>Scholarly work reviewed (required)&lt;/li>
&lt;li>Recommendation&lt;/li>
&lt;li>Revision stage&lt;/li>
&lt;li>Review round&lt;/li>
&lt;li>Contributor name&lt;/li>
&lt;/ul>
&lt;p>PeerJ, SciPost, and ScienceOpen have registered this whole set where applicable (review round not applicable to post-publication reviews), with the exception of the recommendation.&lt;/p>
&lt;h3 id="scholarly-contributions-captured-in-time">Scholarly contributions captured in time&lt;/h3>
&lt;p>Published peer reviews uniquely highlight the nature of research ideas evolving over time, spotlighting the nature of this as a collective effort involving multiple individuals. The more metadata, the bolder the story. We have created a set of reference metadata (fictitious) to illustrate this phenomenon. Josiah Carberry submits a manuscript to the Journal of Psychoceramics, entitled “Dog: A Methodology for the Development of Simulated Annealing.” It undergoes two rounds of review with two referees each round. The article &lt;a href="https://doi.org/10.5555/12345681" target="_blank">https://doi.org/10.5555/12345681&lt;/a> is published and registered on May 6, 2012 along with the history of peer review materials on the same day:&lt;/p>
&lt;p>First submission&lt;/p>
&lt;ul>
&lt;li>Referee report 1 - &lt;a href="https://doi.org/10.5555/12345681.9879" target="_blank">https://doi.org/10.5555/12345681.9879&lt;/a>&lt;/li>
&lt;li>Referee report 2 - &lt;a href="https://doi.org/10.5555/12345681.9880" target="_blank">https://doi.org/10.5555/12345681.9880&lt;/a>&lt;/li>
&lt;li>Editor decision - &lt;a href="https://doi.org/10.5555/12345681.9881" target="_blank">https://doi.org/10.5555/12345681.9881&lt;/a>&lt;/li>
&lt;/ul>
&lt;p>Revision round 1&lt;/p>
&lt;ul>
&lt;li>Author rebuttal - &lt;a href="https://doi.org/10.5555/12345681.9882" target="_blank">https://doi.org/10.5555/12345681.9882&lt;/a>&lt;/li>
&lt;li>Referee report 1 - &lt;a href="https://doi.org/10.5555/12345681.9883" target="_blank">https://doi.org/10.5555/12345681.9883&lt;/a>&lt;/li>
&lt;li>Referee report 2 - &lt;a href="https://doi.org/10.5555/12345681.9884" target="_blank">https://doi.org/10.5555/12345681.9884&lt;/a>&lt;/li>
&lt;li>Editor decision - &lt;a href="https://doi.org/10.5555/12345681.9885" target="_blank">https://doi.org/10.5555/12345681.9885&lt;/a>&lt;/li>
&lt;/ul>
&lt;p>Published reviews can show peer feedback in progress; the progress of scholarly discussion unfolding, as expert ideas build upon each other. Many of us have traditionally located the article’s publication as the climactic event, but the story in fact doesn’t end there. Pre-publication becomes post-publication. Throughout this time, research is validated and sprouts into new ideas.&lt;/p>
&lt;p>Peer review platform &lt;a href="https://publons.com/home/" target="_blank">Publons&lt;/a> is working on getting reviews authored on its platform registered with us. Doing so will mean that PeerJ article, “Transformative optimisation of agricultural land use to meet future food demands” by Lian Pin Koh, Thomas Koellner, and Jaboury Ghazoul &lt;a href="https://doi.org/10.7717/peerj.188" target="_blank">https://doi.org/10.7717/peerj.188&lt;/a> with three scholarly discussions published over the course of peer review, would also be accompanied by a fourth that occurred after publication from Gene A. Bunin &lt;a href="https://publons.com/publon/3374/" target="_blank">https://publons.com/publon/3374/&lt;/a>, not yet registered.&lt;/p>
&lt;h3 id="research-begets-research">Research begets research&lt;/h3>
&lt;p>In my investigation of review publications registered, two examples cropped up, highlighting the richness of the research process not only as it shows a set of research results evolve through scholarly discussion, but as it is then folded into new research outputs.&lt;/p>
&lt;ol>
&lt;li>A PeerJ article “Software citation principles” &lt;a href="https://doi.org/10.7717/peerj-cs.86" target="_blank">https://doi.org/10.7717/peerj-cs.86&lt;/a> has had a very rich life: &lt;a href="https://api.crossref.org/works/10.7717/peerj-cs.86" target="_blank">https://api.crossref.org/works/10.7717/peerj-cs.86&lt;/a>. It was originally submitted as a preprint and underwent multiple iterations of improvement (&lt;a href="https://doi.org/10.7287/peerj.preprints.2169" target="_blank">https://doi.org/10.7287/peerj.preprints.2169&lt;/a>, &lt;a href="https://doi.org/10.7287/peerj.preprints.2169v1" target="_blank">https://doi.org/10.7287/peerj.preprints.2169v1&lt;/a>, &lt;a href="https://doi.org/10.7287/peerj.preprints.2169v2" target="_blank">https://doi.org/10.7287/peerj.preprints.2169v2&lt;/a>, etc.). It then was subjected to peer review. And three referee reports are published alongside the final publication:&lt;/li>
&lt;/ol>
&lt;ul>
&lt;li>&lt;a href="https://doi.org/10.7287/peerj-cs.86v0.1/reviews/1" target="_blank">https://doi.org/10.7287/peerj-cs.86v0.1/reviews/1&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://doi.org/10.7287/peerj-cs.86v0.1/reviews/2" target="_blank">https://doi.org/10.7287/peerj-cs.86v0.1/reviews/2&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://doi.org/10.7287/peerj-cs.86v0.2/reviews/1" target="_blank">https://doi.org/10.7287/peerj-cs.86v0.2/reviews/1&lt;/a>.&lt;/li>
&lt;/ul>
&lt;p>We glimpse a view of time unfolding here:&lt;/p>
&lt;p>&lt;img src="https://www.crossref.org/images/blog/Peer-reviews-registered-PeerJ-graph1.png" alt="peer review PeerJ graph" height="325px"/>&lt;/p>
NB: in the review metadata, all the dates provided reference September 19, 2016 when they were published with the accompanying research article. To really make the metadata useful, we recommend providing the date the review was received, rather than published (for publishers who are publishing pre-publication review materials).
&lt;p>The reviews were then cited in three versions of the F1000Research article, “A multi-disciplinary perspective on emergent and future innovations in peer review” (&lt;a href="https://doi.org/10.12688/f1000research.12037.1" target="_blank">https://doi.org/10.12688/f1000research.12037.1&lt;/a>, &lt;a href="https://doi.org/10.12688/f1000research.12037.2" target="_blank">https://doi.org/10.12688/f1000research.12037.2&lt;/a>, and &lt;a href="https://doi.org/10.12688/f1000research.12037.3%29" target="_blank">https://doi.org/10.12688/f1000research.12037.3)&lt;/a>. These three all link up on the Crossref metadata map. The visualization below is only an entrypoint into this picture of research dissemination and the spread of ideas.&lt;/p>
&lt;p>&lt;img src="https://www.crossref.org/images/blog/Peer-reviews-registered-PeerJ-graph2.png" alt="peer review PeerJ graph2" height="325px"/>&lt;/p>
&lt;ol start="2">
&lt;li>András Láng served as a reviewer for a paper by Danilo Garcia and Fernando R. González Moraga published as “The Dark Cube: dark character profiles and OCEAN” (&lt;a href="https://doi.org/10.7717/peerj.3845%29" target="_blank">https://doi.org/10.7717/peerj.3845)&lt;/a>. As of the blog release date, this paper has been cited by two sources:&lt;/li>
&lt;/ol>
&lt;p>&lt;img src="https://www.crossref.org/images/blog/Peer-reviews-registered-citations.png" alt="PeerJ citations list" height="250px"/>&lt;/p>
&lt;p align="left">Source: https://doi.org/10.7717/peerj.3845, CC-BY 4.0&lt;/p>
&lt;p>What this view of the paper does not reveal is that Láng’s review (&lt;a href="https://doi.org/10.7287/peerj.3845v0.1/reviews/2" target="_blank">https://doi.org/10.7287/peerj.3845v0.1/reviews/2&lt;/a>) provided such insight to the original researchers that the first author (Garcia) incorporates the discussion in his subquent work. This evidence is documented in the citation list of that new publication, “Encyclopedia of Personality and Individual Differences” &lt;a href="https://doi.org/10.1007/978-3-319-28099-8_2302-1" target="_blank">https://doi.org/10.1007/978-3-319-28099-8_2302-1&lt;/a>. What a wonderful illustration of the ways in which peer reviews can operate like other publications, and how far is it from being unique. But up to now, we have not yet programmatically captured them in a formal way as we do now with these materials registered properly as a review.&lt;/p>
&lt;h3 id="the-evolution-of-crossrefs-piece">The evolution of Crossref’s piece&lt;/h3>
&lt;p>In the same spirit of ever evolving knowledge, we also continue to update our schemas based upon community feedback. Are references important? Tell us! What new metadata on peer reviews are important to answer your questions or help you do what you need? Members, if you are interested in registering your peer review content with us, please &lt;a href="mailto:support@crossref.org">get in touch&lt;/a>.&lt;/p></description></item><item><title>Org ID: a recap and a hint of things to come</title><link>https://www.crossref.org/blog/org-id-a-recap-and-a-hint-of-things-to-come/</link><pubDate>Thu, 02 Aug 2018 00:00:00 +0000</pubDate><author>John Chodacki</author><guid>https://www.crossref.org/blog/org-id-a-recap-and-a-hint-of-things-to-come/</guid><description>&lt;p>&lt;em>Cross-posted on the blogs of University of California (UC3), ORCID, and DataCite: &lt;a href="https://doi.org/10.5438/67sj-4y05" target="_blank">https://doi.org/10.5438/67sj-4y05&lt;/a>&lt;/em>.&lt;/p>
&lt;p>Over the past couple of years, a group of organisations with a shared purpose&amp;mdash;California Digital Library, Crossref, DataCite, and ORCID&amp;mdash;invested our time and energy into launching the Org ID initiative, with the goal of defining requirements for an open, community-led organisation identifier registry.  The goal of our initiative has been to offer a transparent, accessible process that builds a better system for all of our communities. As the working group chair, I wanted to provide an update on this initiative and let you know where our efforts are headed.&lt;/p>
&lt;h2 id="community-led-effort">Community-led effort&lt;/h2>
&lt;p>First, I would like to summarize all of the work that has gone into this project, a truly community-driven initiative, over the last two years:&lt;/p>
&lt;ul>
&lt;li>A series of collaborative workshops were held at the Coalition for Networked Information (CNI) meeting in San Antonio TX (2016), the FORCE11 conference in Portland OR (2016), and at PIDapalooza in Reykjavik (2016).&lt;/li>
&lt;li>Findings from these workshops were summarized in three documents, which we made openly available to the community for public comment:&lt;/li>
&lt;li>organisation Identifier Project: A Way Forward (&lt;a href="https://doi.org/10.5438/2906" target="_blank">PDF&lt;/a>)&lt;/li>
&lt;li>organisation Identifier Provider Landscape (&lt;a href="https://doi.org/10.5438/4716" target="_blank">PDF&lt;/a>)&lt;/li>
&lt;li>Technical Considerations for an organisation Identifier Registry (&lt;a href="https://doi.org/10.5438/7885" target="_blank">PDF&lt;/a>)&lt;/li>
&lt;li>A &lt;a href="https://orcid.org/content/organisation-identifier-working-group" target="_blank">Working Group&lt;/a> worked throughout 2017 and voted to approve a set of recommendations and principles for &amp;lsquo;governance&amp;rsquo; and &amp;lsquo;product&amp;rsquo;:&lt;/li>
&lt;li>&lt;a href="https://figshare.com/articles/ORG_ID_WG_Governance_Principles_and_Recommendations/5402002/1" target="_blank">Governance Recommendations&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://figshare.com/articles/ORG_ID_WG_Product_Principles_and_Recommendations/5402047/1" target="_blank">Product Principles and Recommendations&lt;/a>&lt;/li>
&lt;li>We then put out a &lt;a href="https://doi.org/10.23640/07243.5458162.v1" target="_blank">Request for Information&lt;/a> that sought expressions of interest from organisations to be involved in implementing and running an organisation identifier registry.&lt;/li>
&lt;li>There was a really good response to the RFI; reviewing the responses and thinking about next steps led to our most recent &lt;a href="https://orcid.org/content/2018-org-id-meeting" target="_blank">stakeholder meeting in Girona&lt;/a> in January 2018, where ORCID, DataCite, and Crossref were tasked with drafting a proposal that meets the Working Group&amp;rsquo;s requirements for a community-led, organisational identifier registry.&lt;/li>
&lt;/ul>
&lt;h2 id="thank-you">Thank you&lt;/h2>
&lt;p>I want to take this opportunity to thank everyone who has contributed to this effort so far.  We&amp;rsquo;ve been able to make good progress with the initiative because of the time and expertise many of you have volunteered. We have truly benefited from the support of the community, with representatives from Alfred P. Sloan Foundation; American Physical Society, California Digital Library, Cornell University, Crossref, DataCite, Digital Science, Editeur, Elsevier, Foundation for Earth Sciences, Hindawi, Jisc, ORCID, Ringgold, Springer Nature, The IP Registry, and U.S. Geological Survey involved throughout this initiative.  And we couldn&amp;rsquo;t have done any of it without the help and guidance of our consultants, Helen Szigeti and Kristen Ratan.&lt;/p>
&lt;h2 id="the-way-forward">The way forward&lt;/h2>
&lt;p>The recommendations from our initiative have been converted into a concrete plan for building a registry for research organisations.  This plan will be posted in the coming weeks.&lt;/p>
&lt;p>The initiative&amp;rsquo;s leadership group has already secured start-up resourcing and is getting ready to announce the launch plan&amp;mdash;more details coming soon.  &lt;/p>
&lt;p>We hope that all stakeholders will continue to support the next phase of our work &amp;ndash; look for announcements in the coming weeks about how to get involved.  &lt;/p>
&lt;p>As always, we welcome your feedback and involvement as this effort continues. Please contact me directly with any questions or comments at &lt;a href="mailto:john.chodacki@ucop.edu">john.chodacki@ucop.edu&lt;/a>. And thanks again for your help bringing an open organisation identifier registry to fruition!&lt;/p>
&lt;hr>
&lt;h3 id="references">References&lt;/h3>
&lt;p>Bilder, G., Brown, J., &amp;amp; Demeranville, T. (2016). Organisation identifiers: current provider survey. ORCID. &lt;a href="https://doi.org/10.5438/4716" target="_blank">https://doi.org/10.5438/4716&lt;/a>&lt;/p>
&lt;p>Cruse, P., Haak, L., &amp;amp; Pentz, E. (2016). organisation Identifier Project: A Way Forward. ORCID. &lt;a href="https://doi.org/10.5438/2906" target="_blank">https://doi.org/10.5438/2906&lt;/a>&lt;/p>
&lt;p>Fenner, M., Paglione, L., Demeranville, T., &amp;amp; Bilder, G. (2016). Technical Considerations for an organisation Identifier Registry. &lt;a href="https://doi.org/10.5438/7885" target="_blank">https://doi.org/10.5438/7885&lt;/a>&lt;/p>
&lt;p>Laurel, H., Bilder, G., Brown, C., Cruse, P., Devenport, T., Fenner, M., … Smith, A. (2017). ORG ID WG Product Principles and Recommendations. &lt;a href="https://doi.org/10.23640/07243.5402047" target="_blank">https://doi.org/10.23640/07243.5402047&lt;/a>&lt;/p>
&lt;p>Laurel, H., Pentz, E., Cruse, P., &amp;amp; Chodacki, J. (2017). organisation Identifier Project: Request for Information. &lt;a href="https://doi.org/10.23640/07243.5458162" target="_blank">https://doi.org/10.23640/07243.5458162&lt;/a>&lt;/p>
&lt;p>Pentz, E., Cruse, P., Laurel, H., &amp;amp; Warner, S. (2017). ORG ID WG Governance Principles and Recommendations. &lt;a href="https://doi.org/10.23640/07243.5402002" target="_blank">https://doi.org/10.23640/07243.5402002&lt;/a>&lt;/p></description></item><item><title>3,2,1… it’s ‘lift-off’ for Participation Reports</title><link>https://www.crossref.org/blog/321-its-lift-off-for-participation-reports/</link><pubDate>Wed, 01 Aug 2018 00:00:00 +0000</pubDate><author>Anna Tolwinska</author><guid>https://www.crossref.org/blog/321-its-lift-off-for-participation-reports/</guid><description>&lt;p>Metadata is at the heart of all our services. With a growing range of members participating in our community—often compiling or depositing metadata on behalf of each other—the need to educate and express obligations and best practice has increased. In addition, we’ve seen more and more researchers and tools making use of our APIs to &lt;a href="https://doi.org/10.64000/gw54x-dpg59" target="_blank">harvest&lt;/a>, &lt;a href="https://doi.org/10.64000/5mndr-eyy53" target="_blank">analyze&lt;/a> and &lt;a href="https://doi.org/10.64000/cza9e-gfq89" target="_blank">re-purpose&lt;/a> the metadata our members register, so we’ve been very aware of the need to be more explicit about what this metadata enables, why, how, and for whom.&lt;/p>
&lt;p>This week we take an important step towards this goal with a much-anticipated announcement: Participation reports are in beta release—so come along and &lt;a href="https://www.crossref.org/members/prep/" target="_blank">take a look&lt;/a>!&lt;/p>
&lt;h3 id="what-does-this-mean">What does this mean?&lt;/h3>
&lt;p>Participation Reports gives—for the first time—a clear visualization of the metadata that Crossref has. &lt;a href="https://www.crossref.org/members/prep/" target="_blank">Search for any member&lt;/a> to find out what percentage of their content includes &lt;a href="https://doi.org/10.64000/4tzvr-w1k74" target="_blank">10 key elements&lt;/a> of information, above and beyond the basic bibliographic metadata that all members are obliged to provide. This includes metadata such as ORCID iDs for contributors, funding acknowledgements, reference lists, and abstracts—richer metadata that makes content more discoverable, and much more useful to the scholarly community as a whole, including among members themselves.&lt;/p>
&lt;p align="center">
&lt;img src="https://www.crossref.org/images/blog/Prep.png" alt="participation reports dashboard" height="600px" />
&lt;/p>
&lt;p>You can filter by content such as journal articles, book chapters, datasets, and preprints, and compare current content (past two calendar years and year-to-date) to back file content (older than that). And within the journal articles view, you can drill down to view the metadata completeness for each individual journal. We hear that editorial boards are keen to see that aspect!&lt;/p>
&lt;p>We’re delighted that participation reports are now available in beta. That means that while we are confident that the data shown is accurate, there could be the odd glitch as we monitor use.&lt;/p>
&lt;p>Thank you to everyone who has helped us to test the reports and provided so much valuable feedback. We plan to expand and improve participation reports to include additional metadata elements, metadata quality checks, and adherence to Crossref best practice such as DOI display. We’re still listening so do &lt;a href="mailto:feedback@crossref.org">get in touch&lt;/a> if you have questions or suggestions, or would like a more detailed walk through. There is also a feedback button right in-situ in the tool.&lt;/p>
&lt;hr></description></item><item><title>Crossref LIVE and local (to you)</title><link>https://www.crossref.org/blog/crossref-live-and-local-to-you/</link><pubDate>Wed, 18 Jul 2018 00:00:00 +0000</pubDate><author>Vanessa Fairhurst</author><guid>https://www.crossref.org/blog/crossref-live-and-local-to-you/</guid><description>&lt;p>The last few months have been busy for the Crossref community outreach team. We’ve been out and about from Cape Town to Ulyanovsk—and many places in between—talking at ‘LIVE locals’ to members about all things metadata.
Our &lt;a href="https://www.crossref.org/events/">LIVE locals&lt;/a> are one-day events, held around the world—but local to you—that provide both deeper insight into Crossref, and information on our services and how to benefit from them. These events are always free to attend, and whether you are a long-established member, totally new, or not even a member at all, we welcome you all to join us.&lt;/p>
&lt;p>At our most recent events we collaborated with some fantastic organisations and welcomed attendees from a variety of backgrounds including editors, publishers, service providers, researchers and other metadata users.&lt;/p>
&lt;h2 id="south-africa">South Africa&lt;/h2>
&lt;p>In April &lt;a href="https://www.crossref.org/people/chuck-koscher/">Chuck Koscher&lt;/a>, Director of Technology, and I travelled to South Africa for two LIVE locals, one in Pretoria and the other in Cape Town—and both in collaboration with the &lt;a href="https://www.assaf.org.za/" target="_blank">Academy of Science of South Africa (ASSAf)&lt;/a>. ASSAf also provided two excellent speakers, Nadine Wubbeling (ASSAf) and Pierre de Villiers (&lt;a href="https://aosis.co.za/" target="_blank">AOSIS&lt;/a>), who shared their experiences with Crossref and presented valuable insights into the work that they do.&lt;/p>
&lt;p>Delivering events for a varied audience like this means there are often differing levels of knowledge and experience. So, to make sure everyone benefited from our sessions, we covered the different ways you can work with the Crossref deposit system as an XML pro, or an absolute beginner. This included a live demonstration of our new deposit tool Metadata Manager (currently in beta) which should help those less technically-minded people (like myself), and be a big improvement upon our current web deposit form.&lt;/p>
&lt;div style="float:right;margin:10px">
|&lt;img src="https://www.crossref.org/images/blog/dr-pierre.jpg" alt="Dr. Pierre de Villiers" height="250px" width="300px"/>|&lt;img src="https://www.crossref.org/images/blog/table-mountain2.jpg" alt="Table Mountain" height="250px" width="300px"/>|
&lt;/div>
&lt;p>The day ended with a technical session, where attendees discussed specific issues they needed help with, which mainly focussed on retrieving metadata in the Crossref system, interpreting reports, and support with XML.&lt;/p>
&lt;p>&lt;em>Images left to right: Dr. Pierre de Villiers talks about the Crossref Experience at AOSIS, and the stunning scenery of Table Mountain provided a nice backdrop to our Cape Town event.&lt;/em>&lt;/p>
&lt;h2 id="russia">Russia&lt;/h2>
&lt;div style="float:right;margin:10px">
&lt;blockquote class="twitter-tweet" data-lang="en">&lt;p lang="en" dir="ltr">Just back from a few days in Russia 🇷🇺. We ran a &lt;a href="https://twitter.com/CrossrefOrg?ref_src=twsrc%5Etfw">@CrossrefOrg&lt;/a> LIVE local in Ulyanovsk for 60 editors, made plans to do more education and outreach in the region and caught a &lt;a href="https://twitter.com/hashtag/FifaWorldCup2018?src=hash&amp;amp;ref_src=twsrc%5Etfw">#FifaWorldCup2018&lt;/a> game... &lt;a href="https://t.co/GSdNEujJXa">pic.twitter.com/GSdNEujJXa&lt;/a>&lt;/p>&amp;mdash; Rachael Lammey (@rachaellammey) &lt;a href="https://twitter.com/rachaellammey/status/1010040188406587393?ref_src=twsrc%5Etfw">June 22, 2018&lt;/a>&lt;/blockquote> &lt;script async src="https://platform.twitter.com/widgets.js" charset="utf-8">&lt;/script>
&lt;/div>
&lt;p>The World Cup wasn’t the only big event in Russia last month. That’s right, we were there too—with our very first Russian LIVE local! On the 19th June, 60 attendees from a range of academic and publishing institutions joined us at &lt;a href="http://www.ulspu.ru/" target="_blank">The Ulyanovsk State Pedagogical University&lt;/a>.
&lt;a href="https://www.crossref.org/people/rachael-lammey/">Rachael Lammey&lt;/a> and I introduced Crossref, the role of identifiers, and how to register different resource types with us. We also discussed the use and importance of providing accurate and comprehensive metadata, and shared some interesting use cases.&lt;/p>
&lt;p>Guest speaker Professor Zinaida Kuznetsova talked about her experiences of working with Crossref and the benefits of being a member. This was complimented by a talk by fellow guest speaker Maxim Mitrofanov from Crossref sponsoring organisation, &lt;a href="https://neicon.ru/" target="_blank">NEICON&lt;/a>. Maxim explained how NEICON works with Crossref, and provide services for the smaller members they support. Maxim is also one of our &lt;a href="https://www.crossref.org/community/our-ambassadors/">Crossref Ambassadors&lt;/a> - and he will be running more Russian webinars on our services in the near future, so look out for those listed on our &lt;a href="https://www.crossref.org/webinars/">webinar page&lt;/a>!&lt;/p>
&lt;p>We’d like to say a big thank you to the team at Ulyanovsk State Pedagogical University for their support and help with the event. Also thanks to our fantastic interpreters who helped us immensely by relaying the information to the audience in Russian, as well as helping to translate and answer questions.&lt;/p>
&lt;h2 id="germany">Germany&lt;/h2>
&lt;div style="float:right;margin:10px">
&lt;center>&lt;blockquote class="twitter-tweet" data-lang="en">&lt;p lang="en" dir="ltr">Najko Jahn from Göttingen State and University Library talks about how he uses &lt;a href="https://twitter.com/CrossrefOrg?ref_src=twsrc%5Etfw">@CrossrefOrg&lt;/a> metadata in his work &lt;a href="https://twitter.com/hashtag/CRLIVEGermany?src=hash&amp;amp;ref_src=twsrc%5Etfw">#CRLIVEGermany&lt;/a> &lt;a href="https://t.co/Y89ZkBMoSh">pic.twitter.com/Y89ZkBMoSh&lt;/a>&lt;/p>&amp;mdash; Vanessa Fairhurst (@NessaFairhurst) &lt;a href="https://twitter.com/NessaFairhurst/status/1011902317828993024?ref_src=twsrc%5Etfw">June 27, 2018&lt;/a>&lt;/blockquote> &lt;script async src="https://platform.twitter.com/widgets.js" charset="utf-8">&lt;/script>&lt;/center>
&lt;/div>
&lt;p>One week later and we were in Hannover, Germany. Crossref’s &lt;a href="https://www.crossref.org/people/laura-j-wilkinson/">Laura Wilkinson&lt;/a>, &lt;a href="https://www.crossref.org/people/joe-wass/">Joe Wass&lt;/a> and &lt;a href="https://www.crossref.org/people/jennifer-kemp/">Jennifer Kemp&lt;/a> joined me for this event, which was held in collaboration with the German National Library of Science and Technology (&lt;a href="https://www.tib.eu/en/service/news/details/metadaten-unverzichtbarer-rohstoff-im-digitalen-zeitalter/" target="_blank">Technische Informationsbibliothek - TIB&lt;/a> at their impressive venue in on the 27th June. ￼&lt;/p>
&lt;p>The day focused on all things metadata - how it can be used and why good metadata is important. This included taking a look at our new &lt;a href="https://www.crossref.org/members/prep/" target="_blank">Participation Reports&lt;/a> tool and a fascinating talk from guest speaker Najko Jahn from &lt;a href="https://www.sub.uni-goettingen.de/sub-aktuell/" target="_blank">Göttingen State and University Library&lt;/a> on the benefits of using Crossref metadata for libraries and scientists.&lt;/p>
&lt;p>&lt;a href="https://www.datacite.org/" target="_blank">Datacite’s&lt;/a> Britta Dreyer also spoke about how DataCite and Crossref support research data sharing, before Joe Wass and I presented updates to the collaborative &lt;a href="https://www.crossref.org/categories/organisation-identifier/">Org ID project&lt;/a> and &lt;a href="https://www.crossref.org/services/event-data/">Event Data&lt;/a> service. The day concluded with us sharing more ways to participate in Crossref and other community initiatives.&lt;/p>
&lt;h2 id="questions-вопросов-fragen">Questions? Вопросов? Fragen?&lt;/h2>
&lt;p>Over the course of these events we were asked many questions—and here are some of the more interesting/common ones posed to the team: &lt;br>&lt;/p>
&lt;p>Q. Do I have to join Crossref directly, or can I join as part of a group of smaller organisations? &lt;br>
A. You don’t have to be a direct member, you can join via a Sponsor. See our &lt;a href="https://www.crossref.org/community/sponsors/">sponsors page&lt;/a> for a list of Sponsors in your area, and for more information on becoming a Sponsor.&lt;br>&lt;/p>
&lt;p>Q. Can I link translations of works together? &lt;br>
A. Yes, a journal article published in two languages can each be assigned its own DOI, and then linked in the metadata using the &lt;a href="https://support.crossref.org/hc/en-us/articles/214357426" target="_blank">relationship type&lt;/a> TranslationOf from our schema.&lt;br>&lt;/p>
&lt;p>Q. Does the web deposit form support depositing abstracts and references?&lt;br>
A. No, it doesn’t. However, our new Metadata Manager tool does and if you are in interested in trying it out in beta, &lt;a href="mailto:feedback@crossref.org">let us know&lt;/a>.&lt;br>&lt;/p>
&lt;p>Q. Can I share your new Participation Report tool with my colleagues?&lt;br>
A. Yes you can! It’s open and available for use, just come along and &lt;a href="https://www.crossref.org/members/prep/" target="_blank">search for a member&lt;/a>.&lt;br>&lt;/p>
&lt;p>Q. Can I also register book chapters, dissertations and other record types under the same prefix?&lt;br>
A. Yes you can. You can register any of the different &lt;a href="https://support.crossref.org/hc/en-us/articles/213123586-Metadata-and-content-type-overview" target="_blank">resource types we support&lt;/a> under one prefix.&lt;br>&lt;/p>
&lt;p>Q. Will you be doing more events in this region in future?&lt;br>
A. We hope so, and we are always happy to hear from those who wish to collaborate on future events, so just &lt;a href="mailto:feedback@crossref.org">contact us&lt;/a> to get involved.&lt;/p>
&lt;hr></description></item><item><title>Status, I am new</title><link>https://www.crossref.org/blog/status-i-am-new/</link><pubDate>Mon, 02 Jul 2018 00:00:00 +0000</pubDate><author>Isaac Farley</author><guid>https://www.crossref.org/blog/status-i-am-new/</guid><description>&lt;p>Hi, I’m Isaac. I’m new here. What better way to get to know me than through a blog post? Well, maybe a cocktail party, but this will have to do. In addition to giving you some details about myself in this post, I’ll be introducing our &lt;a href="http://status.crossref.org/" target="_blank">status page&lt;/a>, too.&lt;/p>
&lt;h3 id="a-little-about-me">A little about me&lt;/h3>
&lt;p>In mid-April, I began as the new Support Manager. My goal is to fill the very large shoes left by Patricia Feeney moving into the Head of Metadata role. I know Patricia knows Crossref and the rich community of members (and metadata!) inside and out. I’ll get there too. For now, I have immersed myself in tackling as many of your support questions as possible, so I may have already met some of you on a support ticket. If so, thanks for your patience; you likely have already taught me a thing or two!&lt;/p>
&lt;p align="center">
&lt;img src="https://www.crossref.org/images/blog/isaac.jpg" alt="Isaac, on the lookout to provide you excellent support" height="250px" width="250px" />
&lt;/p>
&lt;p align="center">Isaac, on the lookout to provide you excellent support&lt;/p>
&lt;p>I came to this position from one of our members – the Society of Exploration Geophysicists, where I served as the Digital Publications Manager for the last five years. Like many of you, I was always impressed, intrigued, and excited by the work underway at Crossref and wanted to be a part of the team. So, here I am, very much looking forward to the challenge ahead.&lt;/p>
&lt;p>I work remotely from Tulsa, Oklahoma, where I live with my wife and two daughters. Tulsa doesn’t have as many members as D.C., London, or Jakarta, but I hope to meet some of you during outreach trips, LIVE events, online in a webinar, or in our support community.&lt;/p>
&lt;p>One of the things that attracts me to being a part of this community are our &lt;a href="https://www.crossref.org/truths/">truths&lt;/a>. As a quick reminder, the truths are:&lt;/p>
&lt;ul>
&lt;li>Come one, come all&lt;/li>
&lt;li>One member, one vote&lt;/li>
&lt;li>Smart alone, brilliant together&lt;/li>
&lt;li>Love metadata, love technology&lt;/li>
&lt;li>What you see, what you get&lt;/li>
&lt;li>Here today, here tomorrow&lt;/li>
&lt;/ul>
&lt;p>I am drawn to forward-thinking, action-oriented communities that value collaboration and openness. These truths, and the ten weeks I have been at Crossref, have confirmed that this is one of those communities. As your new support manager, I want to emphasize our commitment to transparency: Ask me anything; I’ll tell you what I know. In that spirit, I have the privilege of introducing our new status page—a key piece in furthering our own transparency and openness.&lt;/p>
&lt;p>&lt;a href="http://status.crossref.org/" target="_blank">status.crossref.org&lt;/a>&lt;/p>
&lt;p>Our new status page provides critical, real-time information about our services—it helps us tell our overall story. If you are looking for metrics on the performance of our APIs, websites, the deposit system, or new beta services, bookmark this page. The system metrics provide daily, weekly, and monthly overviews of each of our services’ response time (in milliseconds) and uptime, or percentage of time that service has been operational during your selected time span (daily, weekly, or monthly).&lt;/p>
&lt;p>From this page, we’ll announce planned maintenance and keep you regularly updated when we have an incident. And, we’ll provide regular status updates for these incidents when in progress, updated, and completed.&lt;/p>
&lt;p align="center">
&lt;img src="https://www.crossref.org/images/blog/support .jpg" alt="Our new status page" height="750px" width="550px" />
&lt;/p>
&lt;p align="center">Our new status page – status.crossref.org&lt;/p>
&lt;p>I encourage you to subscribe to the updates from the top-right corner of the page. While we’ll update this page with any service-related outages, subscribing for notifications will allow you to stay current on the latest. We’ll describe maintenance and incidents clearly, simply, and timely when we have them. And, if we don’t, call us on it.&lt;/p>
&lt;p>If you have questions about the performance of our services, the status page is a great starting place. If you still have questions, ask us, we’ll tell you what we know.&lt;/p>
&lt;hr></description></item><item><title>Using the Crossref REST API. Part 9 (with Dimensions)</title><link>https://www.crossref.org/blog/using-the-crossref-rest-api.-part-9-with-dimensions/</link><pubDate>Wed, 27 Jun 2018 00:00:00 +0000</pubDate><author>Christine Cormack Wood</author><discourseUsername>ccormackwood</discourseUsername><guid>https://www.crossref.org/blog/using-the-crossref-rest-api.-part-9-with-dimensions/</guid><description>&lt;p>Continuing our &lt;a href="https://www.crossref.org/categories/api-case-study/">blog series&lt;/a> highlighting the uses of Crossref metadata, we talked to the team behind new search and discovery tool &lt;a href="https://www.dimensions.ai/" target="_blank">Dimensions&lt;/a>: Daniel Hook, Digital Science CEO; Christian Herzog, ÜberResearch CEO; and Simon Porter, Director of Innovation. They talk about the work they’re doing, the collaborative approach, and how Dimensions uses the Crossref REST API as part of our &lt;a href="https://www.crossref.org/services/metadata-retrieval/metadata-plus/">Metadata Plus service&lt;/a>, to augment other data and their workflow.&lt;/p>
&lt;h3 id="introducing-dimensions">Introducing Dimensions&lt;/h3>
&lt;p>&lt;a href="https://app.dimensions.ai/discover/publication" target="_blank">Dimensions&lt;/a> is a next-generation approach to discovering, connecting with and contextualising research. Modern academics need data about the research ecosystem in which they exist as much as the administrators who develop institutional research strategies. All academics are now required to think long-range about their research projects, contextualise their research, and demonstrate the impact of their program. Additionally, they need to find funding, ensure that students go on to good positions, and hire talented colleagues whose skills fit well with ongoing projects. Dimensions gives the first fully-linked view of publications, grants, patents and clinical trials in an analytically-centred user experience.&lt;/p>
&lt;img src="https://www.crossref.org/images/blog/dimensions-1-1.jpg" alt="Dimensions sample screen" width="100%" />
&lt;h3 id="how-is-crossref-data-used-within-dimensions">How is Crossref data used within Dimensions?&lt;/h3>
&lt;p>For an article to appear in Dimensions it must have a Crossref DOI, so it would not be possible to create Dimensions’ Publication index without Crossref’s data. Dimensions is built on several principles that we’ve talked about before. Here the most relevant of those principles are:&lt;/p>
&lt;ul>
&lt;li>unique identifiers should underlie everything that we do;&lt;/li>
&lt;li>data should not be inclusive and the tool should allow the user to select what they want to see;&lt;/li>
&lt;li>data should be more available to our community;&lt;/li>
&lt;li>data should be presented with as much contextual information as possible;&lt;/li>
&lt;li>the community should have enough data available to be able to create and experiment with their own metrics and indicators.&lt;/li>
&lt;/ul>
&lt;p>In the context of these principles, Crossref makes a perfect starting place to create a tool like Dimensions. We use the Crossref data to know about our possible “universe” of articles. We then enhance the Crossref core with data from several different places: open access publications in the DOAJ, PubMed, BioArXiv, and through relationships with publishers. In all, 60 million of the 95 million articles in the Dimensions index have a full text version that we can text and data mine for additional information.&lt;/p>
&lt;p>In Dimensions’ enhancement stage we can extract address information (where not included in the original Crossref record) and map it to &lt;a href="https://grid.ac/" target="_blank">GRID&lt;/a> funding information and the list of funders in Crossref’s Funder Registry as well as to our database of grants in Dimensions.&lt;/p>
&lt;img src="https://www.crossref.org/images/blog/dimensions-2-1.jpg" alt="Extracting information with Dimensions" width="100%" />
&lt;h3 id="how-have-you-incorporated-citation-data">How have you incorporated citation data?&lt;/h3>
&lt;p>Access to citations has historically been a thorny issue for citations databases. However, &lt;a href="https://i4oc.org/" target="_blank">I4OC&lt;/a> celebrated its first anniversary in April this year and this project has been a key driver in helping us to build Dimensions with the level of citation coverage that we managed –– it is a fantastic enabling initiative and should be warmly welcomed by the sector. Crossref is not the only source we were able to use to gather citation data; some text mining was needed to get a full graph. Dimensions goes beyond inter-article citations and includes links between patents and publications, links between clinical trials and publications, and Altmetric mentions of publications.&lt;/p>
&lt;h3 id="is-dimensions-openly-available">Is Dimensions openly available?&lt;/h3>
&lt;p>Given that there is so much open data in Dimensions, it was always our intention to give a free version to the community. If you visit &lt;a href="https://app.dimensions.ai/discover/publication" target="_blank">http://app.dimensions.ai&lt;/a> then you’ll be able to play with the system and use it for your research. While only the publications index is fully open, when you see a link to a grant, patent or clinical trial in an article detail page, you’ll be able to navigate to that record so that you can see the full context of the data.&lt;/p>
&lt;p>Beyond the ability to link the publications, Dimensions also displays the CV information which the researcher made visible publicly.&lt;/p>
&lt;img src="https://www.crossref.org/images/blog/dimensions-4-1.jpg" alt="orcid record" width="80%" class="img-responsive" />
&lt;p>Most recently, we’ve integrated ORCID into Dimensions. This means that you can push data from Dimensions into ORCID if you connect your ORCID account to your Dimensions account.&lt;/p>
&lt;img src="https://www.crossref.org/images/blog/dimensions-3-1.jpg" alt="CV information" width="80%" lass="img-responsive" />
&lt;h3 id="what-are-the-future-plans-for-dimensions">What are the future plans for Dimensions?&lt;/h3>
&lt;p>Dimensions is still moving quickly and adding more functionality. Our aim is to release more data facets very soon. We plan to add a Policy Document archive and a Research Data archive. We’ve already found some fascinating insights from joining the existing data together and these two new archives should add even more interesting data.&lt;/p>
&lt;h3 id="what-else-would-dimensions-like-to-see-in-crossref-metadata">What else would Dimensions like to see in Crossref metadata?&lt;/h3>
&lt;p>Open access information is something that we work with &lt;a href="https://unpaywall.org/" target="_blank">Unpaywall&lt;/a> to source for Dimensions right now. It would be great if Crossref and Unpaywall could work together to make this data higher quality and more ubiquitous.&lt;/p>
&lt;hr>
&lt;p>Thank you Daniel, Christian and Simon.&lt;/p>
&lt;p>If you would like to contribute a case study on the uses of Crossref Metadata APIs please contact the &lt;a href="mailto:feedback@crossref.org">Community team&lt;/a>.&lt;/p></description></item><item><title>Meet the members, Part 3 (with INASP)</title><link>https://www.crossref.org/blog/meet-the-members-part-3-with-inasp/</link><pubDate>Wed, 20 Jun 2018 00:00:00 +0000</pubDate><author>Christine Cormack Wood</author><discourseUsername>ccormackwood</discourseUsername><guid>https://www.crossref.org/blog/meet-the-members-part-3-with-inasp/</guid><description>&lt;p>Next in our &lt;a href="https://www.crossref.org/categories/meet-the-members/">Meet the members&lt;/a> blog series is INASP, who isn’t a direct member, but acts as a &lt;a href="https://www.crossref.org/community/sponsors">Sponsor&lt;/a> for hundreds of members. Sioux Cumming, Programme Specialist at &lt;a href="https://www.inasp.info/home" target="_blank">INASP&lt;/a> tells us a bit about the work they’re doing, how they use Crossref and what the future plans for INASP are.&lt;/p>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/INASP.jpg" alt=“INASP logo" height="150px" width="250px" class="img-responsive" />
&lt;/div>
&lt;h3 id="can-you-tell-us-a-little-bit-about-inasp">Can you tell us a little bit about INASP?&lt;/h3>
&lt;p>&lt;a href="https://www.inasp.info/home" target="_blank">INASP&lt;/a> is an international development organisation working with a global network of partners in Africa, Latin America and Asia. We have a vision of research and knowledge at the heart of development, so are working to support individuals and institutions to produce, share and use research and knowledge, which can transform lives.&lt;/p>
&lt;p>Our work includes strengthening research communication, which we do via AuthorAID (supporting researchers, especially early-career researchers, in getting their research published); improving information access (supporting library consortia with access to international journals and other online resources); supporting evidence use in policy making; working with higher-education institutions to improve critical thinking skills; improving gender equity in research systems; and my area, which I’ll talk more about below, supporting academic publishing in the Global South.&lt;/p>
&lt;p>INASP’s approaches are based on the core pillars of capacity development, convening, influencing and working in partnership. INASP promotes equity by actively addressing the needs of both men and women across all our work and addressing issues of power within the research and knowledge system.&lt;/p>
&lt;h3 id="whats-your-role-within-inasp">What’s your role within INASP?&lt;/h3>
&lt;p>I’m a Programme Specialist and since I started at INASP 15 years ago I’ve been responsible for our &lt;a href="https://web.archive.org/web/20220629190912/https://www.inasp.info/theme/academic-publishing" target="_blank">academic publishing work&lt;/a>. This work supports increased visibility, accessibility and quality of peer-reviewed journals published in developing countries so that the research outputs that are produced in these countries can be found, shared and used more effectively.&lt;/p>
&lt;p>We recognize two big challenges for Southern journals in playing their part in global research systems. The first is awareness of Southern journals, many of which were until recently only available in print. Supporting editors and national organisations to put their journals online on central platforms (the Journals Online platforms) has helped increase their visibility.&lt;/p>
&lt;p>We have also provided support to the local management teams in communicating about the platforms, the journals and the research they publish, and we recently published a Handbook for Journal Editors - &lt;a href="https://www.inasp.info/sites/default/files/2018-04/INASP%20-%20Editors%20Toolkit%20-%20DIGITAL.pdf" target="_blank">www.inasp.info/editorshandbook&lt;/a>. This is intended to be a free resource for editors worldwide that can be used as a stand-alone handbook or as an accompaniment to the journal quality online course that we are currently developing.&lt;/p>
&lt;p>The second challenge is supporting publishing quality and enabling Southern journals to demonstrate their quality so they will be regarded as credible. In the early days, this, for me, was largely about providing training and mentoring for journal editors and Journals Online platform managers about standard publishing practices.&lt;/p>
&lt;p>More recently, as local handover progressed, our role shifted towards helping journals to demonstrate their credibility. Last September INASP and African Journals Online launched our Journal Publishing Practices and Standards (JPPS) &lt;a href="https://www.journalquality.info/en/" target="_blank">framework&lt;/a> for assessing the quality of Southern publishing processes. This has been really well received by the international publishing sector and by the journal editors we work with.&lt;/p>
&lt;h3 id="tell-us-a-bit-about-who-you-support-and-how-you-support-them">Tell us a bit about who you support, and how you support them&lt;/h3>
&lt;p>We support others communicating their research and finding out about the research of others.&lt;/p>
&lt;blockquote>
&lt;p>The five Journals Online platforms that were handed over to local management at the end of March collectively host 397 journals from Bangladesh, El Salvador, Honduras, Mongolia, Nepal, Nicaragua and Sri Lanka.&lt;/p>
&lt;/blockquote>
&lt;p>These platforms help the research from these countries to become even more integrated in the global research community. Some fascinating and valuable research is published in the journals on these platforms. You can see some &lt;a href="https://www.inasp.info/publications/helping-southern-research-reach-global-audience" target="_blank">examples&lt;/a> of this research in this article about a small piece of work we did with these platforms to commission and disseminate press releases of some of the research.&lt;/p>
&lt;h3 id="whats-your-participation-level-with-crossref">What’s your participation level with Crossref?&lt;/h3>
&lt;p>INASP has been a Crossref Sponsor for the Journals Online platforms since 2008 and all articles on the sites have DOIs assigned to them (approximately 50,000 articles). All the in-country training sessions for journal editors publishing via the JOL platforms have included sessions to explain how DOIs work and why they are important. We have also trained editors on how to find and include the DOIs for the references of their articles. More recently, in 2015, we provided access to the Crossref Similarity Check service to editors, which enabled them to improve the quality of their submissions by identifying instances of plagiarism before the articles were published.&lt;/p>
&lt;h3 id="what-trends-are-you-seeing-in-your-part-of-the-scholarly-communications-community">What trends are you seeing in your part of the scholarly communications community?&lt;/h3>
&lt;p>Demonstrating credibility of journals is an important part of journal publishing today. There are so many journals worldwide and it is a tough challenge for authors and readers to navigate this sector – a challenge that we often see through the discussions in our AuthorAID network. But it is important that researchers don’t simply turn to the handful of well-known publishers in the Global North that have dominated scholarly discourse to date.&lt;/p>
&lt;p>To really tackle global issues and increase equality in global research we need to work towards levelling the playing field and including all voices – and this challenge needs to be embraced across the global research and knowledge system. We have seen encouraging signs over the past couple of years of magazines, blogs, conference organizers and industry groups in the Global North approaching us to help bring in more global perspectives to scholarly discussions. However, there is plenty more to be done and we are particularly focusing on equity in our new areas of work.&lt;/p>
&lt;h3 id="how-would-you-describe-the-value-of-being-a-crossref-sponsor">How would you describe the value of being a Crossref Sponsor?&lt;/h3>
&lt;blockquote>
&lt;p>Collaboration with Crossref over the past few years has been one of a number of ways that we have been able to connect small, scholar-led titles in the Global South with the latest global standards and approaches in scholarly publishing. This is important as it all helps to level the playing field.&lt;/p>
&lt;/blockquote>
&lt;p>Including DOIs in papers is one of the criteria for being awarded a JPPS star and thus the journals are incentivized to understand and use them more.&lt;/p>
&lt;h3 id="what-are-inasps-plans-for-the-future">What are INASP’s plans for the future?&lt;/h3>
&lt;p>INASP has recently completed a major five-year programme of work with a significant focus on strengthening organisations in the countries we have been working in and handing over responsibility for managing things like the Journals Online platforms. We are now in a new phase of work, building on what has gone before but with a particular emphasis on improving equity both within and between research systems.&lt;/p>
&lt;p>Many challenges remain – the global research system still tends to be biased towards the Global North. From an academic publishing perspective this is apparent both in terms of awareness of journals and also in terms of impressions of credibility. JPPS is intended to tackle the latter challenge but it is still early days – we only announced the first badges awarded a few months ago. Over the next few years we will be building on and strengthening this work and ensuring that it is an important part of the processes for journal editors and for authors and readers.&lt;/p>
&lt;p>Thank you Sioux for your participation in our &lt;a href="https://www.crossref.org/categories/meet-the-members/">Meet the members&lt;/a> series. If your organisation would like to feature in this series, &lt;a href="mailto:feedback@crossref.org">please get in touch&lt;/a>.
&lt;br>&lt;/p></description></item><item><title>Preprints growth rate ten times higher than journal articles</title><link>https://www.crossref.org/blog/preprints-growth-rate-ten-times-higher-than-journal-articles/</link><pubDate>Thu, 31 May 2018 00:00:00 +0000</pubDate><author>Jennifer Lin</author><guid>https://www.crossref.org/blog/preprints-growth-rate-ten-times-higher-than-journal-articles/</guid><description>&lt;p>The Crossref graph of the research enterprise is growing at an impressive rate of 2.5 million records a month - scholarly communications of all stripes and sizes. Preprints are one of the fastest growing types of content. While preprints may not be new, the growth may well be: ~30% for the past 2 years (compared to article growth of 2-3% for the same period). We began supporting preprints in November 2016 at the behest of our members. When members register them, we ensure that: links to these publications persist over time; they are connected to the full history of the shared research results; and the citation record is clear and up-to-date.&lt;/p>
&lt;h3 id="summary">Summary&lt;/h3>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/Fig1-preprints-growth-chart.png" alt="number of preprints registered" width="80%" class="img-responsive" />
&lt;/div>
&lt;p>As of May 24, 2018 we have 44,388 works (see API query &lt;a href="https://api.crossref.org/types/posted-content/works" target="_blank">https://api.crossref.org/types/posted-content/works&lt;/a> with a json viewer) registered as posted content. Today that number is over 150k. Preprints are part of this record type category, which is meant to house scholarly outputs that have been posted online and intended for publication in the future.&lt;/p>
&lt;p>For a more granular view, see the monthly stats captured by Jordan Anaya in &lt;a href="http://www.prepubmed.org/monthly_stats/" target="_blank">PrePubMed&lt;/a>. This data is based on a slightly different set of preprint repositories, though both show the same trends.&lt;/p>
&lt;p>The figure below shows the preprints registered with Crossref, broken down by repository.&lt;/p>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/Fig2-preprints-count-by-repo.png" alt="number of preprints by publisher" width="100%" class="img-responsive" />
&lt;/div>
&lt;p>We eagerly await our newest preprints member, Center for Open Science, who will soon be registering the preprints from their 18 community archives with us (~9k preprints total to date).&lt;/p>
&lt;h3 id="metadata-coverage">Metadata coverage&lt;/h3>
&lt;p>We accept a range of metadata for the preprints registered with us, including:&lt;/p>
&lt;ul>
&lt;li>Repository name &amp;amp; hosting platform&lt;/li>
&lt;li>Contributor names &amp;amp; ORCID iDs&lt;/li>
&lt;li>Title&lt;/li>
&lt;li>Dates (posted, accepted)&lt;/li>
&lt;li>License&lt;/li>
&lt;li>Funding&lt;/li>
&lt;li>Abstract&lt;/li>
&lt;li>Relations&lt;/li>
&lt;li>References&lt;/li>
&lt;/ul>
&lt;p>As with all resource/record types, certain metadata is required, though others are optional. We encourage full coverage of metadata in the record where applicable and possible. So what are publishers including in their posted content records? The summary view is as follows:&lt;/p>
&lt;ul>
&lt;li>License: &lt;a href="https://api.crossref.org/types/posted-content/works?filter=has-license:true&amp;amp;facet=publisher-name:*&amp;amp;rows=0" target="_blank">9926 (json)&lt;/a>, 22% (PeerJ Preprints, ChemRxiv)&lt;/li>
&lt;li>Funder: &lt;a href="https://api.crossref.org/types/posted-content/works?filter=has-funder:true&amp;amp;facet=publisher-name:*&amp;amp;rows=0" target="_blank">0 (json)&lt;/a>, 0%&lt;/li>
&lt;li>ORCID: &lt;a href="https://api.crossref.org/types/posted-content/works?filter=has-orcid:true&amp;amp;facet=publisher-name:*&amp;amp;rows=0" target="_blank">19309 (json)&lt;/a>, 44% (bioRxiv, PeerJ Preprints, Preprints.org, ChemRxiv)&lt;/li>
&lt;li>Abstracts: &lt;a href="https://api.crossref.org/types/posted-content/works?filter=has-abstract:true&amp;amp;facet=publisher-name:*&amp;amp;rows=0" target="_blank">35874 (json)&lt;/a>, 81% (bioRxiv, PeerJ Preprints, ChemRxiv)&lt;/li>
&lt;li>References: &lt;a href="https://api.crossref.org/types/posted-content/works?filter=has-references:true&amp;amp;facet=publisher-name:*&amp;amp;rows=0" target="_blank">1921 (json)&lt;/a>:, 4% (JMIR)&lt;/li>
&lt;/ul>
&lt;p>Compared to all the published content registered with us over time, preprints have above average coverage of ORCID iDs deposited and show well above average with abstract metadata. However, they are significantly lagging behind with depositing references, license, and funding metadata. (See a summary of the full corpus stats taken two months ago in the blog post, &lt;a href="https://doi.org/10.64000/k4j1j-66z41" target="_blank">A Lustrum over the Weekend&lt;/a>).&lt;/p>
&lt;h3 id="preprint-article-pairs">Preprint-article pairs&lt;/h3>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/Fig3-preprint-articles.png" alt="number of citations for preprints" width="80%" class="img-responsive"/>
&lt;/div>
&lt;p>Members registering preprints have an obligation to update the metadata record when a journal article is subsequently published, to clearly identify this work. This pairing is passed on to our metadata users: indexing platforms; recommendations engines; platforms; tools, etc. which pull from our APIs. (The preprint landing page also must link to the article.) As such, the preprint-article pairings are amassing as each week passes. We currently have a total of &lt;a href="https://api.crossref.org/works?filter=relation.type:is-preprint-of&amp;amp;facet=publisher-name:*&amp;amp;rows=0" target="_blank">12983 (json)&lt;/a> preprints connected to articles. The figure below provides the counts based on repository.&lt;/p>
&lt;h3 id="citations">Citations&lt;/h3>
&lt;p>We can see from preprint Cited-by counts that researchers are indeed citing preprints in their articles. This practice is an extension of the common citation behavior to provide evidence for and credit to previous work, a natural consequence of work shared with their peers. The &lt;a href="https://api.crossref.org/types/posted-content/works?sort=is-referenced-by-count&amp;amp;order=desc" target="_blank">most highly cited preprint papers (json)&lt;/a> as of May 24, 2018 are as follows. In some cases, a subsequent paper was published from the results shared in the preprint. These have also accrued citations in their own right and these are also indicated in the table below.&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>No.&lt;/th>
&lt;th>Cited-by&lt;/th>
&lt;th>Preprint DOI&lt;/th>
&lt;th>Preprint title&lt;/th>
&lt;th>Date&lt;/th>
&lt;th>Subsequent journal article&lt;/th>
&lt;th style="text-align: center">Citations of journal article&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>1&lt;/td>
&lt;td>Cited-by 72&lt;/td>
&lt;td>&lt;a href="https://doi.org/10.1101/005165" target="_blank">https://doi.org/10.1101/005165&lt;/a>&lt;/td>
&lt;td>qqman: an R package for visualizing GWAS results using Q-Q and manhattan plots&lt;/td>
&lt;td>May 14, 2014.&lt;/td>
&lt;td>n/a&lt;/td>
&lt;td style="text-align: center">n/a&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>2&lt;/td>
&lt;td>Cited-by 63&lt;/td>
&lt;td>&lt;a href="https://doi.org/10.1101/002824" target="_blank">https://doi.org/10.1101/002824&lt;/a>&lt;/td>
&lt;td>HTSeq - A Python framework to work with high-throughput sequencing data&lt;/td>
&lt;td>August 19, 2014&lt;/td>
&lt;td>Bioinformatics, &lt;a href="https://doi.org/10.1093/bioinformatics/btu638" target="_blank">https://doi.org/10.1093/bioinformatics/btu638&lt;/a>&lt;/td>
&lt;td style="text-align: center">2372&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>3&lt;/td>
&lt;td>Cited-by 43&lt;/td>
&lt;td>&lt;a href="https://doi.org/10.1101/030338" target="_blank">https://doi.org/10.1101/030338&lt;/a>&lt;/td>
&lt;td>Analysis of protein-coding genetic variation in 60,706 humans&lt;/td>
&lt;td>May 10, 2016&lt;/td>
&lt;td>Nature, &lt;a href="https://doi.org/10.1038/nature19057" target="_blank">https://doi.org/10.1038/nature19057&lt;/a>&lt;/td>
&lt;td style="text-align: center">1598&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>4&lt;/td>
&lt;td>Cited-by 38&lt;/td>
&lt;td>&lt;a href="https://doi.org/10.1101/002832" target="_blank">https://doi.org/10.1101/002832&lt;/a>&lt;/td>
&lt;td>Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2&lt;/td>
&lt;td>November 17, 2014&lt;/td>
&lt;td>Genome Biology, &lt;a href="https://doi.org/10.1186/s13059-014-0550-8" target="_blank">https://doi.org/10.1186/s13059-014-0550-8&lt;/a>&lt;/td>
&lt;td style="text-align: center">3284&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>5&lt;/td>
&lt;td>Cited-by 32&lt;/td>
&lt;td>&lt;a href="https://doi.org/10.1101/021592" target="_blank">https://doi.org/10.1101/021592&lt;/a>&lt;/td>
&lt;td>Salmon provides accurate, fast, and bias-aware transcript expression estimates using dual-phase inference&lt;/td>
&lt;td>August 30, 2016&lt;/td>
&lt;td>Nature Methods, &lt;a href="https://doi.org/10.1038/nmeth.4197" target="_blank">https://doi.org/10.1038/nmeth.4197&lt;/a>&lt;/td>
&lt;td style="text-align: center">112&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>6&lt;/td>
&lt;td>Cited-by 22&lt;/td>
&lt;td>&lt;a href="https://doi.org/10.1101/012401" target="_blank">https://doi.org/10.1101/012401&lt;/a>&lt;/td>
&lt;td>DensiTree 2: Seeing Trees Through the Forest&lt;/td>
&lt;td>December 8, 2014&lt;/td>
&lt;td>n/a&lt;/td>
&lt;td style="text-align: center">n/a&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>7&lt;/td>
&lt;td>Cited-by 21&lt;/td>
&lt;td>&lt;a href="https://doi.org/10.1101/011650" target="_blank">https://doi.org/10.1101/011650&lt;/a>&lt;/td>
&lt;td>FusionCatcher - a tool for finding somatic fusion genes in paired-end RNA-sequencing data&lt;/td>
&lt;td>November 19, 2014&lt;/td>
&lt;td>n/a&lt;/td>
&lt;td style="text-align: center">n/a&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>8&lt;/td>
&lt;td>Cited-by 19&lt;/td>
&lt;td>&lt;a href="https://doi.org/10.1101/048991" target="_blank">https://doi.org/10.1101/048991&lt;/a>&lt;/td>
&lt;td>Analysis of shared heritability in common disorders of the brain&lt;/td>
&lt;td>September 6, 2017&lt;/td>
&lt;td>n/a&lt;/td>
&lt;td style="text-align: center">n/a&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>9&lt;/td>
&lt;td>Cited-by 18&lt;/td>
&lt;td>&lt;a href="https://doi.org/10.1101/006395" target="_blank">https://doi.org/10.1101/006395&lt;/a>&lt;/td>
&lt;td>Error correction and assembly complexity of single molecule sequencing reads&lt;/td>
&lt;td>June 18, 2014&lt;/td>
&lt;td>n/a&lt;/td>
&lt;td style="text-align: center">n/a&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>10&lt;/td>
&lt;td>Cited-by 18&lt;/td>
&lt;td>&lt;a href="https://doi.org/10.1101/032839" target="_blank">https://doi.org/10.1101/032839&lt;/a>&lt;/td>
&lt;td>Spread of the pandemic Zika virus lineage is associated with NS1 codon usage adaptation in humans&lt;/td>
&lt;td>November 25, 2015&lt;/td>
&lt;td>n/a&lt;/td>
&lt;td style="text-align: center">n/a&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;br>
The relationship between preprints and the proceeding publication is an interesting area that is not yet well understood. We invite the community to analyze the Crossref metadata using the REST API in concert with other datasets. For example, the citation lifecycle for these two research products has been one of speculation so far without a systematic investigation into patterns and timeframes of preprint citations and those of its succeeding article across the corpus. Here, submission dates would be critical data to this research question as publication windows vary significantly by publisher and by paper.</description></item><item><title>Linking references is different from registering references</title><link>https://www.crossref.org/blog/linking-references-is-different-from-registering-references/</link><pubDate>Wed, 30 May 2018 00:00:00 +0000</pubDate><author>Anna Tolwinska</author><guid>https://www.crossref.org/blog/linking-references-is-different-from-registering-references/</guid><description>&lt;p>From time to time we get questions from members asking what the difference is between reference linking and registering references as part the Content Registration process.&lt;/p>
&lt;p>Here&amp;rsquo;s the distinction:&lt;/p>
&lt;blockquote>
&lt;p>Linking out to other articles from your reference lists is a key part of being a Crossref members - it&amp;rsquo;s an obligation in the membership agreement and it levels the playing field when all members link their references to one another.&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>Registering references when you register your content is completely different. It&amp;rsquo;s enriching the metadata record that describes your content, and it allows Crossref and others&amp;mdash;including non-members&amp;mdash;to use them.&lt;/p>
&lt;/blockquote>
&lt;h3 id="reference-linking">Reference Linking&lt;/h3>
&lt;p>A research article usually includes a reference list of citations to other works that helped inform it. The original function of Crossref was to provide a central service for publishers that enabled them to link to each others&amp;rsquo; content from these reference lists&amp;mdash;using a DOI as a persistent link. This meant that members of all sizes and in all disciplines could easily link to one another without having to sign hundreds of bilateral agreements.&lt;/p>
&lt;p>We made Reference Linking &lt;a href="https://www.crossref.org/membership/terms">obligatory&lt;/a> for Crossref members because it&amp;rsquo;s fundamental to making content discoverable, and because when everyone links their references, research travels further and benefits everyone.&lt;/p>
&lt;h3 id="registering-references">Registering references&lt;/h3>
&lt;p>Every single day hundreds of members register and update their metadata with us&amp;mdash;and every single day hundreds of organisations search for, extract and use it. To make sure your content is discovered in this process, it&amp;rsquo;s important to make the metadata you register with us as rich as possible. Rich metadata includes information such as journal title, article author, publication date, page numbers, ISSN, abstracts, ORCID iDs, funding information, clinical trials numbers, license information, and of course&amp;mdash;references.&lt;/p>
&lt;p>Additionally, registering references is &lt;s> a prerequisite &lt;/s> recommended for participating in our &lt;a href="https://www.crossref.org/documentation/cited-by">Cited-by&lt;/a> service&amp;mdash;which provides citation counts and lists, and ultimately makes your content more discoverable. &lt;em>[EDIT 7th February 2024 - it is no longer required but highly recommended.]&lt;/em>&lt;/p>
&lt;p>We know it&amp;rsquo;s not easy for smaller publishers to deposit references. Read more on how to &lt;a href="https://www.crossref.org/documentation/schema-library/markup-guide-metadata-segments/references">here&lt;/a>. &lt;s> Our upcoming Metadata Manager tool will allow you to register your references at the same time as the rest of your content. This service is currently in development but &lt;a href="mailto:support@crossref.org">let us know if you want to try it out&lt;/a>. &lt;/s> &lt;em>[EDIT 7th February 2024 - Metadata Manager has been deprecated. More info about it &lt;a href="https://www.crossref.org/documentation/register-maintain-records/metadata-manager/">here&lt;/a>.]&lt;/em>&lt;/p>
&lt;div class='shortcode-row '>
&lt;div class="col-md-6 col-sm-12 no-first-para-highlight">&lt;h3 id="reference-linking">Reference Linking&lt;/h3>
&lt;p>Reference Linking means adding Crossref DOI links to the reference list for journal articles on your article pages as per this example: &lt;a href="https://doi.org/10.1088/1367-2630/1/1/006" target="_blank">https://doi.org/10.1088/1367-2630/1/1/006&lt;/a>.&lt;/p>
&lt;h4 id="how-it-works">How it works&lt;/h4>
&lt;p>First retrieve DOIs for all available references either through our &lt;a href="https://search.crossref.org" target="_blank">human&lt;/a> or &lt;a href="https://api.crossref.org" target="_blank">machine&lt;/a> interfaces. Then make sure you use the DOI link in your references and on your article landing page using the &lt;a href="https://doi.org/10.13003/5jchdy" target="_blank">Crossref DOI display guidelines&lt;/a>.&lt;/p>
&lt;h4 id="why-its-useful">Why it’s useful&lt;/h4>
&lt;p>Reference Linking:&lt;/p>
&lt;ul>
&lt;li>Enables you to link to more than 10,000 publishers without having to sign multiple agreements&lt;/li>
&lt;li>Helps with discoverability, because DOIs don’t break if implemented correctly&lt;/li>
&lt;li>Displays your DOIs as URLs so that anyone can copy and share them&lt;/li>
&lt;li>Makes your content more useful to readers&lt;/li>
&lt;li>Drives traffic to your website from other publishers.&lt;/li>
&lt;/ul>
&lt;h4 id="is-it-obligatory">Is it obligatory?&lt;/h4>
&lt;p>Yes, within a short time after becoming a member you should be including references.&lt;/p>
&lt;/div>
&lt;div class="col-md-6 col-sm-12 no-first-para-highlight">&lt;h3 id="registering-references">Registering References&lt;/h3>
&lt;p>Registering references means submitting them as part of your Crossref metadata deposit as per this example:
&lt;a href="https://www.crossref.org/xml-samples/article_with_references.xml" target="_blank">https://www.crossref.org/xml-samples/article_with_references.xml&lt;/a>.&lt;/p>
&lt;h4 id="how-it-works">How it works&lt;/h4>
&lt;p>Whenever you register content with us, make sure you include your references in the submission. You can also add references to your existing content via a &lt;a href="https://support.crossref.org/hc/en-us/articles/213022486-Updating-your-metadata" target="_blank">metadata redeposit&lt;/a>, or our &lt;a href="https://www.crossref.org/documentation/register-maintain-records/maintaining-your-metadata/resource-only-deposit/">resource-only deposit&lt;/a>, or our &lt;a href="https://support.crossref.org/hc/en-us/articles/214236226" target="_blank">Simple Text Query form&lt;/a>.&lt;/p>
&lt;h4 id="why-its-useful">Why it’s useful&lt;/h4>
&lt;p>References registered as part of your metadata:&lt;/p>
&lt;ul>
&lt;li>Make your content more discoverable&lt;/li>
&lt;li>Make your content richer and more useful&lt;/li>
&lt;li>Are required to participate in our Cited-by service (this service shows what articles cite your article)&lt;/li>
&lt;li>Enables discovery of research&lt;/li>
&lt;li>Enables evaluation of research&lt;/li>
&lt;li>Highlights your contents’ provenance&lt;/li>
&lt;li>Helps with citation counts.&lt;/li>
&lt;/ul>
&lt;h4 id="is-it-obligatory">Is it obligatory?&lt;/h4>
&lt;p>No, it’s optional, but strongly encouraged. It is &lt;s> required &lt;/s> recommended if you are participating in our Cited-by service. &lt;em>[EDIT 7th February 2024 - it is no longer required but highly recommended].&lt;/em>&lt;/p>
&lt;/div>
&lt;/div>
&lt;hr>
&lt;p>If you have any questions about reference linking or registering your references please &lt;a href="mailto:support@crossref.org">get in touch&lt;/a>.&lt;/p></description></item><item><title>SSP roadtrip for the Crossref team</title><link>https://www.crossref.org/blog/ssp-roadtrip-for-the-crossref-team/</link><pubDate>Wed, 30 May 2018 00:00:00 +0000</pubDate><author>Amanda Bartell</author><guid>https://www.crossref.org/blog/ssp-roadtrip-for-the-crossref-team/</guid><description>&lt;p>What do you think of when you think of Chicago? Deep dish pizza? Art Deco architecture?&lt;/p>
&lt;p>Well for one week only this year you can add scholarly publishing to the list as the #SSP2018 Conference comes to town. Some Crossref people are excited to be heading out for the conference, and we&amp;rsquo;re looking forward to meeting as many of our members as possible.&lt;/p>
&lt;p>Come along to &lt;strong>stand 212A&lt;/strong> and talk to &lt;a href="https://www.crossref.org/people/anna-tolwinska/">Anna Tolwinska&lt;/a> about Participation Reports. Although this new tool is still in beta, she&amp;rsquo;s giving SSP attendees a sneak peek and the chance to get an early look at whether they (and over 10 000 other members) are registering the ten key elements that add context and richness to the basic required metadata. You&amp;rsquo;ll get real insight into what metadata you&amp;rsquo;re registering, even if this work is done by a third party or other department.&lt;/p>
&lt;p>Thinking about registering preprints or including data citations? Want to find out more about our forthcoming Event Data service? Our product director &lt;a href="https://www.crossref.org/people/jennifer-lin/">Jennifer Lin&lt;/a> will be able to give you the ins and outs of all our latest services so do keep an eye out for her at the conference.&lt;/p>
&lt;p>Speaking of third parties, &lt;a href="https://www.crossref.org/people/amanda-bartell/">I&amp;rsquo;ll&lt;/a> will be popping along to the &amp;ldquo;Thinking the Unthinkable, or How to Prepare for a Platform Migration&amp;rdquo; pre meeting seminar on Wednesday with copies of our new &lt;a href="https://www.crossref.org/education/member-setup/working-with-a-service-provider/checklist-for-platform-migration/">Platform Migration Checklist&lt;/a> and lots of hints and tips to help form a new platform migration guide which will help members have a smooth transition when thinking of moving providers.&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/people/shayn-smulyan/">Shayn Smulyan&lt;/a> will be attending the ORCID breakfast meeting on Thursday morning, so come and say hello if you have any questions about how ORCID and Crossref work together. Shayn is one of our support specialists, so he&amp;rsquo;ll be able to help you with any other technical queries you may have.&lt;/p>
&lt;p>Our tech director &lt;a href="https://www.crossref.org/people/chuck-koscher/">Chuck Koscher&lt;/a> will be keen to hone in on members&amp;rsquo; advanced questions about Content Registration, citation matching, and any and all schema deets. So seek him out if you have deep technical questions.&lt;/p>
&lt;p>Want to find out more about &lt;a href="http://www.metadata2020.org/" target="_blank">Metadata 2020&lt;/a>, the new campaign to improve metadata for research? &lt;a href="https://www.crossref.org/people/rosa-morais-clark/">Rosa Morais Clark&lt;/a> will be able to give you the lowdown, and even better - she has stickers!&lt;/p>
&lt;p>And don&amp;rsquo;t feel left out if you aren&amp;rsquo;t a member but work closely with Crossref. &lt;a href="https://www.crossref.org/people/jennifer-kemp/">Jennifer Kemp&lt;/a> will be on hand to answer all your metadata use and reuse questions, she&amp;rsquo;ll be looking forward to chatting with all kinds of service providers, platforms, and tools.&lt;/p>
&lt;p>We&amp;rsquo;re looking forward to seeing you there!&lt;/p></description></item><item><title>How good is your metadata?</title><link>https://www.crossref.org/blog/how-good-is-your-metadata/</link><pubDate>Thu, 26 Apr 2018 00:00:00 +0000</pubDate><author>Kirsty Meddings</author><guid>https://www.crossref.org/blog/how-good-is-your-metadata/</guid><description>&lt;p>Exciting news! We are getting very close to the beta release of a new tool to publicly show metadata coverage. As members register their content with us they also add additional information which gives context for other members and for services that help e.g. discovery or analytics.&lt;/p>
&lt;p>Richer metadata makes content useful. Participation reports will give&amp;mdash;for the first time&amp;mdash;a clear picture for anyone to see the metadata Crossref has. This is data that&amp;rsquo;s long been available via our Public REST API, now visualized.&lt;/p>
&lt;h3 id="who-are-participation-reports-for-everyone">Who are participation reports for? Everyone!&lt;/h3>
&lt;p>It&amp;rsquo;s an opportunity to evaluate and educate. See for yourself where the gaps are, and what our members could improve upon. Understand best practice through seeing what others are doing, and learn how to level-up.&lt;/p>
&lt;p>Monitor what metadata is being registered, even if this work is done by a third party or another department. And see what other organisations in scholarly communications see when they use Crossref metadata in their research, tools, and services.&lt;/p>
&lt;p>The beta release—expected after acceptance testing some time late May—will let anyone look up any of our 15,000+ members and see whether they are registering ten key elements that add context and richness to the basic required bibliographic metadata.&lt;/p>
&lt;h3 id="what-do-we-mean-by-richer-metadata">What do we mean by ‘richer metadata’?&lt;/h3>
&lt;p>The ten checks for Beta, will be:&lt;/p>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/checklist.png" alt=“checklist" height="250px" width="200px" class="img-responsive" />
&lt;/div>
&lt;ul>
&lt;li>References&lt;/li>
&lt;li>&lt;del>Open references&lt;/del> &lt;em>[EDIT 6th June 2022 - all references are now open by default].&lt;/em>&lt;/li>
&lt;li>ORCID iDs&lt;/li>
&lt;li>Funder IDs&lt;/li>
&lt;li>Funding award numbers&lt;/li>
&lt;li>Crossmark metadata&lt;/li>
&lt;li>License information&lt;/li>
&lt;li>Full text links&lt;/li>
&lt;li>Similarity Check URLs&lt;/li>
&lt;li>Abstracts&lt;/li>
&lt;/ul>
&lt;p>Each of these additional metadata elements helps increase discovery and wider and more varied use&amp;mdash;and usefulness&amp;mdash;of research outputs.&lt;/p>
&lt;h3 id="why-are-we-doing-this-and-what-do-we-mean-by-participation">Why are we doing this and what do we mean by ‘participation’?&lt;/h3>
&lt;p>Over the years when we’ve talked with our members about their metadata, we learned that many just can’t be certain exactly how they’re performing. It could be that they’ve outsourced &lt;a href="https://www.crossref.org/services/content-registration">Content Registration&lt;/a> to another service provider or larger publisher, or it could be they just weren’t previously aware they could collect and share authors’ ORCID iDs, Funder IDs, and so on. So our primary aim is to give our members the information they need in order to make a case for improving their metadata records. Each check will come with information about why it is important and guidance on how to improve. Additionally, with the growing use of Crossref as a central source of metadata for the research community, it’s in everyone’s interest to be as transparent as possible about what metadata we have - and encourage greater understanding of what’s possible.&lt;/p>
&lt;p>Member ‘participation’ is an important concept. Crossref &lt;a href="https://www.crossref.org/membership/benefits">distinguishes itself from other DOI registration agencies&lt;/a> by providing this richer infrastructure which allows for things like funding information, license information, links between data and preprints, and so on—all contributing to the &lt;a href="https://doi.org/10.64000/k2hez-ysv45" target="_blank">research nexus&lt;/a> for everyone’s benefit.&lt;/p>
&lt;p>Membership of Crossref is not just about getting a persistent identifier for your content, it’s about placing your content in context by providing as much metadata as possible and looking after it long-term.&lt;/p>
&lt;p>Here’s a sneak preview of what the report will look like:&lt;/p>
&lt;img src="https://www.crossref.org/images/blog/springer-nature-prep.jpg" alt="Crossref participation report - Springer Nature" width="100%" />
&lt;p>So whether you’re a member who wants to run a “health check” on your own metadata, or a consumer of metadata interested in what’s available and from whom, watch this space for Participation Reports!&lt;/p>
&lt;h3 id="would-you-like-a-heads-up-on-your-report-pre-beta">Would you like a heads-up on your report, pre-beta?&lt;/h3>
&lt;p>Beta will be released some time in May or June this year, following acceptance testing with members and others. Then we’re looking for about 20 members to have a half-hour phone call with a walk-through ‘health check’. Please &lt;a href="mailto:annat@crossref.org">contact Anna if you’d like to schedule one&lt;/a>.&lt;/p></description></item><item><title>Redirecting redirection</title><link>https://www.crossref.org/blog/redirecting-redirection/</link><pubDate>Tue, 24 Apr 2018 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/redirecting-redirection/</guid><description>&lt;p>Crossref has decided to change the HTTP redirect code used by our DOIs from &lt;code>303&lt;/code> back to the more commonly used &lt;code>302&lt;/code>. Our implementation of 303 redirects back in 2010 was based on recommended best practice for supporting linked data identifiers. Unfortunately, very few other parties have adopted this practice.&lt;/p>
&lt;p>What’s more, because using a 303 redirect is still unusual, it tends to throw &lt;a href="https://en.wikipedia.org/wiki/Search_engine_optimization" target="_blank">SEO&lt;/a> tools into a &lt;a href="http://www.dictionary.com/browse/tizzy?s=t" target="_blank">tizzy&lt;/a>- and we spend a lot of time fielding SEO questions from our members about our use of 303s.&lt;/p>
&lt;h2 id="a-nametldratldra">&lt;a name="tldr">&lt;/a>TL;DR&lt;/a>&lt;/h2>
&lt;p>At this point, we need to emphasise that we have never seen our use of 303s actually affect page rankings. But at the same time, use of 303 redirects has not had wider uptake. Maintaining this quixotic behaviour just isn’t worth the effort. We hope that, in the future, we can use other techniques (e.g. &lt;a href="https://signposting.org/" target="_blank">signposting&lt;/a> &amp;amp; &lt;a href="https://datatracker.ietf.org/doc/draft-vandesompel-citeas/" target="_blank">cite-as&lt;/a>) to achieve some of the things that 303 was supposed to do.&lt;/p>
&lt;p>Note that these changes &lt;strong>will not affect users or machines using DOIs&lt;/strong>. The change should be entirely transparent.&lt;/p>
&lt;p>Below we provide some background to our decision and after that we provide some detailed technical notes from &lt;a href="https://orcid.org/0000-0001-7694-8250" target="_blank">Jonathan Rees&lt;/a> and &lt;a href="https://orcid.org/0000-0001-5490-1347" target="_blank">Henry Thompson&lt;/a> who have been very kind in helping to provide Crossref technical guidance on how we can help DOIs best support linked open data and adhere to HTTP best practice.&lt;/p>
&lt;h2 id="a-namebackgroundbackgrounda">&lt;a name="background">Background&lt;/a>&lt;/h2>
&lt;p>Back in 2010, Crossref, DataCite (and later, several other RAs) responded to &lt;a href="https://doi.org/10.64000/x2spb-3d247" target="_blank">concerns that DOIs were not &amp;ldquo;linked-data friendly.&amp;rdquo;&lt;/a> There were three problems with DOIs at that time:&lt;/p>
&lt;ol>
&lt;li>It was not clear that DOIs could be used and expressed as HTTP URIs.&lt;/li>
&lt;li>There was no standard way to ask a DOI to return a machine-readable representation of the data.&lt;/li>
&lt;li>It wasn’t always clear if the DOI resolved to &amp;ldquo;the thing&amp;rdquo; (e.g. an article) or “something about the thing” (e.g. a landing page).&lt;/li>
&lt;/ol>
&lt;p>On the advice of several people in the linked data community, &lt;a href="https://doi.org/10.64000/8f0n4-64m15" target="_blank">we proposed some options for fixing this&lt;/a>. And we finally settled on:&lt;/p>
&lt;ol>
&lt;li>Recommending that Crossref DOIs be expressed and displayed as HTTP (&lt;a href="https://doi.org/10.13003/5jchdy" target="_blank">now HTTPS&lt;/a>) URIs. This made it clear that DOIs could be used with HTTP applications.&lt;/li>
&lt;li>Enabling DOI registration agencies to support content negotiation. This allowed RAs to support providing machine-readable representations of the data associated with a DOI.&lt;/li>
&lt;li>Changing the underlying redirect code from the normal 302 to 303. This was designed to clarify what, at the time, was true- that most DOIs resolved to a landing page, not the article itself.&lt;/li>
&lt;/ol>
&lt;p>By any practical measure, machine use of DOIs has exploded since we made these decisions back in 2010. Crossref’s APIs and content negotiation handle over 800 million requests for machine readable data a month. Our sibling organisation, &lt;a href="https://www.datacite.org" target="_blank">DataCite&lt;/a>, has also seen a huge growth in machine use of DOIs. Many applications, from bibliographic management tools, to authoring systems and CRIS systems, make use of machine actionable DOIs all the time. So clearly our work to promote DOIs as machine actionable identifiers is working, but we are certain that our current use of 303 redirects has nothing to do with this growth.&lt;/p>
&lt;p>First of all, as we said, very few parties have actually subscribed to the notion of using 303s to help distinguish &amp;ldquo;the thing&amp;rdquo; from “something about the thing”.&lt;/p>
&lt;p>Secondly, even if they did try to rely on 303s to make this distinction, they would quickly get confused because the DOI is so often just the first in a chain of redirects which do not implement the same semantic distinction. At this point we should be clear - Crossref thinks these kinds of long redirect chains are a bad idea for two main reasons:&lt;/p>
&lt;ul>
&lt;li>They slow down resolution.&lt;/li>
&lt;li>They increase the number of potential failure points between the DOI and the item it resolves to.&lt;/li>
&lt;/ul>
&lt;p>But we also cannot legislate them away. They exist. And in the real world you will find plenty of DOIs that do a 303 redirect to a system that, in turn, does a 302 redirect to a system that does a 301 redirect and…eventually ends up someplace returning a 200. You get the picture. How on earth is a machine supposed to interpret a 303-&amp;gt;302-&amp;gt;301-&amp;gt;302 redirect chain?&lt;/p>
&lt;p>Furthermore - nowadays, after following this chain of redirects, you will often find yourself on a &amp;ldquo;page&amp;rdquo; that is &lt;em>both&lt;/em> a landing page &lt;em>and&lt;/em> the article itself. Dynamic, one-page applications can simply morph the one into the other without the use of additional HTTP requests.&lt;/p>
&lt;p>In other words, using 303s is not helping machines interpret what the DOI is pointing at. And yet, people seem to be making good use of machine actionable DOIs and they are not complaining much about it.&lt;/p>
&lt;p>Personally, I’d might have just been happy to switch back to using 302s &lt;em>simply&lt;/em> so that I could cut down on my conversations with SEO hacks. But that wouldn’t be a principled approach. In 2010 we spent a lot of time considering the initial switch to 303s- we needed to consult with the LOD community on a potential switch back to 302s. At the January 2018 &lt;a href="https://pidapalooza.org/" target="_blank">PIDapalooza&lt;/a> I had a chance to talk to Henry Thomson about the 302/303 dilemma we faced, and he along with Jonathan Rees very generously provided the following feedback.&lt;/p>
&lt;h2 id="a-namedetailsbest-practices-for-http-redirection-by-persistent-identifier-resolvers-302-vs-303a">&lt;a name="details">Best practices for HTTP redirection by persistent identifier resolvers: 302 vs. 303&lt;/a>&lt;/h2>
&lt;ul>
&lt;li>Jonathan Rees (MIT CSAIL, &lt;a href="https://orcid.org/0000-0001-7694-8250" target="_blank">https://orcid.org/0000-0001-7694-8250&lt;/a>)&lt;/li>
&lt;li>Henry Thompson (University of Edinburgh, School of Informatics, &lt;a href="https://orcid.org/0000-0001-5490-1347" target="_blank">https://orcid.org/0000-0001-5490-1347&lt;/a>)&lt;/li>
&lt;/ul>
&lt;p>If one goes to the trouble to organize an identifier system, then the desire that such a system should last as long as possible leads one to aspirationally say it’s a &lt;em>persistent&lt;/em> identifier (PID) system. The unwillingness of the major browser suppliers to implement new URI schemes for PIDs initially hindered their use on the Web and this in turn inhibited widespread adoption. More recently a number of PID approaches have enjoyed very rapid growth as a result of a compromise: these PIDs participate in the World Wide Web by defining simple conversion rules mapping identifiers to &lt;em>actionable&lt;/em> (&amp;lsquo;http:&amp;rsquo; and/or &amp;lsquo;https:&amp;rsquo;) forms and providing resolution servers that redirect requests for such forms to the appropriate destination.This approach has been widely adopted and is very successful, because it is so useful. An identifier’s actionable form leads, via the HTTP protocol and one or more redirections, to a web page that bears on the ground identity of the associated entity – or perhaps even directly to the entity itself, if the system is one for document entities that are naturally provided as web pages. The nature of the retrieved web page varies from one system to the next.&lt;/p>
&lt;p>A confusion arose, however, over claims in various technical specifications (&lt;a href="https://tools.ietf.org/html/rfc3986" target="_blank">URIs&lt;/a>, &lt;a href="https://tools.ietf.org/html/rfc2616" target="_blank">HTTP&lt;/a>, &lt;a href="https://www.w3.org/TR/webarch/" target="_blank">Web Architecture&lt;/a>) that the normal case is for the protocol to yield a &amp;ldquo;representation&amp;rdquo; of the “resource” “identified” by the URI. None of these terms is adequately defined by the specifications, and initially the language was not taken as normative. Those deploying identifier systems took the HTTP “resource” to be the entity associated with an identifier, and understood the “resource” as being “identified” by the URI, but it was never clear what was, or wasn’t, a “representation” of a given entity/resource: a description of the resource, the resource itself, a version of the resource, instructions on how to find the resource, etc. Sixteen years ago, in an attempt to clarify the intent of this part of the theory of URIs, and to allow applications to usefully and uniformly exploit the idea that an HTTP 200 response must deliver a “representation” of the “resource”, Tim Berners-Lee &lt;a href="https://lists.w3.org/Archives/Public/www-tag/2002Mar/0092" target="_blank">asked&lt;/a> the &lt;a href="https://www.w3.org/2001/tag/" target="_blank">W3C Technical Architecture Group&lt;/a> to consider what came to be known as the &lt;a href="https://www.w3.org/2001/tag/group/track/issues/14" target="_blank">httpRange-14&lt;/a> issue. It’s now 13 years after the TAG gave &lt;a href="https://lists.w3.org/Archives/Public/www-tag/2005Jun/0039.html" target="_blank">advice&lt;/a> which almost no one was happy with, and 5 years after work on issue &lt;a href="https://www.w3.org/2001/tag/group/track/issues/57" target="_blank">httpRedirections-57&lt;/a> (which superseded httpRange-14) ground to a halt. There’s still no consensus on whether it’s OK to return landing pages with a 200 status in response to requests for pictures or publications, but the Web seems to be working nonetheless, and no one seems to be bothered much anymore.&lt;/p>
&lt;p>The provision of HTTP-based resolution services has stimulated widespread support for the use of identifier systems with Web resolution, particularly in the scholarly journal publication context. Those setting up HTTP resolvers responsible for identifier systems must decide which HTTP response code should be used. The TAG’s advice sows doubt on the use of the 200 response code when the response would have been a landing page, and many resolvers avoid 200 regardless and use redirection for administrative purposes, for example&lt;/p>
&lt;p>‘&lt;a href="https://dx.doi.org/10.1109/5.771073" target="_blank">https://dx.doi.org/10.1109/5.771073&lt;/a>’ to&lt;/p>
&lt;p>‘&lt;a href="http://ieeexplore.ieee.org/document/771073/?reload=true" target="_blank">http://ieeexplore.ieee.org/document/771073/?reload=true&lt;/a>’ for the DOI&lt;/p>
&lt;p>‘10.1109/5.771073’, or ‘&lt;a href="https://identifiers.org/uniprot/A0A022YWF9" target="_blank">https://identifiers.org/uniprot/A0A022YWF9&lt;/a>’ to&lt;/p>
&lt;p>‘&lt;a href="http://www.uniprot.org/uniprot/A0A022YWF9" target="_blank">http://www.uniprot.org/uniprot/A0A022YWF9&lt;/a>’ for the Uniprot identifier&lt;/p>
&lt;p>‘A0A022YWF9’.&lt;/p>
&lt;p>So the response should be a redirection, but what kind, 301, 302, or 303? (Or 307, which is almost the same as 302.) A 301 redirect seems to say that the URI is not persistent (since its target is deemed &amp;ldquo;more persistent&amp;rdquo;). A 302 redirect seems to say that the response could have come via a 200, and so suffers the same fate as 200. That leaves 303, as hinted at in the TAG’s advice. This idea got some traction: Ten years ago a Semantic Web interest group promoted the TAG’s advice in &lt;a href="https://www.w3.org/TR/cooluris/" target="_blank">a published note&lt;/a>, and seven years ago one of us wrote a &lt;a href="https://odontomachus.wordpress.com/2011/05/04/crossrefs-gift-of-metadata/" target="_blank">blog post&lt;/a> giving the same advice for resolvers for PIDs in publishing.&lt;/p>
&lt;p>However, not only is there neither consensus nor general utility around this strict understanding of the use of the various response codes – that is, that resolution to a landing page is inconsistent with a 200 (and &lt;em>a posteriori&lt;/em> therefore with a 302) – but also the range of usage patterns for redirection of HTTP requests has grown and ramified over time as the Web has grown and become more complex. It’s on the face of it unlikely that a mere three response codes can capture all the resulting complexity or cover the space of outcomes (in terms of e.g. what ends up in the browser address bar or what search engines index a page under) that a page owner might like to signal.&lt;/p>
&lt;p>We find in practice that some PID redirections &lt;em>are&lt;/em> ending up (usually after further publisher-local redirects) at the &amp;ldquo;identified&amp;rdquo; document, some at landing pages, and some at one &lt;em>or&lt;/em> the other depending on the requesting site, for example in the case of paywalled material.&lt;/p>
&lt;p>In the absence of a rethinking of the whole 3xx space, it seems to us that only the 301 vs. 302 distinct ion (roughly, 301 = permanent = please fix the link, and 302 = temporary = don’t change the link) is well understood and more or less consistently treated, whereas for 303, web servers are not very consistent and both &lt;a href="http://sharkseo.com/nohat/303-redirects-seo/" target="_blank">search engine&lt;/a> and citation crawler behaviours are at best inconsistent and at worst downright unhelpful.&lt;/p>
&lt;p>So, we believe it is in both users’ and publishers’ interests for resolvers of actionable-form PIDs to use 302 redirects, not 303.&lt;/p>
&lt;p>If we want to help machines better understand the resource that a DOI points at, we have to explore using more nuanced mechanisms.&lt;/p>
&lt;p>Just using 302 for the first redirect doesn&amp;rsquo;t do everything necessary to effectively support the emerging PID+redirection architecture. It&amp;rsquo;s at the &lt;em>end&lt;/em> of the redirect chains that we need more: a standardised way to find the PID back at the start of the chain. The &lt;a href="https://datatracker.ietf.org/doc/draft-vandesompel-citeas/" target="_blank">&amp;lsquo;cite-as&amp;rsquo; proposal&lt;/a> does exactly this, and we hope it&amp;rsquo;s quickly approved and widely adopted. Once &lt;em>that&lt;/em> happens a proposal for augmenting browser (and API) behaviour to prefer, or at least offer, the &amp;lsquo;cite-as&amp;rsquo; link for bookmarking and copying will be needed.&lt;/p></description></item><item><title>Using the Crossref REST API. Part 8 (with Researchfish)</title><link>https://www.crossref.org/blog/using-the-crossref-rest-api.-part-8-with-researchfish/</link><pubDate>Mon, 23 Apr 2018 00:00:00 +0000</pubDate><author>Christine Cormack Wood</author><discourseUsername>ccormackwood</discourseUsername><guid>https://www.crossref.org/blog/using-the-crossref-rest-api.-part-8-with-researchfish/</guid><description>&lt;p>Continuing our &lt;a href="https://www.crossref.org/categories/api-case-study/">blog series&lt;/a> highlighting the uses of Crossref metadata, we talked to Gavin Reddick, Chief Analyst at &lt;a href="https://www.researchfish.net/" target="_blank">Researchfish&lt;/a> about the work they’re doing, and how they’re using our REST API as part of their workflow.&lt;/p>
&lt;h3 id="introducing-researchfish">Introducing Researchfish&lt;/h3>
&lt;p>&lt;a href="https://www.researchfish.net/" target="_blank">Researchfish&lt;/a> is the world’s leading platform for the reporting of the outputs, outcomes and impacts of funded research. It is used by over 100 funding organisations in Europe, North America and Australasia and currently tracks around €50 billion of funding, across 125,000 grants. Researchers have reported around 2.5 million attributed outcomes in Researchfish and roughly half of these are publications with the other half being collaborations, further funding, data sets, policy influences, engagement activities etc.&lt;/p>
&lt;p>Funders use Researchfish to ask grantees to report on the outcomes of their grant and Researchfish makes it easy for researchers to do this in a structured way. Researchfish seeks to improve the quality and robustness of the evidence base available for evaluation. It works with funders, research organisations and researchers to present, explain and evaluate the impact of research across all disciplines and a wide range of output types.&lt;/p>
&lt;h3 id="how-is-the-crossref-rest-api-used-in-researchfish">How is the Crossref REST API used in Researchfish?&lt;/h3>
&lt;ol>
&lt;li>
&lt;p>Search&lt;br>
As publications are a major output of research it is important to make the reporting of those publications be as easy as possible and quality of the information on those publications as high as possible. Researchfish integrates with a number of publication APIs, including Crossref, which enables users to enter a number of DOIs or search by author, title, etc. to find their publication.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Direct Harvest&lt;br>
Researchfish uses funding acknowledgements in the Crossref metadata to add publications to researchers’ portfolios and report the publications as arising from the grant. If the acknowledgement exists it’s important to use it instead of asking researchers to report the same thing twice.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Interoperability&lt;br>
Research organisations can upload publications to Researchfish on behalf of researchers, re-using information from their local systems. We use the Crossref REST API to validate the data provided by universities before uploading.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Metadata Enrichment – Open Access&lt;br>
We use the license and embargo period information in the Crossref metadata to help understand the open access status of publications and whether they meet any policy requirements, without researchers having to take any steps to report in this complex area.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Metadata Enrichment – Normalisation/deduplication&lt;br>
As Researchfish allows users to add information from lots of different sources it is very important to normalise the data and prevent the same publication being reported multiple times in different ways. We use the Crossref REST API as part of this process.&lt;/p>
&lt;/li>
&lt;/ol>
&lt;h3 id="what-are-the-future-plans-for-researchfish">What are the future plans for Researchfish?&lt;/h3>
&lt;p>We are looking to expand the range of integrations to support non-publication outputs and allow some of the same functionality that we have built for publications. We already have integrations to support the reporting of patents, collaborations, further funding and next destinations but are looking to enhance these, along with expanding links to data sets, clinical trials, software and spin out companies.&lt;/p>
&lt;h3 id="what-else-would-researchfish-like-to-see-in-crossref">What else would Researchfish like to see in Crossref?&lt;/h3>
&lt;p>Crossref is an excellent resource and most of our wish list would be to see more uptake of existing fields e.g. retractions and the ability to use them more flexibly in the REST API. We would also like to see a little more consistency in some of the metadata – publication type is the area that seems to cause the most confusion, particularly around conference proceedings and clinical trials.&lt;/p>
&lt;hr>
&lt;p>Thank you Researchfish! If you would like to contribute a case study on the uses of Crossref Metadata APIs please contact the &lt;a href="mailto:feedback@crossref.org">Community team&lt;/a>.&lt;/p></description></item><item><title>PIDs for conferences - your comments are welcome!</title><link>https://www.crossref.org/blog/pids-for-conferences-your-comments-are-welcome/</link><pubDate>Thu, 19 Apr 2018 00:00:00 +0000</pubDate><author>Aliaksandr Birukou</author><guid>https://www.crossref.org/blog/pids-for-conferences-your-comments-are-welcome/</guid><description>&lt;p>&lt;em>Aliaksandr Birukou is the Executive Editor for Computer Science at Springer Nature and is chair of the &lt;a href="https://www.crossref.org/working-groups/conferences-projects/">Group&lt;/a> that has been working to establish a persistent identifier system and registry for scholarly conferences. Here Alex provides some background to the work and asks for input from the community:&lt;/em>&lt;/p>
&lt;p>Roughly one year ago, Crossref and DataCite &lt;a href="https://doi.org/10.64000/skv7b-cef25" target="_blank">started&lt;/a> a working group on conference and project identifiers. With this blog post, we would like to share the &lt;a href="https://docs.google.com/document/d/1URIvkUpzcfjSd2YFIS-rdRIrOyrKSbFfhkdpGPRTAFI/edit" target="_blank">specification&lt;/a> of conference metadata and Crossmark for proceedings and are inviting the broader community to comment.&lt;/p>
&lt;h3 id="why-are-conferences-important">Why are conferences important?&lt;/h3>
&lt;p>One common misbelief is that most published research appears in journals. However, next to new ways of communication research results (blogs, presentations,…) and journals there are also other publication options, like books, very important in humanities, or conference proceedings, which are very important in computer science and a couple of related disciplines. Conference proceedings are collections of journal-like papers, often undergoing a more competitive peer review process than in journals. For instance, looking at original research in computer science in Scopus published in CS in 2012-2016, 63% of articles appeared in proceedings, while only 37% were published in journals. &lt;a href="http://dblp.uni-trier.de/statistics/distributionofpublicationtype" target="_blank">DBLP&lt;/a>, one of the most important indexing services in CS, lists more than two million conference papers organized in ~5,400 conference series.&lt;/p>
&lt;p>So, while it is true that CS has a significant share of conference proceedings, conferences are also relevant in many other disciplines which do not publish formal proceedings. For instance, &lt;a href="http://inspirehep.net/" target="_blank">inSPIRE&lt;/a> contains ~23,000 conferences in high-energy physics, the American Society of Mechanical Engineers (ASME) publishes roughly 100 &lt;a href="https://web.archive.org/web/20180203164329/http://proceedings.asmedigitalcollection.asme.org/conferenceproceedings.aspx" target="_blank">proceedings&lt;/a> volumes annually.&lt;/p>
&lt;h3 id="why-do-we-need-an-open-persistent-id-for-a-conference-or-a-conference-series">Why do we need an open persistent ID for a conference or a conference series?&lt;/h3>
&lt;p>With publishers, learned societies, indexing services, libraries, conference management systems, research evaluation and funding agencies using conferences directly or indirectly in their daily work, a common vocabulary would simplify data processing, reporting and minimize errors. Right now, a publisher assigns a unique conference ID to the conference to be published, then an indexing service does it, then it is assigned in a library. Wouldn&amp;rsquo;t it be easier to do this at the very beginning of the process, when the conference planning starts, and keep the same identifier through the whole conference lifecycle?&lt;/p>
&lt;p>The joint Crossref and DataCite group on conference and project identifiers has discussed this topic at half a dozen calls and various PID community meetings (PIDapalooza, FORCE conferences, AAHEP Information Provider Summit). The result of those discussions is a draft of the specification of conference metadata and Crossmark for proceedings.&lt;/p>
&lt;p>The document first defines the concepts of a conference, conference series, joint and co-located conferences. It then introduces the information we want to store about those entities, e.g., the ID, name, acronym, other IDs, URL and the maintainer of the conference series, or the ID, conf series ID, number, dates, location, and URL for conferences. Such metadata can be submitted to Crossref and DataCite by conference organizers or publishers on their behalf and linked to the existing proceedings metadata, where appropriate. It can be then used for linking research outputs from a conference (beyond formal proceedings), recognizing reviewers via services such as ORCID and Publons, computing metrics of a conference series, conference disambiguation in indexing services and ratings (CORE, QUALIS, CCF), and so on.&lt;/p>
&lt;p>The second part of the document introduces Crossmark for conference proceedings. Its goal is to structure and preserve the information about the peer review process of a conference as declared by the general or program chairs. Depending on how much information is available from the conference organizers, one can use the basic or extended versions of Crossmark.&lt;/p>
&lt;p>In order to comment, please open the &lt;a href="https://docs.google.com/document/d/1URIvkUpzcfjSd2YFIS-rdRIrOyrKSbFfhkdpGPRTAFI/edit" target="_blank">specification&lt;/a> and leave comments using “comment” feature of Google Docs. The draft remains open for comments till the &lt;strong>31st of May 2018&lt;/strong>.&lt;/p>
&lt;h3 id="next-steps">Next steps&lt;/h3>
&lt;p>After hearing from YOU, we will update the document to reflect the community comments. In parallel, we start a subgroup discussing the governance models, looking into whether we need a new membership category at Crossref, what fees should be covered, etc.&lt;/p>
&lt;hr></description></item><item><title>Do you want to be on our Board?</title><link>https://www.crossref.org/blog/do-you-want-to-be-on-our-board/</link><pubDate>Wed, 18 Apr 2018 00:00:00 +0000</pubDate><author>Lisa Hart Martin</author><guid>https://www.crossref.org/blog/do-you-want-to-be-on-our-board/</guid><description>&lt;blockquote>
&lt;p>Do you want to effect change for the scholarly community?&lt;/p>
&lt;/blockquote>
&lt;p>The Crossref Nominating Committee is inviting expressions of interest to serve on the Board as it begins its consideration of a slate for the November 2018 election.&lt;/p>
&lt;p>The key responsibilities of the Board are:&lt;/p>
&lt;ol>
&lt;li>Setting the strategic direction for the organisation;&lt;/li>
&lt;li>Providing financial oversight; and&lt;/li>
&lt;li>Approving new policies and services.&lt;/li>
&lt;/ol>
&lt;h3 id="some-of-the-decisions-the-board-has-made-in-recent-years-include">Some of the decisions the board has made in recent years include:&lt;/h3>
&lt;ul>
&lt;li>Introduction of the Metadata APIs Plus service (to provide a paid-for premium service for machine access to metadata);&lt;/li>
&lt;li>Updating the policy on open references (to increase links so that more readers can access content);&lt;/li>
&lt;li>Establishing &lt;a href="https://doi.org/10.64000/g720f-z9z14" target="_blank">the OI Project&lt;/a> (to create a persistent organisation Identifier);&lt;/li>
&lt;li>Inclusion of &lt;a href="https://www.crossref.org/news/2016-11-02-crossref-now-accepts-preprints/">preprints in the Crossref metadata&lt;/a>; and&lt;/li>
&lt;li>Approval to develop &lt;a href="https://www.crossref.org/services/event-data/">Event Data&lt;/a> (which will track online activity from multiple sources).&lt;/li>
&lt;/ul>
&lt;h3 id="what-is-expected-of-a-crossref-board-member">What is expected of a Crossref Board member?&lt;/h3>
&lt;p>Board members should be able to attend all board meetings, which occur three times a year in different parts of the world. If you are unable to attend in person you must be able to attend via telephone.&lt;/p>
&lt;p>Board members must:&lt;/p>
&lt;ul>
&lt;li>be familiar with the three key responsibilities listed above,&lt;/li>
&lt;li>actively participate and contribute towards discussions, and&lt;/li>
&lt;li>read the board documents and materials provided, prior to attending meetings.&lt;/li>
&lt;/ul>
&lt;h3 id="how-to-submit-an-expression-of-interest-to-serve-on-the-board">How to submit an expression of interest to serve on the Board&lt;/h3>
&lt;p>We are seeking people who know about scholarly communications and would like to be part of our future. If you have a vision for the international Crossref community, we are interested in hearing from you.&lt;/p>
&lt;p>If you are a Crossref member, are &lt;a href="https://doi.org/10.64000/wfmdf-hmv37" target="_blank">eligible to vote&lt;/a>, and would like to be considered, you should complete and submit the &lt;a href="https://docs.google.com/forms/d/1AaPqLz4jBUeZ-VggkvRHBSYfwadrwfT2FP6YGcbyb48/edit" target="_blank">expression of interest&lt;/a> form with both your organisation&amp;rsquo;s statement and your personal statement before 18 May 2018.&lt;/p>
&lt;p>It is important to note it is your organisation who is the Crossref member—and therefore the seat will belong to your organisation.&lt;/p>
&lt;h3 id="about-the-election-and-our-board">About the election and our Board&lt;/h3>
&lt;p>We have a principle of &lt;a href="https://www.crossref.org/truths/">“one member, one vote”&lt;/a>; our board comprises a cross-section of members and it doesn’t matter how big or small you are, every member gets a single vote. Board terms are three years, and one third of the Board is eligible for election every year. There are five seats up for election in 2018.&lt;/p>
&lt;p>The board meets in a variety of international locations in March, July, and November each year. &lt;a href="https://www.crossref.org/board-and-governance/">View a list of the current Crossref Board members and a history of the decisions they’ve made (motions).&lt;/a>&lt;/p>
&lt;p>The election opens online in September 2018 and voting is done by proxy online, or in person, at the annual business meeting during ‘Crossref LIVE18’ on 13th November 2018 in Toronto, Canada. Election materials and instructions for voting will be available to all Crossref members online in September 2018.&lt;/p>
&lt;h3 id="the-role-of-the-nominating-committee">The role of the Nominating Committee&lt;/h3>
&lt;p>The Nominating Committee meets to discuss change, process, criteria, and potential candidates, ensuring a fair representation of membership. The Nominating Committee is charged with selecting a slate of candidates for election from those who have expressed an interest.&lt;/p>
&lt;p>The selection of the slate (which is likely to exceed the number of open seats) is based on the quality of the expressions of interest and maintaining the balance and diversity of the board—especially in areas of organisational size, gender, geography and sector.&lt;/p>
&lt;p>The Committee is made up of three board members not up for election, and two non-board members. The current Nominating Committee members are:&lt;/p>
&lt;ul>
&lt;li>Mark Patterson, eLife (Chair);&lt;/li>
&lt;li>Chris Shillum, Elsevier;&lt;/li>
&lt;li>Amy Brand, MIT Press;&lt;/li>
&lt;li>Vincent Cassidy, The Institution of Engineering &amp;amp; Technology (IET); and&lt;/li>
&lt;li>Claire Moulton, The Company of Biologists.&lt;/li>
&lt;/ul>
&lt;p>Our board needs to be stay truly representative of Crossref’s global and diverse membership of organisations who publish. Please &lt;a href="https://docs.google.com/forms/d/1AaPqLz4jBUeZ-VggkvRHBSYfwadrwfT2FP6YGcbyb48/edit" target="_blank">submit your statements of interest&lt;/a> or reply to me with any questions to me at &lt;a href="mailto:lhart@crossref.org">lhart@crossref.org&lt;/a>.&lt;/p></description></item><item><title>Hear this, real insight into the inner workings of Crossref</title><link>https://www.crossref.org/blog/hear-this-real-insight-into-the-inner-workings-of-crossref/</link><pubDate>Sun, 01 Apr 2018 00:00:00 +0000</pubDate><author>Joe Wass</author><guid>https://www.crossref.org/blog/hear-this-real-insight-into-the-inner-workings-of-crossref/</guid><description>&lt;h2 id="you-want-to-hear-more-from-us-we-hear-you">You want to hear more from us. We hear you.&lt;/h2>
&lt;p>We’ve spent the past year building Crossref Event Data, and hope to launch very soon. Building a new piece of infrastructure from scratch has been an exciting project, and we’ve taken the opportunity to incorporate as much feedback from the community as possible. We’d like to take a moment to share some of the suggestions we had, and how we’ve acted on them.&lt;/p>
&lt;p>We asked a focus group “&lt;strong>What one thing would you change?&lt;/strong>”. In hindsight, we could have done a better job with the question. We did get some enlightening answers but&amp;mdash;for legal and practical reasons&amp;mdash;we are unable to end either world hunger or global conflict, or do any of the other things we were invited to do. So we went back to our focus group and asked “What one thing would you change &lt;em>about Crossref&lt;/em>?”.&lt;/p>
&lt;p>The answers were illuminating. Some of you wanted mundane things like more data dumps. A disappointing number of people wanted us to put the capital ‘R’ back in our name. But two things we heard consistently, loud and clear, were:&lt;/p>
&lt;ol>
&lt;li>“I want to hear more from Crossref”&lt;/li>
&lt;li>“I want to know more about what’s going on inside Crossref”&lt;/li>
&lt;/ol>
&lt;p>One respondent said:&lt;/p>
&lt;blockquote>
&lt;p>I like the newsletters, and the Twitter visuals are nice enough, but I want to hear, you know, &lt;em>more&lt;/em> from them.&lt;/p>
&lt;/blockquote>
&lt;p>Another:&lt;/p>
&lt;blockquote>
&lt;p>Crossref is your typical quiet DOI Registration Agency. They make a big thing about being the background infrastructure you don’t notice. But infrastructure doesn’t have to be quiet. I live next to the M25, and I can tell you, that’s the sound of success. I mean, it’s loud.&lt;/p>
&lt;/blockquote>
&lt;p>One final quote which clinched it for us:&lt;/p>
&lt;blockquote>
&lt;p>The outreach team is doing a great job with their multilingual videos. But you can never cover every world language. In today’s connected world, you should be thinking about the &lt;em>universal language&lt;/em>.&lt;/p>
&lt;/blockquote>
&lt;p>She clarified:&lt;/p>
&lt;blockquote>
&lt;p>No, I don’t mean XML.&lt;/p>
&lt;/blockquote>
&lt;p>We took this advice to heart. When we were building Crossref Event Data, we baked these features right in. Now you can hear what’s going on inside Crossref, any time, day or night.&lt;/p>
&lt;h2 id="introducing-the-crossref-thing-action-service">Introducing the Crossref Thing Action Service!&lt;/h2>
&lt;p>Turn up your speakers (about half-way, it would be foolhardy to turn them too high) and visit:&lt;/p>
&lt;h3 id="liveeventdatacrossreforgthing-action-servicehtmlhttpsliveeventdatacrossreforgthing-action-servicehtml">&lt;a href="https://live.eventdata.crossref.org/thing-action-service.html" target="_blank">live.eventdata.crossref.org/thing-action-service.html&lt;/a>&lt;/h3>
&lt;p>It’s optimized for Google Chrome, but we’ve tested it in Firefox and Safari.&lt;/p>
&lt;p>The &lt;strong>Thing Action Service&lt;/strong> shows you, in excruciating sonorous detail, every single action that happens inside the Crossref Event Data system. Every time we receive live data from Twitter or Wikipedia. Every time we check a DOI. Every time we check an RSS feed. Every time we find a link to our Registered Content on the web.&lt;/p>
&lt;p>In a pioneering move within the scholarly publishing space, you can hear the data as it’s being processed, live. Furthermore, we think we are the first DOI Registration Agency to offer our services in stereo.&lt;/p>
&lt;p>John Chodacki, Professional Working Group Chair, said:&lt;/p>
&lt;blockquote>
&lt;p>We welcome this innovation. From my experience Chairing, well, everything, I’m certain that hearing-impaired users will like it especially.&lt;/p>
&lt;/blockquote>
&lt;p>So sit back, put the Thing Action Service on the speakers, and relax. You may find it difficult at first, but as you let the sound waves wash over you, think of all that data in flight. That beep could be someone criticizing the article you wrote on Twitter. But don’t worry, the next one might be someone defending it.&lt;/p>
&lt;p>Think of it as &lt;em>musique concrète&lt;/em>. That’s the Art of Persistence.&lt;/p></description></item><item><title>Hello, meet Event Data Version 1, and new Product Manager</title><link>https://www.crossref.org/blog/hello-meet-event-data-version-1-and-new-product-manager/</link><pubDate>Thu, 29 Mar 2018 00:00:00 +0000</pubDate><author>Christine Buske</author><guid>https://www.crossref.org/blog/hello-meet-event-data-version-1-and-new-product-manager/</guid><description>&lt;p>I joined Crossref only a few weeks ago, and have happily thrown myself into the world of Event Data as the service’s new product manager. In my first week, a lot of time was spent discussing the ins and outs of Event Data. This learning process made me very much feel like you might when you’ve just bought a house, and you’re studying the blueprints while also planning the house-warming party.&lt;/p>
&lt;p>If Event Data is like a house, it’s been built and we’ve recently been putting on a last coat of paint. We’re very happy to announce version 1 of the API today. This is bringing us closer to the launch (house warming party), which will officially present Event Data to the world. Further to that analogy, while I bought into the house, I wasn’t around to see it being built. That’s both incredibly exciting and a little daunting.&lt;/p>
&lt;p>Version 1 contains fixes for some challenges we came up against. Like scalability, data modeling for Wikipedia, and polishing. Version 1 is a new release of the data, but it is the same data set you already know and love. It should solve some of the recent stability issues, for which we apologize.&lt;/p>
&lt;p>Moving forward, we expect the data model in V1 to persist and are not planning to make further large scale, fundamental changes to the Event Data API. As such, the version 1 release of the API is exceptional and a big step forward. It is important that we address these fixes before we go into production as it affects everyone who uses the service.&lt;/p>
&lt;h2 id="same-event-data-new-address">Same Event Data, new address&lt;/h2>
&lt;p>In setting up for the upcoming production service rollout, we have updated the Event Data API domain so that it is in line with Crossref’s suite of APIs. The Query API can now be found at a new URL. Here is an example query: &lt;a href="https://api.eventdata.crossref.org/v1/events?rows=1" target="_blank">https://api.eventdata.crossref.org/v1/events?rows=1&lt;/a>&lt;/p>
&lt;p>We have also simplified the standard query parameters in favor of a cleaner filter syntax.&lt;/p>
&lt;p>Lastly, we have added a new “Mailto” parameter, &lt;a href="https://github.com/CrossRef/rest-api-doc#etiquette" target="_blank">just like in our REST API&lt;/a>. It is encouraged but optional, so you are not obliged to supply it. We&amp;rsquo;ll only use it to contact you if there&amp;rsquo;s a problem.&lt;/p>
&lt;h2 id="changes-to-the-wikipedia-data-structure">Changes to the Wikipedia data structure&lt;/h2>
&lt;p>We’ve done a lot of work to use the &lt;a href="https://www.eventdata.crossref.org/guide/data/ids-and-urls/" target="_blank">canonical URLs&lt;/a> for web pages to represent content as consistently as possible. This has entailed updating previously collected Events across data sources. As such, we’ve updated our Wikipedia data model to align with this. Because this update has impacted every Wikipedia Event in the system, we recommend those who have used or saved existing data from the deprecated Query API version to pull a new copy of the data. Read more about &lt;a href="https://groups.google.com/forum/#!topic/crossref-event-data-beta-testers/-RAzhr7SIHY" target="_blank">the rationale for changing the Wikipedia data model&lt;/a>.&lt;/p>
&lt;h2 id="updated-data">Updated data&lt;/h2>
&lt;p>This then brings me to how we now handle updated data. Sometimes we edit Events to add new features, or we may edit Events if there is an issue processing and/or representing the data when we provision it to the community. And sometimes we must remove Events to comply with a particular data source’s terms and conditions (ex: deleted Tweets). You can read about how updates work in &lt;a href="https://www.eventdata.crossref.org/guide/data/updates/" target="_blank">the user guide&lt;/a>.&lt;/p>
&lt;p>To make life easier moving forward, we’ve split updated Events into two API endpoints.
If you are already using Event Data, you will need to make some small updates to your client(s) to align with this. The new endpoints are further described &lt;a href="https://www.eventdata.crossref.org/guide/service/query-api/" target="_blank">in the documentation&lt;/a>.&lt;/p>
&lt;h2 id="event-data-beta-group">Event Data beta group&lt;/h2>
&lt;p>With the version 1 release we are making solid progress towards an official launch (the house-warming party!), we are quite excited to &lt;a href="mailto:eventdata@crossref.org">hear how you are using Event Data&lt;/a>. Please consider [joining our beta group] (&lt;a href="https://groups.google.com/forum/#!forum/crossref-event-data-beta-testers%29" target="_blank">https://groups.google.com/forum/#!forum/crossref-event-data-beta-testers)&lt;/a>, if you are using the Event Data API or want to hear about updates.&lt;/p>
&lt;p>This is also where you can &lt;a href="https://groups.google.com/forum/#!topic/crossref-event-data-beta-testers/2fak5d1UMag" target="_blank">read about these updates in more detail&lt;/a>.&lt;/p>
&lt;p>For more information and to get started with Crossref Event Data, please refer to &lt;a href="https://www.eventdata.crossref.org/guide/index.html" target="_blank">the user guide&lt;/a>.&lt;/p>
&lt;p>I am looking forward to seeing how Event Data is being used, and working with the community to continuously improve what we can offer through this service. Feedback is always welcome, feel free to get in touch with me at &lt;a href="mailto:eventdata@crossref.org">eventdata@crossref.org&lt;/a>.&lt;/p></description></item><item><title>A Lustrum over the weekend</title><link>https://www.crossref.org/blog/a-lustrum-over-the-weekend/</link><pubDate>Mon, 26 Mar 2018 00:00:00 +0000</pubDate><author>Jennifer Lin</author><guid>https://www.crossref.org/blog/a-lustrum-over-the-weekend/</guid><description>&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/lustrum2.png" alt="image and meaning of lustrum" width="350px" />
&lt;/p>
&lt;/div>
&lt;p>The ancient Romans performed a purification rite (“lustration”) after taking a census every five years. The term &lt;a href="https://en.wikipedia.org/wiki/Lustrum" target="_blank">“lustrum”&lt;/a> designated not only the animal sacrifice (“suovetaurilia”) but was also applied to the period of time itself. At Crossref, we’re not exactly in the business of sacrificial rituals. But over the weekend I thought it would be fun to dive into the metadata and look at very high level changes during this period of time.&lt;/p>
&lt;p>Crossref provides the &lt;a href="https://www.crossref.org/dashboard">latest cumulative stats online&lt;/a>. We share news about the work we do along the way in the &lt;a href="https://www.crossref.org/blog">Crossref blog&lt;/a>, including periodic summaries such as the &lt;a href="https://doi.org/10.64000/hnk6j-p5q04" target="_blank">Executive Director’s 2017 end-of-year highlights&lt;/a> and the &lt;a href="https://www.crossref.org/operations-and-sustainability/annual-report">annual review&lt;/a>. But what follows is a brief and very informal survey of the population of inhabitants in the Crossref metadata-land for the current lustrum.&lt;/p>
&lt;h3 id="works-published">Works published&lt;/h3>
&lt;p>The first thing a census typically asks is population size. We know there are new records arriving each month with 95.7mil to date. And they do so at variable rates. But when the data is visualized, a rough yearly pattern emerges into view. (Data were collected on Mar 25, 2018; results are partial for this month.)&lt;/p>
&lt;p align="center">
&lt;img src="https://www.crossref.org/images/blog/Jen blog chart.png" alt="works published by month" height="400px" width="650px" />
&lt;/p>
&lt;p>Each year brings with it a significant spike, an influx of new entrants, perhaps reflecting an increase in submissions at the end of the previous year. After January, volume drops down dramatically and gradually rises once more over the course of the year. We see smaller spikes at the March, June, and September mark. (Since this was a brief exercise, I did not dive into any formal research conducted on the nature of publishing cycles.)&lt;/p>
&lt;h3 id="metadata-coverage">Metadata Coverage&lt;/h3>
&lt;p>The next question is a look at how the population is broken up into different demographics. For this, I analyzed four key sub-populations of ORCID, funding information, license, abstract metadata. The following graph shows the percentage of new parties (i.e., works registered at Crossref containing these metadata) across four specific segments.&lt;/p>
&lt;p align="center">
&lt;img src="https://www.crossref.org/images/blog/metadata coverage.png" alt="Crossref metadata coverage" height="750px" width="650px" />
&lt;/p>
&lt;p>I ran &lt;a href="https://gist.github.com/karthik/7e7875af0ecaa4327d3d61f550de94e0" target="_blank">Karthik Ram’s script&lt;/a> which employed &lt;a href="https://github.com/ropensci/rcrossref" target="_blank">rOpenSci’s r client&lt;/a> for the &lt;a href="https://github.com/CrossRef/rest-api-doc/" target="_blank">Crossref REST API&lt;/a>. Data are based on publication date rather than deposit date and represent all updates to the metadata record for the baseline view.&lt;/p>
&lt;p>The census graph shows extensive empty space on the top half, indicating there is ample room for continual growth in these communities. The ORCID population is expanding the fastest, followed by license and funding. Abstracts are a minority group and quite visibly needs a population boost here in Crossref-land.&lt;/p>
&lt;p>This view does not capture the percentages across record types nor does it take into account the differential rate of growth between record types (e.g., journal article, book, report, conference proceeding, dissertation, dataset, component, posted content, peer review) as the Crossref corpus has grown. While ORCID, funding, and license information are available for all full record types (viz., excludes components), this matters for abstracts. Abstracts are part of the metadata schema of all relevant record types. This excludes those which do not apply: dataset, component, and peer reviews. All things considered though, the relative impact on the total percentage of metadata deposited (or not deposited) is miniscule given the small sums for these works.&lt;/p>
&lt;h3 id="calling-the-real-demographers--cartographers">Calling the real demographers &amp;amp; cartographers&lt;/h3>
&lt;p>This mini-pseudo-lustrum was the result of a few hours of play. The graphs have raised more questions than answers. We welcome more serious and earnest efforts to dive into the metadata and conduct a more detailed, reliable investigation on the size, distribution and composition of the population through our &lt;a href="http://github.com/CrossRef/rest-api-doc" target="_blank">REST API&lt;/a>. Next month, we will roll out reports on metadata coverage based on individual members.&lt;/p>
&lt;p>This “play” census came out of a session with Karthik Ram, one of the founders of &lt;a href="https://ropensci.org/" target="_blank">rOpenSci&lt;/a>, as we talked about struggle to build better tools for researchers. (rOpenSci is an exciting and influential non-profit that builds open source software for research with a community of users and developers and educates scientists about transparent research practices.) With each round of cocktails, it became clear that a critical subset of the issues boiled down to the problem of limited information about research publications. Why, that is what Crossref does! Indeed. Publishers register their content with Crossref and provide the metadata about the works they publish.&lt;/p>
&lt;p>Over the past few years, we have been working with our members to broaden the coverage of the metadata as well as improve their metadata quality. This issue is not exclusive to Crossref - &lt;a href="http://www.metadata2020.org/" target="_blank">Metadata 2020&lt;/a> rallies stakeholders across the research enterprise to push for change together.&lt;/p>
&lt;p>To represent the full breadth and depth of the scholarly communications enterprise, Crossref aims to capture the richness of what our members publish through the content they register. So publishers, powerfully represent your services and make sure &lt;a href="https://www.crossref.org/services/content-registration/">your metadata is complete and correct&lt;/a> for discovery systems, indexing platforms, research evaluation systems, analytics tools, and the great number of Crossref metadata consumers far and wide.&lt;/p></description></item><item><title>How we use Crossref metadata</title><link>https://www.crossref.org/blog/how-we-use-crossref-metadata/</link><pubDate>Mon, 26 Mar 2018 00:00:00 +0000</pubDate><author>Christine Cormack Wood</author><discourseUsername>ccormackwood</discourseUsername><guid>https://www.crossref.org/blog/how-we-use-crossref-metadata/</guid><description>&lt;p>Bruce Rosenblum, CEO, Inera Incorporated talks about the work they are doing at Inera, and how they’re using our metadata as part of their workflow.&lt;/p>
&lt;h3 id="can-you-tell-us-little-bit-about-inera-and-yourself">Can you tell us little bit about Inera, and yourself&lt;/h3>
&lt;div style="float:left;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/Bruce.jpg" alt=“Bruce Rosenblum CEO Inera" height="150px" width="150px" class="img-responsive" />
&lt;/div>
&lt;p>I’ve always been fascinated by the intersection of publishing and technology. At Inera I help scholarly and technical publishers improve their workflows through technology, and build editorial and XML software solutions to improve the publication workflow. I lead the development teams for our eXtyles and Edifix products, and I also participate in community projects: co-authoring the original NLM DTD suite, developing the Crossref Metadata Deposit Schema in 2001, and serving for 8 years on the NISO board. I continue to work on JATS and BITS development, and I co-chair the NISO STS Working Group. Before joining Inera, I developed publishing technology such as an Apple II Word processor for Chinese in 1981, and early micro-computer desktop publishing systems in the late 1980s.&lt;/p>
&lt;p>At Inera, we develop and license the eXtyles family of Word-based editorial and XML software tools (including eXtyles, eXtyles NLM, eXtyles STS, and eXtyles SI) as well as Edifix, an online bibliographic reference solution. eXtyles and Edifix allow users to automate the most time-consuming aspects of document publication. Publishers of scholarly journals and books, standards, and government documents around the world rely on our software solutions to drive efficient, effective publishing workflows.&lt;/p>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/Inera.png" alt=“Inera logo" height="200px" width="250px" class="img-responsive" />
&lt;/div>
&lt;p>Inera and Crossref have collaborated since 2001, and we jointly won the &lt;a href="https://web.archive.org/web/20161105110057/https://www.nepcoawards.com/2014-winner-videos.html" target="_blank">2014 NEPCo Award&lt;/a> for the ongoing symbiotic relationship between our organisations.&lt;/p>
&lt;h3 id="what-problem-is-your-software-and-service-trying-to-solve">What problem is your software and service trying to solve?&lt;/h3>
&lt;p>Publishers receive manuscripts from authors who have deep knowledge of their subject matter but are sometimes not expert writers and rarely expert users of Microsoft Word. Our eXtyles and Edifix solutions are designed to help publishers rapidly and accurately prepare these manuscripts for publication by automating a lot of technical and editorial cleanup, then producing high-quality JATS and BITS XML.&lt;/p>
&lt;p>Within eXtyles and Edifix, we have sophisticated algorithms that heuristically parse bibliographic references, copyedit them automatically to a publisher’s editorial style, and then link them to Crossref and PubMed. These features eliminate a lot of repetitive detail copy editing work so that human editors can focus on higher-level editing tasks, and they produce more accurate bibliographies that include online links, with a fraction of the work that it would take to look up, check, and correct each reference manually.&lt;/p>
&lt;h3 id="how-are-you-using-crossref-metadata-at-inera">How are you using Crossref Metadata at Inera?&lt;/h3>
&lt;p>Simply stated, we use Crossref metadata in our products to ensure that bibliographic reference lists are as complete, correct, and up to date as possible at the point of publication.&lt;/p>
&lt;p>Both eXtyles and Edifix use Crossref metadata to improve reference lists. Our reference processing module pulls apart references to journal articles, books, book chapters, conference papers, and standards, applies elements based on the JATS reference model, and then reconstructs them according to the editorial style chosen by the user (e.g., AMA, APA, MLA, or a custom-configured style to meet customers’ requirements).&lt;/p>
&lt;p>Crossref metadata is used for two primary purposes. First, we query the Crossref database to obtain DOI links for journal articles, books, conferences, and other types of references. This link lookup helps our customers fulfill their Crossref membership obligations and helps ensure that researchers get appropriate credit for citations of their work. Second, we use the metadata obtained from Crossref to improve the accuracy of author-supplied reference entries.&lt;/p>
&lt;h3 id="what-values-do-you-pull-from-our-apis">What values do you pull from our APIs?&lt;/h3>
&lt;p>The most important metadata value we retrieve is the DOI itself. Because the majority of bibliographic references in author manuscripts do not include DOIs, a key feature of our service is DOI retrieval. However, we use metadata well beyond the DOIs once we’ve matched a record. Even if a reference already has a DOI, we still do a traditional query, using the other available reference elements, to retrieve a DOI and compare the results to flag discrepancies. We’ve found that ~20% of author-supplied DOIs are incorrect, so correcting these discrepancies is one of myriad ways that our software uses Crossref metadata to improve references before publication.&lt;/p>
&lt;p>We also pull all of the other fields that are used to build a bibliographic reference—complete author list, title of the item, publication date, volume, pages, and so on—and use these elements to correct and improve the reference. By filling in missing data (e.g., volume, issue, and page numbers) and flagging discrepancies between author-supplied entries and Crossref metadata (e.g., author names in a different order, words missing or misspelled in an article or chapter title), our software assures publishers of a high-quality bibliography with minimal manual effort.&lt;/p>
&lt;p>Finally, we use Crossmark metadata to flag references that may have been corrected—or retracted—to inform editors when an item may need further attention from an author. Did the author knowingly cite a retracted article? If not, does that change the science of the paper citing that retracted item?&lt;/p>
&lt;h3 id="have-you-built-your-own-interface-to-extract-this-data">Have you built your own interface to extract this data?&lt;/h3>
&lt;p>Yes, we’ve built our own tools to query Crossref’s APIs. In 2002, we used the old “piped-query” API to submit elements of journal references, but we outgrew this API because it returned too many false positive results and missed other DOIs that were correct, and because we wanted to query Crossref for DOIs to other reference types (e.g., books, conference papers, reports) as well as journals. We switched to XML queries in 2006, and the result was a huge improvement in the quality and quantity of DOI links for our customers.&lt;/p>
&lt;p>But just moving to XML queries still wasn’t good enough. Eight years ago, we wanted to improve DOI retrieval of non-journal items like reports, and we found that the existing Crossref APIs didn’t provide what we needed. So we collaborated with Crossref CTO Chuck Koscher to create the author–title query as an extension to regular XML queries. The result was a dramatic improvement in our ability to retrieve DOIs to non-journal items. The author–title query was a precursor to Crossref’s current metadata APIs, and it continues to serve us well.&lt;/p>
&lt;h3 id="how-often-do-you-extract-or-query-data">How often do you extract or query data?&lt;/h3>
&lt;p>All the time! Our customers are located all around the world in more than 25 countries on six continents, so Crossref metadata queries from our software are happening continually, at any time of the day or night, seven days a week, and even on holidays.&lt;/p>
&lt;p>There are two other important ways that our software interacts with Crossref APIs every day. First, Crossref’s &lt;a href="https://apps.crossref.org/simpleTextQuery" target="_blank">Simple Text Query&lt;/a> (STQ) service, which is used by smaller publishers to meet their Crossref requirement to add DOIs to their reference lists, was built using Inera’s reference parsing engine. In this case, our software runs on Crossref’s servers and is an integral part of the Crossref ecosystem.&lt;/p>
&lt;p>Second, to test our products, we run a comprehensive automated quality assurance process every night that tests all aspects of our software and ensures day-over-day stability. When we added Crossref linking functionality in 2003, we began running several thousand Crossref queries per night, looking for consistency in our software’s results. A few months later, we noted an unexpected change: a reference that had previously returned a DOI failed to link! We contacted Crossref about the “lost” DOI, and upon investigation, Crossref discovered that in the process of redepositing 20,000 DOIs, the publisher had accidentally inverted author surnames and given names in all of those records.&lt;/p>
&lt;p>Crossref immediately recognized the value of Inera’s automated testing, and its ability to unearth such errors, to Crossref and its members. Over time, the number of DOIs we test nightly has grown to tens of thousands, so we’ve worked with Crossref to develop an automated reporting and analysis process that makes detecting and resolving the issues highlighted by our internal testing more efficient.&lt;/p>
&lt;p>The co-development of the author–title query API and the sharing of our nightly test suite results are just two examples that highlight the nature of the Inera–Crossref relationship: it’s characterized by technology integration, bidirectional information exchange, and innovative problem solving.&lt;/p>
&lt;h3 id="what-are-the-future-plans-for-inera">What are the future plans for Inera?&lt;/h3>
&lt;p>We’re constantly working to improve eXtyles and Edifix and to develop new and innovative ways to help our customers. Here are a few examples:&lt;/p>
&lt;p>Two years ago, at the peak of the Zika outbreak, we received an urgent request from the World Health organisation to help them create DOIs for articles that had been submitted but not yet peer reviewed (&lt;a href="http://www.who.int/bulletin/online_first/zika_open/en/" target="_blank">see Zika Open&lt;/a>). Within 16 hours of their request, we developed, tested, and deployed updated software that allowed WHO to publish information vital to researchers, including DOIs, within hours of receipt.&lt;/p>
&lt;p>With respect to Crossref APIs, we plan to integrate the Crossref query features to retrieve DOIs for standards that are deposited by organisations like IEEE, ASTM, and BSI. We also plan to expand our linking and verification capabilities to incorporate newer reference types such as preprints and data citations.&lt;/p>
&lt;p>More broadly, we’re very excited about the eXtyles Metadata Extraction technology we released last year. This technology can be used by online submission systems and preprint servers to automatically extract key metadata elements (title, abstract, authors, affiliations, keywords) from author-submitted manuscripts, no matter what “style” the author may have used to format the manuscript. This technology is already in-use at Aries Systems to simplify the submission process. We’re looking forward, soon, to seeing this technology used by preprint servers and institutional repositories to automate the collection and deposit of preprint metadata to Crossref.&lt;/p></description></item><item><title>New Board Chair Paul Peters shares our mission</title><link>https://www.crossref.org/blog/new-board-chair-paul-peters-shares-our-mission/</link><pubDate>Thu, 22 Mar 2018 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/new-board-chair-paul-peters-shares-our-mission/</guid><description>&lt;p>At the end of last year, Paul Peters&amp;mdash;CEO of our member &lt;em>Hindawi&lt;/em>&amp;mdash;became the new Chair of the Crossref Board. The announcement was made in Singapore at our first LIVE Annual ever held in Asia. I caught up with Paul back in London, UK, where he answered a few questions about what he hopes to bring to the Board, and to the Crossref community as a whole.&lt;/p>
&lt;h3 id="1-congratulations-paul-how-delighted-were-you-to-be-voted-in-by-your-fellow-board-members-old-and-new">1. Congratulations, Paul. How delighted were you to be voted in by your fellow board members, old and new?&lt;/h3>
&lt;p>That’s a rather leading question ;-)&lt;/p>
&lt;p>Seriously though, I am incredibly honored to have been chosen to lead Crossref’s board at such an important point in the organisation’s development. The current composition of the board is as diverse as it has ever been, which is essential if the board is to represent Crossref’s global membership, as well as the wide range of business and publication models that our members use. This diversity on the board will help to support Crossref’s aim of encouraging innovation in scholarly communication by providing open infrastructure that benefits all researchers.&lt;/p>
&lt;h3 id="2-youve-been-on-our-board-for-nine-years-how-has-it-changed-in-that-time-and-what-should-the-board-be-most-proud-of">2. You’ve been on our board for nine years. How has it changed in that time and what should the board be most proud of?&lt;/h3>
&lt;p>When I first joined the board, Crossref was at the stage where you had successfully established persistent reference linking as a standard practice among scholarly journal publishers. And, although this was the original purpose of Crossref, it was by no means an easy task, as it required a diverse group of competing publishers to work together in building shared infrastructure for the common good.&lt;/p>
&lt;p>In the nine years since then, I’ve seen Crossref continue to build on this core foundation of technological expertise, the trust and goodwill of its membership, and the diverse skills of its small staff. The result has been the development of important new services (such as Similarity Check) that have become an essential component of the scholarly communications system, support new record types (including both preprints and peer review reports) that are becoming increasingly important in the move towards an Open Science future, and the expansion of Crossref’s membership to include almost 10,000 members of all shapes and sizes from 114 countries around the world.&lt;/p>
&lt;p>With regard to the board itself, I have been pleased to see Crossref undergo important changes that have provided greater transparency in the organisation&amp;rsquo;s governance, as well as more active participation from its members. Last year Crossref put out an open call to invite members to put themselves forward for consideration on the board. As a result of holding its first contested election, Crossref saw a dramatic increase in the engagement of members in the election process. Not only is this important for ensuring that the board is truly representative of the diverse membership, but it will also help to actively engage a larger pool of members in the important work that lies ahead.&lt;/p>
&lt;h3 id="3-what-do-you-see-as-crossrefs-strengths-and-role">3. What do you see as Crossref’s strengths and role?&lt;/h3>
&lt;p>I believe that Crossref’s past and future success relies on two key strengths. The first is its ability to bring together a large and disparate community of organisations and individuals to create tools and services that no single organisation could develop alone. People sometimes overlook how successful Crossref has been in building the trust and support of a diverse group of stakeholders, however I believe this has been an essential ingredient in the organisation’s success and will be essential as Crossref develops new tools and services in the years to come.&lt;/p>
&lt;p>Crossref’s other core strength has been the expertise, passion, and ambitious vision of its staff, many of whom I have had the pleasure of knowing since my first days on the board. The ability to develop and maintain real-time infrastructure serving millions of end-users, while simultaneously developing new products and services, requires an incredible range of skills from technology and product development, to marketing, community outreach, and customer support. Moreover, as a growing non-profit organisation with thousands of members around the world, and an international staff working across national boundaries, Crossref’s legal, financial, and administrative support team have also been an essential ingredient in the organisation’s success.&lt;/p>
&lt;h3 id="4-weve-grown-beyond-just-the-publisher-constituency-to-libraries-scholars-and-platforms-and-tools-which-constituencies-do-you-see-us-involving-next">4. We’ve grown beyond just the publisher constituency to libraries, scholars, and platforms and tools, which constituencies do you see us involving next?&lt;/h3>
&lt;p>Over time I believe that Crossref’s constituency will grow to cover all organisations that contribute to the creation and dissemination of scholarly research, although I recognize this may take several years to achieve.&lt;/p>
&lt;p>In the short-term, I believe that research funders are the most important stakeholder group for Crossref to focus on, for the following reasons:&lt;/p>
&lt;ul>
&lt;li>First, with the development of the open Funder Registry and the addition of structured funding data to the Crossref registry, Crossref has already become an important provider of open infrastructure for research funders.&lt;/li>
&lt;li>Second, as the result of several key initiatives within the Open Science movement I believe that research funders will play an increasingly important role in determining how scholarly research outputs are created, shared, evaluated, and re-used. Therefore, the active involvement of research funders in Crossref’s membership and governance is essential.&lt;/li>
&lt;li>Finally, I believe that there is an important opportunity for Crossref to enable a range of new services across the research lifecycle by providing persistent identifiers and structured metadata research grants. Given how critical grants are within the research process, I’m amazed by the lack of infrastructure to monitor, evaluate, and build upon grants as first-class research objects. In many cases there is minimal, if any, public information about the grants that have been awarded by a particular funder. Even in cases where such data is available, it is rarely structured in a way that enables it to be searched or analyzed across multiple funding agencies.&lt;/li>
&lt;/ul>
&lt;p>In the absence of a community-driven, non-profit organisation like Crossref to provide this infrastructure on an open basis, there is a risk that funders will be forced to rely on proprietary alternatives that limit how this information is used and by whom. Fortunately there are already efforts underway within Crossref to develop both the tools and the community of funders that will be required to create persistent identifiers and structured metadata for grants and other forms of research funding.&lt;/p>
&lt;h3 id="5-what-are-the-biggest-challenges-facing-crossref">5. What are the biggest challenges facing Crossref?&lt;/h3>
&lt;p>I believe that Crossref’s greatest challenge will be to continue to bring together a diverse group of stakeholders, some of whom are regularly at odds with each other, in order to collaborate in developing tools and services for the benefit of the research community.&lt;/p>
&lt;p>As challenging as it has been for Crossref to bring together competing publishers to build the shared services that we have all come to depend on, I believe that keeping the community focused towards a common goal will become even more challenging as that community expands to include funders, universities, and the many other organisations involved in the scholarly communications ecosystem. However, I think that Ed and his team have as good of a chance of succeeding as anyone could hope for, which is why I am so excited about Crossref’s future in the years ahead.&lt;/p>
&lt;h3 id="6-how-will-things-change-with-you-as-chair-youll-be-busier-i-guess-but-enough-about-you-already-what-can-we-expect-as-staff-and-board">6. How will things change with you as Chair? You’ll be busier I guess. But enough about you already, what can we expect as staff and Board?&lt;/h3>
&lt;p>As my first order of business I’ll be getting rid of Crossref’s corporate jet, lavish office spaces, and executive chef. &lt;code>&amp;lt;/sarcasm&amp;gt;&lt;/code>.&lt;/p>
&lt;p>On a more serious note, my hope is that as Chair I will be able to work with the other members of the board in supporting Crossref’s staff as they work to achieve the ambitious goals we have set out during the past year. I believe that Crossref’s board members and staff are aligned in the desire to significantly expand the range of services Crossref provides, as well as the communities it serves.&lt;/p>
&lt;p>The board still has an important role to play in shaping the organisation’s strategic vision, while giving staff ample space to execute on this vision. Said another way, I hope to enable some lively strategic conversations among the board while making sure that we don’t get in the way of Ed and his team once it’s time to put ideas into action.&lt;/p>
&lt;p>On a more personal note, I hope to be a good sounding board for Ed on any issues that he faces, either internally or externally, on the road ahead. Given my own experience in leading a growing organisation through a period of significant change, I know how important it can be to have someone to talk to when difficult challenges arise, which they inevitably will. I hope that I can be a good advisor&amp;mdash;and also a good friend&amp;mdash;to Ed as he leads Crossref into the exciting future that lies ahead.&lt;/p>
&lt;h3 id="ginny-thanks-paul-i-know-ed-will-miss-his-personal-chef-but-we-look-forward-to-working-with-you-too">Ginny: Thanks, Paul. I know Ed will miss his personal chef&amp;hellip; but we look forward to working with you too!&lt;/h3></description></item><item><title>Crossref LIVE in Tokyo</title><link>https://www.crossref.org/blog/crossref-live-in-tokyo/</link><pubDate>Thu, 08 Mar 2018 00:00:00 +0000</pubDate><author>Rachael Lammey</author><discourseUsername>rlammey</discourseUsername><guid>https://www.crossref.org/blog/crossref-live-in-tokyo/</guid><description>&lt;p>What better way to start our program of LIVE locals in 2018 than with a trip to Japan? With the added advantage of it being Valentine’s Day, it seemed a good excuse to share our love of metadata with a group who feel the same way!&lt;/p>
&lt;p>We’ve worked closely with the Japan Science and Technology Agency (JST) since 2002, and were delighted when they agreed to collaborate with us on a LIVE event at their offices in Tokyo.&lt;/p>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/val-day.png" alt=“Valentines Day message" height="150px" width="400px" class="img-responsive" />
&lt;/div>
&lt;p>With help from the team at JST, we welcomed around 80 attendees—a mix of editors, publishers and enthusiastic metadata users—who all enjoyed the talks from our guest speakers, Nobuko Miyari from ORCID, Ritsuko Nakajima from JST and Tatsuji Tomioka from Kyoto University Library (who talked about the use of DOIs and metadata in their research information repository, named KURENAI).&lt;/p>
&lt;p>Vanessa Fairhurst and I also took part in the days program and talked about the different services that Crossref offers. With many of our members in Japan already well-versed in DOIs, we placed the focus of our sessions around the importance of accurate, complete metadata, and new record types (such as peer reviews and preprints). We also discussed our new community initiatives such as the &lt;a href="https://blog.datacite.org/next-steps/" target="_blank">OI project&lt;/a>, &lt;a href="https://doi.org/10.64000/5cfh1-1wa10" target="_blank">identifiers for grants&lt;/a> and &lt;a href="http://www.metadata2020.org/" target="_blank">Metadata2020&lt;/a>.&lt;/p>
&lt;p>We’d like to say a big thank you to Kentaro Kinoshita from JST for his help with organizing the event. We’d also like to thank the excellent team of translators who assisted us greatly by relaying the content to the audience in Japanese—being able to offer information and take questions in English and Japanese was an invaluable part of the day.&lt;/p>
&lt;h3 id="any-questionsbr">Any questions?&lt;br>&lt;/h3>
&lt;p>One day is never quite enough to cover all things Crossref, so we were happy to answer questions from the enthusiastic audience:&lt;/p>
&lt;p>&lt;strong>What metadata is required to register peer review reports with Crossref?&lt;/strong>&lt;br>
To answer this we pointed them to this informative blog on &lt;a href="https://support.crossref.org/hc/en-us/articles/115005255706-Peer-Reviews" target="_blank">peer reviews&lt;/a>.&lt;/p>
&lt;p>&lt;strong>How can I find information on using your REST API?&lt;/strong>&lt;br>
This is a great starting point, and most information can be found here &lt;a href="https://api.crossref.org" target="_blank">https://api.crossref.org&lt;/a>&lt;/p>
&lt;p>&lt;strong>Is the forthcoming Metadata Manager tool something I can use?&lt;/strong>&lt;br>
Yes! We hope it will make it much easier for you to deposit good metadata—and if you are in interested in participating in our open beta, &lt;a href="mailto:feedback@crossref.org">let us know&lt;/a>.&lt;/p>
&lt;p>We’re looking forward to continuing to collaborate with JST, and are really grateful for their help in working with us to make the event go so smoothly. Thank you to those who joined us, and we hope to see you again soon.&lt;/p>
&lt;br></description></item><item><title>Are you having an identity crisis?</title><link>https://www.crossref.org/blog/are-you-having-an-identity-crisis/</link><pubDate>Fri, 23 Feb 2018 00:00:00 +0000</pubDate><author>Amanda Bartell</author><guid>https://www.crossref.org/blog/are-you-having-an-identity-crisis/</guid><description>&lt;p>We work with a huge range of organisations in the scholarly communications world—publishers, libraries, universities, government agencies, funders, publishing service providers, and researcher services providers—and you each have different relationships with us.&lt;/p>
&lt;p>Some of you are members who create and disseminate your own content, register it with us by depositing metadata, and help steer our future by voting in our annual board elections. Some of you don&amp;rsquo;t vote in our board elections but do play a vital role by registering content on members&amp;rsquo; behalf.&lt;/p>
&lt;p>And some of you make use of the metadata provided by our members and so perform a key service by getting their published works out into the world, but don&amp;rsquo;t vote in our board elections.&lt;/p>
&lt;p>After a recent review we realized our Member Types weren&amp;rsquo;t completely clear, and may in fact have led to a bit of confusion. With this in mind, we put some thought into their revision and have now given them the clarity they were missing. Over the course of this year we&amp;rsquo;ll be checking that everyone is in the right group and getting the appropriate support based on your Member Type.&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th style="text-align: left">Former Member Type name&lt;/th>
&lt;th style="text-align: left">New Member Type name&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td style="text-align: left">Publisher&lt;/td>
&lt;td style="text-align: left">Member&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">Sponsoring Publisher&lt;/td>
&lt;td style="text-align: left">Sponsoring Member&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">Represented Member&lt;/td>
&lt;td style="text-align: left">Sponsored Member&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">Sponsoring Entity&lt;/td>
&lt;td style="text-align: left">Sponsoring organisation&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">Sponsored Member&lt;/td>
&lt;td style="text-align: left">Sponsored organisation&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">Affiliate&lt;/td>
&lt;td style="text-align: left">Metadata User&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">Service Provider&lt;/td>
&lt;td style="text-align: left">(No change to Member Type name)&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">&lt;br>&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;blockquote>
&lt;p>So, what&amp;rsquo;s different?&lt;/p>
&lt;/blockquote>
&lt;p>The changes we&amp;rsquo;ve made help to differentiate if you&amp;rsquo;re a voting member (and therefore have a say in our future direction), or not. If you are a voting member, you&amp;rsquo;ll now have the word &amp;ldquo;Member&amp;rdquo; in your title—and if you&amp;rsquo;re not—you won&amp;rsquo;t, as the diagram below indicates.&lt;br>
&lt;img src="https://www.crossref.org/images/blog/Sugar-labels-2.png" alt="membership map" width="800px" />&lt;br>
Where there are two organisations with a sponsorship arrangement in place (with a sponsoring party and a sponsored party), one of you will always be the voting party, and the other will be non-voting. These partnerships will therefore always contain one &amp;ldquo;Member&amp;rdquo; and one &amp;ldquo;organisation&amp;rdquo;.&lt;/p>
&lt;p>We&amp;rsquo;ve also stopped using the word &amp;ldquo;Publisher&amp;rdquo; in our Member Types as not all our members consider themselves to be publishers — sometimes you&amp;rsquo;re libraries, funders, scholars, repositories, etc. As it says in one of our &lt;a href="https://www.crossref.org/truths">truths&lt;/a> &amp;ldquo;Come one, come all: we define publishing broadly. If you communicate research and care about preserving the scholarly record, join us.&amp;rdquo;&lt;/p>
&lt;h2 id="how-do-you-know-if-you-are-a-voting-member">How do you know if you are a voting member?&lt;/h2>
&lt;p>&lt;br>Voting members fall into three Member Types: Members, Sponsoring Members and Sponsored Members.&lt;/p>
&lt;p>This means you are organisations who create and disseminate content, and therefore contribute to the scholarly record. Some of you register your content directly with us and some via a third party, but the key thing is that you&amp;rsquo;re adding to our metadata records, and as such can have a say in the future direction of Crossref. Voting members can also take metadata out of our system — and many of you do — however, your key relationship with us is as a member who is contributing to the scholarly record.&lt;/p>
&lt;p>It also means you have &lt;a href="https://www.crossref.org/membership/terms">obligations&lt;/a> to keep your records up-to-date, and maximize links with other Crossref members.&lt;/p>
&lt;h2 id="whats-the-difference-between-the-voting-categories">What&amp;rsquo;s the difference between the voting categories?&lt;/h2>
&lt;p>&lt;strong>Members&lt;/strong>&lt;br>
As a Member (formerly known as Publishers), you create and disseminate content, register your own content with us (usually under a single prefix), and are able to vote in our board elections. You pay an annual fee based on your publishing revenue, plus Content Registration fees for all new DOIs.&lt;/p>
&lt;p>&lt;strong>Sponsoring Members&lt;/strong>&lt;br>
As a Sponsoring Member (formerly known as a Sponsoring Publisher), you do everything a standard member does, but as well as registering your own content under your own DOI prefix, you also register content on behalf of other, smaller publishers (ideally using separate DOI prefixes so the metadata is accurate and can be reported on separately and relied upon downstream).&lt;/p>
&lt;p>When you vote, you vote on behalf of the organisations that you sponsor. You pay an annual fee based on your publishing revenue/expenses plus the publishing revenue of your sponsored organisations, and you also pay Content Registration fees for all new metadata records registered. You look after deposit billing for the organisations you sponsor, and provide technical and language support for them.&lt;/p>
&lt;p>Some of our larger members may be thinking that you should be in this Member Type - and you&amp;rsquo;re probably right! During the course of 2018 we&amp;rsquo;ll be working with you to transition you over to Sponsoring Membership. If you are a Member who is thinking of becoming a Sponsoring Member, &lt;a href="mailto:member@crossref.org">please get in touch&lt;/a>.&lt;/p>
&lt;p>&lt;strong>Sponsored Members&lt;/strong>&lt;br>
As a Sponsored Member (formerly known as a Represented Member), you create and disseminate content, but you don&amp;rsquo;t register your content directly with us—this is done by your Sponsoring organisation.  Because of this it&amp;rsquo;s you, the one who creates and disseminates the content and thus contributes to the scholarly record, who can vote.&lt;/p>
&lt;h2 id="how-do-you-know-if-you-are-a-non-voting-member">How do you know if you are a non-voting member?&lt;/h2>
&lt;p>If you haven&amp;rsquo;t spotted yourself yet, you may be one of the non-voting organisations we work with — these fall into four Member Types: Sponsoring organisations, Sponsored organisations, Service Providers and Metadata Users.&lt;/p>
&lt;p>As a non-voting organisation, you may still register content with us, but you either don&amp;rsquo;t create and disseminate the content yourselves, or you&amp;rsquo;re already represented by a voting organisation. Non-voting organisations also include those whose only relationship with us is to make use of our metadata.  &lt;/p>
&lt;h2 id="whats-the-difference-between-the-non-voting-categories">What&amp;rsquo;s the difference between the non-voting categories?&lt;/h2>
&lt;p>&lt;strong>Sponsoring organisations&lt;/strong>&lt;br>
As a Sponsoring organisation (formerly known as a Sponsoring Affiliate), you don&amp;rsquo;t create and disseminate content yourself, but you do register content with us on behalf of your Sponsored Members — preferably using distinct DOI prefixes for each member. You also often look after their administrative, technical, billing and language support needs. You&amp;rsquo;ll pay us an annual fee based on the publishing revenue of all your members, and Content Registration fees for all new DOIs. You might charge the members you work with for this service. You also provide support and promotion of our services and activities.&lt;/p>
&lt;p>&lt;strong>Sponsored organisations&lt;/strong>&lt;br>
As a Sponsored organisation (formerly known as a Sponsored Member), you do create and disseminate content yourself, but you don&amp;rsquo;t register your own content. This is done by a Sponsoring Member, and as they have the member vote, you can&amp;rsquo;t have one too. For this reason, we&amp;rsquo;ve removed the word &amp;ldquo;Member&amp;rdquo; from your title, to make your voting position clearer. Of course, your Sponsoring Member needs to represent your needs too when voting, so make sure you make them known!&lt;/p>
&lt;p>&lt;strong>Service Providers&lt;/strong>&lt;br>
As a Service Provider you work closely with our members to collect and/or host and/or deposit metadata on their behalf. Unlike a Sponsoring organisation however you don&amp;rsquo;t get involved with administrative, technical, billing or language support for the members you work with, but you&amp;rsquo;re a key partner in helping them deposit quality metadata and contribute effectively to the scholarly record. During 2018 we&amp;rsquo;ll be working more closely with you to help you collaborate with us more effectively.&lt;/p>
&lt;p>&lt;strong>Metadata Users&lt;/strong>&lt;br>
Metadata Users (formerly known as Affiliates), you are the organisations who don&amp;rsquo;t register content with us, but you do make use of it through our free and open APIs and search interfaces, or our paid-for Metadata Plus service, giving you access to a premium version of both the REST API and OAI-PMH. Of course all members can get metadata out of our systems as well, but if the only thing you do with us is get metadata out, then you&amp;rsquo;re a Metadata User.&lt;/p>
&lt;h3 id="dont-know-which-member-type-you-are">Don&amp;rsquo;t know which Member Type you are?&lt;/h3>
&lt;p>We&amp;rsquo;re hoping these new names make it clearer, but if you&amp;rsquo;re still confused, please get in touch with our &lt;a href="mailto:member@crossref.org">membership specialist&lt;/a>&lt;/p></description></item><item><title>Wellcome explains the benefits of developing an open and global grant identifier</title><link>https://www.crossref.org/blog/wellcome-explains-the-benefits-of-developing-an-open-and-global-grant-identifier/</link><pubDate>Fri, 16 Feb 2018 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/wellcome-explains-the-benefits-of-developing-an-open-and-global-grant-identifier/</guid><description>&lt;p>Wellcome, in partnership with Crossref and several research funders including the NIH and the MRC, are looking to pilot an initiative in which new grants would be assigned an open, global and interoperable grant identifier. Robert Kiley (Open Research) and Nina Frentrop (Grants Operations) from the Wellcome explain the potential benefits this would deliver and how it might work.&lt;/p>
&lt;h3 id="introduction">Introduction&lt;/h3>
&lt;p>As a funder we want to be able to track the outputs that arise from research we have funded. Currently, this is not as straightforward as it should be as researchers do not always cite their funder correctly, let alone their specific grant number. And, even when they do this accurately, because every funder users its own set of grant IDs, these numbers are not unique. For example, we can use EuropePMC to look up outputs from &lt;a href="http://europepmc.org/grantfinder/results?gid=207467&amp;amp;page=1" target="_blank">grants with ID 207467&lt;/a>, and see that there is one Wellcome grant with this number, and one from the European Research Council.&lt;/p>
&lt;p>To resolve such issues, we need a system in which every grant awarded is giving a unique, global ID. Global IDs are already assigned to articles &lt;a href="https://search.crossref.org/" target="_blank">DOIs&lt;/a>, people &lt;a href="https://orcid.org/" target="_blank">ORCIDs&lt;/a> and even biological materials &lt;a href="https://scicrunch.org/resources" target="_blank">RRIDs&lt;/a>. It is time for the funder community to follow suit.&lt;/p>
&lt;h3 id="benefits-of-an-open--global-grant-identifier-system">Benefits of an open &amp;amp; global grant identifier system&lt;/h3>
&lt;p>Once implemented, it would make the identification of grant-specific research outputs more accurate, whilst simultaneously reducing the burden on the researcher.&lt;/p>
&lt;p>Currently, researchers are typically asked to manually disclose what outputs have arisen from their funding. In the future, such disclosures would be fully automated. We are already seeing how publishers&amp;mdash;who collect ORCIDs through their manuscript submission system&amp;mdash;automatically update the author’s ORCID record with details of new publications. If a global ID system for grants was developed, publishers and repositories could also require these to be disclosed on submission, and this data could then programmatically be passed to researcher assessment platforms, like &lt;a href="https://www.researchfish.net/" target="_blank">ResearchFish&lt;/a>.&lt;/p>
&lt;h3 id="how-would-it-work">How would it work?&lt;/h3>
&lt;p>For a global grant ID system to work, two things need to happen. First, when a new grant is awarded, that grant must be assigned a unique ID. For the pilot project we plan to contract with Crossref who will register a unique ID, (a DOI) for every grant we register.&lt;/p>
&lt;p>Second, every DOI must resolve to a publicly accessible web site, where information about that grant is disclosed. Again, for this pilot we will almost certainly use the Europe PMC &lt;a href="http://europepmc.org/grantfinder" target="_blank">Grants Finder Repository&lt;/a>, as we already make grant data available from this resource.&lt;/p>
&lt;p>A &lt;a href="https://www.crossref.org/working-groups/funders/">working group&lt;/a> has been established to determine precisely what metadata we should make available, but it is likely to include the name of the grant holder, title and value of the award, a short abstract, along with the name of the funder and the unique ID.
Mindful that funders already assign IDs to the grants they award and that any changes to this process may be problematic (and certainly time consuming), the plan is to register a DOI which still makes use of the existing grant ID. To make it unique however, the ID will be prefixed with a funder identifier, most likely the &lt;a href="https://www.crossref.org/services/funder-registry/">Funder Registry ID&lt;/a>.&lt;/p>
&lt;h3 id="next-steps">Next steps&lt;/h3>
&lt;p>Whilst the metadata working group is focusing on the technical aspects of the pilot, a separate “governance group” is examining how a funder might become a member of Crossref and what the business model for registering grant DOIs should be.&lt;/p>
&lt;p>In parallel with this, a pilot “proof of concept” initiative is under way, and we anticipate that by autumn 2018 we will have registered DOIs for a defined cohort of grants.&lt;/p>
&lt;p>Ultimately we want to get to a situation where every grant has a unique ID, which can then be unambiguously linked to the all outputs – articles, data, code, materials, patents etc. – which arise from it.&lt;/p>
&lt;p>And, if every funder were to adopt such a system and expose their grant metadata in a consistent, machine-readable way, it would facilitate the development of applications to help funders get a greatly enhanced picture of the global funding landscape, which in turn would inform strategic planning and resource allocation.&lt;/p>
&lt;h2 id="thanks-to-guest-authors">Thanks to guest authors:&lt;/h2>
&lt;p>Robert Kiley, Head of Open Research, Wellcome [&lt;a href="http://orcid.org/0000-0003-4733-2558" target="_blank">ORCID: 0000-0003-4733-2558&lt;/a>]
Nina Frentrop, Grants Information &amp;amp; Systems Manager, Wellcome&lt;/p>
&lt;hr>
&lt;p>Please read &lt;a href="https://www.crossref.org/community/funders">Crossref for funders&lt;/a> for context, and contact &lt;a href="mailto:feedback@crossref.org">Ginny Hendricks&lt;/a> at Crossref with any questions.&lt;/p></description></item><item><title>Meet the members, Part 2 (with protocols.io)</title><link>https://www.crossref.org/blog/meet-the-members-part-2-with-protocols.io/</link><pubDate>Wed, 31 Jan 2018 00:00:00 +0000</pubDate><author>Christine Cormack Wood</author><discourseUsername>ccormackwood</discourseUsername><guid>https://www.crossref.org/blog/meet-the-members-part-2-with-protocols.io/</guid><description>&lt;p>Second in our &lt;em>Meet the members&lt;/em> blog series is Lenny Teytelman, co-founder and CEO of &lt;a href="https://www.protocols.io" target="_blank">protocols.io&lt;/a>, who gives us a bit of insight into his background and why he started protocols.io, what the future plans for protocols.io are, and how they use and benefit from being a Crossref member.&lt;/p>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/large-logo.png" alt=“protocols.io logo" height="150px" width="250px" class="img-responsive" />
&lt;/div>
&lt;h2 id="can-you-tell-us-a-little-bit-about-yourself-and-why-you-started-protocolsio">Can you tell us a little bit about yourself, and why you started protocols.io?&lt;/h2>
&lt;p>I am a computational and experimental biologist, and it was my struggle with correcting a published research method as a postdoctoral researcher at MIT that led me to co-found &lt;a href="https://www.protocols.io" target="_blank">protocols.io&lt;/a>. I spent a year and a half correcting a single step of a research recipe. Instead of 1ul of a chemical, it needed 5, instead of a 15-minute incubation, it needed an hour. But this was a correction of something previously published, not a new method, so absurdly, it was not a result that I could publish. That means I got no credit for this year and a half, and more importantly, every other scientist using this recipe is either getting misleading results or has to waste 1-2 years rediscovering what I know—rediscovering something that I’d love to share, but have no easy way of doing so.&lt;/p>
&lt;p>So, I became obsessed with creating a central place where scientists can easily share and discover detailed research recipes. We’re open access, free-to-read and free-to-publish, with web &amp;amp; mobile apps that make these protocols dynamic and interactive.&lt;/p>
&lt;h2 id="what-problem-is-your-service-trying-to-solve">What problem is your service trying to solve?&lt;/h2>
&lt;p>Currently, methods sections of research papers are full of things like &amp;ldquo;we used a slightly modified version of the method reported in paperX&amp;rdquo;. Here are two examples:&lt;/p>
&lt;p>&lt;a href="https://twitter.com/dgonzales1990/status/953737802205794304" target="_blank">Tweet&lt;/a>&lt;/p>
&lt;p>We are working to increase reproducibility, by encouraging precise detailing of methods and then making it easy to keep these methods up-to-date, long after the paper is published. More broadly, our mission is to accelerate science by getting the detailed knowledge out of paper notebooks, and getting it out in months, instead of years.&lt;/p>
&lt;h2 id="tell-us-a-little-bit-about-what-you-publish-and-for-whom">Tell us a little bit about what you publish and for whom.&lt;/h2>
&lt;p>Both the content, and the audience for it, has been expanding recently. When we launched in 2014, the protocols were almost exclusively wetlab biology recipes. In 2015, we added support for computational workflows and began to see bioinformatics methods. More recently, thanks to the referrals from &lt;a href="http://journals.plos.org/plosone" target="_blank">PLOS ONE,&lt;/a> we&amp;rsquo;ve started to see protocols for human trials, medical devices, psychology, and more. About half a year ago, we changed our landing page form &amp;ldquo;Open Access Repository of Life Science Methods&amp;rdquo; to the more general &amp;ldquo;Open Access Repository of Research Methods&amp;rdquo;.&lt;/p>
&lt;p>The readership is also broadening, it’s no longer just professional researchers—we now have protocols and guidelines for undergraduate and high school students, instructions for citizen science projects, and even standard operating procedures for lab management. We&amp;rsquo;ve also been seeing more off-the-shelf use, with people sharing actual cooking recipes, and we recently began asking authors to classify whether they are sharing &amp;ldquo;research&amp;rdquo; or &amp;ldquo;non-research&amp;rdquo; instructions.&lt;/p>
&lt;h2 id="how-would-you-describe-the-value-of-being-a-crossref-member">How would you describe the value of being a Crossref member?&lt;/h2>
&lt;p>Without a doubt, we would be nowhere close to the adoption and sharing that we have now if we were not members of Crossref, registering DOIs for all public protocols. This is an absolute prerequisite for being included in author guidelines of journals, and we wouldn&amp;rsquo;t have grown in 2017 from two to over 200 journals that encourage authors to detail their recipes on &lt;a href="https://www.protocols.io" target="_blank">protocols.io&lt;/a>.&lt;/p>
&lt;p>In addition to the benefit to &lt;a href="https://www.protocols.io" target="_blank">protocols.io,&lt;/a> there is a benefit to the scientists in terms of the quality control that Crossref ensures among the members. Much of this is behind the scenes and invisible to the researchers visiting &lt;a href="https://www.protocols.io" target="_blank">protocols.io&lt;/a>.&lt;/p>
&lt;p>For example, in the beginning, we used to simply delete spam protocols. However, once we started issuing DOIs, we realized that we would be violating the Crossref requirements for minted DOIs if we simply trashed these. As a result, we had to build &amp;ldquo;retraction&amp;rdquo; functionality that allows us to take down content, put up a notice explaining the reason for removal, and keep the record so that the respective DOI continues to resolve. This is the correct way to handle removals of scientific content and it is Crossref that made us mature and improve the platform. (We&amp;rsquo;ve since had to use the retraction functionality at the request of scientists, and we&amp;rsquo;re glad we implemented it to comply with the Crossref requirements.)&lt;/p>
&lt;p>Another example is the resolution report that we routinely get from Crossref, showing us which DOIs are broken. It highlights errors for us and helps us to investigate, identify, and prevent problems with the journal partners.&lt;/p>
&lt;h2 id="what-do-you-see-as-the-value-of-crossref-beyond-protocolsiohttpswwwprotocolsio">What do you see as the value of Crossref, beyond &lt;a href="https://www.protocols.io" target="_blank">protocols.io&lt;/a>?&lt;/h2>
&lt;p>As I argued &lt;a href="https://www.youtube.com/watch?v=Ta2M_gkgeKI&amp;amp;list=PLe_-TawAqQj16f8DRwCADugYIaaXN_fZO&amp;amp;index=9" target="_blank">in my talk&lt;/a> at the annual Crossref conference, we are finally in a position to connect scientists with the knowledge they need, automatically. Almost every scientist uses a reference manager such as Mendeley, Zotero, Paperpile, etc. to manage their literature bibliography. In turn, that means that in theory, when something happens to a paper or research objects connected to the paper (retraction, correction, update to the dataset accompanying the manuscript), the reference management platforms could notify every scientist who has that paper in their library.&lt;/p>
&lt;p>The problem is that it isn&amp;rsquo;t feasible for every service like Mendeley to connect to every repository and publisher to track events connected to every paper. This is where Crossref is positioned so powerfully. By collecting the metadata linking papers to the research objects, Crossref can be the single source that the platforms need to query to see if there is news for their users related to any specific published paper. (More of this from my talk was captured really nicely in &lt;a href="http://musingsaboutlibrarianship.blogspot.co.uk/2017/11/scholarly-maps-recommenders-reference.html" target="_blank">this&lt;/a> blog post by the SMU librarian Aaron Tay.&lt;/p>
&lt;h2 id="what-are-the-future-plans-for-protocolsiohttpswwwprotocolsio">What are the future plans for &lt;a href="https://www.protocols.io" target="_blank">protocols.io&lt;/a>?&lt;/h2>
&lt;p>Expanding &lt;a href="https://www.protocols.io" target="_blank">protocols.io&lt;/a> content to include chemistry workflows is an important goal for 2018-19.&lt;/p>
&lt;p>We are also eager to start on connecting the protocols directly to the devices that the scientists use. Imagine you need to spin your cells for 30 seconds, but the centrifuge is accidentally set for 3 minutes. Our app should be able to connect to the equipment and alert the researcher to the wrong setting, asking if they are sure they want to proceed.&lt;/p>
&lt;br></description></item><item><title>No longer lost in translation</title><link>https://www.crossref.org/blog/no-longer-lost-in-translation/</link><pubDate>Tue, 30 Jan 2018 00:00:00 +0000</pubDate><author>Christine Cormack Wood</author><discourseUsername>ccormackwood</discourseUsername><guid>https://www.crossref.org/blog/no-longer-lost-in-translation/</guid><description>&lt;p>More than 80% of the record breaking 1,939 new members we welcomed in 2017 were from non-English speaking countries, and as our member base grows in its diversity, so does the need for us to share information about Crossref and its services in languages appropriate to our changing audience.&lt;/p>
&lt;p>So, early last year we started translating our service videos into six other languages: French, Spanish, Brazilian Portuguese, Chinese, Japanese, and Korean. However, the process of translating from one language to another is not always straightforward—but it is super important—as some things can get seriously lost in translation&amp;hellip;&lt;/p>
&lt;p>|&lt;img src="https://www.crossref.org/images/blog/dog.png" height="250px" width="300px"/>|&lt;img src="https://www.crossref.org/images/blog/foot.png" height="250px" width="300px"/>|&lt;img src="https://www.crossref.org/images/blog/luggage.png" height="250px" width="300px"/>|&lt;/p>
&lt;p>In order to avoid such translation tragedies we created a foolproof process to get the text of the service videos translated and ready for production. (I am, I realize, exposing myself here—see what I did there? —by using a word like foolproof.)&lt;/p>
&lt;p>First we produced the videos in English, setting the content to animation and sound (AKA audio visual or A/V to us marketing types), then we brought in a translation company to turn the English content into the six other languages. So far so good. However, as the above examples demonstrate, the &lt;em>meaning&lt;/em> of words can get lost in translation. Also, what Crossref does isn’t the easiest thing in the world to translate (&lt;em>are&lt;/em> there words for &lt;em>metadata delivery&lt;/em> and &lt;em>full-text XML&lt;/em> in Japanese?), so we added another stage to the process.&lt;/p>
&lt;p>Next, we sent the translated scripts and their English counterparts to some very helpful international members who, as part of the scholarly research community, understand the complexities of our work and are therefore qualified to check that the text had remained &lt;em>in context.&lt;/em>&lt;/p>
&lt;p>Unfortunately, it hadn’t, as the text came back from them heavily edited. After round two of the editing process, the revised text was applied to the videos—but just to be 100% sure, we sent the completed videos back to our helpful international members for a final run through.&lt;/p>
&lt;p>Multiply this painstaking process by 48 videos, throw numerous time zones into the mix and you can see why it took us nearly 12 months to complete them.&lt;/p>
&lt;p>And so, it is with great pleasure that today we launch all eight of our service videos in six languages, just click the links below, and enjoy! Découvrez-les!​ ¡Que los disfrutes! Aproveite! 请欣赏! どうぞお楽しみください！ 즐거운 시간 되세요!&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th style="text-align: center">&lt;/th>
&lt;th style="text-align: center">&lt;/th>
&lt;th style="text-align: center">&lt;/th>
&lt;th style="text-align: center">View videos by language&lt;/th>
&lt;th style="text-align: center">&lt;/th>
&lt;th style="text-align: center">&lt;/th>
&lt;th style="text-align: center">&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td style="text-align: center">&lt;a href="https://www.youtube.com/playlist?list=PLe_-TawAqQj2f2I-TevZcFchyhEAhkQ0g" target="_blank">English&lt;/a>&lt;/td>
&lt;td style="text-align: center">&lt;a href="https://www.youtube.com/watch?v=hK3LAAfm1-U&amp;amp;list=PLe_-TawAqQj22lY2dikyWA3XCvmDaZcEV" target="_blank">French&lt;/a>&lt;/td>
&lt;td style="text-align: center">&lt;a href="https://www.youtube.com/watch?v=G309-3KW7ok&amp;amp;list=PLe_-TawAqQj02nIuITrQdds9Vt8A2jKvm" target="_blank">Spanish&lt;/a>&lt;/td>
&lt;td style="text-align: center">&lt;a href="https://www.youtube.com/watch?v=wI1peEvLINU&amp;amp;list=PLe_-TawAqQj37hN_S8Qice7DDB6cu1TPZ" target="_blank">Brazilian Portuguese&lt;/a>&lt;/td>
&lt;td style="text-align: center">&lt;a href="https://www.youtube.com/watch?v=VXPCYulcEHs&amp;amp;list=PLe_-TawAqQj0zVsT6A3ym6HLMHAXMWORd" target="_blank">Simplified Chinese&lt;/a>&lt;/td>
&lt;td style="text-align: center">&lt;a href="https://www.youtube.com/watch?v=IPvf4Zl2qLY&amp;amp;list=PLe_-TawAqQj05sOlOtYsV1uiBAydpvxKr" target="_blank">Japanese&lt;/a>&lt;/td>
&lt;td style="text-align: center">&lt;a href="https://www.youtube.com/watch?v=Q_yXjiinHG0&amp;amp;list=PLe_-TawAqQj2pHiy0XZRWctA-ac_hUcVx" target="_blank">Korean&lt;/a>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: center">English&lt;/td>
&lt;td style="text-align: center">français&lt;/td>
&lt;td style="text-align: center">español&lt;/td>
&lt;td style="text-align: center">português do Brasil&lt;/td>
&lt;td style="text-align: center">简体中文&lt;/td>
&lt;td style="text-align: center">日本語&lt;/td>
&lt;td style="text-align: center">한국어로&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>&lt;br>&lt;br>
&lt;em>We&amp;rsquo;d like to thank the following for their help in checking the video translations: Fabienne Meyers from IUCAP for the French versions, our very own resident translator Vanessa Fairhurst for the Spanish versions, Edilson Damasio from the University Library of Maringá for the Brazilian Portuguese versions, Guo Xiaofeng from Wanfang Data for the Chinese versions, Nobuko Miyairi from ORCID for the Japanese versions and Junghyo from Nurimedia and Jae Hwa Chang at infoLumi for the Korean versions.&lt;/em>&lt;/p>
&lt;hr></description></item><item><title>A year in the life of Crossref</title><link>https://www.crossref.org/blog/a-year-in-the-life-of-crossref/</link><pubDate>Tue, 23 Jan 2018 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/a-year-in-the-life-of-crossref/</guid><description>&lt;p>We are delighted to report that last year Crossref welcomed a record-breaking 1,939 new members and, because our member base is growing so rapidly in both headcount and geography&amp;mdash;with the highest number of new members joining from Asia&amp;mdash;we thought it was a good time to reiterate what Crossref is all about, as well as show off a little about the things we are proud to have achieved in 2017.&lt;/p>
&lt;blockquote>
&lt;p>What is Crossref?&lt;/p>
&lt;/blockquote>
&lt;p>We are an organisation that runs a registry of metadata and DOIs of course, but we are much more than that&amp;mdash;staff, board, working groups, and committees as well as a broad range of collaborators, users, and supporters in the wider scholarly communications community. Increasingly, our community includes new contributors like scholars, funders, and universities. Together, we are all working toward the same goal&amp;mdash;to enhance scholarly communications. Everything we do is designed to put scholarly content in context so that the content our members publish can be found, cited, used, and re-used.&lt;/p>
&lt;p>Here&amp;rsquo;s how we did that over the past year:&lt;/p>
&lt;h2 id="we-rallied-the-community">We rallied the community&lt;/h2>
&lt;p>Rallying the community is all about working together to forge new relationships and pave the way for future generations of researchers&amp;mdash;in 2017 we were closely involved with the launch of &lt;a href="http://www.metadata2020.org/" target="_blank">Metadata 2020&lt;/a>; a collaboration that advocates richer, connected, and reusable metadata for all research outputs.&lt;/p>
&lt;h2 id="we-tagged-and-shared-metadata">We tagged and shared metadata&lt;/h2>
&lt;p>To make sure that our APIs continue to have real, genuine utility, we introduced a new service called &lt;a href="https://www.crossref.org/news/2017-11-15-new-metadata-plus-service-launching/">Metadata Plus&lt;/a> in 2017 so that platforms and tools can leverage the power of our rich, immense database to increase the value and discoverability of content.&lt;/p>
&lt;h2 id="we-played-with-new-technology">We played with new technology&lt;/h2>
&lt;p>To keep pace with changes in the industry and stay true to &lt;a href="https://www.crossref.org/about/">our mission&lt;/a>, we often play with new technology with the goal of offering a bigger and better infrastructure. In 2017 we formed a working group and an advisory group for two new identifiers that will see this infrastructure increase; &lt;a href="https://doi.org/10.64000/xyp08-prx66" target="_blank">organisation IDs&lt;/a> which became ROR, and &lt;a href="https://doi.org/10.64000/5cfh1-1wa10" target="_blank">Grant IDs&lt;/a> which became the Crossref Grant Linking System.&lt;/p>
&lt;h2 id="we-made-new-tools-and-services">We made new tools and services&lt;/h2>
&lt;p>Combining our own knowledge and experience with input from the wider community, in 2017 we were able to launch in Beta a new and exciting tool called &lt;a href="https://doi.org/10.64000/cbcne-j1d05" target="_blank">Event Data&lt;/a>. Event Data provides a record of where research has been bookmarked, linked, recommended,  shared, referenced, commented on etc, beyond publisher platforms&amp;mdash;which is a great example of putting scholarly research in a wider context.&lt;/p>
&lt;hr>
&lt;p>So, while richer metadata (including more record and resource types) remains our focus 2018 and beyond, we also hope that as we become a bigger and more global community we can move beyond the basics and work together to make sure that DOIs, are not the be-all and end-all when they are, in fact, just the beginning.&lt;/p></description></item><item><title>Bridging Identifiers at PIDapalooza</title><link>https://www.crossref.org/blog/bridging-identifiers-at-pidapalooza/</link><pubDate>Mon, 22 Jan 2018 00:00:00 +0000</pubDate><author>Joe Wass</author><guid>https://www.crossref.org/blog/bridging-identifiers-at-pidapalooza/</guid><description>&lt;p>Hello from sunny Girona! I&amp;rsquo;m heading to &lt;a href="https://pidapalooza.org/" target="_blank">PIDapalooza&lt;/a>, the Persistent Identifier festival, as it returns for its second year. It&amp;rsquo;s all about to kick off.&lt;/p>
&lt;p>One of the themes this year is &amp;ldquo;bridging worlds&amp;rdquo;: how to bring together different communities and the identifiers they use. Something I really enjoyed about PIDapalooza last year was the variety of people who came. We heard about some &amp;ldquo;traditional&amp;rdquo; identifier systems (at least, it seems that way to us): DOIs for publications, DOIs for datasets, ORCIDs for researchers. But, gathered in Reykjavik, under dark Icelandic skies, I met oceanographic surveyors assigning DOIs to drilling equipment, heard stories of identifiers in Chinese milk production and consoled librarians trying navigate the identifier landscape.&lt;/p>
&lt;p>In addition to the usual scholarly publishing and science communication crowd, it was encouraging to see a real diversity of people from different walks of life encounter the same problems and work on them them collaboratively. The thing that brought everyone together was the understanding that if we&amp;rsquo;re going to reliably reference things &amp;ndash; be they researchers, articles they write, or ships they sail &amp;ndash; we need to give them identifiers. And those identifiers should be as good as possible: persistent, resolvable, interoperable.&lt;/p>
&lt;h2 id="who-cares-about-pids">Who cares about PIDs?&lt;/h2>
&lt;p>At the turn of the century, a handful of publishers came together to create Crossref (or &lt;em>CrossRef&lt;/em> as it was in those days). It was becoming increasingly important to be able to store references in machine-readable format, but publishers were faced with a problem. If an author wants to cite an article, they&amp;rsquo;ll do so without worrying who published it. This means they needed an identifier system that worked across all publishers. Thus the Crossref DOI was born.&lt;/p>
&lt;p>Today we&amp;rsquo;re heading toward 10,000 members, and the thing that they have in common is that they all produce scholarly content and care about how it&amp;rsquo;s referenced. As a trade association, we effectively act on behalf of all of our members, allowing them to register their content, share metadata and links, and assign an identifier.&lt;/p>
&lt;p>But there&amp;rsquo;s a whole world out there. Publications have never been the be-all and end-all of scholarship, but they have been a backbone. But more and more scholarship, especially science, is done outside journal publishing. Sometimes it&amp;rsquo;s done on platforms that care about the scholarly record as much as publishers. And sometimes it isn&amp;rsquo;t.&lt;/p>
&lt;h2 id="the-twitterverse">The Twitterverse&lt;/h2>
&lt;p>Lots of people use Twitter to talk about science. Some are scientists, some aren&amp;rsquo;t. Scientific articles are linked from news reports and discussed on blogs. Gone are the days of scholarly articles being cited only by other scholarly articles. We see links coming in from all over the place. And, although not all of this can be counted as the &amp;ldquo;scholarly record&amp;rdquo;, some of it &lt;em>could&lt;/em> be.&lt;/p>
&lt;p>The barrier-to-entry for journals publishing means that science journals contain only science articles. The barrier-to-entry for Twitter means that anyone can, and does, publish there. My Twitter feed is finely balanced between bibliometrics research, marine biology and pictures of snow leopards with Japanese captions. I don&amp;rsquo;t understand all of it, but I like looking at the pictures.&lt;/p>
&lt;p>Back in the days when the only references to scholarly publications were from other scholarly publications, it was easy to keep track of those references. When an article was published, its references went into a citation database. This happened because the publisher considered this important.&lt;/p>
&lt;p>But Twitter, the publisher of tweets, doesn&amp;rsquo;t care. It is used for a huge variety of communications and although some people choose to use it to engage in scholarship, we&amp;rsquo;re just a blip on their radar. The same goes for Reddit, a platform that describes itself as &amp;ldquo;the front page of the Internet&amp;rdquo;. There are communities engaged in scientific discussions, but Reddit doesn&amp;rsquo;t feel the need to publish its bibliographic references.&lt;/p>
&lt;p>Nor should it.&lt;/p>
&lt;h2 id="bridging-those-who-care-with-those-who-dont">Bridging those who care with those who don&amp;rsquo;t&lt;/h2>
&lt;p>The barrier-to-entry for contributing to scientific discussions has lowered, meaning that the role of more non-specialist platforms has increased.&lt;/p>
&lt;p>I imagine that there are other communities out there who have their own concerns about the web. Maybe there are model train enthusiasts who want to keep track of every reference to a particular model. Or political commentators who want to keep track of how certain politicians and policies are discussed. As the scholarly community embraces new platforms for communicating, we should recognise that we are part of a broader universe of people using those platforms for more diverse reasons.&lt;/p>
&lt;p>Gone are the days when the only way to reply to an article was by writing a letter to the editor. But also gone are the days when you could guarantee that your letter wouldn&amp;rsquo;t appear next to cat pictures (assuming you weren&amp;rsquo;t writing to the &lt;a href="https://journals.sagepub.com/home/jfm" target="_blank">Journal of Feline Medicine &amp;amp; Surgery&lt;/a>). As a specialist community cohabiting online spaces with non-specialists, it falls to us to do whatever we need to adapt that space and make it our own. In our case, this means recording bibliographic references as and where they occur.&lt;/p>
&lt;p>Something like this happened once before. As traditional publishers went online, they created Crossref to build and maintain the necessary infrastructure. We&amp;rsquo;re acting on behalf of the community again to collect links from non-traditional sources. Because we can&amp;rsquo;t go to platforms like Twitter and say &amp;ldquo;please deposit your references&amp;rdquo;, we&amp;rsquo;re doing the opposite. We identify a platform, then work out how to scrape its content and extract links.&lt;/p>
&lt;h2 id="working-at-scale">Working at scale&lt;/h2>
&lt;p>So we&amp;rsquo;re broadening out the universe of references that we would like to track from &amp;ldquo;traditional scholarly publishing&amp;rdquo; to &amp;ldquo;the entire web&amp;rdquo;. There are four broad challenges inherent in this, and we think that Crossref infrastructure is the right way to meet them.&lt;/p>
&lt;p>The first challenge is physically finding the links. Because social media platforms aren&amp;rsquo;t specialised for scholarly publishing, they don&amp;rsquo;t have the same mechanisms in place for capturing bibliographic references. This means that we have to do it ourselves by scraping webpages for references. As the standard-bearer for scholarly PIDs, we think we can do a good job of this.&lt;/p>
&lt;p>The second challenge is doing this at the scale of the web. Because we might, in theory, find a link on any webpage, there is a literally infinite number of publishing platforms. From big websites like BBC News down to tiny blogs run out of a bedroom. It would be impossible to partner with each of these individually. The way to solve this is to run a centralised service which goes out and contacts as many sources as possible. This role is a collaborative one. Our system is open to inspection, suggestions and contributions from the community.&lt;/p>
&lt;p>The third challenge is the sheer number of publishers. Because they all register content with us, we are in good position to track their DOIs. In addition to that, every member of Crossref publishes content on their own platform, and has their own set of websites to track. We monitor our members&amp;rsquo; websites and create a central list of domains that we look for. If this wasn&amp;rsquo;t done centrally, each publisher would have to run its own web crawlers and perform the same work, only to filter out their own links.&lt;/p>
&lt;p>The fourth challenge is how to get all that data to the public. Even if every publisher were able to run their own infrastructure, it would make it very difficult to consume. Through Crossref metadata services, publishers have built a system where you can look up metadata and link to articles without worrying who published them. We think that the same approach should apply to this new link data.&lt;/p>
&lt;p>For these reasons, we&amp;rsquo;re building Crossref Event Data: a system that monitors as many platforms as we can think of, and brings them into one place, and serves the whole community.&lt;/p>
&lt;h2 id="building-bridges">Building bridges&lt;/h2>
&lt;p>If you&amp;rsquo;ve been &lt;a href="https://www.crossref.org/authors/joe-wass/">following along&lt;/a> you&amp;rsquo;ll know that &lt;a href="https://doi.org/10.64000/3jrqv-85z62" target="_blank">my last metaphor was the process of refining crude oil&lt;/a>. I like metaphors, and mixing them. After all, you can&amp;rsquo;t mix a good metaphor without breaking a few eggs into the mixing bowl. Today&amp;rsquo;s metaphors are bridges. And not just one.&lt;/p>
&lt;h2 id="bridge-1-pids-and-urls">Bridge 1: PIDs and URLs&lt;/h2>
&lt;p>In the world of Persistent Identifiers, we&amp;rsquo;re quite good at linking. organisations like Crossref, DataCite and ORCID run separate systems but we work together to record and exchange links. But the web is different. There&amp;rsquo;s no single organisation in control and there are many organisations working to catalogue it. Event Data is our offering: bridging the web with our identifiers.&lt;/p>
&lt;h2 id="bridge-2-scholarly-link-providers">Bridge 2: Scholarly link providers&lt;/h2>
&lt;p>Of course, some platforms and systems &lt;em>do&lt;/em> care about persistence and Persistent Identifiers. Event Data is an open platform, and we&amp;rsquo;re collaborating with a few providers to publish links.&lt;/p>
&lt;p>We&amp;rsquo;ve partnered with &lt;a href="https://www.lens.org/lens/" target="_blank">The Lens&lt;/a> to include Patent to DOI references. We&amp;rsquo;re working with F1000 to include links between reviews and articles. Hopefully we&amp;rsquo;ll see more organisations use Event Data to publish their links.&lt;/p>
&lt;h2 id="bridge-3-crossref--datacite">Bridge 3: Crossref / DataCite&lt;/h2>
&lt;p>Event Data is a collaborative project between DataCite and Crossref. When Crossref Registered Content contains a reference to a DataCite DOI we put it into Event Data. DataCite do the same in reverse. This means that Event Data contains a huge number of article - dataset links.&lt;/p>
&lt;h2 id="bridge-4-traditional-discussions-vs-new-ones">Bridge 4: Traditional discussions vs new ones&lt;/h2>
&lt;p>At each moment, scholarly discussions are happening in the literature, on various social media platforms and on the web at large. They are all talking about the same thing, but are spread out. Event Data collects links wherever we find them and brings them into one place. By doing this we hope we can help bring those conversations together.&lt;/p>
&lt;h2 id="bridge-5-bridging-bibliometricians-and-altmetricians-to-data-sources">Bridge 5: Bridging bibliometricians and altmetricians to data sources&lt;/h2>
&lt;p>Capturing links from social media to published literature underpins the field of altmetrics. By collecting this data and making it available under open licenses, we bring it to altmetrics researchers. We don&amp;rsquo;t provide metrics, but we do provide the data points that can form the basis for research.&lt;/p>
&lt;p>Without infrastructure for collecting data, researchers would have to perform the same work over and over again. Because the data is all open, we allow datasets to be republished, reworked and replicated.&lt;/p>
&lt;h2 id="bridge-6-bridging-the-evidence-gap">Bridge 6: Bridging the Evidence Gap&lt;/h2>
&lt;p>Running Event Data involves collecting a lot of data - gigabytes per day - and boiling it down into hundreds of thousands of individual Events per day. People consuming the data may want to do further boiling down. At every point of the process we record the input data that we were working from, the internal thought process of the system, and the Events that were produced. A researcher can use the Evidence Logs to trace through the entire process that led to an Event.&lt;/p>
&lt;p>We&amp;rsquo;re a bridge from websites and social media to data consumers. But we take the role very seriously, and there&amp;rsquo;s nothing hidden. A &lt;a href="https://en.wikipedia.org/wiki/Zhangjiajie_Glass_Bridge" target="_blank">glass bridge&lt;/a>, you could say.&lt;/p>
&lt;h2 id="interesting-challenges">Interesting challenges&lt;/h2>
&lt;p>It&amp;rsquo;s not all plain sailing. There are a few challenges along the way to collecting this data which anyone who wanted to collect this kind of information would face. By collecting it in a central place and running an open platform we can solve each problem once, and improve our process as a community.&lt;/p>
&lt;p>One problem is choosing what to include. We include any link that we find from a non-publisher website. That means that invariably some of the links are from spam. This problem isn&amp;rsquo;t new: we see low-quality articles being published in traditional journals from time to time. We try to include all of the data we can find and pass it onto consumers. They might want to whitelist certain sources, or they may want all of the data because they&amp;rsquo;re trying to study scholarly spam. We have decided to provide data as Events, which strike the balance between atomicity and usefulness.&lt;/p>
&lt;p>Another, which I talked about at last year&amp;rsquo;s PIDapalooza, is how we track article landing pages. Read &lt;a href="https://doi.org/10.64000/jw4t5-5yt89" target="_blank">the blog post&lt;/a>, the &lt;a href="https://www.eventdata.crossref.org/guide/data/ids-and-urls/" target="_blank">user guide&lt;/a> or hop in a time machine if you&amp;rsquo;re interested.&lt;/p>
&lt;h2 id="the-thing-about-bridges">The thing about bridges&amp;hellip;&lt;/h2>
&lt;p>&amp;hellip; is that they help people get where they&amp;rsquo;re going. With a few notable exceptions, they&amp;rsquo;re not the main attraction. We play a humble part in scholarly publishing, helping collect and distribute metadata. Most of what we do goes unseen, and helps people create tools, platforms and research. Event Data is an API, and whilst we hope people will build all kinds of things with it, including altmetrics tools, we&amp;rsquo;re not making another metric.&lt;/p>
&lt;h2 id="pidapalooza">PIDapalooza&lt;/h2>
&lt;p>All of which brings me to my talk, which I&amp;rsquo;m giving on Wednesday: &lt;a href="https://pidapalooza18.sched.com/event/Cwmw/event-data-bridging-persistent-and-not-so-persistent-identifiers" target="_blank">Bridging persistent and not-so-persistent identifiers&lt;/a>. I would tell you about it, but there isn&amp;rsquo;t much more left to say.&lt;/p>
&lt;p>If you want to find out more, we&amp;rsquo;re currently in Beta, and open for business. Head over to the &lt;a href="https://www.eventdata.crossref.org/guide/index.html" target="_blank">User Guide&lt;/a> to get started!&lt;/p></description></item><item><title>Crossref ambassador program</title><link>https://www.crossref.org/blog/crossref-ambassador-program/</link><pubDate>Thu, 04 Jan 2018 00:00:00 +0000</pubDate><author>Vanessa Fairhurst</author><guid>https://www.crossref.org/blog/crossref-ambassador-program/</guid><description>&lt;p>We have listened to the feedback from you, our members, and you&amp;rsquo;ve told us of a need for local experts to provide support in your timezone and language, and to act as liaisons with the Crossref team. You&amp;rsquo;ve also asked for an increased number of training events both online and in person close to you, and for more representatives from Crossref at regional industry events.&lt;/p>
&lt;p>We want to make sure we can reach members around the globe, and as such, a wide team of people is required who are knowledgeable in the languages, cultures, and member needs in a variety of countries. This is why we&amp;rsquo;re launching our Ambassador Program.&lt;/p>
&lt;p align="center">
&lt;img src="https://www.crossref.org/images/blog/crossref-ambassadors-logo-rgb.jpg" alt="image of Crossref Ambassadors Logo" width="500px" />
&lt;/p>
&lt;blockquote>
&lt;p>What are Crossref Ambassadors?&lt;/p>
&lt;/blockquote>
&lt;p>Crossref Ambassadors are volunteers who work within the international scholarly research community in a variety of different roles such as librarians, researchers or editors to name but a few. They are individuals who are well connected, value the work that Crossref does and are passionate about improving scholarly communication and the role Crossref plays within this system.&lt;/p>
&lt;p>Some of the activities our ambassadors will undertake:&lt;/p>
&lt;ul>
&lt;li>Staying up-to-speed with Crossref developments, for example, by attending webinars and maintaining regular check-ins with the Crossref team.&lt;/li>
&lt;li>Engaging in the online community platform; providing feedback, joining in discussions and helping other members to resolve issues posted to the group.&lt;/li>
&lt;li>Writing blog posts, or contributing to newsletters.&lt;/li>
&lt;li>Participating in beta-testing of new products and services.&lt;/li>
&lt;li>Helping with local LIVE events; for example, providing recommendations on speakers or venues, helping with logistics and presenting at the event.&lt;/li>
&lt;li>Helping with the translation of Crossref material and content into local languages.&lt;/li>
&lt;li>Running webinars on different Crossref services in local languages.&lt;/li>
&lt;li>Running training sessions locally with Crossref members&lt;/li>
&lt;li>Representing Crossref at relevant industry events&lt;/li>
&lt;/ul>
&lt;p>It is important that our ambassadors enjoy the work they are doing with Crossref by contributing in ways in which they feel comfortable, according to their interests, skills and the time they feel they want to contribute. For this reason, the role comes with a high degree of flexibility.&lt;/p>
&lt;p>We see our ambassadors as valued members of the Crossref network and will provide them with:&lt;/p>
&lt;ul>
&lt;li>A dedicated contact for any upcoming news, or to share ideas, queries or concerns.&lt;/li>
&lt;li>Help with content for proposal calls, presentations, training and written articles.&lt;/li>
&lt;li>Crossref materials and giveaways (plus ambassador-branded materials).&lt;/li>
&lt;li>Personal endorsement via Crossref&lt;/li>
&lt;li>Training on Crossref services and on wider relevant skills as necessary.&lt;/li>
&lt;li>First look at new Crossref developments&lt;/li>
&lt;li>Certification from Crossref on ambassador and training status.&lt;/li>
&lt;li>Personal ambassador logo or badge for use on email, website and profile on the Crossref online community forum (launching later this year).&lt;/li>
&lt;/ul>
&lt;p>Crossref Ambassadors will become an increasingly key part of the Crossref community - the first port of call for updates or to test out new products or services, and the eyes and ears within the local academic community - working closely with Crossref to make scholarly communications better for all.&lt;/p>
&lt;h3 id="meet-our-first-ambassadors">Meet our first ambassadors!&lt;/h3>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/jae-hwa-chang-sq.jpg" alt="image of Jae Hwa Chang" width="300px" />
&lt;/div>
&lt;p>&lt;strong>Jae Hwa Chang&lt;/strong> has been working at infoLumi as a manuscript editor in academic journals since 2010. Prior to joining infoLumi, she was a medical librarian at International Vaccine Institute and was engaged in medical information management and service. Her interests in information control and management started when she was doing work indexing newspaper articles at JoonAng Ilbo. She was fascinated by Crossref’s persistent efforts and contribution in developing new services to “make content easy to find, cite, link, and assess” and has been introducing them to Korean scholarly publishing communities. Jae earned her MA in Library and Information Science from Ewha Womans University, Korea. She serves as a vice chair of the Committee on Planning and Administration at the Korean Council of Science Editors. In her spare time, she enjoys traveling and experiencing new cultures.&lt;/p>
&lt;p>장재화는 2010년부터 인포루미에서 의학학술지 원고편집을 담당하고 있다. 그전에는 국제백신연구소 도서관에서 사서로 일하면서 의학정보와 학술지논문 유통에 관심을 가졌으며, 그에 앞서서는 중앙일보에서 신문기사 DB 색인을 하면서 정보관리와 활용에 대해 연구하였다. 정보의 검색, 평가, 활용을 위해 꾸준히 새로운 서비스를 개발하는 Crossref에 매력을 느꼈고, 그 서비스들을 한국의 학술지 출판 관계자들에게 소개해왔다. 이화여자대학교에서 문헌정보학을 전공하였고, 한국과학학술지편집인협의회 기획운영위원회 부위원장을 맡고 있다. 여행과 다양한 문화 체험을 즐긴다.&lt;/p>
&lt;br>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/edilson-demasio-sq.jpg" alt="image of Edilson Demasio" width="300px" />
&lt;/div>
&lt;p>&lt;strong>Edilson Demasio&lt;/strong> has been a librarian since 1995, with PhD. in Information Science at Federal University of Rio de Janeiro-UFRJ/IBICT. He works in the Department of Mathematics Library of State University of Maringá-UEM, Brazil. With 20 years&amp;rsquo; experience in scientific metadata and publishing. His expertise is various including knowledge in  scientific communication, Crossref services, research integrity, misconduct prevention in science, publishing on Latin America, biomedical information, OJS-Open Journal Systems, Open Access journals, scientific journals quality and indexing, and scientific bibliographical databases. He is enthusiastic about presenting and disseminating information about Crossref services to his community in Brazil and working within the community, exchanging ideas and experience.&lt;/p>
&lt;p>Eu sou bibliotecário desde 1995, Doutor em Ciência da Informação pela Universidade Federal do Rio de Janeiro-UFRJ/convênio IBICT. Eu trabalho na Biblioteca do Departamento de Matemática da Universidade Estadual de Maringá-UEM. Com 20 anos de experiência em metadados científicos e editoração, entre outros. Meus conhecimentos são diversos sobre comunicação científica, cientometria, metadados XML, serviços Crossref, integridade em pesquisa, prevenção de más condutas na ciência, editoração, editoração na América Latina, informação biomédica, OJS-Open Journal Systems, revistas de Acesso Aberto, qualidade de periódicos científicos e indexação, bases de dados bibliográficas. Gosto de disseminar meu conhecimento a outras regiões e pessoas e de trabalhar em comunidade junto as instituições e outros países, de planejar novas apresentações, de trocar experiências como palestrante ou convidado e trabalhar na disseminação do conhecimento para todos.&lt;/p>
&lt;br>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/lauren-lissaris-sq.jpg" alt="image of Lauren Lissaris" width="300px" />
&lt;/div>
&lt;p>&lt;strong>Lauren Lissaris&lt;/strong> has dedicated much of her career to the dissemination of valuable content on a robust platform. She takes pride in her achievements as the Digital Content Manager at JSTOR. &lt;a href="http://www.jstor.org/" target="_blank">JSTOR&lt;/a> provides access to more than 10 million academic journal articles, books, and primary sources in 75 disciplines. JSTOR is part of &lt;a href="http://www.ithaka.org/" target="_blank">ITHAKA&lt;/a>, a not-for-profit organisation helping the academic community use digital technologies to preserve the scholarly record and to advance research and teaching in sustainable ways.&lt;/p>
&lt;p>Lauren successfully works with all aspects of journal content to effectively assist publishers with their digital content. This includes everything from XML markup, Content Registration/multiple resolution, and HTML website updates. Lauren has been involved in hosting current content on JSTOR since the program&amp;rsquo;s launch in 2010. She continues to collaborate with organisations to successfully contribute to the evolution of digital content. The natural spread from journals to books has set Lauren up for developing and planning the book Content Registration program for JSTOR. She is a member of the Crossref Books Advisory Group and she helped successfully pilot Crossref’s new Co-access book deposit feature.&lt;/p>
&lt;p>If you want to find out more information on the Ambassador Program, or you would like to express your interest in being an ambassador, you can either contact us at [feedback@crossref.org](mailto:feedback@crossref.org?subject=Ambassador Program) or complete our &lt;a href="https://www.crossref.org/community/ambassadors/">online form&lt;/a>.&lt;/p></description></item><item><title>Metadata and integrity: the unlikely bedfellows of scholarly research</title><link>https://www.crossref.org/blog/metadata-and-integrity-the-unlikely-bedfellows-of-scholarly-research/</link><pubDate>Thu, 14 Dec 2017 00:00:00 +0000</pubDate><author>Damian Pattinson</author><guid>https://www.crossref.org/blog/metadata-and-integrity-the-unlikely-bedfellows-of-scholarly-research/</guid><description>&lt;p>I was invited recently to present parliamentary evidence to the House of Commons Science and Technology Select Committee on the subject of Research Integrity. For those not familiar with the arcane workings of the British Parliamentary system, a Select Committee is essentially the place where governments, and government bodies, are held to account. So it was refreshing to be invited to a hearing that wasn’t about Brexit.&lt;/p>
&lt;p>The interest of the British Parliament in the integrity of scientific research confirms just how far science’s ongoing “reproducibility crisis” has reached. The fact that a large proportion of the published literature cannot be reproduced is clearly problematic, and this call to action from MPs is very welcome. And why would the government not be interested? At stake is the process of how new knowledge is created, and how reliable that purported knowledge is.&lt;/p>
&lt;p>The other issue driving this overview of research practices are the cases of deliberate fraud and wrongdoing that have recently created headlines (e.g., the &lt;a href="https://www.nature.com/news/stap-1.15332" target="_blank">STAP papers&lt;/a> concerning the reprogramming of stem cells). While these cases are clearly dramatic outliers, they nevertheless serve to diminish public confidence in scholarly research and the findings that come out of this enterprise.&lt;/p>
&lt;p>As with most inquiries, the question quickly boiled down to: who is to blame? As Bill Grant MP asked me directly, “Where does the responsibility lie?”&lt;/p>
&lt;p>My answer was lifted from an article by Ginny Barbour and colleagues in &lt;em>F1000Research&lt;/em> this November (&lt;a href="https://doi.org/10.12688/f1000research.13060.1" target="_blank">https://doi.org/10.12688/f1000research.13060.1&lt;/a>): publishers are responsible for the integrity of the published literature, while institutions and employers are ultimately responsible for the conduct of their staff. Misconduct entails intent, usually to deceive the reader into believing a conclusion that the researcher wishes them to believe. But journal editors can never know, and are not in a position to investigate, whether a researcher has &lt;em>deliberately&lt;/em> falsified their data.&lt;/p>
&lt;p>However, there are things that publishers can do to ensure high standards of integrity. Much of this involves making a study’s authors publish as much information about what they have done as possible - the more the reader can see of how data were generated, the more that reader can trust the findings communicated in the published article.&lt;/p>
&lt;p>Article metadata directly supports this function. It provides structure and transparency to information pertaining to ethics and integrity. And because metadata is independent of the main article, it can be readable even if the article itself is locked behind a paywall.&lt;/p>
&lt;p>Crossref already provides metadata that can demonstrate the integrity of published articles. The metadata collected on 91+ million scholarly works across publishers and disciplines is open and freely accessible to all. Bibliographic information, for example, allows readers to see who the authors of the article are, where they are from, and what else they have published. Similarly, funding data allows readers to identify potential conflicts of interest, for example if the funder has commercial or political affiliations. Even if the reader cannot see the conflict of interest statement (or if the journal has not provided one), they can use the funding statement to surface potential conflicts.&lt;/p>
&lt;p>And if they wanted, publishers could provide additional metadata to add still more transparency to the research process. Ethical approval by institutional review boards, for example, could be captured, and any protocol numbers traced back to the original ethics committee approval. At present the process of ethical approval varies from country to country, and from institution to institution. Encouraging authors and journals to deposit information on the approval process would both demonstrate the high ethical standards the author is working to, and also improve the standards themselves, since institutions would have to encode their approval processes in a way that is understandable to others. This could pave the way to significantly higher international ethical standards, all through a simple addition to the indexed metadata underlying the scholarly literature.&lt;/p>
&lt;blockquote>
&lt;p>One key recommendation that I and many others made to the Committee was, in short, &amp;ldquo;show your work&amp;rdquo;. As a researcher, that means showing your data. As a publisher, that means showing what checks you have done. In both cases, metadata can help.&lt;/p>
&lt;/blockquote>
&lt;p>A major issue that publishers and researchers can – and should – address is the provision of actual scientific data. Most papers, today, present only the end results of the authors’ (often quite extensive) analyses. The case for sharing data is an obvious one - many recent cases of misconduct could have been identified earlier, or even avoided altogether, if editors and readers had had access to underlying datasets.&lt;/p>
&lt;p>With images, a requirement to submit raw images alongside the edited figures would dramatically reduce the cases of manipulation that are rife in the literature (studies suggest up to 20% of papers have some kind of inappropriate figure manipulation, with around 1 in 40 papers showing manipulation beyond that which can be expected to be a result of error). Similarly, providing the numbers that a paper’s analyses are based upon would allow readers to fully assess if datasets are distributed as would be expected through random sampling, and, if they choose, to determine if the data are sufficient to support the statistical inferences made in the paper. The Crossref schema – by providing unique identifiers to data citations - makes this link between data and paper possible. (See the recent blog post on the &lt;a href="https://doi.org/10.64000/k2hez-ysv45" target="_blank">Research Nexus&lt;/a> for more information.)&lt;/p>
&lt;p>For publishers, showing your work also means being transparent to your readers about the editorial checks that a manuscript has undergone. Crossref has a tool that enables this editorial transparency: it’s called &lt;a href="https://www.crossref.org/services/crossmark">Crossmark&lt;/a>. Crossmark allows readers to see the most up-to-date information about an article, even on downloaded PDFs. In most cases it is used to show whether the version of an article is most recent one, or whether any corrigenda or retractions have been subsequently added. But it can also be used to provide whatever information a publisher wishes to share about the paper. Some journals have experimented with using Crossmark to ‘thread’ publications together, for example, by linking all the outputs generated from a single clinical trial registration number (&lt;a href="https://doi.org/10.64000/t2fmq-vdb52" target="_blank">blog post here&lt;/a>). But publishers could go further and display metadata pertaining to the editorial checks they have performed on a paper. So Crossmark could tell readers that the paper has been checked for plagiarism, or figure manipulation, or reporting standards such as CONSORT or ARRIVE guidelines. Here at Research Square we have been addressing this with a series of &lt;a href="https://www.researchsquare.com/researchers/badges" target="_blank">Badges&lt;/a> that researchers can apply to their papers to demonstrate what checks have been performed.&lt;/p>
&lt;p>Together, these implementations would provide value to the reader, who can see exactly what has been checked, and to the publisher, who can show how rigorous their editorial processes are. It would also serve to highlight the integrity of the authors who have passed all of these checks.&lt;/p>
&lt;p>Research integrity is not something that can be easily measured but, unlike wit or charm, it is something that people generally know that they have.* This means that they just need to be transparent in their output to demonstrate this to the world. Metadata provides a simple way of doing this, so researchers and publishers should make sure they provide it as openly as they can.&lt;/p>
&lt;p>*&lt;em>with apologies to &lt;a href="https://en.wikipedia.org/wiki/Laurie_Lee" target="_blank">Laurie Lee&lt;/a> for the mangled quote&lt;/em>&lt;/p></description></item><item><title>Dr. Livingstone, I presume…a two month expedition deep into the heart of research publishing</title><link>https://www.crossref.org/blog/dr.-livingstone-i-presumea-two-month-expedition-deep-into-the-heart-of-research-publishing/</link><pubDate>Wed, 13 Dec 2017 00:00:00 +0000</pubDate><author>Amanda Bartell</author><guid>https://www.crossref.org/blog/dr.-livingstone-i-presumea-two-month-expedition-deep-into-the-heart-of-research-publishing/</guid><description>&lt;p>Hello there. I&amp;rsquo;m &lt;a href="https://www.crossref.org/people/amanda-bartell/">Amanda Bartell&lt;/a>, and I joined the &lt;a href="https://www.crossref.org/people/org-chart/">Crossref team&lt;/a> in mid-October as the new Head of Member Experience. My new Member Experience team will be responsible for metadata users as well as members, onboarding new accounts, supporting existing ones, and making sure that everyone can make the most of Crossref services - an an easy and efficient way. I have spent the last couple of months exploring the world of academic publishing and what our members need - and it&amp;rsquo;s been fascinating!&lt;/p>
&lt;h2 id="expedition-members">Expedition members&lt;/h2>
&lt;p>The new Member Experience team is made up of some people who are new to Crossref and Scholarly Publishing and some whose names you&amp;rsquo;ll probably recognize!&lt;/p>
&lt;ul>
&lt;li>&lt;a href="https://www.crossref.org/people/anna-tolwinska/">Anna Tolwinska&lt;/a> (Member Experience Manager) will support existing members in understanding the quality of metadata they deposit with us, and how they can best make use of our other products and services.&lt;/li>
&lt;li>&lt;a href="https://www.crossref.org/people/paul-davis/">Paul Davis&lt;/a> (Product Support Specialist) and &lt;a href="https://www.crossref.org/people/shayn-smulyan/">Shayn Smulyan&lt;/a> (Product Support Associate) will continue to provide excellent technical support to all creators and consumers of our metadata.&lt;/li>
&lt;li>&lt;a href="https://www.crossref.org/people/gurjit-bhullar/">Gurjit Bhullar&lt;/a> (Membership Coordinator) will help new applicants who want to join Crossref understand the member obligations and have a smooth induction journey.&lt;/li>
&lt;/ul>
&lt;p>We&amp;rsquo;ll be expanding the team in 2018 to support you further - &lt;a href="https://doi.org/10.64000/fcv4b-h5q84" target="_blank">watch this space!&lt;/a>&lt;/p>
&lt;h2 id="what-a-diverse-ecosystem">What a diverse ecosystem&lt;/h2>
&lt;p>My background is educational publishing, so this has been my first foray into the world of scholarly publishing. In my first few months I&amp;rsquo;ve been lucky enough to attend three very different events with Crossref - Frankfurt Book Fair, our annual meeting (&lt;a href="https://www.crossref.org/crossref-annual-meeting/archive/#2017">LIVE17&lt;/a>) in Singapore, and an OpenCon event in Oxford. Each one has given me the chance to talk to our members and other constituents, and I&amp;rsquo;ve been really struck by what a diverse bunch you are:  from small volunteer-led society journals through universities to commercial behemoths; from Albania to Zambia (and 125 countries in between); covering everything from Ancient History to X-Ray Spectrometry.&lt;/p>
&lt;h2 id="expedition-equipment-to-suit-the-climate">Expedition equipment to suit the climate&lt;/h2>
&lt;p>This diversity gives my team a huge responsibility. We need to make sure that the support we provide to you can meet the needs of everyone -  whether you&amp;rsquo;re a multinational publisher with a large team of xml specialists, or a small team of enthusiastic academics. Everyone should be able to clearly understand and take advantage of what Crossref offers both to you as an organisation and to the wider scholarly community.&lt;/p>
&lt;p>With this in mind, we&amp;rsquo;re going to be making a few changes to the support materials we provide over the next 12 months&amp;mdash;rewriting them so they&amp;rsquo;re clearer for everyone, re-structuring our support center so there&amp;rsquo;s a separate route through depending on your level of technical expertise and closer links with our main website, plus providing support in different languages and different formats.&lt;/p>
&lt;h2 id="sticking-together-in-a-harsh-environment">Sticking together in a harsh environment&lt;/h2>
&lt;p>As someone who has previously worked in commercial publishing, something else that has struck me about working in a member organisation is the difference between members and traditional &amp;ldquo;customers&amp;rdquo;. It&amp;rsquo;s been fantastic to see how involved many of you are in Crossref. From taking part in our various committees and working groups, to helping to organize &lt;a href="https://www.crossref.org/events/">LIVE Local events&lt;/a>, to attending webinars and training, it&amp;rsquo;s obvious that you feel a real sense of ownership over Crossref and our shared mission.&lt;/p>
&lt;p>We&amp;rsquo;re hoping to make use of that great sense of community in 2018 by improving our member center, giving you more access to see the level of metadata you&amp;rsquo;re sharing with the community (and that others are sharing) and providing more options for you to communicate with, and support each other. We&amp;rsquo;re also going to be improving the education we offer for new members, to make sure that everyone is aware of the joint mission we all have to improve research communications. Most long time members know it&amp;rsquo;s so much more than just having a DOI, and we need to make sure that our new members are aware of this too and share our vision.&lt;/p>
&lt;h2 id="leaving-no-one-behind">Leaving no-one behind&lt;/h2>
&lt;p>We have a lot of plans for the Member Experience team in 2018, but it&amp;rsquo;s key that everything we do meets the needs of all our members. If you have any suggestions for how we can improve your member experience, [do let me know](mailto:feedback@crossref.org?subject=Member Experience suggestion).&lt;/p></description></item><item><title>Global Persistent Identifiers for grants, awards, and facilities</title><link>https://www.crossref.org/blog/global-persistent-identifiers-for-grants-awards-and-facilities/</link><pubDate>Wed, 13 Dec 2017 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/global-persistent-identifiers-for-grants-awards-and-facilities/</guid><description>&lt;p>Crossref&amp;rsquo;s &lt;a href="https://gitlab.com/crossref/open_funder_registry" target="_blank">Open Funder Registry&lt;/a> (neé FundRef) now includes over 15 thousand entries. Crossref has over 2 million metadata records that include funding information - 1.7 million of which include an Open Funder Identifier. The uptake of funder identifiers is already making it easier and more efficient for the scholarly community to directly link funding to research outputs, but lately we&amp;rsquo;ve been hearing from a number of people that the time is ripe for a global grant identifier as well.&lt;/p>
&lt;p>To that end, Crossref convened its &lt;a href="https://www.crossref.org/working-groups/funders/">funder advisory group&lt;/a> along with representatives from our collaborator organisations, ORCID and DataCite, to explore the creation of a global grant identifier system.&lt;/p>
&lt;p>We thought you might like to know about what we&amp;rsquo;ve been discussing&amp;hellip;&lt;/p>
&lt;h3 id="the-first-rule-of-grant-identifiers">The First Rule of Grant Identifiers&lt;/h3>
&lt;p>The first rule of grant identifiers is that they probably should not be called &amp;ldquo;grant identifiers&amp;rdquo;. Research is supported in a variety of ways&amp;mdash;through grants, endowments, secondments, loans, use of facilities/equipment and even crowd-funding. In any of these cases, it is important to be able to link researchers and research outputs to details about the sources of support. This is true for prosaic reasons&amp;mdash;to understand ROI, to map the competitive landscape, to ensure that mandates are fulfilled, to avoid double payment. But it is also true for epistemic reasons; understanding how research was funded can help contextualise that research, and help expose potential conflicts of interest or specific agendas.&lt;/p>
&lt;p>The &lt;a href="https://www.crossref.org/services/funder-registry/">Open Funder Registry&lt;/a> which provides a coarse mapping between research outputs and funders, but it is becoming clear that we need more fine-grained mapping directly to information about the kind of support that was provided.&lt;/p>
&lt;p>Awkwardly, none of us had any great ideas about alternative nomenclature, so we&amp;rsquo;ve made the eminently practical decision to continue to use the term &amp;ldquo;grant identifier&amp;rdquo; whilst being aware that our aim is to define a system that applies more broadly to any form of funding or support of research. So &lt;code>+1&lt;/code> for practicality.&lt;/p>
&lt;h3 id="why-do-we-need-an-open-global-grant-identifier">Why do we need an open, global, grant identifier?&lt;/h3>
&lt;p>With the steady increase in research outputs, and the growing number of active researchers from both academia and industry, research stakeholders find they need to be able to automate workflows in order to scale their systems efficiently. Funders want to be able to track the outputs that arise from research they have funded. As a result, institutions find themselves having to regularly analyse and summarise the research their faculty produces. Faculty, in turn, face increasing accounting bureaucracy in order to meet all the reporting requirements that are cascading through the system. And finally, publishers are seeking to make the manuscript submission and evaluation process more efficient as well as to increase the discoverability and contextual richness of their publications.&lt;/p>
&lt;p>Most funders already have local, internal grant identifiers. But there are over 15K funders currently listed in the aforementioned Open Funder Registry. The problem is that each funder has its own identifier scheme and (sometimes) API. It is very difficult for third parties to integrate with so many different systems. Open, global, persistent and machine-actionable identifiers are key to scaling these activities.&lt;/p>
&lt;p>We already have a sophisticated open, global, interoperable infrastructure of persistent identifier systems for some key elements of scholarly communications. We have persistent identifiers for researchers and contributors (ORCID iDs), for data and software (DataCite DOIs), for journal articles, preprints, conference proceedings, peer reviews, monographs and standards (Crossref DOIs), and for Funders (Open Funder Registry IDs).&lt;/p>
&lt;p>And there are similar systems under active development for &lt;a href="https://www.crossref.org/categories/organisation-identifier/">research organisations&lt;/a>, &lt;a href="https://doi.org/10.64000/skv7b-cef25" target="_blank">conferences, projects&lt;/a> and &lt;a href="https://scicrunch.org/resources" target="_blank">resources&lt;/a> reported in the biomedical literature (e.g. antibodies, model organisms). At a minimum, open, persistent identifiers address the inherent difficulty in disambiguating entities based on textual strings (structured or otherwise). This precision, in turn, allows automated cross-walking of linked identifiers through APIs and metadata which enable advanced applications.&lt;/p>
&lt;p>For example, the use of identifiers can simplify user interfaces and save users time. Almost everybody in scholarly communications spends a frustrating portion of their lives copying information from one system to another. This process is not just tedious, it is also error-prone. But we are increasingly seeing systems make use of identifiers to eliminate the need for a lot of this manual copying. For example, researchers using an ORCID iD when they submit a manuscript can start to expect that their relevant ORCID biographical data will simply be imported into the manuscript tracking system so that it doesn&amp;rsquo;t have to be manually copied over. And if said researcher has their manuscript accepted, they can also expect that their ORCID record will automatically be updated with the publication information and that their institution and/or their funder can be automatically notified of the impending publication so that relevant repositories and &lt;a href="https://en.wikipedia.org/wiki/Current_research_information_system" target="_blank">CRIS&lt;/a> systems can be populated automatically.&lt;/p>
&lt;p>Additionally, there is a growing list of services that have been built on top of these standard identifiers. Profile systems (e.g. VIVO, Impact Story, Kudos) can automatically retrieve the latest information from a researcher&amp;rsquo;s ORCID record. Bibliographic management tools (EasyBib, Zotero, Papers) allow researchers to cite content with the latest metadata. And similarity checking services can harvest and index the latest scholarly literature for inclusion in the tools they have developed for detecting plagiarism and fraud. Funder identifiers are already playing an important role in this metadata workflow. As of November 2017, there are 1.7 million Crossref publication DOIs that are explicitly linked to an Open Funder Registry ID. These linkages serve as a foundation for initiatives like SHARE, CHORUS, and the Jisc Publications Router.  But there are another 1+ million records that have funding information without an associated ID and, of course, 90+ million records that have no funding information at all.&lt;/p>
&lt;blockquote>
&lt;p>So If we have global funder identifiers and they are already working, why do we need global grant identifiers as well? Don&amp;rsquo;t we just need to increase uptake of funder identifiers? How will grant identifiers help?&lt;/p>
&lt;/blockquote>
&lt;p>First, global grant identifiers could greatly reduce the UX complexity of gathering funder information. This, in turn, would boost the collection of funding information from researchers and ensure that the information that they provide to publishers, institutions and other funders is accurate and complete.&lt;/p>
&lt;p>Second, the introduction of global grant identifiers would further increase the utility of links between research outputs and funding information. A grant identifier provides more granular information about the funding. Instead of just linking to information about the funder, a grant identifier would allow linking research outputs to particular research programs along with the information relating to those programs, such as grant durations, award amounts, etc. It would also allow analysis of relationships between multiple co-funding bodies.&lt;/p>
&lt;h3 id="to-doi-or-not-to-doi">To DOI or not to DOI?&lt;/h3>
&lt;p>Clearly, we think DOIs are pretty good things. But we also aren&amp;rsquo;t zealots. Sometimes DOIs are appropriate and sometimes they are not. For example, we were instrumental in &lt;a href="https://docs.google.com/document/d/1awd6PPguRAdZsC6CKpFSSSu1dulliT8E3kHwIJ3tD5o/edit?usp=sharing" target="_blank">defining the structure of the ORCID identifier&lt;/a> and, in that case, we decided that DOIs were not appropriate.&lt;/p>
&lt;p>But in the case of a global grant identifier system, we think there are a number of reasons adopting DOIs would be useful:&lt;/p>
&lt;ol>
&lt;li>It is easy to &amp;ldquo;overlay&amp;rdquo; the global DOI system onto existing local identifier systems. An organisation does not need to abandon their internal identifier scheme in order to use DOIs. They can instead incorporate their local scheme into the DOI structure via the simple mechanism of prepending their existing identifiers with an assigned DOI prefix and registering relevant metadata with a DOI registration agency like Crossref or DataCite.&lt;/li>
&lt;li>DOI links are &amp;ldquo;persist-able&amp;rdquo;. That is they can resolve to different online locations even if domain names change and/or the DNS system itself is replaced. This characteristic is important for a grant identifier because funding agencies - particularly government funding agencies - tend to undergo frequent reorganisations (e.g. splitting, merging, restructuring) and renaming. An indirectly resolvable identifier like a DOI (or ARK, Handle, etc.) is critical to ensure the long-term integrity of identifiers in these situations.&lt;/li>
&lt;li>There are 15K+ funders currently listed in the Open funder Registry. Each has their own grant identifier scheme and different levels of technical support for them (APIs, etc.). This makes it very difficult for 3rd parties to build tools that work &amp;ldquo;generically&amp;rdquo; with grant identifiers.  But once a local identifier scheme had been &amp;ldquo;globalised&amp;rdquo; by making it a DOI, third parties can build tools without having to worry about the differences between individual funder systems.&lt;/li>
&lt;li>Crossref and DataCite DOIs are deeply embedded in the tools and workflows of scholarly communications. Manuscript tracking systems, bibliographic management systems, metrics systems, CRIS systems, profile systems, etc. often have built-in mechanisms for consuming and making use of DOIs and their associated metadata.&lt;/li>
&lt;li>Crossref and DataCite DOIs are cross-disciplinary. They are used in the humanities, social sciences, sciences and in a host of communities that frequently interact with the scholarly literature for example- NGOs, IGOs, patent systems, and standards bodies.&lt;/li>
&lt;li>Crossref and DataCite provide a variety of APIs (e.g. REST, OAI-PMH) and services (e.g. search, Crossmark, Similarity Check, Scholix) built around DOIs.&lt;/li>
&lt;li>DOI&amp;rsquo;s have a useful characteristic, which is that the &amp;ldquo;prefix&amp;rdquo; of a DOI can be used to determine who originally created the record with which the DOI is associated. In the case of grant identifiers, this means that the prefix of a DOI-based grant identifier could be used to automatically determine the correct funder responsible for the initial grant. This means that the UIs for entering funder/grant information could be both simplified and made more robust&amp;mdash;which would likely increase the number of parties that collect and propagate id-based funder information.&lt;/li>
&lt;/ol>
&lt;p>But the use of DOIs as the basis for grant identifiers also introduces some potential barriers to adopting a standard funding identifier. For example:&lt;/p>
&lt;ul>
&lt;li>Funders would need to be able to join a suitable DOI registration agency (e.g. Crossref, DataCite). Some funders (e.g. government agencies) may be restricted in their ability to &amp;ldquo;join&amp;rdquo; external organisations.&lt;/li>
&lt;li>Funders would need to be able to create new DOIs and register associated metadata with their chosen registration agency in a timely manner. Some funders may be unable to generate metadata or may not have the technical capacity to automatically register metadata.&lt;/li>
&lt;li>Funders would need to be able to provide an openly available (e.g. not behind access control) online resource to which the DOI would resolve. For example, a landing page describing the grant or a digital copy of the grant itself. Again, some funders may face technical barriers to providing an online resource to resolve to. In other cases there may be privacy or security reasons for not providing an open resource to which a DOI can resolve.&lt;/li>
&lt;/ul>
&lt;p>Still, the advisory group consensus has been that these barriers are generally surmountable. Most of the questions they had revolved around understanding what a DOI-based workflow would look like from the funder&amp;rsquo;s perspective, and so we outlined the steps a funder would need to take in order to adopt DOI-based global identifiers.&lt;/p>
&lt;h3 id="the-doi-based-grant-identifiers-workflow">The DOI-based grant identifiers workflow&lt;/h3>
&lt;p>A funder registering metadata and creating DOIs for grants would need to support the following workflow:&lt;/p>
&lt;ol>
&lt;li>When a grant is submitted, the funder would assign their own internal identifier for tracking, etc. For example &lt;code>00-00-05-67-89&lt;/code>.&lt;/li>
&lt;li>If the grant is accepted, the funder would:&lt;/li>
&lt;/ol>
&lt;ul>
&lt;li>generate a global public identifier for the grant based on the DOI. For example, assuming their prefix was &lt;code>10.4440&lt;/code>, then the global public identifier might become &lt;code>https://doi.org/10.4440/00-00-05-67-89&lt;/code>.&lt;/li>
&lt;li>create a &amp;ldquo;landing page&amp;rdquo; on their website (or wherever they make their grants available online) to which the global public identifier will resolve. The landing page would display a TBD set of metadata describing the grant, as well as a link to the grant itself.&lt;/li>
&lt;li>register the generated DOI and a TBD set of metadata with their registration agency (RA) (e.g. Crossref or DataCite). This metadata would include the URL of the landing page defined above.&lt;/li>
&lt;/ul>
&lt;ol start="3">
&lt;li>Once metadata and DOIs are registered with an RA, the funder would have a series of ongoing obligations:&lt;/li>
&lt;/ol>
&lt;ul>
&lt;li>Update locations: If the location of the landing page changes (for example, because of a site restructuring, merger of split of the funding organisation, etc.), the funder would need to update their metadata records to point the DOI to the new location.&lt;/li>
&lt;li>Update metadata: If metadata becomes out-of-date (e.g. the status of a grant changes, additional grant-related metadata is added, etc.), the funder would update the relevant records.&lt;/li>
&lt;li>Promote the use of the the DOI as the preferred global, public identifier for the grant. That is - the one that people should use when referring to or citing the grant (the funder can continue to use the original local identifier for their internal systems, etc.).  &lt;/li>
&lt;/ul>
&lt;p>Again, the advisory group thought that this workflow seemed tractable and agreed that the best way to ensure that would be to proceed to creating a working pilot of a global grant identifier system based on the DOI.&lt;/p>
&lt;h3 id="next-steps">Next steps&lt;/h3>
&lt;p>Crossref is starting a grant identifier pilot. We will create two sub-groups of the funder advisory group.&lt;/p>
&lt;h4 id="group-for-governance-membership-and-fees">Group for &amp;ldquo;Governance, membership, and fees&amp;rdquo;&lt;/h4>
&lt;p>This group will look at governance and financial issues raised by the introduction of grant identifiers. For example, it will look at whether Crossref&amp;rsquo;s membership model works as is or might need to be adjusted in order to accommodate a new constituency. We know, for example, that some funders find it hard to become &amp;ldquo;members&amp;rdquo; of organisations. We might need to create other participation categories in order to accommodate these restrictions. Similarly the group will look design a pricing model of DOIs for grants in order to make sure that they cover the costs of modifying and sustaining the system for them, as well as to ensure that the pricing incentivises funders to participate. This sub-group will work closely with Crossref&amp;rsquo;s membership and fees committee.&lt;/p>
&lt;h4 id="group-for-technical-and-metadata">Group for &amp;ldquo;Technical and metadata&amp;rdquo;&lt;/h4>
&lt;p>This group will look at any technical changes that need to be made to registration process in order to accommodate the new participants. If there are, they are likely to center around specific metadata requirements for grants. As such, the group will likely spend most of its time agreeing to a practical metadata schema for capturing relevant information about the myriad of ways in which organisations &lt;em>support&lt;/em> research. This group will also liaise with other relevant technical working groups, such as those who are looking at organisational identifiers and conference identifiers.&lt;/p>
&lt;p>The two sub-groups will first meet in January and, after a few meetings, will report back the advisory group with recommendations. Using these recommendations, we will develop an implementation plan which will include testing the infrastructure, testing metadata deposits, fee modelling, etc, with a small group of participants.&lt;/p>
&lt;p>If you are a funder, and you would like to have somebody from your origanization participate in one of these working groups, please &lt;a href="mailto:ginny@crossref.org">contact Ginny Hendricks&lt;/a>. Note that joining the above groups does not commit you to anything other than engaging in the discussion. We want to make sure we create a system that works for a range of funders, not just those who can start testing something right away.&lt;/p></description></item><item><title>And our survey says...</title><link>https://www.crossref.org/blog/and-our-survey-says.../</link><pubDate>Mon, 11 Dec 2017 00:00:00 +0000</pubDate><author>Christine Cormack Wood</author><discourseUsername>ccormackwood</discourseUsername><guid>https://www.crossref.org/blog/and-our-survey-says.../</guid><description>&lt;p>Earlier this year we sent out a short survey inviting members to rate our performance. We asked what you think we do well, what we don’t do so well, and one thing we could do to improve our rating.&lt;/p>
&lt;p>We were delighted to receive 313 responses and relieved that 93% of those were positive (phew!). It was very useful to hear your thoughts and to get such a variety of comments covering Product, Outreach, Marketing and Member Experience. There were a few recurring themes, three of which we’d like to address here:&lt;/p>
&lt;h2 id="1-providing-information-in-different-languages">1. Providing information in different languages&lt;/h2>
&lt;p>Not surprisingly, given the growing diversity of our member base, some respondents asked us to share information in languages other than English. We have been aware of this growing need for some time and have been working on a few developments in this area:&lt;/p>
&lt;ul>
&lt;li>In January 2018 we will be launching a series of seven service videos in six different languages—French, Spanish, Brazilian Portuguese, Chinese, Korean, and Japanese.&lt;/li>
&lt;li>January also sees the launch of a new initiative called the Ambassador Program. Ambassadors will work closely with Crossref to help spread the word about our services, and support our global members in their own languages.&lt;/li>
&lt;li>During 2017 we hosted two webinars in Brazilian Portuguese and one in Turkish, and aim to increase this in 2018.&lt;/li>
&lt;/ul>
&lt;h2 id="2-member-to-member-discussion-forum">2. Member-to-member discussion forum&lt;/h2>
&lt;p>Some respondents asked for a facility to enable members to reach out to each other, giving direct opportunity for discussions and/or sharing experiences online (and in their own languages). We have been working for a few months now to provide a member-to-member discussion area, which is planned for 2018. Following a soft launch covering a few areas/topics, we’ll broaden the scope to include technical support, too.&lt;/p>
&lt;h2 id="3-registering-metadata-more-easily-using-the-web-deposit-form">3. Registering metadata more easily using the web deposit form&lt;/h2>
&lt;p>Many respondents requested a more user-friendly process for registering metadata through our webform. Our Product and DevOps teams have been working on this for some time and have created a new interface called the Metadata Manager, which is currently in Beta but scheduled to launch in Q1 of 2018.&lt;/p>
&lt;p>Finally, we’d like to thank you for participating in our survey. Your valuable feedback and suggestions help us understand your experience, improve our service, shape the course of particular projects and even direct our future strategy.&lt;/p>
&lt;p>&lt;em>As this survey was anonymous, we are unable to respond to anyone on an individual basis, however, if you’d like to have your particular comments addressed, &lt;a href="mailto:feedback@crossref.org">we would love to hear from you directly.&lt;/a>&lt;/em>&lt;/p></description></item><item><title>Working with universities at Crossref LIVE Yogyakarta</title><link>https://www.crossref.org/blog/working-with-universities-at-crossref-live-yogyakarta/</link><pubDate>Mon, 11 Dec 2017 00:00:00 +0000</pubDate><author>Rachael Lammey</author><discourseUsername>rlammey</discourseUsername><guid>https://www.crossref.org/blog/working-with-universities-at-crossref-live-yogyakarta/</guid><description>&lt;p>Following on from our LIVE Annual Meeting in Singapore, my colleague, Susan Collins, and I held a local LIVE event in Yogyakarta thanks to support from Universitas Ahmad Dahlan (UAD), Universitas Muhammadiyah Sidoarjo and one of Crossref&amp;rsquo;s new Sponsoring Affiliates, Relawan Jurnal Indonesia.&lt;/p>
&lt;p>Over the past two years, we&amp;rsquo;ve seen accelerated growth in our membership in Asia Pacific (making up a quarter of all new members in the last two years). A lot of those new members have come from Indonesia, so it was great to have the opportunity to meet up, answer questions and to share knowledge between all our different organisations.&lt;/p>
&lt;div class="shortcode-divwrap align-right">
&lt;span>&lt;img src="https://www.crossref.org/images/blog/2017/yogyakarta-blog.jpg" alt="graph of number of new members per region" width="250px"/>&lt;/span>
&lt;/div>
&lt;p>We welcomed speakers such as Dr. Muhammad Dimyati, from the Directorate General of Strengthening for Research and Development, Ministry of Research, Technology and Higher Education. Dr. Dimyati talked about the importance of Indonesian research and presented statistics on its growth, but also its coverage in different databases like Scopus and DOAJ.&lt;/p>
&lt;p>Dr. Lukman from LIPI, the Indonesian Institute of Sciences also joined us to explain the importance of identifiers within the research ecosystem. As any identifier buff will know, we&amp;rsquo;re keen to talk more about how organisations are using Crossref metadata and identifiers, and the importance of providing good, complete metadata (&lt;a href="http://www.metadata2020.org/" target="_blank">Metadata2020&lt;/a>) so this, plus a remote presentation from Nobuko Miyari from &lt;a href="https://orcid.org/" target="_blank">ORCID&lt;/a> helped provide great context for the day.&lt;/p>
&lt;p>Metadata and identifiers are of course just one part of the process, and Mr. Tole Sutikno from UAD gave an overview of good practice publishing by looking  at some of the wider issues that journal editors (and researchers) need to know.&lt;/p>
&lt;p>We had time in the afternoon to talk to our audience about Crossref - our different services, OJS integrations, funding data and our APIs, and thanks to our moderators we were able to take lots of questions from members who had specific questions about Crossmark, Cited-by and depositing references.&lt;/p>
&lt;div class="shortcode-divwrap align-right">
&lt;span>&lt;img src="https://www.crossref.org/images/blog/2017/yogyakarta2-blog.jpg" alt="image of stage" width="250px" />&lt;/span>
&lt;/div>
&lt;p>A few weeks later, and I&amp;rsquo;m still absorbing all of the things that happened on our (too) quick trip to Yogyakarta.&lt;/p>
&lt;p>Thanks again to our members and hosts for attending the event and sharing their questions, ideas and plans with us, and we plan to come back to continue to build on these in future.&lt;/p></description></item><item><title>The PIDapalooza lineup is out; come rock out with us at the open festival of persistent identifiers</title><link>https://www.crossref.org/blog/the-pidapalooza-lineup-is-out-come-rock-out-with-us-at-the-open-festival-of-persistent-identifiers/</link><pubDate>Fri, 01 Dec 2017 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/the-pidapalooza-lineup-is-out-come-rock-out-with-us-at-the-open-festival-of-persistent-identifiers/</guid><description>&lt;p>PIDs&amp;rsquo;R&amp;rsquo;Us and if they&amp;rsquo;re you, too, please join us for the second &lt;a href="https://pidapalooza.org/" target="_blank">PIDapalooza&lt;/a>, in Girona, Spain on January 23-24, for a two-day celebration of persistent identifiers.&lt;/p>
&lt;p>Together, we will achieve the incredible - make a meeting about persistent identifiers and networked research fun! Brought to you by California Digital Library, Crossref, DataCite, and ORCID, this year&amp;rsquo;s &lt;a href="https://pidapalooza18.sched.com/" target="_blank">sessions&lt;/a> are organized around eight themes:&lt;/p>
&lt;ul>
&lt;li>PID myths&lt;/li>
&lt;li>Achieving persistence&lt;/li>
&lt;li>PIDs for emerging uses&lt;/li>
&lt;li>Legacy PIDs&lt;/li>
&lt;li>Bridging worlds&lt;/li>
&lt;li>PIDagogy&lt;/li>
&lt;li>PID stories&lt;/li>
&lt;li>Kinds of persistence&lt;/li>
&lt;/ul>
&lt;h3 id="the-programhttpspidapalooza18schedcom-is-now-final-and-there-really-is-something-for-everyone-well-every-pid-geek">The &lt;a href="https://pidapalooza18.sched.com/" target="_blank">program&lt;/a> is now final and there really is something for everyone (well, every PID geek)&lt;/h3>
&lt;ul>
&lt;li>Hmm, &lt;a href="https://pidapalooza18.sched.com/event/Cwmj/do-researchers-need-to-care-about-pid-systems" target="_blank">Do Researchers Need to Care about PID Systems?&lt;/a> Excellent question.&lt;/li>
&lt;li>We&amp;rsquo;ll hear &lt;a href="https://pidapalooza18.sched.com/event/Cwml/stories-from-the-pid-roadies-scholix" target="_blank">Stories from the PID Roadies: Scholix&lt;/a>.&lt;/li>
&lt;li>Nevermind the &lt;a href="https://pidapalooza18.sched.com/event/CwnA/the-bollockschain-and-other-pid-hallucinations" target="_blank">The Bollockschain and other PID Hallucinations&lt;/a>.&lt;/li>
&lt;li>An intriguing session on &lt;a href="https://pidapalooza18.sched.com/event/Cwmk/resinfocitizenshipis#" target="_blank">#ResInfoCitizenshipIs?&lt;/a>.&lt;/li>
&lt;li>There will be a plenary by &lt;a href="http://orcid.org/0000-0002-1611-6935" target="_blank">Johanna McEntyre&lt;/a> on &lt;a href="https://pidapalooza18.sched.com/event/CwnI/as-a-biologist-i-want-to-reuse-and-remix-data-so-that-i-can-do-my-research" target="_blank">As a &lt;code>biologist&lt;/code> I want to &lt;code>reuse and remix data&lt;/code> so that I can &lt;code>do my research&lt;/code>&lt;/a>.&lt;/li>
&lt;li>And we&amp;rsquo;ll enjoy another plenary from &lt;a href="https://orcid.org/0000-0001-9114-8737" target="_blank">Melissa Haendel&lt;/a> (title to be confirmed).&lt;/li>
&lt;/ul>
&lt;p>With half the places already booked, now&amp;rsquo;s the time to &lt;a href="https://www.eventbrite.com/e/pidapalooza-2018-registration-35176831851" target="_blank">register&lt;/a> and plan your trip. We hope to see fellow festival-goers there for some PIDtastic party time (and actually some epic serious conversations).&lt;/p>
&lt;p>Contact me via the steering committee at &lt;a href="mailto:pidapalooza@datacite.org">PIDapalooza@datacite.org&lt;/a> with any questions, music requests, or backstage passes.&lt;/p>
&lt;h3 id="full-lineup">Full lineup&lt;/h3>
&lt;p>&lt;a id="sched-embed" href="http://pidapalooza18.sched.com/">View the Crossref LIVE17 agenda.&lt;/a>&lt;script type="text/javascript" src="//pidapalooza18.sched.com/js/embed.js">&lt;/script>&lt;/p></description></item><item><title>Using the Crossref REST API. Part 7 (with CHORUS)</title><link>https://www.crossref.org/blog/using-the-crossref-rest-api.-part-7-with-chorus/</link><pubDate>Mon, 27 Nov 2017 00:00:00 +0000</pubDate><author>Christine Cormack Wood</author><discourseUsername>ccormackwood</discourseUsername><guid>https://www.crossref.org/blog/using-the-crossref-rest-api.-part-7-with-chorus/</guid><description>&lt;p>Continuing our &lt;a href="https://www.crossref.org/categories/api-case-study/">blog series&lt;/a> highlighting the uses of Crossref metadata, we talked to Sara Girard and Howard Ratner at &lt;a href="http://www.chorusaccess.org" target="_blank">CHORUS&lt;/a> about the work they’re doing, and how they’re using our REST API as part of their workflow.&lt;/p>
&lt;h3 id="introducing-chorus">Introducing CHORUS&lt;/h3>
&lt;p>CHORUS (&lt;a href="http://www.chorusaccess.org" target="_blank">www.chorusaccess.org&lt;/a>) is an innovative non-profit organisation that supports funders, publishers, authors and institutions to deliver public access to articles reporting on funded research. Our vision is to create a future where the output flowing from funded research is easily and permanently discoverable, accessible and verifiable by anyone in the world.&lt;/p>
&lt;p>CHORUS currently monitors over 400,000 articles for more than 20 US federal and two international funding agencies, and has partnerships with Department of Defense, Department of Energy, National Science Foundation, National Institute of Standards and Technology, Office of the Director National of Intelligence: Intelligence Advanced Research Projects Activity, Smithsonian Institution, US Department of Agriculture, US Geological Survey, Japan Science and Technology Agency, and the Australian Research Council. CHORUS is supported by over 50 publisher and affiliate members who represent the majority of funded published research.&lt;/p>
&lt;p>&amp;lt;img align=right&amp;quot; src=&amp;quot;/images/blog/chorus-blog.png&amp;quot; width=&amp;ldquo;700&amp;rdquo; alt=&amp;ldquo;mage of interaction of platforms&amp;rdquo; class=&amp;ldquo;img-responsive&amp;rdquo;/&amp;gt;&lt;/p>
&lt;h3 id="what-problem-is-your-service-trying-to-solve">What problem is your service trying to solve?&lt;/h3>
&lt;p>CHORUS is the first service of CHOR Inc., founded in 2013 in response to the directive of the US Office of Science and Technology Policy (OSTP) for all US federal research agencies to develop and implement plans to widen public access to publications and data associated with federally funded research.&lt;/p>
&lt;p>CHORUS aims to minimize public access compliance burdens and ensure the long-term preservation and accessibility of articles reporting on funded research. We provide the necessary metadata infrastructure and governance to enable a smooth, low-friction interface between funders, authors, institutions and publishers in a distributed network environment. CHORUS’ services track public accessibility of articles regardless of whether they are published Gold OA or made open by the publisher.&lt;/p>
&lt;h3 id="can-you-tell-us-how-you-are-using-the-crossref-rest-api-at-chorus">Can you tell us how you are using the Crossref REST API at CHORUS?&lt;/h3>
&lt;p>The Crossref REST API is a key source for the metadata database that powers the CHORUS Dashboard, Search and Reporting services for Funders, Institutions and Publishers.&lt;/p>
&lt;h3 id="what-metadata-values-do-you-pull-from-the-api">What metadata values do you pull from the API?&lt;/h3>
&lt;p>We pull the basic bibliographic information such as publisher, journal title, article title, authors and publication date. Perhaps even more important to our area of focus are the funder, grant and license information.&lt;/p>
&lt;h3 id="how-often-do-you-extractquery-data">How often do you extract/query data?&lt;/h3>
&lt;p>CHORUS uses the Crossref REST API every day.&lt;/p>
&lt;h3 id="can-you-describe-your-workflow-using-crossref-metadata">Can you describe your workflow using Crossref metadata?&lt;/h3>
&lt;div style="float:left;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/chorus2-blog.png" width="600" alt="mage of interaction of platforms" class="img-responsive"/>
&lt;/div>
&lt;p>Every night we query the Crossref API to send us metadata for all article or conference proceeding records for our member publishers that have funder metadata matching the funders monitored by CHORUS.&lt;/p>
&lt;p>CHORUS monitors these DOIs for public accessibility on publisher websites; inclusion in agency search tools; deposit in a growing list of funder repositories (e.g.,&lt;a href="https://www.osti.gov/pages/" target="_blank">US DOE PAGES&lt;/a>,&lt;a href="https://par.nsf.gov/" target="_blank">NSF PAR&lt;/a>, and &lt;a href="https://pubs.er.usgs.gov/" target="_blank">USGS Publications Warehouse&lt;/a> and &lt;a href="https://www.ncbi.nlm.nih.gov/pmc/" target="_blank">NIH PubMed Central&lt;/a>); and for associated ORCID researcher records. CHORUS also uses the reuse license metadata to identify when an article is expected to be made publicly accessible.&lt;/p>
&lt;p>Finally, we check for ingestion in &lt;a href="http://www.clockss.org" target="_blank">CLOCKSS&lt;/a> and/or &lt;a href="http://www.portico.org" target="_blank">Portico&lt;/a> to ensure long-term preservation and accessibility of research findings reported in journal and proceedings articles. Our preservation partners keep the full text in their dark archives, only making it available when the content may no longer be made publicly accessible by the publisher.&lt;/p>
&lt;p>The collected and enhanced metadata is presented in our dashboard, search and reporting services all including links back to the publisher sites via the Crossref DOI.&lt;/p>
&lt;h3 id="what-are-the-future-plans-for-chorus">What are the future plans for CHORUS?&lt;/h3>
&lt;p>Following the success of our Funder and Publisher Dashboards, CHORUS is expanding the services we provide to international funders, non-governmental funders, and institutions. Our first funder partnership outside of the United States is with the Japan Science and Technology Agency (JST). CHORUS announced its new Institution Dashboard service this Autumn after successfully concluding pilots with the University of Florida and University of Denver. CHORUS will also be adding links to relevant datasets and other metadata utilizing forthcoming identifiers and metadata standards.&lt;/p>
&lt;h3 id="what-else-would-you-like-to-see-the-rest-api-offer">What else would you like to see the REST API offer&lt;/h3>
&lt;p>It would be great to see more identification of funders from Crossref members. While we have seen great leaps since 2013, we all have a long way to go. We are also eager to see Crossref incorporate the organisation Identifiers that they have begun with ORCID, DataCite and others.&lt;/p>
&lt;hr>
&lt;p>Thanks, CHORUS! If you would like to contribute a case study on the uses of Crossref Metadata APIs please contact the &lt;a href="mailto:feedback@crossref.org">Community team&lt;/a>.&lt;/p></description></item><item><title>The research nexus - better research through better metadata</title><link>https://www.crossref.org/blog/the-research-nexus-better-research-through-better-metadata/</link><pubDate>Tue, 14 Nov 2017 00:00:00 +0000</pubDate><author>Jennifer Lin</author><guid>https://www.crossref.org/blog/the-research-nexus-better-research-through-better-metadata/</guid><description>&lt;p>Researchers are adopting new tools that create consistency and shareability in their experimental methods. Increasingly, these are viewed as key components in driving reproducibility and replicability. They provide transparency in reporting key methodological and analytical information. They are also used for sharing the artifacts which make up a processing trail for the results: data, material, analytical code, and related software on which the conclusions of the paper rely. Where expert feedback was also shared, such reviews further enrich this record. We capture these ideas and build on the notion of the &lt;a href="https://doi.org/10.64000/n0zjv-z6c66" target="_blank">“article nexus” blogpost&lt;/a> with a new variation: &amp;ldquo;the research nexus.&amp;rdquo;&lt;/p>
&lt;div style="float:left;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/Article_Nexus_Reproducibility.png" width="400" alt="article nexus for reproducibility" class="img-responsive"/>
&lt;/div>
&lt;p>Some of Crossref’s publishing community are encouraging the scholarly communication practices surrounding these tools in a variety of ways: incorporating them into the publishing workflow, integrations between the tools and publishing systems, as well as linking and exposing the artifacts in the publications for readers to access. A special set of publishers have gone all the way and included these links into their Crossref metadata record. They insert them directly into the metadata deposit when they register the content (&lt;a href="https://support.crossref.org/hc/en-us/articles/214357426-Relationships-between-DOIs-and-other-objects" target="_blank">technical documentation&lt;/a>). Doing so, these connections reach further than the publisher platform and propagate to systems across the research ecosystem including places like indexers, research information management systems, sharing platforms (oh, the list goes on!). We highlight a small set of examples to illustrate how these outstanding publishing practices are supporting good research.&lt;/p>
&lt;h3 id="1-linking-to-an-entire-collection-of-methods">1. Linking to an entire collection of methods&lt;/h3>
&lt;p>Crossref member, Protocols.io, is supporting transparency and methods reproducibility with their open access repository of science methods. Leitão-Goncalves R, Carvalho-Santos Z,
Francisco AP, et al. investigated the concerted action of the commensal bacteria Acetobacter pomorum and Lactobacilli in Drosophila melanogaster, demonstrating how the interaction of specific nutrients within the microbiome can shape behavioral decisions and life history traits. Findings were published in PLOS Biology earlier this year: &lt;a href="https://doi.org/10.1371/journal.pbio.2000862" target="_blank">https://doi.org/10.1371/journal.pbio.2000862&lt;/a>. Authors deposited detailed methods and protocols used in the project (Drosophila rearing, media preparations, and microbial manipulations) as a collection in Protocols.io: &lt;a href="https://doi.org/10.17504/protocols.io.hdtb26n" target="_blank">https://doi.org/10.17504/protocols.io.hdtb26n&lt;/a>. So Protocols.io registered their content with us, linking the protocol to the paper. This creates the crosswalk between both so that users can get from one to the other through the metadata. The full metadata record can be found &lt;a href="https://api.crossref.org/works/10.17504/protocols.io.hdtb26n" target="_blank">here&lt;/a>.&lt;/p>
&lt;h3 id="2-linking-to-video-protocol">2. Linking to video protocol&lt;/h3>
&lt;p>If a picture is worth a thousand words, the truism might apply to moving pictures many times over. Fasel B, Spörri J, Schütz P, et al. proposed a set of calibration movements optimized for alpine skiing and validated the 3D joint angles of the knee, hip, and trunk during alpine skiing in a PLOS ONE paper: &lt;a href="https://doi.org/10.1371/journal.pone.0181446" target="_blank">https://doi.org/10.1371/journal.pone.0181446&lt;/a>. These movements consisted of squats, trunk rotations, hip ad/abductions, and upright standing. The specific team responsible for designing them (Fasel B, Spörri J, Kröll J, and Aminian K) described the set of calibration movements performed but found videos to be a far more effective way to communicate the technical movements used in their study. They made the visuals available too: &lt;a href="https://doi.org/10.17504/protocols.io.itrcem6" target="_blank">https://doi.org/10.17504/protocols.io.itrcem6&lt;/a>. So Protocols.io deposited the link between video protocol and paper to the Crossref metadata record (&lt;a href="https://api.crossref.org/works/10.17504/protocols.io.itrcem6" target="_blank">full metadata record&lt;/a>).&lt;/p>
&lt;h3 id="3-linking-to-software-and-peer-reviews">3. Linking to software and peer reviews&lt;/h3>
&lt;p>The Journal of Open Source Software (JOSS) is an academic journal about high quality research software across broadly diverse disciplines. Sara Mahar works on the effectiveness of organisations funded by the US Department of Housing and Urban Development to combat homelessness. She collaborated with computational physicist Matthew Bellis to create a python tool for researchers to visualize and analyze data from the Homeless Management Information System:&lt;a href="https://doi.org/10.21105/joss.00384" target="_blank">https://doi.org/10.21105/joss.00384&lt;/a>. The software was archived in Zenodo: &lt;a href="https://doi.org/10.5281/zenodo.13750" target="_blank">https://doi.org/10.5281/zenodo.13750&lt;/a> and the &lt;a href="https://github.com/openjournals/joss-reviews/issues/384" target="_blank">peer review artifacts&lt;/a> were also published. JOSS deposited all these links in the metadata record (&lt;a href="https://api.crossref.org/works/https://doi.org/10.21105/joss.00384" target="_blank">found here&lt;/a>).&lt;/p>
&lt;h3 id="4-linking-to-preprint-data-code-source-code-peer-reviews">4. Linking to preprint, data, code, source code, peer reviews&lt;/h3>
&lt;p>Gigascience, published by Oxford University Press, is experimenting with a number of new tools in their mission to promote reproducibility of analyses and data dissemination, organisation, understanding, and use. In a recent paper Luo R, Schatz M, and Salzberg S shared the results of the firstly publicly available implementation of variant calling using a 16-genotype probabilistic model for germline variant detection: &lt;a href="https://doi.org/10.1093/gigascience/gix045" target="_blank">https://doi.org/10.1093/gigascience/gix045&lt;/a>. Prior to formal peer review, the group posted the preprint in bioRxiv: &lt;a href="https://doi.org/10.1101/111393" target="_blank">https://doi.org/10.1101/111393&lt;/a>. When the paper was published, the authors made the supporting data available, including snapshots of the test and result data, in a public repository: &lt;a href="http://dx.doi.org/10.5524/100316" target="_blank">http://dx.doi.org/10.5524/100316&lt;/a>. OUP included this data citation in their Crossref metadata record via the routes recommended in our previous blog post about &lt;a href="https://doi.org/10.64000/hdj5p-8vy92" target="_blank">depositing data citations&lt;/a>. The researchers made the &lt;a href="https://github.com/aquaskyline/16GT" target="_blank">code available in Github&lt;/a>, and the algorithm is ready for researchers to run on Code Ocean, a cloud-based computational reproducibility platform that allows researchers to wrap and encapsulate the data, code, and computation environment linked to an article: &lt;a href="https://doi.org/10.24433/CO.0a812d9b-0ff3-4eb7-825f-76d3cd049a43" target="_blank">https://doi.org/10.24433/CO.0a812d9b-0ff3-4eb7-825f-76d3cd049a43&lt;/a>. For further transparency, expert reviews of the manuscript from the peer review history were published in Publons: &lt;a href="http://dx.doi.org/10.5524/review.100737" target="_blank">http://dx.doi.org/10.5524/review.100737&lt;/a> and &lt;a href="http://dx.doi.org/10.5524/review.100738" target="_blank">http://dx.doi.org/10.5524/review.100738&lt;/a>. (As of last month, publishers can &lt;a href="https://doi.org/10.64000/akwmm-8b769" target="_blank">register peer reviews at Crossref&lt;/a>). The &lt;a href="https://api.crossref.org/works/10.1093/gigascience/gix045" target="_blank">full metadata record&lt;/a> contains links to the entire set of materials listed above.&lt;/p>
&lt;h3 id="5-linking-to-preprint-code-docker-hub-video-reviews">5. Linking to preprint, Code, Docker hub, video, reviews&lt;/h3>
&lt;p>Narechania A, Baker R, DeSalle R, et al. used bird flocking behavior to design an algorithm, Clusterflock, for optimizing distance-based clusters in orthologous gene families that share an evolutionary history. Their paper was published in Gigascience last year: &lt;a href="https://doi.org/10.1186/s13742-016-0152-3" target="_blank">https://doi.org/10.1186/s13742-016-0152-3&lt;/a>. Supporting data, code snapshots and video were published in GigaDB: &lt;a href="http://dx.doi.org/10.5524/100247" target="_blank">http://dx.doi.org/10.5524/100247&lt;/a>. Code was maintained in &lt;a href="https://github.com/narechan/clusterflock" target="_blank">GitHub&lt;/a>. And authors also created a Docker application for Clusterflock, a lightweight, stand-alone, executable package of the software which includes everything needed to run it: code, runtime, system tools, system libraries, settings (&lt;a href="https://hub.docker.com/r/narechan/clusterflock-0.1/" target="_blank">Docker Hub link here&lt;/a>). They created a &lt;a href="https://youtu.be/ELZTVOiqKn8" target="_blank">video demo&lt;/a> of the algorithm. Publons reviews were published &lt;a href="http://dx.doi.org/10.5524/review.100507" target="_blank">http://dx.doi.org/10.5524/review.100507&lt;/a> and &lt;a href="http://dx.doi.org/10.5524/review.100508" target="_blank">http://dx.doi.org/10.5524/review.100508&lt;/a>.
Gigascience shared all these assets in their publication, including the link to the original bioRxiv preprint: &lt;a href="https://www.biorxiv.org/content/early/2016/03/25/045773" target="_blank">https://www.biorxiv.org/content/early/2016/03/25/045773&lt;/a>). The full metadata record containing these links can be found &lt;a href="https://api.crossref.org/works/10.1186/s13742-016-0152-3" target="_blank">here&lt;/a>.&lt;/p>
&lt;h3 id="the-research-nexus-better-research-through-better-metadata">The Research Nexus: better research through better metadata&lt;/h3>
&lt;p>These five are just a few exemplary cases showing how publishers are declaring the relationships between their publications and other associated artifacts to support reproducibility and discoverability of their content. We welcome you to check out our &lt;a href="https://support.crossref.org/hc/en-us/articles/214357426-Relationships-between-DOIs-and-other-objects" target="_blank">overview of relationships between DOIs and other materials&lt;/a> for more information. Members who are enriching your publishing pipeline in similar ways, please register these links to make your reach go further. We also welcome everyone to retrieve these relations in our REST API (&lt;a href="https://github.com/CrossRef/rest-api-doc" target="_blank">technical documentation&lt;/a>).&lt;/p></description></item><item><title>A transparent record of life after publication</title><link>https://www.crossref.org/blog/a-transparent-record-of-life-after-publication/</link><pubDate>Wed, 01 Nov 2017 00:00:00 +0000</pubDate><author>Madeleine Watson</author><guid>https://www.crossref.org/blog/a-transparent-record-of-life-after-publication/</guid><description>&lt;h2 id="crossref-event-data-and-the-importance-of-understanding-what-lies-beneath-the-data">Crossref Event Data and the importance of understanding what lies beneath the data.&lt;/h2>
&lt;p>Some things in life are better left a mystery. There is an argument for opaqueness when the act of full disclosure only limits your level of enjoyment: in my case, I need a complete lack of transparency to enjoy both chicken nuggets and David Lynch films. And that works for me. But metrics are not nuggets. Because in order to consume them, you really need to know how they’re made. Knowing the provenance of data, along with the context with which it was derived, provides everyone with the best chance of creating indicators which are fit for purpose. This is just one of the reasons why we built the Event Data infrastructure with transparency in mind.&lt;/p>
&lt;h3 id="the-transparency-problem">The transparency problem&lt;/h3>
&lt;p>For the scholarly community, alternative metrics to citation count (‘altmetrics’) are becoming increasingly popular as they can offer rich and expedited insight into today’s diverse and dynamic research environment. Research artifacts undergo an extended life online as they’re linked, shared, saved and discussed in forums both within and beyond the traditional academic ecosystem. Data on these interactions are initially fragmented and buried within platforms like social media, blogs and news sites. Downstream, there are several value-add services that collate and present that data as a single, aggregated count. We see individual data points like ‘paper X was tweeted 22 times’, and ‘paper X is referenced 16 times on Wikipedia’ being combined, homogenised, weighted and expressed as a single figure, a calculated number serving as a proxy for value. But altmetrics alone don&amp;rsquo;t tell the whole story, and how they are calculated is not without idiosyncrasy or politics. As we each have our own unique voice and perspective, we need to ensure we understand the lenses through which these metrics are made in order to consume them effectively.&lt;/p>
&lt;p>The 2015 &lt;a href="https://doi.org/10.13140/RG.2.1.4929.1363" target="_blank">Metric Tide report&lt;/a> highlighted transparency as one of the five dimensions of responsible metrics. Having access to the context used to create a metric — the provenance of the original data as well as full transparency around its extraction, processing and aggregation — helps consumers to use the data meaningfully and allows for comparison across third-party vendors. But transparency is difficult to achieve when, as the report notes, the systems and infrastructure for collecting and curating altmetrics-style data are fragmented and have limited interoperability.&lt;/p>
&lt;p>In the academic community, underlying centralised systems include ORCIDs to identify people and DOIs to identify items. But we’re missing a transparent, centralised infrastructure for describing and recording the relationships between objects and resources&lt;sup>1&lt;/sup>. These relationships, or links, occur outside publisher platforms and can provide valuable information about the interconnectivity and dissemination of research. Dedicated infrastructure for collecting these relationships would provide a data source for those interested in altmetrics to build upon.&lt;/p>
&lt;img src="https://www.crossref.org/images/blog/Fig1.1_EventDiagram.png" alt="Event diagram" class="img-responsive"/>
&lt;p>&lt;em>Figure 1.1 Example of some relationships between articles and activity on the web&lt;/em>&lt;/p>
&lt;p>At Crossref, we call these relationships Events. An Event is the record of a claim made about the existence of a relationship between a registered content item (i.e. a DOI) and a specific activity on the web. Events include:&lt;/p>
&lt;ul>
&lt;li>a DataCite dataset DOI contains a link to a Crossref article DOI&lt;/li>
&lt;li>an article was referenced in Wikipedia&lt;/li>
&lt;li>an article was mentioned on Twitter&lt;/li>
&lt;li>an article has a Hypothes.is annotation&lt;/li>
&lt;li>a blog contains a link to an article&lt;/li>
&lt;/ul>
&lt;p>In collaboration with &lt;a href="https://www.datacite.org/" target="_blank">DataCite&lt;/a>, we are collecting Events for the DOIs registered with our organisations and are making that data available for others in the community to use. This is the Event Data infrastructure, with which we’re plugging the gap in open scholarly relationships infrastructure.
&lt;a href="https://doi.org/10.64000/kattr-5k219" target="_blank">https://doi.org/10.64000/kattr-5k219&lt;/a>&lt;/p>
&lt;h3 id="the-event-data-infrastructure">The Event Data infrastructure&lt;/h3>
&lt;p>Crossref and DataCite have for many years provided a centralised location for bibliographic metadata and links, and a facility to help our members register Persistent Identifiers (DOIs) for their content. With nearly 100 million DOIs registered with Crossref, we know where research lives. Which got us thinking — could we use these links to find out more about the journey research undertakes after publication? Could we express these interactions as links without any aggregation or counts so it could be maximally reused? And if so, could we then provide this data in an open, centralised, structured format? The answer was yes, subject to some challenges:&lt;/p>
&lt;ul>
&lt;li>Querying for individual DOIs wasn’t scalable for our full corpus of 100 million items, so we had to find something else.&lt;/li>
&lt;li>Not everyone uses the DOI link (not a surprise!). Most people will link directly to the publisher’s site. This means we need to look for links using both the DOI and article landing page URLs.&lt;/li>
&lt;li>When we find people referring to registered content using its landing page, we find the DOI for that content item so that the link can be referenced in our data set in a stable, link-rot-proof way.&lt;/li>
&lt;li>We don’t always know the article landing page URL for every DOI upfront because like many relationships, the one &lt;a href="https://doi.org/10.64000/jw4t5-5yt89" target="_blank">between DOIs and URLs&lt;/a> is complicated.&lt;/li>
&lt;/ul>
&lt;p>We began by asking the wrong questions and as a result we got the wrong type of data back: instead of returning a record of individual actions, we were returning aggregated counts. Aside from not meeting our use case, aggregation requires the curation of an ever-churning dataset in order to keep totals updated, which is not scalable for the number of DOIs in our corpus.&lt;/p>
&lt;p>We soon learnt to ask the right questions. One pivotal change in approach was that instead of counts, we asked instead ‘what activity is happening on Twitter for this article?’. Our data went from ‘DOI X was mentioned 20 times on Twitter as of this date’ to ‘tweet X mentions DOI X on this date’. The data are now represented as a subject-verb-object triple:&lt;/p>
&lt;img src="https://www.crossref.org/images/blog/Fig1.2_TripleTable.png" alt="image table of data presented as triples" class="img-responsive"/>
&lt;p>&lt;em>Figure 1.2 Triple table.&lt;/em>&lt;/p>
&lt;p>Ultimately this has allowed us to represent actions like Wikipedia page edits as individual atomic actions (i.e an Event) rather than as a dataset that changes over time.
Being open about the provenance of altmetrics with Event Data
Crossref Event Data (the Crossref-specific service powered by the shared Event Data infrastructure) has evolved beyond a link store to become a continual stream of Events; each Event tells a new part of the story. Rather than constantly updating an Event whenever a new action takes place, we add a new one instead:&lt;/p>
&lt;img src="https://www.crossref.org/images/blog/Fig1.3_WikipediaEvent.png" alt="Wikipedia Event example" class="img-responsive"/>
&lt;p>&lt;em>Figure 1.3 A Wikipedia Event.&lt;/em>&lt;/p>
&lt;p>Events answer a whole range of questions, such as:&lt;/p>
&lt;ul>
&lt;li>What links to what?&lt;/li>
&lt;li>How was the link made?&lt;/li>
&lt;li>Which Agent collected the Event?&lt;/li>
&lt;li>Which data source?&lt;/li>
&lt;li>When was the link observed?&lt;/li>
&lt;li>When do we think the link actually happened?&lt;/li>
&lt;li>What algorithms were used to collect it?&lt;/li>
&lt;li>Where’s the evidence?&lt;/li>
&lt;/ul>
&lt;p>We’re collecting data from a diverse range of platforms including Twitter, Wikipedia, blogs and news sites, Reddit, StackExchange, Wordpress.com and Hypothes.is. This means that when we observe a link in these platforms to what we think is a DOI, we create an &lt;a href="https://www.eventdata.crossref.org/guide/data/events/" target="_blank">Event&lt;/a> and a corresponding &lt;a href="https://www.eventdata.crossref.org/guide/data/evidence-records/" target="_blank">Evidence Record&lt;/a> to represent our observation. We also have Events to represent the links between research items registered with Crossref and DataCite - for example, when a Crossref DOI cites a DataCite DOI and vice versa.&lt;/p>
&lt;p>The provenance of the data is fully transparent and is made available to everyone via an open API. We call this the evidence trail. The record of each link (‘Events’) as well as the corresponding evidence can then be used to feed into tools for impact measurement, discoverability, collaboration and network analysis.&lt;/p>
&lt;p>Therefore, one application of Event Data is as an underlying, transparent data source for altmetrics calculations. For example, you might want to know the total number of times your paper has been mentioned on Twitter to date. If I told you that the number was 22, what does that actually mean? Do you know whether I counted both tweets and retweets? Do you consider both of these actions as equal? Is the sentiment of the tweet important to you? Was it a human or a bot that initiated a tweet? Are you interested in tweets containing links to multiple representations of your paper or do you only want to track mentions of your version of record (the final published copy)? With Event Data as your underlying data source, you can answer these questions.&lt;/p>
&lt;h3 id="not-only-transparent-in-data-transparent-by-design">Not only transparent in data, transparent by design&lt;/h3>
&lt;p>The &lt;a href="http://www.niso.org/" target="_blank">National Information Standards Organisation&lt;/a> (NISO), a US organisation responsible for technical standards for publishing, bibliographic and library applications, has developed a set of recommendations for transparency in their &lt;a href="https://groups.niso.org/apps/group_public/download.php/17091/NISO%20RP-25-2016%20Outputs%20of%20the%20NISO%20Alternative%20Assessment%20Project.pdf" target="_blank">Alternative Assessment Metrics Project report&lt;/a>, as well as a Code of Conduct for both altmetric practitioners and aggregators that aims to help improve the quality of altmetrics data. The working groups recognised that without transparency and conforming to a recognised standard, altmetric indicators &amp;ldquo;are difficult to assess, and thus may be seen as less reliable for purposes of measuring influence or evaluation&amp;rdquo;&lt;sup>1&lt;/sup>.&lt;/p>
&lt;p>Crossref Event Data is one of the example altmetric data providers listed in the NISO recommendations. My colleague Joe Wass participated in the development and specification of the NISO &lt;a href="http://www.niso.org/press-releases/2016/05/niso-releases-draft-altmetrics-recommended-practices-data-metrics" target="_blank">&amp;ldquo;Altmetrics Recommended Practices on Data Metrics, Alternative Outputs, and Persistent Identifiers&amp;rdquo;&lt;/a> at the same time as we were working with DataCite on Event Data, so they have mutually informed one another.&lt;/p>
&lt;img src="https://www.crossref.org/images/blog/Fig1.4_photo_MartinFenner_JoeWass.JPG" alt="image Martin Fenner and Joe Wass drawing plans on a whiteboard" width="600px" height="250" class="img-responsive"/>
&lt;p>&lt;em>Figure 1.4 Martin Fenner (DataCite) and Joe Wass (Crossref) drawing plans for the Event Data infrastructure.&lt;/em>&lt;/p>
&lt;p>The outcome of our involvement in the NISO recommendations is that Crossref Event Data is a service that is transparent by design. We have opened up our entire extraction and processing workflow so that we can clearly demonstrate the context and environment that was used to generate an Event. This evidence is a core component of our transparency-first principle.&lt;/p>
&lt;h3 id="building-services-on-event-data">Building services on Event Data&lt;/h3>
&lt;p>There are some really exciting ways that people are already using Event Data, and we’re still only in beta. Our aim has always been to create an open, portable, transparent data set that can be used by our diverse community including researchers, application developers, publishers, funders and third-party service providers. We have already seen data from our service used in recent research studies, impact reports and even a front-end tool. Launched recently as a prototype, ImpactStory’s &lt;a href="http://paperbuzz.org/" target="_blank">Paperbuzz.org&lt;/a> uses Event Data as one of its data sources for tracking the online buzz around scholarly articles. Jason Priem, cofounder of &lt;a href="https://impactstory.org/" target="_blank">ImpactStory&lt;/a>, notes:&lt;/p>
&lt;blockquote>
&lt;p>&amp;ldquo;Because Crossref Event Data is completely open data, we believe it&amp;rsquo;s a game-changer for altmetrics. Our latest project, Paperbuzz.org, is just the first of a whole constellation of upcoming tools that will add value on top of Crossref&amp;rsquo;s open data.&amp;rdquo;&lt;/p>
&lt;/blockquote>
&lt;p>We are working towards launching Crossref Event Data as a production service. In the meantime though, please do take a look at our comprehensive &lt;a href="https://www.eventdata.crossref.org/guide/" target="_blank">User Guide&lt;/a>. Hopefully you’ll be inspired to go make something cool using the data! Events are being collected constantly; take a look below as they stream in from our data sources or visit our &lt;a href="http://live.eventdata.crossref.org/live.html" target="_blank">live stream demo&lt;/a> site to watch in real time.&lt;/p>
&lt;div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
&lt;iframe allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen="allowfullscreen" loading="eager" referrerpolicy="strict-origin-when-cross-origin" src="https://www.youtube.com/embed/CI93UgbFPuk?autoplay=0&amp;amp;controls=1&amp;amp;end=0&amp;amp;loop=0&amp;amp;mute=0&amp;amp;start=0" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" title="YouTube video">&lt;/iframe>
&lt;/div>
&lt;p>&lt;em>Figure 1.5 Screen capture of Crossref Event Data live stream demo.&lt;/em>&lt;/p>
&lt;p>As the service matures, we’ll continue to add new platforms to track and I also encourage anyone with article link data to get in touch to discuss how we can share it with the community via Event Data.&lt;/p>
&lt;p>For researchers in particular, I’m really keen to hear your thoughts on our data model and about the things we could additionally provide you with from an infrastructure perspective that would best support your research needs.&lt;/p>
&lt;p>And if you’re a publisher, take a look at our &lt;a href="https://doi.org/10.64000/pxdkr-gzg62" target="_blank">Event Data best practice guidelines&lt;/a> — there’s some really important information in there about how you can help give us the best chance possible of collecting Events for your registered content.&lt;/p>
&lt;p>And finally, if you’re a consumer of altmetrics data, I encourage you to ask questions. Ask your altmetrics vendors about how they gather their data and what context they apply to the aggregation of the metrics they supply. Ask yourself what behaviours you are interested in tracking and equally those you are not. Think about the endgame; about the type of impact you’re truly trying to measure and the story you want to tell. Because it’s these questions that will help you choose indicators that are the best fit for your own unique narrative.&lt;/p>
&lt;hr>
&lt;p>This content is cross-posted on &lt;a href=" https://elifesciences.org/labs/995b64e4/a-transparent-record-of-life-after-publication" target="_blank">eLife Labs&lt;/a>.&lt;/p>
&lt;p>&lt;em>References&lt;/em>&lt;/p>
&lt;p>&lt;sup>1&lt;/sup> Bilder, Geoffrey; Lin, Jennifer; Neylon, Cameron (2015): What exactly is infrastructure? Seeing the leopard&amp;rsquo;s spots.
Retrieved: Oct 16, 2017; &lt;a href="https://doi.org/10.6084/m9.figshare.1520432.v1" target="_blank">https://doi.org/10.6084/m9.figshare.1520432.v1&lt;/a>&lt;/p>
&lt;p>&lt;sup>2&lt;/sup> NISO, &lt;em>Outputs of the NISO Alternative Assessment Metrics Project&lt;/em>. Retrieved: 6th October 2017; &lt;a href="https://www.niso.org/publications/rp-25-2016-altmetrics" target="_blank">https://www.niso.org/publications/rp-25-2016-altmetrics&lt;/a> , p.2.&lt;/p></description></item><item><title>Meet the members, Part 1 (with Oxfam)</title><link>https://www.crossref.org/blog/meet-the-members-part-1-with-oxfam/</link><pubDate>Mon, 30 Oct 2017 00:00:00 +0000</pubDate><author>Christine Cormack Wood</author><discourseUsername>ccormackwood</discourseUsername><guid>https://www.crossref.org/blog/meet-the-members-part-1-with-oxfam/</guid><description>&lt;p>Introducing our new blog series &lt;em>Meet the members;&lt;/em> where we talk to some of our members and find out a little bit more about them, ask them to share how they use our services, and discuss what their plans for the future are. To start the series we talk to Liam Finnis of Oxfam.&lt;/p>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/oxfam.jpg" alt=“Oxfam logo" height="250px" width="250px" class="img-responsive" />
&lt;/div>
&lt;h2 id="can-you-tell-us-a-little-bit-about-oxfam">Can you tell us a little bit about Oxfam?&lt;/h2>
&lt;p>Around the globe, Oxfam works to find practical, innovative ways for people to lift themselves out of poverty and thrive. We save lives and help rebuild livelihoods when crisis strikes. And we campaign so that the voices of the poor influence the local and global decisions that affect them.&lt;/p>
&lt;p>Oxfam’s Policy &amp;amp; Practice platform is the gateway to Oxfam’s knowledge, experience, and thinking. Policy &amp;amp; Practice aims to influence, enable and learn from others by sharing and collaborating online with professionals and practitioners.&lt;/p>
&lt;h2 id="whats-your-role-within-oxfam">What’s your role within Oxfam?&lt;/h2>
&lt;p>My name is Liam Finnis and I am the Website Manager for Oxfam GB’s &lt;a href="http://policy-practice.oxfam.org.uk/" target="_blank">Policy &amp;amp; Practice&lt;/a> site and the &lt;a href="http://oxfamilibrary.openrepository.com/oxfam/" target="_blank">Oxfam Digital Repository&lt;/a>. In addition to maintenance and development of our platforms, my role focuses on raising the visibility of our programme work including approach and methodology, while also ensuring the availability and accessibility of our publications and resources.&lt;/p>
&lt;h2 id="whats-your-participation-level">What’s your participation level?&lt;/h2>
&lt;p>We joined Crossref in 2016, but only really began fully implementing DOIs this year. We have registered 139 content items as of October, with the majority assigned in 2017. While this only constitutes a small number of our total publications (roughly 6%), we’ve focused on current and future publications rather than retroactive application (with a handful of exceptions).&lt;/p>
&lt;h2 id="tell-us-a-bit-about-what-you-publish-and-for-whom">Tell us a bit about what you publish and for whom&lt;/h2>
&lt;p>We produce roughly 220 publications each year, with a library of 4,450 spanning 40 years. Roughly half of this would be considered grey literature and includes: research reports; evaluations; briefing papers; technical briefings; case studies; guidelines and toolkits. We also publish the &lt;em>Gender &amp;amp; Development&lt;/em> &lt;em>Journal&lt;/em> with Routledge/Taylor &amp;amp; Francis.&lt;/p>
&lt;p>While our organisational focus is on inequalities and the eradication of poverty, this isn’t something we can achieve by looking solely at economic models. Our publications span a range of subject areas including: climate change; food and livelihoods; economics; gender; conflicts and disasters; land rights; and Water, Sanitation and Hygiene (WASH).&lt;/p>
&lt;p>Our audience ranges from humanitarian and development practitioners to policy makers to researchers and academics. We publish the research that underpins our campaigns advocacy work; the evaluations of our emergency response efforts; reports outlining the methodologies we’ve applied; briefings on policy and recommendations; and, toolkits and guidelines for research, programme quality and responsible data management.&lt;/p>
&lt;h2 id="what-do-you-think-makes-your-publications-unique">What do you think makes your publications unique?&lt;/h2>
&lt;p>Oxfam is one of the only NGOs that is actively sharing an extensive body of knowledge and experience. With 75 years of experience working on a global scale, our publications help to share learning and encourage best practice. Further to that, they showcase the changes (gradual or sudden) that we’ve seen in how development and humanitarian aid is defined and approached through the decades.&lt;/p>
&lt;p>&lt;em>&lt;a href="https://oxfamilibrary.openrepository.com/oxfam/handle/10546/141359" target="_blank">The Oxfam Gender Training Manual&lt;/a>&lt;/em>, published in 1991, remains one of our most frequently accessed resources; still widely regarded as a relevant, unique and valuable resource within the sector. Another of our key publications, &lt;em>&lt;a href="https://oxfamilibrary.openrepository.com/oxfam/handle/10546/338125" target="_blank">Wealth: Having it all and wanting more&lt;/a>&lt;/em>, was published in 2015, outlining the methodology and data sources for Oxfam’s frequently cited fact ‘85 billionaires have the same wealth as the bottom half of the world’s population’.&lt;/p>
&lt;p>The diversity in subject and format of our publications isn’t necessarily unique, but I’m reasonably confident that there is something in our publications that will relate to everyone.&lt;/p>
&lt;h2 id="what-trends-are-you-seeing-in-your-part-of-the-scholarly-publishing-community">What trends are you seeing in your part of the scholarly publishing community?&lt;/h2>
&lt;p>This is difficult to quantify as, while we have been lightly engaged with the scholarly publishing community in the past, we’ve been significantly more active in the past year. In addition to more actively applying DOIs, early in 2017 we were included in EBSCO Discovery and in March we made efforts to improve the visibility of our Digital Repository. Previously, the key route was through the Policy &amp;amp; Practice website, which brought together publications with blogs and pages focused on programmes, projects, approaches and methodology.&lt;/p>
&lt;p>Since making these two changes we’ve seen a significant increase in access of our resources directly from the repository. This has come in addition to the general usage through Policy &amp;amp; Practice. We are also working with Research4Life, INASP and TEEAL to improve visibility and accessibility of our publications more widely.&lt;/p>
&lt;h2 id="how-would-you-describe-the-value-of-being-a-crossref-member">How would you describe the value of being a Crossref member?&lt;/h2>
&lt;p>In the past two years, we’ve been looking into how we can ensure that our publications are visible and accessible to a wider audience. Becoming a member of Crossref and registering content with Crossref is a big part of that. It helps to give us a place in the discussions and events as well as enabling us to better understand and meeting scholarly publishing standards and implement best practice.&lt;/p>
&lt;h2 id="what-are-oxfams-plans-for-the-future">What are Oxfam&amp;rsquo;s plans for the future?&lt;/h2>
&lt;p>In terms of our work with Crossref and an active role in the scholarly publishing community, we’re still fairly new to it and we’re starting to see some of the benefits of our efforts. In the future, we’re looking to get a better idea of the opportunities available and build on our recent work.&lt;/p>
&lt;p>Personally, I’m really interested in exploring Crossref Event Data in greater detail and seeing how it can help us map the impact of our work more effectively.&lt;/p>
&lt;br>
Thanks, Liam!</description></item><item><title>Peer reviews are open for registering at Crossref</title><link>https://www.crossref.org/blog/peer-reviews-are-open-for-registering-at-crossref/</link><pubDate>Tue, 24 Oct 2017 00:00:00 +0000</pubDate><author>Jennifer Lin</author><guid>https://www.crossref.org/blog/peer-reviews-are-open-for-registering-at-crossref/</guid><description>&lt;p>&lt;a href="https://authorservices.taylorandfrancis.com/peer-review-global-view/" target="_blank">About 13-20 billion researcher-hours&lt;/a> were spent in 2015 doing peer reviews. What valuable work! Let&amp;rsquo;s get more mileage out of these labors and make these expert discussions citable, persistent, and linked up to the scholarly record. As we &lt;a href="https://doi.org/10.64000/1b7rc-rmj34" target="_blank">previously shared&lt;/a> during Peer Review week, Crossref is lauintroducing support for a new record type to support the registration of peer reviews. We’re one step closer to changing that. Today, we are excited to announce that we’re open for deposits.&lt;/p>
&lt;div style="float:left;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/televisionset.png" alt="tv set" width="60px" class="img-responsive" />
&lt;/div>
&lt;p>If you missed the first episode, here’s a recap:&lt;/p>
&lt;p>Publishers have been registering reviews with us for a while (ex: &lt;a href="https://doi.org/10.1016/j.engfracmech.2015.01.019" target="_blank">Example 1&lt;/a>, &lt;a href="https://doi.org/10.5194/wes-1-177-2016" target="_blank">Example 2&lt;/a>, and &lt;a href="https://doi.org/10.14322/PUBLONS.R518142" target="_blank">Example 3&lt;/a>). But these have been shoehorned into other content: article, dataset, or component. So we are extending Crossref’s infrastructure to properly treat this special scholarly artifact. This includes a range of outputs made publicly available from the peer review history (referee reports, decision letters, author responses, community comments) across any and all review rounds. We welcome scholarly discussions of journal articles before or after publication (e.g. “post-publication reviews”).&lt;/p>
&lt;p>We collect metadata that characterizes the peer review asset (for example: recommendation, type, license, contributor info, competing interests). We also collect metadata, which offers a view into the review process (e.g. pre/post-publication, revision round, review date).&lt;/p>
&lt;p>This special set will support the discovery and investigation of peer reviews as it is linked up to the article discussed. It will also enable the following:&lt;/p>
&lt;ul>
&lt;li>Enable tracking of the evolution of scholarly claims through the lineage of expert discussion&lt;/li>
&lt;li>Support enrichment of scholarly discussion&lt;/li>
&lt;li>Enable reviewer accountability&lt;/li>
&lt;li>Credit reviewers and editors for their scholarly contribution&lt;/li>
&lt;li>Support publisher transparency&lt;/li>
&lt;li>Connect reviews to the full history of the published results&lt;/li>
&lt;li>Provide data for analysis and research on peer review&lt;/li>
&lt;/ul>
&lt;p>Please come check out our &lt;a href="https://support.crossref.org/hc/en-us/articles/115005255706" target="_blank">documentation &lt;/a>for more information.&lt;/p>
&lt;p>As publishers are implementing this, we are finishing up the delivery of this metadata for machine and human access, across all the Crossref interfaces (&lt;a href="https://api.crossref.org/" target="_blank">REST API&lt;/a>, &lt;a href="https://support.crossref.org/hc/en-us/articles/213679866-OAI-PMH-subscriber-only-" target="_blank">OAI-PMH&lt;/a>, &lt;a href="https://search.crossref.org/" target="_blank">Crossref Metadata Search&lt;/a>) to enable discoverability across the research ecosystem. We are also working to make it possible for members to get Cited-by data for the peer reviews they register.&lt;/p>
&lt;p>If you are interested in registering your peer review content with us, please &lt;a href="mailto:support@crossref.org">get in touch&lt;/a>.&lt;/p></description></item><item><title>What happened at last month's LIVE local in London</title><link>https://www.crossref.org/blog/what-happened-at-last-months-live-local-in-london/</link><pubDate>Sun, 22 Oct 2017 00:00:00 +0000</pubDate><author>Vanessa Fairhurst</author><guid>https://www.crossref.org/blog/what-happened-at-last-months-live-local-in-london/</guid><description>&lt;p>So much has happened since we held LIVE16 (our annual meeting) in London last year that we wanted to check-in with our UK community and share the year’s developments around our tools, teams and services ahead of &lt;a href="https://www.crossref.org/crossref-annual-meeting/archive/#2017">LIVE17&lt;/a> next month in Singapore.&lt;/p>
&lt;p>And so, on 26th September we held a half-day &amp;lsquo;&lt;a href="https://www.crossref.org/events">LIVE local&lt;/a>&amp;rsquo;, covering a wide range of strategic topics, well-attended by a diverse representation of our UK community of publishers, funders, researchers, and tool-makers.&lt;/p>
&lt;p>What we discussed on the day:&lt;/p>
&lt;ul>
&lt;li>Ed Pentz, Crossref&amp;rsquo;s Executive Director, kicked the day off with &lt;a href="https://www.slideshare.net/CrossRef/whats-new-at-crossref-ed-pentz-london-live-2017" target="_blank">&amp;lsquo;What’s new at Crossref&amp;rsquo;&lt;/a>&lt;/li>
&lt;li>Geoffrey Bilder, Strategic Director, talked us through &lt;a href="https://www.slideshare.net/CrossRef/new-initiatives-geoffrey-bilder-london-live-2017" target="_blank">&amp;lsquo;Crossref&amp;rsquo;s Strategic Initiatives&amp;rsquo;&lt;/a>&lt;/li>
&lt;li>Ginny Hendricks, Director of Member and Community Outreach introduced &lt;a href="https://www.slideshare.net/CrossRef/metadata-2020-ginny-hendricks-london-live-2017" target="_blank">&amp;lsquo;Metadata 2020&amp;rsquo;&lt;/a>&lt;/li>
&lt;li>Rachael Lammey, Head of International Outreach discussed the &lt;a href="https://www.slideshare.net/CrossRef/global-reach-of-crossref-metadata-rachael-lammey-london-live-2017" target="_blank">&amp;lsquo;Global reach of Crossref metadata&amp;rsquo;&lt;/a>&lt;/li>
&lt;li>Jure Triglav from Coko Foundation presented some interesting &lt;a href="http://slides.com/jure/metadata-collaboration" target="_blank">&amp;lsquo;Metadata Use Case Studies&amp;rsquo;&lt;/a>&lt;/li>
&lt;li>Jennifer Lin, Director of Product Management, spoke about Crossref&amp;rsquo;s &lt;a href="https://www.slideshare.net/CrossRef/new-product-developments-jennifer-lin-london-live-2017" target="_blank">&amp;lsquo;New Product Developments&amp;rsquo;&lt;/a>&lt;/li>
&lt;li>Ed Pentz concluded the day leading a discussion on &lt;a href="https://www.slideshare.net/CrossRef/crossref-future-direction-ed-pentz-london-live-2017" target="_blank">&amp;lsquo;Crossref&amp;rsquo;s Future Direction&amp;rsquo;&lt;/a>&lt;/li>
&lt;/ul>
&lt;iframe src="https://www.crossref.org/pdfs/crossref-london-live.pdf" width="760" height="500" frameborder="0" marginheight="0" marginwidth="0">&lt;/iframe>
&lt;p>This event was one in a series of smaller, regional events which aim to better cater to our global membership and provide a tailored program of activities. You can read more about this series of events on our &lt;a href="https://www.crossref.org/events">LIVE locals&lt;/a> page, and if you are interested in hosting an event near you or have suggestions for one in your region then please &lt;a href="mailto:feedback@crossref.org">contact me&lt;/a> to get involved.&lt;/p></description></item><item><title>Celebrating ORCID at five</title><link>https://www.crossref.org/blog/celebrating-orcid-at-five/</link><pubDate>Mon, 16 Oct 2017 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/celebrating-orcid-at-five/</guid><description>&lt;p>Happy birthday, ORCID! It&amp;rsquo;s their fifth birthday today and it&amp;rsquo;s gratifying to me&amp;mdash;as a founding board member and former Chair of the board&amp;mdash;to see how successful it has become. ORCID has a great staff, over 700 members from 41 countries and is quickly approaching 4 million ORCID iDs. Crossref&amp;mdash;it&amp;rsquo;s board, staff, and members&amp;mdash;has been an ORCID supporter from the start. One example of this support is that we seconded Geoffrey Bilder to be ORCID&amp;rsquo;s interim CTO for about eight months.&lt;/p>
&lt;p>Actually, Crossref has been involved with ORCID even before the start.&lt;/p>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/orcid-at-5.jpg" alt="ORCID turns five" width="300px" class="img-responsive" />
&lt;/div>
&lt;p>ORCID&amp;rsquo;s birthday recognizes when the registry went live in 2012 but the origins of what became ORCID stretch back to &lt;a href="https://doi.org/10.64000/mz7md-r1m43" target="_blank">a meeting that Crossref organized back in February 2007 on &amp;ldquo;Author IDs&amp;rdquo;&lt;/a>. After this meeting there were many follow on discussions but it was clear that as an association of scholarly publishers Crossref didn&amp;rsquo;t have suitable governance for an researcher identifier registry which needed support from a broader group of stakeholders.&lt;/p>
&lt;p>Subsequent discussions between Nature and Thomson Reuters (represented by Howard Ratner Dave Kochalko) led&amp;mdash;after many more meetings&amp;mdash;to ORCID being set up as a new organisation. ORCID was incorporated in September 2010 and the first meeting of the board of directors of ORCID was on October 8th, 2010.&lt;/p>
&lt;p>A lot of people and organisations have contributed to getting ORCID to where it is today and it&amp;rsquo;s been great to be a part of it and continue to contribute to their future.&lt;/p>
&lt;blockquote>
&lt;p>Reflecting on the creation of ORCID: it has shown the power of collaboration in improving scholarly research, and in making life easier and better for researchers.&lt;/p>
&lt;/blockquote>
&lt;p>Today they &lt;a href="https://orcid.org/blog/2017/10/13/orcid5-coming" target="_blank">celebrate in a number of fun ways&lt;/a> and, in particular, mark the occasion with the release of &lt;a href="https://orcid.org/blog/2017/10/16/celebrating-orcid5-launch-new-resources" target="_blank">a new set of educational resources&lt;/a>.&lt;/p>
&lt;p>From everyone in the Crossref community, here&amp;rsquo;s to ORCID&amp;rsquo;s continuing success!&lt;/p></description></item><item><title>Changes to the 2018 membership agreement for better metadata distribution</title><link>https://www.crossref.org/blog/changes-to-the-2018-membership-agreement-for-better-metadata-distribution/</link><pubDate>Mon, 09 Oct 2017 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/changes-to-the-2018-membership-agreement-for-better-metadata-distribution/</guid><description>&lt;p>We are making a change to section 9b of the standard Crossref membership agreement which will come into effect on January 1, 2018. This will not change how members register content, nor will it affect membership fees in any way. &lt;a href="https://www.crossref.org/membership/2018-agreement/">The new 2018 agreement is on our website&lt;/a>, and the exact wording changes are highlighted below. The new membership agreement will automatically replace the previous version from January 1, 2018 and members will not need to sign a new agreement.&lt;/p>
&lt;h2 id="whats-changing">What’s changing?&lt;/h2>
&lt;p>At its July meeting the &lt;a href="https://www.crossref.org/board-and-governance/">Crossref board&lt;/a> unanimously approved recommendations from the &lt;a href="https://www.crossref.org/committees/membership-and-fees/">Membership and Fees Committee&lt;/a> to update Crossref’s metadata delivery offerings. One of the recommendations was to remove the option for case-by-case opt outs of metadata delivery through the OAI-PMH channel used for Enhanced Crossref Metadata Services.&lt;/p>
&lt;p>This opt-out was only used by a small number of our members (around 40 of nearly 9,000), who have been contacted directly. This means that for the vast majority of members there is no change in how Crossref makes their metadata available but we wanted to make everyone aware of the change to the membership agreement.&lt;/p>
&lt;p>So, as is currently the case, all metadata registered with Crossref is available via all the Metadata APIs under an appropriate agreement with the user or terms and conditions for the service. The one exception to this is how references are distributed - we will contact members next week about the options for references.&lt;/p>
&lt;h2 id="why-are-we-making-this-change">Why are we making this change?&lt;/h2>
&lt;ol>
&lt;li>Our &lt;a href="https://www.crossref.org/services/metadata-retrieval/">metadata services&lt;/a> have become very popular with users of all kinds throughout scholarly communications&amp;ndash;including search and discovery platforms, libraries, other publishers, reference managers, sharing services, and analytics providers. More and better metadata means more and better discoverability of publisher content.&lt;/li>
&lt;li>The change also brings this service into line with our mission to improve scholarly communications through quality metadata and related infrastructure services, removing the need for bilateral agreements between publishers and third parties.&lt;/li>
&lt;li>Many members complained when we contacted them about opt-outs whenever a new OAI-PMH user came on board. It is better for our members and for our staff if there is a common standard across the board.&lt;/li>
&lt;/ol>
&lt;h2 id="changes-to-2018-membership-agreement">Changes to 2018 membership agreement&lt;/h2>
&lt;p>&lt;strong>9) Sharing of Metadata by PILA&lt;/strong>&lt;/p>
&lt;p>a) &lt;em>Local Hosting&lt;/em>. [no change]&lt;/p>
&lt;p>b) &lt;em>Other Metadata Services&lt;/em>. Subject to compliance &lt;strong>by the entity receiving the Metadata and Digital Identifiers&lt;/strong> with the terms and conditions &lt;del>set forth in a separate agreement between&lt;/del> &lt;strong>established by&lt;/strong> PILA &lt;strong>for the particular service through which access is provided,&lt;/strong> and &lt;del>the entity receiving the Metadata and Digital Identifiers&lt;/del>, PILA may &lt;del>license&lt;/del> &lt;strong>authorize&lt;/strong> third parties to receive and use &lt;del>bulk deliveries of&lt;/del> Metadata and Digital Identifiers from &lt;del>the&lt;/del> PILA &lt;del>System from members who have chosen to participate in Metadata Services,&lt;/del> which PILA shall provide directly to such third parties. &lt;del>At least thirty (30) days prior to making such Metadata delivery PILA will notify each PILA Member whose Metadata and Digital Identifiers are intended to be included in such delivery of the anticipated delivery date, the identity of the third party and the purpose for which the delivery is being made. Metadata and Digital Identifiers belonging to any PILA Member who notifies PILA in writing prior to the specified delivery date of its desire to be excluded from such delivery will be excluded or removed from such delivery.&lt;/del>&lt;/p>
&lt;hr>
&lt;p>Please &lt;a href="mailto:member@crossref.org">contact our membership specialist&lt;/a> if you have any feedback or questions.&lt;/p></description></item><item><title>Using the Crossref REST API. Part 6 (with NLS)</title><link>https://www.crossref.org/blog/using-the-crossref-rest-api.-part-6-with-nls/</link><pubDate>Fri, 06 Oct 2017 00:00:00 +0000</pubDate><author>Christine Cormack Wood</author><discourseUsername>ccormackwood</discourseUsername><guid>https://www.crossref.org/blog/using-the-crossref-rest-api.-part-6-with-nls/</guid><description>&lt;p>Continuing our &lt;a href="https://www.crossref.org/categories/api-case-study/">blog series&lt;/a> highlighting the uses of Crossref metadata, we talked to Ulf Kronman, Bibliometric Analyst at the &lt;a href="http://www.kb.se/english/" target="_blank">National Library of Sweden&lt;/a> about the work they’re doing, and how they’re using our REST API as part of their workflow.&lt;/p>
&lt;h3 id="introducing-the-national-library-of-sweden-nls">Introducing the National Library of Sweden (NLS)&lt;/h3>
&lt;p>The NLS is a state agency, has a staff of about 320, and its main offices in Stockholm. Its primary duty is to preserve the Swedish cultural heritage by collecting everything printed in Sweden, and has been doing so since 1661. Nowadays the library also collects Swedish TV and radio programs, movies, videos, music, and computer games.&lt;/p>
&lt;p>The National Library coordinates services and programs for all publicly funded libraries in Sweden and runs the national library catalogue system Libris and the national database for Swedish scholarly output, SwePub. The library also runs the Bibsam consortium, negotiating national subscription licenses and open access publishing agreements with publishers.&lt;/p>
&lt;p>&lt;em>Images left to right: External and internal view of the National Library of Sweden, and Ulf Kronman, Bibliometric Analyst at NLS.&lt;/em>&lt;/p>
&lt;img src="https://www.crossref.org/images/blog/nls-blog-image.png" alt="diptic image view NLS and Ulf Kronman Bibliometric Analyst" class="img-responsive" />
&lt;h3 id="what-problem-is-your-service-trying-to-solve">What problem is your service trying to solve?&lt;/h3>
&lt;p>The metadata in the national scholarly publication database &lt;a href="http://info.swepub.kb.se/bibliometri" target="_blank">SwePub&lt;/a> is harvested from the Swedish universities&amp;rsquo; local publication systems, where data often is entered manually by librarians and researchers. This means that the metadata can contain a lot of omissions, synonyms, spelling variants and errors. Using Crossref, we can enhance and correct the metadata delivered to us, if we just have a correct DOI.&lt;/p>
&lt;h3 id="can-you-tell-us-how-you-are-using-crossref-metadata-at-the-national-library-of-sweden">Can you tell us how you are using Crossref metadata at the National Library of Sweden?&lt;/h3>
&lt;p>The Crossref metadata is presently used in two projects; &lt;em>Open APC Sweden&lt;/em> and in our &lt;em>local analysis database&lt;/em> for publication statistics used in negotiations with publishers.&lt;/p>
&lt;p>Open APC Sweden is a pilot project to gather data on open access publication costs (APC&amp;rsquo;s – Article Processing Charges) from Swedish universities. The project is modelled from the German Bielefeld University Open APC initiative, which is a part of the &lt;a href="https://www.intact-project.org/openapc/" target="_blank">INTACT&lt;/a> project. After APC data has been delivered to the APC system, scripts are run against the Crossref API to fetch information about publishers and journals. &lt;a href="https://github.com/Kungbib/openapc-se/blob/master/README.md" target="_blank">A description of Open APC Sweden can be found here.&lt;/a>&lt;/p>
&lt;p>When building our local analysis database for publisher statistics, we download data from the SwePub database, use the Crossref DOIs for API lookup against Crossref to add correct ISSN and publisher data to the records and then match the records against a list of publisher serials. In this way, we can get information about how much Swedish researchers have been publishing with a certain publisher and use this data when negotiating conditions for open access publishing with the publisher in question.&lt;/p>
&lt;h3 id="what-metadata-values-do-you-pull-from-the-api">What metadata values do you pull from the API?&lt;/h3>
&lt;p>In Open APC Sweden, a Python script supplied by staff at the Bielefeld University is used to pull metadata about publisher and journal names and ISSN&amp;rsquo;s from the Crossref API. The result is entered into an enriched version of the APC data files delivered by the universities and then statistics can be calculated on the result using an R script. &lt;a href="https://github.com/Kungbib/openapc-se/blob/master/statistics.md" target="_blank">The result can be seen here&lt;/a>.&lt;/p>
&lt;p>In the local analysis database, a modified copy of the Bielefeld Python script is used to add the same metadata to the records before matching them against publisher serial ISSNs.&lt;/p>
&lt;h3 id="have-you-built-your-own-interface-to-extract-this-data">Have you built your own interface to extract this data?&lt;/h3>
&lt;p>In Open APC Sweden, the Python script is developed and maintained at the Bielefeld University and an exact copy is being run in the Swedish project.&lt;/p>
&lt;p>In the local analysis system, the Python script is somewhat modified to suit the special demands of this system.&lt;/p>
&lt;p>But sometimes it is very convenient just to use the main &lt;a href="https://www.doi.org/" target="_blank">DOI lookup&lt;/a> to do a manual check-up of problematic records.&lt;/p>
&lt;h3 id="how-often-do-you-extractquery-data">How often do you extract/query data?&lt;/h3>
&lt;p>In Open APC Sweden, usually about two-three times a month, when new datasets are delivered from the universities. In the local analysis database, usually lookups are being done on a daily basis as development of the database continues.&lt;/p>
&lt;h3 id="what-do-you-do-with-the-metadata-once-its-pulled-from-the-api">What do you do with the metadata once it’s pulled from the API?&lt;/h3>
&lt;p>In Open APC Sweden, the metadata is going into the APC data files for processing of statistics. In the local analysis database, the metadata is used to match against publisher journal ISSN&amp;rsquo;s.&lt;/p>
&lt;h3 id="what-plans-do-you-have-for-the-future">What plans do you have for the future?&lt;/h3>
&lt;p>For the Open APC Sweden I would like to build a database system to make the system more scalable than just working with flat data files.&lt;/p>
&lt;p>With both the SwePub system and the local analysis system, we are now using the new service oaDOI and their API to look up metadata about the open access status of the publications to enrich our local systems.&lt;/p>
&lt;h3 id="what-else-would-you-like-to-see-the-rest-api-offer">What else would you like to see the REST API offer?&lt;/h3>
&lt;p>In the process of normalising the publishers&amp;rsquo; names, the names returned are sometimes at a &amp;ldquo;too high&amp;rdquo; or on a too generic level to be used to generate good statistics. For instance, Springer Nature are sometimes returned as &lt;em>Springer Nature&lt;/em>, sometimes as &lt;em>Springer Science + Business Media&lt;/em> and sometimes as &lt;em>Nature Publishing Group&lt;/em>. A similar thing is valid for &lt;em>Taylor &amp;amp; Francis&lt;/em>, where the mother company &lt;em>Informa UK Limited&lt;/em> is returned instead of the publishing subsidiary of the company. One thing to wish for here is that we could agree on some kind of normalisation of the publishers&amp;rsquo; names and that Crossref could return this as a supplement to the present metadata.&lt;/p>
&lt;hr>
&lt;p>Thanks Ulf! If you would like to contribute a case study on the uses of Crossref Metadata APIs please contact the &lt;a href="mailto:feedback@crossref.org">Community team&lt;/a>.&lt;/p></description></item><item><title>Publishers, help us capture Events for your content</title><link>https://www.crossref.org/blog/publishers-help-us-capture-events-for-your-content/</link><pubDate>Mon, 02 Oct 2017 00:00:00 +0000</pubDate><author>Madeleine Watson</author><guid>https://www.crossref.org/blog/publishers-help-us-capture-events-for-your-content/</guid><description>&lt;p>The day I received my learner driver permit, I remember being handed three things: a plastic thermosealed reminder that age sixteen was not a good look on me; a yellow L-plate sign as flimsy as my driving ability; and a weighty ‘how to drive’ guide listing all the things that I absolutely must not, under any circumstances, even-if-it-seems-like-a-really-swell-idea-at-the-time, never, ever do.&lt;/p>
&lt;p>The margin space dedicated to finger-wagging left little room for championing any driving-do’s. And as each page delivered a fresh new warning, my enthusiasm for hitting the road sunk to levels usually reserved for activities like trigonometry and visits to my orthodontist.&lt;/p>
&lt;p>Many years (and an excellent driving record) later, I’m reminded of this again now when thinking about our own Event Data User Guide. Because it contains a chapter with some really important don&amp;rsquo;ts for our members. Really good, we’d-love-you-to-consider-not-doing-these-things type of advice. But despite our intent to encourage, I feel the ghost of finger-waggers past. So in the spirit of championing enthusiasm over ennui, I thought I’d attempt to contextualise our &lt;a href="https://www.eventdata.crossref.org/guide/best-practice/publishers-best-practice/" target="_blank">Event Data Best Practices Guide for Publishers&lt;/a> and show you why there’s a lot of good reasons for publishers to be enthusiastic about these rules.&lt;/p>
&lt;p>So if you’re a publisher, I encourage you to read on to learn more about how you can help us have the best chance possible of capturing Events for your content.&lt;/p>
&lt;div class="shortcode-divwrap blue-highlight">
&lt;span>What&amp;rsquo;s in it for you? Well, collecting this data helps to give everyone (Crossref, yourself, and others) a better picture of how your content is being used, including for altmetrics.&lt;/span>
&lt;/div>
&lt;h3 id="1-please-let-us-in">1. Please let us in&lt;/h3>
&lt;p>Please do open the door when we come knocking, we promise not to stay long. You can do this by allowing the User Agent &lt;code>CrossrefEventDataBot&lt;/code> to visit your site, and whitelisting it if necessary. The bot is how we visit URLs to confirm if they are for an item of content registered with us. The reason why we’re visiting your site could include:&lt;/p>
&lt;ul>
&lt;li>someone tweeted an article landing page&lt;/li>
&lt;li>someone discussed it on Reddit&lt;/li>
&lt;li>it was linked to from a blog post&lt;/li>
&lt;/ul>
&lt;p>The Bot has only one job: to work out the DOI. No information beyond this is stored. Whenever we become aware of a link that we think points to a DOI or an Article Landing Page, we follow it so we can collect the required metadata. Everything in Crossref Event Data is linked via its DOI, so it&amp;rsquo;s important that we can collect this information.&lt;/p>
&lt;p>The bot will identify itself using the standard method. It sets two headers:&lt;/p>
&lt;ul>
&lt;li>Referer: &lt;a href="https://eventdata.crossref.org" target="_blank">https://eventdata.crossref.org&lt;/a>&lt;/li>
&lt;li>User-Agent: CrossrefEventDataBot (&lt;a href="mailto:eventdata@crossref.org">eventdata@crossref.org&lt;/a>)&lt;/li>
&lt;/ul>
&lt;p>Once we confirm that a link points to registered content, we then log an Event for the DOI. You should expect our bot to visit no more than once or twice per second, although if there is a period of activity around your articles, you may see higher rates. The bot also takes a sample of DOIs and visits them to work out which domain names belong to our members, so it can maintain a list. This can happen every few weeks. You may see a small number of requests from the bot, but limited to one per second.&lt;/p>
&lt;p>If we can’t enter your site to look for metadata though, then we won’t be able to collect Events for your DOIs. So by allowing our bot, you will be helping us to collect Event Data for your registered content.&lt;/p>
&lt;p>If you’re worried about traffic on your site, consider sending us your mapping of article landing pages to DOIs. Because &lt;a href="https://doi.org/10.64000/jw4t5-5yt89" target="_blank">Resource URLs aren&amp;rsquo;t the same as article landing pages&lt;/a>, we need more information than the DOI Resource URLs that you already send us.&lt;/p>
&lt;p>If you’re running a blog or website (and you’re not a member of Crossref), you may also see our bot visiting, to look for links that comprise Events. Please allow us to visit, so we can record in our Event Data service the fact that your website links to registered content.&lt;/p>
&lt;h3 id="2-we--robotstxt">2. We ❤️ robots.txt&lt;/h3>
&lt;p>Robots.txt files are important and we ensure our Event Data Bot respects yours. If we are instructed not to visit a site, we won&amp;rsquo;t. So if you want us to visit your site in order to check the metadata of your article landing page, please ensure you provide an exception for our Bot, or make sure that you’re not blocking it. Check the restrictions in your file to see if we’re allowed to visit. This is just another way you can help us work for you.&lt;/p>
&lt;h3 id="3-include-the-dc-identifier">3. Include the DC Identifier&lt;/h3>
&lt;p>Including good metadata is general best practice for scholarly publishing. When we visit a publisher’s site, we look for metadata embedded in the HTML document (such as DC.Identifier tags that, amongst other things, enable Crossmark to work).&lt;/p>
&lt;p>By ensuring you include a Dublin Core identifier meta tag in each of your articles pages, our system can match your landing pages back to DOIs.&lt;/p>
&lt;p>Here’s an example:&lt;/p>
&lt;p>&lt;img src="https://www.crossref.org/images/blog/ced-blog-code.png" alt="example of code" width="550px"
class="img-responsive" />&lt;/p>
&lt;h3 id="4-let-us-in-even-if-we-dont-bring-cookies">4. Let us in, even if we don’t bring cookies&lt;/h3>
&lt;p>We’re like that friend who turns up for dinner without bringing a bottle of wine. And we hope that you’ll be ok with that. Some Publisher sites don&amp;rsquo;t allow browsers to visit unless cookies are enabled and they block visitors that don&amp;rsquo;t accept them. If your site does this, we will be unable to collect Events for your DOIs. Allowing your site to be accessed without cookies will help give us the best chance of successfully reading your metadata.&lt;/p>
&lt;h3 id="5-we-may-not-speak-your-language">5. We may not speak your language&lt;/h3>
&lt;p>Sometimes we come across a publisher’s site that won’t render unless JavaScript is enabled. This means that the site won’t show any content to browsers that don&amp;rsquo;t execute JavaScript. The Event Data Bot does not execute JavaScript when looking for a DOI. This means that if your site requires JavaScript, then we will be unable to collect DOIs for your Events. Consider allowing your site to be accessed without JavaScript. And if this is not possible, then if you ensure you include the &lt;meta name="dc.identifier"> tag in the HTML header, then we’ll do our best to collect Events for your registered content.&lt;/p>
&lt;p>If you want to pass this on to your friendly system administrator, the best practice is documented in full here: &lt;a href="https://www.eventdata.crossref.org/guide/best-practice/publishers-best-practice/" target="_blank">https://www.eventdata.crossref.org/guide/best-practice/publishers-best-practice/&lt;/a>. And sorry about all the don’ts you’ll find on that page…. don’t let them curb your enthusiasm for taking Event Data out for a spin!&lt;/p></description></item><item><title>BestBlogsRead</title><link>https://www.crossref.org/blog/bestblogsread/</link><pubDate>Tue, 26 Sep 2017 00:00:00 +0000</pubDate><author>Christine Cormack Wood</author><discourseUsername>ccormackwood</discourseUsername><guid>https://www.crossref.org/blog/bestblogsread/</guid><description>&lt;p>We know that &lt;strong>research communication happens everywhere&lt;/strong>, and we want your help in finding it!&lt;/p>
&lt;p>From October 9th we will be collecting links sent in by you through a social campaign across Twitter and Facebook called &lt;strong>#BestBlogsRead.&lt;/strong>&lt;/p>
&lt;p>&lt;strong>Simply send us links to the blogs YOU like to read&lt;/strong>
It’s easy to participate, all you have to do is watch out for the daily tweets and facebook posts and then send us links to the blogs (and news sites) you read.&lt;/p>
&lt;p>From gardening to gaming, recipes to rock climbing, tennis to taxidermy - whatever blogs you read, we want to hear about them!&lt;/p>
&lt;p>&lt;strong>Because research happens everywhere!&lt;/strong>
And you’ll be surprised where it &lt;strong>is&lt;/strong> mentioned - for example:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>We found &lt;a href="http://onlinelibrary.wiley.com/doi/10.1111/j.1600-0498.1969.tb00136.x/abstract?systemMessage=Wiley&amp;#43;Online&amp;#43;Library&amp;#43;will&amp;#43;be&amp;#43;unavailable&amp;#43;on&amp;#43;Saturday&amp;#43;7th&amp;#43;Oct&amp;#43;from&amp;#43;03.00&amp;#43;EDT&amp;#43;%2F&amp;#43;08%3A00&amp;#43;BST&amp;#43;%2F&amp;#43;12%3A30&amp;#43;IST&amp;#43;%2F&amp;#43;15.00&amp;#43;SGT&amp;#43;to&amp;#43;08.00&amp;#43;EDT&amp;#43;%2F&amp;#43;13.00&amp;#43;BST&amp;#43;%2F&amp;#43;17%3A30&amp;#43;IST&amp;#43;%2F&amp;#43;20.00&amp;#43;SGT&amp;#43;and&amp;#43;Sunday&amp;#43;8th&amp;#43;Oct&amp;#43;from&amp;#43;03.00&amp;#43;EDT&amp;#43;%2F&amp;#43;08%3A00&amp;#43;BST&amp;#43;%2F&amp;#43;12%3A30&amp;#43;IST&amp;#43;%2F&amp;#43;15.00&amp;#43;SGT&amp;#43;to&amp;#43;06.00&amp;#43;EDT&amp;#43;%2F&amp;#43;11.00&amp;#43;BST&amp;#43;%2F&amp;#43;15%3A30&amp;#43;IST&amp;#43;%2F&amp;#43;18.00&amp;#43;SGT&amp;#43;for&amp;#43;essential&amp;#43;maintenance.&amp;#43;Apologies&amp;#43;for&amp;#43;the&amp;#43;inconvenience&amp;#43;caused&amp;#43;." target="_blank">a Wiley&lt;/a> article mentioned in a blog about &lt;a href="http://www.atlasobscura.com/articles/thales-predicts-eclipse-mystery-ancient-greece?utm_source=Atlas&amp;#43;Obscura&amp;#43;Daily&amp;#43;Newsletter&amp;amp;utm_campaign=810eff404b-EMAIL_CAMPAIGN_2017_08_09&amp;amp;utm_medium=email&amp;amp;utm_term=0_f36db9c480-810eff404b-66765933&amp;amp;ct=t%28Newsletter_8_9_2017%29&amp;amp;mc_cid=810eff404b&amp;amp;mc_eid=4e0067d656" target="_blank">the eclipse&lt;/a>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>An &lt;a href="http://pubs.acs.org/doi/abs/10.1021/tx9002726" target="_blank">American Chemical Society&lt;/a> article in a blog about &lt;a href="http://www.allergy-insight.com/free-from-at-bellavita/" target="_blank">food allergies &lt;/a>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>A blog about Neanderthals on the &lt;a href="https://www.theatlantic.com/science/archive/2017/09/neanderthals-lost-history/540507/" target="_blank">Atlantic&lt;/a> links to and article from the &lt;a href="http://doi.org/10.1126/science.1174462" target="_blank">American Association for the Advancement of Science&lt;/a>&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>So, watch out for the campaign on Twitter and Facebook, and tell us about your #BestBlogsRead.&lt;/p></description></item><item><title>Crossref at the Frankfurt Book Fair</title><link>https://www.crossref.org/blog/crossref-at-the-frankfurt-book-fair/</link><pubDate>Tue, 26 Sep 2017 00:00:00 +0000</pubDate><author>Christine Cormack Wood</author><discourseUsername>ccormackwood</discourseUsername><guid>https://www.crossref.org/blog/crossref-at-the-frankfurt-book-fair/</guid><description>&lt;p>We’ll be at booth M82 in the Hotspot area of Hall 4.2 and would love to meet with you. Let us know if you’re interested in chatting with one of us - about anything at all.&lt;/p>
&lt;p>&lt;a href="mailto:feedback@crossref.org">Kirsty Meddings&lt;/a>, &lt;strong>Product Manager&lt;/strong>: Here to help with Crossref services such as Crossmark and funding data, and happy to talk about your metadata and how you can deposit more.&lt;/p>
&lt;p>&lt;a href="mailto:pdavis@crossref.org">Paul Davis&lt;/a>, &lt;strong>Support Specialist&lt;/strong>: Any issues with metadata deposit, or anything technical, I’m your man.&lt;/p>
&lt;p>&lt;a href="mailto:scollins@crossref.org">Susan Collins&lt;/a>, &lt;strong>Publisher Outreach Manager:&lt;/strong> If you’re a member and have questions about how things are going, or try out additional services, I can help.&lt;/p>
&lt;p>&lt;a href="mailto:feedback@crossref.org">Jennifer Kemp&lt;/a>, &lt;strong>Affiliate Outreach Manager&lt;/strong>: Come to me if you want to get Metadata from Crossref, or discuss our imminent new service for social mentions and data links: &lt;a href="https://doi.org/10.64000/cbcne-j1d05" target="_blank">Event Data (in Beta)&lt;/a>.&lt;/p>
&lt;p>&lt;a href="mailto:ghendricks@crossref.org">Ginny Hendricks&lt;/a>, &lt;strong>Member &amp;amp; Community Outreach Director&lt;/strong>: I’d love to talk to publishers and platforms about the new &lt;a href="http://www.metadata2020.org" target="_blank">Metadata 2020&lt;/a> initiative.&lt;/p>
&lt;p>&lt;a href="mailto:abartell@crossref.org">Amanda Bartell&lt;/a>, &lt;strong>Head of Member Experience&lt;/strong>: This will be my first day at Crossref! If there is something you’d like the Membership team to do or change, please let me know.&lt;/p>
&lt;p>&lt;a href="mailto:feedback@crossref.org">Chrissie Cormack-Wood&lt;/a>, &lt;strong>Head of Marketing Communications&lt;/strong>: I’ll be acting as &amp;ldquo;host&amp;rdquo; so ask me anything about our booth and activities at the Fair. Ideas for joint campaigns or co-promotion are welcome too.&lt;/p>
&lt;p>If some of these topics are on your agenda, or if you’re not sure who to contact, &lt;a href="mailto:feedback@crossref.org">please let me know&lt;/a> and I’ll set up a 30-minute meeting at our booth, M82 in Hall 4.2.&lt;/p>
&lt;div class="shortcode-divwrap blue-highlight">
&lt;span>And, if you don’t get a chance to visit us at our stand, make sure you don’t miss Ginny’s &lt;a href="http://www.metadata2020.org" target="_blank">Metadata 20/20&lt;/a> talk at 2.30pm on Wednesday 11th, at the Hot Spot stage in the corner of Hall 4.2, area N99.&lt;/span>
&lt;/div>
&lt;p>We hope you have a great Book Fair!&lt;/p></description></item><item><title>Event Data as Underlying Altmetrics Infrastructure at the 4:AM Altmetrics Conference</title><link>https://www.crossref.org/blog/event-data-as-underlying-altmetrics-infrastructure-at-the-4am-altmetrics-conference/</link><pubDate>Mon, 25 Sep 2017 00:00:00 +0000</pubDate><author>Joe Wass</author><guid>https://www.crossref.org/blog/event-data-as-underlying-altmetrics-infrastructure-at-the-4am-altmetrics-conference/</guid><description>&lt;p>I&amp;rsquo;m here in Toronto and looking forward to a busy week. Maddy Watson and I are in town for the &lt;a href="https://www.altmetric.com/events/" target="_blank">4:AM Altmetrics Conference&lt;/a>, as well as the altmetrics17 workshop and Hack-day. I&amp;rsquo;ll be speaking at each, and for those of you who aren&amp;rsquo;t able to make it, I&amp;rsquo;ve combined both presentations into a handy blog post, which follows on from &lt;a href="https://doi.org/10.64000/3jrqv-85z62" target="_blank">my last one&lt;/a>.&lt;/p>
&lt;p>But first, nothing beats a good demo. &lt;a href="https://live.eventdata.crossref.org/live.html" target="_blank">Take a look at our live stream&lt;/a>. This shows the Events passing through Crossref Event Data, live, as they happen. You may need to wait a few seconds before you see anything.&lt;/p>
&lt;h2 id="crossref-and-scholarly-links">Crossref and scholarly links&lt;/h2>
&lt;p>You may know about Crossref. If you don&amp;rsquo;t, we are a non-profit organisation that works with Publishers (getting on for nine thousand) to register scholarly publications, issue Persistent Identifiers (DOIs) and maintain the infrastructure required to keep them working. If you don&amp;rsquo;t know what a DOI is, it&amp;rsquo;s a link that looks like this:&lt;/p>
&lt;p>&lt;a href="https://doi.org/10.5555/12345678" target="_blank">https://doi.org/10.5555/12345678&lt;/a>&lt;/p>
&lt;p>When you click on that, you&amp;rsquo;ll be taken to the landing page for that article. If the landing page moves, the DOI can be updated so you&amp;rsquo;re taken to the right place. This is why Crossref was created in the first place: to register Persistent Identifiers to combat link rot and to allow Publishers to work together and cite each other&amp;rsquo;s content. A DOI is a single, canonical identifier that can be used to refer to scholarly content.&lt;/p>
&lt;p>Not only that, we combine that with metadata and links. Links to authors via ORCIDs, references and citations via DOIs, funding bodies and grant numbers, clinical trials&amp;hellip; the list goes on. All of this data is provided by our members and most of it is made available via our free API.&lt;/p>
&lt;p>Because we are the central place that publishers register their content, and we&amp;rsquo;ve got approaching 100 million items of Registered Content, we thought that we could also curate and collect altmetrics type data for our corpus of publications. After all, a reference from a Tweet to an article is a link, just like a citation between two articles is a link.&lt;/p>
&lt;h2 id="an-experiment">An Experiment&lt;/h2>
&lt;p>So, a few years back we thought we would try and track altmetrics for DOIs. This was done as a Crossref Labs experiment. We grabbed a copy of PLOS ALM (since renamed Lagotto), loaded a sample of DOIs into it and watched as it struggled to keep up.&lt;/p>
&lt;p>It was a good experiment, as it showed that we weren&amp;rsquo;t asking exactly the right questions. There were a few things that didn&amp;rsquo;t quite fit. Firstly, it required every DOI to be loaded into it up-front, and, in some cases, for the article landing page for every DOI to be known. This doesn&amp;rsquo;t scale to tens of millions. Secondly, it had to scan over every DOI on a regular schedule and make an API query for each one. That doesn&amp;rsquo;t scale either. Thirdly, the kind of data it was requesting was usually in the form of a count. It asked the question:&lt;/p>
&lt;blockquote>
&lt;p>&amp;ldquo;How many tweets are there for this article as of today?&amp;rdquo;&lt;/p>
&lt;/blockquote>
&lt;p>This fulfilled the original use case for PLOS ALM at PLOS. But when running it at Crossref, on behalf of every publisher out there, the results raised more questions than they answered. Which was good, because it was a Labs Experiment.&lt;/p>
&lt;h2 id="asking-the-right-question">Asking the right question&lt;/h2>
&lt;p>The whole journey to Crossref Event Data has been a process of working out how to ask the right question. There are a number of ways in which &amp;ldquo;How many tweets are there for this article as of today?&amp;rdquo; isn&amp;rsquo;t the right question. It doesn&amp;rsquo;t answer:&lt;/p>
&lt;ul>
&lt;li>Tweeted by who? What about bots?&lt;/li>
&lt;li>Tweeted how? Original Tweets? Retweets?&lt;/li>
&lt;li>What was tweeted? The DOI? The article landing page? Was there extra text?&lt;/li>
&lt;li>When did the tweet occur?&lt;/li>
&lt;/ul>
&lt;p>We took one step closer toward the right question. Instead of asking &amp;ldquo;how many tweets for this article are there as of today&amp;rdquo; we asked:&lt;/p>
&lt;blockquote>
&lt;p>&amp;ldquo;What activity is happening on Twitter concerning this article?&amp;rdquo;&lt;/p>
&lt;/blockquote>
&lt;p>If we record each activity we can include information that answers all of the above questions. So instead of collecting data like this:&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>Registered Content&lt;/th>
&lt;th>Source&lt;/th>
&lt;th>Count&lt;/th>
&lt;th>Date&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>10.5555/12345678&lt;/td>
&lt;td>twitter&lt;/td>
&lt;td>20&lt;/td>
&lt;td>2017-01-01&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>10.5555/87654321&lt;/td>
&lt;td>twitter&lt;/td>
&lt;td>5&lt;/td>
&lt;td>2017-01-15&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>10.5555/12345678&lt;/td>
&lt;td>twitter&lt;/td>
&lt;td>23&lt;/td>
&lt;td>2017-02-01&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>We&amp;rsquo;re collecting data like this:&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>Subject&lt;/th>
&lt;th>Relation&lt;/th>
&lt;th>Object&lt;/th>
&lt;th>Source&lt;/th>
&lt;th>Date&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>twitter.com/tweet/1234&lt;/td>
&lt;td>references&lt;/td>
&lt;td>10.5555/12345678&lt;/td>
&lt;td>twitter&lt;/td>
&lt;td>2017-01-01&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>twitter.com/tweet/5678&lt;/td>
&lt;td>references&lt;/td>
&lt;td>10.5555/987654321&lt;/td>
&lt;td>twitter&lt;/td>
&lt;td>2017-01-11&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>twitter.com/tweet/9123&lt;/td>
&lt;td>references&lt;/td>
&lt;td>10.5555/12345678&lt;/td>
&lt;td>twitter&lt;/td>
&lt;td>2017-02-06&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>Now we&amp;rsquo;re collecting individual links between tweets and DOIs, we&amp;rsquo;re closer to all the other kinds of links that we store. It&amp;rsquo;s like the &amp;ldquo;traditional&amp;rdquo; links that we already curate except:&lt;/p>
&lt;ol>
&lt;li>It&amp;rsquo;s not provided by publishers, we have to go and collect it ourselves.&lt;/li>
&lt;li>It comes from a very diverse range of places, e.g. Twitter, Wikipedia, Blogs, Reddit, random web pages&lt;/li>
&lt;li>The places that the Events do come from don&amp;rsquo;t play by the normal rules. &lt;strong>Web pages work differently to articles.&lt;/strong>&lt;/li>
&lt;/ol>
&lt;h2 id="non-traditional-publishing-is-untraditional">Non-traditional Publishing is Untraditional&lt;/h2>
&lt;p>This last point caused us to scratch our heads for a bit. We used to collect links within the &amp;rsquo;traditional&amp;rsquo; scholarly literature. Generally, journal articles:&lt;/p>
&lt;ul>
&lt;li>get published once&lt;/li>
&lt;li>have a publisher looking after them, who can produce structured metadata&lt;/li>
&lt;li>are subject to a formal process of retractions or updates&lt;/li>
&lt;/ul>
&lt;p>Now we&amp;rsquo;re collecting links between things that aren&amp;rsquo;t seen as &amp;rsquo;traditional&amp;rsquo; scholarship and don&amp;rsquo;t play by the rules.&lt;/p>
&lt;p>The first thing we found is that blog authors don&amp;rsquo;t reference the literature using DOIs. Instead they use article landing pages. This meant that we had to put in the work to collect links to article landing pages and turn them back into DOIs so that they can be referenced in a stable, link-rot-proof way.&lt;/p>
&lt;p>When we looked at Wikipedia we noticed that, as pages are edited, references are added and removed all the time. If our data set reflected this, it would have to evolve over time, with items popping into existence and then vanishing again. This isn&amp;rsquo;t good.&lt;/p>
&lt;p>Our position in the scholarly community is to provide data and infrastructure that others can use to create services, enrich and build things. Curating an ever changing data set, where things can disappear, is not a great idea and is hard to work with.&lt;/p>
&lt;p>We realised that a plain old link store (also known as an assertion store, triple store, etc.) wasn&amp;rsquo;t the right approach as it didn&amp;rsquo;t capture the nuance in the data with sufficient transparency. At least, it didn&amp;rsquo;t tell the whole picture.&lt;/p>
&lt;p>We settled on a new architecture, and Crossref Event Data as we now know it was born. Instead of a dataset that changes over time, we have a continual stream of Events, where each Event tells a new part of the story. An Event is true at the time it is published, but if we find new information we don&amp;rsquo;t edit Events, we add new ones.&lt;/p>
&lt;p>An Event is the way that we tell you that we observed a link. It includes the link, in &amp;ldquo;subject - relation type - object&amp;rdquo; format, but so much more. We realised that one question won&amp;rsquo;t do, so Events now answer the following questions:&lt;/p>
&lt;ul>
&lt;li>What links to what?&lt;/li>
&lt;li>How was the link made? Was it with a article&amp;rsquo;s DOI or straight to an Article landing page?&lt;/li>
&lt;li>Which Agent collected it?&lt;/li>
&lt;li>Which data source were they looking at?&lt;/li>
&lt;li>When was the link observed?&lt;/li>
&lt;li>When do we think the link actually happened?&lt;/li>
&lt;li>What algorithms were used to collect it?&lt;/li>
&lt;li>How do you know?&lt;/li>
&lt;/ul>
&lt;p>I&amp;rsquo;ll come back to the &amp;ldquo;how do you know&amp;rdquo; a bit later.&lt;/p>
&lt;h2 id="what-is-an-altmetrics-event">What is an altmetrics Event?&lt;/h2>
&lt;p>So, an Event is a package that contains a link plus lots of extra information required to interpret and make sense of it. But how do we choose what comprises an Event?&lt;/p>
&lt;p>An Event is created every time we notice an interaction between something we can observe out on the web and a piece of registered content. This simple description gives rise to some interesting quirks.&lt;/p>
&lt;p>It means that every time we see a tweet that mentions an article, for example, we create an Event. If a tweet mentions two articles, there are two events. That means that &amp;ldquo;the number of Twitter events&amp;rdquo; is not the same as &amp;ldquo;the number of tweets&amp;rdquo;.&lt;/p>
&lt;p>It means that every time we see a link to a piece of registered content in a webpage, we create an Event. The Event Data system currently tries to visit each webpage once, but we reserve the right to visit a webpage more than once. This means that the number of Events for a particular webpage doesn&amp;rsquo;t mean there are that many references.&lt;/p>
&lt;p>We might go back and check a webpage in future to see if it still has the same links. If it does, we might generate a new set of Events to indicate that.&lt;/p>
&lt;p>Because of the evolving nature of Wikipedia, we attempt to visit every page revision and document the links we find. This means that if an article has a very active edit history, and therefore a large number of edits, we will see repeated Events to the literature, once for every version of the page that makes references. So the number of Events in Wikipedia doesn&amp;rsquo;t mean the number of references.&lt;/p>
&lt;p>An Event is created every time we notice an interaction. Each source (Reddit, Wikipedia, Twitter, blogs, the web at large) has different quirks, and you need to understand the underlying source in order to understand the Events.&lt;/p>
&lt;h2 id="we-put-the-choice-into-your-hands">We put the choice into your hands.&lt;/h2>
&lt;p>If you want to create a metric based on counting things, you have a lot of decisions to make. Do you care about bots? Do you care about citation rings? Do you care about retweets? Do you care about whether people use DOIs or article landing pages? Do you care what text people included in their tweet? The answer to each of these questions means that you&amp;rsquo;ll have to look at each data point and decide to put a weighting or score on it.&lt;/p>
&lt;p>If you wanted to measure how blogged about a particular article was, you would have to look at the blogs to work out if they all had unique content. For example, Google&amp;rsquo;s Blogger platform can publish the same blog post under multiple domain names.&lt;/p>
&lt;p>A blog full of link spam is still a blog. You may be doing a study into reputable blogs, so you may want to whitelist the set of domain names to exclude less reputable blogs. Or you may be doing a study into blog spam, so lower quality blogs is precisely what you&amp;rsquo;re interested in,&lt;/p>
&lt;p>If you wanted to measure how discussed an article was on Reddit, you might want to go to the conversation and see if people were actually talking about it, or whether it was an empty discussion. You might want to look at the author of the post to see if they were a regular poster, whether they were a bot or an active member of the community.&lt;/p>
&lt;p>If you wanted to measure how referenced an article was in Wikipedia, you might want to look at the history of each reference to see if it was deleted immediately. Or if it existed for 50% of the time, and to give a weighting.&lt;/p>
&lt;p>We don&amp;rsquo;t do any scoring, we just record everything we observe. We know that everyone will have different needs, be producing different outcomes and use different methodologies. So it&amp;rsquo;s important that we tell you everything we know.&lt;/p>
&lt;p>So that&amp;rsquo;s an Event. It&amp;rsquo;s not just a link, it&amp;rsquo;s the observation of a link, coupled with extra information to help you understand it.&lt;/p>
&lt;h2 id="how-do-you-know">How do you know?&lt;/h2>
&lt;p>But what if the Event isn&amp;rsquo;t enough? To come back to the earlier question, &amp;ldquo;how do you know?&amp;rdquo;&lt;/p>
&lt;p>Events don&amp;rsquo;t exist in isolation. Data must be collected and processed. Each Agent in Crossref Event Data monitors a particular data source and feeds data into the system, which goes and retrieves webpages so it can make observations. Things can go wrong.&lt;/p>
&lt;p>Any one of these things might prevent an Event from being collected:&lt;/p>
&lt;ul>
&lt;li>We might not know about a particular DOI prefix immediately after it&amp;rsquo;s registered.&lt;/li>
&lt;li>We might not know about a particular landing page domain for a new member immediately.&lt;/li>
&lt;li>Article landing pages might not have the right metadata, so we can&amp;rsquo;t match them to DOIs.&lt;/li>
&lt;li>Article landing pages might block the Crossref bot, so we can&amp;rsquo;t match DOIs.&lt;/li>
&lt;li>Article landing pages might require cookies, or convoluted JavaScript, so the bot can&amp;rsquo;t get the content.&lt;/li>
&lt;li>Blogs and webpages might require cookies or JavaScript to execute.&lt;/li>
&lt;li>Blogs might block the Event Data bot.&lt;/li>
&lt;li>A particular API might have been unavailable for a period of time.&lt;/li>
&lt;li>We didn&amp;rsquo;t know about a particular blog newsfeed at the time.&lt;/li>
&lt;/ul>
&lt;p>This is a fact of life, and we can only operate on a best-effort basis. If we don&amp;rsquo;t have an Event, it doesn&amp;rsquo;t mean it didn&amp;rsquo;t happen.&lt;/p>
&lt;p>This doesn&amp;rsquo;t mean that we just give up. Our system generates copious logs. It details every API call it made, the response it got, every scan it made, every URL it looked at. This amounts to about a gigabyte of data per day. If you want to find out why there was no Wikipedia data at a given point in time, you can go back to the log data and see what happened. If you want to see why there was no Event for an article by publisher X, you can look at the logs and see, for example, that Publisher X prevented the bot from visiting.&lt;/p>
&lt;p>Every Event that does exist has a link to an Evidence Record, which corresponds with the logs. The Evidence Record tells you:&lt;/p>
&lt;ul>
&lt;li>which version of the Agent was running&lt;/li>
&lt;li>which Artifacts and versions it was working from&lt;/li>
&lt;li>which API requests were made&lt;/li>
&lt;li>which inputs looked like possible links&lt;/li>
&lt;li>which matched or failed&lt;/li>
&lt;li>which Events were generated&lt;/li>
&lt;/ul>
&lt;p>Artifacts are versioned files that contain information that Agents use. For example, there&amp;rsquo;s a list of domain names, a list of DOI prefixes, a list of blog feed urls, and so on. By indicating which version of these Artifacts were used, we can explain why we visited a certain domain and not another.&lt;/p>
&lt;p>All the code is open source. The Evidence Record says which version of each Agent was running so you can see precisely which algorithms were used to generate the data.&lt;/p>
&lt;p>Between the Events, Evidence Records, Evidence Logs, Artifacts and Open Source software, we can pinpoint precisely how the system behaved and why. If you have any questions about how a given Event was (or wasn&amp;rsquo;t) generated, every byte of explanation is freely available.&lt;/p>
&lt;p>This forms our &amp;ldquo;Transparency first&amp;rdquo; idea. We start the whole process with an open Artifact Registry. Open source software then produces open Evidence Records. The Evidence Record is then consulted and turned into Events. All the while, copious logs are being generated. We&amp;rsquo;ve designed the system to be transparent, and for each step to be open to inspection.&lt;/p>
&lt;p>We&amp;rsquo;re currently in Beta. We have over thirty million Events in our API, and they&amp;rsquo;re just waiting for you to use them!&lt;/p>
&lt;p>Head over to the &lt;a href="https://www.eventdata.crossref.org/guide/" target="_blank">User Guide&lt;/a> and get stuck in!&lt;/p>
&lt;p>If you are in Toronto, come and say hi to Maddy or me.&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/people/joe-wass/">&lt;img src="https://www.crossref.org/images/staff/joe_720px.jpg" width="200px">&lt;/a>&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/people/madeleine-watson/">&lt;img src="https://www.crossref.org/images/staff/maddy17-720px.png" width="200px">&lt;/a>&lt;/p></description></item><item><title>Organisation Identifier Working Group Update</title><link>https://www.crossref.org/blog/organisation-identifier-working-group-update/</link><pubDate>Mon, 18 Sep 2017 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/organisation-identifier-working-group-update/</guid><description>&lt;p>About 1 year ago, Crossref, DataCite and ORCID [announced a joint initiative] (&lt;a href="https://orcid.org/blog/2016/10/31/organisation-identifier-project-way-forward" target="_blank">https://orcid.org/blog/2016/10/31/organisation-identifier-project-way-forward&lt;/a>) to launch and sustain an open, independent, non-profit organisation identifier registry to facilitate the disambiguation of researcher affiliations. Today we publish governance recommendations and product principles and requirements for the creation of an open, independent organisation identifier registry and invite community feedback.&lt;/p>
&lt;p>The &lt;a href="https://orcid.org/content/organisation-identifier-working-group" target="_blank">organisation Identifier (OrgID) Working Group&lt;/a> was established as &lt;a href="https://orcid.org/blog/2016/10/31/organisation-identifier-project-way-forward" target="_blank">a joint effort by Crossref, DataCite and ORCID&lt;/a> in January 2017. The members of the group bring a broad range of experience and perspectives, including expertise in research data discovery, data management, persistent identifiers, economics research, funding, archiving, non-profit membership organisations, academia, publishing, and metadata development.&lt;/p>
&lt;p>The Working Group was charged with refining the structure, principles, and technology specifications for an open, independent, non-profit organisation identifier registry to facilitate the disambiguation of researcher affiliations.&lt;/p>
&lt;p>The group has been working in three interdependent areas: Governance, Registry Product Definition, and Business Model &amp;amp; Funding, and today releases for public comment its findings and recommendations for governance and product requirements.&lt;/p>
&lt;ul>
&lt;li>&lt;a href="https://figshare.com/articles/ORG_ID_WG_Governance_Principles_and_Recommendations/5402002/1" target="_blank">Governance Recommendations&lt;/a> - &lt;a href="https://doi.org/10.23640/07243.5402002.v1" target="_blank">https://doi.org/10.23640/07243.5402002.v1&lt;/a>&lt;br>&lt;/li>
&lt;li>&lt;a href="https://figshare.com/articles/ORG_ID_WG_Product_Principles_and_Recommendations/5402047/1" target="_blank">Product Principles and Recommendations&lt;/a> - &lt;a href="https://doi.org/10.23640/07243.5402047.v1" target="_blank">https://doi.org/10.23640/07243.5402047.v1&lt;/a>&lt;/li>
&lt;/ul>
&lt;p>We invite your feedback!&lt;/p>
&lt;p>Please &lt;a href="mailto:oi-project@orcid.org">send comments&lt;/a> by October 15th, 2017.&lt;/p></description></item><item><title>PIDapalooza is back and wants your PID stories</title><link>https://www.crossref.org/blog/pidapalooza-is-back-and-wants-your-pid-stories/</link><pubDate>Thu, 14 Sep 2017 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/pidapalooza-is-back-and-wants-your-pid-stories/</guid><description>&lt;p>Now in its second year, this “open festival of persistent identifiers” brings together people from all walks of life who have something to say about PIDs. If you work with them, develop with them, measure or manage them, let us know your PID adventures, pitfalls, and plans by submitting a talk by September 18. It&amp;rsquo;ll be in Girona, Spain, January 23-24, 2018.&lt;/p>
&lt;p>One of the great strengths of last year’s PIDapalooza was the number of people who spoke and all the conversations that were kindled. &lt;strong>So if you&amp;rsquo;re thinking of going, we encourage you to &lt;a href="https://docs.google.com/forms/d/e/1FAIpQLSdR7TGVGMRUVVgMejMqJhgKa8xdL-GDGyv97g_RSRumBAjgTg/viewform" target="_blank">propose a talk&lt;/a>, so we can hear what you&amp;rsquo;re working on and you can get some feedback&lt;/strong>.&lt;/p>
&lt;p>At the inaugural PIDapalooza event Crossref took to the stage twice, with Ed Pentz covering Org IDs and Joe Wass talking about Event Data.&lt;/p>
&lt;p>Here we have Joe’s memories of the event and Ed’s update on the Org ID status.&lt;/p>
&lt;h3 id="joe-wass-reflects">Joe Wass reflects:&lt;/h3>
&lt;p>At Crossref, the subject of Persistent Identifiers is something we care deeply about, and linking between DOIs, ORCID iDs and other identifiers is the reason we get up in the morning. But a whole conference dedicated to them? If I&amp;rsquo;m honest, the first time I heard about PIDapalooza I thought the subject was rather niche.&lt;/p>
&lt;p>How wrong I was. It turns out there are people from all walks of life who care about &amp;ldquo;things&amp;rdquo; using persistent identifiers to link, describe and reference them. There was a great balance between presenters and attendees, and the programme meant that lots of people had a chance to speak. We heard about identifiers for research vessels, pieces of scientific equipment, individual bottles of milk, plus the usual subjects like scholarly publishing, datasets, organisations and funders, and how to cite them.&lt;/p>
&lt;p>Between sessions we chatted over a wide range of subjects, noted similarities between subject areas, offered advice and exchanged ideas. Who knew this stuff was all related?&lt;/p>
&lt;h3 id="ed-pentz-on-plans-for-the-new-organisation-ids">Ed Pentz on plans for the new organisation IDs&lt;/h3>
&lt;p>An important presentation at the 2016 PIDapalooza meeting was on organisation identifiers. A week before the conference Crossref, DataCite and ORCID released three documents for public comment outlining a proposed way forward. The goal is launch and sustain an open, independent, non-profit organisation identifier registry to facilitate the disambiguation of researcher affiliations. At the packed PIDapalooza session Crossref, DataCite and ORCID gave an update on their work over the previous year and their proposals going forward.&lt;/p>
&lt;p>There was a lively discussion and debate about the issues. Following the meeting the three organisations set up the OI Project Working Group with a broad group of stakeholders. The group has been meeting over the last year and will release two documents next week - a set of Governance Recommendations and Product Principles and Recommendations for community feedback. So watch this space.&lt;/p>
&lt;p>The PIDapalooza conference really helped galvanize the work in this area by bringing together a broad range of people interested in persistent identifiers. If you have an idea about PIDs, please come and tell us about it.&lt;/p>
&lt;hr>
&lt;p>Check out the &lt;a href="https://pidapalooza.figshare.com/" target="_blank">decks from last year's talks&lt;/a>, the &lt;a href="https://www.pidapalooza.org/" target="_blank">PIDapalooza website&lt;/a> with all the info, and &lt;a href="https://docs.google.com/forms/d/e/1FAIpQLSdR7TGVGMRUVVgMejMqJhgKa8xdL-GDGyv97g_RSRumBAjgTg/viewform" target="_blank">sumbit a proposal for your talk before September 18&lt;/a>.&lt;/p></description></item><item><title>Making peer reviews citable, discoverable, and creditable</title><link>https://www.crossref.org/blog/making-peer-reviews-citable-discoverable-and-creditable/</link><pubDate>Mon, 11 Sep 2017 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/making-peer-reviews-citable-discoverable-and-creditable/</guid><description>&lt;p>A number of our members have asked if they can register their peer reviews with us. They believe that discussions around scholarly works should have DOIs and be citable to provide further context and provenance for researchers reading the article. To that end, we can announce some pertinent news as we enter &lt;a href="https://peerreviewweek.wordpress.com" target="_blank">Peer Review Week 2017 &lt;/a>: Crossref infrastructure is soon to be extended to manage DOIs for peer reviews. Launching next month will be support for this new resource/record type, with schema specifically dedicated to the reviews and discussions of scholarly content.&lt;/p>
&lt;p>Not disimilar to other registered resources (datasets, working papers, preprints, translations, etc.) publication peer reviews are important scholarly contributions in their own right and form a part of the scholarly record. In addition to the members who have been registering them, many more are looking to better handle these contributions and give recognition to this process which is so critical to maintaining scientific quality.&lt;/p>
&lt;p>Here are a few examples of existing Crossref DOIs for peer reviews: &lt;a href="https://doi.org/10.1016/j.engfracmech.2015.01.019" target="_blank">https://doi.org/10.1016/j.engfracmech.2015.01.019&lt;/a> and &lt;a href="https://doi.org/10.5194/wes-1-177-2016" target="_blank">https://doi.org/10.5194/wes-1-177-2016&lt;/a> and &lt;a href="https://doi.org/10.14322/PUBLONS.R518142" target="_blank">https://doi.org/10.14322/PUBLONS.R518142&lt;/a>.&lt;/p>
&lt;p>We are extending our infrastructure to support all members who make these scholarly discussions available to readers. To accommodate a wide range of publisher practices, this will include a range of outputs made publicly available from the peer review history, across any and all review rounds, including referee reports, decision letters, and author responses. Members will be able to include not only scholarly discussions of journal articles before but also after publication (e.g. “post-publication reviews”).&lt;/p>
&lt;p>Central to this new feature of the Crossref &lt;a href="https://www.crossref.org/services/content-registration">Content Registration&lt;/a> service is the special set of metadata dedicated to supporting the discovery and investigation of peer reviews as it is linked up to the article discussed. The peer review schema will provide a characterization of the peer review asset (for example: recommendation, type, license, contributor info, competing interests) as well as offer a view into the review process (e.g. pre/post-publication, revision round, review date).&lt;/p>
&lt;h3 id="our-custom-support-for-peer-reviews-will-ensure-that">Our custom support for peer reviews will ensure that:&lt;/h3>
&lt;ul>
&lt;li>Readers can see provenance and get context of a work&lt;/li>
&lt;li>Links to this content persist over time&lt;/li>
&lt;li>The metadata is useful&lt;/li>
&lt;li>They are connected to the full history of the published results&lt;/li>
&lt;li>Contributors are given credit for their work (we will ask for ORCID iDs)&lt;/li>
&lt;li>The citation record is clear and up-to-date.&lt;/li>
&lt;/ul>
&lt;p>As with all the content registered with Crossref, we will make peer review metadata available for machine and human access, across multiple interfaces (e.g. &lt;a href="https://api.crossref.org/" target="_blank">REST API&lt;/a>, &lt;a href="https://support.crossref.org/hc/en-us/articles/213679866-OAI-PMH-subscriber-only-" target="_blank">OAI-PMH&lt;/a>, &lt;a href="https://search.crossref.org/" target="_blank">Crossref Metadata Search&lt;/a>) to enable discoverability across the research ecosystem. This metadata may also support enrichment of scholarly discussion, reviewer accountability, publishing transparency, analysis or research on peer reviews, and so on.&lt;/p>
&lt;p>To reflect the nature of this special content, we will bundle the fees for peer review content fees into the cost of registering the article for members who publish the journal article and its peer reviews. No matter how many reviews are associated with a paper, there will be a fixed fee for the full set.&lt;/p>
&lt;p>Peer review infrastructure will arrive at Crossref in one month, and we are excited to engage our members who want to assign DOIs to peer reviews or migrate previously registered review content to the new schema. A special thanks to the members so far who have given feedback and advice to develop the schema: BMC, The BMJ, Copernicus, eLife, PeerJ, and Publons.&lt;/p>
&lt;p>Please contact our &lt;a href="mailto:feedback@crossref.org">membership specialist&lt;/a> if you&amp;rsquo;d like to know more.&lt;/p></description></item><item><title>More metadata for machines-citations, relations, and preprints arrive in the REST API</title><link>https://www.crossref.org/blog/more-metadata-for-machines-citations-relations-and-preprints-arrive-in-the-rest-api/</link><pubDate>Mon, 11 Sep 2017 00:00:00 +0000</pubDate><author>Kirsty Meddings</author><guid>https://www.crossref.org/blog/more-metadata-for-machines-citations-relations-and-preprints-arrive-in-the-rest-api/</guid><description>&lt;p>Over the past few months we have been adding to the metadata and functionality of our &lt;a href="https://api.crossref.org" target="_blank">REST API&lt;/a>, Crossref’s public machine interface for the metadata of all 90 million+ registered content items. Much of the work focused on a review and upgrade of the API’s code and architecture in order to better support its rapidly growing usage. But we have also extended the &lt;a href="https://github.com/CrossRef/rest-api-doc/blob/master/api_format.md" target="_blank">types of metadata&lt;/a> that the API can deliver.&lt;/p>
&lt;p>One of the biggest changes is that &lt;a href="https://www.crossref.org/services/reference-linking/">references&lt;/a> are now available if the publisher has made them public (a simple &lt;a href="mailto:support@crossref.org">email instruction&lt;/a> to us). Currently 45% of all publications with deposited references are now accessible. For example:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>&lt;a href="http://doi.org/10.1073/pnas.1402289111" target="_blank">This article&lt;/a> studying fluid ejection from animals has 55 references and they are all in the &lt;a href="https://api.crossref.org/v1/works/10.1073/pnas.1402289111" target="_blank">metadata here&lt;/a>. You can also see that the article has an &lt;code>is-referenced-by&lt;/code> count of 6.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;a href="https://doi.org/10.1371/journal.pone.0070585" target="_blank">This article&lt;/a> exploring whether people bitten by their cat are more likely to develop depression has 142 references and is &lt;a href="https://api.crossref.org/v1/works/10.1371/journal.pone.0070585" target="_blank">referenced by 12&lt;/a>.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>We recently announced that we would be &lt;a href="https://doi.org/10.64000/5tcfp-vf140" target="_blank">accepting preprints&lt;/a>, and the metadata for 15,000 preprints &lt;a href="https://api.crossref.org/v1/works?facet=type-name:*&amp;amp;rows=0" target="_blank">registered to date&lt;/a> is now in the API, labelled as &lt;code>posted-content&lt;/code>. Over 4,000 have been subsequently published in a journal, and the Crossref metadata now links these preprints to their respective articles (and vice versa). For example &lt;a href="https://doi.org/10.1101/098947" target="_blank">this article&lt;/a> in Biorxiv has since been &lt;a href="http://dx.doi.org/10.1093/molbev/msx056" target="_blank">published in a journal&lt;/a>, and this relationship is recorded in its &lt;a href="https://api.crossref.org/v1/works/10.1101/098947" target="_blank">metadata&lt;/a> as &lt;code>is-preprint-of&lt;/code>.&lt;/p>
&lt;h3 id="also-new-to-the-api">Also new to the API:&lt;/h3>
&lt;ul>
&lt;li>
&lt;p>Cited-by counts - the number of times each work has been referenced by other content registered with us. Look for &lt;a href="https://api.crossref.org/v1/works/10.1063/1.4870777" target="_blank">&lt;code>is-referenced-by-count&lt;/code>&lt;/a> within a record.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;a href="https://doi.org/10.1038/171737a0" target="_blank">This article&lt;/a> from 1953 about a fairly notable discovery has been &lt;a href="https://api.crossref.org/v1/works/10.1038/171737a0" target="_blank">cited 4832 times&lt;/a>, but the two &lt;a href="https://api.crossref.org/v1/works/10.1038/227680a0" target="_blank">most&lt;/a> &lt;a href="https://api.crossref.org/v1/works/10.1016/0003-2697%2876%2990527-3" target="_blank">cited&lt;/a> articles both have over 100,000 citations and thousands have been cited more than Watson and Crick.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Abstracts for over &lt;a href="https://api.crossref.org/v1/works?query=has-abstract:true&amp;amp;rows=0" target="_blank">1 million works&lt;/a>.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Similarity Check URLs&amp;ndash;the ones that Turnitin crawl to add content to the database&amp;ndash;are now showing so that participating publishers can check that they are including them in their &lt;a href="https://api.crossref.org/v1/works/10.5740/jaoacint.10-223" target="_blank">metadata deposits&lt;/a>.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Subject categories have been added for an additional 7000 journal titles, taking the total number of classified titles to ~45,000.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>Are you already using our Metadata APIs for your system or project? We’re always keen to &lt;a href="mailto:feedback@crossref.org">hear new use cases and happy to answer any questions&lt;/a>.&lt;/p>
&lt;p>&lt;em>You may need to install a JSON viewer extension in your browser to render API queries in a human-friendly way.&lt;/em>&lt;/p></description></item><item><title>Using the Crossref REST API. Part 5 (with OpenCitations)</title><link>https://www.crossref.org/blog/using-the-crossref-rest-api.-part-5-with-opencitations/</link><pubDate>Sun, 10 Sep 2017 00:00:00 +0000</pubDate><author>Christine Cormack Wood</author><discourseUsername>ccormackwood</discourseUsername><guid>https://www.crossref.org/blog/using-the-crossref-rest-api.-part-5-with-opencitations/</guid><description>&lt;p>As part of our &lt;a href="https://www.crossref.org/categories/api-case-study/">blog post series on the Crossref REST API&lt;/a>, we talked to Silvio Peroni and David Shotton of OpenCitations (OC) about the work they’re doing, and how they’re using the Crossref REST API as part of their workflow.&lt;/p>
&lt;p>&lt;strong>Introducing OpenCitations&lt;/strong>&lt;/p>
&lt;p>OpenCitations employs Semantic Web technologies to create an open repository of the citation data that publishers have made available. This repository, called the OpenCitations Corpus (OCC), contains RDF-based scholarly citation data that are made freely available so that others may use and build upon them. All the resources published by OC – namely the data within the OCC, the ontologies describing the data, and the software developed to build the OCC – are available to the public with open licenses.&lt;/p>
&lt;p>&lt;strong>What problem is your service trying to solve?&lt;/strong>&lt;/p>
&lt;p>OC was started to address the lack of RDF-based open citation data. To our knowledge, when the project formally started with Jisc funding in 2010 the prototype OCC was the first RDF-based dataset of open citation data.&lt;/p>
&lt;p>We collect accurate scholarly citation data derived from bibliographic references harvested from the scholarly literature, so as to make them available under a Creative Commons public domain dedication (CC0) by means of Semantic Web technologies, thus making them findable, accessible, interoperable, and re-usable, as well as structured, separable, and open.&lt;/p>
&lt;p>OCC citation data are described using standard and/or well-known vocabularies, including the&lt;a href="http://www.sparontologies.net/" target="_blank"> SPAR Ontologies&lt;/a> ,&lt;a href="https://www.w3.org/TR/prov-o/" target="_blank"> PROV-O&lt;/a>, the&lt;a href="https://www.w3.org/TR/vocab-dcat" target="_blank"> Data Catalog Vocabulary,&lt;/a> and&lt;a href="https://www.w3.org/TR/void" target="_blank"> VoID&lt;/a>. The use of such vocabulary is described in the&lt;a href="https://dx.doi.org/10.6084/m9.figshare.3443876" target="_blank"> OCC metadata document&lt;/a>, and is implemented by means of the&lt;a href="https://w3id.org/oc/ontology" target="_blank"> OpenCitations Ontology&lt;/a> (OCO).&lt;/p>
&lt;p>The OCC resources are made available and accessible in different ways, so as to facilitate their reuse in different contexts:&lt;a href="http://opencitations.net/download" target="_blank"> as monthly dumps&lt;/a>, via the&lt;a href="https://w3id.org/oc/sparql" target="_blank"> SPARQL&lt;/a> endpoint, and by accessing them directly by means of the HTTP URIs of the stored resources (via content negotiation;&lt;a href="https://w3id.org/oc/corpus/br/1" target="_blank"> example&lt;/a>)&lt;/p>
&lt;p>&lt;strong>Can you tell us how you are using the Crossref Metadata API at OpenCitations?&lt;/strong>&lt;/p>
&lt;p>At present, basic citation information is retrieved from PubMed Central, and the Crossref API is then used to retrieve additional metadata describing the citing and cited articles, and to disambiguate bibliographic resources and agents by means of the identifiers retrieved (e.g., DOI, ISSN, ISBN, URL, and Crossref member URL). In future, we will retrieve full citation data direct from Crossref.&lt;/p>
&lt;p>&lt;strong>What metadata values do you pull from the API?&lt;/strong>&lt;/p>
&lt;p>We pull the titles, subtitles, identifiers (e.g. DOI, ISSN, ISBN, URL, and Crossref member URL), author list, publisher, container resources (issue, volume, journal, book, etc.), publication year and pages.&lt;/p>
&lt;p>&lt;strong>Have you built your own interface to extract this data?&lt;/strong>&lt;/p>
&lt;p>The SPAR Citation Indexer, a.k.a.&lt;a href="https://w3id.org/oc/paper/spacin-demo-ekaw2016.html" target="_blank"> SPACIN&lt;/a>, is a script and a series of Python classes that allow one to process particular JSON files containing the bibliographic reference lists of papers, produced from the PubMed Central API by another script included in the &lt;a href="https://github.com/essepuntato/opencitations" target="_blank">OpenCitations GitHub repository.&lt;/a>&lt;/p>
&lt;p>SPACIN processes such JSON files and retrieves additional metadata information about all the citing and cited articles by querying the Crossref API, among others. Once SPACIN has retrieved all these metadata, RDF resources are created (or reused, if they have been already added in the past) and stored in the file system in JSON-LD format. In addition, they are also uploaded to the OCC triplestore (via the SPARQL UPDATE protocol).&lt;/p>
&lt;p>&lt;strong>How often do you extract/query data?&lt;/strong>&lt;/p>
&lt;p>The entire OpenCitations ingestion workflow is running continuously, processing about half a million citations per month.&lt;/p>
&lt;p>&lt;strong>What do you do with the metadata once it’s pulled from the API?&lt;/strong>&lt;/p>
&lt;p>All the metadata relevant to bibliographic entities are stored by using the&lt;a href="https://dx.doi.org/10.6084/m9.figshare.3443876" target="_blank"> OCC metadata model&lt;/a>. The ontological terms of such metadata model are collected within an ontology called the OpenCitations Ontology (OCO), which includes several terms from the SPAR Ontologies and other vocabularies. In particular, the following six bibliographic entity types occur in the datasets created by SPACIN:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>bibliographic resources (br), class fabio:Expression – resources that either cite or are cited by other bibliographic resources (e.g. journal articles), or that contain such citing/cited resources (e.g. journals);&lt;/p>
&lt;/li>
&lt;li>
&lt;p>resource embodiments (re), class fabio:Manifestation – details of the physical or digital forms in which the bibliographic resources are made available by their publishers;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>bibliographic entries (be), class biro:BibliographicReference – literal textual bibliographic entries occurring in the reference lists of bibliographic resources;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>responsible agents (ra), class foaf:Agent – names of agents having certain roles with respect to the bibliographic resources (i.e. names of authors, editors, publishers, etc.);&lt;/p>
&lt;/li>
&lt;li>
&lt;p>agent roles (ar), class pro:RoleInTime – roles held by agents with respect to the bibliographic resources (e.g. author, editor, publisher);&lt;/p>
&lt;/li>
&lt;li>
&lt;p>identifiers (id), class datacite:Identifier – external identifiers (e.g. DOI, ORCID, PubMedID) associated to bibliographic resources and agents.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>Do you have plans to enhance your metadata input?&lt;/strong>&lt;/p>
&lt;p>We already handle additional information, such as ORCIDs, that are extracted by means of the ORCID API applied to the citing and cited articles included in the OCC. In addition, we are developing scripts in order to use all the new citation data Crossref now makes available as consequence of the Initiative for Open Citations (I4OC).&lt;/p>
&lt;p>&lt;strong>What are the future plans for OpenCitations?&lt;/strong>&lt;/p>
&lt;p>With funding received from the Alfred P. Sloan Foundation, we will shortly extend the current infrastructure and the rate of data ingest. Our immediate goal is to increment the daily ingestion of citation data from about half a million citations per month to about half a million citations per day. In addition, we plan to analyse the OCC so as to understand the quality of its current data, and to develop new user interfaces, including graph visualizations of citation networks, that will expand the means whereby users can interact with the OpenCitations data.&lt;/p>
&lt;p>&lt;strong>What else would you like to see our REST API offer?&lt;/strong>&lt;/p>
&lt;p>Categorising articles/journals/any bibliographic resources according to their main discipline (Computer Science, Biology, etc.) and, eventually, by means of subject terms and/or keywords. Additionally, provision of authors&amp;rsquo; institutional affiliations and funder information would be extremely valuable.&lt;/p>
&lt;p>Thank you Silvio and David!&lt;/p>
&lt;p>If you are keen to share what you’re doing with the our Metadata APIs, contact &lt;a href="mailto:feedback@crossref.org">feedback@crossref.org&lt;/a> and share your story.&lt;/p></description></item><item><title>LIVE17 in Singapore is taking shape!</title><link>https://www.crossref.org/blog/live17-in-singapore-is-taking-shape/</link><pubDate>Tue, 29 Aug 2017 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/live17-in-singapore-is-taking-shape/</guid><description>&lt;p>Our annual meeting on 14th and 15th November, &lt;a href="https://www.crossref.org/crossref-annual-meeting/archive/#2017">LIVE17&lt;/a> is shaping up nicely with an exciting line-up of respected speakers talking around the theme of “Metadata + Infrastructure + Relations = Context”, with each half day covering some element of the main theme.&lt;/p>
&lt;ul>
&lt;li>Day one, AM: &lt;em>Metadata enables connections&lt;/em>&lt;/li>
&lt;li>Day one, PM: &lt;em>How research and infrastructure is changing&lt;/em>&lt;/li>
&lt;li>Day two, AM: &lt;em>Social challenges in the scholarly community&lt;/em>&lt;/li>
&lt;li>Day two, PM: &lt;em>Who is using your metadata and what are they doing with it?&lt;/em>&lt;/li>
&lt;/ul>
&lt;p>This years updated format means both days will be packed with a mixture of plenary and breakout sessions and interactive activities. A cocktail reception with entertainment will be held in the Grand Marquee on the first evening.&lt;/p>
&lt;p>A comprehensive agenda of the two-day event will be available shortly, but in the meantime we’ve provided a few talk teasers from six of our plenary speakers to whet your appetite:&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th style="text-align: left">Speaker&lt;/th>
&lt;th style="text-align: left">Title and organisation&lt;/th>
&lt;th style="text-align: left">Talk title&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td style="text-align: left">&lt;a href="#TB">Theodora Bloom&lt;/a>&lt;/td>
&lt;td style="text-align: left">Executive Editor, The BMJ&lt;/td>
&lt;td style="text-align: left">Preparing to handle dynamic scholarly content: Are we ready?&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">&lt;a href="#CG">Casey Green&lt;/a>&lt;/td>
&lt;td style="text-align: left">Assistant Professor, Perelman School of Medicine, University of Pennsylvania&lt;/td>
&lt;td style="text-align: left">Research and literature parasites in a culture of sharing.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">&lt;a href="#LT">Leonid Teytelman&lt;/a>&lt;/td>
&lt;td style="text-align: left">Co-founder and CEO, Protocols.io&lt;/td>
&lt;td style="text-align: left">A call to reduce random collisions with information; we can automatically connect scientists to the knowledge that they need.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">&lt;a href="#NB">Nicholas Bailey&lt;/a>&lt;/td>
&lt;td style="text-align: left">Data Science Team, Royal Society of Chemistry&lt;/td>
&lt;td style="text-align: left">What does data science tell us about social challenges in scholarly publishing?&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">&lt;a href="#MV">Miguel Escobar Varela&lt;/a>&lt;/td>
&lt;td style="text-align: left">Assistant Professor of Theatre Studies, National University of Singapore&lt;/td>
&lt;td style="text-align: left">Digital Humanities in Singapore: some thoughts for the future.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">&lt;a href="#KW">Kuansan Wang&lt;/a>&lt;/td>
&lt;td style="text-align: left">Managing Director, Microsoft Research Outreach&lt;/td>
&lt;td style="text-align: left">Democratize access to scholarly knowledge with AI.&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td style="text-align: left">&lt;br>&lt;br>&lt;/td>
&lt;td>&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>&lt;a id="TB">&lt;/a>&lt;/p>
&lt;h3 id="theodora-bloom---preparing-to-handle-dynamic-scholarly-content-are-we-ready">Theodora Bloom - Preparing to handle dynamic scholarly content: Are we ready?&lt;/h3>
&lt;p>Historically, journals might expect a few &amp;lsquo;Letters to the Editor&amp;quot; to discuss &amp;lsquo;matters arising&amp;rsquo; after an article was published. But scholarly communications are becoming much more dynamic, with versions posted as &amp;lsquo;preprints&amp;rsquo; before publication, corrections after publication, and potentially multiple versions of the same study appearing at different times. How should we handle this changing landscape for the benefits of researchers and consumers of the literature?&lt;/p>
&lt;h3 id="about-theodora-bloom">About Theodora Bloom&lt;/h3>
&lt;p>Theodora Bloom has been executive editor of The BMJ since June 2014. She has a PhD in developmental cell biology from the University of Cambridge and worked as a postdoctoral fellow at Harvard Medical School. She moved into publishing as an editor on the biology team at Nature, and in 1992 joined the fledgling journal Current Biology. After a number of years helping to develop Current Biology and its siblings Structure and Chemistry &amp;amp; Biology, Theo joined the beginnings of the open access movement. As the founding editor of Genome Biology she was closely involved in the birth of the commercial open access publisher BioMed Central. She joined the non-profit open access publisher Public Library of Science (PLOS) in 2008, first as chief editor of PLOS Biology and later as biology editorial director. She took the lead for PLOS on issues around data access and availability and launched PLOS&amp;rsquo;s data sharing policy. At The BMJ she is responsible for operations, delivering the journal online and in print.&lt;/p>
&lt;hr>
&lt;p>&lt;a id="CG">&lt;/a>&lt;/p>
&lt;h3 id="casey-greene---research-and-literature-parasites-in-a-culture-of-sharing">Casey Greene - Research and literature parasites in a culture of sharing.&lt;/h3>
&lt;p>Casey has been a strong champion of preprints and will discuss his efforts in this area including resources that he has shared to help advance the spread of preprints not only amongst researchers but publishers. These include letters to respond to journals that invite reviews but have unclear preprint policies. His lab members have also analyzed the licensing of preprints and the coverage of literature provided by the pirate repository, Sci-Hub. His talk will touch on each of these areas, and also a discussion of the Research Parasite and Symbiont Awards, which aim to advance recognition for data sharing and reuse.&lt;/p>
&lt;h3 id="about-casey-greene">About Casey Greene&lt;/h3>
&lt;p>Casey is an assistant professor in the Department of Systems Pharmacology and Translational Therapeutics in the Perelman School of Medicine at the University of Pennsylvania and the director of the Childhood Cancer Data Lab for Alex&amp;rsquo;s Lemonade Stand Foundation. His lab develops deep learning methods that integrate distinct large-scale datasets to extract the rich and intrinsic information embedded in such integrated data. Before starting the Integrative Genomics Lab in 2012, Casey earned his PhD for his study of gene-gene interactions in the field of computational genetics from Dartmouth College in 2009 and moved to the Lewis-Sigler Institute for Integrative Genomics at Princeton University where he worked as a postdoctoral fellow from 2009-2012. The overarching theme of his work has been the development and evaluation of methods that acknowledge the emergent complexity of biological systems.&lt;/p>
&lt;hr>
&lt;p>&lt;a id="LT">&lt;/a>&lt;/p>
&lt;h3 id="leonid-teytelman---call-to-reduce-random-collisions-with-information-we-can-automatically-connect-scientists-to-the-knowledge-that-they-need">Leonid Teytelman - Call to reduce random collisions with information; we can automatically connect scientists to the knowledge that they need.&lt;/h3>
&lt;p>Every scientist knows that virtually all papers, including their own, contain mistakes. A key motivation for creating protocols.io was to make it possible to share corrections and optimizations of published research protocols and to have this information automatically reach the scientists using these methods. While pushing relevant knowledge to the users is built into all aspects of protocols.io, we can do a lot more. If publishers, Crossref, and reference management platforms collaborate, we can move beyond the search towards a point where important information automatically reaches the appropriate researchers.&lt;/p>
&lt;h3 id="about-leonid-lenny-teytelman">About Leonid (Lenny) Teytelman&lt;/h3>
&lt;p>Lenny is the Co-founder and CEO of protocols.io, an open access platform to share and discover research protocols. It enables scientists to make, exchange, improve and discuss protocols and it is poised to dramatically accelerate and to increase reproducibility of scientific research. Lenny did his graduate studies at UC Berkeley and finished his postdoctoral research at MIT. Lenny has a strong passion for sharing science and improving research efficiency through technology.&lt;/p>
&lt;hr>
&lt;p>&lt;a id="NB">&lt;/a>&lt;/p>
&lt;h3 id="nicholas-bailey---what-does-data-science-tell-us-about-social-challenges-in-scholarly-publishing">Nicholas Bailey - What does data science tell us about social challenges in scholarly publishing?&lt;/h3>
&lt;p>How can we facilitate the fair advancement and dissemination of knowledge? The risks and shortcomings within scholarly publishing are always under scrutiny, but some problems don’t seem to be going away. What should we do about obvious gender inequality within some disciplines, or the weight given to Impact Factor as a measure of quality? The Royal Society of Chemistry has a royal charter to publish scientific content in a way that serves the public interest, and as such its Data Science team devotes part of its time to analysing the social challenges facing scholarly publishing. In this talk, Nicholas Bailey will share some examples.&lt;/p>
&lt;h3 id="about-nicholas-bailey">About Nicholas Bailey&lt;/h3>
&lt;p>Nicholas Bailey is a web analytics expert, a swimmer, a father, and a data geek. After spending several years in the Marketing team at the Royal Society of Chemistry, ultimately managing the database marketing team, he moved out of Marketing and into the Data Science team in order to work more closely with agile teams of developers and strengthen his data analysis and coding skills. Nicholas has a lot to say about measuring digital products, machine learning, and the potential of data science to contribute to positive social outcomes.&lt;/p>
&lt;hr>
&lt;p>&lt;a id="MV">&lt;/a>&lt;/p>
&lt;h3 id="miguel-escobar-varela---digital-humanities-in-singapore-some-thoughts-for-the-future">Miguel Escobar Varela - Digital Humanities in Singapore: some thoughts for the future.&lt;/h3>
&lt;p>Singapore-based researchers from a variety of disciplines are currently using digital tools to study the humanities, in areas as diverse as history and dance studies. This talk will present an overview of current projects and suggest a path for the growth of this field in Singapore. It argues that the future of DH requires better inter-institutional infrastructure for long-term data storage, clearer protocols for interoperability and more freely available and reusable datasets. This is easier said than done, but looking at the examples of other countries can provide some sources for inspiration.&lt;/p>
&lt;h3 id="about-miguel-escobar-varela">About Miguel Escobar Varela&lt;/h3>
&lt;p>Miguel Escobar Varela is an assistant professor in the University Scholars Programme (USP) at the National University of Singapore. At the USP, Dr. Varela teaches in the domain of Humanities and Social Sciences. He is a theatre researcher and software programmer. His interests are in teaching theatre through interactive websites and applying computational methods to study performances in Singapore and Indonesia.&lt;/p>
&lt;hr>
&lt;p>&lt;a id="KW">&lt;/a>&lt;/p>
&lt;h3 id="kuansan-wang---democratize-access-to-scholarly-knowledge-with-ai">Kuansan Wang - Democratize access to scholarly knowledge with AI.&lt;/h3>
&lt;p>With the advent of big data and cloud computing, artificial intelligence has made tremendous strides in recent years. Not only has machine surpassed humans in playing the chess game Go and Jeopardy game shows, reports of superhuman performance in other highly cognitive tasks, ranging from image classification to speech recognition, also abound. Have we reached a stage where the advancements in AI can help tackle a problem in scientific pursuits, namely, the access and the dissemination of scholarly knowledge? This talk describes Microsoft Academic, a project inside Microsoft Research that uses the state-of-the-art AI in natural language understanding and knowledge acquisition to harvest knowledge from scholarly communications and make it available on the web. The talk will describe the technical challenges that have been overcome, the world-wide research collaborations that have since been enabled, and discuss the potentials of making knowledge more readily available to the mass.&lt;/p>
&lt;h3 id="about-kuansan-wang">About Kuansan Wang&lt;/h3>
&lt;p>Kuansan Wang is the Managing Director at Microsoft Research Outreach (MSR), where he started in March 1998 as a Researcher in the speech technology group working. In 2004, he moved to the speech product group and became a software architect where he helped create and ship the product Microsoft Speech Server, which is still powering the corporate call center for Microsoft. Since September 2007, he has been back at MSR, joining the newly founded Internet Service Research Center with a mission to revolutionize online services and make Web more intelligent. In March 2016, he took on an additional role as a Managing Director of MSR Outreach, an organisation with the mission to serve the research community.&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/crossref-annual-meeting/archive/">Read more about our annual events&lt;/a>&lt;br>
&lt;a href="https://www.eventbrite.com/e/crossref-live17-singapore-november-14-15-crlive17-registration-34604951341?ref=ebtnebregn" target="_blank">Register now for LIVE17&lt;/a>&lt;/p></description></item><item><title>Scenario planning for our future</title><link>https://www.crossref.org/blog/scenario-planning-for-our-future/</link><pubDate>Mon, 28 Aug 2017 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/scenario-planning-for-our-future/</guid><description>&lt;p>Crossref is governed by a &lt;a href="https://www.crossref.org/board-and-governance/">board of directors&lt;/a> that meets in person three times a year in March, July and November. At the July meeting the board typically spends a significant amount of time on strategic planning in addition to its usual activities such as financial oversight, approving investment in new services based on staff and committee recommendations, reviewing and approving policies and fees for new and existing services and generally making sure Crossref is healthy and well run.&lt;/p>
&lt;p>This year we worked with a facilitator to look farther into the future than normal using a technique called &lt;a href="https://en.wikipedia.org/wiki/Scenario_planning" target="_blank">scenario planning&lt;/a> to map out “strategic agendas” for the next five years. Scenario-based strategic planning doesn’t try to predict the future but allows us to be flexible in planning by looking at a range of different possible eventualities. This is particularly useful for Crossref because scholarly research and communications is changing rapidly and we operate in a very complex environment.&lt;/p>
&lt;p>To prepare for the meeting our facilitator, Susan Stickely, prepared 12 “critical uncertainties” - impactful issues that could go either way and that will affect how Crossref works, its mission and even whether it needs to exist. To develop the critical uncertainties Susan interviewed Crossref staff, board members, general members and scholarly communications community influencers and we held a preparatory group exercise at the March board meeting. The critical uncertainties are:&lt;/p>
&lt;ol>
&lt;li>&lt;strong>Scholarly Communication Landscape&lt;/strong>: Increasing diversity? Or publishing disintermediated?&lt;/li>
&lt;li>&lt;strong>Machine Learning / Artificial Intelligence&lt;/strong>: Supporting? Or obsoleting the researcher and publishers?&lt;/li>
&lt;li>&lt;strong>Policy and Regulation&lt;/strong>: Limiting? Or visionary?&lt;/li>
&lt;li>&lt;strong>Financing of Scholarly Communication&lt;/strong>: Shrinking Pool? Or Expanding Pool?&lt;/li>
&lt;li>&lt;strong>Rise of Pre-print, New Content Sources&lt;/strong>: New, non-traditional? Or De-formalizing?&lt;/li>
&lt;li>&lt;strong>Tracking and Privacy&lt;/strong>: Increased Privacy? Or Loss of Privacy?&lt;/li>
&lt;li>&lt;strong>Cybersecurity&lt;/strong>: Secure? Or Vulnerable, Insecure?&lt;/li>
&lt;li>&lt;strong>Publisher Sustainability&lt;/strong>: Slow Progress? Or Fast Progress?&lt;/li>
&lt;li>&lt;strong>Impact of Open&lt;/strong>: Open or Closed? Or Slow to Change?&lt;/li>
&lt;li>&lt;strong>Source of Prestige and Recognition&lt;/strong>: New Source? Or Publisher, Institution?&lt;/li>
&lt;li>&lt;strong>Quality and Accuracy of Content&lt;/strong>: High? Or Low?&lt;/li>
&lt;li>&lt;strong>Geopolitical Stability and Stance&lt;/strong>: Stable, Unified? Or Unstable, Fragmented&lt;/li>
&lt;/ol>
&lt;p>In addition, from the interviews Susan was able to summarize Crossref’s distinctive competencies as:&lt;/p>
&lt;ul>
&lt;li>Having a reputation as a trusted, neutral one-stop source of metadata and services&lt;/li>
&lt;li>Managing scholarly infrastructure with technical knowledge and innovation&lt;/li>
&lt;li>Convening and facilitating scholarly communications community collaboration&lt;/li>
&lt;/ul>
&lt;p>To be successful Crossref will need to continue to invest in, apply, and evolve these distinctive competencies and strategic dilemmas and challenges.&lt;/p>
&lt;p>Over a day and half of discussions and breakout sessions the board and staff drew up a number of scenarios and created a draft strategic agenda for Crossref. Over the next couple of months we’ll be working on refining the strategic agenda and will be presenting the results to members in the next couple of months.&lt;/p>
&lt;p>One theme that emerged is for Crossref to engage more with funders and build on the work with done with them in creating the Crossref Funder Registry. We have started a new &lt;a href="https://www.crossref.org/working-groups/funders">Funder Advisory Group&lt;/a> and, among other things, are working with them on a prototype for a new registry of grant identifiers.&lt;/p>
&lt;p>In the regular board session the board approved three recommendations from the &lt;a href="https://www.crossref.org/committees/membership-and-fees/">Membership and Fees Committee&lt;/a>:&lt;/p>
&lt;ol>
&lt;li>To approve the recommendations with respect to volume discounts for current deposits of posted content (i.e. preprints).&lt;/li>
&lt;li>To create a new “peer review report” record type with a specific metadata schema and a bundled fee of $1.25 to be charged for a content item and all the reports associated with it.&lt;/li>
&lt;li>To update the metadata delivery offering to have a single agreement that covers all metadata APIs/delivery routes, to adopt a single (updated) fee structure, and to remove case-by-case opt-outs for metadata.&lt;/li>
&lt;/ol>
&lt;p>Item number 3 involves a number of big changes - for example the removal of the case-by-case opt outs requires a change to the main Membership Agreement - so we will be sending out more information to members and Affiliates in September and October about the changes and our implementation plans.&lt;/p>
&lt;p>You can see the &lt;a href="https://www.crossref.org/board-and-governance/#motions">full history of the motions from every Board meeting&lt;/a> on our website.&lt;/p>
&lt;p>Another major issue that the board discussed is the upcoming &lt;a href="https://www.crossref.org/board-and-governance/elections/">election for the board of directors&lt;/a>. In order to broaden participation and be inclusive there was a new process this year. The Nominating Committee put out a call for expressions of interest for candidates to be on the slate for the election. We had a great response and there were 25 expressions of interest reviewed by the Nominating Committee who came up with a slate of nine excellent candidates for the six seats up for election. This is the first time that there are more candidates than seats on the slate so it’s particularly important for members to vote this year. See the recent &lt;a href="https://doi.org/10.64000/68s5b-35b32" target="_blank">blog post about the election process and the slate&lt;/a> for more details.&lt;/p>
&lt;p>The next board meeting is in November in conjunction with &lt;a href="https://www.crossref.org/crossref-annual-meeting/archive/#2017">Crossref LIVE17 in Singapore&lt;/a>.&lt;/p></description></item><item><title>Coming to a venue near you</title><link>https://www.crossref.org/blog/coming-to-a-venue-near-you/</link><pubDate>Thu, 24 Aug 2017 00:00:00 +0000</pubDate><author>Vanessa Fairhurst</author><guid>https://www.crossref.org/blog/coming-to-a-venue-near-you/</guid><description>&lt;p>First of all – hello! I’m Vanessa. I’m fairly new to Crossref, having just joined our outreach team a few weeks ago. I previously worked in International Development, enabling individuals and institutions in Africa, Asia and Latin America to access cutting edge scholarly research and knowledge, supporting national development and transforming lives.&lt;/p>
&lt;p>A firm belief in the importance of connecting research and information around the world led me to Crossref where my role of International Community Outreach Manager connects me with a range of different people working across diverse disciplines and sectors. I’ll be supporting the coordination of our local LIVE events and helping to set up an ambassador program (more information on this coming soon) to deepen regional connections around the globe. You can read more about myself and my colleagues at Crossref on our &lt;a href="https://www.crossref.org/people/">People&lt;/a> page.&lt;/p>
&lt;p>As Crossref membership continues to grow globally, it becomes increasingly important for us to look at new ways to engage with our international membership base.&lt;/p>
&lt;p>You may have heard about our LIVE local events, or even attended one in person before. These are free-to-attend, one day, regional events (local to you), providing a tailored program of activities which include information on the key concepts of Crossref, the services we offer and our future plans.&lt;/p>
&lt;p>In the past year we have held LIVE local events in Brazil, Beijing, Boston and most recently Seoul. We also have a &lt;a href="https://www.eventbrite.com/e/crossref-live-london-tickets-35757538761" target="_blank">London LIVE&lt;/a> event coming up soon. Next year we are aiming to be even more ambitious, hoping to expand our activities to a number of different countries around the world.&lt;/p>
&lt;p>&lt;em>Images left to right, Crossref LIVE participants in Seoul, Crossref LIVE speakers in Brazil, and literature we use at our LIVE events&lt;/em>&lt;/p>
&lt;p>|&lt;img src="https://www.crossref.org/images/blog/live-seoul2-2017.jpg" alt="Participants at Crossref LIVE Seoul" height="250px" width="300px"/>|&lt;img src="https://www.crossref.org/images/blog/live-brazil2-2017.jpg" alt="Speakers at Crossref LIVE Brazil" height="250px" width="300px"/>|&lt;img src="https://www.crossref.org/images/blog/live-literature2.jpg" alt="LIVE literature" height="250px" width="300px"/>|&lt;/p>
&lt;p>When running our LIVE local events, we collaborate with local organisations to ensure they are appropriate, accessible, and applicable to the country context. Members support us by lending their local expertise with regards to venue selection, suggestions for speakers, tailored content, translation of materials and participant enrolment. We collaborate on logistics, content, Crossref speakers and the promotion of the event to our members and the wider community.&lt;/p>
&lt;p>When running our LIVE local events, we collaborate with local organisations to ensure they are appropriate, accessible, and applicable to the country context. Members support us by lending their local expertise with regards to venue selection, suggestions for speakers, tailored content, translation of materials and participant enrollment. We collaborate on logistics, content, Crossref speakers and the promotion of the event to our members and the wider community.&lt;/p>
&lt;p>We will release more information of upcoming regional events in due course, but we are working on the following countries as priorities for 2018-19:&lt;/p>
&lt;ul>
&lt;li>Asia-Pacific: Malaysia, Indonesia, Japan, Taiwan, Australia&lt;/li>
&lt;li>Central Asia: India&lt;/li>
&lt;li>Latin America: Mexico, Colombia, Chile, Brazil&lt;/li>
&lt;li>Middle East: UAE (Dubai or Abu Dhabi)&lt;/li>
&lt;li>Africa: South Africa, Kenya&lt;/li>
&lt;li>Eastern Europe: Turkey, Greece, Bulgaria, Romania, Serbia, Poland&lt;/li>
&lt;li>Western Europe: Germany, Spain, UK&lt;/li>
&lt;li>North America: Canada, USA&lt;/li>
&lt;/ul>
&lt;p>If you are interested in hosting a LIVE local event or have any suggestions for one in your region, then we would love to hear from you. View more information on our &lt;a href="https://www.crossref.org/events/">LIVE locals&lt;/a> page or &lt;a href="mailto:feedback@crossref.org">contact us&lt;/a> to hear more or get involved.&lt;/p>
&lt;hr></description></item><item><title>2017 election slate</title><link>https://www.crossref.org/blog/2017-election-slate/</link><pubDate>Thu, 17 Aug 2017 00:00:00 +0000</pubDate><author>Lisa Hart Martin</author><guid>https://www.crossref.org/blog/2017-election-slate/</guid><description>&lt;h3 id="slate-of-2017-board-candidates-announced-and-its-going-to-be-exciting">Slate of 2017 board candidates announced, and it’s going to be exciting&lt;/h3>
&lt;p>Crossref is always evolving and the board knows it must evolve with us so we can continue to provide the right kind of services and support for you, as members of the research community.&lt;/p>
&lt;p>This year two things happened for the first time: we used our updated bylaws &lt;a href="https://www.crossref.org/board-and-governance/bylaws/">see article VII, section 2&lt;/a> agreed by the board last year, to allow more candidates than available seats; and secondly, to issue an &lt;a href="https://doi.org/10.64000/a9z2j-c9a52" target="_blank">open call for expressions of interest&lt;/a>. Many members of the current board felt it was vital to move to this more transparent process.&lt;/p>
&lt;p>With Crossref developing new services for new types of members at a rapid pace, it’s an exciting time to be on the board of directors. With 25 expressions of interest it seems we’re not the only ones who think so!&lt;/p>
&lt;p>From these 25 applications, the Nominating Committee has proposed the following nine candidates to fill the six seats open for election to our board of directors:&lt;/p>
&lt;p>&lt;strong>American Institute of Physics (AIP)&lt;/strong>, Jason Wilde, USA&lt;br>
&lt;strong>F1000 Research&lt;/strong>, Liz Allen, UK&lt;br>
&lt;strong>Institute of Electronic and Electrical Engineers (IEEE)&lt;/strong>, Gerry Grenier, USA&lt;br>
&lt;strong>The Institution of Engineering and Technology (IET)&lt;/strong>, Vincent Cassidy, UK&lt;br>
&lt;strong>Massachusetts Institute of Technology Press (MIT Press)&lt;/strong>, Amy Brand, USA&lt;br>
&lt;strong>OpenEdition&lt;/strong>, Marin Dacos, France&lt;br>
&lt;strong>SciELO&lt;/strong>, Abel Packer, Brazil&lt;br>
&lt;strong>SPIE&lt;/strong>, Eric Pepper, USA&lt;br>
&lt;strong>Vilnius Gediminas Technical University Press (VGTU Press)&lt;/strong>, Eleonora Dagiene, Lithuania&lt;br>&lt;/p>
&lt;h3 id="read-the-candidates-organisational-and-personal-statementsboard-and-governanceelections2017-slate">&lt;a href="https://www.crossref.org/board-and-governance/elections/2017-slate">Read the candidates’ organisational and personal statements&lt;/a>&lt;/h3>
&lt;p>&lt;br>Candidates were chosen based on the following criteria:&lt;/p>
&lt;ul>
&lt;li>That board representation should be reflective of membership&lt;br>&lt;/li>
&lt;li>A balance of types and sizes of organisations&lt;br>&lt;/li>
&lt;li>That all committee choices and recommendations were unanimous&lt;br>&lt;/li>
&lt;/ul>
&lt;h2 id="you-can-be-part-of-this-important-process-by-voting-in-the-election">You can be part of this important process, by voting in the election&lt;/h2>
&lt;p>If your organisation is a member of Crossref on September 15 2017, you are eligible to vote when voting opens on September 28 (affiliates, however, are not eligible to vote).&lt;/p>
&lt;h2 id="how-can-you-vote">How can you vote?&lt;/h2>
&lt;p>On September 28, your organisation’s designated voting contact will receive an email with a link to the formal Notice of Meeting and Proxy Form with concise instructions on how to vote. An additional email will be sent with a username and password along with a link to our online voting platform. It is important to make sure your voting contact is up-to-date.&lt;/p>
&lt;h2 id="want-to-add-your-voice">Want to add your voice?&lt;/h2>
&lt;p>We are accepting independent nominations until 7 November 2017. organisations interested in standing as an independent candidate should contact me by this date with the endorsements of ten other Crossref members.&lt;/p>
&lt;p>The election itself will be held at &lt;a href="https://www.crossref.org/crossref-annual-meeting/archive/#2017">LIVE17 Singapore&lt;/a>, our annual meeting, on 14 November 2017. We hope you’ll be there to hear the results.&lt;/p></description></item><item><title>You do want to see how it's made — seeing what goes into altmetrics</title><link>https://www.crossref.org/blog/you-do-want-to-see-how-its-made-seeing-what-goes-into-altmetrics/</link><pubDate>Mon, 14 Aug 2017 00:00:00 +0000</pubDate><author>Joe Wass</author><guid>https://www.crossref.org/blog/you-do-want-to-see-how-its-made-seeing-what-goes-into-altmetrics/</guid><description>&lt;p>There&amp;rsquo;s a saying about oil, something along the lines of &amp;ldquo;you really don&amp;rsquo;t want to see how it&amp;rsquo;s made&amp;rdquo;. And whilst I&amp;rsquo;m reluctant to draw too many parallels between the petrochemical industry and scholarly publishing, there are some interesting comparisons to be drawn.&lt;/p>
&lt;p>Oil starts its life deep underground as an amorphous sticky substance. Prospectors must identify oil fields, drill, extract the oil and refine it. It finds its way into things as diverse as aspirin, paint and hammocks. And as I lie in my hammock watching paint dry, I&amp;rsquo;m curious to know how crude oil made its way into the aspirin that I’ve taken for the headache brought on by the paint fumes. Whilst it would be better if I did know how these things were made, not knowing doesn&amp;rsquo;t impair the efficacy of my aspirin.&lt;/p>
&lt;p>Altmetrics start life deep inside a number of systems. Data buried in countless blogs, social media and web platforms must be identified, extracted and refined before it can be used in products like impact assessments, prompts to engagement, and even tenure decisions. But there the similarity ends. Like the benzene in my aspirin, the data that goes into my favourite metric has come a long way from its origins. But that doesn&amp;rsquo;t mean that I shouldn&amp;rsquo;t know how it was made. In fact, knowing what went into it can help me reason about it, explain it and even improve it.&lt;/p>
&lt;h3 id="heavy-industry-or-backyard-refinery">Heavy industry or backyard refinery?&lt;/h3>
&lt;p>When you head out to fill your car, you buy fuel from a company that probably did the whole job itself. It found the crude oil, extracted it, refined it, transported it and pumped it into your car. Of course there are exceptions, but a lot of fuel is made by vertically integrated companies who do the whole job. And whilst there are research scientists who brew up special batches for one-off pieces of research, if you wanted to make a batch of fuel for yourself you&amp;rsquo;d have to set up your own back-yard fractional distillation column.&lt;/p>
&lt;p>Because the collection of a huge amount of data must be boiled down into altmetrics, organisations who want to produce these metrics have a big job to do. They must find data sources, retrieve the data, process it and produce the end product. The foundation of altmetrics is the measurement of impact, and whilst the intermediary data is very interesting, the ultimate goal of a metric is the end product. If you wanted to make a new metric you&amp;rsquo;d have two choices: set up an oil refinery (i.e. build a whole new system, complete with processing pipeline) or a back-yard still (a one-off research item). Either option involves going out and querying different systems, processing the data and producing an output.&lt;/p>
&lt;p>Being able to demonstrate the provenance of a given measurement is important because no measurement is perfect. It&amp;rsquo;s impossible to query every single extant source out there. And even if you could, it would be impossible to prove that you had. And even then, the process of refinement isn&amp;rsquo;t always faultless. Every measurement out there has a story behind it, and being able to tell that story is important when using the measurement for something important. Data sources and algorithms change over time, and comparing a year-old measurement to one made today might be difficult without knowing what underlying observations went into it. A solution to this is complete transparency about the source data, how it was processed, and how it relates to the output.&lt;/p>
&lt;h3 id="underlying-data">Underlying data&lt;/h3>
&lt;p>This is where Crossref comes in. It turns out that the underlying data that goes into altmetrics is just our kind of thing. As the DOI Registration Agency for scholarly literature, it&amp;rsquo;s our job to work with publishers to keep track of everything that&amp;rsquo;s published, assign DOIs and be the central collection and storage point for metadata and links. Examples of links stored in Crossref are between articles and funders, clinical trial numbers, preprints, datasets etc. With the Event Data project, we are now collecting links between places on the web and our registered content when they&amp;rsquo;re made via DOIs or article landing pages.&lt;/p>
&lt;p>This data has wider use than just than altmetrics. For example, an author might want to know over what time period a link to their article was included in Wikipedia, and which edit to the article was responsible for removing it and why. Or, in these days of &amp;ldquo;fake news&amp;rdquo;, someone may want to know everywhere on Twitter that a particular study is referenced so they can engage in conversation.&lt;/p>
&lt;p>Whilst the field of altmetrics was the starting point for this project, our goal isn’t to provide any kind of metric. Instead, we provide a stream of Events that occurred concerning a given piece of registered content with a DOI. If you want to build a metric out of it, you&amp;rsquo;re welcome to. There are a million different things you could build out of the data, and each will have a different methodology. By providing this underlying data set, we hope we&amp;rsquo;ve found the right level of abstraction to enable people to build a wide range of things.&lt;/p>
&lt;p>Every different end-product will use different data and use different algorithms. By providing an open dataset at the right level of granularity, we allow the producers of these end-products to say exactly which input data they were working with. By making the data open, we allow anyone else to duplicate the data if they wish.&lt;/p>
&lt;h3 id="sticky-mess">Sticky mess&lt;/h3>
&lt;img src="https://www.crossref.org/images/blog/2017/refinery.png" style="float: right">
&lt;p>To finish, let me return to the sticky mess of the distillation column. We identify sources (websites, APIs and RSS feeds). We visit each one, and collect data. We process that data into Events. And we provide Events via an API. At each stage of processing, we make the data open:&lt;/p>
&lt;ul>
&lt;li>The Artifact Registry lists all of the sources, RSS feeds and domains we query.&lt;/li>
&lt;li>The Evidence Registry lists which sites we visited, what input we got, what version of each Artifact was used, and which Events were produced.&lt;/li>
&lt;li>The Evidence Log describes exactly what every part of the system did, including if it ran into problems along the way.&lt;/li>
&lt;li>The Events link back to the Evidence so you can trace exactly what activity led up to the Event.&lt;/li>
&lt;li>All the code is open source and the version is linked in the Evidence Record, so you can see precisely which algorithms were used to generate a given Event.&lt;/li>
&lt;li>Anyone using the Data can link back to Events, which in turn link back to their Evidence.&lt;/li>
&lt;/ul>
&lt;p>The end-product, Events, can be used to answer altmetrics-y questions like &amp;ldquo;who tweeted my article?&amp;rdquo;. But the layers below that can be put to a range of other uses. For example:&lt;/p>
&lt;ul>
&lt;li>&amp;ldquo;Why does publisher X have a lower Twitter count?&amp;rdquo;. The Evidence Logs might show that they tend to block bots from their site, preventing data from being collected.&lt;/li>
&lt;li>&amp;ldquo;Why did their Twitter count rise?&amp;rdquo;. The Evidence Logs might show that they stopped blocking bots.&lt;/li>
&lt;li>&amp;ldquo;What does Crossref think the DOI is for landing page X?&amp;rdquo;. A search of the Evidence Logs might show that the Event Data system visited the page on a given date and decided that it corresponded to DOI Y.&lt;/li>
&lt;li>&amp;ldquo;Which domains hold DOI landing pages?&amp;rdquo;. The &amp;ldquo;Domains&amp;rdquo; Artifact will show the domains that Event Data looked at, and the Evidence Logs will show which versions were used over time.&lt;/li>
&lt;/ul>
&lt;p>By producing not only Events, but being completely transparent about the refinement process, we hope that people can build things beyond traditional altmetrics, and also make use of the intermediary products as well. And by using open licenses, we allow reuse of the data.&lt;/p>
&lt;h3 id="see-you-in-toronto">See you in Toronto!&lt;/h3>
&lt;p>There&amp;rsquo;s so much more to say but I&amp;rsquo;ve run out of ink. To find out more, come to &lt;a href="https://www.altmetric.com/events/" target="_blank">4:AM Altmetrics Conference&lt;/a>! I&amp;rsquo;ll be speaking at the conference in Session 10 on the 28th. I&amp;rsquo;ll also be at the Altmetrics Workshop on the 26th. Stacy Konkiel and I are hosting the Hackathon on the 29th, where you can get your hands on the data. See you there!&lt;/p>
&lt;p>This blog post was originally posted on the &lt;a href="https://web.archive.org/web/20170729190940/http://altmetricsconference.com/category/blog/" target="_blank">4:AM Altmetrics Conference Blog&lt;/a>.&lt;/p></description></item><item><title>Using the Crossref REST API. Part 4 (with CLA)</title><link>https://www.crossref.org/blog/using-the-crossref-rest-api.-part-4-with-cla/</link><pubDate>Tue, 25 Jul 2017 00:00:00 +0000</pubDate><author>Rachael Lammey</author><discourseUsername>rlammey</discourseUsername><guid>https://www.crossref.org/blog/using-the-crossref-rest-api.-part-4-with-cla/</guid><description>&lt;p>As a follow-up to our &lt;a href="https://www.crossref.org/categories/api-case-study/">blog posts&lt;/a> on the Crossref REST API we talked to the Copyright Licensing Agency (CLA) about the work they’re doing, and how they’re using the Crossref REST API as part of their workflow.&lt;/p>
&lt;p>&lt;span >&lt;b>Alex Cole, Senior Business Analyst at the Copyright Licensing Agency introduces the DCS&lt;/b>&lt;/span>&lt;/p>
&lt;p>The Digital Content Store (DCS) is an innovative rights, technology and content platform for UK Higher Education Institutions (HEIs), which was developed collaboratively with HEIs, publishers and technology partners. The platform is included in the CLA annual licence fee and is an optional tool for licensees.&lt;/p>
&lt;p>At its core, the system is a searchable repository of digital copies that have been created under the licence by HEIs (the CLA Digital Content Store), it also functions as a workflow management tool. When extracts are digitised by HEIs under the CLA Licence, they are uploaded directly to the DCS. Once an extract is uploaded and assigned to a course, students are able to access the extract via a secure link. Every year HEIs are obliged to report all of these digitised items to CLA as part of the terms of their copyright blanket licence. Prior to the DCS, HEIs were having to submit this data manually, a process that could take days, if not weeks. The system removes the need for annual census reporting to CLA, reducing the data collection burden on the HE sector and creating administrative efficiencies through streamlining the digital course pack creation process.&lt;br>&lt;/p>
&lt;p>&lt;span >&lt;b>Can you talk about how you&amp;rsquo;re using the &lt;a href="https://www.cla.co.uk/blog-crossref-api#_msocom_1" target="_blank">Crossref REST API&lt;/a> within CLA Digital Content Store (DCS)?&lt;/b>&lt;/span>&lt;/p>
&lt;p>When a DCS user adds a new extract to a course they need to include relevant metadata. This metadata is necessary, as it ultimately helps CLA in correctly identifying the copyright owner of the extract so that we can make sure they receive fair payment in our royalties distributions.
The Crossref REST API supplies the DCS user with article and journal metadata so that they can provide the correct information about the content they are uploading. Using the API saves the user the time they would have otherwise spent searching for this data, streamlining their workflow and making the process more efficient.&lt;/p>
&lt;p>Searching for and adding content in the DCS
&lt;img src="https://www.crossref.org/images/blog/CLA_blog.jpg" alt="Screen shot" class="img-responsive"/>&lt;/p>
&lt;p>&lt;span >&lt;b>What are your future development plans?&lt;/b>&lt;/span>&lt;/p>
&lt;p>We’re continuing to develop the DCS in order to improve user experience for our customers. We’re currently looking into opening up access for our users by allowing academics to submit requests to
the DCS via a web-form and our own DCS Course Content URL API. We are also looking into incorporating the Crossref REST API into some of our back office workflows to improve efficiency and simplify our workflow. The metadata that we can retrieve from Crossref can help us match customer usage to our rights database.&lt;/p>
&lt;p>&lt;span >&lt;b>What else would you like to see in &lt;a href="https://www.cla.co.uk/blog-crossref-api#_msocom_1" target="_blank">Crossref metadata&lt;/a>?&lt;/b>&lt;/span>&lt;/p>
&lt;p>Going forward we’d like to see:&lt;br>&lt;/p>
&lt;ul>
&lt;li>More books included in the database.&lt;br>&lt;/li>
&lt;li>Indicating if an ISSN is associated with the print or digital edition of a journal.&lt;br>&lt;br>
Thanks Alex!&lt;/li>
&lt;/ul></description></item><item><title>Event Data enters Beta</title><link>https://www.crossref.org/blog/event-data-enters-beta/</link><pubDate>Wed, 05 Jul 2017 00:00:00 +0000</pubDate><author>Jennifer Kemp</author><guid>https://www.crossref.org/blog/event-data-enters-beta/</guid><description>&lt;p>We’ve been talking about it at events, blogging about it on our site, living it, breathing it, and even sometimes dreaming about it, and now we are delighted to announce that Crossref Event Data has entered Beta.&lt;/p>
&lt;img src="http://assets.crossref.org/logo/crossref-event-data-logo-200.svg" alt="Crossref Event Data logo" width="200" height="83" />
&lt;p>A collaborative initiative by Crossref and DataCite, Event Data offers transparency around the way interactions with scholarly research occur online, allowing you to discover where it’s bookmarked, linked, liked, shared, referenced, commented on etc., across the web, and beyond publisher platforms.&lt;/p>
&lt;p>The name Event Data reflects the nature of the service, as it collects and stores digital actions that occur on the web, from the quick and simple, such as bookmarking and referencing, through to deeper interconnectivity such as exposing the links between research artifacts. Each individual action is timestamped and recorded in our system as an Event, and made available to the community via an API.&lt;/p>
&lt;p>Event Data will be available for absolutely anyone to use; publishers, third party vendors, editors, bibliometricans, researchers, authors, funders etc., and with tens of thousands of events occurring every day, there’s a wealth of insight to be gained for those interested in analyzing and interpreting the data.&lt;/p>
&lt;p>It’s important to note that Event Data does not provide metrics. What is does provide is the raw data to help you facilitate your own analysis, giving you the freedom to integrate the data into your own systems.&lt;/p>
&lt;p>We are currently working very closely with a few organisations with specific use cases who are helping us to test and refine Beta before we launch our production service later this year. If you decide to take a look at Beta yourself, all the data you collect from Event Data is licensed for public sharing and reuse &lt;a href="https://www.crossref.org/services/event-data/terms/">according to our Terms of Use.&lt;/a>&lt;/p>
&lt;p>&lt;em>Until Event Data is in production mode, we do not recommend building any commercial or customer-based tools off the data.&lt;/em>
 
If you are not in the Beta test group but are interested in participating, please contact me below. For more information about Event Data, &lt;a href="https://www.eventdata.crossref.org/guide/index.html" target="_blank">please see our user guide.&lt;/a>&lt;/p>
&lt;p>Please contact me, &lt;a href="mailto:eventdata@crossref.org">Jennifer Kemp&lt;/a>&amp;mdash;Outreach Manager for Event Data&amp;mdash;with any questions.&lt;/p></description></item><item><title>Crossref and colleagues in South Korea</title><link>https://www.crossref.org/blog/crossref-and-colleagues-in-south-korea/</link><pubDate>Fri, 30 Jun 2017 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/crossref-and-colleagues-in-south-korea/</guid><description>&lt;h3 id="connecting-crossref-orcid-datacite-and-our-communities">Connecting Crossref, ORCID, DataCite, and our communities&lt;/h3>
&lt;p>&lt;strong>Q:&lt;/strong> What do you get if you combine our three organisations for a week to catch up with our Korean community - publishers, librarians, universities, researchers, and service providers?
&lt;strong>A:&lt;/strong> Two events, plenty of meetings, great conversations and feedback, fabulous Korean hospitality, and a little jet-lag.&lt;/p>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/tweet-south-korea-blog.jpg" alt="tweet image" width="350px" />
&lt;/div>
&lt;p>Over the past few years, Crossref has seen huge growth in our members in Korea. We have nine &lt;a href="https://www.crossref.org/community/sponsors">Sponsoring Affiliates&lt;/a> (who look after nearly 1,000 members between them), two Sponsoring Members and nearly 80 Library members. With the &lt;a href="http://www.doi.org" target="_blank">International DOI Foundation (IDF)&lt;/a> strategy meeting taking place in Daejon, it seemed sensible to combine that with our own events and meetings with key organisations. This also fitted nicely with some plans that ORCID and DataCite had, so we combined forces.&lt;/p>
&lt;p>We (that&amp;rsquo;s me, Rachael Lammey, Ed Pentz, and Geoffrey Bilder) hosted a &lt;a href="https://www.crossref.org/events/">Crossref LIVE local&lt;/a> event on Monday 12th June for around 80 members and affiliates. We were joined by Alice Meadows and Nobuko Maiyairi (ORCID), Martin Fenner (DataCite), and Professor Sun-Tae Hong (Seoul National University) as co-presenters. We looked at the global reach of Korean research, and how registering content with Crossref and participating in services like Reference Linking helps create valuable connections between research outputs. With so many established members in Korea, we were able to go beyond the basics and emphasize the importance of metadata input, metadata delivery, and preview our upcoming &lt;a href="https://www.crossref.org/services/event-data/">Event Data&lt;/a> service. We also talked data-sharing and the value of integrating ORCID iDs into publisher and institution workflows.&lt;/p>
&lt;img src="https://www.crossref.org/images/blog/growth-research-outputs-asia-pacific.png" alt="Growth chart" class="img-responsive"/>
_Growth in research outputs in Asia Pacific 2009-2017. Source: Web of Science databases SCI-E, SSCI and AHCI only, downloaded 19/4/2017. Data provided by Wiley (thank you!)_
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/jgic-seoul.jpg" alt="JGIC image" width="350px" />
&lt;/div>
&lt;p>Later in the week we took a multi-pronged approach to highlight the many shared principles of our organisations and discuss the specific initiatives we’re collaborating on. We held the &lt;a href="https://orcid.org/content/joint-global-infrastructure-conference" target="_blank">Joint Global Infrastructure Conference&lt;/a> covering the global nature of what we do and the connections/interoperability between ORCID, DataCite and Crossref. This interoperability and our governance structures lend themselves to cooperation on other initiatives such as &lt;a href="https://twitter.com/metadata2020?lang=en" target="_blank">Metadata 2020&lt;/a> and &lt;a href="https://doi.org/10.64000/g720f-z9z14" target="_blank">The OI Project&lt;/a>, which we were able to share.&lt;/p>
&lt;blockquote>
&lt;p>&lt;a class="twitter-timeline" href="https://twitter.com/hashtag/jgic_seoul" data-widget-id="879259929458225152">Check out all #jgic_seoul tweets.&lt;/a>&lt;/p>
&lt;/blockquote>
&lt;p>Guest speakers volunteered to talk about how they work with our organisations - we were joined by Choon Shil Lee from the &lt;a href="https://www.kamje.or.kr/" target="_blank">Korean Association of Medical Journal Editors (KAMJE)&lt;/a> to demonstrate their ORCID integrations, and Hideaki Takeda from the &lt;a href="https://japanlinkcenter.org/top/english.html" target="_blank">Japan Link Centre (JaLC)&lt;/a> who discussed the infrastructure and services they use to register and disseminate content globally. User stories like this are great - they highlight how people work with our services, give others ideas, and also flag up where we can do more.&lt;/p>
&lt;p>Part of doing more involved providing clarification on Crossref’s position alongside other DOI Registration Agencies. With a new Registration Agency in Korea, we needed to communicate the global nature of what we do to help our members achieve their discoverability goals, as &lt;a href="https://www.crossref.org/membership/#member-obligations-and-benefits/">not all DOIs are made equal&lt;/a>. Through working with ORCID and DataCite colleagues we were able to place great importance both on our work worldwide, and on the benefits to Korean societies in collaborating outside national boundaries.&lt;/p>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/plug-image.jpg" alt="Plug socket image" width="300px" />
&lt;/div>
&lt;p>Combining talks from our three organisations was a great opportunity to emphasize the importance of shared global infrastructure. Geoffrey Bilder’s plug socket analogy is apt - services that work cross-border, cross-language, and cross-subject areas streamline processes for all of our different communities and enable research to travel beyond national boundaries and help it be found, linked, cited and assessed.&lt;/p>
&lt;p>Want to find out more? Slides from both meetings are available &lt;a href="https://www.slideshare.net/Crossref/tag/live-seoul-2017" target="_blank">here&lt;/a> and &lt;a href="https://orcid.org/content/joint-global-infrastructure-conference" target="_blank">here&lt;/a>, and watch out for further collaborative events.&lt;/p></description></item><item><title>Crossref receives SOC accreditation for data integrity and security</title><link>https://www.crossref.org/blog/crossref-receives-soc-accreditation-for-data-integrity-and-security/</link><pubDate>Wed, 21 Jun 2017 00:00:00 +0000</pubDate><author>Lisa Hart Martin</author><guid>https://www.crossref.org/blog/crossref-receives-soc-accreditation-for-data-integrity-and-security/</guid><description>&lt;p>We are delighted to announce that Crossref has been awarded the Service organisation Control (SOC) 2® accreditation after an independent assessment of our controls and procedures by the American Institute of CPA’s (AICPA).&lt;/p>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/soc-logo-2.jpg" alt="SOC logo" width="200px" />
&lt;/div>
&lt;p>The SOC 2® accreditation is awarded to service organisations that have passed standard trust services criteria relating to the security, availability, and processing integrity of systems used to process users’ data and the confidentiality and privacy of the information processed by these systems.&lt;/p>
&lt;p>The AICPA’s assessment also reviewed our vendor management programs, internal corporate governance and risk management processes, and regulatory oversight.&lt;/p>
&lt;p>&lt;a href="https://www.ssae-16.com/soc-2/" target="_blank">Find out more about the SOC accreditation structure&lt;/a>&lt;/p></description></item><item><title>Now put your hands up! (for a Similarity Check update)</title><link>https://www.crossref.org/blog/now-put-your-hands-up-for-a-similarity-check-update/</link><pubDate>Tue, 06 Jun 2017 00:00:00 +0000</pubDate><author>Madeleine Watson</author><guid>https://www.crossref.org/blog/now-put-your-hands-up-for-a-similarity-check-update/</guid><description>&lt;p>Today, I’m thinking back to 2008. A time when khaki and gladiator sandals dominated my wardrobe. The year when Obama was elected, and Madonna and Guy Ritchie parted ways. When we were given both the iPhone 3G and the Kindle, and when the effects of the global financial crisis lead us to come to terms with the notion of a ‘staycation’. In 2008 we met both Wall-E and Benjamin Button, were enthralled by the Beijing Olympics, and became addicted to Breaking Bad. And lest we forget, 2008 was also the year in which Beyoncé brought us Single Ladies; in all its sassy hand-waving, monochrome glory. For Crossref though, 2008 holds another important milestone as it was the year we launched our Similarity Check initiative. Today, the artist formerly known as CrossCheck provides our members with cost-effective access to Turnitin’s powerful text comparison tool, &lt;a href="https://www.ithenticate.com/" target="_blank">iThenticate&lt;/a>.&lt;/p>
&lt;p>Fast forward nearly a decade, and it’s wonderful to see just how Similarity Check membership has grown in the nine years since launch; from 16 original members in 2008 to over 1,300 today.&lt;/p>
&lt;img src="https://www.crossref.org/images/blog/Sim Check member graph_Fig 1.1.png" alt="Membership graph" width="800px" height="450" class="img-responsive" />
&lt;p>&lt;em>Figure 1.1 The number of publishers participating in the Similarity Check service each year between 2008 – 2017 (to April)&lt;/em>&lt;/p>
&lt;br>
Usage of iThenticate is also consistent with this growth in membership, and throughout 2016 our members checked over four million manuscripts for similarity using the tool. As Similarity Check members contribute their full-text content into Turnitin’s database, this increase in membership also has a dramatic impact on the volume of content indexed by Turnitin. Today, members can compare their manuscripts against Turnitin’s database of over 60 million full-text works provided by Similarity Check members. With over 88 million works currently registered with Crossref, this means that 68% of all content deposited with us is now available for comparison in iThenticate.
&lt;p>Over the years we have worked very closely with Turnitin to help champion new iThenticate feature developments that best support our member’s use of the tool as a core function of their editorial workflow. Many of our members too have also worked together with Turnitin to provide feedback on user experience and design.&lt;/p>
&lt;p>Below, Turnitin’s Product Manager for iThenticate, Sun Oh, shares an insight into their research process and how Similarity Check member’s feedback has been critical in developing new and improved functionality in iThenticate.&lt;/p>
&lt;p>&lt;strong>Read on to learn more from Sun&amp;hellip;&lt;/strong>&lt;/p>
&lt;img src="https://www.crossref.org/images/blog/turnitin-logo-primary-rgb.png" alt="Turnitin logo" width="400px" class="img-responsive"/>
&lt;p>&lt;em>Sun Oh is a Senior Product Manager at Turnitin. She is currently the Product Manager for iThenticate and backend systems including the Content Intake System and similarity reports.&lt;/em>&lt;/p>
&lt;br>
Last year we surveyed our Crossref customers to find out what Similarity Check improvements they would like to see and noticed a recurring request for the ability to compare two or more personally sourced documents.
&lt;p>We were intrigued and decided to run with it. We contacted the respondents who had asked for this, and started conversations to find out more. This helped us gather invaluable data, which in turn helped us to build the feature based on real use cases and with a clear view of what was wanted.&lt;/p>
&lt;p>The design prototypes were reviewed for usability and effectiveness each step of the way by the respondents and once we had the feature up and running, those who requested it in our initial survey were among the first to trial it.&lt;/p>
&lt;p>We’re thrilled to announce that we’ve now launched the new Doc-to-Doc comparison feature, available through iThenticate’s native interface. Simply select the Doc-to-Doc comparison upload method from the document submission panel.&lt;/p>
&lt;p>If you are a Crossref member using Similarity Check, you have exclusive early access to this new feature, which allows you to use iThenticate’s powerful similarity check functionality and apply it to your own, private documents.&lt;/p>
&lt;h2 id="how-does-doc-to-doc-comparison-work">How does Doc-to-Doc Comparison work?&lt;/h2>
&lt;p>Doc-to-Doc comparison allows users to upload one primary document and compare it against up to five other documents.&lt;/p>
&lt;img src="https://www.crossref.org/images/blog/SimCheck_Doc-to-doc_ Fig 1.2.png" alt="Doc-to-Doc Comparison screenshot" class="img-responsive"/>
&lt;p>&lt;em>Figure 1.2 The document upload screen for Doc-to-Doc comparison&lt;/em>&lt;/p>
&lt;br>
When the upload is complete, a similarity score is generated for the primary document based on the amount of similar content found in the comparison documents. A full comparison report is also available.
&lt;p>The comparison report will open in the document viewer, and will display the primary document along with a list of the comparison documents and with their similarity percentage. If one of the comparison documents doesn’t include text that matches the primary document, iThenticate will still display it anyway, with a 0% score, allowing users to rule it out of their inspection. The similarity report will be stored securely in the user’s folder until they delete it.&lt;/p>
&lt;img src="https://www.crossref.org/images/blog/SimCheck_Doc-to-doc_Fig 1.3.png" alt="Document viewer screenshot" class="img-responsive"/>
&lt;p>&lt;em>Figure 1.3 Similarity report for Doc-to-Doc comparison&lt;/em>&lt;/p>
&lt;br>
As these documents will not be stored in a shared database, they won’t affect the similarity score of any future submissions. Primary and comparison documents remain completely private and will not be indexed into the shared iThenticate content database.
&lt;p>To get a better idea of how Doc-to-Doc comparison works, check out the &lt;a href="https://guides.turnitin.com/iThenticate/Doc-to-Doc_Comparison" target="_blank">iThenticate feature guide &lt;/a>on the Turnitin website.&lt;/p>
&lt;h2 id="start-using-doc-to-doc-comparison-now">Start using Doc-to-Doc Comparison now!&lt;/h2>
&lt;p>If you’re a Crossref member using Similarity Check, you can log in to your iThenticate account now and select the Doc-to-Doc comparison link on the homepage.&lt;/p>
&lt;h2 id="what-else-is-new-in-ithenticate-in-this-new-release">What else is new in iThenticate in this new release?&lt;/h2>
&lt;h3 id="new-look">New Look&lt;/h3>
&lt;p>In addition to Doc-to-Doc comparison, we decided to refresh the look and feel of iThenticate; the same tools our users know and trust, now with a modern interface. Users will also notice that iThenticate now has more readable font and friendlier styling throughout.&lt;/p>
&lt;h3 id="report-mode-memory">Report Mode Memory&lt;/h3>
&lt;p>To make life easier, iThenticate now remembers whether users were in the All Sources or Match Overview mode when they last used the Document Viewer. iThenticate will then open documents in this mode automatically hereafter.&lt;/p>
&lt;h3 id="improved-submission-process">Improved Submission Process&lt;/h3>
&lt;p>We’re also enhancing our submission process by making the upload requirements more inclusive. We’ve increased the possible file size limit from 40MB to 100MB when uploading to either the database or to Doc-to-Doc comparison, and PowerPoint (.ppt) and Excel (.xlsm) file formats are now accepted.&lt;/p>
&lt;h2 id="developments-completed-in-2016">Developments completed in 2016&lt;/h2>
&lt;p>If Similarity Check members haven’t had a chance to check out the improvements we introduced in iThenticate throughout 2016, here’s a quick recap. You can always find our updates on the What&amp;rsquo;s New page of the iThenticate website.&lt;/p>
&lt;h3 id="download-user-list">Download User List&lt;/h3>
&lt;p>The ability for administrators to download a list of all the users in their account has been added. This list will allow administrators to easily send emails to users.&lt;/p>
&lt;h3 id="similarity-score-calculation-update">Similarity Score Calculation Update&lt;/h3>
&lt;p>We updated how the similarity score is calculated when bibliographic material is excluded from a similarity report. Now, when bibliography exclusion is enabled, the word count of the bibliography is not included when calculating the overall percentage. This update to the similarity report calculation helps to provide users with a more accurate similarity score.&lt;/p>
&lt;h3 id="improved-security">Improved Security&lt;/h3>
&lt;p>We are fully committed to keeping user’s data safe and secure at all times. To that end, we’ve added additional security logging, put in measures to enforce stronger passwords, and enabled Captcha after failed login attempts.&lt;/p>
&lt;h3 id="faster-report-generation">Faster Report Generation&lt;/h3>
&lt;p>We’ve increased the number of resources dedicated to the generation of similarity reports for our iThenticate service. As a result, users should see faster turnaround times for similarity reports.&lt;/p>
&lt;h3 id="support-for-eight-additional-languages">Support for Eight Additional Languages&lt;/h3>
&lt;p>The iThenticate user interface is now available in eight additional languages: German, Dutch, Latin American Spanish, Brazilian Portuguese, Italian, French, and both Simplified &amp;amp; Traditional Chinese. When adding new users to an account, administrators can specify the language of the new user, which will then send a welcome email in the selected language. Individual users can also set their preferred language by selecting a language from the Language dropdown in the Settings menu.&lt;/p>
&lt;h3 id="content-intake-system">Content Intake System&lt;/h3>
&lt;p>We’ve developed a new Content Intake System which enables our publication content database to scale so that our users can compare against a constantly growing database of the most recently published content. This allows us to index Similarity Check members’ data in a much more reliable and efficient way than legacy intake methods. And recently, we’ve made the collecting and processing of content from Crossref members using Similarity Check even faster by parallelising our processors. This means that we have more processors running simultaneously to process data.&lt;/p>
&lt;p>By removing the need for crawling, we will also minimize our impact on traffic to a Similarity Check member’s public-facing website. The Content Intake System is able to directly collect full text URLs from members DOI metadata. This results in a huge reduction in the time it takes from when a publisher first deposits a new DOI with Crossref, to when the content is indexed by us into our full-text publication database. To date, we’ve been able to index the content associated with 60 million Crossref DOIs, and have indexed more than 165 million published works in total which submissions are compared against in iThenticate.&lt;/p>
&lt;h3 id="walker-web-crawler">Walker (web crawler)&lt;/h3>
&lt;p>We’ve developed a new web crawler. Referred to as “Walker”, the crawler makes it possible to provide quicker and more reliable similarity matches to content available on the web. Not to be confused with the Content Intake System mentioned above, Walker’s purpose is to crawl the public web and is not used for indexing full-text content from Similarity Check members.&lt;/p>
&lt;p>Using Walker, we’re adding an average of nearly 10 million new web pages to our content database per day, ensuring we have the freshest internet content available to find matches against.&lt;/p>
&lt;h2 id="wed-love-to-get-your-feedback">We’d love to get your feedback!&lt;/h2>
&lt;p>As we design and develop new features, we want to make sure we’re fully understanding Similarity Check member’s needs and would love the opportunity to engage with users for further research. If you’d like to sign up to participate in user research for upcoming feature developments, please take a few minutes to fill out our Feedback Program Form. We look forward to connecting with you!&lt;/p>
&lt;h2 id="contact-turnitin-edit-300424-support-for-ithenticate-contact-details-updated-">&lt;del>Contact Turnitin&lt;/del> (EDIT 30/04/24: Support for iThenticate, contact details updated )&lt;/h2>
&lt;p>Please go to our &lt;a href="https://www.crossref.org/documentation/similarity-check/ithenticate-account-use/help/" target="_blank">Get help with Similarity Check page&lt;/a>&lt;/p>
&lt;p>&lt;del>For iThenticate technical and billing support, please email &lt;a href="mailto:tiisupport@turnitin.com">tiisupport@turnitin.com&lt;/a>&lt;/del>&lt;/p>
&lt;p>&lt;del>For questions about content indexing, please contact Gareth at &lt;a href="mailto:gmalcolm@turnitin.com">gmalcolm@turnitin.com&lt;/a>&lt;/del>&lt;/p>
&lt;p>&lt;del>For iThenticate product development questions, please contact Sun at &lt;a href="mailto:soh@turnitin.com">soh@turnitin.com&lt;/a>&lt;/del>&lt;/p>
&lt;p>           * &lt;del>Sun Oh, Product Manager for iThenticate*&lt;/del>&lt;/p>
&lt;br>
**Thanks to Sun and the whole team at Turnitin for sharing this update.**
&lt;p>For more information about Similarity Check, visit our &lt;a href="https://www.crossref.org/services/similarity-check/">service page&lt;/a>.&lt;/p>
&lt;p>Want to join Crossref Similarity Check? Please contact our &lt;a href="mailto:member@crossref.org">membership specialist&lt;/a>.&lt;/p></description></item><item><title>Data citations and the eLife story so far</title><link>https://www.crossref.org/blog/data-citations-and-the-elife-story-so-far/</link><pubDate>Thu, 18 May 2017 00:00:00 +0000</pubDate><author>Melissa Harrison</author><guid>https://www.crossref.org/blog/data-citations-and-the-elife-story-so-far/</guid><description>&lt;p>When we set up the eLife journal in 2012, we knew datasets were an important component of research content and decided to give them prominence in a section entitled ‘Major datasets’ (see images below). Within this section, major previously published and generated datasets are listed. We also strongly encourage data citations in the reference list.&lt;/p>
&lt;img src="https://www.crossref.org/images/blog/elife-blog.png" alt="Major datasets" class="img-responsive"/>
&lt;p>&lt;em>Major Datasets for &lt;a href="https://doi.org/10.7554/eLife.24487" target="_blank">“Structural basis of protein translocation by the Vps4-Vta1 AAA ATPase”&lt;/a> by N. Monroe, H. Han, P. Shen, et. al.&lt;/em>&lt;/p>
&lt;p>Almost five years on and I feel we have still not cracked it! We have signed up to the &lt;a href="https://www.force11.org/group/joint-declaration-data-citation-principles-final" target="_blank">Force11 data citation principles&lt;/a>, which were published three years back; we have been actively involved in working groups of Force11 and others, for example the &lt;a href="http://biorxiv.org/content/early/2017/01/19/100784" target="_blank">Data Citation Roadmap for Scientific Publishers&lt;/a> and the JATS XML &lt;a href="http://jats4r.org/data-citations" target="_blank">data citation recommendation&lt;/a> of &lt;a href="http://jats4r.org" target="_blank">JATS4R&lt;/a>. I am also currently working with other publishers to come up with recommended JATS XML tagging for data availability statements, which is easier said than done considering the nuances of dataset uses and also how different publishers approach this.&lt;/p>
&lt;p>Added to this, there is still significant push-back from authors about putting all dataset citations in the reference list (for example, authors are concerned about self-citing by citing a dataset created as part of the research article; “dataset citations” that are in effect a link to a search results page on a database; and the necessitation of hundreds of reference entries if an author has used a large base for the research).&lt;/p>
&lt;p>While eLife is very active in this space, and aims to arrange and mark up the datasets and citations produced by our authors in line with recommendations, the recommendations still have some gaps and the complete picture is not yet clear.&lt;/p>
&lt;p>In late 2014, we brought in-house the process of depositing Crossref metadata (previously our online host did this for us). It gave us control of our processes and, at the time, we sent all the information we could to Crossref and have ensured our references are open and available in the Crossref public API. The code for this conversion process is all open-source and available for reuse. It can be &lt;a href="https://github.com/elifesciences/elife-crossref-feed" target="_blank">found on GitHub&lt;/a>. Since then, besides small improvements to the code and troubleshooting problems, we’ve not updated the code. I have been keeping a list of Crossref features and new deposit metadata we can add to our deposits, and now is the time for us to start working on this again.&lt;/p>
&lt;p>One of the items we’ll be addressing is data citations.&lt;/p>
&lt;p>The Crossref reference schema does not cater well for non-book or -journal content, and if an item does not have a DOI, the “reference” is not very useful because of the few tags available in the Crossref schema.&lt;/p>
&lt;p>However, Crossref have introduced the relationship type to their schema, so data references can be well linked and mineable. As I see Crossref as a potential broker between publishers and data repositories in the future, using the relationship-type deposit for all datasets will assist this and also allow these data points to more easily be seen within the article Nexus framework (see the recent blog post, &lt;a href="https://doi.org/10.64000/hdj5p-8vy92" target="_blank">How do you deposit data citations?&lt;/a>).&lt;/p>
&lt;p>At eLife, we already distinguish between Dataset generated as part of research results (relationship type in the Crossref schema: “isSupplementedBy”) and Dataset produced by a different set of researchers or previously published (relationship type: “references”). Therefore, it will not be hard for us to convert all the information about data referencing that is within the dataset section into a relationship-type deposit in the conversion to Crossref XML.&lt;/p>
&lt;p>We have also recently gone through an exercise of defining a set of rules for all our references and, of the 12 allowed types, one is data. The rules for Schematron (a rule-based validation language for making assertions about the presence or absence of patterns in XML trees; see also this useful &lt;a href="http://jats4r.org/schematron-a-handy-xml-tool-thats-not-just-for-villains" target="_blank">article about Schematron&lt;/a> on the JATS4R learning centre) have been written for the eLife ‘business’ rules. Subject to final testing, these will be integrated into our workflow (the Schematron is open source and available for reuse on &lt;a href="https://github.com/elifesciences/reference-schematron" target="_blank">GitHub&lt;/a>, and we will also build an API for people to use the Schematron direct). This will allow us to easily identify all data references and convert them into relationship types in the XML delivered to Crossref. This way, they will not be lost in the references section of our deposits, but properly identified.&lt;/p>
&lt;p>However, we do appreciate this will become harder for us as authors become more familiar with datasets as references, because we will not be able to identify the difference between generated and analysed datasets so easily.&lt;/p>
&lt;p>The code developed and used to complete these conversions will, again, be on Github and open source, and we actively encourage the reuse of this.&lt;/p>
&lt;p>While the industry is still working on the best way to deal with data and ensuring it is given the prominence it requires, we feel this is the best approach we can take. Nothing is forever and we can still change what we do in the future. The beauty of open-source code also means that if there is an alternative approach now or in the future, the code we wrote at eLife can be developed by someone else in the future and we can all benefit.&lt;/p>
&lt;p>If you have any questions, please do not hesitate to &lt;a href="mailto:feedback@crossref.org">contact us&lt;/a>.&lt;/p></description></item><item><title>Want to be on our Board?</title><link>https://www.crossref.org/blog/want-to-be-on-our-board/</link><pubDate>Fri, 28 Apr 2017 00:00:00 +0000</pubDate><author>Lisa Hart Martin</author><guid>https://www.crossref.org/blog/want-to-be-on-our-board/</guid><description>&lt;blockquote>
&lt;p>Do you want to affect change for the scholarly community?&lt;/p>
&lt;/blockquote>
&lt;p>Our Nominating Committee is inviting expressions of interest to serve on the Board as it begins its consideration of a slate for the November 2017 election.&lt;/p>
&lt;p>Key responsibilities of the Board are setting the strategic direction for the organisation, providing financial oversight, and approving new policies and services. Some of the decisions the board has made in recent years include:&lt;/p>
&lt;ul>
&lt;li>Establishing &lt;a href="https://doi.org/10.64000/g720f-z9z14" target="_blank">The OI Project&lt;/a> to create a persistent organisation Identifier;&lt;/li>
&lt;li>Inclusion of &lt;a href="https://www.crossref.org/news/2016-11-02-crossref-now-accepts-preprints">preprints in the Crossref metadata&lt;/a>; and&lt;/li>
&lt;li>The approval to &lt;a href="https://www.crossref.org/services/event-data">develop Event Data&lt;/a> which will track online activity from multiple sources.&lt;!--more-->&lt;/li>
&lt;/ul>
&lt;h2 id="any-member-can-express-interest-in-serving-on-the-board">Any member can express interest in serving on the Board&lt;/h2>
&lt;p>We are seeking people who know about scholarly communications and would like to be part of our future. If you have a vision for the international Crossref community, we are interested in hearing from you. Crossref members that are eligible to vote, and would like to be considered, can &lt;a href="https://docs.google.com/forms/d/e/1FAIpQLSdwqraD2fjb3eqZgLpTQWsMYPQvvz4LARLq6k8H8mA7xGbZAw/viewform" target="_blank">express their interest&lt;/a> together with statements of interest from you and from your organisation. The form should be completed and sent to us before 01 June 2017.&lt;/p>
&lt;h2 id="the-role-of-the-nominating-committee">The role of the Nominating Committee&lt;/h2>
&lt;p>The Nominating Committee meets to discuss change, process, criteria, and potential candidates, ensuring a fair representation of membership. The Committee is made up of three board members not up for election, and two non-board members.&lt;/p>
&lt;p>Current Nominating Committee members:&lt;/p>
&lt;ul>
&lt;li>John Shaw, Sage (Chair)&lt;/li>
&lt;li>Mark Patterson, eLife&lt;/li>
&lt;li>Paul Peters, Hindawi&lt;/li>
&lt;li>Chris Fell, Cambridge University Press&lt;/li>
&lt;li>Rebecca Lawrence, F1000 Research&lt;/li>
&lt;/ul>
&lt;h2 id="about-the-election-and-our-board">About the election and our Board&lt;/h2>
&lt;p>We have a principle of &lt;a href="https://www.crossref.org/truths">one member, one vote&lt;/a>; our board comprises a cross-section of members and it doesn’t matter how big or small you are, every member gets a single vote. Board terms are three years, and one third of the Board is eligible for election every year. There are six seats up for election in 2017. The board meets in a variety of international locations in March, July, and November each year. View a list of the &lt;a href="https://www.crossref.org/board-and-governance">current Crossref Board members and a history of the decisions they’ve made (motions)&lt;/a>. The election opens online in late September 2017 and voting is done by proxy online or in person at the annual business meeting during Crossref LIVE in November 2017. Election materials and instructions for voting will be available to all Crossref members online in late September 2017. The board needs to be truly representative of Crossref’s global and diverse membership of organisations who publish.&lt;/p>
&lt;p>Please &lt;a href="https://docs.google.com/forms/d/e/1FAIpQLSdwqraD2fjb3eqZgLpTQWsMYPQvvz4LARLq6k8H8mA7xGbZAw/viewform" target="_blank">express interest using the form&lt;/a>, or &lt;a href="mailto:lhart@crossref.org">email me&lt;/a> with any questions.&lt;/p></description></item><item><title>The OI Project gets underway planning an open organisation identifier registry</title><link>https://www.crossref.org/blog/the-oi-project-gets-underway-planning-an-open-organisation-identifier-registry/</link><pubDate>Tue, 28 Mar 2017 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/the-oi-project-gets-underway-planning-an-open-organisation-identifier-registry/</guid><description>&lt;p>At the end of October 2016, Crossref, DataCite, and ORCID &lt;a href="https://doi.org/10.64000/224cc-a0w76" target="_blank">reported on&lt;/a> collaboration in the area of organisation identifiers. We issued three papers for community comment and after input we subsequently announced the formation of The OI Project, along with a call for expressions of interest from people interested in serving on the working group.&lt;/p>
&lt;p>We had a great response and are happy to report that the Working Group has now been established, and is already underway with work to develop a plan for an open, independent, not-for-profit, sustainable, organisation identifier registry. &lt;!--more-->&lt;/p>
&lt;p>There is &lt;a href="https://orcid.org/content/organisation-identifier-working-group" target="_blank">information about the OI Project Working Group on the ORCID website&lt;/a> including a list of the &lt;a href="https://orcid.org/content/organisation-id-working-group" target="_blank">17 working group members&lt;/a>. They represent a broad range of scholarly communications stakeholders. Our scope of work includes three separate but interdependent areas:&lt;/p>
&lt;ul>
&lt;li>Governance;&lt;/li>
&lt;li>Registry Product Definition; and&lt;/li>
&lt;li>Business Model &amp;amp; Funding.&lt;/li>
&lt;/ul>
&lt;p>The initial goal of the Working Group is to create a thorough and robust implementation plan by the end of 2017.&lt;/p>
&lt;p>Please take a look at the website for more information and we’ll provide updates as things progress throughout the course of the year.&lt;/p>
&lt;p>&lt;strong>Please &lt;a href="mailto:oi-project@orcid.org">contact us&lt;/a> with any questions.&lt;/strong>&lt;/p></description></item><item><title>Revised Crossref DOI display guidelines are now active</title><link>https://www.crossref.org/blog/revised-crossref-doi-display-guidelines-are-now-active/</link><pubDate>Wed, 15 Mar 2017 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/revised-crossref-doi-display-guidelines-are-now-active/</guid><description>&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/crossref-doi-display-march-2017.jpg
" alt="Crossref DOI Display" width="300px" />
&lt;/div>
&lt;p>We have updated our DOI display guidelines as of March 2017, this month! I described the what and the why in my previous blog post &lt;a href="https://doi.org/10.64000/h1se5-5kq62" target="_blank">New Crossref DOI display guidelines are on the way&lt;/a> and in an email I wrote to all our members in September 2016. I’m pleased to say that the updated Crossref &lt;a href="https://doi.org/10.13003/5jchdy" target="_blank">DOI display guidelines are available via this fantastic new website&lt;/a> and are now active. Here is the URL of the full set of guidelines in case you want to bookmark it (&lt;a href="https://doi.org/10.13003/5jchdy" target="_blank">https://doi.org/10.13003/5jchdy&lt;/a>) and a shareable image to spread the word on social media.&lt;/p>
&lt;p>This blog is a quick reminder that all Crossref members should now be displaying DOIs in the &lt;a href="https://doi.org/10.13003/5jchdy" target="_blank">recommended new format&lt;/a> from this month, on any new content you publish online. Please note these guidelines are for Crossref DOIs only, we have nearly 90 million registered but there are others, and &lt;a href="https://www.crossref.org/membership/#member-obligations-and-benefits/">not all DOIs are made equal&lt;/a>.&lt;/p>
&lt;p>The main changes are to display the DOI as a full, linked URL using HTTPS:&lt;/p>
&lt;p>&lt;code>https://doi.org/10.xxxx/xxxxx&lt;/code>&lt;/p>
&lt;p>For background on the HTTPS issue please read Geoffrey Bilder’s blog post, &lt;a href="https://doi.org/10.64000/6xkdj-gzr09" target="_blank">Linking DOIs using HTTPS&lt;/a>.&lt;/p>
&lt;h2 id="what-will-happen-if-you-dont-update-your-crossref-doi-display">What will happen if you don’t update your Crossref DOI display?&lt;/h2>
&lt;p>We tell members that they should be working towards making the change even if they can’t do it until later - we recognize that it is not always an easy change to make.&lt;/p>
&lt;p>However, if members don’t make the change, nothing immediate will happen (Crossref won’t fine you!) although as more members make the change your display will look odd and out of place compared with other members’ content.&lt;/p>
&lt;h3 id="if-you-have-any-questions-please-do-not-hesitate-to-contact-usmailtofeedbackcrossreforg">If you have any questions please do not hesitate to &lt;a href="mailto:feedback@crossref.org">contact us&lt;/a>.&lt;/h3></description></item><item><title>How do you deposit data citations?</title><link>https://www.crossref.org/blog/how-do-you-deposit-data-citations/</link><pubDate>Thu, 02 Mar 2017 00:00:00 +0000</pubDate><author>Jennifer Lin</author><guid>https://www.crossref.org/blog/how-do-you-deposit-data-citations/</guid><description>&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/Data_within_XML.png" alt="An exemplary image" width="300px" />
&lt;/div>
&lt;h3 id="please-visit-crossrefs-official-data--software-citations-deposit-guidehttpsupportcrossreforghcen-usarticles215787303-crossref-data-software-citation-deposit-guide-for-publishers-for-deposit-details">Please visit Crossref&amp;rsquo;s official &lt;a href="http://support.crossref.org/hc/en-us/articles/215787303-Crossref-Data-Software-Citation-Deposit-Guide-for-Publishers" target="_blank">Data &amp;amp; Software Citations Deposit Guide&lt;/a> for deposit details.&lt;/h3>
&lt;p>&lt;strong>Very carefully, one at a time? However you wish.&lt;/strong>&lt;/p>
&lt;p>Last year, we introduced linking publication metadata to associated data and software when registering publisher content with Crossref &lt;a href="https://doi.org/10.64000/hnzd5-aew22" target="_blank">Linking Publications to Data and Software&lt;/a>. This blog post follows the “whats” and “whys” with the all-important “how(s)” for depositing data and software citations. We have made the process simple and fairly straightforward: publishers deposit data &amp;amp; software links by adding them directly into the standard metadata deposit via &lt;strong>relation type and/or references&lt;/strong>. This is part of the **existing Content Registration ** process and requires no new workflows.&lt;/p>
&lt;h2 id="relationships">Relationships&lt;/h2>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/data_article_nexus_short.png" alt="An exemplary image" width="500px" />
&lt;/div>
&lt;p>Data &amp;amp; software citations are a valuable part of the “&lt;a href="https://doi.org/10.64000/n0zjv-z6c66" target="_blank">research article nexus&lt;/a>”, comprised of the publication linked to a variety of associated research objects, including data and software, supporting information, protocols, videos, published peer reviews, a preprint, conference papers, etc. For all of these resources, we use relation types in the metadata deposit to “anchor” the article in the article nexus and link to it.&lt;/p>
&lt;h3 id="for-data--software-we-ask-for">For data &amp;amp; software, we ask for:&lt;/h3>
&lt;ul>
&lt;li>identifier of the dataset/software&lt;/li>
&lt;li>identifier type: “DOI”, “Accession”, “PURL”, “ARK”, “URI”, “Other” *&lt;/li>
&lt;li>&lt;a href="https://support.crossref.org/hc/en-us/articles/214357426" target="_blank">relationship type&lt;/a>: “isSupplementedBy” or “references”&lt;/li>
&lt;li>description of dataset or software.
&lt;br/>
*&lt;em>Additional identifier types beyond those used for data or software are also accepted, including ARXIV, ECLI, Handle, ISSN, ISBN, PMID, PMCID, and UUID.&lt;/em>&lt;/li>
&lt;/ul>
&lt;p>Crossref maintains an expansive set of relationship types to support the various resources linked in the research article nexus. For data and software, we recommend “isSupplementedBy” and “references” as relationship types in the metadata. Use the former if it was generated de novo as part of the research results. For those generated by another project and then reused, we recommend applying “references” in the relationship type. These were selected in consultation with DataCite and data working groups. They will provide the level of specificity requested by the community.&lt;/p>
&lt;p>To illustrate how to represent the link within the metadata deposit, we offer two examples from two popular dataset identifiers, one for each of the relationship types.&lt;/p>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>Dataset&lt;/th>
&lt;th>Snippet of deposit XML containing link&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>&lt;strong>Dataset with DOI:&lt;/strong> &lt;br/> Data from: Extreme genetic structure in a social bird species despite high dispersal capacity. &lt;br/> &lt;strong>Database:&lt;/strong> Dryad Digital Repository&lt;br/>&lt;strong>DOI:&lt;/strong> &lt;a href="https://doi.org/10.5061/dryad.684v0" target="_blank">https://doi.org/10.5061/dryad.684v0&lt;/a>&lt;/td>
&lt;td>&lt;code>&amp;lt;program xmlns=&amp;quot;http://www.crossref.org/relations.xsd&amp;quot;&amp;gt;&lt;/code> &lt;br/> &lt;code>&amp;lt;related_item&amp;gt;&lt;/code> &lt;br/> &lt;code>&amp;lt;description&amp;gt;Data from: Extreme genetic structure in a social bird species despite high dispersal capacity&amp;lt;/description&amp;gt;&lt;/code> &lt;br/> &lt;code>&amp;lt;inter_work_relation relationship-type=&amp;quot;isSupplementedBy&amp;quot; identifier-type=&amp;quot;doi&amp;quot;&amp;gt;10.5061/dryad.684v0&amp;lt;/inter_work_relation&amp;gt;&lt;/code> &lt;br/> &lt;code>&amp;lt;/related_item&amp;gt;&lt;/code> &lt;br/> &lt;code>&amp;lt;/program&amp;gt;&lt;/code>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;strong>Dataset with accession number:&lt;/strong>&lt;br/> NKX2-5 mutations causative for congenital heart disease retain functionality and are directed to hundreds of targets &lt;br/>&lt;strong>Database:&lt;/strong> Gene Expression Omnibus (GEO) &lt;br/> &lt;strong>Accession number:&lt;/strong> GSE44902 &lt;br/> &lt;strong>URL:&lt;/strong> &lt;a href="https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE44902" target="_blank">https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE44902&lt;/a>&lt;/td>
&lt;td>&lt;code>&amp;lt;program xmlns=&amp;quot;http://www.crossref.org/relations.xsd&amp;quot;&amp;gt;&lt;/code> &lt;br/> &lt;code>&amp;lt;related_item&amp;gt;&lt;/code> &lt;br/> &lt;code>&amp;lt;description&amp;gt;NKX2-5 mutations causative for congenital heart disease retain and are directed to hundreds of targets&amp;lt;/description&amp;gt;&lt;/code>&lt;br/> &lt;code>&amp;lt;inter_work_relation relationship-type=&amp;quot;references&amp;quot; identifier-type=&amp;quot;Accession&amp;quot;&amp;gt;GSE44902&amp;lt;/inter_work_relation&amp;gt;&lt;/code> &lt;br/> &lt;code>&amp;lt;/related_item&amp;gt;&lt;/code> &lt;br/>&lt;code>&amp;lt;/program&amp;gt;&lt;/code>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;br/>&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>In the examples above, the Dryad dataset was generated as part of the research published in an article. Hence, it contains the “isSupplementedBy” relationship type. The GEO dataset was reused by and referenced in a scholarly article published separate from the project that generated this dataset. Hence, it contains the “references” relationship type.&lt;/td>
&lt;td>&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;p>Both Crossref and DataCite employ this method of linking. Data repositories who register their content with DataCite follow the same process and apply the same metadata tags. This means that we achieve direct data interoperability with links in the reverse direction (data and software repositories to journal articles).&lt;/p>
&lt;h2 id="references">References&lt;/h2>
&lt;p>Another mechanism for depositing data and software citations is to insert it into the manuscript’s references. Publishers then deposit it as part of the article’s references. To do so, publishers follow the general process for depositing references. (Visit Crossref’s &lt;a href="https://support.crossref.org/hc/en-us/articles/215578403-Adding-references-to-your-metadata-record" target="_blank">Support page&lt;/a> for step-by-step instructions.)&lt;/p>
&lt;p>Publishers can deposit the full data or software citation as a unstructured reference.
&lt;br/>
&lt;code>&amp;lt;citation key=&amp;quot;ref=3&amp;quot;&amp;gt;&lt;/code>
&lt;br/>
&lt;code>&amp;lt;unstructured_citation&amp;gt;Morinha F, Dávila JA, Estela B, Cabral JA, Frías Ó, González JL, Travassos P, Carvalho D, Milá B, Blanco G (2017) Data from: Extreme genetic structure in a social bird species despite high dispersal capacity. Dryad Digital Repository. http://dx.doi.org/10.5061/dryad.684v0&amp;lt;/unstructured_citation\&amp;gt;&lt;/code>
&lt;br/>
&lt;code>&amp;lt;/citation&amp;gt;&lt;/code>
&lt;br/>
&lt;code>&amp;lt;/citation_list&amp;gt;&lt;/code>&lt;/p>
&lt;p>Or they can employ any number of &lt;a href="https://support.crossref.org/hc/en-us/articles/215578403-Adding-references-to-your-metadata-record" target="_blank">reference tags&lt;/a> currently accepted by Crossref. Most do not readily suit datasets and software as the suite was originally established to match article and book references. This leaves out substantial metadata needed to identify and describe the dataset, however, if the resource does not have a DOI.
&lt;br/>
&lt;code>&amp;lt;citation key=&amp;quot;ref2&amp;quot;&amp;gt;&lt;/code>
&lt;br/>
&lt;code>&amp;lt;doi&amp;gt;10.5061/dryad.684v0&amp;lt;/doi&amp;gt;&lt;/code>
&lt;br/>
&lt;code>&amp;lt;cYear&amp;gt;2017&amp;lt;/cYear&amp;gt;&lt;/code>
&lt;br/>
&lt;code>&amp;lt;author&amp;gt;Morinha F, Dávila JA, Estela B, Cabral JA, Frías Ó, González JL, Travassos P, Carvalho D, Milá B, Blanco G&amp;lt;/author&amp;gt;&lt;/code>
&lt;br/>
&lt;code>&amp;lt;/citation&amp;gt;&lt;/code>
&lt;br/>
We are exploring the &lt;a href="http://jats4r.org/data-citations" target="_blank">JATS4R&lt;/a> recommendations while we consider expanding the current collection. We welcome additional suggestions from the community.&lt;/p>
&lt;h2 id="precise-accessible-links">Precise, accessible links&lt;/h2>
&lt;p>Crossref’s infrastructure is setup to facilitate the flow of information about scholarly works across the research network. We maintain a fair degree of flexibility both in the structure and completeness of metadata deposited. The aim, though, is to make the links rich in metadata, accurate in associating literature to corresponding resource, and available to both human and machine consumers as per Principle #5 and #7 in the &lt;a href="https://www.force11.org/group/joint-declaration-data-citation-principles-final" target="_blank">Joint Declaration of Data Citation Principles&lt;/a>.&lt;/p>
&lt;p>As with the other associated resources in the article nexus, we recommend depositing data/software links in the publication metadata via relationships. Publishers are free to do this &lt;em>on top of&lt;/em> or &lt;em>independent of&lt;/em> references. Relationship metadata offer a high degree of precision. References are a hodgepodge of various resources cited by the publication, including articles, books, media, blogs, reference materials, etc. and data citations are hard to isolate. Furthermore, the unstructured, “spaghetti string” text is difficult for systems to parse and extract specific information.&lt;/p>
&lt;p>With relationship metadata, data and software resources are expressly designated. We obtain a more accurate link that specifies identifier type and explicitly identifies data generated as part of the research shared in the paper or as reuse of existing data). The richer metadata contained here enables consumers to conduct powerful queries based on different attributes (identifier type, description, relationship), taking data discovery and mining to the next level.&lt;/p>
&lt;p>Furthermore, relationships are important for achieving full accessibility of data and software citations. Access to references is based on publisher permission so not all data citations can be shared (excluding DataCite DOIs). In contrast, all links deposited via relationships are publicly available.&lt;/p>
&lt;p>Publishers play an important role in supporting research validation and reproducibility. Data &amp;amp; software citation is a basic part of of this practice, and instrumental in enabling the reuse and verification of these research outputs, tracking their impact, and creating a scholarly structure that recognizes and rewards those involved in producing them. For the full scoop of how to deposit (i.e., technical details and more), we encourage you to reference the Crossref &lt;a href="http://support.crossref.org/hc/en-us/articles/215787303-Crossref-Data-Software-Citation-Deposit-Guide-for-Publishers" target="_blank">Data &amp;amp; Software Citations Deposit Guide&lt;/a> and contact us (&lt;a href="mailto:support@crossref.org">support@crossref.org&lt;/a>) with questions or feedback.&lt;/p></description></item><item><title>Taking the "con" out of conferences</title><link>https://www.crossref.org/blog/taking-the-con-out-of-conferences/</link><pubDate>Wed, 15 Feb 2017 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/taking-the-con-out-of-conferences/</guid><description>&lt;p>TL;DR&lt;/p>
&lt;p>Crossref and DataCite are forming a working group to explore conference identifiers and project identifiers. If you are interested in joining this working group &lt;em>and&lt;/em> in doing some actual work for it, please contact us at &lt;code>community@crossref.org&lt;/code> and include the text &lt;code>conference identifiers WG&lt;/code> in the subject heading. &lt;br>&lt;/p>
&lt;p align="center">
&lt;img src="https://www.crossref.org/images/blog/mouse-ears.png" alt= "Mouse ears"/>
&lt;/p>
&lt;h2 id="all-the-times-i-could-have-gone-to-walt-disney-world--br-br">All the times I could have gone to Walt Disney World&amp;hellip; &lt;br> &lt;br>&lt;/h2>
&lt;p>Back around 2010 I added a filter to my email settings that automatically flagged and binned any email that contained the word &amp;ldquo;Orlando.&amp;rdquo; Back then this was a remarkably effective way of detecting and ignoring spam from the numerous fake technology conferences that all seemed to advertise the city of Orlando, Florida as the location for their non-events. I suspected they all chose Orlando as it would provide the &lt;a href="http://dictionary.cambridge.org/dictionary/english/punter" target="_blank">punter&lt;/a> that little bit of extra motivation to pay and register for the conference as they simultaneously plotted how they could tag-on some holiday time at Walt Disney World. I finally had to remove the filter last year when I realised that the scammers had moved on to advertising more realistically gritty cities in their calls for submissions and that meanwhile I had managed to miss all the mail informing me of the &lt;a href="http://2016.alaannual.org/" target="_blank">ALA&amp;rsquo;s summer 2016 meeting&lt;/a> held in, you guessed it&amp;hellip; Orlando. &lt;br> &lt;br>&lt;/p>
&lt;p>Clearly we need better mechanisms to flag dubious conferences. &lt;br> &lt;br>&lt;/p>
&lt;p>Late last year Crossref&amp;rsquo;s Strategic initiatives group was approached by “CounterMock,” a group of Crossref members (including major proceedings publishers like Springer Nature, Elsevier, IEEE, ACM, IET, etc) who were actively exploring the establishment of an identifier system and registry for scholarly conferences. &lt;br> &lt;br>&lt;/p>
&lt;p>The long term goal of the group is to make it easier for publishers, researchers and other stakeholders to identify fraudulent and/or low-quality conferences. There has recently been a proliferation of conferences that seem to have been developed specifically to dupe international and early-career researchers into paying substantial conference and publication fees. Sometimes these conferences are intentionally named after long-standing and well-respected conferences. At worst these conferences are entirely fake - no meetings are held and no publications are issued. At best they produce subpar publications of questionable academic integrity. Members of the group are concerned that these &amp;ldquo;mock conferences&amp;rdquo; (Hence &amp;ldquo;COUNTERMOCK&amp;rdquo;) will: &lt;br> &lt;br>&lt;/p>
&lt;ul>
&lt;li>Waste researcher time.&lt;br> &lt;br>&lt;/li>
&lt;li>Waste publisher time.&lt;br> &lt;br>&lt;/li>
&lt;li>Undermine academic trust in conferences and conference proceedings as a trustworthy means of scholarly communication.&lt;br> &lt;br>&lt;/li>
&lt;/ul>
&lt;p>The group understands that the &amp;ldquo;evaluation of a conference quality&amp;rdquo; and the &amp;ldquo;unambiguous identification of conferences&amp;rdquo; are separate concerns (as they are with publications, contributors, etc). But they also realise that it will be hard to address the quality issue without an infrastructure for unambiguously identifying conferences and providing meaningful provenance metadata about those conferences. Moreover, having unique identifiers for conference series would enable a number of other applications. Examples include conference-level metrics, better and more structured info about forthcoming conferences on a certain topic, and more visibility of conferences in research evaluation. &lt;br> &lt;br>&lt;/p>
&lt;p>Springer Nature has built a &lt;a href="http://lod.springer.com/data/search" target="_blank">POC prototype of a conference identifier system&lt;/a> and shown it to a number of other parties. The feedback has been that there is interest in the project, but that the consensus is that it should be managed a run by a neutral industry group. They have approached us to form a working group and explore how this project can be advanced. &lt;br> &lt;br>&lt;/p>
&lt;p>This is all good. Crossref itself doesn&amp;rsquo;t make value judgements on the quality of content registered with us. &lt;a href="https://doi.org/10.64000/3gjb5-tkm69" target="_blank">Crossref DOIs are not quality marks&lt;/a>. But we do believe that unambiguous identification of research artifacts is a perquisite to building effective trust and reputation tools.&lt;br> &lt;br>&lt;/p>
&lt;p>It is possible that the issue of conference identifiers can be folded into &lt;a href="https://doi.org/10.64000/224cc-a0w76" target="_blank">the work we are doing with DataCite and ORCID on organisation identifiers&lt;/a>. For example, some have argued that organisation identifiers should include identifiers for projects or other less formal and more ephemeral corporate entities that are often included in affiliation and/or bibliographic data. It is possible to make the similar arguments in the case of conferences.&lt;br> &lt;br>&lt;/p>
&lt;p>On the other hand we have also been interested in the issue of &amp;ldquo;project identifiers.&amp;rdquo; &lt;a href="http://orcid.org/0000-0003-1419-2405" target="_blank">Martin Fenner&lt;/a> and &lt;a href="http://orcid.org/0000-0003-0902-4386" target="_blank">Tom Demeranville&lt;/a> have &lt;a href="https://doi.org/10.6084/m9.figshare.4216323.v2" target="_blank">made a strong argument&lt;/a> that &amp;lsquo;projects&amp;rsquo; can be thought of as containers for collections of project outputs, project members and project funders. Again, it seems plausible that one could make the same case for conferences.&lt;br> &lt;br>&lt;/p>
&lt;p>At the very least it is important to coordinate any work that is done on conference, project and organisation identifiers. This why we have decided to form a joint Crossref/DataCite working group to specifically explore conference and project identifiers and determine how they relate both to each other and to our already ongoing work with ORCID on organisation identifiers.
&lt;br> &lt;br>
Additionally, it is likely that the working group will discuss and explore how conference/project identifiers might be used for increasing the transparency of peer review at conferences, better attribution for programme chairs and program committee members, and how they might be incorporated into other services like &lt;a href="https://search.crossref.org" target="_blank">Crossref Metadata Search&lt;/a>, &lt;a href="https://search.datacite.org/" target="_blank">DataCite search&lt;/a>, &lt;a href="https://www.crossref.org/services/crossmark/">CrossMark&lt;/a>, etc.&lt;br> &lt;br>&lt;/p>
&lt;p>If you are interested in doing some work on this- then please indicate your interest in joining a working group by sending email to &lt;code>community@crossref.org&lt;/code> and include the text &lt;code>conference identifiers WG&lt;/code> in the subject heading.&lt;br> &lt;br>&lt;/p>
&lt;p>We will update this blog as the group convenes and makes progress.&lt;br> &lt;br>&lt;/p>
&lt;p align="center">
&lt;img src="https://www.crossref.org/images/blog/florida.png" alt= "Florida"/>
&lt;/p></description></item><item><title>Linking DOIs using HTTPs: the background to our new guidelines</title><link>https://www.crossref.org/blog/linking-dois-using-https-the-background-to-our-new-guidelines/</link><pubDate>Tue, 17 Jan 2017 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/linking-dois-using-https-the-background-to-our-new-guidelines/</guid><description>&lt;p>Recently we announced that we were making some new recommendations in our DOI display guidelines. One of them was to use the secure HTTPS protocol to link Crossref DOIs, instead of the insecure HTTP.&lt;/p>
&lt;p>&lt;span style="font-weight: 400;">Some people asked whether the move to HTTPS might affect their ability to measure referrals (i.e. where the people who visit your site come from).&lt;/span>&lt;/p>
&lt;h2>&lt;span style="font-weight: 400;">TL;DR: Yes&lt;/span>&lt;/h2>
&lt;span style="font-weight: 400;">Yes. If you do &lt;/span>&lt;b>not&lt;/b>&lt;span style="font-weight: 400;"> move your DOI links to HTTPS, Crossref, its members and the members of &lt;/span>&lt;a href="http://www.doi.org/registration_agencies.html">&lt;span style="font-weight: 400;">other DOI registration agencies&lt;/span>&lt;/a>&lt;span style="font-weight: 400;"> (e.g. DataCite, JLC, CNKI)  will find it increasingly difficult to accurately measure referrals. You should link DOIs using HTTPS.&lt;/span>
&lt;p>&lt;span style="font-weight: 400;">In fact, if you do not support HTTPS on your site &lt;/span>&lt;b>now&lt;/b>&lt;span style="font-weight: 400;">, it is likely that your ability to measure referrals is already impaired. If you do not already have a plan to move your site to HTTPS, you should develop one.&lt;/span>&lt;/p>
&lt;p>&lt;span style="font-weight: 400;">If you have already transitioned your site to HTTPS, you should follow the new guidelines and link DOIs via HTTPS as soon as possible. As it stands, you are not sending any referrer information when DOIs are clicked on and followed from your site. You should also make sure that the URLs you have registered with Crossref are HTTPS URLs, otherwise &lt;em>you&lt;/em> will not get referrer information on your site when they are followed.&lt;/span>&lt;/p>
&lt;p>&lt;span style="font-weight: 400;">Read on if you want some grody details. We&amp;rsquo;ll try to keep it as non-technical as possible.&lt;/span>&lt;/p>
&lt;h2>&lt;span style="font-weight: 400;">Two protocols, one web&lt;/span>&lt;/h2>
&lt;span style="font-weight: 400;">To start with your web browser supports two closely related protocols, HTTP and HTTPS.&lt;/span>
&lt;p>&lt;span style="font-weight: 400;">The first, HTTP, is the protocol that the web started out with. It is an unencrypted protocol and it is also easy to intercept and modify. It is also very easy and inexpensive to implement.&lt;/span>&lt;/p>
&lt;p>&lt;span style="font-weight: 400;">The second protocol, HTTPS, is a secure version of the first protocol. It is very difficult to intercept and modify. It has historically been more complex and expensive to implement. &lt;/span>&lt;/p>
&lt;p>&lt;span style="font-weight: 400;">Here you might say - &amp;ldquo;Great, but HTTPS has been around for a long time. We&amp;rsquo;ve used it for sensitive transactions like authentication and credit card transactions. Why do we want to use DOI links with HTTPS?&amp;rdquo; Why are you suggesting that we should even consider moving our entire site to HTTPS? &lt;/span>&lt;/p>
&lt;h2>&lt;span style="font-weight: 400;">The pressure to move to HTTPS&lt;/span>&lt;/h2>
&lt;span style="font-weight: 400;">The insecure HTTP protocol has become a major vector for a lot of security issues on the web. It allows user web pages to be intercepted and modified between the server and the browser. This flaw is being abused for everything from spying, to inserting unwanted advertisements into web pages, to distributing viruses, ransomware and botnets. &lt;/span>
&lt;p>&lt;span style="font-weight: 400;">As such, there has been a steady drumbeat of industry encouragement to move to the more secure HTTPS protocol for all website functions.&lt;/span>&lt;/p>
&lt;p>&lt;span style="font-weight: 400;">We are not going to argue all the points here. Instead we will mention the major constituencies that are advocating for a move to HTTPS and provide you with some pointers. We apologise that these are all so US-centric, but a lot of the web&amp;rsquo;s global direction does seem to be presaged by US adoption trends.&lt;/span>&lt;/p>
&lt;h2>&lt;span style="font-weight: 400;">Google&lt;/span>&lt;/h2>
&lt;span style="font-weight: 400;">It is probably easiest to start with Google, since its practices tend to focus the attention of those managing websites.&lt;/span>
&lt;p>&lt;span style="font-weight: 400;">Back in 2014 &lt;/span>&lt;a href="https://webmasters.googleblog.com/2014/08/https-as-ranking-signal.html">&lt;span style="font-weight: 400;">Google announced that they would slowly move toward including the use of HTTPS as a ranking signal&lt;/span>&lt;/a>&lt;span style="font-weight: 400;">. In 2015 they upped the ante by announcing that &lt;/span>&lt;a href="https://security.googleblog.com/2015/12/indexing-https-pages-by-default.html">&lt;span style="font-weight: 400;">they would start indexing HTTPS versions of pages by default&lt;/span>&lt;/a>&lt;span style="font-weight: 400;">. It looks like in early 2017 they will really start to take the gloves off as they &lt;/span>&lt;a href="https://www.chromium.org/Home/chromium-security/marking-http-as-non-secure">&lt;span style="font-weight: 400;">modify their Chrome browser to flag sites that do not use HTTPS as being &lt;code>insecure&lt;/code>&lt;/span>&lt;/a>&lt;span style="font-weight: 400;">.&lt;/span>&lt;/p>
&lt;h2>&lt;span style="font-weight: 400;">Every top website, &lt;/span>&lt;a href="http://www.urbandictionary.com/define.php?term=evah">&lt;span style="font-weight: 400;">evah&lt;/span>&lt;/a>&lt;/h2>
&lt;span style="font-weight: 400;">It looks like Google's plan is working too. Their &lt;/span>&lt;a href="https://www.google.com/transparencyreport/https/grid/?hl=en">&lt;span style="font-weight: 400;">2016 transparency report&lt;/span>&lt;/a>&lt;span style="font-weight: 400;"> shows that most top websites have already transitioned to HTTPS and that this translates to approximately 25% of all web traffic worldwide taking place using HTTPS. Indeed, over 50% of all web pages viewed by desktop users are delivered via HTTPS.&lt;/span>
&lt;h2>&lt;span style="font-weight: 400;">Government agencies&lt;/span>&lt;/h2>
&lt;span style="font-weight: 400;">The USA’s Whitehouse issued [&lt;/span>&lt;a href="https://www.whitehouse.gov/blog/2015/06/08/https-everywhere-government">&lt;span style="font-weight: 400;">a directive instructing all Federal websites to adopt HTTPS&lt;/span>&lt;/a>&lt;span style="font-weight: 400;">]. As of December 2016 &lt;/span>&lt;a href="https://pulse.cio.gov/">&lt;span style="font-weight: 400;">64%&lt;/span>&lt;/a>&lt;span style="font-weight: 400;"> of federal websites have made the transition.&lt;/span>
&lt;h2>&lt;span style="font-weight: 400;">Libraries&lt;/span>&lt;/h2>
&lt;span style="font-weight: 400;">Much of the pressure to move to HTTPS is coming from the library community who have a historical tradition of protecting patron privacy and resisting efforts to censor content. The third principle of the American Library Association's code of ethics reads:&lt;/span>
&lt;p>&lt;span style="font-weight: 400;">We protect each library user&amp;rsquo;s right to privacy and confidentiality with respect to information sought or received and resources consulted, borrowed, acquired or transmitted.&lt;/span>&lt;/p>
&lt;p>&lt;span style="font-weight: 400;">Recently there has been &lt;/span>&lt;a href="https://www.eff.org/deeplinks/2016/12/librarians-act-now-protect-your-users-its-too-late">&lt;span style="font-weight: 400;">a major push by the Electronic Frontier Foundation&lt;/span>&lt;/a>&lt;span style="font-weight: 400;"> to get libraries to adopt a number of security and privacy practices, including the use of HTTPS by all library systems as well as those used by library vendors.&lt;/span>&lt;/p>
&lt;h2>&lt;span style="font-weight: 400;">What are Crossref members doing about HTTPS?&lt;/span>&lt;/h2>
&lt;span style="font-weight: 400;">How big an issue is this? How many of our members have moved to HTTPS? How many plan to? Well, we looked at the URLs that are registered with Crossref and we tested them with both protocols. Eventually we will write a blog post detailing our findings - but the highlights are:&lt;/span>
&lt;ul>
&lt;li style="font-weight: 400;">&lt;span style="font-weight: 400;">Slightly fewer than half of the member domains tested only support HTTP.&lt;/span>&lt;/li>
&lt;li style="font-weight: 400;">&lt;span style="font-weight: 400;">Slightly fewer than half of the member domains tested support both HTTP and HTTPS.&lt;/span>&lt;/li>
&lt;li style="font-weight: 400;">&lt;span style="font-weight: 400;">About 370 of the member domains tested only support HTTPS.&lt;/span>&lt;/li>
&lt;/ul>
&lt;h2>&lt;span style="font-weight: 400;">The transition to HTTPS and the issue of DOI referrals&lt;/span>&lt;/h2>
&lt;span style="font-weight: 400;">The HTTP referrer is a piece of information passed on by a browser that indicates the site from which the user navigated.&lt;/span>
&lt;p>&lt;span style="font-weight: 400;">So, for example, if a user visiting site &lt;code>A&lt;/code> clicks on a link which takes them to site &lt;code>B&lt;/code>, site &lt;code>B&lt;/code> will then record in its logs that a user visited them from site A. Obviously, this is important information for understanding where your web site traffic comes from. &lt;/span>&lt;/p>
&lt;p>&lt;span style="font-weight: 400;">The default rules for referrals are&lt;sup>&lt;a href="#fn1">1&lt;/a>&lt;/sup>&lt;/span>&lt;span style="font-weight: 400;">:&lt;/span>&lt;/p>
&lt;ol>
&lt;li style="font-weight: 400;">&lt;span style="font-weight: 400;">If you link between two sites with the same level of security, all referral information is retained.&lt;/span>&lt;/li>
&lt;li style="font-weight: 400;">&lt;span style="font-weight: 400;">When you follow a link from an insecure (HTTP) web site to a secure (HTTPS) site, referral data is passed on to the secure web site. &lt;/span>&lt;/li>
&lt;li style="font-weight: 400;">&lt;span style="font-weight: 400;">If you follow a link from a secure (HTTPS) web site to an insecure (HTTP) site, referral data is not passed on to the insecure web site.&lt;/span>&lt;/li>
&lt;/ol>
&lt;span style="font-weight: 400;">So let's see what the situation would look like with normal links. If we had two sites, `A` &amp;amp; `B`, the following table maps the possible combinations of protocols that can be used to link from `A` to `B`. So, for example, row #2 reads:&lt;/span>
&lt;blockquote>&lt;span style="font-weight: 400;">A user browses site A using HTTP and clicks on a HTTPS link to publisher B who hosts their site using HTTPS. &lt;/span>&lt;/blockquote>
&lt;span style="font-weight: 400;">The last column indicates if the referrer information is passed along by the browser. In the case of row #2, the answer is “yes”. The user has navigated from a less secure site to a more secure site.&lt;/span>
&lt;table>
&lt;tbody>
&lt;tr>
&lt;td>&lt;b>User views site A using&lt;/b>&lt;/td>
&lt;td>&lt;b>Site A links to site B using&lt;/b>&lt;/td>
&lt;td>&lt;b>Browser reports referrer to site B&lt;/b>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;span style="font-weight: 400;">HTTP&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">HTTP&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">Yes&lt;/span>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;span style="font-weight: 400;">HTTP&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">HTTPS&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">Yes&lt;/span>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;span style="font-weight: 400;">HTTPS&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">HTTP&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">No&lt;/span>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;span style="font-weight: 400;">HTTPS&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">HTTPS&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">Yes&lt;/span>&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;span style="font-weight: 400;">
But this gets a little more complicated with DOIs. In this case publisher `A` links to publisher `B` through the DOI system. This means there are two parts to the link. The first `(A-&amp;gt;doi.org)` results in a redirect (A-&amp;gt;B). Again we use the last columns to indicate when referrer information is passed along to site B. Again, let’s look at row #2. It reads:&lt;/span>
&lt;p>&lt;span style="font-weight: 400;">A user browses the site of member A using HTTP and clicks on a HTTP DOI link. The DOI system redirects the browser to member B using an HTTPS link registered with Crossref by member B. The middle column and the last column records whether Crossref and the publisher were able to see referrer information. The answer in both cases is “yes”. In the first case (A-&amp;gt;DOI) because the link was from a less secure site (HTTP on A) to a more secure site (HTTPS at DOI). The second case because the link is between two sites at the same security level (HTTP).&lt;/span>&lt;/p>
&lt;table>
&lt;tbody>
&lt;tr>
&lt;td>&lt;/td>
&lt;td>&lt;b>User views site A using&lt;/b>&lt;/td>
&lt;td>&lt;b>Site A links DOI using&lt;/b>&lt;/td>
&lt;td>&lt;b>Browser reports referrer to Crossref&lt;sup>&lt;a href="#fn2">2&lt;/a>&lt;/sup>&lt;/b>&lt;/td>
&lt;td>&lt;b>Crossref redirects to site B using&lt;sup>&lt;a href="#fn3">3&lt;/a>&lt;/sup>&lt;/b>&lt;/td>
&lt;td>&lt;b>Browser reports referrer to site B&lt;/b>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;span style="font-weight: 400;">1&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">HTTP&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">HTTP&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">Yes&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">HTTP&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">Yes&lt;/span>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;span style="font-weight: 400;">2&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">HTTP&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">HTTP&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">Yes&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">HTTPS&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">Yes&lt;/span>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;span style="font-weight: 400;">3&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">HTTP&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">HTTPS&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">Yes&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">HTTP&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">Yes&lt;/span>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;span style="font-weight: 400;">4&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">HTTP&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">HTTPS&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">Yes&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">HTTPS&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">Yes&lt;/span>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;span style="font-weight: 400;">5&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">HTTPS&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">HTTP&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">No&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">HTTP&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">No&lt;/span>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;span style="font-weight: 400;">6&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">HTTPS&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">HTTP&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">No&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">HTTPS&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">No&lt;/span>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;span style="font-weight: 400;">7&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">HTTPS&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">HTTPS&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">Yes&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">HTTP&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">No&lt;/span>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;span style="font-weight: 400;">8&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">HTTPS&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">HTTPS&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">Yes&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">HTTPS&lt;/span>&lt;/td>
&lt;td>&lt;span style="font-weight: 400;">Yes&lt;/span>&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;h2>&lt;span style="font-weight: 400;">
So what does this mean?&lt;/span>&lt;/h2>
&lt;span style="font-weight: 400;">Our old display guidelines recommended linking DOIs using HTTP. Rows #1, #2, #5, #6 represent the status quo.&lt;/span>
&lt;p>&lt;span style="font-weight: 400;">About half of our members support HTTPS. A few support it exclusively and it seems, given the industry pressures mentioned above, those who support both protocols are likely doing so as a transition stage to HTTPS-only sites.&lt;/span>&lt;/p>
&lt;p>&lt;span style="font-weight: 400;">This means that &lt;/span>&lt;b>the scenarios represented in row #5 &amp;amp; #6 are already happening&lt;/b>&lt;span style="font-weight: 400;">. The referral information for any user viewing one of our member sites using HTTPS is being lost when they click on DOIs that use the HTTP protocol. Crossref doesn&amp;rsquo;t get the referral data and neither does the member whose DOI has been clicked on.&lt;/span>&lt;/p>
&lt;p>&lt;span style="font-weight: 400;">Of course this applies to non-member sites that link to DOIs as well. Wikipedia is the largest referrer of DOIs from outside the industry. In 2015 The Wikimedia Foundation &lt;/span>&lt;a href="https://blog.wikimedia.org/2015/06/12/securing-wikimedia-sites-with-https/">&lt;span style="font-weight: 400;">made a highly publicised transition&lt;/span>&lt;/a>&lt;span style="font-weight: 400;"> to HTTPS on all of their sites. This means that any of our members who are running HTTP sites have already lost the ability to see any referral information from Wikipedia on their own sites. However, Crossref &lt;/span>&lt;a href="http://blog.crossref.org/2016/05/https-and-wikipedia.html">&lt;span style="font-weight: 400;">worked closely with Wikimedia&lt;/span>&lt;/a>&lt;span style="font-weight: 400;"> to ensure that, at the very least, Crossref was still able to record Wikimedia referral data on behalf of our members.&lt;/span>&lt;/p>
&lt;h2>&lt;span style="font-weight: 400;">A solution&lt;/span>&lt;/h2>
&lt;span style="font-weight: 400;">It is largely this work with Wikimedia that has helped us to understand just how important it is for Crossref to get ahead of the curve in helping our community to transition to HTTPS.&lt;/span>
&lt;p>&lt;span style="font-weight: 400;">As long as our members are running a combination of HTTP and HTTPS sites, there is no way for our community to avoid some disruption in the flow of referral data. And we certainly would never entertain the notion of asking our members to keep using HTTP.The best we can do is recommend a practice that will help smooth the transition to HTTPS. That is what we are doing.Our new recommendation is to move to linking DOIs using HTTPS. This is represented in rows #3, #4, #7 and #8 in the table above. &lt;/span>&lt;/p>
&lt;p>&lt;span style="font-weight: 400;">This is a particularly important step for our members who have already moved to hosting their sites on HTTPS. As long as they are using HTTP DOIs on their site, they will be sending no referral traffic to Crossref, other Crossref members or other users of the DOI infrastructure. This is captured in scenarios #5 and #6.&lt;/span>&lt;/p>
&lt;p>&lt;span style="font-weight: 400;">If our linking guidelines are followed during the industry’s transition to HTTPS, then scenario #5 and #6 will eventually be replaced with scenario #7. It is still not perfect, but at least it means that, during the transition, publishers who are still running HTTP sites will be able to get some DOI referral data via Crossref. And of course, once our members have widely transitioned to HTTPS, everything will go back to normal and they will be able to see referral data on their own sites as well (i.e.they will have moved from the state represented in row #1 to state represented in row #8.)&lt;/span>&lt;/p>
&lt;p>&lt;span style="font-weight: 400;">In summary, please change your sites to use HTTPS to link DOIs. They should look like this:&lt;/span>&lt;/p>
&lt;p>&lt;a href="https://doi.org/10.7554/eLife.20320">&lt;span style="font-weight: 400;">&lt;a href="https://doi.org/10.7554/eLife.20320" target="_blank">https://doi.org/10.7554/eLife.20320&lt;/a>&lt;/span>&lt;/a>&lt;/p>
&lt;h2>&lt;span style="font-weight: 400;">FAQ&lt;/span>&lt;/h2>
&lt;span style="font-weight: 400;">&lt;strong>Q:&lt;/strong> If I have moved my site to HTTPS, do I need to redeposit my URLs to that they use the HTTPS protocol instead?&lt;/span>
&lt;p>&lt;span style="font-weight: 400;">&lt;strong>A:&lt;/strong> Yes. If you want to be able to still collect referrer information on your site (scenario #8) as opposed to via Crossref (scenario #7).&lt;/span>&lt;/p>
&lt;hr />
&lt;p>&lt;span style="font-weight: 400;">&lt;strong>Q:&lt;/strong> But can’t I avoid redepositing my URLs and get referrer data again if I simply redirect HTTP URLs to HTTPS on my own site?&lt;/span>&lt;/p>
&lt;p>&lt;span style="font-weight: 400;">&lt;strong>A:&lt;/strong> No. The browser will strip referrer information if there is any HTTP step in the redirects. Even if the redirect is done on your own site.&lt;/span>&lt;/p>
&lt;hr />
&lt;p>&lt;span style="font-weight: 400;">&lt;strong>Q:&lt;/strong> Can I avoid having to redeposit all my URLs? Can’t Crossref just update the protocol on our existing DOIs for us?&lt;/span>&lt;/p>
&lt;p>&lt;span style="font-weight: 400;">&lt;strong>A:&lt;/strong> Contact &lt;/span>&lt;a href="mailto:support@crossref.org">&lt;span style="font-weight: 400;">&lt;a href="mailto:support@crossref.org">support@crossref.org&lt;/a>&lt;/span>&lt;/a>&lt;span style="font-weight: 400;">. We’ll see what we can do.&lt;/span>&lt;/p>
&lt;hr />
&lt;p>&lt;span style="font-weight: 400;">&lt;strong>Q:&lt;/strong> What about all the old PDFs that are are there? They link to DOIs using HTTP. &lt;/span>&lt;/p>
&lt;p>&lt;span style="font-weight: 400;">&lt;strong>A:&lt;/strong> That is true. But links followed from PDFs don’t send referrer information anyway.&lt;/span>&lt;/p>
&lt;hr />
&lt;p>&lt;span style="font-weight: 400;">&lt;strong>Q:&lt;/strong> And what about my new PDFs? Should I start linking DOIs from them using HTTPS.&lt;/span>&lt;/p>
&lt;p>&lt;span style="font-weight: 400;">&lt;strong>A:&lt;/strong> Probably. But not because of the DOI referrer problem. Simply because HTTPS is a more secure, private, and future-proof protocol.&lt;/span>&lt;/p>
&lt;hr />
&lt;p>&lt;span style="font-weight: 400;">&lt;strong>Q:&lt;/strong> Don’t some countries block HTTPS?&lt;/span>&lt;/p>
&lt;p>&lt;span style="font-weight: 400;">&lt;strong>A:&lt;/strong> Typically countries block specific sites and/or services. We do not know of any countries that have a blanket block on the HTTPS protocol.&lt;/span>&lt;/p>
&lt;hr />
&lt;p>&lt;span style="font-weight: 400;">&lt;strong>Q:&lt;/strong> I use a link resolver that uses OpenURL + a  cookie pusher to redirect my users to local resources. What do I need to do?&lt;/span>&lt;/p>
&lt;p>&lt;span style="font-weight: 400;">&lt;strong>A:&lt;/strong> You need to change your cookie pusher script to enable the &lt;code>Secure&lt;/code> attribute for cookies for HTTPS-linked DOIs.   &lt;/span>&lt;/p>
&lt;hr />
&lt;p>&lt;span style="font-weight: 400;">&lt;strong>Q:&lt;/strong> Can I use protocol-relative URLs (e.g. &lt;/span>&lt;a href="https://doi.org/10.7554/eLife.20320">&lt;span style="font-weight: 400;">//doi.org/10.7554/eLife.20320&lt;/span>&lt;/a>&lt;span style="font-weight: 400;">)?&lt;/span>&lt;/p>
&lt;p>&lt;span style="font-weight: 400;">&lt;strong>A:&lt;/strong> Protocol-relative URLs can be used in HTML HREFs to help ease the transition from HTTP to HTTPS, but use the full protocol in the text of the DOI link itself. So, for example, the following is fine:&lt;/span>&lt;span style="font-weight: 400;">
&lt;/span>&lt;span style="font-weight: 400;">
&lt;/span>&lt;/p>
&lt;pre>&lt;a href="//doi.org/10.7554/eLife.20320">https://doi.org/10.7554/eLife.20320&lt;/a>&lt;/pre>
&lt;hr />
&lt;p>&lt;span style="font-weight: 400;">&lt;strong>Q:&lt;/strong> I hear that HTTP and HTTPS versions of URI identifiers are considered to be different identifiers. Doesn’t this mean that by moving to HTTPS we are essentially doubling the number of DOI-based identifiers out there?&lt;/span>&lt;/p>
&lt;p>&lt;span style="font-weight: 400;">&lt;strong>A:&lt;/strong> Yes. It isn’t a problem that is only being faced by DOIs. Basically all HTTP-URI based identifiers face the same issue. We will put in place appropriate same-as assertions in our metadata and HTTP headers to allow people to understand that the HTTP and HTTPS representations of the DOI point to the same thing. &lt;/span>&lt;/p>
&lt;p>&lt;em>On a personal note (@gbilder speaking- don’t blame @CrossrefOrg) - it breaks my brain that the official line is that the protocol difference means they are different identifiers. As a practical matter (a concept the W3C seems to be increasingly alienated from), it would be insane for anybody to follow this policy to the letter. You can probably be pretty safe swapping the protocols on DOIs and being sure you will get the same thing.&lt;/em>&lt;/p>
&lt;hr />
&lt;p>&lt;span style="font-weight: 400;">&lt;strong>Q:&lt;/strong> I see that the Crossref site isn’t running on HTTPS. Are you just a bunch of hypocrites?&lt;/span>&lt;/p>
&lt;p>&lt;span style="font-weight: 400;">&lt;strong>A:&lt;/strong> &lt;del>Yes. The site will be moving to HTTPS-only very soon. Then we won’t be.&lt;/del> We do now.&lt;/span>&lt;/p>
&lt;p> &lt;/p>
&lt;h2>&lt;span style="font-weight: 400;">References&lt;/span>&lt;/h2>
&lt;ol>
&lt;li id="fn1">These rules can be tweaked using meta referrer tags (https://www.w3.org/TR/referrer-policy/), but not in any way that both avoids the fundamental problems outlined here &lt;b>and&lt;/b> that preserves the security/privacy characteristics that are the very reason to implement HTTPS in the first place.&lt;/li>
&lt;li id="fn2">To be pedantic- it actually passes referrer information to the DOI proxy (https://doi.org/), which in turn is reported to Crossref.&lt;/li>
&lt;li id="fn3">To continue with the pedantry- the DOI proxy does the redirect based on the URL member B has deposited with Crossref.&lt;/li>
&lt;/ol></description></item><item><title>Included, registered, available: let the preprint linking commence.</title><link>https://www.crossref.org/blog/included-registered-available-let-the-preprint-linking-commence./</link><pubDate>Mon, 05 Dec 2016 00:00:00 +0000</pubDate><author>Rachael Lammey</author><discourseUsername>rlammey</discourseUsername><guid>https://www.crossref.org/blog/included-registered-available-let-the-preprint-linking-commence./</guid><description>&lt;p>We &lt;a href="https://doi.org/10.64000/5tcfp-vf140" target="_blank">began accepting preprints&lt;/a> as a new record type last month (in a category known as “posted content” in our XML schema). Over 1,000 records have already been registered in the first few weeks since we launched the service.&lt;/p>
&lt;p>By extending our existing services to preprints, we want to help make sure that:&lt;/p>
&lt;ul>
&lt;li>links to these publications persist over time&lt;/li>
&lt;li>they are connected to the full history of the shared research&lt;/li>
&lt;li>the citation record is clear and up-to-date.&lt;/li>
&lt;/ul>
&lt;p>It’s not just collecting the metadata however, it’s also making it available so that it can be as widely used as possible. Preprint metadata is no different. As with all record types, we make the metadata available for machine and human access, across multiple interfaces (e.g. &lt;a href="https://github.com/Crossref/rest-api-doc/blob/master/rest_api.md" target="_blank">REST API&lt;/a>, &lt;a href="https://support.crossref.org/hc/en-us/articles/213679866-OAI-PMH-subscriber-only" target="_blank">OAI-PMH&lt;/a>, &lt;a href="https://web.archive.org/web/20131229210637/http://search.crossref.org//" target="_blank">Crossref Metadata Search&lt;/a>)&lt;/p>
&lt;p>For example, you can see information on the preprint &lt;a href="https://doi.org/10.20944/preprints201608.0191.v1" target="_blank">https://doi.org/10.20944/preprints201608.0191.v1&lt;/a> in a number of ways:&lt;/p>
&lt;ul>
&lt;li>&lt;a href="https://api.crossref.org/v1/works/10.20944/preprints201608.0191.v1/transform/application/vnd.crossref.unixsd&amp;#43;xml" target="_blank">https://api.crossref.org/v1/works/10.20944/preprints201608.0191.v1/transform/application/vnd.crossref.unixsd+xml&lt;/a>&lt;/li>
&lt;li>&lt;a href="https://web.archive.org/web/20131229210637/http://search.crossref.org//?q=10.20944%2Fpreprints201608.0191.v1" target="_blank">https://web.archive.org/web/20131229210637/http://search.crossref.org//?q=10.20944%2Fpreprints201608.0191.v1&lt;/a>&lt;/li>
&lt;/ul>
&lt;p>If you want to see all the preprint metadata deposited so far, try &lt;a href="https://api.crossref.org/v1/types/posted-content/works" target="_blank">https://api.crossref.org/v1/types/posted-content/works&lt;/a>. Over 1,000 records have already been registered in the first few weeks since we launched the service.&lt;/p>
&lt;p>Crossref members depositing preprints need to make sure they:&lt;/p>
&lt;ul>
&lt;li>Register content using the &lt;a href="https://support.crossref.org/hc/en-us/articles/213126346-Posted-content-includes-preprints-#examples" target="_blank">posted content&lt;/a> metadata schema.&lt;/li>
&lt;li>Respond to our match notifications that a manuscript / version of record (AM/VOR) has been registered and link to that within seven days.&lt;/li>
&lt;li>Label the manuscript as a preprint clearly, above the scroll on the preprint landing page, and ensure that any link to the AM/VOR is also prominently displayed above the scroll.&lt;/li>
&lt;/ul>
&lt;blockquote>
&lt;p>It’s important to clearly label the record type so we can ensure that the connections between preprints and the associated literature are clearly visible, to both humans and machines.&lt;/p>
&lt;/blockquote>
&lt;p>As with other record types, there is a registration fee to include content in the Crossref system. For preprints, it’s $0.25 fee for current preprint files and $0.15 for back-year records.&lt;/p>
&lt;p>Are you an existing Crossref member who wants to assign preprint DOIs? &lt;a href="mailto:support@crossref.org">Let&amp;rsquo;s talk about&lt;/a> getting started or migrating any existing content over to the dedicated preprint deposit schema.&lt;/p>
&lt;p>Interested in becoming a Crossref member to assign DOIs to your preprints? &lt;a href="mailto:member@crossref.org">Contact our membership specialist&lt;/a> so we can answer any questions and get you set up as a member.&lt;/p></description></item><item><title>Using the Crossref REST API. Part 3 (with SHARE)</title><link>https://www.crossref.org/blog/using-the-crossref-rest-api.-part-3-with-share/</link><pubDate>Thu, 01 Dec 2016 00:00:00 +0000</pubDate><author>Rachael Lammey</author><discourseUsername>rlammey</discourseUsername><guid>https://www.crossref.org/blog/using-the-crossref-rest-api.-part-3-with-share/</guid><description>&lt;p>&lt;span >As a follow-up to our &lt;a href="https://www.crossref.org/blog/crossref-metadata-api-part-1-authorea/">blog posts on the Crossref REST API&lt;/a>&lt;/span>&lt;span > we talked to SHARE about the work they’re doing, and how they’re employing the Crossref metadata as a piece of the puzzle.  Cynthia Hudson-Vitale from &lt;a href="http://share-research.org" target="_blank">SHARE&lt;/a> explains in more detail…&lt;/span>&lt;/p>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/wp/blog/uploads/2016/09/SHARE_logo-300x240.jpg" alt="share logo" width="350px" />
&lt;/div>
&lt;p>&lt;strong>Cynthia Hudson-Vitale, digital data librarian in Research Data and GIS Services at Washington University in St. Louis Libraries and visiting program office for SHARE&lt;/strong>&lt;/p>
&lt;p>&lt;span >SHARE (&lt;/span>&lt;a href="http://share-research.org" target="_blank">&lt;span >http://share-research.org&lt;/span>&lt;/a>&lt;span >) is building a free, open, data set about research and scholarly activities across their life cycle. It is a higher education initiative whose mission is to maximize research impact by making research widely accessible, discoverable, and reusable. SHARE’s data set is free, openly licensed, and built with open source technology developed at the Center for Open Science (COS). Launched in beta in April 2015 the data set has grown to more than 6 million records from 100+ providers, including Crossref, Social Science Research Network (SSRN), DataONE, 50+ library institutional repositories, and more.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;b>How is the Crossref REST API used within SHARE?&lt;/b>&lt;/span>&lt;/p>
&lt;p>&lt;span >SHARE currently harvests metadata from Crossref using the Crossref application programming interface (API). We pull such metadata values as journal title, author, DOI, journal name, and publisher, to name just a few. This metadata is then fed into our data processing pipeline, normalized, and aggregated into the full data set.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;b>What are the future plans for SHARE?&lt;/b>&lt;/span>&lt;/p>
&lt;p>&lt;span >Phase II of SHARE, launched in late 2015, focuses on adding metadata providers, enhancing the metadata, and making connections and links between the metadata records. These links will show the entire life cycle of research and scholarship—connecting a data management plan, grant award information, data deposits, analytic/software code, pre-publications, final manuscripts, and more.&lt;/span>&lt;/p>
&lt;p>&lt;span >To move these plans forward, SHARE is applying machine-learning and automation techniques and working with the community to verify metadata enhancements and curate the metadata. Current technology work focuses on imputing subject domain keywords and object types into the SHARE data set using learning models and heuristics. Data models and schemas are in development to connect the research lifecycle, connect multiple instances of an object to a single entity, and capture metadata provenance.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;b>What else would SHARE like to see in Crossref metadata?&lt;/b>&lt;/span>&lt;/p>
&lt;p>&lt;span >We would love to see rights-declaration metadata elements and article references/citations included in the metadata about digital objects. The rights-declaration information is invaluable for individuals who want to know what category the object is in (public domain, copyrighted, etc.), what constraints or permission requirements exist, contact information, and more. Additionally, networks of research can be discovered and meta-scholarship facilitated by making article reference lists machine-readable and openly available. &lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;b>What’s next?&lt;/b>&lt;/span>&lt;/p>
&lt;p>&lt;span >Does this give you any ideas? Feel free to get in touch with questions or &lt;/span>&lt;a href="https://github.com/Crossref/rest-api-doc/blob/master/rest_api.md" target="_blank">&lt;span >take the API for a spin&lt;/span>&lt;/a> &lt;span >yourself and let us know what you can do with it! &lt;/span>&lt;/p>
&lt;p> &lt;/p></description></item><item><title>Call for participation: Membership &amp; Fees Committee</title><link>https://www.crossref.org/blog/call-for-participation-membership-fees-committee/</link><pubDate>Tue, 29 Nov 2016 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/call-for-participation-membership-fees-committee/</guid><description>&lt;p>Crossref was founded to enable collaboration between publishers.  As our &lt;a href="https://doi.org/10.64000/vwgzw-6zk15" target="_blank">membership has grown and diversified over recent years&lt;/a>, it’s becoming even more vital that we take input from a representative cross-section of the membership. This is especially important when considering how fees and policies will affect our diverse members in different ways.&lt;/p>
&lt;h2 id="about-the-mf-committee">About the M&amp;amp;F Committee&lt;/h2>
&lt;p>The Membership &amp;amp; Fees Committee (M&amp;amp;F Committee) was established in 2001 and plays an important role in Crossref’s governance.  Made up of 10-12 organisations of both board members and regular members, the group makes recommendations to the board about fees and policies for all of our services. They regularly review existing fees to discuss if any changes are needed. They also review new services while they are being developed, to assess if fees should be charged and if so, what those fees should be. For example, the committee recently made recommendations to the board about the fees for a new service called Event Data that we’ll launch soon, and the Content Registration fees for preprints.  In addition, the board can also ask the committee to address specific issues about policies and services. Increasingly, the committee works with the outreach team to include research and survey insights.&lt;/p>
&lt;h2 id="about-committee-participation">About committee participation&lt;/h2>
&lt;p>The M&amp;amp;F Committee meets via one-hour conference calls about six times a year, although this can vary depending on what issues the committee is considering. Often proposals are developed by staff and then reviewed and discussed by the committee - so there is reading to do in preparation for the calls.&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/11/header-chairs.jpg">&lt;img class="alignright wp-image-2393 size-large" src="https://www.crossref.org/wp/blog/uploads/2016/11/header-chairs-1024x509.jpg" alt="Join a Crossref committee" width="840" height="418" srcset="https://www.crossref.org/wp/blog/uploads/2016/11/header-chairs-1024x509.jpg 1024w, https://www.crossref.org/wp/blog/uploads/2016/11/header-chairs-300x149.jpg 300w, https://www.crossref.org/wp/blog/uploads/2016/11/header-chairs-768x382.jpg 768w, https://www.crossref.org/wp/blog/uploads/2016/11/header-chairs-1200x596.jpg 1200w" sizes="(max-width: 709px) 85vw, (max-width: 909px) 67vw, (max-width: 1362px) 62vw, 840px" />&lt;/a>&lt;/p>
&lt;p>This is very important work and in order to ensure that the committee is broadly representative of Crossref’s diverse membership we are seeking expressions of interest from members who would like to serve on the M&amp;amp;F Committee for 2017. Appointments are for one year and members can serve multiple terms.&lt;/p>
&lt;h2 id="about-you">About you&lt;/h2>
&lt;p>In view of our commitment to be representative of the membership we are refreshing the committee and want to have engaged and interested people from a diverse set of members join.&lt;/p>
&lt;p>If you are interested in joining the committee and helping Crossref fulfil its mission please email &lt;a href="mailto:feedback@crossref.org">&lt;a href="mailto:feedback@crossref.org">feedback@crossref.org&lt;/a>&lt;/a> with your name, title, organisation and a short statement about why you want to serve on the committee by December 19th, 2016.      &lt;/p>
&lt;p>&lt;strong>Scott Delman, Director of Group Publishing, ACM is the current Chair of the committee and will review the expressions of interest with me, Ed Pentz, Executive Director, to form the committee.&lt;/strong>&lt;/p>
&lt;p>Thanks for your interest.&lt;/p></description></item><item><title>A look back at LIVE16</title><link>https://www.crossref.org/blog/a-look-back-at-live16/</link><pubDate>Thu, 24 Nov 2016 00:00:00 +0000</pubDate><author>April Ondis</author><guid>https://www.crossref.org/blog/a-look-back-at-live16/</guid><description>&lt;p>Crossref LIVE16 opened with a Mashup Day on 1st November 2016 in London. Attendees from the scholarly communications world met to chat with Crossref team members in an open house atmosphere. The Crossref team put their latest projects on display and were met with questions, comments, and ideas from members and other metadata folks. Here’s what it looked like — you may recognize a few familiar faces. &lt;/p>
&lt;h3 id="crossref-live16-in-london">Crossref LIVE16 in London&lt;/h3>
&lt;table>
&lt;thead>
&lt;tr>
&lt;th>&lt;/th>
&lt;th>&lt;/th>
&lt;th>&lt;/th>
&lt;th>&lt;/th>
&lt;th>&lt;/th>
&lt;/tr>
&lt;/thead>
&lt;tbody>
&lt;tr>
&lt;td>&lt;img src="https://www.crossref.org/images/blog/LIVE16-1-72-.jpg" alt="image of people at the event" width="175px"/>&lt;/td>
&lt;td>&lt;img src="https://www.crossref.org/images/blog/LIVE16-2-72-.jpg" alt="image of people at the event" width="175px"/>&lt;/td>
&lt;td>&lt;img src="https://www.crossref.org/images/blog/LIVE16-3-72-.jpg" alt="image of people at the event" width="175px"/>&lt;/td>
&lt;td>&lt;img src="https://www.crossref.org/images/blog/LIVE16-4-72-.jpg" alt="image of people at the event" width="175px"/>&lt;/td>
&lt;td>&lt;img src="https://www.crossref.org/images/blog/LIVE16-5-72-.jpg" alt="image of people at the event" width="175px"/>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;img src="https://www.crossref.org/images/blog/LIVE16-6-72-.jpg" alt="image of people at the event" width="175px"/>&lt;/td>
&lt;td>&lt;img src="https://www.crossref.org/images/blog/LIVE16-7-72-.jpg" alt="image of people at the event" width="175px"/>&lt;/td>
&lt;td>&lt;img src="https://www.crossref.org/images/blog/LIVE16-8-72-.jpg" alt="image of people at the event" width="175px"/>&lt;/td>
&lt;td>&lt;img src="https://www.crossref.org/images/blog/LIVE16-9-72-.jpg" alt="image of people at the event" width="175px"/>&lt;/td>
&lt;td>&lt;img src="https://www.crossref.org/images/blog/LIVE16-10-72-.jpg" alt="image of people at the event" width="175px"/>&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>&lt;img src="https://www.crossref.org/images/blog/LIVE16-11-72-.jpg" alt="image of people at the event" width="175px"/>&lt;/td>
&lt;td>&lt;img src="https://www.crossref.org/images/blog/LIVE16-12-72-.jpg" alt="image of people at the event" width="175px"/>&lt;/td>
&lt;td>&lt;img src="https://www.crossref.org/images/blog/LIVE16-13-72-.jpg" alt="image of people at the event" width="175px"/>&lt;/td>
&lt;td>&lt;img src="https://www.crossref.org/images/blog/LIVE16-14-72-.jpg" alt="image of people at the event" width="175px"/>&lt;/td>
&lt;td>&lt;img src="https://www.crossref.org/images/blog/LIVE16-15-72-.jpg" alt="image of people at the event" width="175px"/>&lt;/td>
&lt;/tr>
&lt;/tbody>
&lt;/table>
&lt;br>
LIVE16 continued with the Conference Day on 2nd November, a plenary session with invited speakers and presentations by the Crossref team. Here are the presentations, in chronological order.
&lt;ul>
&lt;li>
&lt;p>Dario Taraborelli speaks on “Wikipedia’s role in the dissemination of scholarship” &lt;/p>
&lt;/li>
&lt;li>
&lt;p>Ian Calvert speaks on: “You don’t have metadata (and how to befriend a data scientist)” &lt;/p>
&lt;/li>
&lt;li>
&lt;p>Ed Pentz speaks on “Crossref’s outlook &amp;amp; key priorities”&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Ginny Hendricks speaks on “A vision for membership”&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Geoffrey Bilder speaks on “The case of the missing leg” &lt;/p>
&lt;/li>
&lt;li>
&lt;p>Lisa Hart Martin speaks on “The meaning of governance”&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Jennifer Lin speaks on “New territories in the Scholarly Research Map”&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Chuck Koscher speaks on “Relationships and other notable things”&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Carly Strasser speaks on “Funders and Publishers as Agents of Change” &lt;/p>
&lt;/li>
&lt;li>
&lt;p>April Hathcock speaks on “Opening Up the Margins”&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>Your survey feedback&lt;/strong>&lt;/p>
&lt;p>We’re serious about making Crossref LIVE a useful and welcoming annual event for the Crossref membership as well as members of the wider scholarly communications community. That’s why we appreciate responses from the attendees who answered our survey. Here’s what we have learned from your feedback:&lt;/p>
&lt;p>&lt;em>Content&lt;/em>&lt;/p>
&lt;ul>
&lt;li>You want speakers to tell you something new, even if you don’t agree with their points of view&lt;/li>
&lt;li>Your favorite speakers were those who inspired you&lt;/li>
&lt;li>You prefer an unscripted presentation style that makes complex topics accessible to all&lt;/li>
&lt;li>You’re not as interested in the mechanics of Crossref’s annual election as we are&lt;/li>
&lt;/ul>
&lt;p>&lt;em>Format&lt;/em>&lt;/p>
&lt;ul>
&lt;li>&lt;span >You enjoyed the diversity of presenters and would like even more external speakers &lt;/span>&lt;/li>
&lt;li>&lt;span >You want more opportunity to ask us technical questions on the Mashup Day  &lt;/span>&lt;/li>
&lt;li>&lt;span >You want to see panel discussions in addition to individual presentations on the Conference Day&lt;/span>&lt;/li>
&lt;li>&lt;span >&lt;span >Those who attended the Conference Day only wished they had also attended the Mashup Day&lt;/span> &lt;/span>&lt;/li>
&lt;/ul>
&lt;p>&lt;em>Atmosphere&lt;/em>&lt;/p>
&lt;ul>
&lt;li>&lt;span >You liked the casual atmosphere but wanted more seating and more dessert.  So noted!&lt;/li>
&lt;/ul>
&lt;p>LIVE17 will be held next November 14-15 in Asia. Until then, we hope you’ll have the chance to see us at the regional Crossref LIVE events we are planning around the world throughout the year. Our next local event is Crossref LIVE in Brazil, held 13 December in Campinas and 16 December in Sao Paulo. &lt;/p></description></item><item><title>URLs and DOIs: a complicated relationship</title><link>https://www.crossref.org/blog/urls-and-dois-a-complicated-relationship/</link><pubDate>Fri, 04 Nov 2016 00:00:00 +0000</pubDate><author>Joe Wass</author><guid>https://www.crossref.org/blog/urls-and-dois-a-complicated-relationship/</guid><description>&lt;p>As the linking hub for scholarly content, it’s our job to tame URLs and put in their place something better. Why? Most URLs suffer from link rot and can be created, deleted or changed at any time. And that’s a problem if you’re trying to cite them.&lt;/p>
&lt;p>Thus the Crossref DOI was born: an Identifier which is Persistent, which means that it’s designed to live forever (or, as Geoff Bilder rather more &lt;a href="https://www.crossref.org/blog/january-2015-doi-outage-followup-report/">prosaically puts it&lt;/a>, as long as we do), and also Resolvable, which means that you can click on it. A DOI &lt;strong>is&lt;/strong> a URL, but it’s imbued with special properties. I say special, not magical, because all of the things that make Crossref DOIs what they are, are obtained through agreements and common standards rather than any kind of magic.&lt;/p>
&lt;p>As part of the development of Crossref Event Data I’ve been doing some research about the relationship between DOIs and URLs. It’s a problem we have to solve in order to make Event Data work, but it’s a much broader and more interesting story, and the results have wide applicability. I’ll be telling this story at &lt;a href="http://pidapalooza.org/">PIDapalooza&lt;/a>. If you’re interested in Persistent Identifiers you should go and &lt;a href="http://pidapalooza.org/">registration is open&lt;/a>, though hurry, as it’s next week and in Rejkjavik, Iceland!&lt;/p>
&lt;p>This is also a story in progress. As I write not all of the data is in, and we can be certain that it will evolve in ways we have no idea about. It’s also quite long but I’ll do my best to disqualify it from the bedtime reading list.&lt;/p>
&lt;h2 id="full-circle">Full circle&lt;/h2>
&lt;p>Crossref was established just over fifteen years ago with the purpose of forming the linking hub between publishers. Our job was — and still is — to register content for publishers and then continue to work with them to ensure their DOIs always point to the right location of the content. To do this we need to do one main thing: send people in the right direction when they click on a DOI, and know which direction to point them in.&lt;/p>
&lt;p>Today, linking is still an important part of what Crossref does, but we do a huge amount more. One of the new things we’re working on is Crossref Event Data. It’s a service for tracking how and where people use scholarly content (such as articles) across the web and social media. Early research suggested that if we limited ourselves to just looking for DOIs we wouldn’t find much. Instead we broadened our aims a little: rather than looking for mentions of registered content exclusively via their DOIs, we look for them via the most suitable mechanism. In most cases this means the actual URL of the Item. So we have come full circle: we started linking DOIs to URLs. Now we’re trying to link URLs back to DOIs.&lt;/p>
&lt;img src="https://www.crossref.org/wp/blog/uploads/2016/10/1.png" alt="urls-back-to-dois" class="img-responsive"/>
&lt;p>Which URL are we talking about here? The Crossref Guidelines say:&lt;/p>
&lt;blockquote>
&lt;p>DOI-routed reference links enabled by Crossref must resolve to a response page containing no less than complete bibliographic information about the target content …&lt;/p>
&lt;/blockquote>
&lt;p >
&lt;a href="http://www.crossref.org/02publishers/59pub_rules.html">http://www.crossref.org/02publishers/59pub_rules.html&lt;/a>
&lt;/p>
&lt;p>This is what’s referred to as the Landing Page. Every Landing Page has a URL. Usually when you want to read information about an Article, it’s the Landing Page that you’re looking at. I should also say at this point that when I say Article I mean any item of Crossref Registered Content with a DOI. So the same applies to books, chapters, conference proceedings etc. But as most items are Articles, I’ll stick with that for now.&lt;/p>
&lt;p>I’m going to make some assumptions. Unfortunately, and I don’t want to spoil the surprise here, they all turn out to be false. They’re all reasonable assumptions, though, and you would be forgiven for thinking, or at least wishing, that they were true.&lt;/p>
&lt;p>So suspend your disbelief and follow me down the rabbit-hole…&lt;/p>
&lt;h2 id="assumption-1-a-doi-points-directly-to-a-landing-page-url">Assumption 1: A DOI points directly to a Landing Page URL&lt;/h2>
&lt;p>When you click on a DOI you are taken to the Article Landing Page. It seems like a perfectly valid assumption to think that you are taken directly there.&lt;/p>
&lt;p>The DOI system is essentially a big lookup table. In the first column is the DOI and in the second column is the URL. Publishers request that we register each item’s DOI and supply us with the URL it should point to. We work with CNRI and the International DOI Foundation to keep the system running and it means that when you, the reader at home, click on a DOI, you end up on the article’s Landing Page.&lt;/p>
&lt;p>It would be very convenient if our assumption were true. If we wanted to turn a URL back into an article page, we could just swap the two columns and find the DOI by looking up the URL.&lt;/p>
&lt;img src="https://www.crossref.org/wp/blog/uploads/2016/10/2.png" alt="flip DOIs" class="img-responsive" />
&lt;p>It turns out that it’s not quite so simple.&lt;/p>
&lt;p>The Landing Page is under control of the publisher, as is the URL that they supply us with. They don’t need to supply us with the final landing page URL, only with one that &lt;em>&lt;strong>leads&lt;/strong>&lt;/em> to the landing page.&lt;/p>
&lt;h2 id="http-redirects">HTTP redirects&lt;/h2>
&lt;p>When you request a URL, either by typing it into your browser or by clicking on a link, your browser contacts the server and gets a reply. That reply can be “200 OK, here’s your page”, “303, look over there” or the dreaded “404, I can’t find it”. Other HTTP response codes are available, including well-known classics such as 201, 500 and 418.&lt;/p>
&lt;p>If it’s a 303, your browser will follow the redirect URL. The response that comes back from that redirect could be another 303. You could end up following a whole chain of redirects. You wouldn’t notice anything, except having to wait an extra few milliseconds.&lt;/p>
&lt;h2 id="extraordinary-diversity">Extraordinary diversity&lt;/h2>
&lt;p>Crossref was created by a group of publishers who needed a way to link between articles. It was an ambitious goal: create a central system with which any publisher can integrate their own systems; one that allows linking to any article no matter who published it. Today we have over 5,000 members and counting, all contributing to our metadata engine. And up to 2 million DOIs are resolved every day, by all kinds of people and systems. Our wide range of members means a wide range of systems with a wide range of designs.&lt;/p>
&lt;p>This brings an extraordinary diversity of behavior. If we want to make observations about DOIs we can’t just take a random sample of the over 80 million. Instead, we need to take a sample of DOIs per Publisher System. Even taking a sample per publisher might not do the job because some publishers run a variety of systems.&lt;/p>
&lt;h2 id="experiment-1-does-crossref-know-all-landing-pages">Experiment 1: Does Crossref know all Landing Pages?&lt;/h2>
&lt;figure>&lt;img src="https://upload.wikimedia.org/wikipedia/commons/thumb/e/e7/Atomic_Laboratory_Experiment_on_Atomic_Materials_-_GPN-2000-000663.jpg/256px-Atomic_Laboratory_Experiment_on_Atomic_Materials_-_GPN-2000-000663.jpg"
alt="Atomic Laboratory Experiment on Atomic Materials - GPN-2000-000663" width="40%">&lt;figcaption>
&lt;h4>By NASA / Paul Riedel (Great Images in NASA: Home - info - pic) [Public domain], via Wikimedia Commons&lt;/h4>
&lt;/figcaption>
&lt;/figure>
&lt;p>&lt;strong>Hypothesis:&lt;/strong> Crossref knows the Landing Page URL for all DOIs.&lt;/p>
&lt;p>For a sample of Items, we can follow the DOI link all the way through to the Landing Page, following any redirects, then compare the final Landing Page URL to the one that Crossref knows about. If there are extra redirects, that means that the one we have on file isn’t the final one.&lt;/p>
&lt;p>We need to tighten up the terminology at this stage:&lt;/p>
&lt;ul>
&lt;li>&lt;strong>DOI URL&lt;/strong> - The full DOI, e.g. &lt;a href="https://doi.org/10.5555/12345678">&lt;a href="https://doi.org/10.5555/12345678" target="_blank">https://doi.org/10.5555/12345678&lt;/a>&lt;/a> .&lt;/li>
&lt;li>&lt;strong>Resource URL&lt;/strong> - The URL that Crossref has on file (stored in our system). This is where the browser is initially redirected.&lt;/li>
&lt;li>&lt;strong>Destination URL&lt;/strong> - The URL that we end up at if we follow all the redirects.&lt;/li>
&lt;li>&lt;strong>Article Landing Page&lt;/strong> - The page that represents the item. If everything works, this should be the same as the Destination URL.&lt;/li>
&lt;/ul>
&lt;p>The reason we’re talking about the Destination URL as distinct from the Article Landing Page when they should be the same thing will become clear later. Consider yourself foreshadowed.&lt;/p>
&lt;img src="https://www.crossref.org/wp/blog/uploads/2016/10/3-2.png" alt="redirects" class="img-responsive" />
&lt;p>So let’s re-word our hypothesis:&lt;/p>
&lt;p>&lt;strong>Hypothesis:&lt;/strong> The Destination URL is the same as the Resource URL.&lt;/p>
&lt;p>&lt;strong>Method:&lt;/strong> A sample of DOIs was taken (most items updated in 2016, all from 2009 or earlier). The Resource URL was obtained for all of them. The DOIs were split by the domain name of the Resource URL (to give a good coverage of all Publisher systems). A sample of Resource URLs was followed per domain, at least 200 (or fewer if that exceeds the number of DOIs available). Where there were HTTP redirects they were followed.&lt;/p>
&lt;p>&lt;strong>Observations:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>Number of Items sampled Destination URL: 253,381&lt;/li>
&lt;li>Number where Resource URL = Destination URL: 46,995 or 19.96%&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>Conclusion:&lt;/strong> Not all Resource URLs are the same as the Destination URL by a long shot. Crossref does not automatically know every landing page URL.&lt;/p>
&lt;p>Now we know the truth about our first assumption: DOIs don’t point directly to Landing Pages. If we want to reverse Landing Pages back into DOIs, we’re going to need to go a bit deeper…&lt;/p>
&lt;h2 id="interlude">Interlude&lt;/h2>
&lt;p>But first, an interlude with some information about publishers, owners, and systems, because now seems like the right time to do it.&lt;/p>
&lt;h2 id="assumption-2-you-can-tell-the-publisher-of-a-doi-by-looking-at-its-prefix">Assumption 2: You can tell the publisher of a DOI by looking at its prefix&lt;/h2>
&lt;p>This is a real one one that people believe. Again, it’s entirely understandable. People look at a DOI like &lt;a href="https://doi.org/10.1371/journal.pone.0136117.g001">&lt;a href="https://doi.org/10.1371/journal.pone.0136117.g001" target="_blank">https://doi.org/10.1371/journal.pone.0136117.g001&lt;/a>&lt;/a> , which takes them to PLoS and naturally assume that another DOI like &lt;a href="https://doi.org/10.1371/journal.pone.0136053.t003">&lt;a href="https://doi.org/10.1371/journal.pone.0136053.t003" target="_blank">https://doi.org/10.1371/journal.pone.0136053.t003&lt;/a>&lt;/a> — because it has the same prefix of 10.1371 — is also for a PLoS item.&lt;/p>
&lt;p>Whilst this turns out to be true most of the time, it’s not true for all Items, which makes it a dangerous assumption to make.&lt;/p>
&lt;p>It is true that every publisher is given a prefix. They can then register DOIs with this prefix. It is also true that Items can be transferred between publishers. Because DOIs are persistent, the prefix in the DOI doesn’t change. So you might find a DOI that belongs to a publisher that has an unexpected prefix. Publishers can also be bought and sold, merged and split, which means that whilst most publishers have a single prefix, some, like Elsevier, have several. Take the case of Elsevier, who has 26 at the time of writing (you can see this in &lt;a href="https://api.crossref.org/v1/members/78">Elsevier’s entry in the Crossref Metadata API&lt;/a>).&lt;/p>
&lt;p>Every Item has an ‘owner prefix’ in addition to the prefix in the DOI. The owner prefix is the same as the DOI prefix when the Item is created, but over time, as articles are transferred, that can change to indicate that it is owned by another publisher.&lt;/p>
&lt;p>Every Item has a DOI, and every DOI has a prefix. But every Item also has an Owner Prefix (you can check this in the Metadata API in the ‘prefix’ field).&lt;/p>
&lt;p>So Assumption 2 has been laid to rest. The only thing you can tell from looking at a DOI is that it is, in fact, a DOI (you can tell by the “10.” index code).&lt;/p>
&lt;p>Why do we care about identifying publishers anyway?&lt;/p>
&lt;h2 id="a-fair-test">A Fair Test&lt;/h2>
&lt;p>We fundamentally want to conduct a fair test. The reason we can’t just take a random sample from the set of all DOIs is that there are lots of members who all do things slightly differently. Therefore we need to take a sample per publisher ‘system’. The word ‘system’ is a bit fuzzy, but my assumption is that two articles in the same system will behave the same way so we can treat them the same.&lt;/p>
&lt;p>We also know that each Crossref member may be running more than one system, or a mixture. Therefore just looking at the owner of a DOI may not give accurate results if we want to conduct a survey of all the systems out there.&lt;/p>
&lt;p>There’s no perfect answer, but the approach I’m taking is to look at the domain name of the Resource URL. We often find lots of subdomains for the same publisher, for example, “psw.sagepub.com”, “pol.sagepub.com”, “psx.sagepub.com” and “bpi.sagepub.com”. It’s clear that these are all operated by Sage, but they might or might not all be running on different ‘systems’.&lt;/p>
&lt;p>Therefore I’m splitting DOIs up into groups based on the domain of their Resource URL. It may turn out that some publishers use a single system running on many domains, or it may turn out that some publishers use a different system for each domain they use. The key point is to find a sampling technique that broadly works, and that allows us to explore and differentiate, as keenly as possible, the variety of systems and behaviours.&lt;/p>
&lt;h2 id="why-all-the-redirects">Why all the redirects?&lt;/h2>
&lt;p>Curious minds might at this stage be wondering about all these extra redirects. Surely it’s extra stuff for the publisher to maintain. Why don’t they just point the DOI directly to the landing page?&lt;/p>
&lt;p>The answer must be prefaced by repeating that there is a huge number of publishers, running a variety of systems, so we’ll never be able to completely answer that. But some humble suggestions:&lt;/p>
&lt;ol>
&lt;li>They might want to be able to change the URLs of the Landing Pages. It may be easier to update their internal systems than send the update to Crossref, especially in bulk.&lt;/li>
&lt;li>Different parts of their technology stack may be owned by different parts of the company, or outsourced. It’s easier to define internal boundaries than to co-ordinate business units and cross an external one.&lt;/li>
&lt;li>A publisher may run a mix of different technology. As part of their systems integration process, they set up a redirect server to make everything work together.&lt;/li>
&lt;li>A publisher assigns DOIs to articles but also has their own internal IDs. They maintain their own DOI-to-internal-ID lookup service.&lt;/li>
&lt;/ol>
&lt;h3 id="internal-doi-resolvers">Internal DOI resolvers&lt;/h3>
&lt;p>That last point is an interesting one. The DOI system is the canonical “DOI-to-URL resolver”. That doesn’t prevent publishers from running their own. Indeed, many do.&lt;/p>
&lt;p>To take a real example of &lt;a href="https://plos.org">PLoS&lt;/a>, an Open Access publisher who registers lots of content with Crossref. To follow one of their DOIs we go on the following journey of redirects:&lt;/p>
&lt;ul>
&lt;li>http://dx.doi.org/10.1371/journal.pone.0164910&lt;/li>
&lt;li>http://dx.plos.org/10.1371/journal.pone.0164910&lt;/li>
&lt;li>http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0164910&lt;/li>
&lt;li>http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0164910&lt;/li>
&lt;/ul>
&lt;p>Given that the last step uses a DOI, this suggests that they use the DOI as an internal identifier. All those redirects were for some purpose, but they weren’t mapping a DOI to an internal ID. This is therefore &lt;strong>not&lt;/strong> an internal DOI resolver.&lt;/p>
&lt;p>Another example from JAMA Surgery:&lt;/p>
&lt;ul>
&lt;li>&lt;a href="http://doi.org/10.1001/archsurg.142.7.595" target="_blank">http://doi.org/10.1001/archsurg.142.7.595&lt;/a>&lt;/li>
&lt;li>&lt;a href="http://archsurg.jamanetwork.com/article.aspx?doi=10.1001/archsurg.142.7.595" target="_blank">http://archsurg.jamanetwork.com/article.aspx?doi=10.1001/archsurg.142.7.595&lt;/a>&lt;/li>
&lt;li>&lt;a href="http://jamanetwork.com/journals/jamasurgery/fullarticle/487551" target="_blank">http://jamanetwork.com/journals/jamasurgery/fullarticle/487551&lt;/a>&lt;/li>
&lt;li>&lt;a href="http://jamanetwork.com/journals/jamasurgery/article-abstract/487551" target="_blank">http://jamanetwork.com/journals/jamasurgery/article-abstract/487551&lt;/a>&lt;/li>
&lt;/ul>
&lt;p>In this case we see a mapping from the DOI 10.1001/archsurg.142.7.595 to the ID 487551.&lt;/p>
&lt;p>Can we define a heuristic for this pattern? Yes, but not a perfect one. My test is this:&lt;/p>
&lt;ul>
&lt;li>Does the resource URL contain the DOI?&lt;/li>
&lt;li>If so, does it redirect to a different destination URL?&lt;/li>
&lt;li>If so, does the destination URL not contain the DOI?&lt;/li>
&lt;/ul>
&lt;p>The last step is important, because we can’t really say the publisher is running a DOI resolver if they use the DOI all the way through.&lt;/p>
&lt;p>It’s not perfect and no doubt has false negatives. But we’re just trying to find out whether &lt;strong>some&lt;/strong> publishers run their own DOI resolver systems.&lt;/p>
&lt;h2 id="experiment-2-determine-how-widespread-use-of-internal-doi-resolvers-is">Experiment 2: Determine how widespread use of internal DOI resolvers is:&lt;/h2>
&lt;p>&lt;a title="By MacVicar, N. - National Institutes of Health [Public domain], via Wikimedia Commons" href="https://commons.wikimedia.org/wiki/File%3AMarshall_Nirenberg_performing_experiment.jpg">&lt;img src="https://upload.wikimedia.org/wikipedia/commons/thumb/1/10/Marshall_Nirenberg_performing_experiment.jpg/256px-Marshall_Nirenberg_performing_experiment.jpg" alt="Marshall Nirenberg performing experiment" class="img-responsive" />&lt;/a>&lt;/p>
&lt;p>&lt;strong>Hypothesis:&lt;/strong> Some publishers run their own DOI resolvers.&lt;/p>
&lt;p>&lt;strong>Method:&lt;/strong> A number of Destination URLs were sampled per Resource URL Domain. If the Resource URL contains the DOI but the Destination URL doesn’t, that’s marked as a Publisher DOI resolver redirect.&lt;/p>
&lt;p>&lt;strong>Observations:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>Number of Items sampled with Resource URL and Destination URL: 253,381&lt;/li>
&lt;li>Number of Items that appear to be DOI resolvers: 166,352 = 65.6%&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>Conclusions: Some publishers run their own DOI resolvers.&lt;/strong>&lt;/p>
&lt;p>This isn’t of much practical use, but it’s interesting to know, and hints at the way the Crossref system and DOIs are integrated with Publishers’ systems. Now that we’ve got a little insight into the reasons that publishers might run their own DOI resolvers, we can resume our journey of assumptions.&lt;/p>
&lt;h2 id="assumption-3-we-can-find-the-landing-page-for-every-doi">Assumption 3: We can find the Landing Page for Every DOI&lt;/h2>
&lt;p>Now we know that we can’t just use the lookup table in reverse, but have to follow the links all the way to their destination. Does this approach actually work?&lt;/p>
&lt;p>This is a pretty big question and we need to be clear about what we mean by ‘every’ DOI. The set of DOIs I’m using (although I’m using a subset) is “all DOIs in our Metadata API that are found in doi.org”.&lt;/p>
&lt;p>What is a DOI? Geoff Bilder went over it in the &lt;a href="https://www.crossref.org/blog/doi-like-strings-and-fake-dois/">DOI-like-strings blog post&lt;/a> earlier this year. The definition I’m working to here is:&lt;/p>
&lt;blockquote>
&lt;p>A DOI is an identifier for an item of content registered in the DOI system.&lt;/p>
&lt;/blockquote>
&lt;p>That is, if you resolve the DOI on &lt;a href="https://doi.org/" target="_blank">https://doi.org/&lt;/a> and it’s recognised, that counts as a DOI. I’m working from the set of DOIs found in the Crossref system as I’m primarily concerned with Crossref DOIs. However, we collaborate closely with DataCite.&lt;/p>
&lt;p>Back to our assumption: “we can find the Landing Page for every DOI”. The answer is that we can, most of the time. But because Crossref Event Data has to work as well as possible, and therefore work with as many DOIs as possible, we have to scour all the nooks and crannies.&lt;/p>
&lt;h3 id="assumption-4-every-doi-points-somewhere-unique">Assumption 4: Every DOI points somewhere unique&lt;/h3>
&lt;p>Stop me when you find the deliberate mistake:&lt;/p>
&lt;ol>
&lt;li>Every Item corresponds to a different thing&lt;/li>
&lt;li>Every Item has a single DOI&lt;/li>
&lt;li>Every DOI is different&lt;/li>
&lt;li>Every DOI points to a landing page&lt;/li>
&lt;li>Therefore every DOI points to a different landing page&lt;/li>
&lt;/ol>
&lt;p>Two things immediately suggest themselves:&lt;/p>
&lt;p>&lt;em>“Every item has a single DOI”&lt;/em> should be true, but it isn’t. We find that sometimes two DOIs are assigned to the same item. This can happen when publications change hands between publishers, or when mistakes are made, or for a variety of other reasons. We also find that in some cases Publishers registered a DOI for the metadata and one for the article abstract. The two DOIs point to the same place. In some cases where there were two DOIs registered for the same thing we create an Alias.&lt;/p>
&lt;p>When we alias a DOI we simply say “this DOI should actually point to this one”. Both DOIs still exist, and both still point to the ‘correct’ thing, it’s just that they both point to the same place. If we have two DOIs pointing to the same place, then there isn’t a one-to-one mapping, and Assumption 4 is incorrect.&lt;/p>
&lt;h2 id="experiment-4-aliased-dois">Experiment 4: Aliased DOIs&lt;/h2>
&lt;p>&lt;a title="By The Air Force Research Laboratory’s Directed Energy Directorate [Public domain], via Wikimedia Commons" href="https://commons.wikimedia.org/wiki/File%3ALasertests.jpg">&lt;img src="https://upload.wikimedia.org/wikipedia/commons/thumb/4/4c/Lasertests.jpg/256px-Lasertests.jpg" alt="Lasertests" class="img-responsive" />&lt;/a>&lt;/p>
&lt;p>&lt;strong>Hypothesis:&lt;/strong> There isn’t a one-to-one mapping between DOIs and URLs because some DOIs are aliased to others.&lt;/p>
&lt;p>&lt;strong>Method:&lt;/strong> We collected a sample of Resource URLs from the DOI API. We count how many DOIs are classified as Aliases in the DOI system.&lt;/p>
&lt;p>&lt;strong>Observations&lt;/strong>:&lt;/p>
&lt;ul>
&lt;li>From a sample of 11,227,458 DOIs&lt;/li>
&lt;li>14,566 are aliased to others, or 0.129%&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>Conclusion:&lt;/strong> There aren’t many aliases. But there are some, and we should be aware of them.&lt;/p>
&lt;h2 id="experiment-5-duplicate-resource-urls">Experiment 5: Duplicate Resource URLs&lt;/h2>
&lt;p>&lt;a title="By Ms. Barbara Hertz (Ms. Barbara Hertz) [Public domain], via Wikimedia Commons" href="https://commons.wikimedia.org/wiki/File%3AHertz-experiment.jpg">&lt;img src="https://upload.wikimedia.org/wikipedia/commons/8/88/Hertz-experiment.jpg" alt="Hertz-experiment" class="img-responsive" />&lt;/a>&lt;/p>
&lt;p>&lt;strong>Hypothesis&lt;/strong>: There isn’t a one-to-one mapping between DOIs and URLs because some DOIs have duplicate Resource URLs.&lt;/p>
&lt;p>&lt;strong>Method&lt;/strong>: A sample of Resource URLs was collected from the DOI API. We counted how many DOIs have Resource URLs that aren’t unique. We subtract the number of deleted DOIs because all deleted DOIs have the same resource URL.&lt;/p>
&lt;p>&lt;strong>Observations&lt;/strong>:&lt;/p>
&lt;ul>
&lt;li>From a sample size of 11,227,458&lt;/li>
&lt;li>a total of 112,195 have duplicate resource URLs, or 0.99%&lt;/li>
&lt;li>of these duplicates, 77,896 have the ‘deleted’ URL&lt;/li>
&lt;li>leaving 34,229, or 0.30% having non-unique Resource URLs&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>Conclusion&lt;/strong>: A small number of DOIs have duplicate Resource URLs, even if we exclude those that have been deleted, which means that not every DOI can have a unique URL.&lt;/p>
&lt;h2 id="assumption-5-the-landing-page-is-the-same-as-the-destination-page">Assumption 5: The Landing Page is the same as the Destination Page.&lt;/h2>
&lt;p>HTTP has a very neat system for doing redirects. If it were that simple, then we could easily look up every Destination page and confidently say that it was the Landing Page. Not so.&lt;/p>
&lt;h2 id="cookies">Cookies&lt;/h2>
&lt;p>Web browsers aren’t the only tools that use HTTP. Most programming languages have HTTP capabilities built in.&lt;/p>
&lt;p>Using cookies is a requirement of some websites, but it’s not a requirement of HTTP. Most websites use cookies in some way or another. When you log into a site, you expect cookies. But when you’re just browsing there isn’t any technical need. A small number of websites absolutely require cookies to be enabled to use the site, even if you’re just browsing and not logged in. Unfortunately, this includes some publishers.&lt;/p>
&lt;p>Requiring cookies to use a publisher site means that you can’t fully resolve a DOI without enabling cookies. Most tools out there don’t. Some privacy-conscious people quite reasonably don’t enable cookies from all sites.&lt;/p>
&lt;p>Using cookies when resolving a DOI adds considerable overhead and isn’t fool-proof.&lt;/p>
&lt;p>Let’s try a quick experiment to see when we land up on a cookie page. Here’s an example page that tells us that we should have enabled cookies: &lt;a href="http://www.tandfonline.com/action/cookieAbsent">&lt;a href="http://www.tandfonline.com/action/cookieAbsent" target="_blank">http://www.tandfonline.com/action/cookieAbsent&lt;/a>&lt;/a> . It’s reachable from the DOI: &lt;a href="https://doi.org/10.1016/j.envhaz.2007.09.007">&lt;a href="https://doi.org/10.1016/j.envhaz.2007.09.007" target="_blank">https://doi.org/10.1016/j.envhaz.2007.09.007&lt;/a>&lt;/a> .&lt;/p>
&lt;h2 id="experiment-6-some-dois-cant-be-resolved-without-cookies">Experiment 6: Some DOIs can’t be resolved without cookies&lt;/h2>
&lt;p>&lt;a title="By National Eye Institute (Laboratory Experiment) [CC BY 2.0 (http://creativecommons.org/licenses/by/2.0)], via Wikimedia Commons" href="https://commons.wikimedia.org/wiki/File%3ALaboratory_scientist_conducts_an_experiment_with_a_Rotary_evaporator.jpg">&lt;img src="https://upload.wikimedia.org/wikipedia/commons/thumb/a/af/Laboratory_scientist_conducts_an_experiment_with_a_Rotary_evaporator.jpg/512px-Laboratory_scientist_conducts_an_experiment_with_a_Rotary_evaporator.jpg" alt="Laboratory scientist conducts an experiment with a Rotary evaporator" class="img-responsive" />&lt;/a>&lt;/p>
&lt;p>&lt;strong>Hypothesis&lt;/strong>: We can’t resolve some DOIs to the Landing Page using standard tools because cookies are required.&lt;/p>
&lt;p>&lt;strong>Method&lt;/strong>: A sample of DOIs was taken per Resource URL Domain. They were resolved by following HTTP links. Where the Destination URL contains the word ‘cookie’, we mark that as a DOI requiring a cookie.&lt;/p>
&lt;p>&lt;strong>Observations&lt;/strong>:&lt;/p>
&lt;ul>
&lt;li>A sample of 253,381 DOIs were resolved following HTTP redirects where necessary&lt;/li>
&lt;li>a total of 6305 resolved to a page with ‘cookie’ in the URL or 2.48%&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>Conclusion: &lt;/strong>There are cookies at play for at least 2.48% of DOIs. This is probably a very conservative estimate, as we’re using a blunt tool looking for ‘cookie’ in the URL.&lt;/p>
&lt;h2 id="cookies-required">Cookies Required&lt;/h2>
&lt;p>For one DOI I found, the publisher system set cookies, then sent us on a series of redirects which set cookies that expired in the past and then, as far as I can tell, checked whether or not they were sent back. My working hypothesis is that it was profiling the behaviour to see what browser I was using.&lt;/p>
&lt;p>I have also seen javascript-based redirects. This is where a web page loads a javascript file, which executes and sends the browser onto another URL. This seems to be to be a browser detection method. There is no way you can follow these DOIs without actually using a real browser.&lt;/p>
&lt;p>This is a problem for Crossref Event Data. We can’t fire up a browser and follow every DOI: it isn’t practical. When I tried this for a sample as an experiment I got an email from another publisher who was worried that we were scraping data (good bot operators always put contact details in their request headers!).&lt;/p>
&lt;p>The &lt;a href="http://www.crossref.org/02publishers/59pub_rules.html">Crossref member rules&lt;/a> leave some wiggle-room about whether this is allowed, but for the Event Data service, we can say that it’s a physical impossibility to collect all Event Data for DOIs like this.&lt;/p>
&lt;h2 id="bring-in-the-browser">Bring in the Browser&lt;/h2>
&lt;p>To quantify the size of the problem, we need to bring in a web browser. If we assume that some Publishers design their sites to work only with real browsers, that’s what we’ll use. Luckily there are web browsers packaged up into an automatable package, and we can use these to visit the DOI.&lt;/p>
&lt;p>Using one of these is considerably slower than just following link headers.&lt;/p>
&lt;p>I have split the ‘destination’ concept into two:&lt;/p>
&lt;ol>
&lt;li>Naïve destination URL: The URL that you get from following HTTP redirects acccording to the HTTP specification&lt;/li>
&lt;li>Browser destination URL: The URL that you get from letting a browser follow the DOI doing whatever a browser does.&lt;/li>
&lt;/ol>
&lt;p>Rather than defining a complicated spectrum of types of DOI resolution behaviour, I am classifying DOIs into two groups: those where standard HTTP redirects are sufficient and everything else.&lt;/p>
&lt;p>The method I am using is to resolve a sample of URLs using the browser. I can then compare the Naïve Destination URL with the Browser Destination URL. If they are the same, then I didn’t need to use the browser after all. If they give a different result however, I trust the Browser one better and declare that DOI to require a browser to resolve.&lt;/p>
&lt;img src="https://www.crossref.org/wp/blog/uploads/2016/10/4.png" alt="naive vs browser" class="img-responsive" />
&lt;p>Again, I took a sample of DOIs per Resource URL domain.&lt;/p>
&lt;h2 id="experiment-7-quantify-proportion-of-dois-that-require-a-browser-to-redirect">Experiment 7: Quantify proportion of DOIs that require a browser to redirect&lt;/h2>
&lt;p>&lt;a title="By NASA / Paul Riedel (Great Images in NASA: Home - info - pic) [Public domain], via Wikimedia Commons" href="https://commons.wikimedia.org/wiki/File%3AAtomic_Laboratory_Experiment_on_Atomic_Materials_-_GPN-2000-000663.jpg">&lt;img src="https://upload.wikimedia.org/wikipedia/commons/thumb/e/e7/Atomic_Laboratory_Experiment_on_Atomic_Materials_-_GPN-2000-000663.jpg/256px-Atomic_Laboratory_Experiment_on_Atomic_Materials_-_GPN-2000-000663.jpg" alt="Atomic Laboratory Experiment on Atomic Materials - GPN-2000-000663" class="img-responsive" />&lt;/a>&lt;/p>
&lt;p>&lt;strong>Hypothesis&lt;/strong>: A number of DOIs can’t be resolved with standard tools but instead require a browser.&lt;/p>
&lt;p>&lt;strong>Method&lt;/strong>: A sample of DOIs was selected per Resource URL domain. The links were followed using standard HTTP and using a browser. Where the URLs between the two were different, the DOI was counted as requiring a browser to resolve.&lt;/p>
&lt;p>&lt;strong>Observations&lt;/strong>:&lt;/p>
&lt;ul>
&lt;li>A total of 59,453 items were followed both using the Naïve and Browser methods.&lt;/li>
&lt;li>Of these 5,883 items have a different URL between the two methods, or 9.88%&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>Conclusion&lt;/strong>: We can’t rely on the Naïve redirect, and would have to fire up the browser in about 10% of cases in the sample.&lt;/p>
&lt;h2 id="other-gnarly-things">Other gnarly things&lt;/h2>
&lt;p>There are one or two supplementary gnarly things that crop up.&lt;/p>
&lt;p>First, session IDs are sometimes embedded in the URL. This is a tracking technique similar to cookies, but instead of sending cookies, which are invisible to the user, a unique code is placed on the end of the URL. This means that everyone gets a different URL. The most popular of these is the JSESSIONID, which is used by servers in the Java ecosystem. An example URL is:&lt;/p>
&lt;p>&lt;a href="http://onlinelibrary.wiley.com/doi/10.1002/047084289X.rn00615.pub3/abstract;jsessionid=0D1B7AC4689A494E0EA78BD2F0A710C4.f04t04" target="_blank">http://onlinelibrary.wiley.com/doi/10.1002/047084289X.rn00615.pub3/abstract;jsessionid=0D1B7AC4689A494E0EA78BD2F0A710C4.f04t04&lt;/a>&lt;/p>
&lt;p>We can easily remove these if they appear at the end of a URL. Sometimes they occur in the middle of a URL, as above. Sometimes they appear as query parameters:&lt;/p>
&lt;p>&lt;a href="http://jpharmsci.org/action/consumeSharedSessionAction?SERVER=WZ6myaEXBLGvmNGtLlDx7g%3D%3D&amp;amp;MAID=npYBLvZTaUI3JTHw%2BH63WQ%3D%3D&amp;amp;JSESSIONID=aaajjhdDL5ssK6d1HHrFv&amp;amp;ORIGIN=207988872&amp;amp;RD=RD" target="_blank">http://jpharmsci.org/action/consumeSharedSessionAction?SERVER=WZ6myaEXBLGvmNGtLlDx7g%3D%3D&amp;amp;MAID=npYBLvZTaUI3JTHw%2BH63WQ%3D%3D&amp;amp;JSESSIONID=aaajjhdDL5ssK6d1HHrFv&amp;amp;ORIGIN=207988872&amp;amp;RD=RD&lt;/a>&lt;/p>
&lt;p>In this case we make no attempt to remove them. These URLs won’t be any use for matching, and we have to acknowledge that and move on.&lt;/p>
&lt;h2 id="interpreting-the-results">Interpreting the results&lt;/h2>
&lt;p>All the above experiments involved taking as many DOIs as we had time for, gathering the Resource URLs, and then grouping the DOIs per Resource URL Domain. A sample of DOIs was investigated per each Resource URL domain to give the best chance at even coverage. The above figures have been presented as a proportion of the sampled data-set.&lt;/p>
&lt;p>Now it’s time to draw some practical conclusions. I grouped the results per Resource URL Domain, so I can say that “for this domain, X% of DOIs was deleted, or aliased, or whatever”. This means that we can look at the statistics for a given domain and work out the best method for working with DOIs that belong to it.&lt;/p>
&lt;p>I have created histograms of domains by their various proportions.&lt;/p>
&lt;p>Our first chart is histogram of Resource URL Domains where the Naïve Destination = the Resource URL. Each domain is given a proportion which represents how many DOIs sampled on that domain have a Landing Page equal to the Resource URL.&lt;/p>
&lt;img src="https://www.crossref.org/wp/blog/uploads/2016/10/h_proportion_resource_equals_naive_destination_url.png" alt="h_proportion_resource_equals_naive_destination_url" class="img-responsive" />
&lt;p>There’s a clear bimodal distribution here. The conclusion here is “&lt;strong>most domains require you to follow the link to find the destination URL&lt;/strong>“. Furthermore, the domains are consistent: there are virtually no domains that have a mix of DOIs that behave differently.&lt;/p>
&lt;p>Our second chart is a histogram of Resource URLs where the Browser-based redirect = the Naive URL. Each domain is given a proportion which represents how many DOIs sampled on that domain require us to fire up a browser.&lt;/p>
&lt;img src="https://www.crossref.org/wp/blog/uploads/2016/10/h_proportion_naive_equals_browser_destination_url.png" alt="h_proportion_naive_equals_browser_destination_url" class="img-responsive" />
&lt;p>Overwhelmingly, the Browser Redirect URL is the same as the Naïve Redirect URL, meaning that we don’t need to fire up the browser, we can just use the Naïve URL, which is much easier to compute. There are some resource URL domains which require every DOI to be followed in a browser rather than just following links.&lt;/p>
&lt;p>We know from this that we don’t have to use the browser most of the time. There is a small number of domains where we’re unsure (under 500) and a small number of domains where we know that we have to use a browser. This means we can focus our efforts.&lt;/p>
&lt;h3 id="there-are-lots-of-dois-and-they-all-behave-differently">There are lots of DOIs and they all behave differently.&lt;/h3>
&lt;p>There are thousands of publishers out there registering DOIs. There are thousands of domains. Some publishers have lots of domains. This makes it impossible to make many general observations about DOIs.&lt;/p>
&lt;h3 id="you-cant-tell-anything-by-looking-at-the-doi">You can’t tell anything by looking at the DOI&lt;/h3>
&lt;p>Just by looking at the DOI you can’t tell who published it, or which publisher’s system is hosting it. Therefore you can’t tell how it’s going to behave.&lt;/p>
&lt;p>We’ve looked at five kinds of URLs:&lt;/p>
&lt;ol>
&lt;li>The DOI itself&lt;/li>
&lt;li>The Resource URL&lt;/li>
&lt;li>The “naïve” redirect URL&lt;/li>
&lt;li>The “browser” redirect URL&lt;/li>
&lt;li>The Article Landing Page&lt;/li>
&lt;/ol>
&lt;p>In some cases, the Resource URL, naïve redirect URL, browser redirect and Article Landing Page are the same. In some cases they aren’t. Of these, the fifth is somewhat mythical.&lt;/p>
&lt;h3 id="dois-fall-into-classifications">DOIs fall into classifications&lt;/h3>
&lt;p>Each DOI falls into a category, most preferable first:&lt;/p>
&lt;ol>
&lt;li>The Resource URL is the same as the Landing Page.&lt;/li>
&lt;li>The Landing Page can be discovered by following HTTP redirects.&lt;/li>
&lt;li>The Landing Page can be discovered by firing up a web browser to follow redirects.&lt;/li>
&lt;li>The Landing Page can’t be determined.&lt;/li>
&lt;/ol>
&lt;h3 id="we-can-predictively-group-dois">We can predictively group DOIs&lt;/h3>
&lt;p>We can group DOIs by their Resource URLs and take a sample per Resource URL Domain. If all samples for a domain behave a certain way, we can place the DOIs into one of the above four groups with a probability.&lt;/p>
&lt;h3 id="well-never-know-the-full-story">We’ll never know the full story.&lt;/h3>
&lt;p>Because of the diversity of Publisher Systems and the long history of Crossref DOIs, we’ll never be able to describe exactly what’s going on for all DOIs.&lt;/p>
&lt;h2 id="what-next">What next?&lt;/h2>
&lt;p>We’re continuing to develop Crossref Event Data. The part of the system that handles turning URLs back into DOIs will never be perfect, but we know from this research that we can at least work with a subset.&lt;/p>
&lt;p>I’m also working on another project which will attempt to reverse a Landing Page URL back into a DOI by looking at the metadata on the Landing Page. You can &lt;a href="https://github.com/Crossref/doi-destinations">read about it here&lt;/a>. Ultimately we’re going to have to take a blended approach. Building a useful set of Landing Page URL to DOI mappings will be part of the mix.&lt;/p>
&lt;p>As Event Data matures we’ll be sharing all the datasets automatically as part of our infrastructure, including our DOI-to-URL mapping.&lt;/p>
&lt;blockquote>
&lt;p>And any members reading, please make your DOIs as easy to follow as possible! Please don’t require JavaScript or cookies when resolving DOIs.&lt;/p>
&lt;/blockquote>
&lt;p>If you’re read this far, perhaps you’re as interested in DOIs as we are. There’s a lot more to say on the subject, but that’s enough for now. See you at &lt;a href="http://pidapalooza.org/">PIDapalooza&lt;/a>!&lt;/p>
&lt;p> &lt;/p>
&lt;h3 id="image-credits">Image Credits&lt;/h3>
&lt;p>All images from Wikipedia Commons. Click or hover on the image to see the attribution.&lt;/p></description></item><item><title>Preprints are go at Crossref!</title><link>https://www.crossref.org/blog/preprints-are-go-at-crossref/</link><pubDate>Wed, 02 Nov 2016 00:00:00 +0000</pubDate><author>Rachael Lammey</author><discourseUsername>rlammey</discourseUsername><guid>https://www.crossref.org/blog/preprints-are-go-at-crossref/</guid><description>&lt;p>&lt;span >We’re excited to say that we’ve finished the work on our infrastructure to allow members to register preprints. Want to know why we’re doing this? Jennifer Lin &lt;/span>&lt;a href="https://www.crossref.org/blog/getting-ready-to-run-with-preprints-any-day-now">&lt;span >explains the rationale in detail&lt;/span>&lt;/a> &lt;span >in an earlier post, but in short we want to help make sure that:&lt;/span>&lt;/p>
&lt;ul>
&lt;li>links to these publications persist over time&lt;/li>
&lt;li>they are connected to the full history of the shared research results&lt;/li>
&lt;li>the citation record is clear and up-to-date&lt;/li>
&lt;/ul>
&lt;p>Doing so will help fully integrate preprint publications into the formal scholarly record.&lt;/p>
&lt;h2 id="whats-new">What’s new?&lt;/h2>
&lt;p>&lt;span >We’ve had to do some work on our own infrastructure to facilitate the inclusion of preprints, enabling: &lt;/span>&lt;/p>
&lt;ul>
&lt;li>Crossref membership for preprint repositories by updating our membership criteria and creating a &lt;/span>&lt;a href="http://support.crossref.org/hc/en-us/articles/213126346-Posted-content-includes-preprints-#policies">&lt;span >policies&lt;/span>&lt;/a>&lt;span > for preprints&lt;/span>&lt;/li>
&lt;li>The deposit of persistent identifiers for preprints to ensure successful links to the scholarly record over the course of time via the DOI resolver.&lt;/li>
&lt;li>Content Registration for preprints with &lt;/span>&lt;a href="http://support.crossref.org/hc/en-us/articles/213126346-Posted-content-includes-preprints-#depositing">&lt;span >custom metadata&lt;/span>&lt;/a>&lt;span > that reflect researcher workflows from preprint to formal publication (this custom metadata will then be visible to anyone using the Crossref metadata).&lt;/li>
&lt;li>Notification of links between preprints and formal publications that may follow (journal articles, monographs, etc.).&lt;/li>
&lt;li>&lt;a href="https://info.orcid.org/auto-update-has-arrived-orcid-records-move-to-the-next-level/">&lt;span >Auto-update of ORCID records&lt;/span>&lt;/a>&lt;span > to ensure that preprint contributors get credit for their work.&lt;/li>
&lt;li>&lt;a href="https://www.crossref.org/blog/a-healthy-infrastructure-needs-healthy-funding-data/">&lt;span >Preprint and funder registration&lt;/span>&lt;/a>&lt;span > to automatically report research contributions based on funder and grant identification.&lt;/li>
&lt;li>It will also allow for the collection of “&lt;/span>&lt;a href="https://www.crossref.org/blog/crossref-event-data-early-preview-now-available/">&lt;span >event data&lt;/span>&lt;/a>&lt;span >” that capture activities surrounding preprints (usage, social shares, mentions, discussions, recommendations, links to datasets and other research entities, etc.).&lt;/li>
&lt;/ul>
&lt;/li>
&lt;p>Now we’re ready to go!&lt;/p>
&lt;h2 id="early-adopters">Early adopters&lt;/h2>
&lt;p>&lt;span >We have been working with various preprint publishers who are launching (or planning to launch) their own preprint initiatives. &lt;/span>&lt;/p>
&lt;p>&lt;span >Preprints.org is the first to successfully make preprints deposits using the dedicated schema. For example, this preprint &lt;/span>&lt;a href="https://doi.org/10.20944/preprints201608.0191.v1" target="_blank">&lt;span >https://doi.org/&lt;/span>&lt;span >10.20944/preprints201608.0191.v1&lt;/span>&lt;/a> &lt;span >is registered with Crossref. It is linked to a published journal article &lt;/span>&lt;a href="https://doi.org/10.3390/data1030014" target="_blank">&lt;span >https://doi.org/10.3390/data1030014&lt;/span>&lt;/a> &lt;span >both in the online display as well &lt;/span>&lt;a href="https://api.crossref.org/v1/works/10.20944/preprints201608.0191.v1/transform/application/vnd.crossref.unixsd&amp;#43;xml" target="_blank">&lt;span >the preprint’s Crossref metadata record&lt;/span>&lt;/a>&lt;span >. Others are getting ready to go - will your organisation be next? (Technical documentation available &lt;/span>&lt;a href="http://support.crossref.org/hc/en-us/articles/213126346-Posted-content-includes-preprints-" target="_blank">&lt;span >here&lt;/span>&lt;/a>&lt;span >.)&lt;/span>&lt;/p>
&lt;blockquote>
&lt;p>Martyn Rittman, from Preprints, operated by MDPI said: Preprints.org is delighted to be the very first to integrate the Crossref schema for preprints. We believe it is an important step in allowing working papers and preliminary results to be fully citable as soon as they are available. It also makes it easy to link to the final peer-reviewed version, regardless of where it is published. Thanks to the hard work of Crossref and clear documentation, the schema was very simple to implement and has been applied retrospectively to all preprints at Preprints.org.&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>Jessica Polka, Director, ASAPbio adds: ASAPbio is a scientist-driven community initiative to promote the productive use of preprints in the life sciences. We’re thrilled to see Crossref’s development of a service that enables preprints to better contribute to the scholarly record. This infrastructure lays a necessary foundation for increasing acceptance of preprints as a valuable form of scientific communication among biologists.&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;h2 id="questions">Questions?&lt;/h2>
&lt;p>&lt;a href="mailto:feedback@crossref.org">&lt;span >Get in touch&lt;/span>&lt;/a> &lt;span >with any questions or comments, or join our upcoming &lt;/span>&lt;a href="https://attendee.gotowebinar.com/register/7523925461867007490" target="_blank">&lt;span >webinar&lt;/span>&lt;/a> &lt;span >to talk about preprints, infrastructure and where we go from here. &lt;/span>&lt;/p></description></item><item><title>The Organisation Identifier Project: a way forward</title><link>https://www.crossref.org/blog/the-organisation-identifier-project-a-way-forward/</link><pubDate>Mon, 31 Oct 2016 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/the-organisation-identifier-project-a-way-forward/</guid><description>&lt;p>The scholarly communications sector has built and adopted a series of open identifier and metadata infrastructure systems to great success.  Content identifiers (through Crossref and DataCite) and contributor identifiers (through ORCID) have become foundational infrastructure to the industry.  &lt;/p>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/wp/blog/uploads/2016/10/Screenshot-2016-10-31-15.42.15-300x201.png" alt="organisation Identifier Project" width="300px" />
&lt;/div>
&lt;p>But there still seems to be one piece of the infrastructure that is missing.  There is as yet no open, stakeholder-governed infrastructure for organisation identifiers and associated metadata.&lt;/p>
&lt;p>In order to understand this gap, Crossref, DataCite and ORCID have been collaborating to:&lt;/p>
&lt;ul>
&lt;li>Explore the current landscape of organisational identifiers;&lt;/li>
&lt;li>Collect the use-cases that would benefit our respective stakeholders in scholarly communications industry;&lt;/li>
&lt;li>Identify those use-cases that can be more feasibly addressed in the near term; and&lt;/li>
&lt;li>Explore how the three organisations can collaborate (with each other and with others) to practically address this key missing piece of scholarly infrastructure.&lt;/li>
&lt;/ul>
&lt;p>The result of this work is in three related papers being released by Crossref, DataCite and ORCID for community review and feedback. The three papers are:&lt;/p>
&lt;ul>
&lt;li>organisation Identifier Project: A Way Forward (&lt;a href="https://doi.org/10.5438/2906" target="_blank">PDF&lt;/a>; &lt;a href="https://docs.google.com/document/d/1PpWRBnlrU_X6TwYzQlB89w4FNXMLqieJv-RW0irNTsg/edit?usp=sharing" target="_blank">GDoc&lt;/a>)&lt;/li>
&lt;li>organisation Identifier Provider Landscape (&lt;a href="https://doi.org/10.5438/4716" target="_blank">PDF&lt;/a>; &lt;a href="https://docs.google.com/document/d/1lcKXWm9PxDvVWBxdlH7BVU7w8esnW0F_dppNiCJ9BW8/edit#" target="_blank">GDoc&lt;/a>)&lt;/li>
&lt;li>Technical Considerations for an organisation Identifier Registry (&lt;a href="https://doi.org/10.5438/7885" target="_blank">PDF&lt;/a>; &lt;a href="https://docs.google.com/a/datacite.org/document/d/1Zj5sRRdnjKLjY81AbaeUdal3n6VuQgi1H66vRMaayiA/edit?usp=sharing" target="_blank">GDoc&lt;/a>)&lt;/li>
&lt;/ul>
&lt;p>We invite the community to comment on these papers both via email (&lt;a href="mailto:oi-project@orcid.org">&lt;a href="mailto:oi-project@orcid.org">oi-project@orcid.org&lt;/a>&lt;/a>) and at&lt;/span> &lt;a href="http://pidapalooza.org">PIDapalooza&lt;/a> on November 9th and 10th and at &lt;a href="https://crossreflive16.sched.org">Crossref LIVE16&lt;/a> on November 1st and 2nd. To move The OI Project forward, we will be forming a Community Working Group with the goal of holding an initial meeting before the end of 2016. The Working Group’s main charge is to develop a plan to launch and sustain an open, independent, non-profit organisation identifier registry to facilitate the disambiguation of researcher affiliations.&lt;/p>
&lt;h3 id="crossref-use-casesspan">Crossref Use Cases&lt;/span>&lt;/h3>
&lt;p>Crossref has also been discussing the needs of its members over the last year and there is value in focusing on the affiliation name ambiguity problem with research outputs and contributors. In terms of the metadata that Crossref collects, something that is missing has been affiliations for the authors of publications. Over the last couple of years, Crossref has been expanding what it collects - for example, funding and licensing data and ORCID iDs - and this enables a fuller picture of what we are calling the &lt;a href="https://doi.org/10.64000/n0zjv-z6c66" target="_blank">article nexus&lt;/a>. In order to continue to fill out the metadata we collect - and for our members to use in their own systems and publications - we need an organisation identifier.&lt;/p>
&lt;p>Another use case for Crossref is identifying funders as part of collecting funder data to enable connecting funding sources with the published scholarly literature. In order to enable the reliable identification of funders in the Crossref system we created the Open Funder Registry that now has over 15,000 funders available as Open Data under a &lt;a href="https://creativecommons.org/publicdomain/zero/1.0/">CC0 waiver&lt;/a>. While this has been very successful, it is a very narrowly focused registry and is not suitable for a broad, community-run organisation identifier registry that addresses the affiliation use case. In future, our goal will be to merge the Open Funder Registry into the identifier registry that the organisation Identifier Working Group will work on.&lt;/p>
&lt;p>By working collaboratively we can define a pragmatic and cost-effective service that will meet a fundamental need of all scholarly communication stakeholders.&lt;/p>
&lt;p>Geoffrey Bilder will be focusing &lt;a href="https://crossreflive16.sched.com/event/8hqy/geoffrey-bilder-the-case-of-the-missing-leg">his talk at Crossref LIVE16&lt;/a> this week on this initiative, dubbed The OI Project. The talk is scheduled for 2pm UK time and will be live streamed along with the rest of that day’s program.&lt;/p></description></item><item><title>Smart alone; brilliant together. Community reigns at Crossref LIVE16</title><link>https://www.crossref.org/blog/smart-alone-brilliant-together.-community-reigns-at-crossref-live16/</link><pubDate>Sat, 29 Oct 2016 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/smart-alone-brilliant-together.-community-reigns-at-crossref-live16/</guid><description>&lt;p>&lt;span >A bit different from our traditional meetings, &lt;a href="https://www.eventbrite.com/e/crossref-live16-registration-25928526922#">Crossref LIVE16&lt;/a> next week is the first of a totally new annual event for the scholarly communications community.  Our theme is &lt;span >&lt;strong>Smart alone; brilliant together&lt;/strong>&lt;/span>.  We have a broad program of both informal and plenary talks across two days. There will be stations to visit, conversation starters, and entertainment, that highlight what our community can achieve if it works together. &lt;/span>&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>&lt;span >&lt;a href="https://crossreflive16.sched.com/">Check out the final program&lt;/a>.&lt;/span>&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;span >We’re now opening the doors to all parties—our 5,000+ members of all shapes and sizes—as well as the technology providers, funders, libraries, and researchers that we work with.  &lt;/span>&lt;span >Our aim is to gather the ‘metadata-curious’ and have more opportunities to talk face-to-face to share ideas and information, see live demos, and get to know one another.&lt;/span>&lt;/p>
&lt;p >
&lt;span >&lt;strong>&lt;span >Mashup Day - Tuesday 1st November 12-5pm.&lt;/span>&lt;/strong>  An &amp;#8216;open house’ vibe, we’ll have several stations to visit each Crossref team, a LIVE Lounge, good food, and guest areas run by our friends at &lt;span >DataCite&lt;/span>, &lt;span >ORCID&lt;/span>, and &lt;span >Turnitin&lt;/span>.  We’ll have some special programming too, on-the-hour lightning talks, including &lt;/span>&lt;span >a wild talk at 2pm from a primatologist who speaks baboon! &lt;/span>
&lt;/p>
&lt;p >
&lt;span >&lt;strong>&lt;span >Conference Day - Wednesday 2nd November 9am-5pm.&lt;/span>&lt;/strong>  There is more of a formal plenary agenda this day, with keynote speakers from across the scholarly communications landscape.  Our primary goal is to share Crossref strategy and plans, alongside thought-provoking perspectives from our guest speakers.  We’ll hear from many corners of our community including:&lt;/span>
&lt;/p>
&lt;li >
&lt;span >Funder program officer, Carly Strasser (Moore Foundation) on &amp;#8220;&lt;span >Publishers and funders as agents of change&lt;/span>&amp;#8220;, &lt;/span>
&lt;/li>
&lt;li >
&lt;span >Data scientist, Ian Calvert (Digital Science) on &amp;#8220;&lt;span >You don’t have metadata&lt;/span>&amp;#8220;, &lt;/span>
&lt;/li>
&lt;li >
&lt;span >Open knowledge advocate, Dario Taraborelli (The Wikimedia Foundation) on &amp;#8220;&lt;span >Citations for the sum of all human knowledge&lt;/span>&amp;#8220;, and&lt;/span>
&lt;/li>
&lt;li >
&lt;span >Scholarly communications librarian, April Hathcock (New York University) on &amp;#8220;&lt;span >Opening up the margins&lt;/span>&amp;#8220;. &lt;/span>
&lt;/li>
&lt;p >
&lt;span >&lt;span >&lt;span >For our part, we will set out Crossref’s &amp;#8220;&lt;span >strategy and key priorities&lt;/span>&amp;#8221; (Ed Pentz), &amp;#8220;&lt;span >A vision for membership&lt;/span>&amp;#8221; (me, Ginny Hendricks), &amp;#8220;&lt;span >The meaning of governance&lt;/span>&amp;#8221; (Lisa Hart Martin), &amp;#8220;&lt;span >The case of the missing leg&lt;/span>&amp;#8221; (Geoffrey Bilder),&amp;#8221;&lt;span >New territories in the scholarly research map&lt;/span>&amp;#8221; (Jennifer Lin), and &amp;#8220;&lt;span >Relationships and other notable things&lt;/span>&amp;#8221; (Chuck Koscher).  &lt;/span>&lt;/span>&lt;/span>
&lt;/p>
&lt;p >
&lt;span >&lt;span >&lt;span >We will also set aside thirty minutes fo&lt;/span>r the important Crossref annual business meeting, when we will announce the results of the &lt;span >&lt;a href="https://www.crossref.org/blog/one-member-one-vote-crossref-board-election-opens-today-september-30th/">membership’s vote&lt;/a>, and welcome new board members.&lt;/span>&lt;/span>&lt;/span>
&lt;/p>
&lt;p>&lt;span >I can’t wait to welcome you all.&lt;/span>&lt;/p>
&lt;h1 id="span-have-you-votedspan">&lt;span >Have you voted?&lt;/span>&lt;/h1>
&lt;p>&lt;span >If you’re a voting member of Crossref you’ll have cast your vote already I hope! I’m so happy to see that people have voted in record numbers although it’s under 7% of our eligible members which is not high… more on member participation next week.&lt;/span>&lt;/p></description></item><item><title>Important changes to Similarity Check</title><link>https://www.crossref.org/blog/important-changes-to-similarity-check/</link><pubDate>Fri, 21 Oct 2016 00:00:00 +0000</pubDate><author>Madeleine Watson</author><guid>https://www.crossref.org/blog/important-changes-to-similarity-check/</guid><description>&lt;h2 id="new-features-new-indexing-new-name---oh-my">New features, new indexing, new name - oh my!&lt;/h2>
&lt;div style="float:right;margin:10px">
&lt;img src="https://assets.crossref.org/logo/crossref-similarity-check-logo-200.svg" width="200" height="98" alt="Crossref Similarity Check logo">
&lt;/div>
&lt;p>&lt;strong>TL;DR&lt;/strong> The indexing of Similarity Check users’ content into the shared full-text database is about to get a lot faster. Now we need members assistance in helping Turnitin (the company who own and operate the iThenticate plagiarism checking tool) to transition to a new method of indexing content.&lt;/p>
&lt;p>&lt;span >For existing Similarity Check users: please check that your metadata includes full-text URLs so that Turnitin can quickly and easily locate and index your content. Full-text URLs need to be included in 90% of journal article metadata by 31st December 2016.&lt;/span>&lt;/p>
&lt;h2 id="span-2016-has-seen-some-exciting-new-developmentsspan">&lt;span >2016 has seen some exciting new developments&lt;/span>&lt;/h2>
&lt;p>&lt;span >(And there are plenty more in store as we strive towards 2017). But first: i&lt;/span>&lt;span >&lt;span >n April we renamed the service from CrossCheck to Similarity Check and we now have a new service logo available to reference via our &lt;/span>&lt;a href="https://www.crossref.org/brand">&lt;span >logo CDN&lt;/span>&lt;/a>&lt;span > using the following code.&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;code>&amp;lt;img src=&amp;quot;https://assets.crossref.org/logo/crossref-similarity-check-logo-200.svg&amp;quot; width=&amp;quot;200&amp;quot; height=&amp;quot;98&amp;quot; alt=&amp;quot;Crossref Similarity Check logo&amp;quot;&amp;gt;&lt;/code>&lt;/p>
&lt;p>&lt;span >Earlier this year Crossref also signed a new contract with Turnitin. As part of this, we negotiated the inclusion of dedicated development time each year from Turnitin’s engineering and product teams to focus on developments in the iThenticate tool that will specifically support Similarity Check users and their needs.  Many of our members will have been contacted recently by Turnitin and asked to complete a survey regarding how they use the tool and what improvements they would like to see made in the future. The results of this survey are currently being analyzed and will be used by Turnitin to inform a development plan.&lt;/span>&lt;/p>
&lt;p>&lt;span >Finally, throughout 2016 we have also been working with Turnitin to help them develop a new Content Intake System that provides a faster, more reliable and robust method for collecting data from Crossref and indexing users’ content into the Similarity Check full-text database.  Previously Turnitin was only able to collect prefix data from Crossref’s system on a monthly basis whereas today, with the new Content Intake System up and running, they are able to pull full-text content links from deposited metadata on a daily basis. This means that if you are a Similarity Check user currently depositing full-text URLs with Crossref, your content is being indexed by Turnitin faster than ever before.&lt;/span>&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >There are plenty of other benefits this new method provides. This is why we have agreed with Turnitin that from 1st January 2017 onwards, indexing via full-text URLs will be the only method supported for Similarity Check.&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;span >Not convinced? Let me share my top four reasons for advocating Turnitin’s exclusive use of the full-text URL indexing method for Similarity Check:&lt;/span>&lt;/p>
&lt;p >
&lt;span >&lt;b>1. Reduced traffic to publisher servers.&lt;/b>&lt;span > Indexing via full-text URLs means that the crawl is targeted specifically to the location of the full-text PDF or HTML content, thereby reducing the amount of traffic Turnitin puts through publisher’s servers.&lt;/span>&lt;/span>
&lt;/p>
&lt;p >
&lt;span >&lt;b>2. Lower margin for error and simplified issue recovery.&lt;/b>&lt;span > Turnitin will no longer need to make multiple fetches for any content item, meaning there are now fewer steps in the process. This means there will be fewer places for indexing errors to occur and also reduces the reliance on users setting meta tags or span tags correctly in their markup. Furthermore, if problems do arise, using the one method of indexing for all users will mean that Turnitin is able to pinpoint the issue faster and work with members to resolve it quickly. &lt;/span>&lt;/span>
&lt;/p>
&lt;p >
&lt;span >&lt;b>3. Quicker turnaround on indexing with fewer delays.&lt;/b>&lt;span > Turnitin will no longer need to investigate and set up bespoke indexing methods for different Similarity Check users and they will be able to access the location of full-text content from the one place (ie. within the specific &lt;iparadigms> resource tag in member’s metadata deposits). More accurate data from only one location will result in a quicker turnaround on indexing, meaning newly published content will be added into the Similarity Check content database sooner for all members to check other new manuscripts against.&lt;/span>&lt;/span>
&lt;/p>
&lt;p >
&lt;span >&lt;b>4. Daily ingest is better than monthly!&lt;/b> Full-text links can be collected daily from Crossref-rather than monthly for other methods-meaning a more regular ingest of content.&lt;/span>
&lt;/p>
&lt;p>&lt;span >The presence of full-text URLs within the metadata is critical to the functioning of Turnitin’s new indexing system. All new Similarly Check participants are now asked to ensure they have these links in place within their deposited metadata before they participate in the service.&lt;/span>&lt;/p>
&lt;h2 id="span-already-a-userof-similarity-checkspan">&lt;span >Already a user of Similarity Check? &lt;/span>&lt;/h2>
&lt;p>&lt;span >&lt;span >If you’re an existing Similarity Check participant who joined the service before 2016, your content is likely to be currently indexed via different methods, such as following links contained in your page meta tags. If you’re not currently depositing full-text links with Crossref for Similarity Check, you will have received an email from us about this in August. If you’re unsure though, you can check your XML to see if you have included the full-text link in the &lt;iparadigms> field or you can send us an email at &lt;/span>&lt;a href="mailto:similaritycheck@crossref.org">&lt;span >&lt;a href="mailto:similaritycheck@crossref.org">similaritycheck@crossref.org&lt;/a>&lt;/span>&lt;/a>&lt;span > as we’d be happy to check for you. &lt;/span>&lt;/span>&lt;/p>
&lt;h3 id="span-help-dont-leave-me-behindspan">&lt;span >Help, don’t leave me behind!&lt;/span>&lt;/h3>
&lt;p>&lt;span >&lt;span >Us? Never! We’re here to help. But we really do need those full-text links… Everything existing Similarity Check publishers need to know about adding full-text links into new or existing metadata can be found on our &lt;/span>&lt;a href="http://help.crossref.org/similaritycheck">&lt;span >help site&lt;/span>&lt;/a>&lt;span >. These URLs should be included as part of all standard metadata deposits going forward and can be easily added into existing files in bulk. So there’s no need to redeposit the full metadata, unless of course you would prefer to do so!&lt;/span>&lt;/span>&lt;/p>
&lt;h2 id="span-thats-a-wrapspan">&lt;span >That’s a wrap&lt;/span>&lt;/h2>
&lt;p>&lt;span >Looking back, it really has been a busy year for Similarity Check and it will continue to be so as we persevere in laying the groundwork for a more streamlined, robust and scalable service for 2017 and beyond. Remember, we need Similarity Check users to ensure they have full-text URLs in at least 90% of their journal article metadata by 31st December 2016 in order to continue using Similarity Check from 2017 onwards.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >And please keep us updated!  With over 1,200 publishers using Similarity Check, we’ll need a little nudge to know when metadata has been updated to include these links. So once updates have been deposited, please email &lt;/span>&lt;a href="mailto:similaritycheck@crossref.org">&lt;span >&lt;a href="mailto:similaritycheck@crossref.org">similaritycheck@crossref.org&lt;/a>&lt;/span>&lt;/a>&lt;span > to confirm. And of course, as always, if there are any questions or if some advice would help, we’re just an &lt;a href="mailto:similaritycheck@crossref.org">email&lt;/a> away.  &lt;/span>&lt;/span>&lt;/p>
&lt;p> &lt;/p></description></item><item><title>One member, one vote: Crossref Board Election opens today, September 30th</title><link>https://www.crossref.org/blog/one-member-one-vote-crossref-board-election-opens-today-september-30th/</link><pubDate>Fri, 30 Sep 2016 00:00:00 +0000</pubDate><author>Lisa Hart Martin</author><guid>https://www.crossref.org/blog/one-member-one-vote-crossref-board-election-opens-today-september-30th/</guid><description>&lt;h2 id="span-bwatch-for-two-important-emails-on-september-30bbthbb--one-with-a-voting-link-and-material-and-one-with-your-username-and-passwordbspan">&lt;span >&lt;b>Watch for two important emails on September 30&lt;/b>&lt;b>th&lt;/b>&lt;b> – one with a voting link and material, and one with your username and password.&lt;/b>&lt;/span>&lt;/h2>
&lt;p>&lt;span >&lt;b>Running&lt;/b>&lt;span > Crossref well is a key part of our mission. It’s important that we be as neutral and fair as possible, and we are always striving for that balance. One of our stated principles is “One member, one vote”. And each year we encourage each of our members-standing at over 6000 today-to participate in the election of new board members.&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >It is hard to believe that November 2&lt;/span>&lt;span >nd&lt;/span>&lt;span > will be Crossref’s 17&lt;/span>&lt;span >th&lt;/span>&lt;span > annual meeting and our 16&lt;/span>&lt;span >th&lt;/span>&lt;span > annual Board of Directors election. How time flies, and &lt;a href="https://www.crossref.org/blog/the-membership-boom-why-metadata-isnt-like-beer/">oh, how we have grown&lt;/a>!&lt;/span>&lt;/span>&lt;figure id="attachment_2215" class="wp-caption alignnone">&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/09/screencapture-crossref-org-about-truths.png">&lt;img class="size-large wp-image-2215" src="https://www.crossref.org/wp/blog/uploads/2016/09/screencapture-crossref-org-about-truths-1024x663.png" alt="Crossref's Truths, taken from our forthcoming new website. " width="840" height="544" srcset="https://www.crossref.org/wp/blog/uploads/2016/09/screencapture-crossref-org-about-truths-1024x663.png 1024w, https://www.crossref.org/wp/blog/uploads/2016/09/screencapture-crossref-org-about-truths-300x194.png 300w, https://www.crossref.org/wp/blog/uploads/2016/09/screencapture-crossref-org-about-truths-768x497.png 768w, https://www.crossref.org/wp/blog/uploads/2016/09/screencapture-crossref-org-about-truths-1200x777.png 1200w, https://www.crossref.org/wp/blog/uploads/2016/09/screencapture-crossref-org-about-truths.png 1287w" sizes="(max-width: 709px) 85vw, (max-width: 909px) 67vw, (max-width: 1362px) 62vw, 840px" />&lt;/a>&lt;figcaption class="wp-caption-text">Crossref’s Truths, taken from our forthcoming new website.&lt;/figcaption>&lt;/figure>&lt;/p>
&lt;p>&lt;span >&lt;span >I am hoping that we can &lt;/span>&lt;b>rally&lt;/b>&lt;span > the membership to participate in this important process!&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >Candidates will be elected at &lt;a href="https://www.eventbrite.com/e/crossref-live16-registration-25928526922">Crossref LIVE16&lt;/a> for three-year terms to fill five of the 16 Board seats whose terms expire this year.  The slate of candidates was recommended by the Nominating Committee, which consisted of three Board members not up for re-election, and two Crossref members that are not on the Board. &lt;/span>&lt;/p>
&lt;p>&lt;span >This year, Jasper Simons, APA; Paul Peters, Hindawi; Jason Wilde, AIP; Chris Fell, Cambridge University Press; and Rebecca Lawrence, f1000 served on the Nominating Committee.  The Committee met to discuss the process, criteria, and potential candidates, and put forward a slate which was required to be at least equal to the number of Board seats up for election. The slate may or may not consist of Board members up for re-election.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >Crossref members are welcome to run as independent candidates, as long as they have ten member endorsements sent to &lt;/span>&lt;a href="mailto:lhart@crossref.org">&lt;span >&lt;a href="mailto:lhart@crossref.org">lhart@crossref.org&lt;/a>&lt;/span>&lt;/a>&lt;span > with the intent to run. We sent a notification of the process in advance (this year on August 26&lt;/span>&lt;span >th&lt;/span>&lt;span >), so any nominations could be included in the voting materials that will be sent via email on September 30&lt;/span>&lt;span >th&lt;/span>&lt;span >.&lt;/span>&lt;/span>&lt;/p>
&lt;h3 id="span-you-can-access-online-voting-from-today-atspan">&lt;strong>&lt;span >You can access online voting from today at:&lt;/span>&lt;/strong>&lt;/h3>
&lt;h3 id="span-a-hrefhttpseballot4votenetcompilaadminhttpseballot4votenetcompilaadmina-watch-your-inbox-today-for-emails-with-your-username-and-passwordspan">&lt;strong>&lt;span >&lt;a href="https://eballot4.votenet.com/PILA/admin">&lt;a href="https://eballot4.votenet.com/PILA/admin" target="_blank">https://eballot4.votenet.com/PILA/admin&lt;/a>&lt;/a>. Watch your inbox today for emails with your username and password!&lt;/span>&lt;/strong>&lt;/h3></description></item><item><title>New Crossref DOI display guidelines are on the way</title><link>https://www.crossref.org/blog/new-crossref-doi-display-guidelines-are-on-the-way/</link><pubDate>Tue, 27 Sep 2016 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/new-crossref-doi-display-guidelines-are-on-the-way/</guid><description>&lt;h3 id="span-tldrspan">&lt;span >TL;DR&lt;/span>&lt;/h3>
&lt;p>&lt;span >Crossref will be updating its DOI Display Guidelines within the next couple of weeks.  This is a big deal.  We last made a change in 2011 so it’s not something that happens often or that we take lightly.  In short, the changes are to drop “dx” from DOI links and to use “http&lt;span >&lt;strong>s&lt;/strong>&lt;/span>:” rather than “http:”.  An example of the new best practice in displaying a Crossref DOI link is: &lt;a href="https://doi.org/10.1629/22161">&lt;a href="https://doi.org/10.1629/22161" target="_blank">https://doi.org/10.1629/22161&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;h3 id="span-hey-ho-8220doi8221-and-8220dx8221-have-got-to-gospan">&lt;span >Hey Ho, “doi:” and “dx” have got to go&lt;/span>&lt;/h3>
&lt;p>&lt;span >The updated Crossref DOI Display guidelines recommend that &lt;a href="https://doi.org/" target="_blank">https://doi.org/&lt;/a> be used and not &lt;a href="http://dx.doi.org/" target="_blank">http://dx.doi.org/&lt;/a> in DOI links.  Originally the “dx” separated the DOI resolver from the International DOI Foundation (IDF) website but this has changed and the IDF has already updated its recommendations so we are bringing ours in line with theirs.&lt;/span>&lt;/p>
&lt;p>&lt;span >We are also recommending the use of HTTP&lt;span >&lt;strong>S&lt;/strong>&lt;/span> because it makes for more sec&lt;/span>ure browsing.  When you use an HTTPS link, the connection between the person who clicks the DOI and the DOI resolver is secure.  This means it can’t be tampered with or eavesdropped on.  The DOI resolver will redirect to both HTTP and HTTPS URLs.&lt;/p>
&lt;h3 id="span-timing-and-backwards-compatibilityspan">&lt;span >Timing and backwards compatibility&lt;/span>&lt;/h3>
&lt;p>&lt;span >&lt;span >We are requesting all Crossref member publishers and anyone using Crossref DOIs to start following the updated guidelines as soon as possible.  But realistically we are setting a goal of &lt;span >&lt;strong>six months&lt;/strong>&lt;/span> for implementation; we realize that updating systems and websites can take time.  We at Crossref will also be updating our systems within six months - &lt;/span>&lt;span >we already use HTTPS for some of our services and our new website (coming very soon!) will use HTTPS. &lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >An important point about backwards compatibility is that “&lt;/span>&lt;a href="http://dx.doi.org/">&lt;span >&lt;a href="http://dx.doi.org/" target="_blank">http://dx.doi.org/&lt;/a>&lt;/span>&lt;/a>&lt;span >” and “&lt;/span>&lt;a href=http://doi.org/>&lt;span >&lt;a href="http://doi.org/" target="_blank">http://doi.org/&lt;/a>&lt;/span>&lt;/a>&lt;span >” are valid and will continue to work forever-or as long as Crossref DOIs continue to work-and we plan to be around a long time.&lt;/span>&lt;/span>&lt;/p>
&lt;h3 id="span-we-need-to-do-betterspan">&lt;span >We need to do better&lt;/span>&lt;/h3>
&lt;p>&lt;span >Reflecting on the 2011 update to the display guidelines it’s fair to say that we have been disappointed.  It is still much too common to see unlinked DOIs in the form doi:10.1063/1.3599050 or DOI: 10.1629/22161 or even unlinked in this form: &lt;a href="http://dx.doi.org/10.1002/poc.3551" target="_blank">http://dx.doi.org/10.1002/poc.3551&lt;/a> &lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >What’s so wrong with this approach?  To demonstrate, please click on this DOI doi:10.1063/1.3599050 - oh, you can’t click on it?  How about I send you to a real example of a publisher page.  What I’d like you to do is click the following link and then copy the DOI you find there and come back - &lt;/span>&lt;a href="http://dx.doi.org/10.1002/poc.3551">&lt;span >&lt;a href="http://dx.doi.org/10.1002/poc.3551" target="_blank">http://dx.doi.org/10.1002/poc.3551&lt;/a>&lt;/span>&lt;/a>&lt;span >. &lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >Are you back? I expect you had to carefully highlight the “10.1063/1.3599050” and then do “edit”, “copy”.  That wasn’t too bad but the next step is to put the DOI into an email and send it to someone.  But wait - what are they going to do with “10.1063/1.3599050”?  It’s useless.  If you want it to be useful you’ll have to add “&lt;/span>&lt;a href="http://doi.org">&lt;span >&lt;a href="http://doi.org" target="_blank">http://doi.org&lt;/a>&lt;/span>&lt;/a>&lt;span >” or &lt;/span>&lt;a href="https://doi.org/">&lt;span >&lt;a href="https://doi.org/" target="_blank">https://doi.org/&lt;/a>&lt;/span>&lt;/a>&lt;span > in the front. &lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >When publishers follow the guidelines it makes things easier - if you go to &lt;/span>&lt;a href="https://doi.org/10.1063/1.3599050">&lt;span >&lt;a href="https://doi.org/10.1063/1.3599050" target="_blank">https://doi.org/10.1063/1.3599050&lt;/a>&lt;/span>&lt;/a>&lt;span > you’ll note that you can just right click on the full DOI link on the page and get a full menu of options of what to do with it.  One of which is to copy the link and then you can easily paste into an email or anywhere else.&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >However-putting a positive spin on the spotty adherence to the 2011 update to the DOI display guidelines-everyone has another chance with the latest set of updates to make all the changes at once! &lt;/span>&lt;/p>
&lt;h3 id="span-more-on-https-future-proofing-scholarly-linkingspan">&lt;span >More on HTTPS (future-proofing scholarly linking)&lt;/span>&lt;/h3>
&lt;p>&lt;span >We take providing the central linking infrastructure for scholarly publishing seriously.  Because we form the link between publisher sites all over the web, it’s important that we do our bit to enable secure browsing from start to finish.  In addition, HTTPS is now a ranking signal for Google &lt;a href="https://webmasters.googleblog.com/2014/08/https-as-ranking-signal.html">who gives sites using HTTPS a small ranking boost&lt;/a>.&lt;/span>&lt;/p>
&lt;p>&lt;span >The process of enabling HTTPS on publisher sites will be a long one and, given the number of members we have, it may a while before everyone’s made the transition.  But by using HTTPS we are future-proofing scholarly linking on the web.&lt;/span>&lt;/p>
&lt;p>&lt;span >Some years ago we started the process of making our new services available exclusively over HTTPS.  The Crossref Metadata API is HTTPS enabled, and Crossmark and our Assets CDN use HTTPS exclusively. Last year we collaborated with Wikipedia to make all of their DOI links HTTPS.  We hope that we’ll start to see more of the scholarly publishing industry doing the same.&lt;/span>&lt;/p>
&lt;p>&lt;span >So-it’s simple-always make the DOI a full link - &lt;a href="https://doi.org/10.1006/jmbi.1995.0238">&lt;a href="https://doi.org/10.1006/jmbi.1995.0238" target="_blank">https://doi.org/10.1006/jmbi.1995.0238&lt;/a>&lt;/a> - even when it’s on the abstract or full text page of the content that the DOI identifies - and use “&lt;a href="https://doi.org/">&lt;a href="https://doi.org/" target="_blank">https://doi.org/&lt;/a>&lt;/a>”. &lt;/span>&lt;/p></description></item><item><title>The membership boom &amp; why metadata isn’t like beer</title><link>https://www.crossref.org/blog/the-membership-boom-why-metadata-isnt-like-beer/</link><pubDate>Fri, 23 Sep 2016 00:00:00 +0000</pubDate><author>Susan Collins</author><guid>https://www.crossref.org/blog/the-membership-boom-why-metadata-isnt-like-beer/</guid><description>&lt;p>&lt;span >You might recognize my name if you’ve ever applied for Crossref membership on behalf of your organisation. It recently occurred to me that, since I’ve been working in our membership department for eight years, I’ve been a part of shepherding new members for half of our history. And my, how we’ve grown. &lt;/span>&lt;/p>
&lt;h3 id="span-bmembership-growth-by-countrybspan">&lt;span >&lt;b>Membership growth by country&lt;/b>&lt;/span>&lt;/h3>
&lt;p>&lt;span >Though it may be easy to see our membership growth by looking at the numbers, I think it’s interesting to consider where we’ve grown.  The top ten member countries have dramatically changed since Crossref began sixteen years ago.  At the end of our first year of operations, our membership included 54 publishers and affiliated organisations.  The majority were from the US and the UK, with a small number from Germany, the Netherlands, and Japan. &lt;/span>&lt;/p>
&lt;p>&lt;span >In 2012, participation in our sponsors program began to increase. Sponsors are affiliated organisations that act on behalf of smaller publishers and societies who wish to register their content with Crossref.  Several organisations from Turkey and South Korea were among the first sponsors to join and were very successful in representing a large number of publishers and societies from their regions. Soon to follow were sponsors from India, Ukraine, Russia and Brazil. In 2014, the Public Knowledge Project (PKP) became a sponsoring affiliate, focusing on smaller publishers with the aim of increasing the quality and global reach of scholarly publishing.  With the introduction of our sponsor program, the past few years have seen a steady increase in the geographical diversity of our members.  &lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >There are &lt;/span>&lt;a href="http://www.worldatlas.com/nations.htm">&lt;span >194 countries&lt;/span>&lt;/a>&lt;span > in the world.  It’s pretty amazing that organisations in 112 of the world’s countries are now represented in our membership. Do I think we’ll see members joining from the other 82 nations? I don’t know but I hope so.&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >A look at our trending nations chart shows the diversity of our membership as we’ve grown, depicting the countries that produced the most new members over the last two years.  There has been tremendous growth from South Korea! What I find just as interesting is that we have new members from so many different nations that they form their own special bloc, shown here as “Other.” &lt;/span>&lt;/p>
&lt;p>&lt;img src="https://www.crossref.org/images/blog/blog_membergrowth.jpg" alt="Membership growth">&lt;/p>
&lt;p>&lt;span >Our growth has taken place at a remarkable rate.  When I joined Crossref in 2008, we had over 1800 publishers and affiliates and we were adding about 300 new members per year.  In 2015, nearly 1500 members joined and we are seeing even larger numbers so far in 2016.  Counting all publishers, affiliates, libraries, sponsors and represented members, our new member total through the end of August is nearly 1200 and will most certainly overtake the 2015 figure.  &lt;/span>&lt;/p>
&lt;p>&lt;img src="https://www.crossref.org/images/blog/blog_membersbyyear.png" alt="Members by year">&lt;/p>
&lt;h3 id="span-member-perceptionsspan">&lt;strong>&lt;span >Member perceptions&lt;/span>&lt;/strong>&lt;/h3>
&lt;p>&lt;span >With such a range of new members each month it’s even more important that we help people understand the benefits of joining Crossref.  That it’s not just registering metadata and DOIs but maintaining and improving records over time, and participating in reference linking.  We are adding and improving some educational tools that will help everyone understand how our services can enhance the discoverability of content, and why sharing richer metadata supports their full participation in the scholarly community.  We are in the process of developing a new, cleaner website with videos that better explain our services-to be released in the next few weeks,-a new onboarding experience, and new and improved query and deposit tools. &lt;/span>&lt;/p>
&lt;h3 id="span-bconnected-metadata-isnt-like-beerbspan">&lt;span >&lt;b>Connected metadata isn’t like beer &lt;/b>&lt;/span>&lt;/h3>
&lt;p>&lt;span >Sometimes inviting more people to a party means there is less beer to go around.  Fortunately for everyone, metadata isn’t like beer. In fact, the more metadata you draw from the tap, the more useful it becomes.  So inviting new members to join Crossref makes our community better and more valuable for everyone.  Every member uses that metadata to link their content to every other member’s content.  This makes all members’ content easier to find, link, and cite, not just at the moment it is published, but over time.   &lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >Members from around the globe join Crossref everyday and help guide our growing community.  If you are interested in joining please contact me at &lt;/span>&lt;a href="mailto:member@crossref.org">&lt;span >&lt;a href="mailto:member@crossref.org">member@crossref.org&lt;/a>&lt;/span>&lt;/a>&lt;span >.&lt;/span>&lt;/span>&lt;/p></description></item><item><title>Crossmark 2.0 - grab the code and you’re ready to go!</title><link>https://www.crossref.org/blog/crossmark-2.0-grab-the-code-and-youre-ready-to-go/</link><pubDate>Thu, 15 Sep 2016 00:00:00 +0000</pubDate><author>Kirsty Meddings</author><guid>https://www.crossref.org/blog/crossmark-2.0-grab-the-code-and-youre-ready-to-go/</guid><description>&lt;p>&lt;span >&lt;span >On September 1st we completed the final stage of the Crossmark v2.0 release and sent an email to all participating publishers containing instructions for upgrading. The first phase of v2.0 happened when we changed the design and layout of the Crossmark box back in May of this year. That allowed us to better display the growing set of additional metadata that our members are depositing, and saw the introduction of the &lt;/span>&lt;a href="https://www.crossref.org/blog/linked-clinical-trials-are-here/">&lt;span >Linked Clinical Trials&lt;/span>&lt;/a>&lt;span > feature.&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >&lt;img class="size-full wp-image-2120 alignright" src="https://www.crossref.org/wp/blog/uploads/2016/09/crossmark_stack.png" alt="crossmark_stack" width="277" height="187" />&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >Now all publishers have the opportunity to &lt;/span>&lt;a href="http://crossmarksupport.crossref.org/">&lt;span >complete the upgrade&lt;/span>&lt;/a>&lt;span > by simply replacing the Crossmark button and the piece of code that calls the box. The new button designs are, we think, a much better fit for most websites, and are designed to look more like a button than a flat logo. The new buttons are also &lt;/span>&lt;a href="https://assets.crossref.org/">&lt;span >available&lt;br /> as .eps&lt;/span>&lt;/a>&lt;span > files for placement in PDFs.&lt;/span>&lt;/span>&lt;/p>
&lt;p> &lt;figure id="attachment_2125" class="wp-caption alignleft">&lt;/p>
&lt;p>&lt;img class="size-medium wp-image-2125" src="https://www.crossref.org/wp/blog/uploads/2016/09/Screenshot_20160915-154051-300x182.jpg" alt="Crossmark box on a mobile phone" width="300" height="182" srcset="https://www.crossref.org/wp/blog/uploads/2016/09/Screenshot_20160915-154051-300x182.jpg 300w, https://www.crossref.org/wp/blog/uploads/2016/09/Screenshot_20160915-154051-768x467.jpg 768w, https://www.crossref.org/wp/blog/uploads/2016/09/Screenshot_20160915-154051-1024x623.jpg 1024w, https://www.crossref.org/wp/blog/uploads/2016/09/Screenshot_20160915-154051.jpg 1184w" sizes="(max-width: 300px) 85vw, 300px" />&lt;figcaption class="wp-caption-text">Crossmark box on a mobile phone&lt;/figcaption>&lt;/figure>&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >Most importantly, switching to 2.0 makes the Crossmark box responsive for better display on mobile devices.&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;p> &lt;/p>
&lt;p>&lt;span >&lt;span >Just two weeks after the code release a number of publishers have already upgraded and are running Crossmark 2.0 on their content. Congrats to the &lt;/span>&lt;a href="http://www.dx.doi.org/10.11604/pamj.2016.24.338.8455">&lt;span >Pan African Medical Journal&lt;/span>&lt;/a>&lt;span > who were the first member to upgrade just a couple of days after the release.  Of course we realise that many members will need time to schedule the upgrade, and while we are keen to see as many early adopters as possible, we will support version 1.5 of Crossmark through to the end of March 2017.&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >If your content is running Crossmark 2.0 we would love to see it. &lt;/span>&lt;a href="mailto:crossmark_info@crossref.org">&lt;span >Drop us a line&lt;/span>&lt;/a>&lt;span > or put a link in the comments below.&lt;/span>&lt;/span>&lt;/p></description></item><item><title>Using the Crossref Metadata API. Part 2 (with PaperHive)</title><link>https://www.crossref.org/blog/using-the-crossref-metadata-api.-part-2-with-paperhive/</link><pubDate>Thu, 08 Sep 2016 00:00:00 +0000</pubDate><author>Rachael Lammey</author><discourseUsername>rlammey</discourseUsername><guid>https://www.crossref.org/blog/using-the-crossref-metadata-api.-part-2-with-paperhive/</guid><description>&lt;p>&lt;span >We first met the team from &lt;/span>&lt;a href="https://paperhive.org/" target="_blank">&lt;span >PaperHive&lt;/span>&lt;/a> &lt;span >at SSP in June, pointed them in the direction of the &lt;/span>&lt;a href="https://github.com/Crossref/rest-api-doc/blob/master/rest_api.md" target="_blank">&lt;span >Crossref Metadata API&lt;/span>&lt;/a> &lt;span >and let things progress from there. That’s the nice thing about having an API - because it’s a common and easy way for developers to access and use metadata, it makes it possible to use with lots of diverse systems and services.&lt;/span>&lt;/p>
&lt;p>&lt;span >So how are things going? Alexander Naydenov, PaperHive’s Co-founder gives us an update on how they’re working with the Crossref metadata: &lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;b>PaperHive&lt;/b>&lt;/span>&lt;/p>
&lt;p>&lt;span >PaperHive is a web-platform for collaborative reading and a cross­-publisher layer of   interaction on top of research documents. It lets researchers communicate in published documents in a productive and time-saving way. PaperHive thus puts academic literature, which is integrated with the platform, in the limelight and increases content usage and reader engagement.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;a href="https://paperhive.org/">&lt;img class="size-medium wp-image-2051 alignright" src="https://www.crossref.org/wp/blog/uploads/2016/09/Logo-PaperHive-300x59.png" alt="Logo PaperHive" width="300" height="59" srcset="https://www.crossref.org/wp/blog/uploads/2016/09/Logo-PaperHive-300x59.png 300w, https://www.crossref.org/wp/blog/uploads/2016/09/Logo-PaperHive-768x151.png 768w, https://www.crossref.org/wp/blog/uploads/2016/09/Logo-PaperHive.png 800w" sizes="(max-width: 300px) 85vw, 300px" />&lt;/a>Transforming reading into a process of collaboration gives researchers a reason to return to the content and discover new enrichments they can benefit from. Functionality like hiving, deep linking, and the PaperHive browser extension embeds communication in the researcher’s workflow. PaperHive is free to use!&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;b>How is the Crossref API used within PaperHive?&lt;/b>&lt;/span>&lt;/p>
&lt;p>&lt;span >PaperHive extends the concept of a living document and offers an innovative way of displaying content without hosting it. Instead, academic documents are dynamically pulled from the publisher’s servers thus ensuring compliance with content licensing. It enables readers to stay in touch with the articles of interest beyond just saving them in an offline folder.&lt;/span>&lt;/p>
&lt;p>&lt;span >Crossref is the common ground on which third party companies and initiatives can build valuable services for publishers and researchers. It facilitates the integration of content into PaperHive by providing the metadata of articles and books from numerous publishers independent of the technology behind their content platforms. Moreover, if the publishers provide ORCID identifiers of authors in the Crossref metadata, researchers can immediately interact with the readers of their works.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;b>What are the future plans for PaperHive?&lt;/b>&lt;/span>&lt;/p>
&lt;p>&lt;span >In addition to integrating further publishers’ content and extending PaperHive’s feature set for readers, we also plan to extend our partnerships with other technology providers.&lt;/span>&lt;/p>
&lt;p>&lt;span >As far as our cooperation with Crossref is concerned, we are looking forward to the implementation of the&lt;/span> &lt;a href="http://eventdata.crossref.org/" target="_blank">&lt;span >Crossref Event Data API&lt;/span>&lt;/a>&lt;span >.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;b>What else would you like to see in Crossref metadata?&lt;/b>&lt;/span>&lt;/p>
&lt;p>&lt;span >-&lt;/span> &lt;span >      &lt;/span>&lt;span >The quality of the existing metadata should be improved significantly. We noticed that important fields such as author or title are missing in the metadata of many documents. PaperHive ignores articles and books with incomplete metadata because it impairs the user experience. Publishers, authors and readers can only benefit from the wider and more active usage of content, so we hope that more publishers will improve the data their provide Crossref with.&lt;/span>&lt;/p>
&lt;p>&lt;span >-&lt;/span> &lt;span >      &lt;/span>&lt;span >Since researchers are working with full texts on PaperHive, it would be great if  links to the full text are provided in the metadata of all articles and books. The metadata should also contain information about the format of the full text (e.g., PDF, EPUB, HTML).&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;b>Thanks Alex!&lt;/b>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;b>Just getting started with the API or what to know more? Get in touch via &lt;a href="mailto:feedback@crossref.org">&lt;a href="mailto:feedback@crossref.org">feedback@crossref.org&lt;/a>&lt;/a> and pass on your questions and comments.&lt;/b>&lt;/span>&lt;/p>
&lt;p> &lt;/p></description></item><item><title>Linking Publications to Data and Software</title><link>https://www.crossref.org/blog/linking-publications-to-data-and-software/</link><pubDate>Wed, 07 Sep 2016 00:00:00 +0000</pubDate><author>Jennifer Lin</author><guid>https://www.crossref.org/blog/linking-publications-to-data-and-software/</guid><description>&lt;h3 id="tldr">TL;DR&lt;/h3>
&lt;p>Crossref and Datacite provide a service to link publications and data. The easiest way for Crossref members to participate in this is to cite data using DataCite DOIs and to include them in the references within the metadata deposit. These data citations are automatically detected. Alternatively and/or additionally, Crossref members can deposit data citations (regardless of identifier) as a relation type in the metadata. Data &amp;amp; software citations from both methods are freely propagated. This blog post also describes how to retrieve the links collected between publication and data &amp;amp; software.&lt;/p>
&lt;!--
&lt;span >&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/09/Data-blog-post.002-1.jpeg">&lt;img class="alignright wp-image-2075 " src="https://www.crossref.org/wp/blog/uploads/2016/09/Data-blog-post.002-1-300x199.jpeg" width="280" height="186" srcset="https://www.crossref.org/wp/blog/uploads/2016/09/Data-blog-post.002-1-300x199.jpeg 300w, https://www.crossref.org/wp/blog/uploads/2016/09/Data-blog-post.002-1.jpeg 542w" sizes="(max-width: 280px) 85vw, 280px" />&lt;/a>&lt;/span>
-->
&lt;hr>
&lt;p align="center">
&lt;img src="https://www.crossref.org/wp/blog/uploads/2016/09/Data-blog-post.002-1-300x199.jpeg"/>
&lt;/p>
&lt;p>Data &amp;amp; software citation is good research practice (&lt;a href="http://www.stm-assoc.org/2012_06_14_STM_DataCite_Joint_Statement.pdf">DataCite-STM Joint Statement&lt;/a> and FORCE11 &lt;a href="https://www.force11.org/group/joint-declaration-data-citation-principles-final">Joint Declaration of Data Citation Principles&lt;/a>) and is part of the scholarly ecosystem supporting research validation and reproducibility&lt;/span>&lt;span >. Data &amp;amp; software citation is also instrumental in enabling the reuse and verification of these research outputs, tracking their impact, and creating a scholarly structure that recognises and rewards those involved in producing them.&lt;/p>
&lt;p>&lt;span >Crossref supports the propagation of data &amp;amp; software citations alongside a publisher’s standard bibliographic metadata. members deposit the data citation link as part of the overall publication metadata when registering their content. Crossref partners with DataCite and together, we jointly provide a clearinghouse for the citations collected. These are all made freely available to the community as open data.&lt;/span>&lt;/p>
&lt;p>Citation practices are evolving across different communities of practice. Crossref’s offering is flexible and easily accommodates variations and changes, since it does not rely on a specific set of citation metadata elements, citation format, nor manner of credit and attribution. Publishers deposit data &amp;amp; software citations in their metadata deposit via a) references and/or b) relation type.&lt;/p>
&lt;h3 id="method-a-bibliographic-references">Method A: Bibliographic references&lt;/h3>
&lt;p>&lt;span >Crossref and DataCite have partnered to provide automatic linking between publications registered with Crossref and datasets bearing DataCite DOIs. This is the most efficient and effective way to ensure that data citations are fully integrated into the scholarly research information network with full and accurate metadata.&lt;/span>&lt;/p>
&lt;p>&lt;span >All data &amp;amp; software citations that include datasets bearing a DataCite DOI are eligible for auto-update linking with Crossref. In this method: authors cite the dataset or software containing the DataCite DOI per journal article submission guidelines and add it to the article citation list (c.f. &lt;/span>&lt;a href="https://web.archive.org/web/20171019061351/https://force11.org/node/4771" target="_blank">&lt;span >FORCE11 citation placement&lt;/span>&lt;/a>&lt;span >, &lt;/span>&lt;a href="https://www.force11.org/software-citation-principles" target="_blank">&lt;span >FORCE11 Software Citation Principles&lt;/span>&lt;/a>&lt;span >). Publishers then deposit references as part of their standard practice when registering content. Crossref checks every reference deposited for a DOI. If the DOI is identified as DataCite’s, we automatically link it to the article. &lt;/span>&lt;strong>With this method, no additional action is needed when publishers register their content with Crossref.&lt;/strong>&lt;/p>
&lt;p>Data citation links to non-DataCite DOIs can only be exposed in the references if the publisher makes references openly available. Even in the event that the data citation is shared, it remains undifferentiated from other references. Method B described below offers another approach.&lt;/p>
&lt;h3 id="method-b-relation-type">Method B: Relation type&lt;/h3>
&lt;p>&lt;span >Publishers can link their publication to a variety of associated research objects as part of the article metadata directly in the metadata deposited to Crossref, including data &amp;amp; software, protocols, videos, published peer reviews, preprints, conference papers, etc. Doing so not only groups digital objects together, but formally associates them with the publication. Each link is a relationship and the sum of all these relationships constitutes a ‘&lt;/span>&lt;a href="https://www.crossref.org/blog/the-article-nexus-linking-publications-to-associated-research-outputs/">&lt;span >research article nexus&lt;/span>&lt;/a>&lt;span >.’ Data &amp;amp; software citations are a valuable part of this.&lt;/span>&lt;/p>
&lt;p>&lt;span >To tag the citation in the metadata deposit, we ask for: &lt;/span>&lt;/p>
&lt;li >
&lt;span >description of dataset or software (optional) &lt;/span>
&lt;/li>
&lt;li >
&lt;span >dataset or software identifier &lt;/span>
&lt;/li>
&lt;li >
&lt;span >identifier type&lt;/span>
&lt;/li>
&lt;li >
&lt;a href="https://support.crossref.org/hc/en-us/articles/214357426">&lt;span >relationship type&lt;/span>&lt;/a>&lt;span >. &lt;/span>
&lt;/li>
&lt;span >Crossref can accommodate research outputs with any identifier, though we currently only validate DOI relationships during metadata processing. Technical details are documented in the &lt;/span>[&lt;span >Data &amp; Software Citations Deposit Guide&lt;/span>][4]&lt;span >. &lt;/span>
&lt;h3 id="combining-methods-increases-total-available-citations">Combining methods increases total available citations&lt;/h3>
&lt;p>&lt;span >The two methods are independent and can be used exclusively or jointly. Each caters to a different set of conditions and their practical considerations. See &lt;/span>&lt;a href="http://support.crossref.org/hc/en-us/articles/215787303#benefits" target="_blank">&lt;span >the comparison of benefits and limitations&lt;/span>&lt;/a> &lt;span >for each method in the deposit guide. We recommend that publishers use both methods where possible at this time for optimum specificity and coverage. &lt;/span>&lt;/p>
&lt;h3 id="how-to-access-data--software-citations">How to access data &amp;amp; software citations&lt;/h3>
&lt;p>&lt;span >Crossref and DataCite make the data &amp;amp; software citations deposited by Crossref members and DataCite data repositories openly available to a wide host of parties, including both Crossref and DataCite communities as well as the extended research ecosystem (funders, research organisations, technology and service providers, research data frameworks such as Scholix, etc.).&lt;/span>&lt;/p>
&lt;p>&lt;span >Data &amp;amp; software citations from references can be accessed via the &lt;/span>&lt;a href="http://eventdata.crossref.org/guide/" target="_blank">&lt;span >Crossref Event Data API&lt;/span>&lt;/a> &lt;span > Citations included directly into the metadata by relation type can be accessed via &lt;/span>&lt;a href="http://support.crossref.org/hc/en-us/articles/213420286" target="_blank">&lt;span >Crossref’s APIs&lt;/span>&lt;/a> &lt;span >in a number of formats (REST, OAI-­PMH, OpenURL). (A single channel containing data &amp;amp; software citations across interfaces is in development and will be released next year.)&lt;/span>&lt;/p>
&lt;p>&lt;span >Publishers, visit our detailed &lt;/span>&lt;a href="http://support.crossref.org/hc/en-us/articles/215787303-Crossref-Data-Software-Citation-Deposit-Guide-for-Publishers" target="_blank">&lt;span >guide on how to deposit data and software citations&lt;/span>&lt;/a>&lt;span >. We welcome your questions and concerns at &lt;/span>&lt;a href="mailto:Feedback@crossref.org">&lt;span >feedback@crossref.org&lt;/span>&lt;/a>&lt;span >.&lt;/span>&lt;/p>
&lt;p> &lt;/p>
&lt;p>&lt;em>&lt;span >Special thanks to the following who provided valuable feedback in developing the guide: Martin Fenner (DataCite), Amye Kenall (Springer Nature), Brooks Hanson (AGU), Shelley Stall (AGU), and the &lt;/span>&lt;/em>&lt;a href="https://web.archive.org/web/20201024154446/https://force11.org/group/dcip/eg3publisherearlyadopters" target="_blank">&lt;em>&lt;span >FORCE11 Data Citation Implementation Pilot publisher’s subgroup&lt;/span>&lt;/em>&lt;/a>&lt;em>&lt;span >.&lt;/span>&lt;/em>&lt;/p></description></item><item><title>Crossref’s Annual Meeting is now Crossref LIVE16</title><link>https://www.crossref.org/blog/crossrefs-annual-meeting-is-now-crossref-live16/</link><pubDate>Fri, 02 Sep 2016 00:00:00 +0000</pubDate><author>April Ondis</author><guid>https://www.crossref.org/blog/crossrefs-annual-meeting-is-now-crossref-live16/</guid><description>&lt;p>Everyone is invited to our free annual event this 1-2 November in London. &lt;a href="https://www.eventbrite.com/e/crossref-live16-registration-25928526922?aff=ehomesaved" target="_blank">(Register here)&lt;/a>!&lt;/p>
&lt;/p>
&lt;p class="p1">
In years past, only Crossref members typically attended the [Crossref Annual Meeting](/crossref-live-annual). &lt;span class="s1" >This year, we looked at the event with new eyes. We realized that we’d have even richer conversations, more creative energy, and the meeting would be even better for our members if we could rally the entire community together.  So we decided to re-develop our annual event from the ground-up. &lt;/span>
&lt;/p>
&lt;p class="p1">
&lt;span class="s1" >&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/08/crossref_live16_rgb.jpg">&lt;img class="size-medium wp-image-2008 alignleft" src="https://www.crossref.org/wp/blog/uploads/2016/08/crossref_live16_rgb-300x115.jpg" alt="Logo for Crossref LIVE 16" width="300" height="115" srcset="https://www.crossref.org/wp/blog/uploads/2016/08/crossref_live16_rgb-300x115.jpg 300w, https://www.crossref.org/wp/blog/uploads/2016/08/crossref_live16_rgb-768x295.jpg 768w, https://www.crossref.org/wp/blog/uploads/2016/08/crossref_live16_rgb-1024x393.jpg 1024w, https://www.crossref.org/wp/blog/uploads/2016/08/crossref_live16_rgb-1200x461.jpg 1200w" sizes="(max-width: 300px) 85vw, 300px" />&lt;/a>&lt;/span>
&lt;/p>
&lt;p class="p1">
&lt;span class="s1" >The result is Crossref LIVE16, an event with a new format and a new focus on the entirety of the scholarly communications community.  We are opening doors for the whole community, welcoming publishers, librarians, researchers, funders, technology providers, and Crossref members alike. &lt;/span>
&lt;/p>
&lt;ul class="ul1">
&lt;li class="li1">
&lt;span >&lt;b>&lt;/b>&lt;span class="s1">&lt;b>1st November - Mashup Day, from 12 noon&lt;/b>: an afternoon of interactive activities including mingling with the Crossref team and special guests, trying out our services, live troubleshooting, and exclusive previews of some exciting things we’re working on. Plus entertainment and refreshments at an early evening reception.&lt;/span>&lt;/span>
&lt;/li>
&lt;/ul>
&lt;ul class="ul1">
&lt;li class="li1">
&lt;span >&lt;b>&lt;/b>&lt;span class="s1">&lt;b>2nd November - Conference Day&lt;/b>: a full-day plenary session with distinguished keynote speakers including &lt;a href="http://nycdh.org/members/ah160/">April Hathcock&lt;/a> (NYU), &lt;a href="https://strasser.github.io/">Carly Strasser&lt;/a> (Moore Foundation), &lt;a href="https://www.digital-science.com/people/ian-calvert/">Ian Calvert&lt;/a> (Digital Science), and &lt;a href="https://meta.wikimedia.org/wiki/User:Dario_(WMF)">Dario Taraborelli&lt;/a> (Wikimedia Foundation). We will provide the most important updates about our services, and share our vision and strategies for the future.&lt;/span>&lt;/span>
&lt;/li>
&lt;/ul>
&lt;p class="p1">
&lt;span class="s1" >&lt;em>Note:&lt;/em> You are welcome to join us for both days or just one day, as you like.&lt;/span>
&lt;/p>
&lt;p class="p1">
&lt;span class="s1" >&lt;b>Location: &lt;/b>The Royal Society, London, UK.   &lt;/span>
&lt;/p>
&lt;p class="p1">
&lt;span >&lt;b>&lt;/b>&lt;span class="s1">We hope you will join us, and extend this invitation to your colleagues.&lt;/span>&lt;/span>
&lt;/p>
&lt;p class="p1">
&lt;span class="s1" >This is going to be fun.&lt;/span>
&lt;/p>
&lt;p class="p1">
&lt;span >&lt;a href="https://www.eventbrite.com/e/crossref-live16-registration-25928526922?aff=ehomesaved">&lt;span class="s1">&lt;b>Register here&lt;/b>&lt;/span>&lt;/a>&lt;/span>
&lt;/p></description></item><item><title>Announcing PIDapalooza - a festival of identifiers</title><link>https://www.crossref.org/blog/announcing-pidapalooza-a-festival-of-identifiers/</link><pubDate>Tue, 30 Aug 2016 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/announcing-pidapalooza-a-festival-of-identifiers/</guid><description>&lt;div style="float:left;margin:10px">
&lt;img src="https://www.crossref.org/wp/blog/uploads/2016/08/sideA-300x213.jpg" alt="sideA" width="300" height="213" />
&lt;/div>
&lt;p>The buzz is building around PIDapalooza - the first open festival of scholarly research persistent identifiers (PID), to be held at the &lt;a href="https://www.radissonblu.com/en/sagahotel-reykjavik" target="_blank">Radisson Blu Saga Hotel Reykjavik&lt;/a>on November 9-10, 2016.&lt;/p>
&lt;p>&lt;span >PIDapalooza will bring together creators and users of PIDs from around the world to shape the future PID landscape through the development of tools and services for the research community. PIDs support proper attribution and credit, promote collaboration and reuse, enable reproducibility of findings, foster faster and more efficient progress, and facilitate effective sharing, dissemination, and linking of scholarly works.&lt;/span>&lt;/p>
&lt;p>&lt;span >We believe that by bringing together everyone who’s working with PIDs for two days of discussions, demos, workshops, brainstorming, updates on the state of the art, and more, we can make this happen faster. And you can help by giving us your input on which sessions would be most valuable. Please send us your ideas, using this &lt;/span>&lt;a href="https://docs.google.com/forms/d/e/1FAIpQLSej7YKQVCPTTCo8zeIS-ODjtsb5SIS299uZZBo8ZN6yD0WI5Q/viewform?c=0&amp;amp;w=1&amp;amp;usp=send_form" target="_blank">&lt;span >form&lt;/span>&lt;/a> &lt;span >by September 18. We will send session proposal notifications the first week of October with the festival lineup.&lt;/span>&lt;/p>
&lt;h5 id="register-to-attend">&lt;strong>Register to attend&lt;/strong>&lt;/h5>
&lt;p>&lt;a href="http://pidapalooza.eventbrite.com" target="_blank">&lt;strong>Registration is now open&lt;/strong>&lt;/a> &lt;strong>— c&lt;/strong>&lt;span >ome join the festival with a crowd of like-minded innovators. And please help us spread the word about PIDapalooza in your community! &lt;/span>&lt;/p>
&lt;p>&lt;span >Stay updated with the latest news on on the &lt;/span>&lt;a href="http://pidapalooza.org/" target="_blank">&lt;span >PIDapalooza website&lt;/span>&lt;/a> &lt;span >and on Twitter (&lt;/span>&lt;a href="http://twitter.com/pidapalooza" target="_blank">&lt;span >@PIDapalooza&lt;/span>&lt;/a>&lt;span >) in the coming weeks.&lt;/span>&lt;/p>
&lt;p>&lt;span >Looking forward to seeing you in November! &lt;/span>&lt;/p>
&lt;p> &lt;/p></description></item><item><title>Preprints and Crossref’s metadata services</title><link>https://www.crossref.org/blog/preprints-and-crossrefs-metadata-services/</link><pubDate>Mon, 29 Aug 2016 00:00:00 +0000</pubDate><author>Chuck Koscher</author><guid>https://www.crossref.org/blog/preprints-and-crossrefs-metadata-services/</guid><description>&lt;p>We’re putting the final touches on the changes that will allow preprint publishers to register their metadata with Crossref and assign DOIs. These changes support Crossref’s CitedBy linking between the preprint and other scholarly publications (journal articles, books, conference proceedings). Full preprint support will be released over the next few weeks.&lt;/p>
&lt;p>I’d like to mention one change that will be immediately visible to Crossref members who use our OAI based service to retrieve CitedBy links to their content.&lt;/p>
&lt;p>This API, show in an example here, is intended to retrieve large quantities of data detailing all the CitedBy links to a given publication. The example request shows pulling the data for an IEEE conference proceeding.&lt;/p>
&lt;p >
example:
&lt;/p>
&lt;p >
http://oai.crossref.org/OAIHAndler?verb=ListRecords&amp;usr=*** pwd=****&amp;set=B:10.1109:1070762&amp;metadataPrefix=cr_citedby
&lt;/p>
&lt;div class="" >
&lt;/div>
&lt;p>With the new change, results will now identify the type of content that is doing the citing. The example results below shows that the DOI 10.1109/CSMR.2012.14  is cited by five other items and displays the DOIs of those items and their record type.&lt;/p>
&lt;p >
&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/08/Screen-Shot-2016-08-29-at-9.12.24-AM.png">&lt;img class="alignnone wp-image-2031" src="https://www.crossref.org/wp/blog/uploads/2016/08/Screen-Shot-2016-08-29-at-9.12.24-AM-300x235.png" alt="Screen Shot 2016-08-29 at 9.12.24 AM" width="436" height="341" srcset="https://www.crossref.org/wp/blog/uploads/2016/08/Screen-Shot-2016-08-29-at-9.12.24-AM-300x235.png 300w, https://www.crossref.org/wp/blog/uploads/2016/08/Screen-Shot-2016-08-29-at-9.12.24-AM.png 536w" sizes="(max-width: 436px) 85vw, 436px" />&lt;/a>
&lt;/p>
&lt;p >
When preprint content that cites other scholarly work starts being registered with Crossref, members using this API will start seeing data like the following:
&lt;/p>
&lt;p >
&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/08/Screen-Shot-2016-08-29-at-9.20.15-AM.png">&lt;img class="alignnone wp-image-2032" src="https://www.crossref.org/wp/blog/uploads/2016/08/Screen-Shot-2016-08-29-at-9.20.15-AM-300x197.png" alt="Screen Shot 2016-08-29 at 9.20.15 AM" width="432" height="284" srcset="https://www.crossref.org/wp/blog/uploads/2016/08/Screen-Shot-2016-08-29-at-9.20.15-AM-300x197.png 300w, https://www.crossref.org/wp/blog/uploads/2016/08/Screen-Shot-2016-08-29-at-9.20.15-AM.png 490w" sizes="(max-width: 432px) 85vw, 432px" />&lt;/a>
&lt;/p>
&lt;p >
For many users of Crossref metadata the introduction of preprints will be transparent until preprint content starts being registered. However, a few changes like the one above have benefits not limited to just preprints.
&lt;/p></description></item><item><title>The article nexus: linking publications to associated research outputs</title><link>https://www.crossref.org/blog/the-article-nexus-linking-publications-to-associated-research-outputs/</link><pubDate>Thu, 25 Aug 2016 00:00:00 +0000</pubDate><author>Jennifer Lin</author><guid>https://www.crossref.org/blog/the-article-nexus-linking-publications-to-associated-research-outputs/</guid><description>&lt;p>&lt;span >Crossref began its service by linking publications to other publications&lt;/span> &lt;span >via references.&lt;/span> &lt;span >Today, this extends to relationships with associated entities. People (authors, reviewers, editors, other collaborators), funders, and research affiliations are important players in this story. Other metadata also figure prominently in it as well: references, licenses and access indicators, publication history (updates, revisions, corrections, retractions, publication dates), clinical trial and study information, etc. The list goes on.&lt;/span>&lt;/p>
&lt;p>What is lesser known (and utilized) is that Crossref is increasingly linking publications to associated scholarly artifacts. At the bottom of it all, these links can help researchers better understand, reproduce, and build off of the results in the paper. But associated research objects can enormously bolster the research enterprise in many ways (e.g., discovery, reporting, evaluation, etc.).&lt;/p>
&lt;blockquote>
&lt;p>With all the relationships declared across all 80+ million Crossref metadata records, Crossref creates a global metadata graph across subject areas and disciplines that can be used by all.&lt;/p>
&lt;/blockquote>
&lt;h3 id="research-article-nexus">Research article nexus&lt;/h3>
&lt;p>&lt;span >As research increasingly goes digital, more research artifacts associated with the formal publication are stored or shared online. We see a plethora of materials closely connected to publications, including: versions, peer reviews, datasets generated or analysed in the research, software packages used in the analysis, protocols and related materials, preprints, conference posters, language translations, comments, etc. Occasionally, these resources are linked from the publication. But very rarely are these relationships made available beyond the publisher platform. &lt;/span>&lt;/p>
&lt;p>&lt;span >Crossref will make these relationships available to the broader research ecosystem. When publishers register content for a publication, they can identify the associated scholarly artifacts directly in the article metadata. Doing so not only groups digital objects together, but formally associates with the publication. Each link is a relationship and the sum of all these relationships constitutes a “&lt;/span>&lt;strong>research article nexus.&lt;/strong>&lt;span >”&lt;/span>&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/08/DOI-network-diagram_v3_600x560px.png">&lt;img class="alignnone wp-image-1990 size-large" src="https://www.crossref.org/wp/blog/uploads/2016/08/DOI-network-diagram_v3_600x560px-1024x956.png" width="840" height="784" srcset="https://www.crossref.org/wp/blog/uploads/2016/08/DOI-network-diagram_v3_600x560px-1024x956.png 1024w, https://www.crossref.org/wp/blog/uploads/2016/08/DOI-network-diagram_v3_600x560px-300x280.png 300w, https://www.crossref.org/wp/blog/uploads/2016/08/DOI-network-diagram_v3_600x560px-768x717.png 768w, https://www.crossref.org/wp/blog/uploads/2016/08/DOI-network-diagram_v3_600x560px-1200x1120.png 1200w, https://www.crossref.org/wp/blog/uploads/2016/08/DOI-network-diagram_v3_600x560px.png 1250w" sizes="(max-width: 709px) 85vw, (max-width: 909px) 67vw, (max-width: 1362px) 62vw, 840px" />&lt;/a>&lt;/p>
&lt;p>An assortment of connections already abound in the wild today. Examples include:&lt;/p>
&lt;li >
&lt;span >F1000Research article &lt;a href="http://doi.org/10.12688/f1000research.2-198.v3">http://doi.org/10.12688/f1000research.2-198.v3&lt;/a>&lt;/span>&lt;span > connected to initial version &lt;a href="http://doi.org/10.12688/f1000research.2-198.v1">http://doi.org/10.12688/f1000research.2-198.v1&lt;/a> &lt;/span>
&lt;/li>
&lt;li >
&lt;span >OECD publication &lt;/span>&lt;a href="http://doi.org/10.1787/empl_outlook-2014-en">&lt;span >http://dx.doi.org/10.1787/empl_outlook-2014-en&lt;/span>&lt;/a>&lt;span > and its German translation &lt;/span>&lt;a href="http://doi.org/10.1787/empl_outlook-2014-de">&lt;span >http://dx.doi.org/10.1787/empl_outlook-2014-de&lt;/span>&lt;/a>
&lt;/li>
&lt;li >
&lt;span >PeerJ article &lt;/span>&lt;a href="https://doi.org/10.7717/peerj.1135">&lt;span >http://doi.org/10.7717/peerj.1135&lt;/span>&lt;/a>&lt;span > and its peer review &lt;a href="http://doi.org/10.7287/peerj.1135v0.1/reviews/3">http://doi.org/10.7287/peerj.1135v0.1/reviews/3&lt;/a> &lt;/span>
&lt;/li>
&lt;li >
&lt;span >eLife article &lt;/span>&lt;a href="http://doi.org/10.7554/eLife.09771">&lt;span >http://doi.org/10.7554/eLife.09771&lt;/span>&lt;/a>&lt;span > and its BioArXiv preprint &lt;/span>&lt;a href="http://doi.org/10.1101/018317">&lt;span >http://doi.org/10.1101/018317&lt;/span>&lt;/a>
&lt;/li>
&lt;li >
&lt;span >PLOS ONE article &lt;/span>&lt;a href="http://dx.doi.org/10.1371/journal.pone.0161541">&lt;span >http://doi.org/10.1371/journal.pone.0161541&lt;/span>&lt;/a>&lt;span > with underlying data in Dryad &lt;/span>&lt;a href="http://dx.doi.org/10.5061/dryad.d2vf8">&lt;span >http://doi.org/10.5061/dryad.d2vf8&lt;/span>&lt;/a>
&lt;/li>
&lt;li >
&lt;span >Frontiers article &lt;/span>&lt;a href="http://dx.doi.org/10.3389/fevo.2015.00015">&lt;span >http://doi.org/10.3389/fevo.2015.00015&lt;/span>&lt;/a>&lt;span > with a figshare &lt;/span>&lt;a href="https://dx.doi.org/10.6084/m9.figshare.1305089.v1">&lt;span >http://doi.org/10.6084/m9.figshare.1305089.v1&lt;/span>&lt;/a>&lt;span > video &lt;/span>
&lt;/li>
&lt;li >
&lt;span >Journal of Chemical Theory and Computation article &lt;/span>&lt;a href="http://doi.org/10.1021/ct400399x">&lt;span >http://doi.org/10.1021/ct400399x&lt;/span>&lt;/a>&lt;span > with software archived in Zenodo &lt;/span>&lt;a href="http://doi.org/10.5281/zenodo.60678">&lt;span >http://doi.org/10.5281/zenodo.60678&lt;/span>&lt;/a>
&lt;/li>
&lt;li >
&lt;span >Nature Biotech article &lt;/span>&lt;a href="http://doi.org/10.1038/nbt.3481">&lt;span >http://doi.org/10.1038/nbt.3481&lt;/span>&lt;/a>&lt;span > with a Protocols.io protocol &lt;/span>&lt;a href="http://doi.org/10.17504/protocols.io.dm649d">&lt;span >http://doi.org/10.17504/protocols.io.dm649d&lt;/span>&lt;/a>
&lt;/li>
&lt;p>To date, almost all these relationships are not directly recorded in the article metadata (great job, PeerJ!). And as a result, they are more than likely “invisible” to the broader scholarly research ecosystem. Publishers can remedy these gaps by depositing associations when registering content with Crossref or updating the records after registration. That is how the article nexus is formed.&lt;/p>
&lt;p>&lt;span >(Associated datasets can also be identified in the reference list as per &lt;/span>&lt;a href="https://www.force11.org/group/joint-declaration-data-citation-principles-final" target="_blank">&lt;span >Joint Declaration of Data Citation Principles&lt;/span>&lt;/a> &lt;span >as with the &lt;/span>&lt;a href="https://www.force11.org/software-citation-principles" target="_blank">&lt;span >FORCE11 Software Citation Principles&lt;/span>&lt;/a>&lt;span >. &lt;/span>&lt;em>&lt;span >Stay tuned next week for a follow up blog post on Crossref’s support for publisher data and software citations through its metadata.&lt;/span>&lt;/em>&lt;span >)&lt;/span>&lt;/p>
&lt;h3 id="forming-the-nexus">Forming the nexus&lt;/h3>
&lt;p>&lt;span >The mechanism of declaring these relationships is straightforward and a longstanding part of the standard deposit process. For each associated research object, simply provide the identifier and identifier type for the object, an optional description of it, as well as name the relationship into the metadata record. For the latter, Crossref and DataCite share a closed list of relationship types, which ensures interoperability between mappings. See Crossref &lt;/span>&lt;a href="http://support.crossref.org/hc/en-us/articles/214357426-Relationships-between-DOIs-and-other-objects" target="_blank">&lt;span >technical documentation&lt;/span>&lt;/a> &lt;span >for more details. &lt;/span>&lt;/p>
&lt;p>&lt;span >We maintain a &lt;/span>&lt;a href="http://support.crossref.org/hc/en-us/articles/214357426#aro" target="_blank">&lt;span >list of the recommended relation types&lt;/span>&lt;/a> &lt;span >for a host of associated research objects to promote standardization across publishers. If you have relationships not specified, please contact us at &lt;/span>&lt;a href="mailto:Feedback@crossref.org">&lt;span >feedback@crossref.org&lt;/span>&lt;/a> &lt;span >to identify a suitable one considered best practice. Common adoption of relation types will make relationship metadata useful to tool builders and systems. For example, programmatic queries on supporting materials require proper tagging of their respective relationship types.&lt;/span>&lt;/p>
&lt;p>This approach is highly extensible and accommodates the introduction of new research object forms as they emerge. It also supports associated research objects regardless of identifier type. When an associated entity has a DOI, however, we can validate the relationship during metadata processing as well as provide a more reliable representation of the article nexus.&lt;/p>
&lt;h3 id="article-nexus-a-far-richer-scholarly-map">Article nexus: a far richer scholarly map&lt;/h3>
&lt;p>&lt;span >Bibliographic metadata is like a ship’s manifest that catalogs each item of cargo in a ship’s hold - crate, drum, sack, and barrel. It identifies the components that have an internal relation to the publication (contributor, funder, article update, license, etc.), each of which are well-understood points on the scholarly map. But when we integrate the article nexus into the graph, new territories become visible - not isolated islands, but places with highways connecting them to addresses already known.&lt;/span>&lt;/p>
&lt;p>&lt;span >When a publication has its relationships clearly identified, the connections both go out as well as lead back to it. The more connections, the more visibility on the &lt;/span>&lt;a href="https://www.crossref.org/blog/crossref-the-art-of-cartography-an-open-map-for-scholarly-communications/">&lt;span >scholarly map, as the Art of Cartography&lt;/span>&lt;/a> &lt;span >goes. Numerous systems tap into this map: publishing, funders, research institutions, research councils, indexers &amp;amp; repositories, indexers, research information systems, lab &amp;amp; diagnostics systems, reference management and literature discovery, other PID suppliers. So publishers, you can provide the fullest value to your own publishing operation, your authors, their research communities, and the overall research enterprise by ensuring that all publications are fully linked both inside and out.&lt;/span>&lt;/p></description></item><item><title>Using the Crossref Metadata API. Part 1 (with Authorea)</title><link>https://www.crossref.org/blog/crossref-metadata-api-part-1-authorea/</link><pubDate>Thu, 18 Aug 2016 00:00:00 +0000</pubDate><author>Rachael Lammey</author><discourseUsername>rlammey</discourseUsername><guid>https://www.crossref.org/blog/crossref-metadata-api-part-1-authorea/</guid><description>&lt;p>&lt;span >&lt;span >Did you know that we have a shiny, not so new, &lt;/span>&lt;a href="https://github.com/CrossRef/rest-api-doc">&lt;span >API&lt;/span>&lt;/a>&lt;span > kicking around? If you missed &lt;/span>&lt;a href="https://www.crossref.org/blog/researchers-just-wanna-have-funds/">&lt;span >Geoffrey’s post in 2014 &lt;/span>&lt;/a>&lt;span >(or don’t want a Cyndi Lauper song stuck in your head all day), the short explanation is that the Crossref Metadata API exposes the information that publishers provide Crossref when they register their content with us. And it’s not just the bibliographic metadata either-funding and licensing information, full-text links (useful for text-mining), ORCID iDs and update information (via Crossmark)-are all available, if included in the publishers’ metadata. &lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >Interested? This is the kickoff a series of case studies on the innovative and interesting things people are doing with the Metadata API. Welcome to Part 1.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;b>What can you do with the Metadata API?&lt;/b>&lt;/span>&lt;/p>
&lt;li >
&lt;span >&lt;span >Build search interfaces. We’ve built some ourselves. Check out &lt;/span>&lt;a href="https://web.archive.org/web/20131229210637/http://search.crossref.org//">&lt;span >Crossref Metadata Search&lt;/span>&lt;/a>&lt;span > to search the metadata of over 80 million journal articles, books, standards, datasets &amp; more. Or &lt;/span>&lt;a href="https://web.archive.org/web/20131229210637/http://search.crossref.org//funding">&lt;span >Crossref Funder Search&lt;/span>&lt;/a>&lt;span > t&lt;/span>&lt;span >o search nearly 15,000 funders and the 982,162 records we have that contain funding data. &lt;/span>&lt;/span>
&lt;/li>
&lt;li >
&lt;span >&lt;span >Provide cross-publisher support for &lt;a href="https://www.crossref.org/documentation/retrieve-metadata/rest-api/text-and-data-mining/">text and data mining&lt;/a> applications.&lt;/span>&lt;/span>
&lt;/li>
&lt;li >
&lt;span >Get really interesting top-level reports on the metadata Crossref holds - or look at subsets of the information you’re interested in. &lt;/span>
&lt;/li>
&lt;li >
&lt;span >Third parties are free to build their own products and tools that build off of the Metadata API (below are some of the many examples that we will highlight in this series).&lt;/span>
&lt;/li>
&lt;p>&lt;span >Importantly, there’s no sign-up required to use the Metadata API - the data are facts from members, therefore not subject to copyright and free to use for whatever purpose anyone chooses. &lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >To help, Scott Chamberlain of rOpenSci has built a set of &lt;/span>&lt;a href="https://github.com/ropensci/rcrossref">&lt;span >robust libraries for accessing the Metadata API&lt;/span>&lt;/a>&lt;span >. These libraries are now available in the R, Python and Ruby languages. &lt;/span>&lt;a href="https://www.crossref.org/blog/python-and-ruby-libraries-for-accessing-the-crossref-api/">&lt;span >Scott’s blog post&lt;/span>&lt;/a>&lt;span > has some great information on those. For those using the libraries, there have been a few updates since Scott’s post - &lt;/span>&lt;a href="https://github.com/sckott/serrano/blob/master/CHANGELOG.md#022-2016-06-07">&lt;span >to serrano&lt;/span>&lt;/a>&lt;span >, and support for field queries has been &lt;/span>&lt;a href="https://github.com/Crossref/rest-api-doc/blob/master/rest_api.md#field-queries">&lt;span >added to habanero &lt;/span>&lt;/a>&lt;span >(coming to serrano and rCrossref soon). Any feedback/bug reports can be submitted via the GitHub repos &lt;a href="https://github.com/sckott/serrano">serrano&lt;/a> or &lt;a href="https://github.com/sckott/habanero">habanero&lt;/a>&lt;/span>&lt;span >. &lt;/span>&lt;span >There’s also a&lt;a href="https://github.com/scienceai/crossref"> javascript library&lt;/a>, &lt;/span>&lt;span >authored by &lt;/span>&lt;span >Robin Berjon&lt;/span>&lt;span >. &lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;b>Who’s using the Crossref Metadata API?&lt;/b>&lt;/span>&lt;/p>
&lt;p>&lt;span >We get around 30 million requests a month. We’d like to share a few case studies to showcase what they’re doing and how they’re using it. Look out for a series of posts over the next few months where we’ll open the floor to those using the API and let them explain how and why. &lt;/span>&lt;/p>
&lt;p>&lt;span >We’ll let Authorea kick things off…       &lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;b>&lt;span >Alberto Pepe, co-founder of Authorea explains:&lt;/span>&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/08/Authorea.png">&lt;img class="size-full wp-image-1941 alignright" src="https://www.crossref.org/wp/blog/uploads/2016/08/Authorea.png" alt="Authorea" width="297" height="124" />&lt;/a>&lt;/b>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >&lt;a href="https://www.authorea.com/">Authorea&lt;/a> is a word processor for researchers and scholars. It is a collaboration platform to write, share and openly research &lt;/span>&lt;span >in real-time: write manuscripts and include rich media, such as data sets, software, source code and videos. The media-rich, data-driven capabilities of Authorea make it the perfect platform to create and disseminate a new generation of research articles, which are natively web-based, open, and reproducible. Authorea is free to use.&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;b>How is the Crossref Metadata API used within Authorea?&lt;/b>&lt;/span>&lt;/p>
&lt;p>&lt;span >Authorea is specifically made for scholarly documents such as research articles, conference papers, grey literature, class notes, student papers, and problem sets. What makes scholarly documents so peculiar are their citations and references, mathematical notation, tables, and data. For citations and references, we built a citation tool which allows authors to search and cite scholarly papers with ease, without having to leave the editor. While in the middle of writing a sentence, authors can click the “cite” button and a citation tool opens up:&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/05/Authorea-screenshot.jpg">&lt;img class="size-medium wp-image-1715 alignleft" src="https://www.crossref.org/wp/blog/uploads/2016/05/Authorea-screenshot-241x300.jpg" alt="Authorea screenshot" width="241" height="300" srcset="https://www.crossref.org/wp/blog/uploads/2016/05/Authorea-screenshot-241x300.jpg 241w, https://www.crossref.org/wp/blog/uploads/2016/05/Authorea-screenshot.jpg 576w" sizes="(max-width: 241px) 85vw, 241px" />&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >We currently use two engines for searching scholarly literature via their APIs: Crossref and Pubmed. Our authors love being able to search (by author name, paper title, topic, etc) and add references to their papers on the fly, in one click.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;b>What are the future plans for Authorea?&lt;/b>&lt;/span>&lt;/p>
&lt;p>&lt;span >Among the many plans we have for the future, there is one which is also tied to Crossref: we are going to let authors assign DOIs to Authorea articles such as blog posts, preprints, “data papers”, “software papers” and other kinds of grey literature which does not fit in the traditional scholarly journals.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;b>What else would you like to see in our metadata?&lt;/b>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >Well, since you ask: we would love to see unique BibTex IDs being served by the Metadata API (right now, you create the ID automatically using author name and year). Also, in some cases, some important metadata fields are missing (even author or title). I think it is actually more important to fix existing metadata rather than add new fields! &lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;b>Keen to share what you’re doing with the Crossref Metadata API? Contact &lt;/b>&lt;a href="mailto:feedback@crossref.org">&lt;b>&lt;a href="mailto:feedback@crossref.org">feedback@crossref.org&lt;/a>&lt;/b>&lt;/a>&lt;b> and share your story.&lt;/b>&lt;/span>&lt;/p></description></item><item><title>Get ready for Crossmark 2.0!</title><link>https://www.crossref.org/blog/get-ready-for-crossmark-2.0/</link><pubDate>Wed, 17 Aug 2016 00:00:00 +0000</pubDate><author>Kirsty Meddings</author><guid>https://www.crossref.org/blog/get-ready-for-crossmark-2.0/</guid><description>&lt;p>&lt;span >&lt;span >&lt;span >TL;DR… In a few weeks, publishers can upgrade to the new and improved Crossmark 2.0 including a mobile-friendly pop-up box and new button. We will provide a new snippet of code for your landing pages, and we’ll support version v1.5 until March 2017.&lt;/span>&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >We recently revealed a new look for the Crossmark box, bringing it up-to-date in design and offering extra space for more metadata. The new box pulls all of a publication’s Crossmark metadata into the same space, so readers no longer have to click between tabs. &lt;/span>&lt;a href="https://www.crossref.org/blog/linked-clinical-trials-are-here/">&lt;span >Linked Clinical Trials&lt;/span>&lt;/a>&lt;span > and author names (including ORCID iDs) now have their own sections alongside funding information and licenses. Feedback so far tells us that the new box is a vast improvement.&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >However, this was only phase one of the Crossmark makeover. We will soon complete the upgrade to display a fully responsive, mobile-friendly box. The Crossmark button has been given a facelift too, and we are excited to offer the first public preview today:&lt;/span>&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >&lt;img class="wp-image-1955 size-medium alignnone" src="https://www.crossref.org/wp/blog/uploads/2016/08/CROSSMARK_LOGO-300x65.png" alt="CROSSMARK_LOGO" width="300" height="65" srcset="https://www.crossref.org/wp/blog/uploads/2016/08/CROSSMARK_LOGO-300x65.png 300w, https://www.crossref.org/wp/blog/uploads/2016/08/CROSSMARK_LOGO.png 355w" sizes="(max-width: 300px) 85vw, 300px" />&lt;/span>&lt;/p>
&lt;p>&lt;span >The new button brings the Crossmark icon up to date and is designed to be more “clickable” than the current button. It will be available in several different ratios and also in greyscale.&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;span >The first phase of the new design was rolled out in the existing Crossmark pop up window (Crossmark v1.5) without the need for changes within publisher systems. For the Crossmark v2.0 upgrade, publishers will need to update their landing pages with a new snippet of code, to ‘unlock’ the new button and functional enhancements.&lt;/span>&lt;/p>
&lt;p>&lt;span >Crossmark 2.0 will be available to adopt in a few weeks, and each publi&lt;/span>&lt;span >&lt;span >sher can decide when to switch over. We encourage members to upgrade sooner rather than later to get the benefits of the new box, but we also understand there are planned development schedules and the need for a testing period so &lt;strong>w&lt;/strong>&lt;/span>&lt;span >&lt;strong>e will continue to support Crossmark v1.5 until March 2017&lt;/strong>.&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >Many thanks to all of those who completed our surveys to help us shape the new button. And congratulations to &lt;strong>Elizabeth Ramsey&lt;/strong>, a researcher from &lt;strong>Trent University in Canada&lt;/strong>, who will be receiving a limited edition Crossref Moleskine notebook from the survey prize draw.&lt;/span>&lt;/p>
&lt;p>&lt;span >Our User Experience Designer, Rakesh Masih, will be blogging soon with details about the research and testing for this project, as well as more about our new approach to user experience at Crossref.&lt;/span>&lt;/p></description></item><item><title>Getting ready to run with preprints, any day now</title><link>https://www.crossref.org/blog/getting-ready-to-run-with-preprints-any-day-now/</link><pubDate>Tue, 16 Aug 2016 00:00:00 +0000</pubDate><author>Jennifer Lin</author><guid>https://www.crossref.org/blog/getting-ready-to-run-with-preprints-any-day-now/</guid><description>&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/wp/blog/uploads/2016/08/Preprints-ready-to-go-shoelaces.jpg" alt="run" width="300" height="200" />
&lt;/div>
&lt;p>While preprints have been a formal part of scholarly communications for decades in certain communities, they have not been fully adopted to date across most disciplines or systems. That may be changing very soon and quite rapidly, as new &lt;a href="https://en.wikipedia.org/wiki/Preprint#Preprint_server_by_research_field" target="_blank">initiatives&lt;/a> come thick and fast from researchers, funders, and publishers alike. This flurry of activity points to the realization from these parties of preprints’ potential benefits:&lt;/p>
&lt;li >
&lt;span >Accelerating the sharing of results; &lt;/span>
&lt;/li>
&lt;li >
&lt;span >Catalyzing research discovery; &lt;/span>
&lt;/li>
&lt;li >
&lt;span >Establishing priority of discoveries and ideas; &lt;/span>
&lt;/li>
&lt;li >
&lt;span >Facilitating career advancement; and &lt;/span>
&lt;/li>
&lt;li >
&lt;span >Improving the culture of communication within the scholarly community. &lt;/span>
&lt;/li>
&lt;p>&lt;span >To acknowledge them as a legitimate part of the research story, we need to fully build preprints into the broader research infrastructure. Preprints need infrastructure support just like journal articles, monographs, and other formal research outputs. Otherwise, we (continue to) have a &lt;span >two-tiered scholarly communications system&lt;/span>, unlinked and operating independently.&lt;br /> &lt;/span>&lt;/p>
&lt;h5 id="span-binfrastructure-for-preprintsbspan">&lt;span >&lt;b>Infrastructure for preprints&lt;/b>&lt;/span>&lt;/h5>
&lt;p>&lt;span >&lt;span >For this reason, the team at Crossref is extending its infrastructure services to &lt;/span>&lt;a href="https://www.crossref.org/blog/members-will-soon-be-able-to-assign-crossref-dois-to-preprints/">&lt;span >allow members to register preprints&lt;/span>&lt;/a>&lt;span >. This new development is designed to provide custom support for preprints. It will ensure that: links to these publications persist over time; they are connected to the full history of the shared research results; and the citation record is clear and up-to-date. We established this preprints service to fully integrate preprint publications into the formal scholarly record with features such as:&lt;/span>&lt;/span>&lt;/p>
&lt;li >
&lt;span >Crossref membership for preprint repositories, joining the community of publishers who have made a commitment to maintain and connect scholarly publications.&lt;/span>
&lt;/li>
&lt;li >
&lt;span >Persistent identifiers for preprints to ensure successful links to the scholarly record over the course of time via the DOI resolver.&lt;/span>
&lt;/li>
&lt;li >
&lt;span >Content Registration for preprints with custom metadata that reflect researcher workflows from preprint to formal publication.&lt;/span>
&lt;/li>
&lt;li >
&lt;span >Notification of links between preprints and formal publications that may follow (journal articles, monographs, etc.).&lt;/span>
&lt;/li>
&lt;li >
&lt;span >&lt;span >Collection of “&lt;/span>&lt;a href="https://www.crossref.org/blog/crossref-event-data-early-preview-now-available/">&lt;span >event data&lt;/span>&lt;/a>&lt;span >” that capture activities surrounding preprints (usage, social shares, mentions, discussions, recommendations, links to datasets and other research entities, etc.).&lt;/span>&lt;/span>
&lt;/li>
&lt;li >
&lt;span >&lt;span >Reference linking for preprints, connecting up the scholarly record to associated literature&lt;/span>&lt;/span>
&lt;/li>
&lt;li >
&lt;span >&lt;span >&lt;a href="https://info.orcid.org/auto-update-has-arrived-orcid-records-move-to-the-next-level/">Auto-update of ORCID records&lt;/a>&lt;/span>&lt;span > to ensure that preprint contributors get credit for their work.&lt;/span>&lt;/span>
&lt;/li>
&lt;li >
&lt;span >&lt;a href="https://www.crossref.org/blog/a-healthy-infrastructure-needs-healthy-funding-data/">&lt;span >Preprint and funder registration&lt;/span>&lt;/a>&lt;span > to automatically report research contributions based on funder and grant identification.&lt;/span>&lt;/span>
&lt;/li>
&lt;h5 id="span-bsupporting-utility--effectiveness-of-preprints-for-allbspan">&lt;span >&lt;b>Supporting utility &amp;amp; effectiveness of preprints for all&lt;/b>&lt;/span>&lt;/h5>
&lt;p>&lt;span >&lt;span >To build the service, we are listening to the research community tell us their vision of what preprints will do. &lt;/span>&lt;a href="https://www.crossref.org/blog/hello-preprints-whats-your-story/">&lt;span >We solicited &lt;/span>&lt;/a>&lt;span >use cases from the community and have built a &lt;/span>&lt;a href="https://docs.google.com/spreadsheets/d/1UoTuzVVFe5qdMGenxAAbD9xEDOrnxuqpi29tO-frMXU/edit#gid=488933191">&lt;span >registry of preprint user stories&lt;/span>&lt;/a>&lt;span > with researchers, publishers, funding agencies, tenure and promotion committees in academic institutions, and technology providers. &lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >To realize the user stories, the research enterprise will no doubt need brand new tools and existing systems enhancements. Crossref’s preprints infrastructure will support the development of all needs currently registered. The community at large can focus on building effective solutions, instead of finding or securing access to data. All data are available without restriction to all so that participants as well the services and systems supporting them can access the data and reuse it for advancing early dissemination, literature discovery, research tracking, promotion and funding assessment, etc. &lt;/span>&lt;/p>
&lt;p>&lt;span >These are exciting days for scholarly communications. Over time, we envision an even more vibrant ecosystem of research outputs that include existing artefacts linked up to preprints. And Crossref is committed to providing infrastructure for the dynamic enterprise all along the way.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >We plan to announce the availability of the preprints infrastructure and further technical details within the next few weeks. If you’re interested in learning more about how these will be supported, &lt;/span>&lt;a href="mailto:feedback@crossref.org">&lt;span >get in touch&lt;/span>&lt;/a>&lt;/span>&lt;span >&lt;span >!&lt;/span> &lt;/span>&lt;/p>
&lt;p> &lt;/p></description></item><item><title>Using AWS S3 as a large key-value store for Chronograph</title><link>https://www.crossref.org/blog/using-aws-s3-as-a-large-key-value-store-for-chronograph/</link><pubDate>Tue, 02 Aug 2016 00:00:00 +0000</pubDate><author>Joe Wass</author><guid>https://www.crossref.org/blog/using-aws-s3-as-a-large-key-value-store-for-chronograph/</guid><description>&lt;p>&lt;span >One of the cool things about working in Crossref Labs is that interesting experiments come up from time to time. One experiment, entitled “what happens if you plot DOI referral domains on a chart?” turned into the &lt;a href="http://chronograph.labs.crossref.org">Chronograph&lt;/a> project. In case you missed it, Chronograph analyses our DOI resolution logs and shows how many times each DOI link was resolved per month, and also how many times a given domain referred traffic to DOI links per day.&lt;/span>&lt;/p>
&lt;p>&lt;span >We’ve released a new version of Chronograph. This post explains how it was put together. One for the programmers out there.&lt;/span>&lt;/p>
&lt;h2 id="span-big-enough-to-be-annoyingspan">&lt;span >Big enough to be annoying&lt;/span>&lt;/h2>
&lt;p>&lt;span >Chronograph sits on the boundary between normal-sized data and large-enough-to-be-annoying-size data. It doesn’t store data for all DOIs (it includes only those that are used on average once a day), but it has information on up to 1 million DOIs per month over about 5 years, and about 500 million data points in total.&lt;/span>&lt;/p>
&lt;p>&lt;span >Storing 500 million data points is within the capabilities of a well-configured database. In the first iteration of Chronograph a MySQL database was used. But that kind of data starts to get tricky to back up, move around and index.&lt;/span>&lt;/p>
&lt;p>&lt;span >Every month or two new data comes in for processing, and it needs to be uploaded and merged into the database. Indexes need to be updated. Disk space needs to be monitored. This can be tedious.&lt;/span>&lt;/p>
&lt;h2 id="span-key-valuesspan">&lt;span >Key values&lt;/span>&lt;/h2>
&lt;p>&lt;span >Because the data for a DOI is all retrieved at once, it can be stored together. So instead of a table that looks like&lt;/span>&lt;/p>
&lt;table>
&lt;tr>
&lt;td>
&lt;span >10.5555/12345678&lt;/span>
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;2010-01-01&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;5&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
&lt;span >10.5555/12345678&lt;/span>
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;2010-02-01&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;7&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
&lt;span >10.5555/12345678&lt;/span>
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;2010-03-01&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;3&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;/table>
&lt;p>&lt;span >Instead we can store&lt;/span>&lt;/p>
&lt;table>
&lt;tr>
&lt;td>
10.5555/12345678
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
{&amp;amp;#8220;2010-01-01&amp;amp;#8221;: 5, &amp;amp;#8220;2010-02-01&amp;amp;#8221;: 7, &amp;amp;#8220;2010-03-01&amp;amp;#8221;: 3}
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;/table>
&lt;p>&lt;span >This is much lighter on the indexes and takes much less space to store. However, it means that adding new data is expensive. Every time there’s new data for a month, the structure must be parsed, merged with the new data, serialised and stored again millions of times over.&lt;/span>&lt;/p>
&lt;p>&lt;span >After trials with &lt;a href="https://www.mysql.com/">MySql&lt;/a>, &lt;a href="https://www.mongodb.com/">MongoDB&lt;/a> and &lt;a href="http://www.mapdb.org/">MapDB&lt;/a>, this approach was taken with MySQL in the original Chronograph.&lt;/span>&lt;/p>
&lt;h2 id="span-keep-it-simple-storage-service-stupidspan">&lt;span >Keep it Simple Storage Service Stupid&lt;/span>&lt;/h2>
&lt;p>&lt;span >In the original version of Chronograph the data was processed using &lt;a href="http://spark.apache.org/">Apache Spark&lt;/a>. There are various solutions for storing this kind of data, including Cassandra, time-series databases and so on.&lt;/span>&lt;/p>
&lt;p>&lt;span >The flip side of being able to do interesting experiments is wanting them to stick around without having to bother a sysadmin. The data is important to us, but we’d rather not have to worry about running another server and database if possible.&lt;/span>&lt;/p>
&lt;p>&lt;span >Chronograph fits into the category of ‘interesting’ rather than ‘mission-critical’ projects, so we’d rather not have to maintain expensive infrastructure if possible.&lt;/span>&lt;/p>
&lt;p>&lt;span >I decided to look into using Amazon Web Services &lt;a href="https://aws.amazon.com/s3/">Simple Storage Service&lt;/a> (AWS S3) to store the data. AWS itself is a key-value store, so it seems like a good fit. S3 is a great service because, as the name suggests, it’s a simple service for storing a large number of files. It’s cheap and its capabilities and cost scale well.&lt;/span>&lt;/p>
&lt;p>&lt;span >However, storing and updating up to 80 million very small keys (one per DOI) isn’t very clever, and certainly isn’t practical. I looked at &lt;a href="https://aws.amazon.com/documentation/dynamodb/">DynamoDB&lt;/a>, but we still face the overhead of making a large number of small updates.&lt;/span>&lt;/p>
&lt;h2 id="span-is-it-weirdspan">&lt;span >Is it weird?&lt;/span>&lt;/h2>
&lt;p>&lt;span >In these days of plentiful databases with cheap indexes (and by ‘these days’ I mean the 1970s onward) it seems somehow wrong to use plain old text files. However, the whole Hadoop “Big Data” movement was predicated on a return to batch processing files. Commoditisation of services like S3 and the shift to do more in the browser have precipitated a bit of a rethink. The movement to abandon LAMP stacks and use static site generators is picking up pace. The term ‘serverless architecture’ is hard to avoid if you read &lt;a href="https://hn.algolia.com/?query=serverless%20architecture&amp;sort=byDate&amp;prefix&amp;page=0&amp;dateRange=all&amp;type=story">certain news sites&lt;/a>.&lt;/span>&lt;/p>
&lt;p>&lt;span >Using Apache Spark (with its brilliant &lt;a href="http://spark.apache.org/docs/latest/programming-guide.html#resilient-distributed-datasets-rdds">RDD concept&lt;/a>) was useful for bootstrapping the data processing for Chronograph, but the new code has an entirely flat-file workflow. The simplicity of not having to unnecessarily maintain a &lt;a href="https://hadoop.apache.org/docs/r1.2.1/hdfs_design.html">Hadoop HDFS&lt;/a> instance seems to be the right choice in this case.&lt;/span>&lt;/p>
&lt;h2 id="span-repurposing-the-wheelspan">&lt;span >Repurposing the Wheel&lt;/span>&lt;/h2>
&lt;p>&lt;span >The solution was to use S3 as a big &lt;a href="https://en.wikipedia.org/wiki/Hash_table">hash table&lt;/a> to store the final data that’s served to users.&lt;/span>&lt;/p>
&lt;p>&lt;span >The processing pipeline uses flat files all the way through from input log files to projections to aggregations. At the penultimate stage of the pipeline blocks of CSV per DOI are produced that represent date-value pairs.&lt;/span>&lt;/p>
&lt;table>
&lt;tr>
&lt;td>
10.5555/12345678
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
2010-01
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
2010-01-01,05&amp;lt;br /&amp;gt; 2010-02-01,02&amp;lt;br /&amp;gt; 2010-01-03,08&amp;lt;br /&amp;gt; &amp;amp;#8230;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
10.5555/12345678
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
2010-02
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
2010-02-1,10&amp;lt;br /&amp;gt; 2010-02-01,7&amp;lt;br /&amp;gt; 2010-02-03,22&amp;lt;br /&amp;gt; &amp;amp;#8230;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;/table>
&lt;p>&lt;span >At the last stage, these are combined into blocks of all dates for a DOI&lt;/span>&lt;/p>
&lt;table>
&lt;tr>
&lt;td>
10.5555/12345678
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
2010-01
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
2010-01-01,05&amp;lt;br /&amp;gt; 2010-02-01,02&amp;lt;br /&amp;gt; 2010-01-03,08&amp;lt;br /&amp;gt; &amp;amp;#8230;&amp;lt;br /&amp;gt; 2010-02-1,10&amp;lt;br /&amp;gt; 2010-02-01,7&amp;lt;br /&amp;gt; 2010-02-03,22&amp;lt;br /&amp;gt; &amp;amp;#8230;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;/table>
&lt;p>&lt;span >The DOIs are then hashed into 12 bits and stored as chunks of CSV&lt;/span>&lt;/p>
&lt;p>&lt;span >day-doi.csv-chunks_8841:&lt;/span>&lt;/p>
&lt;pre class="">10.1038/ng.3020
2014-06-24,4
2014-06-25,4
2014-06-26,3
...
10.1007/978-94-007-2869-1_7
2012-06-01,12
2012-06-02,8
...
10.1371/journal.pone.0145509
2016-02-01,13
2016-02-02,75
2016-02-03,30
...&lt;/pre>
&lt;p>&lt;span >There are 65,536 (0x000 to 0xFFFF) possible files, each with about a thousand DOIs worth of data in each.&lt;/span>&lt;/p>
&lt;p>&lt;span >When the browser requests data for a DOI, it is hashed and then the request for the appropriate file in S3 is made. The browser then has to perform a linear scan of the file to find the DOI it is looking for.&lt;/span>&lt;/p>
&lt;p>&lt;span >This is the simplest possible form of hash table: simple addressing with separate &lt;a href="https://en.wikipedia.org/wiki/Hash_table#Separate_chaining_with_linked_lists">linear chaining&lt;/a>. The hash function is a 16-bit mask of MD5, chosen because of availability in the browser. It does a great job of evenly distributing the DOIs over all 65,536 possible files.&lt;/span>&lt;/p>
&lt;h2 id="span-striking-the-balancespan">&lt;span >Striking the balance&lt;/span>&lt;/h2>
&lt;p>&lt;span >In any data structure implementation, there are balances to be struck. Traditionally these concern memory layout, the shape of the data, practicalities of disk access and CPU cost.&lt;/span>&lt;/p>
&lt;p>&lt;span >In this instance, the factors in play included the number of buckets that need to be uploaded and the cost of the browser downloading an over-large bucket. The size of the bucket doesn’t matter much for CPU (as far as the user is concerned it takes about the same time to scan 10 entries as it does 10,000), but it does make a difference asking  user to download a 10kb bucket or a 10MB one.&lt;/span>&lt;/p>
&lt;p>&lt;span >I struck the balance at 4096 buckets, resulting in files of around 100k, which is the size of a medium sized image.&lt;/span>&lt;/p>
&lt;h2 id="span-it-worksspan">&lt;span >It works&lt;/span>&lt;/h2>
&lt;p>&lt;span >The result is a simple system that allows people to look up data for millions of DOIs, without having to look after another server. It’s also portable to any other file storage service.&lt;/span>&lt;/p>
&lt;p>&lt;span >The approach isn’t groundbreaking, but it works.&lt;/span>&lt;/p></description></item><item><title>A fairer approach to waiting for deposits</title><link>https://www.crossref.org/blog/a-fairer-approach-to-waiting-for-deposits/</link><pubDate>Wed, 20 Jul 2016 00:00:00 +0000</pubDate><author>Chuck Koscher</author><guid>https://www.crossref.org/blog/a-fairer-approach-to-waiting-for-deposits/</guid><description>&lt;p>If you ever see me in the checkout line at some store do not &lt;em>ever&lt;/em> get in the line I’m in. It is always the absolute slowest.&lt;/p>
&lt;p>Crossref’s metadata system has a sort of checkout line, when members send in their data they got processed essentially in a first come first served basis. It’s called the deposit queue. We had controls to prevent anyone from monopolizing the queue and ways to jump forward in the queue but our primary goal was to give everyone a fair shot at getting processed as soon as possible. With many different behaviors by our members this could often be a challenge and at times some folks were not 100% happy.&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/07/depositwars.png">&lt;img class="alignleft wp-image-1903 size-medium" src="https://www.crossref.org/wp/blog/uploads/2016/07/depositwars-300x75.png" width="300" height="75" srcset="https://www.crossref.org/wp/blog/uploads/2016/07/depositwars-300x75.png 300w, https://www.crossref.org/wp/blog/uploads/2016/07/depositwars-768x192.png 768w, https://www.crossref.org/wp/blog/uploads/2016/07/depositwars-1024x256.png 1024w, https://www.crossref.org/wp/blog/uploads/2016/07/depositwars.png 1200w" sizes="(max-width: 300px) 85vw, 300px" />&lt;/a>We recently made a change where the queue now cycles through all waiting users and selects a job from each. This means that low-frequency users will always get a pretty fast service even if there are a lot of unique users waiting. Everyone gets one bite of the apple on each cycle through the waiting list. Of course, we still have some special controls to help deal with large quantities of files from a single user and ways to jump the queue under really special circumstances.&lt;/p>
&lt;p>We believe this will, on average,  yield a better experience and minimize the backups that formerly required administrator attention to resolve.&lt;/p></description></item><item><title>2016 upcoming events - we’re out and about!</title><link>https://www.crossref.org/blog/2016-upcoming-events-were-out-and-about/</link><pubDate>Wed, 06 Jul 2016 00:00:00 +0000</pubDate><author>Rosa Morais Clark</author><guid>https://www.crossref.org/blog/2016-upcoming-events-were-out-and-about/</guid><description>&lt;div>
&lt;p>
&lt;span >Check out the events below where Crossref will attend or present in 2016. We have been busy over the past few months, and we have more planned for the rest of year. If we will be at a place near you, please come see us (and support these organisations and events)!&lt;/span>
&lt;/p>
&lt;p>
&lt;span >Upcoming Events&lt;/span>&lt;br /> &lt;span >&lt;a href="http://www.share-research.org/2016/04/share-2016-community-meeting/">SHARE Community Meeting&lt;/a>, July 11-14, Charlottesville, VA, USA&lt;/span>&lt;br /> &lt;span >Crossref Outreach Day - July 19-21 - Seoul, South Korea&lt;/span>&lt;br /> &lt;span >&lt;a href="http://asianeditor.org/event/2016/index.php">CASE 2016 Conference&lt;/a> - July 20-22 - Seoul, South Korea&lt;/span>&lt;br /> &lt;span >&lt;a href="http://theacse.com/meeting2016/">ACSE Annual Meeting 2016&lt;/a> - August 10-11 - Dubai, UAE&lt;/span>&lt;br /> &lt;span >&lt;a href="http://vivoconference.org/">Vivo 2016 Conference&lt;/a> - August 17-19 - Denver CO, USA&lt;/span>&lt;br /> &lt;span >&lt;a href="https://www.scidatacon.org/2016/">SciDataCon&lt;/a> - September 11-17 - Denver CO, USA&lt;/span>&lt;br /> &lt;span >&lt;a href="https://www.youtube.com/watch?v=qNgPrQGfSb0">ALPSP&lt;/a> - September 14-16 - London, UK&lt;/span>&lt;br /> &lt;span >&lt;a href="http://oaspa.org/conference/">OASPA&lt;/a> - September 21-22 - Arlington VA, USA&lt;/span>&lt;br /> &lt;span >&lt;a href="https://www.altmetric.com/events/">3:AM Conference&lt;/a> - September 26 - 28 - Bucharest, Romania&lt;/span>&lt;br /> &lt;span >&lt;a href="https://orcid.org/about/events/">ORCID Outreach Conference&lt;/a> - October 5-6 - Washington DC, USA&lt;/span>&lt;br /> &lt;span >&lt;a href="http://www.buchmesse.de/en/">Frankfurt Book Fair&lt;/a> - October 19-23 - Frankfurt, Germany (Hall 4.2, Stand #4.2 M 85)&lt;/span>&lt;br /> &lt;span >&lt;a href="https://www.eventbrite.com/e/crossref-annual-community-meeting-2016-tickets-25928526922">Crossref Annual Community Meeting #Crossref16&lt;/a> - November 1-2 - London, UK**&lt;/span>&lt;br /> &lt;span >&lt;a href="http://pidapalooza.org/">PIDapalooza&lt;/a> - November 9-10 - Reykjavik, Iceland&lt;/span>&lt;br /> &lt;span >&lt;a href="http://www.opencon2016.org/updates">OpenCon 2016&lt;/a> - November 12-14 - Washington DC, USA&lt;/span>&lt;br /> &lt;span >&lt;a href="http://www.stm-assoc.org/events/stm-digital-publishing-2016/">STM Digital Publishing Conference&lt;/a> - December 6-8 - London, UK&lt;/span>
&lt;/p>
&lt;/div>
&lt;div>
&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/06/DC4.jpeg">&lt;img class="alignnone size-medium wp-image-1831" src="https://www.crossref.org/wp/blog/uploads/2016/06/DC4-300x225.jpeg" alt="DC4" width="300" height="225" srcset="https://www.crossref.org/wp/blog/uploads/2016/06/DC4-300x225.jpeg 300w, https://www.crossref.org/wp/blog/uploads/2016/06/DC4-768x576.jpeg 768w, https://www.crossref.org/wp/blog/uploads/2016/06/DC4.jpeg 948w" sizes="(max-width: 300px) 85vw, 300px" />&lt;/a> &lt;a href="https://www.crossref.org/wp/blog/uploads/2016/06/DC2.jpeg">&lt;br /> &lt;/a>
&lt;/div>
&lt;div>
&lt;span >The Crossref outreach team will host a number of outreach events around the globe. Updates about events are shared through social media so please connect with us via @CrossrefOrg.&lt;/span>
&lt;/div>
&lt;div>
&lt;span > &lt;/span>
&lt;/div>
&lt;div>
&lt;/div></description></item><item><title>DOI-like strings and fake DOIs</title><link>https://www.crossref.org/blog/doi-like-strings-and-fake-dois/</link><pubDate>Wed, 29 Jun 2016 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/doi-like-strings-and-fake-dois/</guid><description>&lt;h2 id="span-tldrspan">&lt;span >TL;DR&lt;/span>&lt;/h2>
&lt;p>&lt;span >Crossref discourages our members from using DOI-like strings or fake DOIs.&lt;/span>&lt;/p>
&lt;p>&lt;a href="#">&lt;img class="alignnone wp-image-1850 size-thumbnail" src="https://www.crossref.org/wp/blog/uploads/2016/06/prohibited-150x150.png" alt="discouraged" width="150" height="150" srcset="https://www.crossref.org/wp/blog/uploads/2016/06/prohibited-150x150.png 150w, https://www.crossref.org/wp/blog/uploads/2016/06/prohibited-300x300.png 300w, https://www.crossref.org/wp/blog/uploads/2016/06/prohibited.png 729w" sizes="(max-width: 150px) 85vw, 150px" />&lt;/a>&lt;/p>
&lt;h2 id="span-detailsspan">&lt;span >Details&lt;/span>&lt;/h2>
&lt;p>&lt;span >Recently we have seen quite &lt;a href="https://go-to-hellman.blogspot.co.uk/2016/06/wileys-fake-journal-of-constructive.html">a bit of debate&lt;/a> around the use of so-called “fake-DOIs.” We have also been quoted as saying that we discourage the use of “fake DOIs” or “DOI-like strings”. This post outlines some of the cases in which we’ve seen fake DOIs used and why we recommend against doing so.&lt;/span>&lt;/p>
&lt;h2 id="span-using-doi-like-strings-as-internal-identifiersspan">&lt;span >Using DOI-like strings as internal identifiers&lt;/span>&lt;/h2>
&lt;p>&lt;span >Some of our members use DOI-like strings as internal identifiers for their manuscript tracking systems. These only get registered as real DOIs with Crossref once an article is published. This seems relatively harmless, except that, frequently, the unregistered DOI-like strings for unpublished (e.g. under review or rejected manuscripts) content ‘escape’ into the public as well. People attempting to use these DOI-like strings get understandably confused and angry when they don’t resolve or otherwise work as DOIs. After years of experiencing the frustration that these DOI-like things cause, we have taken to recommending that our members not use DOI-like strings as their internal identifiers.&lt;/span>&lt;/p>
&lt;h3 id="span-using-doi-like-strings-in-access-control-compliance-applicationsspan">&lt;span >Using DOI-like strings in access control compliance applications&lt;/span>&lt;/h3>
&lt;p>&lt;span >We’ve also had members use DOI-like strings as the basis for systems that they use to detect and block tools designed to bypass the member’s access control system and bulk-download content. The methods employed by our members have fallen into two broad categories:&lt;/span>&lt;/p>
&lt;ul>
&lt;li>&lt;span >Spider (or robot) traps.&lt;/span>&lt;/li>
&lt;li>&lt;span >Proxy bait.&lt;/span>&lt;/li>
&lt;/ul>
&lt;h3 id="spider-traps">Spider traps&lt;/h3>
&lt;p>&lt;a href="#">&lt;img class="alignnone wp-image-1849 size-thumbnail" src="https://www.crossref.org/wp/blog/uploads/2016/06/web-150x150.png" alt="spider trap" width="150" height="150" srcset="https://www.crossref.org/wp/blog/uploads/2016/06/web-150x150.png 150w, https://www.crossref.org/wp/blog/uploads/2016/06/web-300x300.png 300w, https://www.crossref.org/wp/blog/uploads/2016/06/web.png 729w" sizes="(max-width: 150px) 85vw, 150px" />&lt;/a>&lt;/p>
&lt;p>&lt;span >A “&lt;a href="https://en.wikipedia.org/wiki/Spider_trap">spider trap&lt;/a>” is essentially a tripwire that allows a site owner to detect when a spider/robot is crawling their site to download content. The technique involves embedding a special trigger URL in a public page on a web site. The URL is embedded such that a normal user should not be able see it or follow it, but an automated bot (aka “spider”) will detect it and follow it. The theory is that when one of these trap URLs is followed, the website owner can then conclude that the ip address from which it was followed harbours a bot and take action. Usually the action is to inform the organisation from which the bot is connecting and to ask them to block it. But sometimes triggering a spider trap has resulted in the IP address associated with it being instantly cut off. This, in turn, can affect an entire university’s access to said member’s content.&lt;/span>&lt;/p>
&lt;p>&lt;span >When a spider/bot trap includes a DOI-like string, then we have seen some particularly pernicious problems as they can trip-up legitimate tools and activities as well. For example, a bibliographic management browser plugin might automatically extract DOIs and retrieve metadata on pages visited by a researcher. If the plugin were to pick up one of these spider traps DOI-like strings, it might inadvertently trigger the researcher being blocked- or worse- the researcher’s entire university being blocked. In the past, this has even been a problem for Crossref itself. We periodically run tools to test DOI resolution and to ensure that our members are properly displaying DOIs, Crossmarks, and metadata as per their member obligations. We’ve occasionally been blocked when we ran across the spider traps as well.&lt;/span>&lt;/p>
&lt;h3 id="proxy-bait">Proxy bait&lt;/h3>
&lt;p>&lt;a href="#">&lt;img class="alignnone wp-image-1848 size-thumbnail" src="https://www.crossref.org/wp/blog/uploads/2016/06/bait-150x150.png" alt="proxy bait" width="150" height="150" srcset="https://www.crossref.org/wp/blog/uploads/2016/06/bait-150x150.png 150w, https://www.crossref.org/wp/blog/uploads/2016/06/bait-300x300.png 300w, https://www.crossref.org/wp/blog/uploads/2016/06/bait.png 729w" sizes="(max-width: 150px) 85vw, 150px" />&lt;/a>&lt;/p>
&lt;p>&lt;span >Using proxy bait is similar to using a spider trap, but it has an important difference. It does not involve embedding specially crafted DOI like strings on the member’s website itself. The DOI-like strings are instead fed directly to tools designed to subvert the member’s access control systems. These tools, in turn, use proxies on a subscriber’s network to retrieve the “bait” DOI-like string. When the member sees one of these special DOI-like strings being requested from a particular institution, they then know that said institution’s network harbours a proxy. In theory this technique never exposes the DOI-like strings to the public and automated tools should not be able to stumble upon them. However, recently one of our members had some of these DOI-like strings “escape” into the public and at least one of them was indexed by Google. The problem was compounded because people clicking on these DOI-like strings sometimes ended having their university’s IP address banned from the member’s web site. As you can imagine, there has been a lot of gnashing of teeth. We are convinced, in this case, that the member was doing their best to make sure the DOI-like strings never entered the public. But they did nonetheless. We think this just underscores how hard it is to ensure DOI-like strings remain private and why we recommend our members not use them.&lt;/span>&lt;/p>
&lt;h2 id="span-pedantry-and-terminologyspan">&lt;span >Pedantry and terminology&lt;/span>&lt;/h2>
&lt;p>&lt;span >Notice that we have not used the phrase “fake DOI” yet. This is because, internally, at least, we have distinguished between “DOI-like strings” and “fake DOIs.” The terminology might be daft, but it is what we’ve used in the past and some of our members at least will be familiar with it. We don’t expect anybody outside of Crossref to know this.&lt;/span>&lt;/p>
&lt;p>&lt;span >To us, the following is not a DOI:&lt;/span>&lt;/p>
&lt;p>&lt;span >10.5454/JPSv1i220161014&lt;/span>&lt;/p>
&lt;p>&lt;span >It is simply a string of alphanumeric characters that copy the DOI syntax. We call them “DOI-like strings.” It is not registered with any DOI registration agency and one cannot lookup metadata for it. If you try to “resolve” it, you will simply get an error. Here, you can try it. Don’t worry- clicking on it will not disable access for your university.&lt;/span>&lt;/p>
&lt;p>&lt;a href="http://doi.org/10.5454/JPSv1i220161014" target="_blank">http://doi.org/10.5454/JPSv1i220161014&lt;/a>&lt;/p>
&lt;p>&lt;span >The following is what we have sometimes called a “fake DOI”&lt;/span>&lt;/p>
&lt;p>&lt;span >10.5555/12345678&lt;/span>&lt;/p>
&lt;p>&lt;span >It is registered with Crossref, resolves to a fake article in a fake journal called The Journal of Psychoceramics (the study of Cracked Pots) run by a fictitious author (Josiah Carberry) who has a fake ORCID (&lt;a href="http://orcid.org/0000-0002-1825-0097">&lt;a href="http://orcid.org/0000-0002-1825-0097" target="_blank">http://orcid.org/0000-0002-1825-0097&lt;/a>&lt;/a>) but who is affiliated with a real university (&lt;a href="http://www.brown.edu">Brown University&lt;/a>).&lt;/span>&lt;/p>
&lt;p>&lt;span >Again, you can try it.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;a href="http://doi.org/10.5555/12345678">&lt;a href="http://doi.org/10.5555/12345678" target="_blank">http://doi.org/10.5555/12345678&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >And you can even look up metadata for it.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;a href="https://api.crossref.org/v1/works/10.5555/12345678">&lt;a href="https://api.crossref.org/v1/works/10.5555/12345678" target="_blank">https://api.crossref.org/v1/works/10.5555/12345678&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >Our dirty little secret is that this “fake DOI” was registered and is controlled by Crossref.&lt;/span>&lt;/p>
&lt;p>&lt;span >Why does this exist? Aren’t we subverting the scholarly record? Isn’t this awful? Aren’t we at the very least hypocrites? And how does a real university feel about having this fake author and journal associated with them?&lt;/span>&lt;/p>
&lt;p>&lt;span >Well- the DOI is using a prefix that we use for testing. It follows a long tradition of test identifiers starting with “5”. &lt;a href="https://en.wikipedia.org/wiki/555_(telephone_number)">Fake phone numbers in the US start with “555”&lt;/a>. Many credit card companies &lt;a href="https://web.archive.org/web/20160707151357/https://www.paypalobjects.com/en_US/vhelp/paypalmanager_help/credit_card_numbers.htm">reserve fake numbers starting with “5”&lt;/a>. For example, Mastercard’s are “5555555555554444” and “5105105105105100.”&lt;/span>&lt;/p>
&lt;p>&lt;span >We have created this fake DOI, the fake journal and the fake ORCID so that we can test our systems and demonstrate interoperable features and tools. The fake author, Josiah Carberry, is &lt;a href="http://library.brown.edu/hay/carberry.php">a long-running joke at Brown University&lt;/a>. He even has a &lt;a href="https://en.wikipedia.org/wiki/Josiah_S._Carberry">Wikipedia entry&lt;/a>. There are also a lot of other DOIs under the test prefix “5555.”&lt;/span>&lt;/p>
&lt;p>&lt;span >We acknowledge that the term “fake DOI” might not be the best in this case- but it is a term we’ve used internally at least and it is worth distinguishing it from the case of DOI-like strings mentioned above.&lt;/span>&lt;/p>
&lt;p>&lt;span >But back to the important stuff….&lt;/span>&lt;/p>
&lt;p>&lt;span >As far as we know, none of our members has ever registered a “fake DOI” (as defined above) in order to detect and prevent the circumvention of their access control systems. If they had, we would consider it much more serious than the mere creation of DOI-like strings. The information associated with registered DOIs becomes part of the persistent scholarly citation record. Many, many third party systems and tools make use of our API and metadata including bibliographic management tools, &lt;a href="https://en.wikipedia.org/wiki/Text_mining">TDM&lt;/a> tools, &lt;a href="https://en.wikipedia.org/wiki/Current_research_information_system">CRIS&lt;/a> systems, &lt;a href="https://en.wikipedia.org/wiki/Altmetrics">altmetrics&lt;/a> services, etc. It would be a very bad thing if people started to worry that the legitimate use of registered DOIs could inadvertently block them from accessing content. Crossref DOIs are designed to encourage discovery and access- not block it.&lt;/span>&lt;/p>
&lt;p>&lt;span >And again, we have absolutely no evidence that any of our members has registered fake DOIs.&lt;/span>&lt;/p>
&lt;p>&lt;span >But just in case, we will continue to discourage our members from using DOI-like strings and/or registering fake DOIs.&lt;/span>&lt;/p>
&lt;p>&lt;span >This has been a public service announcement from the identifier dweebs at Crossref.&lt;/span>&lt;/p>
&lt;h2 id="span-image-creditsspan">&lt;span >Image Credits&lt;/span>&lt;/h2>
&lt;p>&lt;span >Unless otherwise noted, included images purchased from &lt;a href="https://thenounproject.com/">The Noun Project&lt;/a>&lt;/span>&lt;/p></description></item><item><title>Outreach Day DC. Next Up? You Tell Us</title><link>https://www.crossref.org/blog/outreach-day-dc.-next-up-you-tell-us/</link><pubDate>Tue, 28 Jun 2016 00:00:00 +0000</pubDate><author>Jennifer Kemp</author><guid>https://www.crossref.org/blog/outreach-day-dc.-next-up-you-tell-us/</guid><description>&lt;p>&lt;span >Rallying the community is a key Crossref role. Sometimes this means collaborating on new initiatives but it is also an ongoing process, a cornerstone of our outreach efforts. Part of rallying the community is bringing people together, literally, in a series of outreach days around the globe. It means we encourage dialog with us and among members and non-publisher affiliates. We want to hear from the community and we hope to facilitate conversations in it. Not just about Crossref, but larger issues of scholarly communications and your particular part in it. The Crossref outreach team is doing a number of events around the world to bring together the community for updates, feedback and discussion.&lt;/span>&lt;/p>
&lt;p>&lt;span >On 16 June, Crossref hosted an all day session in Washington, DC where we were joined by about 35 attendees from the region, mostly publishers. The size of the group made for lots of discussion, and we are grateful for the feedback. Here is what we took away from the event:&lt;/span>&lt;/p>
&lt;h3 id="span-strongwe-all-need-a-better-understanding-of-who-is-using-crossref-metadata-and-howstrongspan">&lt;span >&lt;strong>We all need a better understanding of who is using Crossref metadata and how&lt;/strong>&lt;/span>&lt;/h3>
&lt;p>&lt;span >Sure, we all know that, for example, submission systems, libraries and hosting platforms use Crossref metadata (‘metadata out’), but pinpointing where in workflows (often multiple instances) and the interplay between publishers and these systems? Not so much. &lt;strong>Help us change that:&lt;/strong> &lt;a href="http://goo.gl/forms/E1l4rYHHLEHb8bsj1" target="_blank">take this short survey&lt;/a> to tell us how publisher metadata quality affects your systems and workflows and we will, in turn, make use cases (anonymized if you wish) available as part of an ongoing effort to promote the value of more, better and enriched metadata.&lt;/span>&lt;/p>
&lt;p>&lt;span >Here I must say a big thank you to our guest speaker for the day, Carly Robinson, who provided an excellent presentation on the work of &lt;a href="http://www.osti.gov/home/about.html" target="_blank">OSTI&lt;/a>, of the U.S. Department of Energy. Carly shared examples of how OSTI uses the Crossref metadata in their systems to aid compliance and compliment the DOE public access model. A live use case is a welcome way to partner with our community!&lt;/span>&lt;/p>
&lt;h3 id="span-strongthe-more-things-change-the-more-they-emphasize-core-best-practicesstrongspan">&lt;span >&lt;strong>The more things change, the more they emphasize core best practices&lt;/strong>&lt;/span>&lt;/h3>
&lt;p>&lt;span >A good part of the day was spent on new initiatives such as: &lt;a href="https://www.crossref.org/blog/members-will-soon-be-able-to-assign-crossref-dois-to-preprints/" target="_blank">DOIs for preprints&lt;/a>, &lt;a href="https://www.crossref.org/blog/auto-update-has-arrived-orcid-records-move-to-the-next-level/" target="_blank">auto-update of ORCID records&lt;/a>, &lt;a href="https://www.crossref.org/blog/community-responses-to-our-proposal-for-early-content-registration/" target="_blank">&amp;lsquo;early content registration&amp;rsquo; &lt;/a>, &lt;a href="https://www.crossref.org/blog/linked-clinical-trials-are-here/" target="_blank">linked clinical trials&lt;/a> and &lt;a href="https://www.crossref.org/blog/" target="_blank">more&lt;/a>. All good stuff-the industry evolves and workflows must keep pace-but none of which generated a great deal of questions or expressed concern.&lt;/span>&lt;/p>
&lt;p>&lt;span >One session that did spur a lot of discussion was a simple overview of where Crossref services sit in the publishing process (including pre- and post-). Perhaps this is because it was early in the day but the much-appreciated discussion underscored the need to make the case for enriched metadata in a well-understood workflow that reflects the roles of publishers and affiliate users of metadata.&lt;/span>&lt;/p>
&lt;h3 id="span-boutreach-is-an-experiment-in-which-we-are-all-subjectsbspan">&lt;span >&lt;b>Outreach is an experiment in which we are all subjects&lt;/b>&lt;/span>&lt;/h3>
&lt;p>&lt;span >Finally, it must be noted here that we actively seek feedback on our Community Outreach days! We are not a large team and we can’t do as many outreach days as we’d like, but we are very open to hearing from you: So, tell us in &lt;a href="http://goo.gl/forms/MDDRy8WUgyiwzo4m2" target="_blank">this quick survey:&lt;/a> what should we discuss? And where should we head next?&lt;/span>&lt;/p>
&lt;img class="alignleft" src="https://www.crossref.org/wp/blog/uploads/2016/06/Crossref-DC-Outreach-Day-2016.jpg" alt="D.C. Crossref Outreach Day" width="436" height="327" /></description></item><item><title>Hello preprints, what’s your story?</title><link>https://www.crossref.org/blog/hello-preprints-whats-your-story/</link><pubDate>Thu, 23 Jun 2016 00:00:00 +0000</pubDate><author>Jennifer Lin</author><guid>https://www.crossref.org/blog/hello-preprints-whats-your-story/</guid><description>&lt;h3 id="span-the-role-of-preprints">&lt;span >The role of preprints&lt;/h3>
&lt;p>&lt;span >&lt;span >Crossref provides infrastructure services and therefore we support scholarly communications as it evolves over time. Today, preprints are increasingly discussed as a valuable part of the research story (beyond physics, math, and a small set of sub-disciplines). Preprints might play a positive role in catalyzing research discovery, establishing priority of discoveries and ideas, facilitating career advancement, and improving the culture of communication within the scholarly community.&lt;/p>
&lt;p>&lt;span >&lt;span >As we shared in an earlier blog post last month, &lt;a href="https://www.crossref.org/blog/members-will-soon-be-able-to-assign-crossref-dois-to-preprints/">&lt;span >members will be able to register Crossref DOIs for preprints &lt;/a>&lt;span > later this year. We will connect the full history of a research work, and ensure the citation record is clear and up-to-date. As we build out this new record/resource type, we’d love to hear how the research community envisions what preprints will do.&lt;/p>
&lt;h3 id="span-whats-your-story-preprint">&lt;span >What’s your story, preprint?&lt;/h3>
&lt;p>&lt;span >&lt;span >So we can develop a service that supports the whole host of potential uses for all stakeholders, we ask the entire research community to contribute &lt;b>preprints user stories &lt;/b>&lt;span >. &lt;a href="https://en.wikipedia.org/wiki/User_story">&lt;span >User stories &lt;/a>&lt;span > are concrete descriptions of a specific need, typically used in technology development: &lt;i>&lt;span >As a [x], I want to [y] to that I can [z]&lt;/i>&lt;span >. User stories take the “end-user’s” perspective as they focus on a discrete result and its value. They are essential when implementing solutions that must meet a wide range of needs, across a diverse set of constituents. For example:&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >As an author, I want to share results before my paper is submitted to a journal so that I can get rapid feedback on it and make improvements before publication.&lt;/p>
&lt;p>&lt;span >As a researcher who is part of a tenure and promotion committee or funder review panel, I want to know the reach of early results published from the candidate so that I can more quickly track the impact of results, rather than relying only on journal articles that take much longer to publish.&lt;/p>
&lt;p>&lt;span >As a journal publisher, I want to know whether a preprint exists for a manuscript submitted to me so that I can decide whether I will accept the submission based on my editorial policy.&lt;/p>
&lt;/blockquote>
&lt;p>&lt;span >&lt;span >We aim to assemble a full catalog that cuts across research disciplines and stakeholder groups. We want to hear from you: &lt;b>researchers, publishers, funding agencies, scholarly societies, academic institutions, technology providers, other infrastructure providers &lt;/b>, etc &lt;span >.&lt;/p>
&lt;h3 id="span-tell-us-your-story-here">&lt;span >Tell us your story here&lt;/h3>
&lt;p>&lt;span >&lt;span >To ensure that your needs are included, please send us your user stories via this &lt;a href="https://docs.google.com/spreadsheets/d/1UoTuzVVFe5qdMGenxAAbD9xEDOrnxuqpi29tO-frMXU/edit#gid=488933191">&lt;span >user story “deposit” form &lt;/a>&lt;span >. They will be added to the &lt;a href="https://docs.google.com/spreadsheets/d/1UoTuzVVFe5qdMGenxAAbD9xEDOrnxuqpi29tO-frMXU/edit#gid=488933191">&lt;span >full registry of contributions &lt;/a>&lt;span > from the community, which we hope will serve as a key resource for all those developing preprints into a core part of scholarly communications (e.g., ASAPbio, etc.).&lt;/p></description></item><item><title>Linked Clinical Trials initiative gathers momentum</title><link>https://www.crossref.org/blog/linked-clinical-trials-initiative-gathers-momentum/</link><pubDate>Tue, 21 Jun 2016 00:00:00 +0000</pubDate><author>Kirsty Meddings</author><guid>https://www.crossref.org/blog/linked-clinical-trials-initiative-gathers-momentum/</guid><description>&lt;p>&lt;span >We now have &lt;a href="https://www.crossref.org/blog/linked-clinical-trials-are-here/">linked clinical trials&lt;/a> deposits coming in from five publishers: BioMedCentral, BMJ, Elsevier, National Institute for Health Research and PLOS. It’s still a relatively small pool of metadata - &lt;a href="https://api.crossref.org/v1/works?filter=has-clinical-trial-number:true">around 4000 DOIs&lt;/a> with associated clinical trial numbers - but we’re delighted to see that “threads” of publications are already starting to form.&lt;/span>&lt;/p>
&lt;div style="float:right;margin:10px">
&lt;img src="https://www.crossref.org/images/blog/clinical-trials-blog.png" alt="An exemplary image" width="300px" />
&lt;/div>
&lt;p>&lt;span >If you look at &lt;a href="https://doi.org/10.1016/s0140-6736(14)61836-5">this article in &lt;em>The Lancet&lt;/em>&lt;/a> and click on the Crossmark button you will see that in the Clinical Trials section there are links to three other articles reporting on the same trial: two from the &lt;em>American Heart Journal&lt;/em> and one from BMJ’s &lt;em>Heart&lt;/em>. Readers can navigate between these four articles in three separate journals using the Crossmark functionality- a new set of links and routes for discovery have appeared.&lt;/span>&lt;/p>
&lt;p>&lt;span >In another example, three articles from &lt;em>&lt;a href="https://doi.org/10.1371/journal.pone.0017554">PLOS ONE&lt;/a> &lt;/em>are threaded together around a trial for the treatment of Type 1 diabetes. And here another PLOS journal, &lt;a href="https://doi.org/10.1371/journal.pone.0017554">&lt;em>Neglected Tropical Diseases&lt;/em>&lt;/a> links through to a &lt;em>PLOS ONE&lt;/em> article about the same trial.&lt;/span>&lt;/p>
&lt;p>&lt;span >If you publish in the health sciences please do consider joining this exciting initiative so that we can expand these threads and build up the metadata. Read the &lt;a href="https://www.crossref.org/documentation/crossmark/linked-clinical-trials/">tech specs here&lt;/a> or drop me an email if you have questions.&lt;/span>&lt;/p></description></item><item><title>Distributing references via Crossref</title><link>https://www.crossref.org/blog/distributing-references-via-crossref/</link><pubDate>Fri, 17 Jun 2016 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/distributing-references-via-crossref/</guid><description>&lt;h2 id="known-unknowns">Known unknowns&lt;/h2>
&lt;p>If you follow this blog, you are going to notice a theme over the coming months- Crossref supports the deposit and distribution of &lt;a href="https://www.crossref.org/blog/beyond-the-doi-to-richer-metadata/">a lot more kinds of metadata&lt;/a> than people usually realise.&lt;/p>
&lt;p>We are in the process of completely revamping our web site, help documentation, and marketing to better promote our metadata distribution capabilities, but in the mean time we think it would be useful highlight one of our most under-promoted functions- the ability to distribute references via Crossref.&lt;/p>
&lt;p>One of the questions we most often get from members is- “can we distribute references via Crossref?” The answer is an emphatic &lt;strong>yes&lt;/strong>. But to do so, you have to take an extra and hitherto obscure step to enable reference distribution.
&lt;em>[EDIT 6th June 2022 - all references are now open by default with the March 2022 board vote to remove any restrictions on reference distribution].&lt;/em>&lt;/p>
&lt;h2 id="how">How?&lt;/h2>
&lt;p>Many members deposit references to Crossref as part of their participation in Crossref’s &lt;a href="http://www.crossref.org/citedby/index.html" target="_blank">CitedBy&lt;/a> service. However - for historical reasons too tedious to go into- participation in CitedBy does not automatically make references available via Crossref’s standard APIs. In order for publishers to distribute references along with standard bibliographic metadata, publishers need to either:&lt;/p>
&lt;ul>
&lt;li>Contact Crossref &lt;a href="mailto:support@crossref.org">support&lt;/a> and ask them to turn on reference distribution for all of the prefixes they manage.&lt;/li>
&lt;li>Set the &lt;a href="http://data.crossref.org/reports/help/schema_doc/4.4.1/schema_4_4_1.html#reference_distribution_opts.att" target="_blank">&lt;code>reference_distribution_opt&lt;/code>&lt;/a> element to &lt;code>any&lt;/code> for each content item registered where they want to make references openly available.&lt;/li>
&lt;/ul>
&lt;p>Either of these steps will allow references for the affected member DOIs to be distributed without restriction through all of Crossrefs APIs and bulk metadata dumps.&lt;/p>
&lt;p>Note that by doing this, you are &lt;strong>not&lt;/strong> enabling the open querying of your CitedBy data- you are simply allowing the references that you already deposit to be redistributed to interested parties via our public APIs.&lt;/p>
&lt;h2 id="who">Who?&lt;/h2>
&lt;p>So who does this now? Well, at the moment not many members have enabled this feature. How could they? They probably didn’t know it existed.  At the time of writing this 29 publishers have enabled reference distribution for at least some of their DOIs.&lt;/p>
&lt;p>But that’s why we are writing this post. Given the interest expressed by our members, we expect the list to start growing quickly over the next few months. Particularly now that they know they &lt;strong>can&lt;/strong> do it and have clear instructions on &lt;strong>how&lt;/strong> to do it. 🙂&lt;/p>
&lt;p>If you are of a geeky persuasion and want to see the list of publishers who are doing this, you can check via our API.&lt;/p>
&lt;p>The following query will just show you the total number of members who are distributing references for at least some of their DOIs.&lt;/p>
&lt;p >
&lt;span >https://api.crossref.org/v1/members?filter=has-public-references:true&amp;rows=0&lt;/span>
&lt;/p>
&lt;p>And this query will allow you to page through the member records and see who is distributing references.&lt;/p>
&lt;p >
&lt;span >https://api.crossref.org/v1/members?filter=has-public-references:true&lt;/span>
&lt;/p>
&lt;p>That cool, but can you see how many total DOIs have reference distribution enabled? No, but will will be adding that capability to our API soon.&lt;/p>
&lt;h2 id="omg-omg-omg-does-this-mean-i-can-get-references-from-apicrossreforg">OMG! OMG! OMG! Does this mean I can get references from api.crossref.org?&lt;/h2>
&lt;p>&lt;del>Yep. But before you get too excited- note above that not many of our members are doing this yet and that our API is still being updated to allow you to better query this information. At the moment references are not included in our JSON representation- they are only included in our XML representation. You can get the XML for a Crossref DOI either through &lt;a href="http://www.crosscite.org/cn/" target="_blank">content negotiation&lt;/a>, or by using the following incantation on our API (using an &lt;a href="https://elifesciences.org/" target="_blank">eLife&lt;/a> DOI as an example):&lt;/del>&lt;/p>
&lt;p>&lt;del>&lt;code>https://api.crossref.org/v1/works/10.7554/eLife.10288.xml&lt;/code>&lt;/del>&lt;/p>
&lt;p>&lt;del>As we update our API to better support querying DOIs that include references, you will see the new functionality reflected in our documentation at:&lt;/del>&lt;/p>
&lt;div class="shortcode-divwrap service-red">
&lt;span>&lt;strong>Yes.&lt;/strong> 🤗. See the API docs below.&lt;/span>
&lt;/div>
&lt;p>&lt;a href="https://api.crossref.org" target="_blank">&lt;code>https://api.crossref.org&lt;/code>&lt;/a>&lt;/p></description></item><item><title>Beyond the DOI to richer metadata</title><link>https://www.crossref.org/blog/beyond-the-doi-to-richer-metadata/</link><pubDate>Wed, 15 Jun 2016 00:00:00 +0000</pubDate><author>April Ondis</author><guid>https://www.crossref.org/blog/beyond-the-doi-to-richer-metadata/</guid><description>&lt;p>&lt;span >&lt;span >The act of registering a DOI (Digital Object Identifier) for scholarly content is sometimes conflated with the notion of conferring a seal of approval or other mark of good quality upon an item of content.  &lt;/span>&lt;a href="https://www.crossref.org/blog/dois-unambiguously-and-persistently-identify-published-trustworthy-citable-online-scholarly-literature-right/">&lt;span >This is a fundamental misunderstanding&lt;/span>&lt;/a>&lt;span >.&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;b>A DOI is a tool, not a badge of honor.&lt;/b>&lt;span >  &lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >The presence of a Crossref DOI on content sends a signal that:&lt;/span>&lt;/p>
&lt;ol>
&lt;li>&lt;span >The owner of the content would like to be formally cited if the content is used in a scholarly context.&lt;/span>&lt;/li>
&lt;li>&lt;span >The owner of the content considers that it is worthy of being made persistent.&lt;/span>&lt;/li>
&lt;/ol>
&lt;p>&lt;span >&lt;b>Beyond the DOI&lt;/b>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >&lt;span >For Crossref, a &lt;/span>&lt;a href="http://www.doi.org/factsheets/DOIKeyFacts.html">&lt;span >DOI&lt;/span>&lt;/a>&lt;/span>&lt;span >&lt;span > is just one of several types of metadata we register, albeit an important one.&lt;/span>  &lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >Metadata about scholarly works extends beyond the DOI.  In addition to bibliographic details, layers of information accompanying published works may now extend to data that describes the research, such as the source of research funding.  It may also include non-descriptive information that facilitates usage, such as copyright and access permissions.&lt;/span>&lt;/p>
&lt;p>&lt;span >In fact, this “richer” metadata can tell you more about the context of the content deposited for a published work than you might realize.  &lt;/span>&lt;/p>
&lt;p>&lt;span >For example:&lt;/span>&lt;/p>
&lt;li >
&lt;span >&lt;b>Author data - &lt;/b>&lt;span >Crossref metadata may include information specifying the author’s unique &lt;/span>&lt;a href="https://web.archive.org/web/20160610063458/http://orcid.org/about/what-is-orcid/mission">&lt;span >ORCID&lt;/span>&lt;/a>&lt;span >, allowing you to find other works by the same person.  &lt;/span>&lt;/span>
&lt;/li>
&lt;li >
&lt;span >&lt;b>Copyright and access indicators - &lt;/b>&lt;span >You can view the license terms under which the full content may be available, which is very helpful for scholars who want to access the full content for research and teaching or for text and data mining.  &lt;/span>&lt;/span>
&lt;/li>
&lt;li >
&lt;span >&lt;b>Funding data - &lt;/b>&lt;span >Metadata may also include the &lt;/span>&lt;a href="https://www.crossref.org/services/funder-registry/index.html">&lt;span >identity of the grant-making institution&lt;/span>&lt;/a>&lt;span > that funded the research, so that the funder and, in the case of publicly funded research, the general public and other researchers, have visibility on the resulting research outputs.  &lt;/span>&lt;/span>
&lt;/li>
&lt;li >
&lt;span >&lt;b>Clinical Trials data - &lt;/b>&lt;span >Similarly, when research involves a clinical trial, (testing of medicines and treatments on human beings), Crossref metadata can enhance output visibility by displaying the &lt;/span>&lt;a href="https://www.crossref.org/blog/linked-clinical-trials-are-here/">&lt;span >clinical trial number and the related clinical trial registry&lt;/span>&lt;/a>&lt;span >.     &lt;/span>&lt;/span>
&lt;/li>
&lt;p>&lt;span >Like the full content they describe, these metadata have become research resources in their own right.  Unfortunately, too much metadata is entered into Crossref with missing, incomplete, or duplicated fields.  This “bad” metadata slows the pace of discovery, confounding attempts to find and understand scholarly content and its context.  &lt;/span>&lt;/p>
&lt;p>&lt;span >As a community, we really need to do something about that.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;b>“The Map is not the Territory”&lt;/b>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/06/scholarly-road-map.png">&lt;img class="alignright wp-image-1751" src="https://www.crossref.org/wp/blog/uploads/2016/06/scholarly-road-map-300x300.png" alt="scholarly-road-map" width="148" height="148" srcset="https://www.crossref.org/wp/blog/uploads/2016/06/scholarly-road-map-300x300.png 300w, https://www.crossref.org/wp/blog/uploads/2016/06/scholarly-road-map-150x150.png 150w, https://www.crossref.org/wp/blog/uploads/2016/06/scholarly-road-map-768x768.png 768w, https://www.crossref.org/wp/blog/uploads/2016/06/scholarly-road-map-1024x1024.png 1024w, https://www.crossref.org/wp/blog/uploads/2016/06/scholarly-road-map.png 1200w" sizes="(max-width: 148px) 85vw, 148px" />&lt;/a>And the metadata is not the content.  In &lt;a href="https://mitpress.mit.edu/books/metadata-0">&lt;em>Metadata&lt;/em>&lt;/a> (MIT Press), Jeffrey Pomerantz quotes Alfred Korzybski’s insight that a map is a simplified representation of a territory, a tool of abstraction that allows us to find our way.  Jennifer Lin contributed the concept of the scholarly road map as a useful metaphor for the way we use metadata about scholarly works to find our way between and among them in the digital world. &lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >Metadata deposited with Crossref amounts to pieces of information-structured, descriptive, administrative, contextual-about published works that humans can read and machines can use to automate linking and retrieval.  The systematic development of such metadata allows us to make sense of such complex information by finding, linking, citing, and assessing scholarly content. &lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >If you want to understand how Crossref acts as a map of scholarly metadata, try searching for content on &lt;/span>&lt;span >&lt;a href="https://web.archive.org/web/20131229210637/http://search.crossref.org//?q=zika+virus">search.crossref.org&lt;/a> (our human API interface)&lt;/span>&lt;span >.  Or simply talk with us &lt;a href="https://twitter.com/CrossrefOrg">&lt;strong>@CrossrefOrg&lt;/strong>&lt;/a> and via  &lt;a href="mailto:member@crossref.org">&lt;a href="mailto:member@crossref.org">member@crossref.org&lt;/a>&lt;/a>.  &lt;/span>&lt;/span>&lt;/p></description></item><item><title>Our memories of #SSP2016</title><link>https://www.crossref.org/blog/our-memories-of-ssp2016/</link><pubDate>Wed, 08 Jun 2016 00:00:00 +0000</pubDate><author>April Ondis</author><guid>https://www.crossref.org/blog/our-memories-of-ssp2016/</guid><description>&lt;p class="p1">
&lt;b>&lt;/b>&lt;span class="s1" >Last week a bunch of Crossref’s staff traveled to the 2016 Society for Scholarly Publishing Annual Meeting in Vancouver, BC.  After we returned en masse, all nine of us put our heads together to share some of our personal memories of the event.   &lt;/span>
&lt;/p>&lt;figure id="attachment_1732" class="wp-caption alignright">
&lt;p>&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/06/Enjoying-the-High-Wire-Run-Walk-at-SSP2016_.jpg">&lt;img class="wp-image-1732 " src="https://www.crossref.org/wp/blog/uploads/2016/06/Enjoying-the-High-Wire-Run-Walk-at-SSP2016_-240x300.jpg" alt="Enjoying-the-High-Wire-Run-Walk-at-SSP2016_" width="314" height="393" srcset="https://www.crossref.org/wp/blog/uploads/2016/06/Enjoying-the-High-Wire-Run-Walk-at-SSP2016_-240x300.jpg 240w, https://www.crossref.org/wp/blog/uploads/2016/06/Enjoying-the-High-Wire-Run-Walk-at-SSP2016_-768x959.jpg 768w, https://www.crossref.org/wp/blog/uploads/2016/06/Enjoying-the-High-Wire-Run-Walk-at-SSP2016_-820x1024.jpg 820w, https://www.crossref.org/wp/blog/uploads/2016/06/Enjoying-the-High-Wire-Run-Walk-at-SSP2016_-1200x1499.jpg 1200w, https://www.crossref.org/wp/blog/uploads/2016/06/Enjoying-the-High-Wire-Run-Walk-at-SSP2016_.jpg 1500w" sizes="(max-width: 314px) 85vw, 314px" />&lt;/a>&lt;figcaption class="wp-caption-text">&lt;span >Crossref’s Rosa and Susan at the Fun Walk/Run sponsored by High Wire. 5K before breakfast!&lt;/span>&lt;/figcaption>&lt;/figure>&lt;/p>
&lt;p class="p1">
&lt;span class="s1" >&lt;b>On &lt;em>Cybersecurity and the Scholarly World&lt;/em> —&lt;/b>“The session described the many and complicated security threats that IT systems face and how threat detection and defense is a constantly ongoing activity. Certainly system administrators are challenged with the technology issues that build firewalls, block intrusions and divert disruptive activity. But perhaps even more important are the social issues that must be managed to develop an informed user community that is immune to the less technical but probably more effective hacks like phishing for user passwords.”&lt;/span>
&lt;/p>
&lt;p class="p1">
&lt;span class="s1" >&lt;b>On &lt;em>Persistent Identifiers in Scholarly Communications: What, Why, How, Where, and Who? &lt;/em>&lt;/b>“Everyone from Crossref loved this panel, which should come as no surprise (wink).  Persistent identifiers such as DOIs and ORCID iDs enable machine and human readers to discover, cite, link, and correctly attribute works across different platforms.  David Crotty of the Oxford University Press said it best with &amp;#8216;If you’re not actively building these persistent identifiers into your systems, get busy!’ Alice Meadows of ORCID represented the scholarly communications infrastructure with an image of shiny copper plumbing - don’t tell me we don’t have glamorous jobs!  Laura Rueda of DataCite had particularly helpful diagrams to explain how persistent identifiers ease and speed the workflow of a research object as it travels from researcher to publisher to the greater community.” &lt;/span>
&lt;/p>
&lt;p class="p4">
&lt;span >&lt;span class="s3">&lt;b>On&lt;/b> &lt;em>&lt;b>Crossing Boundaries: Encouraging Diversity in Scientific Communication&lt;/b>&lt;/em> with &lt;/span>&lt;span class="s1">Dr. Margaret-Ann Armour&lt;i> — &lt;/i>“I decided to attend this keynote when I saw that men as well as women were in the audience.  Dr. Armour had great anecdotes that supported formal data on women’s roles in STEM.  It made me reflect on how the path to a career in scholarly publishing is often not direct, and relies on personal networking.  She was very witty and deserved her standing ovation.”&lt;/span>&lt;/span>
&lt;/p>
&lt;p class="p4">
&lt;span >&lt;span class="s1">&lt;b>On &lt;i>Standards and Recommended Practices to Support Adoption of Altmetrics&lt;/i> —  &lt;/b>“Todd Carpenter summed up the intent behind many altmetrics initiatives when he said that understanding how many people are using and reading scholarly content is important because &amp;#8216;we all want to know how we’re doing’ but &amp;#8216;this project should never become the number’ because the intent is about ‘trying to add flavor and nuance to the conversation in a meaningful way’.  Stuart Maxwell of Scholarly IQ also made a really astute observation that “all assessment is in some way subjective - impact is relative to how you compare yourself to other researchers in your field.” What especially appealed to me about this session was learning that NISO extends its remit to include the data quality performance of altmetrics aggregators themselves.  Asking each aggregator to self-report a publicly available, annual accounting of how they comply with the &lt;a href="https://www.niso.org/publications/rp-25-2016-altmetrics">Altmetrics Data Quality Code of Conduct&lt;/a> will likely increase consistency, transparency and trust.&amp;#8221; &lt;/span>&lt;span class="s1"> &lt;/span>&lt;/span>
&lt;/p>
&lt;h3 class="p1" id="span-classs1-bssp-receptions--evening-events-where-mashed-potato-sundaes-were-a-thingbspan">&lt;span class="s1" >&lt;b>SSP receptions &amp;amp; evening events, where mashed potato sundaes were a thing&lt;/b>&lt;/span>&lt;/h3>
&lt;p class="p1">
&lt;span class="s1" >Yes, the sessions are great, but some of the really interesting sights, sounds and discussions occur at the evening events. It’s impossible for one person to cover all of them (or is it?), but our idea of a few memorable highlights from this year’s SSP are, in no particular order:&lt;/span>
&lt;/p>
&lt;p class="p1">
&lt;span class="s1" >&amp;#8220;Tuesday’s reception—bar conveniently located just steps from the Crossref booth meant lots of good traffic! The convivial atmosphere made it easy to ignore that we were all tantalizingly close to the glorious view just outside the hotel doors. Wednesday’s reception was a chance to meet all the folks who didn’t make it in Tuesday. Though it seems most of us were delayed arriving in Vancouver, it was well worth the trip and arriving to find a few hundred colleagues all enjoying happy hour is a fine way to start a meeting.&amp;#8221;&lt;/span>
&lt;/p>
&lt;p class="p1">
&lt;span class="s1" >&amp;#8220;HighWire’s reception at the Vancouver Rowing Club provided a lovely walk on the way there, a great band at the party and a shrimp tower almost (but not quite) too good looking to eat. The pouring rain on the walk back made for a memorable bonding experience.&amp;#8221;&lt;/span>
&lt;/p>
&lt;p class="p1">
&lt;span class="s1" >&amp;#8220;Wildebeest was the atmospheric site of the Silverchair reception and great chance to see a bit of downtown before enjoying some good cheese and fine company. At least two of us attending made plans to save the world through better metadata. Over sparkling rose wine no less.&amp;#8221;&lt;/span>
&lt;/p>
&lt;p class="p7">
&lt;span >&lt;span class="s5">&lt;br /> &lt;a href="https://www.crossref.org/wp/blog/uploads/2016/06/Sheridan-at-Vancouver-Aquarium-1.jpeg">&lt;img class=" wp-image-1736 alignleft" src="https://www.crossref.org/wp/blog/uploads/2016/06/Sheridan-at-Vancouver-Aquarium-1-300x225.jpeg" alt="Sheridan-at-Vancouver-Aquarium" width="360" height="270" srcset="https://www.crossref.org/wp/blog/uploads/2016/06/Sheridan-at-Vancouver-Aquarium-1-300x225.jpeg 300w, https://www.crossref.org/wp/blog/uploads/2016/06/Sheridan-at-Vancouver-Aquarium-1-768x576.jpeg 768w, https://www.crossref.org/wp/blog/uploads/2016/06/Sheridan-at-Vancouver-Aquarium-1-1024x768.jpeg 1024w, https://www.crossref.org/wp/blog/uploads/2016/06/Sheridan-at-Vancouver-Aquarium-1-1200x900.jpeg 1200w, https://www.crossref.org/wp/blog/uploads/2016/06/Sheridan-at-Vancouver-Aquarium-1.jpeg 1227w" sizes="(max-width: 360px) 85vw, 360px" />&lt;/a>&amp;#8220;Dolphins and sea otters made merry in a pool outside the Sheridan Group reception at the Vancouver Aquarium, while we noshed and drank with the fishes inside.  But the food rivalled the undersea sights. &lt;/span>&lt;span class="s1">A very nice gentleman with an ice cream scoop filled a parfait glass with a perfectly round dollop of mashed potatoes and told me to help myself to toppings. Shut the front door! I got the works.  Delicious creamy mashed (whipped) potatoes of a perfect consistency, a ladle full of warm brown gravy topped with a generous sprinkle of finely sliced green onions (scallions), and a healthy spray of large, crispy bacon pieces!!  It looked like a sundae … that you eat with a fork!!&amp;#8221;&lt;br /> &lt;/span>&lt;/span>
&lt;/p>
&lt;p class="p1">
&lt;span class="s1" >&amp;#8220;The President’s reception was in the world’s largest hotel suite (approximately), with some very photogenic desserts and a lot of happy people who know that it’s well worth sacrificing some sleep for the event.&amp;#8221;&lt;/span>
&lt;/p>
&lt;p class="p1">
&lt;span class="s1" >Of course, the hotel bar in the evenings had some memorable discussions too but what happens in the bar stays in the bar, right? And we should probably all be grateful for the early last call …&lt;/span>
&lt;/p>
&lt;p class="p1">
&lt;span >&lt;strong>&lt;span class="s1">’Til next year!&lt;/span>&lt;/strong>&lt;/span>
&lt;/p></description></item><item><title>HTTPS and Wikipedia</title><link>https://www.crossref.org/blog/https-and-wikipedia/</link><pubDate>Tue, 31 May 2016 00:00:00 +0000</pubDate><author>Joe Wass</author><guid>https://www.crossref.org/blog/https-and-wikipedia/</guid><description>&lt;p>&lt;span >&lt;em>This is a joint blog post with Dario Taraborelli, coming from &lt;a href="https://meta.wikimedia.org/wiki/WikiCite_2016">WikiCite 2016&lt;/a>.&lt;/em>&lt;/span>&lt;/p>
&lt;p>&lt;span >In 2014 we were taking our first steps along the path that would lead us to &lt;a href="http://eventdata.crossref.org">Crossref Event Data&lt;/a>. At this time I started looking into the DOI resolution logs to see if we could get any interesting information out of them. This project, which became &lt;a href="https://www.crossref.org/blog/introducing-chronograph/">Chronograph&lt;/a>, showed which domains were driving traffic to Crossref DOIs.&lt;/span>&lt;/p>
&lt;p>&lt;span >You can read about the latest results from this analysis in the &lt;a href="https://www.crossref.org/blog/where-do-doi-clicks-come-from/">“Where do DOI Clicks Come From”&lt;/a> blog post.&lt;/span>&lt;/p>
&lt;p>&lt;span >Having this data tells us, amongst other things:&lt;/span>&lt;/p>
&lt;ul>
&lt;li>&lt;span >where people are using DOIs in unexpected places&lt;/span>&lt;/li>
&lt;li>&lt;span >where people are using DOIs in unexpected ways&lt;/span>&lt;/li>
&lt;li>&lt;span >where we knew people were using DOIs but the links are more popular than we realised&lt;/span>&lt;/li>
&lt;/ul>
&lt;p>&lt;span >By the time the &lt;a href="http://www.lagotto.io/workshop_2014/">ALM Workshop 2014&lt;/a> rolled around there was some preliminary data and we realised that Wikipedia came into the third category. There are lots of DOIs in Wikipedia and people click them!&lt;/span>&lt;/p>
&lt;p>&lt;span >I met with Dario Taraborelli, head of research at the Wikimedia Foundation, and shared the data. Dario — who co-authored in 2010 the Altmetrics Manifesto — has been interested in understanding how scholarly citations are used in Wikipedia. Over the years, Wikipedia contributors have made extensive use of references to the scientific literature using DOIs, and by doing so they have created a resource that represents today in many ways the &lt;a href="https://meta.wikimedia.org/wiki/Wikipedia_as_the_front_matter_to_all_research">“front matter to all research”&lt;/a>. There is growing interest in the community in understanding how DOIs are being used in Wikipedia and in non traditional scholarship.&lt;/span>&lt;/p>
&lt;p>&lt;span >During our discussions the subject of Wikipedia’s gradual transition to HTTPS was raised: we anticipated that this change would affect our data gathering.&lt;/span>&lt;/p>
&lt;h2 id="span-changesspan">&lt;span >Changes&lt;/span>&lt;/h2>
&lt;p>&lt;span >When you’re reading webpage and click on a link to another page, your web browser will usually tell the server of that second page the last page you were on. This forms the basis of trackers like Google Analytics.&lt;/span>&lt;/p>
&lt;p>&lt;span >In the days before HTTPS, the next site would know the full URL that you were previously on. With the change to HTTPS, this was reduced to just sending the domain name and not the full URL, or no data at all if you click from an HTTPS page to HTTP.&lt;/span>&lt;/p>
&lt;p>&lt;span >DOI hyperlinks are just like any other hyperlink, and are mostly HTTP not HTTPS.&lt;/span>&lt;/p>
&lt;p>&lt;span >Up until 2015, Wikipedia was served over HTTP, only switching to HTTPS when users were logged in or if they requested it. The Wikimedia Foundation started planning to move to HTTPS and we knew that if they did that, and continued to use HTTP DOIs then we would lose valuable research data.&lt;/span>&lt;/p>
&lt;h2 id="span-a-planspan">&lt;span >A Plan&lt;/span>&lt;/h2>
&lt;p>&lt;span >We decided that the best course of action was to try and change the DOIs in Wikipedia to use HTTPS. Simple, right?&lt;/span>&lt;/p>
&lt;p>&lt;span >After some further research, Dario &lt;a href="https://meta.wikimedia.org/wiki/Research:Wikimedia_referrer_policy">posted a proposal&lt;/a> on how to mitigate the impact of the HTTPS rollout, to make sure that Wikipedia can still signal its importance as a traffic source, while preserving the privacy of its users. &lt;a href="https://meta.wikimedia.org/wiki/Research_talk:Wikimedia_referrer_policy">Discussion followed&lt;/a> and the conclusion was to change the format of every single DOI on Wikipedia, which fortunately could be done without having to edit millions of pages. You can read the full story in &lt;a href="https://www.crossref.org/blog/real-time-stream-of-dois-being-cited-in-wikipedia/">this post from a year ago&lt;/a>.&lt;/span>&lt;/p>
&lt;p>&lt;span >The result of this effort was that well in advance of the HTTPS switchover, the DOI links were ready to continue reporting referral data.&lt;/span>&lt;/p>
&lt;h2 id="span-the-switchspan">&lt;span >The Switch&lt;/span>&lt;/h2>
&lt;p>&lt;span >In June 2015 the Wikimedia foundation made the &lt;a href="http://blog.wikimedia.org/2015/06/12/securing-wikimedia-sites-with-https/">announcement that they were finalising the switch&lt;/a>, and that within a few weeks all traffic would be HTTPS.&lt;/span>&lt;/p>
&lt;p>&lt;span >We held our breath. Would it work? Would we lose all referral data from Wikipedia sites? In February 2016 &lt;a href="https://phabricator.wikimedia.org/T99174#2053812">the last piece of the puzzle fell into place&lt;/a> as Wikipedia gained a ‘meta referrer’ tag to explicitly specify how they would like referrers to be sent: a detailed report on the effect of this change is coming up on the Wikimedia Foundation’s blog.&lt;/span>&lt;/p>
&lt;h2 id="span-the-resultsspan">&lt;span >The results&lt;/span>&lt;/h2>
&lt;p>&lt;span >As detailed in &lt;a href="https://www.crossref.org/blog/where-do-doi-clicks-come-from/">the last blog post&lt;/a> the traffic that we measured coming from Wikipedia doesn’t seem to have slowed down during 2015:&lt;/span>&lt;/p>
&lt;img src="https://www.crossref.org/wp/blog/uploads/2016/05/month-top-10-filtered-domains-1.png" alt="month-top-10-filtered-domains" class="img-responsive" />
&lt;p>&lt;span >I’d call that a success! Over the period covered in the graph, Wikipedia remained prominent as a non-publisher referral of traffic to DOIs.&lt;/span>&lt;/p>
&lt;p>&lt;span >Looking at the balance of HTTP vs HTTPS traffic coming from wikipedia.org, the switchover was dramatic:&lt;/span>&lt;/p>
&lt;img src="https://www.crossref.org/wp/blog/uploads/2016/05/day-code-area.png" alt="day-code-area" class="img-responsive" />
&lt;p>&lt;span >Thank you to Dario Taraborelli, Nemo (Federico Leva), Aaron Halfaker, Alex Stinson and everyone who put in this effort.&lt;/span>&lt;/p>
&lt;p>&lt;span >I’ll leave the last word to Dario:&lt;/span>&lt;/p>
&lt;p>&lt;span >It’s great to see this data. It shows that the switchover happened successfully, which better protects the privacy of our users whilst still reporting the fact that Wikipedia is a prominent source of traffic. This is important validation of the increasing role that Wikipedia plays in the education and scientific community.&lt;/span>&lt;/p></description></item><item><title>Where do DOI clicks come from?</title><link>https://www.crossref.org/blog/where-do-doi-clicks-come-from/</link><pubDate>Thu, 19 May 2016 00:00:00 +0000</pubDate><author>Joe Wass</author><guid>https://www.crossref.org/blog/where-do-doi-clicks-come-from/</guid><description>&lt;p>As part of our &lt;a href="http://eventdata.crossref.org" target="_blank">Event Data&lt;/a> work we’ve been investigating where DOI resolutions come from. A resolution could be someone clicking a DOI hyperlink, or a search engine spider gathering data or a publisher’s system performing its duties. Our server logs tell us every time a DOI was resolved and, if it was by someone using a web browser, which website they were on when they clicked the DOI. This is called a referral.&lt;/p>
&lt;p>This information is interesting because it shows not only where DOI hyperlinks are found across the web, but also when they are actually followed. This data allows us a glimpse into scholarly citation beyond references in traditional literature.&lt;/p>
&lt;p>Last year Crossref Labs &lt;a href="https://www.crossref.org/blog/introducing-chronograph/">announced Chronograph&lt;/a>, an experimental system for browsing some of this data. We’re working toward a new version, but in the meantime I’d like to share the results for 2015 and some of 2016. We have filtered out domains that belong to Crossref member publishers to highlight citations beyond traditional publications.&lt;/p>
&lt;h2 id="top-10-doi-referrals-from-websites-in-2015">Top 10 DOI referrals from websites in 2015&lt;/h2>
&lt;p>This chart shows the top 10 referring non-primary-publisher domains of DOIs per month. Note that if browsers don’t send the referrer (e.g. from an HTTPS page), we don’t get to find out. Because the top 10 can be different month to month, the total number of domains mentioned can be more than 10. Subdomains are combined, which means that, for example, the wikipedia.org entry covers all Wikipedia languages. This chart covers all of 2015 and the first two months of 2016.&lt;/p>
&lt;img src="https://www.crossref.org/wp/blog/uploads/2016/05/month-top-10-filtered-domains-1.png" alt="month-top-10-filtered-domains" class="img-responsive" />
&lt;p>The top 10 referring domains for the period:&lt;/p>
&lt;ol>
&lt;li>webofknowledge.com&lt;/li>
&lt;li>baidu.com&lt;/li>
&lt;li>serialssolutions.com&lt;/li>
&lt;li>scopus.com&lt;/li>
&lt;li>exlibrisgroup.com&lt;/li>
&lt;li>wikipedia.org&lt;/li>
&lt;li>google.com&lt;/li>
&lt;li>uni-trier.de&lt;/li>
&lt;li>ebsco.com&lt;/li>
&lt;li>google.co.uk&lt;/li>
&lt;/ol>
&lt;p>It’s not surprising to see some of these domains here: for example serialssolutions.com and exlibrisgroup.com are effectively proxies for link resolvers, Baidu and Google are incredibly popular search engines which would show up anywhere. But it is exciting to see Wikipedia ranked amongst these. For more detail look out for the new Chronograph.&lt;/p>
&lt;h2 id="http-vs-https-in-2015">HTTP vs HTTPS in 2015&lt;/h2>
&lt;p>We’ve also seen a steady increase in HTTPS referral traffic, i.e. people clicking on DOIs from sites that are using HTTPS. While it is still dwarfed by HTTP, there was a steady uptick throughout 2015.&lt;/p>
&lt;p>This chart shows HTTP vs HTTPS referrals per day, which shows up the weekly spikes. It doesn’t include resolutions where we don’t know the referrer.&lt;/p>
&lt;img src="https://www.crossref.org/wp/blog/uploads/2016/05/day-code.png" alt="HTTP vs HTTPS DOI Referrals" class="img-responsive"/>
&lt;p>Increasing numbers of people are moving to HTTPS for reasons of security, privacy and protection from tampering. &lt;a href="https://webmasters.googleblog.com/2014/08/https-as-ranking-signal.html" target="_blank">Google has announced plans&lt;/a> to take HTTPS into account when ranking search results. Wikipedia has moved exclusively to HTTPS, and I’ll be telling the story of how Crossref and Wikipedia collaborated in an upcoming blog post.&lt;/p>
&lt;h2 id="chronograph">Chronograph&lt;/h2>
&lt;p>Another version of Chronograph will be available soon. It will contain full data for all non-primary-publisher referring domains. Stay tuned!&lt;/p></description></item><item><title>Clinical trial data and articles linked for the first time</title><link>https://www.crossref.org/blog/linked-clinical-trials-are-here/</link><pubDate>Tue, 17 May 2016 00:00:00 +0000</pubDate><author>Daniel Shanahan</author><guid>https://www.crossref.org/blog/linked-clinical-trials-are-here/</guid><description>&lt;p>&lt;span >It’s here. After years of hard work and with a huge cast of characters involved, I am delighted to announce that you will now be able to instantly link to all published articles related to an individual clinical trial through the Crossmark dialogue box. Linked Clinical Trials are here!&lt;/span>&lt;/p>
&lt;p>&lt;span >In practice, this means that anyone reading an article will be able to pull a list of both clinical trials relating to that article and all other articles related to those clinical trials – be it the protocol, statistical analysis plan, results articles or others – all at the click of a button.&lt;/span> &lt;figure id="attachment_1644" class="wp-caption aligncenter">&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/05/crossmark_example-2_720.jpg">&lt;img class="wp-image-1644 size-medium" src="https://www.crossref.org/wp/blog/uploads/2016/05/crossmark_example-2_720-300x286.jpg" width="300" height="286" srcset="https://www.crossref.org/wp/blog/uploads/2016/05/crossmark_example-2_720-300x286.jpg 300w, https://www.crossref.org/wp/blog/uploads/2016/05/crossmark_example-2_720.jpg 720w" sizes="(max-width: 300px) 85vw, 300px" />&lt;/a>&lt;figcaption class="wp-caption-text">Linked Clinical Trials interface&lt;/figcaption>&lt;/figure>&lt;/p>
&lt;p>&lt;span >Now I’m sure you’ll agree that this sounds nifty. It’s definitely a ‘nice-to-have’. But why was it worth all the effort? Well, simply put: “to move a mountain, you begin by carrying away the small stones”.&lt;/span>&lt;/p>
&lt;p>&lt;span >Science communication in its current form is an anachronism, or at the very least somewhat redundant.&lt;/span>&lt;/p>
&lt;p>&lt;span >You may have read about the &lt;a href="http://www.apa.org/monitor/2015/10/share-reproducibility.aspx">‘crisis in reproducibility’&lt;/a>. Good science, at its heart, should be testable, falsifiable and reproducible, but an historical over-emphasis on results has led to a huge number of problems that seriously undermine the integrity of the scientific literature.&lt;/span>&lt;/p>
&lt;p>&lt;span >Issues such as publication bias, selective reporting of outcome and analyses, hypothesising after the results are known (HARKing) and p-hacking are widespread, and can seriously distort the literature base (unless anyone seriously considers &lt;a href="http://tylervigen.com/spurious-correlations">Nicholas Cage to be causally related to people drowning in swimming pools&lt;/a>).&lt;/span>&lt;/p>
&lt;p>&lt;span >This is, of course, nothing new. Calls for prospective registration of clinical trials &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/3760920">date back to the 1980s&lt;/a> and it is now becoming increasingly commonplace, recognising that the quality of research lies in the questions it asks and the methods it uses, not the results observed.&lt;/span>&lt;figure id="attachment_1581" class="wp-caption aligncenter">&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/04/Trial-registration.jpg">&lt;img class="wp-image-1581" src="https://www.crossref.org/wp/blog/uploads/2016/04/Trial-registration.jpg" alt="Uptake of trial registration since 2000" width="600" height="350" srcset="https://www.crossref.org/wp/blog/uploads/2016/04/Trial-registration.jpg 868w, https://www.crossref.org/wp/blog/uploads/2016/04/Trial-registration-300x175.jpg 300w, https://www.crossref.org/wp/blog/uploads/2016/04/Trial-registration-768x448.jpg 768w" sizes="(max-width: 709px) 85vw, (max-width: 909px) 67vw, (max-width: 984px) 61vw, (max-width: 1362px) 45vw, 600px" />&lt;/a>&lt;figcaption class="wp-caption-text">Uptake of trial registration year-on-year since 2000&lt;/figcaption>&lt;/figure>&lt;/p>
&lt;p>&lt;span >Building on this, a number of journals and funders – starting with BioMed Central’s &lt;em>Trials&lt;/em> &lt;a href="http://trialsjournal.biomedcentral.com/articles/10.1186/1468-6708-6-15">over 10 years ago&lt;/a> – have also pushed for the prospective publication of a study’s protocol and, more recently, statistical analysis plan. The idea that null and non-confirmatory results have value and should be published has also gained increasing support.&lt;/span>&lt;/p>
&lt;p>&lt;span >Over the last ten years, there has been a general trend towards increasing transparency. So what is the problem? Well, to borrow an analogy from Jeremy Grimshaw, co-Editor-in-Chief of &lt;a href="http://trialsjournal.biomedcentral.com/">&lt;em>Trials&lt;/em>&lt;/a> – we’ve gone from &lt;a href="http://blogs.biomedcentral.com/on-medicine/2014/05/30/the-consort-statement-in-2014/">Miró to Pollock&lt;/a>.&lt;/span>&lt;/p>
&lt;p>&lt;span >Although a results paper may reference a published study protocol, there is nothing to link that report to subsequent published articles; and no link from the protocol itself to the results article.&lt;/span>&lt;/p>
&lt;p>&lt;span >A &lt;a href="http://trialsjournal.biomedcentral.com/articles/10.1186/1745-6215-15-369">single clinical trial can result in multiple publications&lt;/a>: the study protocol and traditional results paper or papers, as well as commentaries, secondary analyses and, eventually, systematic reviews, among others, many published in different journals, years apart. This situation is further complicated by an ever-growing body of literature.&lt;/span>&lt;/p>
&lt;p>&lt;span >Researchers need access to all of these articles if they are to reliably evaluate bias or selective reporting in a research object, but – as any systematic reviewer can tell you – actually finding them all is like looking for a needle in a haystack. When you don’t know how many needles there are. With the haystack still growing.&lt;/span>&lt;/p>
&lt;p>&lt;span >That’s where we come in. The advent of trial registration means that there is a unique identifier associated with every clinical trial, at the study-level, rather than the article level. Building on this, the &lt;a href="http://blogs.biomedcentral.com/on-medicine/2014/01/31/threaded-publications-one-step-closer/">Linked Clinical Trials project&lt;/a> set out to connect all articles relating to an individual trial together using its trial registration number (TRN).&lt;/span>&lt;/p>
&lt;p>&lt;span >By adapting the existing Crossmark standard, we have captured additional metadata about an article, namely the TRN and the trial registry, with this information then associated with the article’s DOI on publication. This means that you will be able to pull all articles related to an individual clinical trial from the Crossmark dialogue box on any relevant article. &lt;/span>&lt;/p>
&lt;p>&lt;span >This obviously has huge implications for the way science is reported and used. By quickly and easily linking to related published articles, it will enable editors, reviewers and researchers to evaluate any selective reporting in the study, and help to provide far greater context for the results.&lt;/span>&lt;/p>
&lt;p>&lt;span >As all the metadata will be open access (CC0), with no copyright, it will also be possible to access this article ‘thread’ through the Crossref Metadata Search, or independently through an application programming interface (API). This provides a platform for others to build on, with many already looking to take the next step, such as Ben Goldacre’s new &lt;a href="http://trialsjournal.biomedcentral.com/articles/10.1186/s13063-016-1290-8">Open Trials initiative&lt;/a>.&lt;/span>&lt;/p>
&lt;p>&lt;span >However, in order for this to work, we must capture as many articles and trials as possible to create a truly comprehensive thread of publications. We currently have data from the NIHR Libraries, PLoS and, of course, BioMed Central, but need more publishers and journals to join us in depositing clinical trial metadata. After all, without metadata, this is all merely wishful thinking.&lt;/span>&lt;/p>
&lt;p>&lt;span >Let’s hope we’re the pebble that starts the landslide.&lt;/span>&lt;/p></description></item><item><title>Members will soon be able to assign Crossref DOIs to preprints</title><link>https://www.crossref.org/blog/members-will-soon-be-able-to-assign-crossref-dois-to-preprints/</link><pubDate>Thu, 05 May 2016 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/members-will-soon-be-able-to-assign-crossref-dois-to-preprints/</guid><description>&lt;h2 id="span-strongtldrstrongspan">&lt;span >&lt;strong>TL;DR&lt;/strong>&lt;/span>&lt;/h2>
&lt;p>&lt;span >By August 2016, Crossref will enable its members to assign Crossref DOIs to preprints. Preprint DOIs will be assigned by the Crossref member responsible for the preprint and that DOI will be different from the DOI assigne&lt;/span>&lt;span >d by the publisher to the accepted manuscript and version of record. Crossref’s display guidelines, tools and APIs will be modified in order to enable researchers to easily identify and link to the best available version of a document (BAV). We are doing this in order to support the changing publishing models of our members and in order to clarify the scholarly citation record.&lt;/span>&lt;/p>
&lt;h3 id="span-bbackgroundbspan">&lt;span >&lt;b>Background&lt;/b>&lt;/span>&lt;/h3>
&lt;p>&lt;span >Why is this news? Well, to understand that you need to know a little Crossref history.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;i>&lt;span >(cue music and fade to sepia) &lt;/span>&lt;/i>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;br /> &lt;a href="https://www.crossref.org/wp/blog/uploads/2016/05/cue-music-fade-to-sepia.jpg">&lt;img class="alignright wp-image-1606" src="https://www.crossref.org/wp/blog/uploads/2016/05/cue-music-fade-to-sepia.jpg" alt="ukelele memory" width="283" height="425" srcset="https://www.crossref.org/wp/blog/uploads/2016/05/cue-music-fade-to-sepia.jpg 800w, https://www.crossref.org/wp/blog/uploads/2016/05/cue-music-fade-to-sepia-200x300.jpg 200w, https://www.crossref.org/wp/blog/uploads/2016/05/cue-music-fade-to-sepia-768x1152.jpg 768w, https://www.crossref.org/wp/blog/uploads/2016/05/cue-music-fade-to-sepia-683x1024.jpg 683w" sizes="(max-width: 283px) 85vw, 283px" />&lt;/a>When Crossref was founded, one of its major goals was to clarify the scholarly record by uniquely identifying formally published scholarly content on the web so that it could be cited precisely. At the time, our members had two primary concerns:&lt;/span>&lt;/p>
&lt;ul>
&lt;li>
&lt;p>&lt;span >That a Crossref DOI should point to one intellectually discrete scholarly document. That is, they did not want one Crossref DOI to be assigned to two documents that appeared largely similar, but which might vary in intellectually significant ways.&lt;/span>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;span >That two DOIs should not point to the same intellectually discrete document. They wanted it to be easy for all to tell when the same discrete intellectual content was cited.&lt;/span>&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>&lt;span >As such, when Crossref was founded, we developed a complex set of rules that were colloquially known by our members as Crossref’s rules “prohibiting the assignment of DOIs to duplicative content.”&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;i>&lt;span >(cue music, show wavy lines, return to color)&lt;/span>&lt;/i>&lt;/span>&lt;/p>
&lt;p>&lt;span >Well… as we gained experience in assigning DOIs, many of these rules have been amended or discarded when it became apparent that they didn’t actually support common scholarly citation practice and/or otherwise muddied the scholarly citation record.&lt;/span>&lt;/p>
&lt;p>&lt;span >For example, sometimes a document will be re-published in a special issue or an anthology. Before the advent of the DOI, it was common citation practice to always cite a document in the context in which it was read. The context of the document could, after all, affect the interpretation or crediting of the work. But it would be impossible to support this common citation practice if we were to assign the same Crossref DOI to the article on both its original context and in its re-published form. Our current recommendation in these situations is to assign separate DOIs to content that is republished in another context.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >Another example occurs when a particular copy of a two identical documents has been annotated. For example, though the &lt;/span>&lt;i>&lt;span >Handbook to The birds of Australia&lt;/span>&lt;/i>&lt;span > By John Gould has its own Crossref DOI (&lt;a href="http://doi.org/10.5962/bhl.title.8367" target="_blank">http://doi.org/10.5962/bhl.title.8367&lt;/a>), another copy of the same book has been hand-annotated by &lt;/span>&lt;a href="https://en.wikipedia.org/wiki/Charles_Darwin">&lt;span >Charles Darwin&lt;/span>&lt;/a>&lt;span > and &lt;/span>&lt;i>&lt;span >also&lt;/span>&lt;/i>&lt;span > has its own, different Crossref DOI (&lt;a href="http://doi.org/10.5962/bhl.title.50403" target="_blank">http://doi.org/10.5962/bhl.title.50403&lt;/a>). Historians of science quite reasonably may want to refer and cite the particular annotated copy of this historic document.&lt;/span>&lt;/span>&lt;/p>
&lt;p>__&lt;span >&lt;i>&lt;span >[So much for not assigning two separate Crossref DOIs to identical documents.]&lt;/span>&lt;/i>&lt;/span>&lt;/p>
&lt;p>&lt;span >Finally, we should note a far more common example practice in our industry. Our members often make content available online with a Crossref DOI before they consider it to be formally published. This practice goes by a number of names including “publish ahead of print,” “article in progress,” “article in press,” “online ahead of print,” “online first”, etc.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >But in each case, the process is the same- the publisher is assigning a Crossref DOI to the document soon after it has been accepted for publication and this &lt;/span>&lt;i>&lt;span >same&lt;/span>&lt;/i>&lt;span > Crossref DOI is carried over to the finally published article. Again, this practice just reflects that the “intellectual” content of the accepted manuscript should not change between the point of acceptance and the point of publication, so of the purposes of “citation” they are largely interchangeable.&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;i>&lt;span >[So much for not assigning one Crossref DOI to two versions of the same document.]&lt;/span>&lt;/i>&lt;/span>&lt;/p>
&lt;p>&lt;span >Now, in the above cases it also helps to clarify the scholarly record to also specify that the respective Crossref DOIs of the original and the “duplicative” work are related, and we encourage our members to make these connections explicit when they can. Nonetheless, it is paramount in both cases to allow the “duplicative works” to be cited precisely and independently.&lt;/span>&lt;/p>
&lt;p>&lt;span >Which brings us back to preprints.&lt;/span>&lt;/p>
&lt;h3 id="span-bthe-case-for-preprintsbspan">&lt;span >&lt;b>The case for preprints&lt;/b>&lt;/span>&lt;/h3>
&lt;p>&lt;span >First we should define what was meant by preprints because even this commonly used term sometimes means different things to different communities. We have historically considered preprints to be any version of a manuscript that is intended for publication but that has not yet been submitted to a publisher for formal review. Note that this definition does not include “accepted manuscripts” which -as we noted above- often already have Crossref DOIs assigned to them soon after acceptance.&lt;/span>&lt;/p>
&lt;p>&lt;span >Crossref members originally worried that, by assigning DOIs to preprints, we would end up muddying the scholarly record. They worried that the very presence of a Crossref DOI would be interpreted to mean that the content to which it had been applied had gone through a formal publishing process. And unlike the case with “accepted manuscripts”, the difference between intellectual content of a preprint and the final published version can sometimes be substantial. At the time, it seemed that the scholarly record would be clarified by prohibiting the assignment of DOIs to preprints.&lt;/span>&lt;/p>
&lt;p>&lt;span >But again, changes in the scholarly communication landscape have led us to -as the youngsters say- pivot.&lt;/span>&lt;/p>
&lt;h3 id="span-ba-koanbspan">&lt;span >&lt;b>A Koan&lt;/b>&lt;/span>&lt;/h3>
&lt;p>&lt;span >&lt;i>&lt;span >When is a preprint a preprint?&lt;/span>&lt;/i>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/05/contemplative-hand.jpg">&lt;img class="alignleft size-medium wp-image-1609" src="https://www.crossref.org/wp/blog/uploads/2016/05/contemplative-hand-200x300.jpg" alt="contemplative hand" width="200" height="300" srcset="https://www.crossref.org/wp/blog/uploads/2016/05/contemplative-hand-200x300.jpg 200w, https://www.crossref.org/wp/blog/uploads/2016/05/contemplative-hand.jpg 577w" sizes="(max-width: 200px) 85vw, 200px" />&lt;/a>Crossref has always been &lt;/span>&lt;a href="https://www.google.co.uk/?ion=1&amp;espv=2#q=define:catholic">&lt;span >catholic&lt;/span>&lt;/a>&lt;span > in its definition of “publisher.” Many of our members do not consider “publishing” to be their primary mission. The OECD and World Bank are two obvious cases here. But our membership also includes government departments, universities and archives. In these latter cases they have traditionally assigned Crossref DOIs to things like internal reports, grey literature, working papers, etc. This activity was clearly within the original rules set out by Crossref. And this is where our koan comes into play- “when is a preprint a preprint?”&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >It is often difficult to predict when something &lt;/span>&lt;i>&lt;span >might&lt;/span>&lt;/i>&lt;span > eventually be formally published. How do you &lt;/span>&lt;a href="https://www.google.co.uk/?ion=1&amp;espv=2#q=define:a+priori">&lt;i>&lt;span >a priori&lt;/span>&lt;/i>&lt;/a>&lt;span > know that working paper will never be submitted for publication? After all, &lt;/span>&lt;i>&lt;span >everything&lt;/span>&lt;/i>&lt;span > could potentially be submitted for publication (Sometimes it seems everything is.)&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >This is the dilemma that was faced by a few of our members. For example, Cold Spring Harbor Laboratory, which runs &lt;/span>&lt;a href="http://biorxiv.org/">&lt;span >bioRxiv&lt;/span>&lt;/a>&lt;span > has been a Crossref member since 2000 and has assigned over 35,000 Crossref DOIs. They have been assiduous in trying to stick to Crossref’s rules about preprints. Furthermore, they have taken equal care to ensure that preprints in bioRxiv are labeled as such and linked to the final publication (via a Crossref journal DOI) when it is available. This takes a lot of work. &lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >But often bioRxiv simply has no way of telling when the authors of a working paper or report might suddenly decide to submit their work for publication. So they have found themselves occasionally and inadvertently violating Crossref’s rules on preprints because they had no way of predicting when something would magically transform from being an innocuous working paper into a fraught preprint.&lt;/span>&lt;/p>
&lt;p>&lt;span >It is a testament to bioRxiv that they have persevered. We have other members who face the same problem. They have not given up. They have not gone elsewhere for their DOIs.&lt;/span>&lt;/p>
&lt;p>&lt;span >Which brings us to our next point.&lt;/span>&lt;/p>
&lt;h3 id="span-bnot-all-doisbspan">&lt;span >&lt;b>Not All DOIs&lt;/b>&lt;/span>&lt;/h3>
&lt;p>&lt;span >&lt;span >Have you noticed how often we use the phrase “Crossref DOIs?” Were you wondering if this was an annoying affectation or an example of a marketing department gone mad? It’s neither. It is an essential distinction that we make because Crossref is just one of &lt;/span>&lt;a href="https://www.doi.org/registration_agencies.html">&lt;span >several DOI registration agencies&lt;/span>&lt;/a>&lt;span >. Although &lt;/span>&lt;i>&lt;span >all DOIs&lt;/span>&lt;/i>&lt;span > are “compatible” in the minimal sense that you can “resolve” them to a location on the web, that does not mean that all DOIs work identically. Different DOI registration agencies have different constituencies, different services, different governance models and different rules covering what their members can assign their respective DOIs to.&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >This was not the case when Crossref was founded and our rules were first drafted. At the time, Crossref was the &lt;/span>&lt;i>&lt;span >only&lt;/span>&lt;/i>&lt;span > registration agency and, as such, the rule which prohibited the assignment of Crossref DOIs to preprints kinda worked. But it was unworkable in the longer term.&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >Quite naturally, new DOI registration agencies have been established for different communities with different primary use-cases. While Crossref could have a rule prohibiting the assignment of Crossref DOIs to preprints, there was nothing stopping another registration agency from allowing (indeed, encouraging) its members to assign DOIs to preprints.&lt;/span>&lt;/p>
&lt;p>&lt;span >So the simple fact is that DOIs could be assigned to preprints regardless of Crossref’s old rules. By continuing to prohibit the practice at Crossref we were just making life for some of our existing members more difficult.&lt;/span>&lt;/p>
&lt;p>&lt;span >And it has become clear that the situation would only get worse as more of our members started to roll-out new publishing and business models.&lt;/span>&lt;/p>
&lt;h3 id="span-bbusiness-model-neutral-bspan">&lt;span >&lt;b>Business model neutral  &lt;/b>&lt;/span>&lt;/h3>
&lt;p>&lt;span >Crossref has always been business model neutral. We need to adapt and change to support our members’ business models, not the other way around.&lt;/span>&lt;/p>
&lt;p>&lt;span >A number of our members are starting to adopt publishing workflows that are more fluid and public than established publishing models. These new workflows make much of the submission and review process open, which, in turn often blurs the historically hard distinctions between a draft manuscript, a preprint, a revised proof, an accepted manuscript, the “final” published version, and subsequent corrections and updates. Where as in classic publishing models a document went through a series of discrete state-changes (some in public, many in private) new publishing workflows treat document versions as a continuum, most of which are made available publicly and which consequently may be used cited at almost any point in the publishing process.&lt;/span>&lt;/p>
&lt;p>&lt;span >In short, Crossref’s members increasingly need the flexibility to assign DOIs at different points in the publishing lifecycle. Rather than enforce rules that enshrined an existing publishing or business model, we need to work with our members to establish and adopt new DOI assignment practices which support evolving publishing models whilst maintaining a clear citation record and which lets researchers easily identify the best available version (BAV) of a document or research object.&lt;/span>&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/05/flinty-exterior.jpg">&lt;img class="alignright size-medium wp-image-1615" src="https://www.crossref.org/wp/blog/uploads/2016/05/flinty-exterior-200x300.jpg" alt="flinty-exterior" width="200" height="300" srcset="https://www.crossref.org/wp/blog/uploads/2016/05/flinty-exterior-200x300.jpg 200w, https://www.crossref.org/wp/blog/uploads/2016/05/flinty-exterior.jpg 577w" sizes="(max-width: 200px) 85vw, 200px" />&lt;/a>So you see, not all of our motivations for this change in policy are opportunistic or prosaic. Underneath our gruff and flinty exterior is a soft, idealistic center. There are principles at work here as well.&lt;/p>
&lt;p>&lt;span >&lt;b>What next&lt;/b>&lt;/span>&lt;/p>
&lt;p>&lt;span >So this isn’t just matter of changing our rules and display guidelines. We also have to make some schema changes, and adjust our services and APIs to clearly distinguish between preprints and accepted manuscripts/versions of record. Additionally, we will be building tools to make it much easier for our members to link preprints to the final published article (and vice versa). Finally, we need to update our documentation to help our members take advantage of the new functionality. We expect that everything will be in place by the end of August, 2016, at which point you will see another announcement from us.&lt;/span>&lt;/p></description></item><item><title>Crossref Brand update: new names, logos, guidelines, + video</title><link>https://www.crossref.org/blog/crossref-brand-update-new-names-logos-guidelines-and-video/</link><pubDate>Fri, 29 Apr 2016 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/crossref-brand-update-new-names-logos-guidelines-and-video/</guid><description>&lt;p>It can be a pain when companies rebrand as it usually requires some coordinated updating of wording and logos on websites, handouts, and slides. Nevermind changing habits and remembering to use the new names verbally in presentations.&lt;/p>
&lt;h2 id="why-bother">Why bother?&lt;/h2>
&lt;p>As our infrastructure and services expanded, we sometimes branded services with no reference to Crossref. As explained in our &lt;a href="https://doi.org/10.64000/ggwer-c7839" target="_blank">The Logo Has Landed post&lt;/a> last November, this has led to confusion, and it was not scalable nor sustainable. &lt;/p>
&lt;p>With a cohesive approach to naming and branding, the benefits of changing to (some) new names and logos should help everyone. Our aim is to stem confusion and be in a much better position to provide clear messages and useful resources so that people don’t have to try hard to understand what Crossref enables them to do. &lt;/p>
&lt;p>So while it may be a bit of a pain short-term, it will be worth it!&lt;/p>
&lt;h2 id="what-are-the-new-names">What are the new names?&lt;/h2>
&lt;p>As a handy reference, here is a slide-shaped image giving an overview of our services with their new names:&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/wp/blog/uploads/2016/04/Overview-of-brand-name-changes-April-2016.png"
alt="Overview of brand name changes, April 2016" width="100%">&lt;figcaption>
&lt;p>Overview of brand name changes, April 2016&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;h2 id="its-a-lowercase-8216r-in-crossref">It’s a lowercase ‘r’ in Crossref&lt;/h2>
&lt;p>That’s right, you’ve spent fifteen years learning to capitalize the second R in Crossref, and now we’re asking you to lowercase it! Please say hello to and start to embrace the more natural and contemporary &lt;strong>Crossref&lt;/strong>.&lt;/p>
&lt;h2 id="reference-logos-from-our-new-cdn-viaassetscrossreforghttpassetscrossreforg">Reference logos from our new CDN via &lt;a href="http://assets.crossref.org" target="_blank">assets.crossref.org&lt;/a>&lt;/h2>
&lt;p>I’m hoping we can count on our community to update logos and names on your end, keeping consistent with new brand guidelines. And I hope we can make it as easy as possible to do: &lt;/p>
&lt;ul>
&lt;li>
&lt;p>This Content Delivery Network (CDN) at &lt;a href="http://assets.crossref.org" target="_blank">assets.crossref.org&lt;/a> allows you to reference logos using a snippet of code. Please do not copy/download the logos.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>This &lt;a href="http://outreach.crossref.org/acton/ct/16781/s-0038-1604/Bct/l-001d/l-001d:282/ct2_0/1?sid=xd9u0mOai" target="_blank">set of brand guidelines for members&lt;/a>.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>We also have a new website in development which will put support and resources front and center of the user experience. More on that in the next month or two.&lt;/p>
&lt;p>By using the snippets of code provided via our new CDN at &lt;a href="http://assets.crossref.org" target="_blank">assets.crossref.org&lt;/a>, these kind of manual updates should never be a problem in the future if the logo changes again (no plans anytime soon!).&lt;/p>
&lt;p>Of course, we don’t expect people to update new logos and names immediately, there is always a period of transition. Please &lt;a href="mailto:feedback@crossref.org">let us know&lt;/a> let us know if we can help you to update your sites and materials in the coming weeks.&lt;/p>
&lt;p>Also, check out &lt;a href="https://www.youtube.com/watch_popup?v=_Bm2r59TG1I" target="_blank">the launch video&lt;/a>, which presents five key Crossref brand messages:&lt;/p></description></item><item><title>Getting Started with Crossref DOIs, courtesy of Scholastica</title><link>https://www.crossref.org/blog/getting-started-with-crossref-dois-courtesy-of-scholastica/</link><pubDate>Mon, 25 Apr 2016 00:00:00 +0000</pubDate><author>Anna Tolwinska</author><guid>https://www.crossref.org/blog/getting-started-with-crossref-dois-courtesy-of-scholastica/</guid><description>&lt;p>I had a great chat with &lt;a href="https://twitter.com/djpadula5" target="_blank">Danielle Padula&lt;/a> of &lt;a href="https://scholasticahq.com/" target="_blank">Scholastica&lt;/a>, a journals &lt;em>platform with an integrated peer-review process that was founded in 2011.  We talked about how journals&lt;/em> get started with Crossref, and she turned our conversation into a blog post that describes the steps to begin registering content and depositing metadata with us.  Since the result is a really useful description of our new member on-boarding process, I want to share it with you here as well.  As always, comments and questions are welcome here, at &lt;a href="mailto:member@Crossref.org">member@Crossref.org&lt;/a>, and &lt;a href="http://twitter.com/crossreforg" target="_blank">@CrossrefOrg&lt;/a>.  - Anna_&lt;/p>
&lt;p>The internet is in a constant state of change, with new content being added to the web by the minute and old content sometimes getting moved around. While the benefit of publishing scholarly outputs online is that it’s possible to update them at any moment, moving or modifying content can also …&lt;/p>
&lt;p>Read more at: &lt;a href="https://blog.scholasticahq.com/post/getting-started-with-dois-at-your-journal-interview-with-anna-tolwinska-crossref/" target="_blank">https://blog.scholasticahq.com/post/getting-started-with-dois-at-your-journal-interview-with-anna-tolwinska-crossref/&lt;/a>&lt;/p></description></item><item><title>Crossref Event Data: early preview now available</title><link>https://www.crossref.org/blog/crossref-event-data-early-preview-now-available/</link><pubDate>Mon, 18 Apr 2016 00:00:00 +0000</pubDate><author>Madeleine Watson</author><guid>https://www.crossref.org/blog/crossref-event-data-early-preview-now-available/</guid><description>&lt;img src="http://assets.crossref.org/logo/crossref-event-data-logo-200.svg" alt="Crossref Event Data logo" width="200" height="83" />
&lt;p>&lt;span >Test out the early preview of Event Data while we continue to develop it. Share your thoughts. And be warned: we may break a few eggs from time to time!&lt;/span>&lt;figure id="attachment_1530" class="wp-caption alignright">&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/04/Screen-Shot-2016-04-18-at-14.43.59.png" rel="attachment wp-att-1530">&lt;img class="wp-image-1530 size-full" src="https://www.crossref.org/wp/blog/uploads/2016/04/Screen-Shot-2016-04-18-at-14.43.59.png" alt="Egg" width="197" height="243" />&lt;/a>&lt;figcaption class="wp-caption-text">&lt;/span> &lt;span >Chicken by anbileru adaleru from the The Noun Project&lt;/span>&lt;/figcaption>&lt;/figure>&lt;/p>
&lt;p>&lt;span >Want to discover which research works are being shared, liked and commented on? What about the number of times a scholarly item is referenced? Starting today, you can whet your appetite with an early preview of the forthcoming Crossref Event Data service. We invite you to start exploring the activity of DOIs as they permeate and interact with the world after publication.&lt;/span>&lt;/p>
&lt;h2 id="span-but-first-a-bit-of-backgroundspan">&lt;span >But first, a bit of background&lt;/span>&lt;/h2>
&lt;p>&lt;span >&lt;span >Discussion around scholarly research increasingly occurs online after publication, for example on blogs, sharing services, social media, and wikis. These ‘events’ occur across the web on numerous platforms and are a critical part of the scholarly enterprise. We are developing an infrastructure service (&lt;/span>&lt;a href="http://eventdata.crossref.org">&lt;span >Crossref Event Data&lt;/span>&lt;/a>&lt;span >) that collects, stores, and delivers raw data of the events occurring with Crossref DOIs. We will store the data in an open, auditable and portable form for the community to access. Publishers, platforms, funders, bibliometricians and service providers may benefit from access to this raw data, and it can be used to feed into research records or proprietary tools and services that offer aggregation and analysis. &lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >For more information, see our &lt;/span>&lt;a href="https://www.crossref.org/blog/det-poised-for-launch/">&lt;span >pilot blog post&lt;/span>&lt;/a>&lt;span > and description of &lt;/span>&lt;a href="https://www.crossref.org/blog/event-data-open-for-your-interpretation/">&lt;span >potential use cases&lt;/span>&lt;/a>&lt;span >.&lt;/span>&lt;/span>&lt;/p>
&lt;h2 id="span-collaborative-transparent-development-spanfigure-idattachment_1524--classwp-caption-alignright">&lt;span >Collaborative, transparent development &lt;/span>&lt;figure id="attachment_1524" class="wp-caption alignright">&lt;/h2>
&lt;p>&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/04/JoeMartin.png" rel="attachment wp-att-1524">&lt;img class="size-medium wp-image-1524" src="https://www.crossref.org/wp/blog/uploads/2016/04/JoeMartin-300x236.png" alt="Photo of collaborators Martin Fenner and Joe Wass enjoying a meal together. " width="300" height="236" srcset="https://www.crossref.org/wp/blog/uploads/2016/04/JoeMartin-300x236.png 300w, https://www.crossref.org/wp/blog/uploads/2016/04/JoeMartin.png 438w" sizes="(max-width: 300px) 85vw, 300px" />&lt;/a>&lt;figcaption class="wp-caption-text">&lt;/span> &lt;span >Developers Martin Fenner (DataCite) and Joe Wass (Crossref) enjoy a tofu break&lt;/span>&lt;/figcaption>&lt;/figure>&lt;/p>
&lt;p>&lt;span >&lt;span >Lagotto, the software originally developed at PLOS, has been extended and improved in a joint effort between DataCite and Crossref. The two DOI Registration Agencies have partnered to envision, build and release the service. On the 13th of April, after a year of&lt;/span> &lt;span >collaboration, we jointly released Lagotto 5.0. You can read about the collaboration on the &lt;/span>&lt;/span>&lt;span >&lt;a href="https://doi.org/10.5438/pe54-zj5t">&lt;span >DataCite blog post&lt;/span>&lt;/a>&lt;span >.&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >Crossref and DataCite will continue to work closely together to develop Lagotto and the Event Data service. Although Crossref Event Data has mostly Crossref DOIs at launch, you will be able to find DataCite DOIs if they are cited in Crossref or Wikipedia.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >All of the software that runs Event Data, including Lagotto, is developed in the open and is open source. Please refer to the &lt;/span>&lt;a href="http://eventdata.crossref.org/guide/">&lt;span >Crossref Event Data Technical User Guide&lt;/span>&lt;/a>&lt;span > for full details.&lt;/span>&lt;/span>&lt;/p>
&lt;h2 id="span-preview-the-dataspan">&lt;span >Preview the data&lt;/span>&lt;/h2>
&lt;p>&lt;span >&lt;span >This service is currently under development with a full launch expected the second half of 2016. Before it is launched however, we invite you to take a look around and preview a subset of the data sources we plan to include. Y&lt;/span>&lt;span >ou may experience occasional hiccups while we continue building the service.&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >At this stage, we are working with data from three sources although we will greatly expand the variety of platforms from which we collect data as development progresses. At this stage, you can view Mendeley bookmarks, Wikipedia references, and Crossref to DataCite links.&lt;/span>&lt;/p>
&lt;h3 id="span-mendeleyspan">&lt;span >Mendeley&lt;/span>&lt;/h3>
&lt;p>&lt;span >Mendeley is a reference manager and academic social network for scholars. View the number of social bookmarks from scholars or groups on Mendeley.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >For example,  &lt;/span>&lt;a href="http://doi.org/10.1016/J.JIP.2016.03.007">&lt;span >doi.org/10.1016/J.JIP.2016.03.007&lt;/span>&lt;/a>&lt;span > currently has &lt;/span>&lt;a href="https://www.mendeley.com/research/hygienic-food-reduce-pathogen-risk-bumblebees/">&lt;span >8 readers on Mendeley&lt;/span>&lt;/a>&lt;span > to date.&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/04/Medeley-example.png" rel="attachment wp-att-1525">&lt;img class="alignnone wp-image-1525 size-large" src="https://www.crossref.org/wp/blog/uploads/2016/04/Medeley-example-1024x446.png" alt="Example of event data in Mendeley." width="840" height="366" srcset="https://www.crossref.org/wp/blog/uploads/2016/04/Medeley-example-1024x446.png 1024w, https://www.crossref.org/wp/blog/uploads/2016/04/Medeley-example-300x131.png 300w, https://www.crossref.org/wp/blog/uploads/2016/04/Medeley-example-768x334.png 768w, https://www.crossref.org/wp/blog/uploads/2016/04/Medeley-example-1200x522.png 1200w, https://www.crossref.org/wp/blog/uploads/2016/04/Medeley-example.png 1300w" sizes="(max-width: 709px) 85vw, (max-width: 909px) 67vw, (max-width: 1362px) 62vw, 840px" />&lt;/a>&lt;/span>&lt;/p>
&lt;h3 id="span-wikipedia-span">&lt;span >Wikipedia &lt;/span>&lt;/h3>
&lt;p>&lt;span >Wikipedia is an online encyclopaedia, the Internet’s largest and most popular general reference work. View references in Wikipedia of Crossref publications in Wikipedia articles in all languages.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >For example, &lt;/span>&lt;a href="http://doi.org/10.3897/ZOOKEYS.565.7185">&lt;span >doi.org/10.3897/ZOOKEYS.565.7185&lt;/span>&lt;/a>&lt;span > was referenced in the &lt;/span>&lt;a href="https://ru.wikipedia.org/wiki/Oxyscelio">&lt;span >Russian Wikipedia page on Oxyscelio&lt;/span>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/04/Wikipedia-example.png" rel="attachment wp-att-1526">&lt;img class="alignnone wp-image-1526 size-large" src="https://www.crossref.org/wp/blog/uploads/2016/04/Wikipedia-example-1024x472.png" alt="Example of event data for a DOI referenced in a Wikipedia page" width="840" height="387" srcset="https://www.crossref.org/wp/blog/uploads/2016/04/Wikipedia-example-1024x472.png 1024w, https://www.crossref.org/wp/blog/uploads/2016/04/Wikipedia-example-300x138.png 300w, https://www.crossref.org/wp/blog/uploads/2016/04/Wikipedia-example-768x354.png 768w, https://www.crossref.org/wp/blog/uploads/2016/04/Wikipedia-example-1200x553.png 1200w" sizes="(max-width: 709px) 85vw, (max-width: 909px) 67vw, (max-width: 1362px) 62vw, 840px" />&lt;/a>&lt;/span>&lt;/p>
&lt;h3 id="span-crossref-to-datacite-linksspan">&lt;span >Crossref to DataCite links&lt;/span>&lt;/h3>
&lt;p>&lt;span >DataCite is a global consortium that assigns DOIs to research data. This enables people to find, share, use, and cite data. You can view all the data citations to DataCite research outputs found in Crossref publications (work is underway to make the links found in DataCite metadata available in Event Data). &lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >For example, Global, Regional, and National Fossil-Fuel CO2 Emissions (&lt;a href="http://doi.org/10.3334/CDIAC/00001" target="_blank">doi.org/10.3334/CDIAC/00001&lt;/a>) dataset &lt;/span>&lt;span >has been referenced by &lt;/span>&lt;a href="http://api.eventdata.crossref.org/works/doi.org/10.3334/CDIAC/00001">&lt;span >six Crossref publications&lt;/span>&lt;/a>&lt;span > to date. Software links are also included. Another&lt;/span>&lt;span > example is&lt;/span>&lt;span > &lt;/span>&lt;span >PGOPHER (&lt;a href="http://doi.org/10.5523/bris.huflggvpcuc1zvliqed497r2">doi.org/10.5523/bris.huflggvpcuc1zvliqed497r2&lt;/a>)&lt;/span>&lt;span >, a general purpose software for simulating and fitting rotational, vibrational and electronic spectra, which has been referenced by &lt;/span>&lt;a href="http://api.eventdata.crossref.org/works/doi.org/10.5523/BRIS.HUFLGGVPCUC1ZVLIQED497R2">&lt;span >seven Crossref publications&lt;/span>&lt;/a>&lt;span > to date.&lt;/span>&lt;/span>&lt;/p>
&lt;h2 id="span-ready-to-take-a-spinspan">&lt;span >Ready to take a spin?&lt;/span>&lt;/h2>
&lt;p>&lt;span >&lt;span >You can explore the Crossref Event Data early preview by visiting &lt;/span>&lt;a href="http://eventdata.crossref.org">&lt;span >&lt;a href="http://eventdata.crossref.org" target="_blank">http://eventdata.crossref.org&lt;/a>&lt;/span>&lt;/a>&lt;span > and following the links to featured examples within our interim application for inspecting the data, technical documentation, and our &lt;/span>&lt;a href="http://eventdata.crossref.org/guide/#quick-start">&lt;span >Quick Start guide&lt;/span>&lt;/a>&lt;span >.&lt;/span>&lt;/span>&lt;/p>
&lt;h2 id="span-share-your-thoughtsspan">&lt;span >Share your thoughts&lt;/span>&lt;/h2>
&lt;p>&lt;span >&lt;span >This service is currently under development and as such we welcome your thoughts and feedback on the data we are collecting curren&lt;/span>&lt;span >tly from our three active sources. As a reminder, we expect to include the following sources as part of our full service launch later this year &lt;/span>&lt;span >(pending confirmation):&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >[table id=1 /]&lt;/span>&lt;/p>
&lt;p> &lt;/p>
&lt;p>&lt;span >&lt;span >We’re also on the lookout for new data sources to investigate for future inclusion in the Event Data service so please do &lt;/span>&lt;a href="mailto:eventdata@crossref.org">&lt;span >get in touch&lt;/span>&lt;/a>&lt;span > with requests and recommendations. As we continue to build the service throughout 2016, we will be committing to a model of continuous development so that we can make new sources available as they are completed.&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >Watch this blog for regular updates on our progress, or subscribe to receive new blog posts by email (just add your details to the upper right side of this page).&lt;/span>&lt;/p>
&lt;p> &lt;/p>
&lt;p> &lt;/p>
&lt;p> &lt;/p>
&lt;p> &lt;/p></description></item><item><title>What are there 80 million of?</title><link>https://www.crossref.org/blog/what-are-there-80-million-of/</link><pubDate>Fri, 08 Apr 2016 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/what-are-there-80-million-of/</guid><description>&lt;p>&lt;span >&lt;span >As of this week, there are 80,000,000 scholarly items registered with Crossref!&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >By the way, we update &lt;a href="https://data.crossref.org/reports/statusReport.html">these interesting Crossref stats&lt;/a> regularly and you can &lt;a href="https://web.archive.org/web/20131229210637/http://search.crossref.org//">search the metadata&lt;/a>.&lt;/span>&lt;/p>
&lt;p>&lt;span >The 80 millionth scholarly item is [drumroll…] &lt;a href="http://doi.org/10.12816/0016504">Management Approaches in Beihagi History&lt;/a> from the journal &lt;em>Oman Chapter of Arabian Journal of Business and Management Review&lt;/em>&lt;span class="s1">, p&lt;/span>&lt;span class="s1">ublished by &lt;strong>Al Manhal&lt;/strong> in the United Arab Emirates.&lt;/span>&lt;/span>&lt;/p>
&lt;p>There have been loads of changes since Wiley registered &amp;ldquo;Designer selves: Construction of technologically mediated identity within graphical, multiuser virtual environments&amp;rdquo; with the DOI &lt;code>http://dx.doi.org/10.1002/(SICI)1097-4571(1999)50:10&amp;lt;855::AID-ASI3&amp;gt;3.0.CO;2-6)&lt;/code>, which happens to have been Crossref’s first official DOI (after many prototype deposits).&lt;/p>
&lt;p>&lt;span >&lt;span >&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/04/Trending-Nations.png" rel="attachment wp-att-1507">&lt;img class="alignright wp-image-1507" src="https://www.crossref.org/wp/blog/uploads/2016/04/Trending-Nations-300x198.png" alt="Crossref Membership - Trending Nations" width="401" height="265" srcset="https://www.crossref.org/wp/blog/uploads/2016/04/Trending-Nations-300x198.png 300w, https://www.crossref.org/wp/blog/uploads/2016/04/Trending-Nations-768x508.png 768w, https://www.crossref.org/wp/blog/uploads/2016/04/Trending-Nations.png 978w" sizes="(max-width: 401px) 85vw, 401px" />&lt;/a>In the beginning, most of our new members came from the United States and Europe.  Now, lots of our members and affiliates come from other parts of the world.&lt;br /> &lt;/span>&lt;/span>&lt;span >&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/04/Crossref-Membership-Trending-Nations.png" rel="attachment wp-att-1503">&lt;br /> &lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >Ed Pentz was Crossref’s first (and only) employee in February 2000. Now it takes 30 of us to manage the 80 million records and over 5,300 participating organisations and to work on projects like &lt;a href="https://www.crossref.org/blog/event-data-open-for-your-interpretation/">&lt;span >Crossref Event Data&lt;/span>&lt;/a>&lt;span >,  &lt;/span>&lt;a href="https://www.crossref.org/blog/community-responses-to-our-proposal-for-early-content-registration/">&lt;span >&amp;lsquo;early content registration&amp;rsquo; &lt;/span>&lt;/a>&lt;span >, and all the new stuff you’ll be hearing about later this year&lt;/span>.&lt;/span>&lt;/p>
&lt;p>&lt;span >Maybe in the context of social media services (e.g. Facebook users) 80,000,000 does not seem like such a big number. But 80,000,000 is an important milestone. Just think — &lt;/span>&lt;/p>
&lt;p>&lt;span >There are also &lt;a href="http://doi.org/10.1186/2049-2618-2-41">80 million microbes in a 10 second kiss&lt;/a> [&lt;span class="JournalTitle">&lt;em>Microbiome&lt;/em>, &lt;/span>&lt;span class="ArticleCitation_Year">2014, &lt;/span>&lt;span class="ArticleCitation_Volume">2:41, &lt;/span>Kort et al].&lt;/span>&lt;/p>
&lt;p>&lt;span >And after &lt;a href="http://dx.doi.org/10.1515/9781400874248" target="_blank">80 million years of extinction events&lt;/a>, we’re all still here!  &lt;/span>&lt;/p>
&lt;h2 id="span-strongwhat-else-is-80-millionstrong-tell-us-in-a-tweet-using-a-hrefhttpstwittercomsearchftweetsq23crossref80milsrctypdcrossref80mila-there-may-be-a-prizespanfigure-idattachment_1482--classwp-caption-alignnone">&lt;span >&lt;strong>What else is 80 million?&lt;/strong> Tell us in a tweet using &lt;a href="https://twitter.com/search?f=tweets&amp;q=%23crossref80mil&amp;src=typd">#Crossref80mil&lt;/a>. There may be a prize!&lt;/span>&lt;figure id="attachment_1482" class="wp-caption alignnone">&lt;/h2>
&lt;p>&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/04/2.png" rel="attachment wp-att-1482">&lt;img class="wp-image-1482 size-medium" src="https://www.crossref.org/wp/blog/uploads/2016/04/2-300x150.png" alt="Crossref has 80 million registered content items" width="300" height="150" srcset="https://www.crossref.org/wp/blog/uploads/2016/04/2-300x150.png 300w, https://www.crossref.org/wp/blog/uploads/2016/04/2-768x384.png 768w, https://www.crossref.org/wp/blog/uploads/2016/04/2.png 1024w" sizes="(max-width: 300px) 85vw, 300px" />&lt;/a>&lt;figcaption class="wp-caption-text">Crossref has 80 million registered content items&lt;/figcaption>&lt;/figure>&lt;/p></description></item><item><title>Dr Norman Paskin</title><link>https://www.crossref.org/blog/dr-norman-paskin/</link><pubDate>Wed, 06 Apr 2016 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/dr-norman-paskin/</guid><description>&lt;figure id="attachment_1484" class="wp-caption alignright">&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/04/Norman.png" rel="attachment wp-att-1484">&lt;img class="size-full wp-image-1484" src="https://www.crossref.org/wp/blog/uploads/2016/04/Norman.png" alt="Dr Norman Paskin" width="197" height="197" srcset="https://www.crossref.org/wp/blog/uploads/2016/04/Norman.png 197w, https://www.crossref.org/wp/blog/uploads/2016/04/Norman-150x150.png 150w" sizes="(max-width: 197px) 85vw, 197px" />&lt;/a>&lt;figcaption class="wp-caption-text">Dr Norman Paskin&lt;/figcaption>&lt;/figure>
&lt;p>&lt;span >It was with great sadness and shock that I learned that Dr Norman Paskin had passed away unexpectedly on the 27th March. This is a big loss to the DOI, Crossref and digital information communities. Norman was the driving force behind the DOI System and was a key supporter and ally of Crossref from the start. Norman founded the International DOI Foundation in 1998 and ran it successfully until the end of 2015 when he moved to a strategic role as an Independent Board Member.&lt;/span>&lt;/p>
&lt;p>&lt;span >Norman was an early proponent of the value of persistent digital identifiers paired with standardised metadata and laid the groundwork for the system and infrastructure that has made Crossref and eight other Registration Agencies so successful. Norman was also a key adviser and participant in many standards organisations and initiatives where he regularly provided key intellectual input to help improve digital communications.&lt;/span>&lt;/p>
&lt;p>&lt;span >Personally, it was a great pleasure to work with Norman over the last twenty years and I greatly appreciated his intelligence, humour, advice, and particularly his help and generous support when I relocated to Oxford.&lt;/span>&lt;/p>
&lt;p>&lt;span >The International DOI Foundation has &lt;a href="http://www.doi.org/index.html">posted a notice&lt;/a>, and has created &lt;a href="mailto:condolences@doi.org">&lt;a href="mailto:condolences@doi.org">condolences@doi.org&lt;/a>&lt;/a> for people to send messages. &lt;/span>&lt;/p></description></item><item><title>The Wikipedia Library: A Partnership of Wikipedia and Publishers to Enhance Research and Discovery</title><link>https://www.crossref.org/blog/the-wikipedia-library-a-partnership-of-wikipedia-and-publishers-to-enhance-research-and-discovery/</link><pubDate>Mon, 04 Apr 2016 00:00:00 +0000</pubDate><author>Rachael Lammey</author><discourseUsername>rlammey</discourseUsername><guid>https://www.crossref.org/blog/the-wikipedia-library-a-partnership-of-wikipedia-and-publishers-to-enhance-research-and-discovery/</guid><description>&lt;p>&lt;span >&lt;span >Back in 2014, Geoffrey Bilder blogged about the kick-off of &lt;/span>&lt;a href="https://www.crossref.org/blog/citation-needed/">&lt;span >an initiative between Crossref and Wikimedia&lt;/span>&lt;/a>&lt;span > to better integrate scholarly literature into the world’s largest knowledge space, Wikipedia. Since then, Crossref has been working to coordinate activities with Wikimedia: Joe Wass has worked with them to create &lt;/span>&lt;a href="https://live.eventdata.crossref.org/live.html">&lt;span >a live stream of content being cited in Wikipedia&lt;/span>&lt;/a>&lt;span >; and we’re including Wikipedia in &lt;/span>&lt;a href="https://www.crossref.org/blog/event-data-open-for-your-interpretation/">&lt;span >Event Data&lt;/span>&lt;/a>&lt;span >, a new service to launch later this year. In that time, we’ve also seen Wikipedia importance grow in terms of the volume of DOI referrals.&lt;/span>&lt;/span>&lt;figure id="attachment_1412" class="wp-caption alignright">&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/03/Stinson_Alex_June_2015_2.jpg" rel="attachment wp-att-1412">&lt;img class="wp-image-1412 size-medium" src="https://www.crossref.org/wp/blog/uploads/2016/03/Stinson_Alex_June_2015_2-300x200.jpg" alt="Alex Stinson, Project Manager for the Wikipedia Library, and our guest blogger! This file is licensed under the Creative Commons Attribution-Share Alike 3.0 Unported license (Source: Myleen Hollero Photography) " width="300" height="200" srcset="https://www.crossref.org/wp/blog/uploads/2016/03/Stinson_Alex_June_2015_2-300x200.jpg 300w, https://www.crossref.org/wp/blog/uploads/2016/03/Stinson_Alex_June_2015_2-768x512.jpg 768w, https://www.crossref.org/wp/blog/uploads/2016/03/Stinson_Alex_June_2015_2.jpg 800w" sizes="(max-width: 300px) 85vw, 300px" />&lt;/a>&lt;figcaption class="wp-caption-text">Alex Stinson, Project Manager for the Wikipedia Library, and our guest blogger! This file is licensed under the &lt;a href="https://creativecommons.org/licenses/by-sa/3.0/deed.en" target="_blank">Creative Commons Attribution-Share Alike 3.0 Unported license&lt;/a> (Source: Myleen Hollero Photography)&lt;/figcaption>&lt;/figure>&lt;/p>
&lt;p>&lt;em>&lt;span >&lt;span >Alex Stinson, Project Manager for the Wikipedia Library, and guest blogger! This file is licensed under the &lt;a href="https://creativecommons.org/licenses/by-sa/3.0/deed.en">Creative Commons Attribution-Share Alike 3.0 Unported license&lt;/a> (Source: Myleen Hollero Photography)&lt;/span>&lt;/span>&lt;/em>&lt;/p>
&lt;p>&lt;span >&lt;span >How can we keep this momentum going and continue to improve the way we link Wikipedia articles with the formal literature? We invited Alex Stinson, a project manager at &lt;/span>&lt;a href="https://en.wikipedia.org/wiki/Wikipedia:The_Wikipedia_Library">&lt;span >The Wikipedia Library &lt;/span>&lt;/a>&lt;span >(and one of our first guest bloggers) to explain more:&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >Wikipedia provides the most public gateway to academic and scholarly research. With millions of citations to academic as well as non-academic but reliable sources, like those produced by newspapers, its ecosystem of 5 million English Wikipedia articles and 35 million articles in &lt;/span>&lt;a href="https://www.wikipedia.org/">&lt;span >hundreds of languages&lt;/span>&lt;/a>&lt;span > provides the first stop for researchers in both scholarly and informal research situations. The practice of “checking Wikipedia” has become ubiquitous in a number of fields; for example, Wikipedia is the most visited &lt;/span>&lt;a href="http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4376174/">&lt;span >source of medical information online&lt;/span>&lt;/a>&lt;span >, even providing the first stop for many &lt;/span>&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/23137251">&lt;span >medical students and medical practitioners when looking for medical literature&lt;/span>&lt;/a>&lt;span >.&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >&lt;span >The Wikipedia Library prog&lt;/span>ram helps Wikipedia’s volunteer editors access and use the best sources in their research and citations.  Through &lt;/span>&lt;a href="https://en.wikipedia.org/wiki/Wikipedia:TWL/Publishers">&lt;span >partnerships&lt;/span>&lt;/a>&lt;span > with over fifty leading publishers and aggregators, like JSTOR, Project Muse, Elsevier, Newspapers.com, Highbeam, Oxford University Press and others, we have been able to give over 3000 of our most prolific volunteers access to over 5500 accounts. These are clear, win-win relationships where Wikipedia editors get to use these databases to improve Wikipedia, while in turn linking to authoritative resources and enhancing their discovery. &lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >JSTOR has been working with us since 2012, providing over 500 accounts to our editors. Kristen Garlock at JSTOR writes: &lt;/span>&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >&lt;span >“We’re very happy to collaborate with the Wikipedia Library to provide JSTOR access to Wikipedia editors. Supporting the initiative to increase editor access to scholarly resources and improve the quality of information and sources on Wikipedia has the potential to help all Wikipedia readers. In addition to providing more discoverability for our institutional subscribers, introducing new audiences to the scholarship on JSTOR them discover access opportunities like our &lt;/span>&lt;a href="http://about.jstor.org/rr">&lt;span >Register &amp;amp; Read program&lt;/span>&lt;/a>&lt;span >.”&lt;/span>&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;span >There are strong signals that Wikipedia’s role in the citation ecosystem helps ensure the best materials reach the public through its over 400 million monthly readers: &lt;/span>&lt;/p>
&lt;li >
&lt;span >&lt;span >The latest estimates by Crossref show that Wikipedia has &lt;/span>&lt;a href="https://youtu.be/8qO3BYDN67k?t=11m15s">&lt;span >risen from the 8th most prolific referrer to DOIs to the 5th&lt;/span>&lt;/a>&lt;span >. &lt;/span>&lt;/span>
&lt;/li>
&lt;li >
&lt;span >Two of our access partners have found that around half of the referrals arriving from Wikipedia were able to authenticate into their subscription resources, suggesting that a large portion of our readers can take advantage of subscriptions provided by scholarly institutions. &lt;/span>
&lt;/li>
&lt;li >
&lt;span >&lt;span >Wikipedia is highly influential in the open access ecosystem as well, with a recent study showing &lt;/span>&lt;a href="http://arxiv.org/abs/1506.07608">&lt;span >higher citation rates for OA materials &lt;/span>&lt;/a>&lt;span >than those behind a paywall.&lt;/span>&lt;/span>
&lt;/li>
&lt;li >
&lt;span >&lt;a href="https://en.wikipedia.org/wiki/Altmetrics">&lt;span >Altmetrics&lt;/span>&lt;/a>&lt;span > tools (such as Altmetric.com, ImpactStory or Plum Analytics) are recognizing Wikipedia’s importance by including Wikipedia citations &lt;/span>&lt;a href="http://www.altmetric.com/blog/new-source-alert-wikipedia/">&lt;span >in their impact metrics&lt;/span>&lt;/a>&lt;span >. &lt;/span>&lt;/span>
&lt;/li>
&lt;p>&lt;span >&lt;span >Despite these advances, we think this is only the beginning of Wikipedia’s impact on the landscape of scholarly research and discovery. Wikipedia can become a highly integrated research platform within the broader research ecosystem, where the best scholarship is summarized and discoverable-where Wikipedia effectively becomes the &lt;/span>&lt;a href="https://meta.wikimedia.org/wiki/Wikipedia_as_the_front_matter_to_all_research">&lt;span >front matter to all research&lt;/span>&lt;/a>&lt;span >.&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >However, there are some clear barriers to fulfilling this vision. Currently, most citations on Wikipedia are stored in free-text and not readily available in machine-readable formats; our community is working to fix this. Wikipedia also has major systematic gaps in topics where either we lack volunteer interest or Wikipedia reflects &lt;/span>&lt;a href="https://en.wikipedia.org/wiki/Wikipedia:Systemic_bias">&lt;span >larger systemic biases within society or scholarship&lt;/span>&lt;/a>&lt;span >.We need the help of volunteers, experts, industry partners, and information technologists to grow Wikipedia’s collection of citations, especially around key missing areas, and to transform existing citations into structured formats. &lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;a href="https://www.wikidata.org/">&lt;i>&lt;span >WikiData&lt;/span>&lt;/i>&lt;/a>&lt;span >, Wikipedia’s sister project which crowdsources structured metadata, offers an excellent opportunity for improving the impact of Wikipedia in research.  Having Wikipedia citations &lt;/span>&lt;a href="https://www.wikidata.org/wiki/Wikidata:WikiProject_Source_MetaData">&lt;span >stored in this structured ecosystem&lt;/span>&lt;/a>&lt;span >, connecting metadata with semantic meaning, would allow the citations in Wikipedia to become the backbone for discovery tools which emphasize the hand-curated interrelationships between authoritative sources and the knowledge collected by Wikipedia and Wikidata editors.&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >We need more collaborators to realize the full vision of Wikipedia supporting research in the most effective ways:&lt;/span>&lt;/p>
&lt;li >
&lt;span >&lt;span >We need help from publishers with subscription databases, to help us give our editors access to the databases through &lt;/span>&lt;a href="https://en.wikipedia.org/wiki/Wikipedia:The_Wikipedia_Library/Publishers">&lt;span >The Wikipedia Library’s access partnership program&lt;/span>&lt;/a>&lt;span >. These high-quality source materials allow our editors to expose that research in a number of languages and for millions of readers. &lt;/span>&lt;/span>
&lt;/li>
&lt;li >
&lt;span >&lt;span >We need help from the open access community, to figure out how to better support increased citation and strategic use of open access materials within Wikipedia and other Wikimedia projects. &lt;/span>&lt;a href="http://blog.wikimedia.org/2015/09/16/open-access-in-a-closed-world/">&lt;span >Our community has some ideas, but we need your input and collaboration&lt;/span>&lt;/a>&lt;span >.&lt;/span>&lt;/span>
&lt;/li>
&lt;li >
&lt;span >&lt;span >We need your expertise to build our structured metadata ecosystem, by helping Wikidata &lt;/span>&lt;a href="https://www.wikidata.org/wiki/Wikidata:WikiProject_Source_MetaData">&lt;span >map and collect citation data&lt;/span>&lt;/a>&lt;span >.&lt;/span>&lt;/span>
&lt;/li>
&lt;li >
&lt;span >&lt;span >We need the larger research community to promote Wikipedia as a scholarly communications tool and make contributing to Wikipedia an important part of &lt;/span>&lt;a href="https://en.wikipedia.org/wiki/Wikipedia:Research_help/Scholars_and_experts">&lt;span >the social responsibility of experts&lt;/span>&lt;/a>&lt;span >. Wider citation of sources in Wikipedia ensures widespread discovery and dissemination of that research.&lt;/span>&lt;/span>
&lt;/li>
&lt;p>&lt;span >&lt;span >If you think you can help, we invite you to contact us at &lt;/span>&lt;a href="mailto:wikipedialibrary@wikimedia.org">&lt;span >&lt;a href="mailto:wikipedialibrary@wikimedia.org">wikipedialibrary@wikimedia.org&lt;/a>&lt;/span>&lt;/a>&lt;span > or via &lt;/span>&lt;a href="https://twitter.com/wikilibrary">&lt;span >Twitter @WikiLibrary&lt;/span>&lt;/a>&lt;span >. &lt;/span>&lt;/span>&lt;/p>
&lt;p> &lt;/p></description></item><item><title>Python and Ruby Libraries for accessing the Crossref API</title><link>https://www.crossref.org/blog/python-and-ruby-libraries-for-accessing-the-crossref-api/</link><pubDate>Fri, 04 Mar 2016 00:00:00 +0000</pubDate><author>Scott Chamberlain</author><guid>https://www.crossref.org/blog/python-and-ruby-libraries-for-accessing-the-crossref-api/</guid><description>&lt;p>&lt;span >I’m a co-founder with &lt;a href="https://ropensci.org/">rOpenSci&lt;/a>, a non-profit that focuses on making software to facilitate reproducible and open science. &lt;a href="https://github.com/ropensci/rcrossref/commit/a264da3177d2bdbdfce289a4fdccc43c8df36da1">Back in 2013&lt;/a> we started to make an R client working with various Crossref web services. I was lucky enough to attend &lt;a href="https://www.crossref.org/crossref-annual-meeting/archive/">last year’s Crossref annual meeting in Boston&lt;/a>, and gave one talk on &lt;a href="https://youtu.be/_2iRjK5QjKU?si=qzAvJ70n_kaMJpmU">details of the programmatic clients&lt;/a>, and another higher level talk on &lt;a href="https://youtu.be/j8qlHw7UqlI?si=qWY4NXls4w4jwZ3I">text mining and use of metadata for research&lt;/a>.&lt;/span>&lt;/p>
&lt;p>&lt;span >Crossref has a newish API encompassing works, journals, members, funders and more (check out &lt;a href="https://github.com/Crossref/rest-api-doc/blob/master/rest_api.md">the API docs&lt;/a>), as well as a few other services. Essential to making the Crossref APIs easily accessible—and facilitating easy tool/app creation and exploration—are programmatic clients for popular languages. I’ve maintained an R client for a while now, and have been working on Python and Ruby clients for the past four months or so.&lt;/span>&lt;/p>
&lt;p>&lt;span >The R client falls squarely into the analytics/research use cases, while the Python and Ruby clients are ideal for general data access and use in web applications (the Javascript library below as well).&lt;/span>&lt;/p>
&lt;p>&lt;span >I’ve strived to make each client in idiomatic fashion according to the language. Due to this fact, there is not generally correspondence between the different clients with respect to data outputs. However, I’ve tried to make method names similar across Ruby and Python; although the R client is quite a bit older, so method names differ from the other clients and I’m resistant to changing them so as not to break current users’ projects. In addition, R users are likely to want a data.frame (i.e., table) of results, so we give back that - whereas with Python and Ruby we give back dictionaries and hashes, respectively.&lt;/span>&lt;/p>
&lt;h3 id="span-strongcrossref-clientsstrongspan">&lt;span >&lt;strong>Crossref clients&lt;/strong>&lt;/span>&lt;/h3>
&lt;ul>
&lt;li>&lt;span >Python:&lt;/span>
&lt;ul>
&lt;li>&lt;span >Source: &lt;a href="https://github.com/sckott/habanero">&lt;a href="https://github.com/sckott/habanero" target="_blank">https://github.com/sckott/habanero&lt;/a>&lt;/a>&lt;/span>&lt;/li>
&lt;li>&lt;span >Pypi: &lt;a href="https://pypi.python.org/pypi/habanero">&lt;a href="https://pypi.python.org/pypi/habanero" target="_blank">https://pypi.python.org/pypi/habanero&lt;/a>&lt;/a>&lt;/span>&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>&lt;span >Ruby:&lt;/span>
&lt;ul>
&lt;li>&lt;span >Source: &lt;a href="https://github.com/sckott/serrano">&lt;a href="https://github.com/sckott/serrano" target="_blank">https://github.com/sckott/serrano&lt;/a>&lt;/a>&lt;/span>&lt;/li>
&lt;li>&lt;span >Rubygems: &lt;a href="https://rubygems.org/gems/serrano">&lt;a href="https://rubygems.org/gems/serrano" target="_blank">https://rubygems.org/gems/serrano&lt;/a>&lt;/a>&lt;/span>&lt;/li>
&lt;li>&lt;span >&lt;code>serrano&lt;/code> also comes with a command line tool of the same name that’s installed when you install &lt;code>serrano&lt;/code> (examples below)&lt;/span>&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>&lt;span >R:&lt;/span>
&lt;ul>
&lt;li>&lt;span >Source: &lt;a href="https://github.com/ropensci/rcrossref">&lt;a href="https://github.com/ropensci/rcrossref" target="_blank">https://github.com/ropensci/rcrossref&lt;/a>&lt;/a>&lt;/span>&lt;/li>
&lt;li>&lt;span >CRAN: &lt;a href="https://cran.rstudio.com/web/packages/rcrossref/">&lt;a href="https://cran.rstudio.com/web/packages/rcrossref/" target="_blank">https://cran.rstudio.com/web/packages/rcrossref/&lt;/a>&lt;/a>&lt;/span>&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>&lt;span >Javascript:&lt;/span>
&lt;ul>
&lt;li>&lt;span >Source: &lt;a href="https://github.com/scienceai/crossref">&lt;a href="https://github.com/scienceai/crossref" target="_blank">https://github.com/scienceai/crossref&lt;/a>&lt;/a>&lt;/span>&lt;/li>
&lt;li>&lt;span >NPM: &lt;a href="https://www.npmjs.com/package/crossref">&lt;a href="https://www.npmjs.com/package/crossref" target="_blank">https://www.npmjs.com/package/crossref&lt;/a>&lt;/a>&lt;/span>&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;p>&lt;span >I’ll cover the Python, Ruby, and R libraries below.&lt;/span>&lt;/p>
&lt;h3 id="span-stronginstallationstrongspan">&lt;span >&lt;strong>Installation&lt;/strong>&lt;/span>&lt;/h3>
&lt;p>&lt;span >&lt;strong>&lt;em>Python&lt;/em>&lt;/strong>&lt;/span>&lt;/p>
&lt;p>&lt;span >on the command line&lt;/span>&lt;/p>
&lt;pre class="theme:solarized-light lang:sh decode:true">&lt;span >pip install habanero&lt;/span>&lt;/pre>
&lt;p>&lt;span >&lt;em>&lt;strong>Ruby&lt;/strong>&lt;/em>&lt;/span>&lt;/p>
&lt;p>&lt;span >on the command line&lt;/span>&lt;/p>
&lt;pre class="theme:solarized-light lang:sh decode:true">&lt;span >gem install serrano&lt;/span>&lt;/pre>
&lt;p>&lt;span >&lt;em>&lt;strong>R&lt;/strong>&lt;/em>&lt;/span>&lt;/p>
&lt;p>&lt;span >in an R session&lt;/span>&lt;/p>
&lt;pre class="theme:solarized-light lang:r decode:true">&lt;span >install.packages("rcrossref")&lt;/span>&lt;/pre>
&lt;p> &lt;/p>
&lt;h3 id="span-strongexamplesstrongspan">&lt;span >&lt;strong>Examples&lt;/strong>&lt;/span>&lt;/h3>
&lt;p>&lt;span >Output is indicated by the syntax &lt;code>#&amp;gt;&lt;/code> in all examples below.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;em>&lt;strong>Python&lt;/strong>&lt;/em>&lt;/span>&lt;/p>
&lt;p>&lt;span >in a Python REPL (e.g. &lt;em>iPython&lt;/em>)&lt;/span>&lt;/p>
&lt;p>&lt;span >Import the &lt;em>Crossref&lt;/em> module from within &lt;em>habanero&lt;/em>, and initialize a client&lt;/span>&lt;/p>
&lt;pre class="theme:solarized-light lang:python decode:true">&lt;span >from habanero import Crossref
cr = Crossref()&lt;/span>&lt;/pre>
&lt;p>&lt;span >Query for the phrase “ecology”&lt;/span>&lt;/p>
&lt;pre class="theme:solarized-light lang:python decode:true">&lt;span >x = cr.works(query = "ecology", limit = 5)&lt;/span>&lt;/pre>
&lt;p>&lt;span >Index to various parts of the output&lt;/span>&lt;/p>
&lt;pre class="theme:solarized-light lang:python decode:true">&lt;span >x['message']['total-results']
#&amp;gt; 276188&lt;/span>&lt;/pre>
&lt;p>&lt;span >Extract similar data items from each result. The records are in the “items” slot&lt;/span>&lt;/p>
&lt;pre class="theme:solarized-light lang:python decode:true">&lt;span >[ z['DOI'] for z in x['message']['items'] ]
#&amp;gt; [u'10.1002/(issn)1939-9170',
#&amp;gt; u'10.4996/fireecology',
#&amp;gt; u'10.5402/ecology',
#&amp;gt; u'10.1155/8641',
#&amp;gt; u'10.1111/(issn)1439-0485']&lt;/span>&lt;/pre>
&lt;p>&lt;span >In &lt;em>habanero&lt;/em> for some methods we require you to instantiate a client.&lt;/span>&lt;/p>
&lt;p>&lt;span >You can set a base URL and API key. This is a future looking feature&lt;/span>&lt;/p>
&lt;p>&lt;span >as Crossref API does not require an API key.&lt;/span>&lt;/p>
&lt;p>&lt;span >Note: I’ve tried to make sure habanero is Python 2 and 3 compatible. Hopefully you’ll find that’s true.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;em>&lt;strong>Ruby&lt;/strong>&lt;/em>&lt;/span>&lt;/p>
&lt;p>&lt;span >in a Ruby repl (e.g., &lt;a href="https://web.archive.org/web/20160312125404/http://pryrepl.org//">pry&lt;/a>), load &lt;em>serrano&lt;/em>&lt;/span>&lt;/p>
&lt;pre class="theme:solarized-light lang:ruby decode:true ">&lt;span >require 'serrano'&lt;/span>&lt;/pre>
&lt;p>&lt;span >Query for “peerj” on the journals route&lt;/span>&lt;/p>
&lt;pre class="theme:solarized-light lang:ruby decode:true">&lt;span >x = Serrano.journals(query: "peerj")&lt;/span>&lt;/pre>
&lt;p>&lt;span >Collect just ISSN’s from each result&lt;/span>&lt;/p>
&lt;pre class="theme:solarized-light lang:ruby decode:true">&lt;span >x['message']['items'].collect { |z| z['ISSN'] }
#&amp;gt; =&amp;gt; [["2376-5992"], ["2167-8359"]]&lt;/span>&lt;/pre>
&lt;p> &lt;/p>
&lt;p>&lt;span >&lt;em>&lt;strong>Shell&lt;/strong>&lt;/em>&lt;/span>&lt;/p>
&lt;p>&lt;span >The &lt;code>serrano&lt;/code> command line tool is quite powerful if you are used to doing things there.&lt;/span>&lt;/p>
&lt;p>&lt;span >Here, search for one article; summary data is shown.&lt;/span>&lt;/p>
&lt;pre class="theme:solarized-light lang:sh decode:true">&lt;span >serrano works 10.1371/journal.pone.0033693
#&amp;gt; DOI: 10.1371/journal.pone.0033693
#&amp;gt; type: journal-article
#&amp;gt; title: Methylphenidate Exposure Induces Dopamine Neuron Loss and Activation of Microglia in the Basal Ganglia of Mice&lt;/span>&lt;/pre>
&lt;p>&lt;span >There’s also a &lt;code>-json&lt;/code> flag to give back JSON data, which can be parsed with the command line tool &lt;a href="https://stedolan.github.io/jq/">jq&lt;/a>.&lt;/span>&lt;/p>
&lt;pre class="theme:solarized-light lang:sh decode:true">&lt;span >serrano works --filter=has_full_text:true --json --limit=5 | jq '.message.items[].link[].URL'
#&amp;gt; "http://api.wiley.com/onlinelibrary/tdm/v1/articles/10.1002%2F9781119208082.ch9"
#&amp;gt; "http://api.wiley.com/onlinelibrary/tdm/v1/articles/10.1002%2F9781119208082.index"
#&amp;gt; "http://api.wiley.com/onlinelibrary/tdm/v1/articles/10.1002%2F9781119208082.ch11"
#&amp;gt; "http://api.wiley.com/onlinelibrary/tdm/v1/articles/10.1002%2F9781119208082.ch15"
#&amp;gt; "http://api.wiley.com/onlinelibrary/tdm/v1/articles/10.1002%2F9781119208082.ch4"&lt;/span>&lt;/pre>
&lt;p> &lt;/p>
&lt;p>&lt;span >&lt;em>&lt;strong>R&lt;/strong>&lt;/em>&lt;/span>&lt;/p>
&lt;p>&lt;span >In an R session, load &lt;code>rcrossref&lt;/code>&lt;/span>&lt;/p>
&lt;pre class="theme:solarized-light lang:r decode:true ">&lt;span >library("rcrossref")&lt;/span>&lt;/pre>
&lt;p>&lt;span >Search the &lt;code>works&lt;/code> route for the phrase “science”&lt;/span>&lt;/p>
&lt;pre class="theme:solarized-light lang:r decode:true">&lt;span >res &amp;lt;- cr_works(query = "science", limit = 5)
#&amp;gt; $meta
#&amp;gt; total_results search_terms start_index items_per_page
#&amp;gt; 1 4333827 science 0 5
#&amp;gt;
#&amp;gt; $data
#&amp;gt; Source: local data frame [5 x 23]
#&amp;gt;
#&amp;gt; alternative.id container.title created deposited DOI funder indexed
#&amp;gt; (chr) (chr) (chr) (chr) (chr) (chr) (chr)
#&amp;gt; 1 2013-11-21 2013-11-21 10.1126/science &amp;lt;NULL&amp;gt; 2015-12-27
#&amp;gt; 2 Science Askew 2004-11-26 2013-12-16 10.1887/0750307145/b426c18 &amp;lt;NULL&amp;gt; 2015-12-24
#&amp;gt; 3 2006-04-10 2010-07-30 10.1002/(issn)1557-6833 &amp;lt;NULL&amp;gt; 2015-12-25
#&amp;gt; 4 2013-08-27 2013-08-27 10.1002/(issn)1469-896x &amp;lt;NULL&amp;gt; 2015-12-27
#&amp;gt; 5 2013-12-19 2013-12-19 10.5152/bs. &amp;lt;NULL&amp;gt; 2015-12-28
#&amp;gt; Variables not shown: ISBN (chr), ISSN (chr), issued (chr), link (chr), member (chr), prefix (chr), publisher
#&amp;gt; (chr), reference.count (chr), score (chr), source (chr), subject (chr), title (chr), type (chr), URL
#&amp;gt; (chr), assertion (chr), author (chr)
#&amp;gt;
#&amp;gt; $facets
#&amp;gt; NULL&lt;/span>&lt;/pre>
&lt;p>&lt;span >Index through to get the DOIs&lt;/span>&lt;/p>
&lt;pre class="theme:solarized-light lang:r decode:true">&lt;span >res$data$DOI
#&amp;gt; [1] "10.1126/science" "10.1887/0750307145/b426c18" "10.1002/(issn)1557-6833"
#&amp;gt; [4] "10.1002/(issn)1469-896x" "10.5152/bs."&lt;/span>&lt;/pre>
&lt;blockquote>
&lt;p>&lt;span >rcrossref also has faster versions of most functions with an underscore at the end (&lt;code>_&lt;/code>) which only do the http request and give back json (e.g., &lt;code>cr_works_()&lt;/code>)&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;h3 id="span-strongcomparisonof-crossref-client-methodsstrongspan">&lt;span >&lt;strong>Comparison of Crossref Client Methods&lt;/strong>&lt;/span>&lt;/h3>
&lt;p>&lt;span >After installation and loading the libraries above, the below methods are available&lt;/span>&lt;/p>
&lt;table style="width: 100%;">
&lt;tr>
&lt;th>
&lt;span >API Route&lt;/span>
&lt;/th>
&lt;pre>&lt;code>&amp;lt;th&amp;gt;
&amp;lt;span &amp;gt;Python&amp;lt;/span&amp;gt;
&amp;lt;/th&amp;gt;
&amp;lt;th&amp;gt;
&amp;lt;span &amp;gt;Ruby&amp;lt;/span&amp;gt;
&amp;lt;/th&amp;gt;
&amp;lt;th&amp;gt;
&amp;lt;span &amp;gt;R&amp;lt;/span&amp;gt;
&amp;lt;/th&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
&lt;span >&lt;strong>works&lt;/strong>&lt;/span>
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;cr.works()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;Serrano.works()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;cr_works()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
&lt;span >&lt;strong>members&lt;/strong>&lt;/span>
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;cr.members()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;Serrano.members()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;cr_members()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
&lt;span >&lt;strong>funders&lt;/strong>&lt;/span>
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;cr.funders()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;Serrano.funders()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;cr_funders()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
&lt;span >&lt;strong>types&lt;/strong>&lt;/span>
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;cr.types()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;Serrano.types()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;cr_types()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
&lt;span >&lt;strong>licenses&lt;/strong>&lt;/span>
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;cr.licenses()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;Serrano.licenses()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;cr_licenses()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
&lt;span >&lt;strong>journals&lt;/strong>&lt;/span>
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;cr.journals()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;Serrano.journals()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;cr_journals()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
&lt;span >&lt;strong>members&lt;/strong>&lt;/span>
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;cr.members()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;Serrano.members()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;cr_members()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
&lt;span >&lt;strong>registration agency&lt;/strong>&lt;/span>
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;cr.registration_agency()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;Serrano.registration_agency()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;cr_agency()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
&lt;span >&lt;strong>random DOIs&lt;/strong>&lt;/span>
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;cr.random_dois()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;Serrano.random_dois()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;cr_r()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;/table>
&lt;h3 id="span-strongother-crossref-servicesstrongspan">&lt;span >&lt;strong>Other Crossref Services&lt;/strong>&lt;/span>&lt;/h3>
&lt;table style="width: 100%;">
&lt;tr>
&lt;th>
&lt;span >Service&lt;/span>
&lt;/th>
&lt;pre>&lt;code>&amp;lt;th&amp;gt;
&amp;lt;span &amp;gt;Python&amp;lt;/span&amp;gt;
&amp;lt;/th&amp;gt;
&amp;lt;th&amp;gt;
&amp;lt;span &amp;gt;Ruby&amp;lt;/span&amp;gt;
&amp;lt;/th&amp;gt;
&amp;lt;th&amp;gt;
&amp;lt;span &amp;gt;R&amp;lt;/span&amp;gt;
&amp;lt;/th&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
&lt;span >&lt;strong>content negotiation&lt;/strong>&lt;/span>
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;cn.content_negotiation()&amp;lt;/code&amp;gt;&amp;lt;a href=&amp;quot;#footnote-1&amp;quot;&amp;gt;[1]&amp;lt;/a&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;Serrano.content_negotiation()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;cr_cn()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
&lt;span >&lt;strong>CSL styles&lt;/strong>&lt;/span>
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;cn.csl_styles()&amp;lt;/code&amp;gt;&amp;lt;a href=&amp;quot;#footnote-1&amp;quot;&amp;gt;[1]&amp;lt;/a&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;Serrano.csl_styles()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;get_styles()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
&lt;span >&lt;strong>citation count&lt;/strong>&lt;/span>
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;counts.citation_count()&amp;lt;/code&amp;gt;&amp;lt;a href=&amp;quot;#footnote-2&amp;quot;&amp;gt;[2]&amp;lt;/a&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;Serrano.citation_count()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;cr_citation_count()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;/table>
&lt;p id="footnote-1">
&lt;span >[1] &lt;code>from habanero import cn&lt;/code>&lt;/span>
&lt;/p>
&lt;p id="footnote-2">
&lt;span >[2] &lt;code>from habanero import counts&lt;/code>&lt;/span>
&lt;/p>
&lt;p> &lt;/p>
&lt;h3 id="span-featuresspan">&lt;span >Features&lt;/span>&lt;/h3>
&lt;p>&lt;span >These are supported in all 3 libraries:&lt;/span>&lt;/p>
&lt;ul>
&lt;li>&lt;span >Filters (see below)&lt;/span>&lt;/li>
&lt;li>&lt;span >Deep paging (see below)&lt;/span>&lt;/li>
&lt;li>&lt;span >Pagination&lt;/span>&lt;/li>
&lt;li>&lt;span >Verbose curl output&lt;/span>&lt;/li>
&lt;/ul>
&lt;h3 id="span-filtersspan">&lt;span >Filters&lt;/span>&lt;/h3>
&lt;p>&lt;span >Filters (see &lt;a href="https://github.com/Crossref/rest-api-doc/blob/master/rest_api.md#filter-names">API docs&lt;/a> for details) are a powerful way to get closer to exactly what you want in your queries. In the Crossref API filters are passed as query parameters, and are comma-separated like &lt;span class="lang:default decode:true crayon-inline ">filter=has-orcid:true,is-update:true&lt;/span> . In the client libraries, filters are passed in idiomatic fashion according to the language.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;em>&lt;strong>Python&lt;/strong>&lt;/em>&lt;/span>&lt;/p>
&lt;pre class="theme:solarized-light lang:python decode:true">&lt;span >from habanero import Crossref
cr = Crossref()
cr.works(filter = {'award_number': 'CBET-0756451', 'award_funder': '10.13039/100000001'})&lt;/span>&lt;/pre>
&lt;p>&lt;span >&lt;em>&lt;strong>Ruby&lt;/strong>&lt;/em>&lt;/span>&lt;/p>
&lt;pre class="theme:solarized-light lang:ruby decode:true">&lt;span >require 'serrano'
Serrano.works(filter: {award_number: 'CBET-0756451', award_funder: '10.13039/100000001'})&lt;/span>&lt;/pre>
&lt;p>&lt;span >&lt;em>&lt;strong>R&lt;/strong>&lt;/em>&lt;/span>&lt;/p>
&lt;pre class="theme:solarized-light lang:r decode:true">&lt;span >library("rcrossref")
cr_works(filter=c(award_number=TRUE, award_funder='10.13039/100000001'))
&lt;/span>&lt;/pre>
&lt;p>&lt;span >Note how syntax is quite similar among languages, though keys don’t have to be quoted in Ruby and R, and in R you pass in a vector or list instead of a hash as in the other two.&lt;/span>&lt;/p>
&lt;p>&lt;span >All 3 clients have helper functions to show you what filters are available and what the options are for each filter.&lt;/span>&lt;/p>
&lt;table style="width: 100%;">
&lt;tr>
&lt;th>
&lt;span >Action&lt;/span>
&lt;/th>
&lt;pre>&lt;code>&amp;lt;th&amp;gt;
&amp;lt;span &amp;gt;Python&amp;lt;/span&amp;gt;
&amp;lt;/th&amp;gt;
&amp;lt;th&amp;gt;
&amp;lt;span &amp;gt;Ruby&amp;lt;/span&amp;gt;
&amp;lt;/th&amp;gt;
&amp;lt;th&amp;gt;
&amp;lt;span &amp;gt;R&amp;lt;/span&amp;gt;
&amp;lt;/th&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
&lt;span >&lt;strong>Filter names&lt;/strong>&lt;/span>
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;filters.filter_names&amp;lt;/code&amp;gt;&amp;lt;a href=&amp;quot;#footnote-3&amp;quot;&amp;gt;[3]&amp;lt;/a&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;Serrano::Filters.names&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;filter_names()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
&lt;span >&lt;strong>Filter details&lt;/strong>&lt;/span>
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;filters.filter_details&amp;lt;/code&amp;gt;&amp;lt;a href=&amp;quot;#footnote-3&amp;quot;&amp;gt;[3]&amp;lt;/a&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;Serrano::Filters.filters&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;code&amp;gt;filter_details()&amp;lt;/code&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;/table>
&lt;p id="footnote-3">
&lt;span >[3] &lt;code>from habanero import filters&lt;/code>&lt;/span>
&lt;/p>
&lt;p> &lt;/p>
&lt;h3 id="span-deep-pagingspan">&lt;span >Deep paging&lt;/span>&lt;/h3>
&lt;p>&lt;span >Sometimes you want a lot of data. The Crossref API has parameters for paging (see &lt;a href="https://github.com/Crossref/rest-api-doc/blob/master/rest_api.md#rows">rows&lt;/a> and &lt;a href="https://github.com/Crossref/rest-api-doc/blob/master/rest_api.md#offset">offset&lt;/a>), but large values of either can lead to long response times and potentially timeouts (i.e., request failure). The API has a deep paging feature that can be used when large data volumes are desired. This is made possible via Solr’s cursor feature (e.g., &lt;a href="http://solr.pl/en/2014/03/10/solr-4-7-efficient-deep-paging/">blog post on it&lt;/a>). Here’s a run down of how to use it:&lt;/span>&lt;/p>
&lt;ul>
&lt;li>&lt;span >&lt;code>cursor&lt;/code>: each method in each client library that allows deep paging has a &lt;code>cursor&lt;/code> parameter that if you set to &lt;code>*&lt;/code> will tell the Crossref API you want deep paging.&lt;/span>&lt;/li>
&lt;li>&lt;span >&lt;code>cursor_max&lt;/code>: for boring reasons we need to have feedback from the user when they want to stop, since each request comes back with a cursor value that we can make the next request with, thus, an additional parameter &lt;code>cursor_max&lt;/code> is used to indicate the number of results you want back.&lt;/span>&lt;/li>
&lt;li>&lt;span >&lt;code>limit&lt;/code>: this parameter when not using deep paging determines number of results to get back. however, when deep paging, this parameter sets the chunk size. (note that the max. value for this parameter is 1000)&lt;/span>&lt;/li>
&lt;/ul>
&lt;p>&lt;span >For example, &lt;code>cursor=&amp;amp;#8221;*&amp;amp;#8221;&lt;/code> states that you want deep paging, &lt;code>cursor_max&lt;/code> states maximum results you want back, and &lt;code>limit&lt;/code> determines how many results per request to fetch.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;em>&lt;strong>Python&lt;/strong>&lt;/em>&lt;/span>&lt;/p>
&lt;pre class="theme:solarized-light lang:python decode:true ">&lt;span >from habanero import Crossref
cr = Crossref()
cr.works(query = "widget", cursor = "*", cursor_max = 500)&lt;/span>&lt;/pre>
&lt;p>&lt;span >&lt;em>&lt;strong>Ruby&lt;/strong>&lt;/em>&lt;/span>&lt;/p>
&lt;pre class="theme:solarized-light lang:ruby decode:true">&lt;span >require 'serrano'
Serrano.works(query: "widget", cursor: "*", cursor_max: 500)&lt;/span>&lt;/pre>
&lt;p>&lt;span >&lt;em>&lt;strong>R&lt;/strong>&lt;/em>&lt;/span>&lt;/p>
&lt;pre class="theme:solarized-light lang:r decode:true">&lt;span >library("rcrossref")
cr_works(query = "widget", cursor = "*", cursor_max = 500)
&lt;/span>&lt;/pre>
&lt;p> &lt;/p>
&lt;h3 id="span-text-mining-clientsspan">&lt;span >Text mining clients&lt;/span>&lt;/h3>
&lt;p>&lt;span >Just a quick note that I’ve begun a few text-mining clients for Python and Ruby, focused on using the low level clients discussed above.&lt;/span>&lt;/p>
&lt;ul>
&lt;li>&lt;span >Python: &lt;a href="https://github.com/sckott/pyminer">&lt;a href="https://github.com/sckott/pyminer" target="_blank">https://github.com/sckott/pyminer&lt;/a>&lt;/a>&lt;/span>&lt;/li>
&lt;li>&lt;span >Ruby: &lt;a href="https://github.com/sckott/textminer">&lt;a href="https://github.com/sckott/textminer" target="_blank">https://github.com/sckott/textminer&lt;/a>&lt;/a>&lt;/span>&lt;/li>
&lt;/ul>
&lt;p>&lt;span >Do try them out!&lt;/span>&lt;/p></description></item><item><title>Community responses to our proposal for early content registration</title><link>https://www.crossref.org/blog/community-responses-to-our-proposal-for-early-content-registration/</link><pubDate>Tue, 01 Mar 2016 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/community-responses-to-our-proposal-for-early-content-registration/</guid><description>&lt;h2 id="span-tldrspan">&lt;span >TL;DR:&lt;/span>&lt;/h2>
&lt;p>&lt;span >We will proceed with implementing the &lt;a href="https://www.crossref.org/blog/rfc-registering-content-before-online/" target="_blank">proposed support for registering content before online availability&lt;/a>. Adopting the workflow will be optional and will involve no extra fees.&lt;/span>&lt;/p>
&lt;h2 id="span-backgroundspan">&lt;span >Background&lt;/span>&lt;/h2>
&lt;p>&lt;span >At the end of January, Crossref issued a “request for community comment” on a proposed new process to support the registration of content including DOIs before online availability. We promised that we would summarize the results of the survey once we had received and analyzed all the responses.&lt;/span>&lt;/p>
&lt;p>&lt;span >Support for Crossref implementing the proposed new workflow was overwhelming. Of the 104 responses, 90 were positive, 7 were neutral and 7 were negative. As such we will proceed to make the necessary changes to better support registering content before online availability. We aim to enable this functionality in the second half of 2016.&lt;/span>&lt;/p>
&lt;p>&lt;span >We received survey responses varying in length from one or two sentences to multiple pages. A lot of the responses also interspersed questions and observations about entirely different issues that were of interest to respondents. As such, it has taken a while for us to analyze the results. We also found it was pretty much impossible for us to tabulate a summary of the responses to the direct questions. Instead we’ll summarize the responses at a high level and then drill down into some of the nuances in the answers and issues that were raised from the responses.&lt;/span>&lt;/p>
&lt;h2 id="span-the-positive-responsesspan">&lt;span >The positive responses&lt;/span>&lt;/h2>
&lt;p>&lt;span >By “positive” we mean the respondent understood the issues we were trying to address and thought what we were proposing was a reasonable way to address the problems. Here are a few (anonymized) excerpts from the responses:&lt;/span>&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >“[This] is very timely as we have been made aware of changes to manuscript deposit requirements for UK authors. Authors who partake in the REF system will have to deposit articles at their manuscript stage before publication. We need to set an embargo on the articles so that they only become discoverable at some point after the publication date. Ideally we would like this to happen with all articles regardless of where they are from as authors will put their own work up on open access sites.”&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>&lt;span >“Your proposal and the associated workflow look good to us and will help with our media embargo timelines, as well as our authors’ institutional requirements.”&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>&lt;span >“The workflows and solutions seem reasonable … The temporary landing page seems like a sustainable technical solution. Hosting by Crossref is key to this – there is no way that all publishers would otherwise take on maintaining temporary pages. And having a standard display for metadata consistency is crucial too.”&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>&lt;span >“Early assignment and recording of DOIs from the point of acceptance forms a key step in [the university’s] proposed ‘Submit-accept-deposit’ workflow. We welcome the proposal by Crossref to enable early assignment of DOIs for publications.”&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;span >Note that a positive response did not mean the respondents thought the problems necessarily applied to them or that they would necessarily be implementing the changes - just that what we were proposing seemed sound for those who needed to address the issues.&lt;/span>&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >“While not directly relevant to our business the proposal seems aimed to protect the integrity of DOIs and Crossref’s role and that is not a bad thing.”&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>&lt;span >“I would consider it an irresponsible use of the system on the part of a publisher to circulate dois that don’t (yet) work. This is bound to lead to frustration with users encountering errors. However I appreciate that this situation may arise in some workflows and therefore your proposals to implement temporary landing pages make sense.”&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>&lt;span >“I was not aware of these issues, but think that your solutions seem feasible. We are a small journal and generally don’t add doi’s or publish until the article is complete (i.e., we don’t post anything that’s just accepted - only finalized). So we would be unlikely to update our workflow.”&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;span >Also, though respondents might have been generally positive about the proposal - that didn’t always mean they were also sanguine about it. For example, several shared concerns about the potential costs of changing their workflows.&lt;/span>&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >“[we] would consider implementing this change into our workflow. Limiting factors would include the effort and additional cost to enable our paper management system vendor…”&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>&lt;span >“My only comment is that the process needs to be streamlined as much as possible so small publishers without great technical capacity will not be burdened with twice the work or with additional expense. After reading through Crossref’s proposal, I believe you have taken such things into account and will implement an efficient and worthwhile system.”&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>&lt;span >“The workflow makes sense as a solution to the problem you describe […] but will require extensive workflow changes on our end in order to implement. Speaking for a small publishing house I’m not sure it’s reasonable to expect this from us on any short term.”&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;span >Several of the positive respondents also wondered about how we would handle particular edge cases (e.g. rescinding acceptances) and/or offered suggestions to improve the proposal. We will discuss these further at the end of this post.&lt;/span>&lt;/p>
&lt;h2 id="span-the-neutral-responsesspan">&lt;span >The neutral responses&lt;/span>&lt;/h2>
&lt;p>&lt;span >The responses we categorised as “neutral” were generally too short to conclude much about. They consisted of one or two sentences that said something like “this doesn’t apply to me.” It wasn’t clear whether it didn’t apply to them because they didn’t have the problems we described or because they’d already solved the problems we described (e.g. by providing their own interim landing pages). They also didn’t comment on the applicability to other members or whether they thought the issues might eventually affect them.&lt;/span>&lt;/p>
&lt;h2 id="span-the-negative-responsesspan">&lt;span >The negative responses&lt;/span>&lt;/h2>
&lt;p>&lt;span >We categorised responses as “negative” when the member rejected that the issues we outlined were actual problems or they rejected the mechanisms we were proposing to address the problems.&lt;/span>&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >“…a formal letter of acceptance on a letter in PDF will be OK for &lt;/span>&lt;span >authors. Why a DOI is better?”&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>&lt;span >“…I am aware of funder and institutional requirements for authors to take action on acceptance of manuscripts for publication in journals but don’t think the time pressure is so high that it has to happen in short time between acceptance and published ahead of print online…”&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>&lt;span >“Of all the accepted-but-not-yet-published papers in existence at any time, the number whose existence must be demonstrated to promotion and tenure review boards must be awfully small.”&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;span >There are a few common themes here. The first is that, historically, the industry has been content with acceptance letters as proof of publication and that it was relatively rare for authors to have to produce such proof.&lt;/span>&lt;/p>
&lt;p>&lt;span >The problem that has led us to propose support for a modified workflow is that now we have situations where all the researchers in a country require such letters on a regular basis - not just when they are up for promotion or tenure. This is the new reality faced by researchers and institutions who are subject to regular national evaluation schemes like the &lt;a href="http://www.ref.ac.uk/" target="_blank">REF&lt;/a> and &lt;a href="https://web.archive.org/web/20160229001254/http://www.arc.gov.au/era-faqs" target="_blank">ERA&lt;/a>.&lt;/span>&lt;/p>
&lt;p>&lt;span >One of the negative respondents added:&lt;/span>&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >“This is very familiar territory. It’s definitely coming out of STEM.”&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;span >Indeed, the initial pressure to support the earlier registration of DOIs is certainly falling on our members who focus on STEM publishing. Researchers in the STEM fields are generally under more pressure to publish articles frequently and they are primarily affected by emerging funder mandates. The relatively high research output in STEM fields combined with the need for regular compliance checks and regular evaluation schemes is creating an environment that requires more automated mechanisms to keep track of publications. Asking for and processing letters of acceptance in these situations just doesn’t scale.&lt;/span>&lt;/p>
&lt;p>&lt;span >Some of the negative responses also questioned our assertions about the hazards of promulgating unregistered DOI-like strings and/or the problems associated with the delay between when content is made available online and when the content is registered.&lt;/span>&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >“I don’t buy the argument that people lose trust in DOIs in general because they once tried to resolve one and it didn’t lead to an article. By the same argument, URLs in general are similarly undermined.”&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>&lt;span >“where authors ask me for their DOIs so they can accurately cite the paper in another publication or use it for grants and applications. I explain that it won’t work until the issue as a whole posts and I have never heard back about confusion or distrust of the system.”&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;span >To this all we can say is, we have the data. Next to typos, unregistered DOIs account for the second greatest category of failed resolutions on the Crossref system. Our help desk has to explain them to researchers constantly. We have promoted DOIs as being more robust, persistent identifiers than ordinary URLs. People are not surprised when URLs don’t work. They are surprised when DOIs don’t work. We’d like to keep it that way.&lt;/span>&lt;/p>
&lt;p>&lt;span >What seems to be at the the root of the few negative responses - is that most assumed that Crossref was mandating that publishers change their workflow - even if they didn’t face any of the issues outlined in the proposal. There is very little that Crossref &lt;em>mandates&lt;/em> to participate. This is by design. Our membership is just too diverse for us to have mandates that can be sensibly applied to all. Still - we should have made it clearer in the proposal that the proposed changes would not be mandated. We will certainly need to make this clearer when we roll out support for the new processes.&lt;/span>&lt;/p>
&lt;p>&lt;span >Oh yeah - one respondent called us out for using the phrase “advanced publication” instead of “advance publication”. For this we are truly sorry. The employee who made this mistake has been dragged out and shotted.&lt;/span>&lt;/p>
&lt;h2 id="span-issues-raised-and-questions-askedspan">&lt;span >Issues raised and questions asked&lt;/span>&lt;/h2>
&lt;p>&lt;span >Both the positive and negative respondents raised issues, asked questions and provided suggestions regarding the proposal. We will make sure that, when the proposal is implemented, we address all of these issues more clearly, but in the meantime, we thought it would be helpful if we answered some of them briefly here.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>Q:&lt;/strong> Would Crossref charge extra for the new workflow?&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>A:&lt;/strong> No. We should have made this clear in the proposal. We should have also mentioned that, in the “Crossref-facilitated Early Registration” scenario members will only be charged once they have replaced the “registered_content” metadata with metadata for the published item using one of the existing content schemas (e.g. article, book, confproc).&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>Q:&lt;/strong> Would Crossref require that publishers adopt the new proposed workflows?&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>A:&lt;/strong> No. But we will recommend them to members who need to address the issues outlined in the proposal. And in general, we will recommend that our members register DOIs as early in the process as practicable.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>Q:&lt;/strong> What does “acceptance” mean? It was pointed out that there were lots of variations of “acceptance” including “acceptance pending revisions”, etc.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>A:&lt;/strong> We would expect that “contingent acceptance” does not constitute final acceptance and that in this case “acceptance” should mean that the publisher has a copy of the manuscript in which the author has made all of the changes asked of them.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>Q:&lt;/strong> Doesn’t “acceptance” works both ways? A researcher has to grant permission to publish to the publisher.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>A:&lt;/strong> This is a vital point - the publisher should only register content for which they have already secured the rights to publish.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>Q:&lt;/strong> Collecting and verifying the metadata associated with a paper is expensive and time consuming. As such, some publishers only produce complete and robust metadata after a paper has been accepted. We face a Catch-22. if we deposit metadata immediately after acceptance, it will be sparse and unreliable. If we wait to collect and verify the metadata, then we risk violating some of the emerging mandates. How do we resolve this dilemma?&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>A:&lt;/strong> This is clearly beyond our control, but we expect that those issuing the mandates will have to make some reasonable accommodations if they expect publishers to register content both early and with reasonably useful metadata.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>Q:&lt;/strong> How would publishers handle rescinded acceptances?&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>A:&lt;/strong> Publishers can handle this the same way they handle retractions or withdrawals. Additionally, the registered record type and the “intent to publish” landing page will both support Crossmark for those members who use Crossmark to promulgate corrections to the literature. We will explore adding a new “acceptance rescinded” update type to the Crossmark schema.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>Q:&lt;/strong> The Crossref DOIs we generate contain embedded publication information such as volume and issue. We don’t know these details at acceptance so how can we register DOIs early? &lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>A:&lt;/strong> Many of our members generate Crossref DOIs with embedded semantic information in them such as volume/issue, publication date or even author initials and title. After 16 years of experience, we have found that this tends to be a bad idea. Publication schedules slip. Metadata changes. We will soon be revising our guidelines on DOI best practice in Crossref DOI generation to recommend against embedding such information into the DOI itself. Clearly, if you decide to assign Crossref DOIs at acceptance, you will need to adopt a DOI structure that accommodates this.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>Q:&lt;/strong> Our hosting provider manages DOI registrations for us. If we have to register DOIs earlier in the process, can we have one party (e.g. a manuscript tracking system vendor) register the initial “registered_content” metadata and then have different party (e.g. hosting provider or typesetter) replace that record with the final metadata?&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>A:&lt;/strong> Yes.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>Q:&lt;/strong> Will you be working with industry vendors to help them support this new workflow?&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>A:&lt;/strong> Yes.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>Q:&lt;/strong> Will we support the pre-registration of DOIs in the the deposit forms on the Crossref site?&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>A:&lt;/strong> Yes.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>Q:&lt;/strong> If Crossref hosts the “intent to publish” landing page, how will publishers be able to account for visits to the page and incorporate that information into their metrics?&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>A:&lt;/strong> While visitors to the Crossref hosted page will not show up in the publisher’s own hosting platform logs, publishers will be able to easily see how many times their “intent to publish” landing page was accessed by looking at their standard Crossref DOI resolution logs.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>Q:&lt;/strong> Could the Crossref-hosted landing page also include the URL that the DOI will eventually be associated with?&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>A:&lt;/strong> This is an interesting idea and was suggested by two separate respondents. The challenge will be in explaining to the user that the URL might or might not work. We are also concerned that this would reduce the incentive for publishers to replace the holding page in a timely manner. We’ll explore this option as we continue to work on implementation.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>Q:&lt;/strong> Would the Crossref-hosted landing page be open to indexing by Google and others? If so, wouldn’t this undermine articles under press embargoes?&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>A:&lt;/strong> The idea behind the limited metadata required for registering content is that it allows the publisher to control the balance between discovery (needed to meet funder requirements) and discretion (needed to manage publicity). So yes, the Crossref-hosted landing pages would be open to indexing, but publishers can still control what gets indexed by withholding metadata as needed.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>Q:&lt;/strong> The table of required metadata elements for the “registered content” type does not include the author. How are such records supposed to be used as proof of acceptance if they do not include the author name?&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>A:&lt;/strong>&lt;strike> We made a mistake. The table should have included the contributor in the required element column.&lt;/strike>&lt;strong> Update:&lt;/strong> We retract our retraction! We are trying to accommodate several different use cases for &amp;rsquo;early content registration&amp;rsquo; and these different use cases often have contradictory metadata implications. So, for example, including the author is certainly important for monitoring mandate compliance. However, including the author might be problematic when the publisher is trying to manage publicity around an upcoming publication. Again, Crossref is not in a position to resolve this dilemma and we expect that those issuing the mandates will make some reasonable accommodations with publishers who need to manage publicity around publications. In short, “authors” will remain optional metadata.&lt;/span>&lt;/p>
&lt;h2 id="span-summary-and-conclusionsspan">&lt;span >Summary and conclusions&lt;/span>&lt;/h2>
&lt;p>&lt;span >We were delighted with the response rate on the proposal. It is clear to us that a lot of the respondents really appreciated both being alerted to a set of issues that they were not yet aware of and that they valued the chance to comment on our proposed mechanisms for addressing said issues. We also learned some lessons on how to better structure any such future surveys in order to make them easier for us to summarise and respond to. The wide variety of responses and detailed descriptions of different workflows reconfirmed our sense that Crossref members vary widely in their working practices. We need to continue to work directly with members and understand these different working practices so that we can provide appropriately flexible services to our membership and to the scholarly community in general.&lt;/span>&lt;/p>
&lt;p>&lt;span >Finally, the feedback we received will be used by our product team and our communications &amp;amp; outreach teams to refine our rollout plans for registering content before online availability. We expect that we will rollout this functionality in the second half of 2016.&lt;/span>&lt;/p>
&lt;p>&lt;span >Thanks to those who responded to our RFC. Some of those responses included questions about other matters relating to Crossref. We have attempted to extract these and answer them directly- but if we have not yet answered one of your questions, please follow-up with us at &lt;a href="mailto:feedback@crossref.org">&lt;a href="mailto:feedback@crossref.org">feedback@crossref.org&lt;/a>&lt;/a>&lt;/span>&lt;/p></description></item><item><title>Event Data: open for your interpretation</title><link>https://www.crossref.org/blog/event-data-open-for-your-interpretation/</link><pubDate>Thu, 25 Feb 2016 00:00:00 +0000</pubDate><author>Madeleine Watson</author><guid>https://www.crossref.org/blog/event-data-open-for-your-interpretation/</guid><description>&lt;h3 id="span-strongwhat-happens-to-a-research-work-outside-of-the-formal-literature-thats-what-event-data-will-aim-to-answer-when-the-service-launches-later-this-yearstrongspan">&lt;span >&lt;strong>What happens to a research work outside of the formal literature? That’s what Event Data will aim to answer when the service launches later this year.&lt;/strong>&lt;/span>&lt;/h3>
&lt;p>&lt;span >&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/02/CROSSREF_EventData_logo.png" rel="attachment wp-att-1356">&lt;img class="alignnone wp-image-1356 size-medium" src="https://www.crossref.org/wp/blog/uploads/2016/02/CROSSREF_EventData_logo-300x124.png" alt="Crossref Event Data Logo" width="300" height="124" srcset="https://www.crossref.org/wp/blog/uploads/2016/02/CROSSREF_EventData_logo-300x124.png 300w, https://www.crossref.org/wp/blog/uploads/2016/02/CROSSREF_EventData_logo-768x319.png 768w, https://www.crossref.org/wp/blog/uploads/2016/02/CROSSREF_EventData_logo-1024x425.png 1024w, https://www.crossref.org/wp/blog/uploads/2016/02/CROSSREF_EventData_logo-1200x498.png 1200w, https://www.crossref.org/wp/blog/uploads/2016/02/CROSSREF_EventData_logo.png 1374w" sizes="(max-width: 300px) 85vw, 300px" />&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >Following the successful &lt;a href="https://www.crossref.org/blog/crossrefs-doi-event-tracker-pilot/" target="_blank">DOI Event Tracker pilot&lt;/a> in Spring 2014, development has been underway to build our new service, newly re-named Crossref Event Data. It’s an open data service that registers online activity (specifically, events) associated with Crossref metadata. Event Data will collect and store a record of any activity surrounding a research work from a defined set of web sources. The data will be made available as part of our metadata search service or via our Metadata API and normalised across a diverse set of sources. Data will be open, audit-able and replicable.&lt;/span>&lt;/p>
&lt;p>&lt;span >We expect to include the following sources at the launch of the clearinghouse in Q3 (pending final confirmation):&lt;/span>&lt;/p>
&lt;p>&lt;span >[table id=1 /]&lt;/span>&lt;/p>
&lt;h1 id="span-what-could-you-achievespan">&lt;span >What could you achieve?&lt;/span>&lt;/h1>
&lt;p>&lt;span >Anyone interested in metrics and analytics will have direct and open access to a single collection of DOI activity data of events occurring outside of the formal literature. As Event Data records are time-stamped, you can be assured that the data you receive is both auditable and replicable. Collected and stored by Crossref in the one location, we invite researchers, publishers, funders and altmetrics providers to consider the possibilities Event Data offers to enrich and expand your work. &lt;/span>&lt;/p>
&lt;h3 id="span-strongwith-such-a-corpus-of-open-transferable-and-auditable-raw-data-at-your-fingertips-what-could-you-achieve-strongspan">&lt;span >&lt;strong>With such a corpus of open, transferable and auditable raw data at your fingertips, what could you achieve? &lt;/strong>&lt;/span>&lt;/h3>
&lt;h2 id="span-general-and-altmetrics-service-providersspan">&lt;span >General and altmetrics service providers&lt;/span>&lt;/h2>
&lt;p>&lt;span >Crossref Event Data is a centrally-managed resource, therefore as a third party vendor you will have the ability to collect real-time data from a central location to enrich, analyze, interpret and report via your own tools. Using our API, you will gain regular access to our collection of raw, auditable data to feed into your own tools and services ready for aggregation and analysis. Additionally, the optional benefit of an SLA with Crossref will ensure that your clients have access to a reliable and flexible source of event data.&lt;/span>&lt;/p>
&lt;h2 id="span-journal-editorsspan">&lt;span >Journal editors&lt;/span>&lt;/h2>
&lt;p>&lt;span >&lt;span >Using the data collected in our service, as an editor you can attract authors by offering data on the audience’s research interest, track the full-scope of article dissemination and gain a better understanding of how the publications you manage compare to each other. By analysing the Event Data records, you can q&lt;/span>&lt;span >uickly find reviewers based on publication network analysis, identify new areas to grow author submissions and track the reach of submissions selected for publication. &lt;/span>&lt;/span>&lt;/p>
&lt;h2 id="span-fundersspan">&lt;span >Funders&lt;/span>&lt;/h2>
&lt;p>&lt;span >As a Funder, you can use Event Data to isolate and track the dissemination and usage of the research you funded outside of the scholarly literature. As the data is portable, you can be assured that should a journal move, your ability to track its dissemination moves with it. Using the Event Data records collection, you can:&lt;/span>&lt;/p>
&lt;ul>
&lt;li>&lt;span >Efficiently track progress of the research impact of grant awardees in an automated fashion, with the signals most relevant to your organisation&lt;/span>&lt;/li>
&lt;/ul>
&lt;li >
&lt;span >Develop measurements of research engagement at the article level which reflect your mission and current funding priorities&lt;/span>
&lt;/li>
&lt;li >
&lt;span >Gain visibility into the potent success stories highlighting the impact of your work for your development campaigns&lt;/span>
&lt;/li>
&lt;li >
&lt;span >Analyze trends of past and future funding programs&lt;/span>
&lt;/li>
&lt;li >
&lt;span >More effectively pursue your funding strategy and manage your portfolio based on data-driven decision making. &lt;/span>
&lt;/li>
&lt;h2 id="span-publishers-and-publishing-platforms-span">&lt;span >Publishers and publishing platforms &lt;/span>&lt;/h2>
&lt;p>&lt;span >By analyzing and interpreting the Event Data collection, as a publisher or content distributor you can use the records to undertake the following metric-lead analysis to help drive your business needs: &lt;/span>&lt;/p>
&lt;li >
&lt;span >Conduct more robust publication growth analysis across titles, subject areas, or all published literature&lt;/span>
&lt;/li>
&lt;li >
&lt;span >Gain a balanced understanding of the engagement on your publications across subject areas, titles, or managing editors&lt;/span>
&lt;/li>
&lt;li >
&lt;span >Enhance author services (personalization, content discovery, profile management, etc.)&lt;/span>
&lt;/li>
&lt;li >
&lt;span >Focused and data-driven product development of tools and services to drive audience engagement&lt;/span>
&lt;/li>
&lt;li >
&lt;span >Provide content distributors data on downstream reach of publications.&lt;/span>
&lt;/li>
&lt;h2 id="span-bibliometriciansspan">&lt;span >Bibliometricians&lt;/span>&lt;/h2>
&lt;p>&lt;span >Event Data heavily supports Bibliometric research by facilitating the tracking of DOI-related research activity across different platforms and channels. As a Bibliometrician, use trusted raw data as the underlying data for your research, which you can easily obtain from Crossref in a single, normalized format across a variety of sources. Additionally, as Event Data data is replicable, portable and auditable, you will be assured of high quality results in your research projects.&lt;/span>&lt;/p>
&lt;h2 id="span-research-institutions-span">&lt;span >Research institutions &lt;/span>&lt;/h2>
&lt;p>&lt;span >All of the stakeholders in your institution, from the research, development and marketing offices to the researchers themselves, will benefit from access to data about where and how your research is being discussed in mainstream and social media. As a research institution, Event Data can help you:&lt;/span>&lt;/p>
&lt;li >
&lt;span >Track dissemination of publications (types of channels, rate of growth, etc.) by members of the institution&lt;/span>
&lt;/li>
&lt;li >
&lt;span >Access up-to-date information on the research progress of faculty members, useful for tenure and promotion decisions&lt;/span>
&lt;/li>
&lt;li >
&lt;span >View data on downstream impact of publications&lt;/span>
&lt;/li>
&lt;li >
&lt;span >Roll up data for custom reporting of department’s research activities&lt;/span>
&lt;/li>
&lt;h1 id="span-stay-tuned-testing-begins-soonspan">&lt;span >Stay tuned, testing begins soon!&lt;/span>&lt;/h1>
&lt;p>&lt;span >&lt;span >With development work on the MVP (Minimum Viable Product) scheduled to complete shortly, we will soon be releasing a small subset of data sources that are collecting event data as well as a testing environment for interested parties to explore a very preliminary version of the software as we continue to work towards implementation of the full Event Data clearinghouse release in Q3. Look out for our MVP announcement, with full technical specifications and confirmation of the selected initial pull and push sources, over the coming weeks.&lt;/span> &lt;/span>&lt;/p></description></item><item><title>Revived: Crossref Books Interest Group</title><link>https://www.crossref.org/blog/revived-crossref-books-interest-group/</link><pubDate>Wed, 24 Feb 2016 00:00:00 +0000</pubDate><author>April Ondis</author><guid>https://www.crossref.org/blog/revived-crossref-books-interest-group/</guid><description>&lt;p>&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/02/books_interest_group_3.png" rel="attachment wp-att-1333">&lt;img class="wp-image-1333 alignright" src="https://www.crossref.org/wp/blog/uploads/2016/02/books_interest_group_3.png" alt="books_interest_group_3" width="312" height="312" srcset="https://www.crossref.org/wp/blog/uploads/2016/02/books_interest_group_3.png 800w, https://www.crossref.org/wp/blog/uploads/2016/02/books_interest_group_3-150x150.png 150w, https://www.crossref.org/wp/blog/uploads/2016/02/books_interest_group_3-300x300.png 300w, https://www.crossref.org/wp/blog/uploads/2016/02/books_interest_group_3-768x768.png 768w" sizes="(max-width: 312px) 85vw, 312px" />&lt;/a>We’re reviving the Books Interest Group, and inviting new members!&lt;/p>
&lt;p>After a hiatus, Crossref’s Books Interest Group is back.  We’re excited to announce that Emily Ayubi of the American Psychological Association has agreed to chair the group.&lt;/p>
&lt;p>&lt;span >In reviving the group, our intention is to create opportunities to talk about issues that are important to scholarly book publishers.  For example, we hope to explore whether it is time to revise the &lt;/span>&lt;a href="https://www.crossref.org/documentation/principles-practices/books-and-chapters/" target="_blank">&lt;span >Crossref best practices&lt;/span>&lt;/a> &lt;span >for depositing, versioning, and linking book content.   &lt;/span>&lt;/p>
&lt;p>&lt;span >We are seeking interested members from the book publishing community, and want to hear your ideas for agenda items and topics for discussion.  &lt;/span>&lt;/p>
&lt;p>&lt;span >Our first meeting will be a teleconference held at 11:00 am Eastern time on Wednesday, March 23rd.  You will receive dial-in details by email. &lt;/span>** **&lt;/p>
&lt;p>&lt;strong>If you’d like to join—and we’re hoping you will—please email me at &lt;strong>&lt;a href="mailto:aondis@crossref.org">&lt;strong>aondis@crossref.org&lt;/strong>&lt;/a>&lt;/strong>.&lt;/strong>&lt;/p></description></item><item><title>Request for Community Comment: registering content before online availability</title><link>https://www.crossref.org/blog/rfc-registering-content-before-online/</link><pubDate>Thu, 21 Jan 2016 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/rfc-registering-content-before-online/</guid><description>&lt;p>&lt;span >Crossref is proposing a process to support the registration of content—including DOIs and other metadata—prior to that content being made available, or published, online. We’ve drafted a paper providing background on the reasons we want to support this and highlighting the use cases. One of the main needs is in journal publishing to support registration of Accepted Manuscripts immediately on or shortly after acceptance, and dealing with press embargoes.&lt;/span>&lt;figure id="attachment_1303" class="wp-caption alignright">&lt;/p>
&lt;p>&lt;a href="http://outreach.crossref.org/acton/attachment/16781/f-000b/1/-/-/-/-/RFC4Feb-RegisterContentBeforeOnline.pdf" target="_blank" rel="attachment wp-att-1292">&lt;img class="wp-image-1303 size-medium" src="https://www.crossref.org/wp/blog/uploads/2016/01/Screenshot-2016-01-20-00.00.24-225x300.png" alt="Proposal doc for community comment" width="225" height="300" srcset="https://www.crossref.org/wp/blog/uploads/2016/01/Screenshot-2016-01-20-00.00.24-225x300.png 225w, https://www.crossref.org/wp/blog/uploads/2016/01/Screenshot-2016-01-20-00.00.24.png 754w" sizes="(max-width: 225px) 85vw, 225px" />&lt;/a>&lt;figcaption class="wp-caption-text">&lt;span >Proposal doc for community comment&lt;/span>&lt;/figcaption>&lt;/figure>&lt;/p>
&lt;p>&lt;em>&lt;strong>&lt;span >We request community comment on the &lt;/span>&lt;/strong>__&lt;strong>&lt;span >proposed approach as outlined &lt;a href="http://outreach.crossref.org/acton/attachment/16781/f-000b/1/-/-/-/-/RFC4Feb-RegisterContentBeforeOnline.pdf">in this report&lt;/a>.&lt;/span>&lt;/strong>&lt;/em>&lt;/p>
&lt;p>&lt;span >&lt;strong>Some examples of what we’d like to know:&lt;/strong>&lt;/span>&lt;/p>
&lt;ul>
&lt;li>&lt;span >Are you aware of the issues outlined in this proposal?&lt;/span>&lt;/li>
&lt;li>&lt;span >Are you aware of the funder and institutional requirements for authors to take action on acceptance of manuscripts for publication in journals?&lt;/span>&lt;/li>
&lt;li>&lt;span >Do you think the proposed solution and workflows are reasonable?&lt;/span>&lt;/li>
&lt;li>&lt;span >Are you likely to update your workflow to register content early?&lt;/span>&lt;/li>
&lt;li>&lt;span >If you are likely to update your workflow, how long do you estimate it will take?&lt;/span>&lt;/li>
&lt;li>&lt;span >Any other general comments, questions or feedback on anything raised in this document. &lt;/span>&lt;/li>
&lt;/ul>
&lt;p>&lt;span >&lt;b>&lt;i>Please send comments, feedback and questions to me, Ginny, at &lt;a href="mailto:feedback@crossref.org">&lt;a href="mailto:feedback@crossref.org">feedback@crossref.org&lt;/a>&lt;/a>. The deadline for comments is February 4th. Thanks!&lt;/i>&lt;/b>&lt;/span>&lt;/p></description></item><item><title>Linking clinical trials = enriched metadata and increased transparency</title><link>https://www.crossref.org/blog/linking-clinical-trials-enriched-metadata-and-increased-transparency/</link><pubDate>Mon, 18 Jan 2016 00:00:00 +0000</pubDate><author>Kirsty Meddings</author><guid>https://www.crossref.org/blog/linking-clinical-trials-enriched-metadata-and-increased-transparency/</guid><description>&lt;p>We will shortly be adding a new feature to Crossmark. In a section called “Clinical Trials” we will be using new metadata fields to link together all of the publications we know about that reference a particular clinical trial.&lt;/p>
&lt;p>Most medical journals make clinical trial registration a prerequisite for publication. Trials should be registered with one of the fifteen &lt;a href="https://web.archive.org/web/20160220120635/http://www.who.int/ictrp/network/primary/en" target="_blank">WHO-approved public trial registries&lt;/a> , or with &lt;a href="http://www.clinicaltrials.gov/%22" target="_blank">clinicaltrials.gov&lt;/a> which is run by the US National Library of Medicine. Once registered, a trial is assigned a &lt;strong>clinical trial number (CTN)&lt;/strong> which is subsequently used to identify that trial in any publications that report on it.&lt;/p>
&lt;p>Publications that result from any one trial are likely to be released in multiple journals from different publishers and at different times, for example secondary
analyses coming some time after the publication of the initial results. Cross-publisher collaboration is paramount to linking all of these publications together so that researchers, funders, and regulatory agencies can understand the whole set of results from clinical trials. With this in mind, a group of medical publishers, led by BioMedCentral, approached Crossref to establish a working group, and here, &lt;a href="http://blogs.biomedcentral.com/on-medicine/2014/01/31/threaded-publications-one-step-closer" target="_blank">they designed an approach to address this problem:&lt;/a> “thread” all the various documents together surrounding a clinical trial.&lt;/p>
&lt;h2 id="updated-upstream">Updated upstream&lt;/h2>
&lt;p>To implement threaded publications, publishers extract clinical trial numbers from papers, or ask authors to submit those numbers to them. Publishers add the CTNs to the Crossref DOI metadata via three new fields: clinical trial number, clinical trial registry where trial is registered, and trial stage (pre-results, results or post-results of the trial). Crossref has assigned unique IDs to each trial registry (much the same as we have done for funders in our &lt;a href="https://www.crossref.org/services/funder-registry">Funder Registry&lt;/a> and for the same reason - trial registry names and URIs can change over time and we need a persistent identifier). U&lt;/span>&lt;span >sing a combination of trial registry ID and clinical trial number, we can easily identify other content in the Crossref database that cites the same trial. Finally, Crossref displays the clinical trial metadata on the respective papers for all participating Crossmark publishers. Crossmark is a convenient place for readers to access the clinical trial information and is readily accessible directly from the journal article (online and PDF versions). And of course all of the data also goes into our &lt;a href="https://api.crossref.org" target="_blank">open API&lt;/a> so that anyone can make use of it.&lt;/p>
&lt;p>The reporting of clinical trial results is notoriously inconsistent, something that the &lt;a href="http://www.alltrials.net/%22" target="_blank">AllTrials initiative&lt;/a> is also seeking to address. Publishers can help by collecting this information upstream and disseminating it using the existing Crossref infrastructure.&lt;/p>
&lt;p>We ask all publishers to deposit the clinical trial data which is so critical to transparency in this area of research, and have already had the &lt;a href="https://api.crossref.org/v1/works/10.3310/hta191010" target="_blank">first data&lt;/a> in from Crossref member the &lt;a href="http://www.nihr.ac.uk/" target="_blank">National Institute of Health Research&lt;/a>. Once we launch the initial set of linked clinical trials, we will expand coverage of the threaded publications to include all content that reports on or references a clinical trial, from protocol to results to supporting data and systematic reviews.&lt;/p>
&lt;p>Stay tuned and watch this space as threaded publications rolls out to journal articles across publishers!&lt;/p></description></item><item><title>Crossref &amp; the Art of Cartography: an Open Map for Scholarly Communications</title><link>https://www.crossref.org/blog/crossref-the-art-of-cartography-an-open-map-for-scholarly-communications/</link><pubDate>Fri, 08 Jan 2016 00:00:00 +0000</pubDate><author>Jennifer Lin</author><guid>https://www.crossref.org/blog/crossref-the-art-of-cartography-an-open-map-for-scholarly-communications/</guid><description>&lt;p> &lt;/p>
&lt;p>&lt;span >In the &lt;a href="https://www.crossref.org/crossref-annual-meeting/archive/#2015">2015 Crossref Annual Meeting&lt;/a>, I introduced a metaphor for the work that we do at Crossref. I re-present it here for broader discussion as this narrative continues to play a guiding role in the development of products and services this year.&lt;/span>&lt;/p>
&lt;h5 id="span-bmetadata-enable-connectionsbspan">&lt;span >&lt;b>Metadata enable connections&lt;/b>&lt;/span>&lt;/h5>
&lt;p>&lt;span >&lt;span >&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/01/pasted-image-0.png" rel="attachment wp-att-1214">&lt;img class="alignright wp-image-1214" src="https://www.crossref.org/wp/blog/uploads/2016/01/pasted-image-0-200x300.png" alt="Cartography Borges" width="250" height="375" srcset="https://www.crossref.org/wp/blog/uploads/2016/01/pasted-image-0-200x300.png 200w, https://www.crossref.org/wp/blog/uploads/2016/01/pasted-image-0.png 540w" sizes="(max-width: 250px) 85vw, 250px" />&lt;/a>At Crossref, we make research outputs easy to find, cite, link, and assess through DOIs. Publishers register their publications and deposit metadata through a variety of channels (XML, CSV, PDF, manual entry), which we process and transform into Crossref XML for inclusion into our corpus. This data infrastructure which makes possible scholarly communications without restrictions on publisher, subject area, geography, etc. is far more than a reference list, index or directory.&lt;/span> &lt;/span>&lt;/p>
&lt;p>&lt;span >If research builds on what came before, one could claim that the process of knowledge production is partly the story of the very relationships between results disseminated (i.e., publications). So let’s consider each publication as a node in a graph where &lt;a href="https://www.crossref.org/wp/blog/uploads/2016/01/Map-entities.jpeg" rel="attachment wp-att-1247">&lt;img class="wp-image-1250 alignright" src="https://www.crossref.org/wp/blog/uploads/2016/01/Map-entities-300x237.jpeg" alt="" width="211" height="166" srcset="https://www.crossref.org/wp/blog/uploads/2016/01/Map-entities-300x237.jpeg 300w, https://www.crossref.org/wp/blog/uploads/2016/01/Map-entities.jpeg 651w" sizes="(max-width: 211px) 85vw, 211px" />&lt;/a>each has a coordinate and is connected by its citations to other publications (as well those that cite it). Additionally, each is associated with a set of people and places, along with a whole host of elements involved in the research and dissemination process.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >But take a wider berth, and we begin to capture relationships between all such contributing agents and objects involved in the research process. Here we find an array of entities belonging to the scholarly graph, including different types of research artifacts, publisher and journal, funders, ORCIDs, peer reviews, publication status updates (corrections, retractions, etc.), citations, license information, additional URLs (machine destinations, hosting platforms, etc.), underlying data, software and protocols, materials, discussions and blog posts, recommendations, reference work mentions, etc. The entities on the graph multiply at an even higher rate as researchers share more outputs across more channels. And over time, the graph expands exponentially, producing a webbing that is far more dense and far more vast than we can currently imagine. Perhaps even to the point we realize Borges’ story where a cartographer builds a map so large it replicates the territory itself (&lt;/span>&lt;em>&lt;a href="http://www.borges.pitt.edu/node/144">&lt;span >On Exactitude in Science&lt;/span>&lt;/a>&lt;/em>&lt;span >)!&lt;/span>&lt;/span>&lt;/p>
&lt;!--more-->
&lt;h5 id="span-bfrom-graph-to-cartographybspan">&lt;span >&lt;b>From graph to cartography&lt;/b>&lt;/span>&lt;/h5>
&lt;p>&lt;span >At the heart of Borges’s poignant story is the map. Crossref’s graph of scholarly communications could be seen in the same light. It has a representational aspect, which is not purely abstract and can be visualized. Here, a map becomes an incredibly potent metaphor. Each link enabled by publisher-deposited metadata is a new street, bridge, or highway that takes us to a particular place (i.e., entity) of interest. These roads lead to articles, researchers, funders, institutions, etc., and in doing so, make them discoverable. They tell a story about the roles of each in the broader research in the landscape dotted with a plethora of places. &lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >The scholarly web has a growing corpus of more than &lt;/span>&lt;a href="https://data.crossref.org/reports/statusReport.html">&lt;span >78 million publications&lt;/span>&lt;/a>&lt;span > at this very moment registered with Crossref. On average ten to fifteen thousand new objects appear every day. Maps are all the more essential for getting around in a bewildering environment of new and unfamiliar places, even for known ones in areas of exploding growth. They are critical for orienteering, discovering relationships, identifying sets of associated objects, naming new neighborhoods that emerge (i.e., new research specialties), etc. And if each connection on the map is seen as an event, maps can also represent micro-narratives about the research process and the agents involved. A multi-dimensional map containing all these entities, which serves as an evolving representation of spacetime that is constantly updated and always available, would finally begin to depict the process of scholarly activity as a dynamic, evolving, almost living system.&lt;/span>&lt;/span>&lt;/p>
&lt;h5 id="span-ban-open-map-for-scholarly-communicationbspan">&lt;span >&lt;b>An open map for scholarly communication&lt;/b>&lt;/span>&lt;/h5>
&lt;p>&lt;span >&lt;span >Crossref builds such a scholarly map of the research enterprise and makes it openly available for the entire research ecosystem. Call this a meta map or, more recently, call it &lt;/span>&lt;a href="http://www.wired.com/2016/01/the-metastructure-transportation/">&lt;span >metastructure&lt;/span>&lt;/a>&lt;span >. No matter what name it goes by we call it infrastructure at Crossref.&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >Crossref’s open map for scholarly communications is a core part of the open information infrastructure for scholarly research. Crossref map data are open, portable, as well as licensed and provisioned for maximum reuse to serve the whole community. This open resource has two entrances: one for humans, another for machines. The &lt;/span>&lt;a href="https://github.com/Crossref/rest-api-doc/blob/master/rest_api.md">&lt;span >Crossref REST API&lt;/span>&lt;/a>&lt;span > enables machines to traverse this environment and mine it in equal measure to the humans behind them. It is configured so that a robot can learn, a phone can access, and platforms can be built.&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;a href="https://www.openstreetmap.org/">&lt;span >OpenStreetMap&lt;/span>&lt;/a>&lt;span > and &lt;/span>&lt;a href="https://developers.google.com/maps/?hl=en">&lt;span >Google Maps&lt;/span>&lt;/a>&lt;span >, both widely used and mature infrastructure maps, are instructive examples when we consider a map of this kind for scholarly communications. Map data can be represented in unlimited ways, depending on any variety of needs and users. Third parties can add content via &lt;/span>&lt;a href="http://googlegeodevelopers.blogspot.co.uk/2015/04/interactive-data-layers-in-javascript.html">&lt;span >interactive layers&lt;/span>&lt;/a>&lt;span > that tell different stories such as &lt;/span>&lt;a href="https://mapsengine.google.com/10237621067095735108-16932951632409324660-4/mapview/?authuser=0">&lt;span >health expenditure by country based on GDP&lt;/span>&lt;/a>&lt;span > and &lt;/span>&lt;a href="https://mapsengine.google.com/06900458292272798243-13579632754418963048-4/mapview/?authuser=0">&lt;span >coral reefs at risk&lt;/span>&lt;/a>&lt;span >. They have a broad base of users across business models from philanthropic services aimed at disaster relief (&lt;/span>&lt;a href="http://refugeemaps.eu/">&lt;span >Refugeemaps.eu&lt;/span>&lt;/a>&lt;span >) to commercial entities providing drivers with locations on open parking spaces (&lt;/span>&lt;a href="https://www.appyparking.com/">&lt;span >AppyParking&lt;/span>&lt;/a>&lt;span > on Google Map, &lt;/span>&lt;a href="https://twitter.com/pocketparker">&lt;span >PocketParker&lt;/span>&lt;/a>&lt;span > on OpenStreetMap). They power platforms and services that build maps for others (&lt;/span>&lt;a href="http://www.mapquest.com/">&lt;span >MapQuest&lt;/span>&lt;/a>&lt;span >, &lt;/span>&lt;a href="https://www.mapbox.com/">&lt;span >MapBox&lt;/span>&lt;/a>&lt;span >). They have applications far beyond the business of maps. For example, &lt;/span>&lt;a href="https://web.archive.org/web/20170716112842/https://developers.google.com/places/android-api/placepicker">&lt;span >Place picker&lt;/span>&lt;/a>&lt;span > is a Google Maps widget that supports easy auto-complete the entry of any place or location on a mobile app where typing is a chore. And as far use cases close to home, the two have served as raw data for academic research (ex: &lt;/span>&lt;a href="http://svn.vsp.tu-berlin.de/repos/public-svn/publications/vspwp/2011/11-10/2011-06-20_openstreetmap_for_traffic_simulation_sotm-eu.pdf">&lt;span >workflow for generating multi-agent traffic simulation scenarios&lt;/span>&lt;/a>&lt;span >, &lt;/span>&lt;a href="http://www.tandfonline.com/doi/abs/10.1080/13658816.2012.692791?journalCode=tgis20#.Vo11aJMrIo8">&lt;span >automatic classification of GPS trajectories for transportation modes&lt;/span>&lt;/a>&lt;span >, etc.).&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >In kind, the Crossref infrastructure map also supports: the development of any variety of new maps which re-present the data, the makers of map platforms that power the research enterprise, tools that use map data, as well as academic research (bibliometrics). We extract slices of data of common interest from the map and add them as additional layers by which anyone can access and create applications on or across these bands of data: &lt;/span>&lt;/p>
&lt;ul>
&lt;li>&lt;span >Contributors (authors, editors, reviewers)&lt;/span>&lt;/li>
&lt;li>&lt;span >Funding information (funding body, grant number)&lt;/span>&lt;/li>
&lt;li>&lt;span >Trial &amp;amp; study information (clinical trials registry number, registered report, replication study)&lt;/span>&lt;/li>
&lt;li>&lt;span >Publication history (versions, updates, revisions, corrections, retractions, dates received/accepted/published)&lt;/span>&lt;/li>
&lt;li>&lt;span >Peer review (status, type, reviews)&lt;/span>&lt;/li>
&lt;li>&lt;span >Access indicators (publication license for text &amp;amp; data mining, machine mining URLs)&lt;/span>&lt;/li>
&lt;li>&lt;span >Resources &amp;amp; associated research artifacts (preprints, figures &amp;amp; tables, datasets, software, protocols, research resource IDs)&lt;/span>&lt;/li>
&lt;li>&lt;span >Activity surrounding the publication (peer reviews, comments &amp;amp; discussions, bookmarks, social shares, recommendations).&lt;/span>&lt;/li>
&lt;/ul>
&lt;p>&lt;span >Today, the map powers a host of public and commercial organisations alike for a wide range of scholarly and non-scholarly purposes:&lt;/span>&lt;/p>
&lt;table style="border: 1px solid #ffffff;" border="0" width="400" cellspacing="0" cellpadding="0">
&lt;tr>
&lt;td style="border: 1px solid #ffffff;">
&lt;ul>
&lt;li>
&lt;span >Publishers&lt;/span>
&lt;/li>
&lt;li>
&lt;span >Funders&lt;/span>
&lt;/li>
&lt;li>
&lt;span >Research institutions&lt;/span>
&lt;/li>
&lt;li>
&lt;span >Archives &amp; repositories&lt;/span>
&lt;/li>
&lt;li>
&lt;span >Research councils&lt;/span>
&lt;/li>
&lt;li>
&lt;span >Data centres&lt;/span>
&lt;/li>
&lt;li>
&lt;span >Professional networks&lt;/span>
&lt;/li>
&lt;li>
&lt;span >Patent offices&lt;/span>
&lt;/li>
&lt;li>
&lt;span >Registration Agencies&lt;/span>
&lt;/li>
&lt;/ul>
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td style=&amp;quot;border: 1px solid #ffffff;&amp;quot;&amp;gt;
&amp;lt;ul&amp;gt;
&amp;lt;li&amp;gt;
&amp;lt;span &amp;gt;Indexing services&amp;lt;/span&amp;gt;
&amp;lt;/li&amp;gt;
&amp;lt;li&amp;gt;
&amp;lt;span &amp;gt;Publishing vendors&amp;lt;/span&amp;gt;
&amp;lt;/li&amp;gt;
&amp;lt;li&amp;gt;
&amp;lt;span &amp;gt;Peer review systems&amp;lt;/span&amp;gt;
&amp;lt;/li&amp;gt;
&amp;lt;li&amp;gt;
&amp;lt;span &amp;gt;Reference manager systems&amp;lt;/span&amp;gt;
&amp;lt;/li&amp;gt;
&amp;lt;li&amp;gt;
&amp;lt;span &amp;gt;Lab &amp;amp; diagnostics suppliers&amp;lt;/span&amp;gt;
&amp;lt;/li&amp;gt;
&amp;lt;li&amp;gt;
&amp;lt;span &amp;gt;Info management systems&amp;lt;/span&amp;gt;
&amp;lt;/li&amp;gt;
&amp;lt;li&amp;gt;
&amp;lt;span &amp;gt;Educational tools&amp;lt;/span&amp;gt;
&amp;lt;/li&amp;gt;
&amp;lt;li&amp;gt;
&amp;lt;span &amp;gt;Data analytics systems&amp;lt;/span&amp;gt;
&amp;lt;/li&amp;gt;
&amp;lt;li&amp;gt;
&amp;lt;span &amp;gt;Literature discovery services&amp;lt;/span&amp;gt;
&amp;lt;/li&amp;gt;
&amp;lt;/ul&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;/table>
&lt;p>&lt;span >We will follow up this post to highlight a cross-section of these consumers in the Crossref map ecosystem and elaborate on what &amp;amp; how they have built from our data. An infrastructure map offers endless potential to third parties across publishers, funders, research institutions, and vendors working to serve the scholarly research enterprise.&lt;/span>&lt;/p>
&lt;h5 id="span-bthe-art-of-cartographybspan">&lt;span >&lt;b>The art of cartography&lt;/b>&lt;/span>&lt;/h5>
&lt;p>&lt;span >&lt;span >In the Crossref Product Management team, we have ambitious plans for map enhancements this year. They focus on expanding information density and ease of access to the data. In the former case, we will introduce a new class of locations where activity surrounding the publications are occurring when we launch the &lt;/span>&lt;a href="https://www.crossref.org/blog/det-poised-for-launch/">&lt;span >DOI Event Tracker&lt;/span>&lt;/a>&lt;span >. We will also initiate an extensive publisher campaign to achieve full metadata deposit completeness across our membership. No one can keep pace with the sheer volume of research activity happening online nor wander the &lt;a href="http://fusion.net/story/251095/lonely-web-the-dress-viral-social-media-profit/">Lonely Web&lt;/a> of research alone. The more metadata publishers provide for a publication, the more roads lead to its map location. After all, discoverability is closely associated with connectedness on a map.&lt;/span>&lt;span > And finally, in the latter case, we will refresh and enhance the user interface to make it more powerful for humans to traverse the ever-changing landscape (as easily as the REST API enables machines!).&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;i>&lt;span >I gratefully acknowledge the feedback received from the following who served as  generous and insightful sounding boards: &lt;/span>&lt;/i>&lt;i>&lt;a href="https://twitter.com/GinnyBarbour">Virginia Barbour&lt;/a>&lt;/i>&lt;i>&lt;span >, &lt;/span>&lt;/i>&lt;a href="https://twitter.com/TheoBloom">&lt;i>&lt;span >Theo Bloom&lt;/span>&lt;/i>&lt;/a>&lt;i>&lt;span >, &lt;/span>&lt;/i>&lt;a href="https://twitter.com/martin_eve">&lt;i>&lt;span >Martin Eve,&lt;/span>&lt;/i>&lt;/a> &lt;a href="https://twitter.com/danielskatz">&lt;i>&lt;span >Daniel S. Katz&lt;/span>&lt;/i>&lt;/a>&lt;i>&lt;span >, &lt;/span>&lt;/i>&lt;a href="https://twitter.com/AmyeKenall">&lt;i>&lt;span >Amye Kenall&lt;/span>&lt;/i>&lt;/a>&lt;i>&lt;span >, &lt;/span>&lt;/i>&lt;a href="https://twitter.com/catmacOA">&lt;i>&lt;span >Catriona MacCullum&lt;/span>&lt;/i>&lt;/a>&lt;i>&lt;span >, &lt;/span>&lt;/i>&lt;a href="https://twitter.com/CameronNeylon">&lt;i>&lt;span >Cameron Neylon&lt;/span>&lt;/i>&lt;/a>&lt;i>&lt;span >, &lt;/span>&lt;/i>&lt;a href="https://twitter.com/marknpatterson">&lt;i>&lt;span >Mark Patterson&lt;/span>&lt;/i>&lt;/a>&lt;i>&lt;span >, &lt;/span>&lt;/i>&lt;a href="https://twitter.com/KristenRatan">&lt;i>&lt;span >Kristen Ratan&lt;/span>&lt;/i>&lt;/a>&lt;i>&lt;span >, &lt;/span>&lt;/i>&lt;a href="https://twitter.com/carlystrasser">&lt;i>&lt;span >Carly Strasser&lt;/span>&lt;/i>&lt;/a>&lt;i>&lt;span >, and &lt;/span>&lt;/i>&lt;a href="https://twitter.com/kaythaney">&lt;i>&lt;span >Kaitlin Thaney&lt;/span>&lt;/i>&lt;/a>&lt;i>&lt;span >.&lt;/span>&lt;/i>&lt;/span>&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/wp/blog/uploads/2016/01/You-decide-where-to-go.001.jpeg" rel="attachment wp-att-1215">&lt;img class="wp-image-1215 aligncenter" src="https://www.crossref.org/wp/blog/uploads/2016/01/You-decide-where-to-go.001-300x169.jpeg" alt="Crossref map" width="405" height="228" srcset="https://www.crossref.org/wp/blog/uploads/2016/01/You-decide-where-to-go.001-300x169.jpeg 300w, https://www.crossref.org/wp/blog/uploads/2016/01/You-decide-where-to-go.001-768x432.jpeg 768w, https://www.crossref.org/wp/blog/uploads/2016/01/You-decide-where-to-go.001.jpeg 960w" sizes="(max-width: 405px) 85vw, 405px" />&lt;/a>&lt;/p></description></item><item><title>ORCID tipping point?</title><link>https://www.crossref.org/blog/orcid-tipping-point/</link><pubDate>Thu, 07 Jan 2016 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/orcid-tipping-point/</guid><description>&lt;p >
&lt;span >Today eight publishers have presented an open letter that sets out the rationale for &lt;a href="https://info.orcid.org/requiring-orcid-in-publications/" target="_blank">making ORCID iDs a requirement&lt;/a> for all corresponding authors, a move that is being backed by even more publishers and researchers as the news spreads on twitter with &lt;a href="https://twitter.com/hashtag/publishORCID?src=hash">#publishORCID&lt;/a>. Crossref is a founding organisation of ORCID and an ongoing supporter so it’s great to see further uptake and even more benefit for the research community.&lt;/span>
&lt;/p>
&lt;p >
&lt;span >We encourage all our members to strive for complete metadata and that should include ORCID iDs, whether their workflows are able to require them at submission or not. Since we launched the &lt;a href="https://www.crossref.org/blog/auto-update-has-arrived-orcid-records-move-to-the-next-level/">ORCID auto-update process&lt;/a> a couple of months ago, over 10,000 authors have given Crossref permission to automatically update their ORCID records.&lt;/span>
&lt;/p>
&lt;p >
&lt;span >The open letter—signed by eLife, PLOS, The Royal Society, AGU, EMBO, Hindawi, IEEE, and Science—also offers minimum implementation guidelines for the process:&lt;/span>
&lt;/p>
&lt;ol>
&lt;li>&lt;span >&lt;span >Require&lt;/span>. ORCID iDs are required for corresponding authors of published papers, ideally at submission.&lt;/span>&lt;/li>
&lt;li>&lt;span >&lt;span >Collect&lt;/span>. The collection of ORCID iDs is done via the ORCID API, so authors are not asked to type in or search for their iD.&lt;/span>&lt;/li>
&lt;li>&lt;span >&lt;span >Auto-update&lt;/span>. Crossref metadata is updated to include ORCID iDs for authors, so this information can automatically populate ORCID records.&lt;/span>&lt;/li>
&lt;li>&lt;span >&lt;span >Publish&lt;/span>. Author/co-author ORCID iDs are embedded into article metadata.&lt;/span>&lt;/li>
&lt;/ol>
&lt;p class="p1">
&lt;span >&lt;a href="http://orcid.org/blog/2016/01/07/publishers-start-requiring-orcid-ids" target="_blank">ORCID’s own announcement&lt;/a> gives further background and describes the benefits for researchers, such as single sign-on across journals and ultimately, increased discovery of their works. Everybody wins.&lt;/span>
&lt;/p></description></item><item><title>A healthy infrastructure needs healthy funding data</title><link>https://www.crossref.org/blog/a-healthy-infrastructure-needs-healthy-funding-data/</link><pubDate>Wed, 16 Dec 2015 00:00:00 +0000</pubDate><author>Kirsty Meddings</author><guid>https://www.crossref.org/blog/a-healthy-infrastructure-needs-healthy-funding-data/</guid><description>&lt;p>&lt;span >&lt;span >We’ve been talking a lot about infrastructure here at Crossref, and how the metadata we gather and organize is the foundation for so many services - those we provide directly - and those services that use our APIs to access that metadata, such as &lt;/span>&lt;span >&lt;a href="http://www.growkudos.com" target="_blank">Kudos&lt;/a>&lt;/span>&lt;span > and &lt;/span>&lt;a href="http://www.chorusaccess.org/about/about-chorus/">&lt;span >CHORUS&lt;/span>&lt;/a>&lt;span >, which in turn provide the wider world of researchers, administrators, and funders with tailored information and tools.&lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;b>The initiative formerly known as FundRef &lt;/b>&lt;/span>&lt;/p>
&lt;p>&lt;span >Together Crossref’s &lt;a href="https://web.archive.org/web/20131229210637/http://search.crossref.org//funding" target="_blank">funding data&lt;/a> (previously known as FundRef  – we simplified the name)  and the &lt;a href="https://www.crossref.org/services/funder-registry/" target="_blank">Open Funder Registry&lt;/a>, our taxonomy of grant-giving organisations, comprise a hub for gathering and querying metadata related to the questions: &lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;b>&lt;i>“Who funded this research?” &lt;/i>&lt;/b>&lt;span >and &lt;/span>&lt;b>&lt;i>“Where has the research we funded been published?”&lt;/i>&lt;/b>&lt;/p>
&lt;/span>
&lt;p>&lt;span >To support the funding data initiative, three key pieces of metadata are needed from publishers:&lt;/span>&lt;/p>
&lt;ul>
&lt;li>&lt;span >Funder ID &lt;/span>&lt;/li>
&lt;li>&lt;span >Funder Name  &lt;/span>&lt;/li>
&lt;li>&lt;span >DOI&lt;i>&lt;/i>&lt;/span>&lt;/li>
&lt;/ul>
&lt;p>&lt;span >Unfortunately only around half of the 950,000 Crossref DOIs with funding data contain funder IDs, the unique funder identifiers from the Open Funder Registry that are needed to link up all of the data.  So, only half of the data is useful. (And 950,000 DOIs is only a fraction of the 77 million DOIs in our database, but more on that later).&lt;/span>&lt;/p>
&lt;p>&lt;span >When we looked at the funding data that was coming in without funder IDs we were a little surprised. We had expected that most of these would be names that simply aren’t in the Open Funder Registry yet, and we thought there would be a certain amount of incorrect information that had been entered into the “funder_name” field. &lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >Instead, what we found was that many of the names were correct, and the funder IDs were just &lt;/span>&lt;i>&lt;span >missing&lt;/span>&lt;/i>&lt;span >. &lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;b>Tidying the data&lt;/b>&lt;/span>&lt;/p>
&lt;p>&lt;span >To help correct this, we decided to match incoming names to funder IDs where we could do so with the highest level of confidence. After much testing to minimize false positives, we switched this on at the end of August 2015. &lt;/span>&lt;/p>
&lt;p>&lt;span >Throughout September and October, we inserted funder IDs for about 25% of the names that have been deposited without IDs. For October, the real numbers were 68,000 funder names with no IDs deposited, and 18,000 funder IDs inserted by Crossref. &lt;/span>&lt;/p>
&lt;p>&lt;span >In the same period 42,000 funder IDs were deposited by publishers. With our matching on top of this, we are achieving a little over a 50% overall success rate of “good” funding data (funder names and funder IDs together). &lt;/span>&lt;/p>
&lt;p>&lt;span >We have been very careful to distinguish the funder IDs that we have added from those deposited by publishers - provenance of data is an extremely important part of what we do. All funder IDs are tagged as provided either by the publisher or Crossref. Every time we insert an ID into a deposit, the publisher is notified in the deposit report. &lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >We have also now added these tags to our &lt;/span>&lt;a href="https://api.crossref.org">&lt;span >REST API&lt;/span>&lt;/a>&lt;span > so that publishers can query to find out &lt;a href="https://api.crossref.org/v1/works?filter=funder-doi-asserted-by:crossref&amp;rows=100" target="_blank">exactly which DOIs&lt;/a> we have amended*. The ideal scenario at this point is that the publisher checks that they are happy with the matching and then redeposits the funding data for those DOIs, over-writing the &lt;/span>&lt;code>&amp;lt;span &amp;gt;doi-asserted-by: “crossref”&amp;lt;/span&amp;gt;&lt;/code>&lt;span > tag and claiming the metadata as their own. &lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;b>Setting some limits &lt;/b>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >The second largest problem with funding data was &lt;/span>&lt;i>&lt;span >incorrectly entered funder name&lt;/span>&lt;/i>&lt;span > – e.g. concatenation of several names or authors entering overly long or vague program names instead of the official funder name. &lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >To help weed this out, we have made a couple of changes to the funding data deposit system:&lt;/span>&lt;/p>
&lt;ul>
&lt;li>&lt;span >&lt;span >Funder_name field can no longer contain a numerical string over &lt;/span>&lt;b>4 digits&lt;/b>&lt;/span>&lt;/li>
&lt;li>&lt;span >&lt;span >Funder_name field can no longer contain a text string over &lt;/span>&lt;b>200 characters&lt;/b>&lt;/span>&lt;/li>
&lt;li>&lt;span >Funder names that that do not adhere to these two rules will now cause the funding data section of the metadata deposit (not the whole deposit) to fail and return an error message.&lt;/span>&lt;/li>
&lt;/ul>
&lt;/span>
&lt;p>&lt;span >&lt;b>Getting the growth we need&lt;/b>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >As of today, 198 publishers deposit funding data with Crossref. This amounts to about 3.5% of Crossref’s membership&lt;/span> &lt;span >(although it’s a larger proportion of our total deposits). We need more publishers to deposit funding data so that &lt;/span>&lt;a href="https://web.archive.org/web/20131229210637/http://search.crossref.org//funding">&lt;span >funding data search&lt;/span>&lt;/a>&lt;span > can become a truly useful tool for the community. There’s no sign-up process or additional fee - read about how to &lt;/span>&lt;a href="https://www.crossref.org/services/funder-registry/">&lt;span >get started&lt;/span>&lt;/a>&lt;span >, and take a look at our &lt;/span>&lt;a href="https://www.crossref.org/blog/best-practices-for-depositing-funding-data/">&lt;span >best practices for depositing funding data&lt;/span>&lt;/a>&lt;span >.  &lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;b>Finally, we ask you: how can we get more and better funder metadata in 2016?&lt;br /> &lt;/b>&lt;/span>&lt;/p>
&lt;p>&lt;span >This is not a rhetorical question. Please tweet your thoughts @CrossrefOrg or email your replies to &lt;a href="mailto:info@crossref.org">info@crossref.org&lt;/a>. You will receive something special via snail mail if you reply to us – just Crossref’s way of saying thank you.&lt;br /> &lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;sup>&lt;em>*At the time of posting our database is re-indexing and the “asserted-by” tags are still filtering through to the API. Check back in a day or two for the full picture. &lt;/em>&lt;/sup>&lt;/span>&lt;/p></description></item><item><title>Distributed Usage Logging: A private channel for private data</title><link>https://www.crossref.org/blog/distributed-usage-logging-a-private-channel-for-private-data/</link><pubDate>Fri, 04 Dec 2015 00:00:00 +0000</pubDate><author>Jennifer Lin</author><guid>https://www.crossref.org/blog/distributed-usage-logging-a-private-channel-for-private-data/</guid><description>&lt;img src="https://www.crossref.org/wp/blog/uploads/2015/11/PSM_V70_D236_Forty_wire_telephone_switchboard.png" alt="image 1907 forty wire telephone switchboard" width="263px" height="300px" class="img-responsive" />
&lt;p>&lt;em>Forty wire telephone switchboard, 1907, Author unknown, Popular Science Monthly Vol 70, Wikimedia Commons.&lt;/em>&lt;/p>
&lt;p>A few months ago Crossref announced that we will be launching a new service for the community in 2016 that tracks activities around DOIs recording user content interactions. These “events” cover a broad spectrum of online activities including publication usage, links to datasets, social bookmarks, blog mentions, social shares, comments, recommendations, etc. The &lt;a href="https://doi.org/10.64000/m57rd-n9868" target="_blank">Event Data&lt;/a> service collects the data and make it available to all in an open clearinghouse so that data are open, comparable, audit-able, and portable. These data are all publicly available from external platform partners, and they meet the terms of distribution from each partner.&lt;/p>
&lt;p>But Crossref and its members are also concerned about privacy. We recognise that not all data can be made open and public. Particularly if it is sensitive, personally identifiable data about usage. With this in mind, we are also launching an affiliated service, Distributed Usage Logging (DUL), for external parties to transmit sensitive data on user content interactions directly to authorized end points. As researchers are increasingly using “alternative” (non-publisher) platforms to store, access and share literature, publishers are correspondingly  interested in incorporating the activity on their publications into their COUNTER reports.&lt;/p>
&lt;p>Interested third-party sites might include the following:&lt;/p>
&lt;ul>
&lt;li>Institutional and subject repositories&lt;/li>
&lt;li>Aggregator platforms (EBSCOhost, IngentaConnect)&lt;/li>
&lt;li>Researcher-oriented social-networking sites (e.g. Academia.edu, ResearchGate, Mendeley)&lt;/li>
&lt;li>Reading environments and tools (e.g. ReadCube, Utopia Documents)&lt;/li>
&lt;/ul>
&lt;p>For publishers to process such events via their COUNTER-compliant usage reporting streams, they need private usage information and a secure channel by which to receive the data from the external platforms. Crossref will provide a switchboard that will enable these non-publisher platforms can safely transmit private data directly to the publisher.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;span >The work ahead entails close collaboration between Crossref, &lt;a href="http://www.projectcounter.org/" target="_blank">COUNTER&lt;/a>, and the partners who will be sending and receiving the private data. The cross-organisational team will be working towards the following before launch: technical infrastructure development for production service, semantic definition of the usage logging message, assignment and validation of credentials to participants in the scheme, participant integration of the DUL API, and incorporation of this data type into the COUNTER Code of Practice. We will also continue to consult with data privacy and security authorities to ensure that the scheme respects all governmental obligations and community best practice regarding the processing of personal data.&lt;/p>
&lt;p>We will share more about the launch of the service as we make progress along the way. Please contact &lt;a href="mailto:feedback@crossref.org">Jennifer Lin&lt;/a> for more information.&lt;/p></description></item><item><title>Crossref Labs plays with the Raspberry Pi Zero</title><link>https://www.crossref.org/blog/crossref-labs-plays-with-the-raspberry-pi-zero/</link><pubDate>Wed, 02 Dec 2015 00:00:00 +0000</pubDate><author>Joe Wass</author><guid>https://www.crossref.org/blog/crossref-labs-plays-with-the-raspberry-pi-zero/</guid><description>&lt;p>&lt;span >If you’re anything like us at Crossref Labs (and we know some of you are) you would have been very excited about the launch of the &lt;a href="https://www.raspberrypi.org/products/">Raspberry Pi Zero&lt;/a> a couple of days ago. In case you missed it, this is a new edition of the tiny low-priced Raspberry Pi computer. Very tiny and very low-priced. At $5 we just had to have one, and ordered one before we knew exactly what we want to do with it. You would have done the same. Bad luck if it was out of stock.&lt;/span>&lt;/p>
&lt;img src="https://www.crossref.org/wp/blog/uploads/2015/12/run.jpg" alt="run" class="img-responsive" />
&lt;p>&lt;span >We love the way &lt;a href="https://www.crossref.org/blog/coming-to-you-live-from-wikipedia/">DOIs are being used in Wikipedia&lt;/a>, but you probably already &lt;a href="https://www.crossref.org/blog/real-time-stream-of-dois-being-cited-in-wikipedia/">know that by now&lt;/a>. Not only is it a brilliant source of information, mostly well cited, it’s also an organic living thing, with countless people and bots working together on countless articles. Our live stream of edits that cite (or uncite) DOIs shows new scholarly literature unfold, as it happens. From new articles to new references to improved citations to edit wars to bots cleaning up all the mess, it captivates everyone we show it to. The &lt;a href="https://live.eventdata.crossref.org/live.html">latest version has a live chart&lt;/a> to show exactly how much activity is going on.&lt;/span>&lt;/p>
&lt;p>&lt;span >Crossref works in five ways: &lt;a href="https://www.crossref.org/blog/the-logo-has-landed/">Rally, Tag, Run, Play, and Make&lt;/a> and this definitely comes under ‘Play’. By the time our Raspberry Pi Zero arrived it was clear what we had to do. We ordered a &lt;a href="https://en.wikipedia.org/wiki/Servo_(radio_control)">servo&lt;/a>, a driver board and a wireless adapter and got to work.&lt;/span>&lt;/p>
&lt;img src="https://www.crossref.org/wp/blog/uploads/2015/12/servo.jpg" alt="servo" class="img-responsive" />
&lt;p>&lt;span >We have some new neighbours in the basement. &lt;a href="https://web.archive.org/web/20160305183505/http://oxhack.org/">Oxford Hackspace&lt;/a> is a community of people who want to work on projects from electronics to metalwork, hack things to improve them or find out how they work. A diverse bunch who at the last visit were working on squeezing unprecedented color capabilities from the 30 year old &lt;a href="https://en.wikipedia.org/wiki/ZX_Spectrum">ZX Spectrum&lt;/a>, a nixie tube display, a smartphone controlled doorbell and a robotic glockenspiel. They let us use their soldering iron to solder a few header pins.&lt;/span>&lt;/p>
&lt;p>&lt;span >A bit of hacky Python, a pictureframe and lots of duck tape later, we have a live display of how many DOIs are cited and uncited per hour. It updates live every minute, fetches the latest numbers from the &lt;a href="https://live.eventdata.crossref.org/live.html">Wikipedia DOI citation stream&lt;/a> and moves the hand.&lt;/span>&lt;/p>
&lt;img src="https://www.crossref.org/wp/blog/uploads/2015/12/tape.jpg" alt="tape" class="img-responsive" />
&lt;p>&lt;span >(For the worried engineers amongst you, rest assured that sufficient duck tape was added after this picture)&lt;/span>&lt;/p>
&lt;p>&lt;span >It’s extraordinary to think that a fully fledged computer with very capable specifications can be manufactured and sold for $5. Within the space of a lunchtime we had it up and running, all connected and fetching data over the internet via wireless. A generation ago you would have had to use &lt;a href="https://en.wikipedia.org/wiki/Computer_programming_in_the_punched_card_era">punched cards&lt;/a>, send them by post and load them in by hand. The live stream would have been at least a month behind.&lt;/span>&lt;/p>
&lt;img class="alignnone size-full wp-image-1069" src="https://www.crossref.org/wp/blog/uploads/2015/12/desk.jpg" alt="desk" class="img-responsive" />
&lt;p>&lt;span >It now sits in our Oxford office reminding us that DOIs Aren’t Just for Traditional Bibliographies. Below &lt;a href="http://twitter.com/gbilder">Geoff Bilder’s&lt;/a> reminder about what happens when you have too many standards (they’re telephone plugs from round the world).&lt;/span>&lt;/p>
&lt;img class="alignnone size-full wp-image-1073" src="https://www.crossref.org/wp/blog/uploads/2015/12/wall.jpg" alt="wall" class="img-responsive" />
&lt;p>&lt;span >You can find &lt;a href="https://github.com/Crossref/wiki.gauge">source code and instructions on the github repository&lt;/a> so you can make your own if you want.&lt;/span>&lt;/p></description></item><item><title>Watch Speaker Videos from the 2015 Annual Meeting</title><link>https://www.crossref.org/blog/watch-speaker-videos-from-the-2015-annual-meeting/</link><pubDate>Tue, 24 Nov 2015 00:00:00 +0000</pubDate><author>April Ondis</author><guid>https://www.crossref.org/blog/watch-speaker-videos-from-the-2015-annual-meeting/</guid><description>&lt;p>&lt;strong>You might have missed it, but you haven’t missed out.&lt;/strong>  If you want to watch – or savor re-watching – the presentations from last week’s 2015 Crossref Annual Meeting, we’ve embedded each video below in chronological order. Sit back, relax, and take it all in (again) just as though you were in an air-conditioned ballroom at the Taj.&lt;/p>
&lt;p>&lt;strong>Note: You can find the playlist containing all the videos &lt;a href="https://www.youtube.com/playlist?list=PLe_-TawAqQj2wPA-gjYglTPk_PEc_0wKz**" target="_blank">on our YouTube channel.&lt;/a>&lt;/strong>&lt;/p>
&lt;p>&lt;strong>Ed Pentz&lt;/strong>, Crossref Executive Director, focuses on the best practice of writing DOIs as actionable hyperlinks in his presentation, &lt;em>Crossref Best Practice:&lt;/em> &lt;a href="http://www.slideshare.net/Crossref/ed-pentz-crossref15-55435481" target="_blank">http://www.slideshare.net/Crossref/ed-pentz-crossref15-55435481&lt;/a> (slides only)&lt;/p>
&lt;p>&lt;strong>Martin Paul Eve&lt;/strong> senior lecturer at Birkbeck University, London, delivers a trenchant criticism of the process small publishers must go through when getting and depositing their first Crossref DOI in his presentation, &lt;em>Crossref Deposit: A Scholar-Publisher Experience&lt;/em>:&lt;/p>
&lt;p>&lt;strong>Anne Coghill&lt;/strong>, Manager of Peer Review Operations for the American Chemical Society, detailed their process for deciding where in the manuscript workflow to insert CrossCheck plagiarism screening in her presentation, &lt;em>American Chemical Society Publications and CrossCheck&lt;/em>: &lt;a href="http://www.slideshare.net/Crossref/ann-coghill-crossref15" target="_blank">http://www.slideshare.net/Crossref/ann-coghill-crossref15&lt;/a> (slides only)&lt;/p>
&lt;p>&lt;strong>Ben Hogan&lt;/strong>, Regional Manager in Wiley’s Peer Review Management team, shares Wiley’s pain points as well as its positive experiences in using CrossCheck to detect plagiarism in his presentation, _CrossCheck Usage and Case Studies: _&lt;/p>
&lt;p>&lt;strong>Jure Triglav&lt;/strong>, Lead Developer for the PubSweet Publishing Framework at the Collaborative Knowledge Foundation,  demonstrates how to mine data from the corpus of open science using Crossref’s metadata via its API and open source tools from the Collaborative Knowledge Foundation in his presentation, &lt;em>Making Science Writing Smarter:&lt;/em>&lt;/p>
&lt;p>&lt;strong>Scott Chamberlain,&lt;/strong> open science researcher, shows the several advantages of using programmatic tools such as R, Python, and Ruby to mine text and data, including Crossref metadata, in his presentation, _Text and Data Mining: _&lt;/p>
&lt;p>&lt;strong>Helen Duriez&lt;/strong>, ePublishing Manager at the Royal Society, describes the Royal Society’s experience with providing Crossmark data as a means of communicating document version information in her presentation, &lt;em>Crossmark – a journey through time (and space?) 2015&lt;/em>&lt;/p>
&lt;p>&lt;strong>John Chodacki&lt;/strong>, chair of Crossref’s DET committee, describes the future state of the DOI Event Tracker as an open hub for collecting and sharing data around web events that involve DOIs in his presentation, &lt;em>DOI Event Tracker 2015&lt;/em>:&lt;/p>
&lt;p>&lt;strong>Marc Abrahams&lt;/strong>, editor and co-founder of the Annals of Improbable Research, makes you LAUGH, then THINK with his keynote speech, &lt;em>Improbable Research, the Ig Nobel Prizes, and You:&lt;/em>&lt;/p>
&lt;p>&lt;strong>Juan Pablo Alperin&lt;/strong> describes the ways that Crossref and the Public Knowledge Project can work together to support common goals, in his presentation, _PKP and Crossref: &lt;em>Two P’s in a Cross&lt;/em>&lt;/p>
&lt;p>&lt;strong>Ed Pentz&lt;/strong>, Crossref Executive Director, summarizes the organisation’s expansion over the past year with his presentation, &lt;em>Crossref Growth and Change&lt;/em>:&lt;/p>
&lt;p>&lt;strong>Ginny Hendricks&lt;/strong>, Director of Member &amp;amp; Community Outreach, details the findings of Crossref’s recent stakeholder research and the organisation’s future plans to enhance member experience with her presentation, &lt;em>Member &amp;amp; Community Outreach&lt;/em>:&lt;/p>
&lt;p>&lt;strong>Jennifer Lin&lt;/strong>, Director of Product Management, visualizes Crossref’s role as a map maker for the scholarly web in her presentation, &lt;em>Crossref: Building an Open Map for the Scholarly Enterprise:&lt;/em>&lt;/p>
&lt;p>&lt;strong>Chuck Koscher&lt;/strong>, Director of Technology, gives us performance stats for the Crossref system, including aggregate uptimes and how long it takes to deposit metadata, in his presentation, &lt;em>Crossref System Performance:&lt;/em>&lt;/p>
&lt;p>&lt;strong>Geoffrey Bilder&lt;/strong>, Director of Strategic Initiatives, sheds light on the status of current and future research projects that are part of Crossref’s new product development process in his presentation, &lt;em>Strategic Initiatives Update:&lt;/em>&lt;/p>
&lt;p>&lt;strong>Scott Chamberlain&lt;/strong>, open science researcher, proposes the use of programmatic tools, such as the R programming language working with the Crossref search API, to undertake scientific research in his presentation, &lt;em>Thinking Programmatically:&lt;/em>&lt;/p>
&lt;p>&lt;strong>Martin Paul Eve&lt;/strong>, senior lecturer at Birkbeck University, London, bears us back to the origins of the scholarly mission, considers the implications of the notion that researchers work within a symbolic economy, and looks at the practical challenges brought about by open access modes of publication for works in the Humanities in his wide-ranging presentation, &lt;em>Open Access &amp;amp; the Humanities: Digital Approaches:&lt;/em>&lt;/p>
&lt;p>&lt;strong>Slideshare, Too!&lt;/strong>&lt;/p>
&lt;p>Finally, each speaker has generously made their slides available here: &lt;a href="http://www.slideshare.net/Crossref/tag/crossref15" target="_blank">http://www.slideshare.net/Crossref/tag/crossref15&lt;/a>&lt;/p>
&lt;p> &lt;/p>
&lt;p>_ _&lt;/p></description></item><item><title>The logo has landed</title><link>https://www.crossref.org/blog/the-logo-has-landed/</link><pubDate>Wed, 11 Nov 2015 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/the-logo-has-landed/</guid><description>&lt;div style="float:left;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/wp/blog/uploads/2015/11/Crossref_Logo_Stacked_RGB_SMALL.png" width="100%">
&lt;/figure>
&lt;/div>
&lt;p>The rebranding of Crossref was top priority when I joined in May in a new role called &amp;ldquo;Director of Member &amp;amp; Community Outreach&amp;rdquo;. Since then I’ve been working to understand the array of services, attributes, and audiences we have developed; to answer the questions &amp;ldquo;What do we do, for whom, and why?&amp;rdquo;&lt;/p>
&lt;p>As Crossref prepares to celebrate turning fifteen at our annual meeting next week, I am thrilled to present our new brand identity with key messages and logo. And along with “thrilled” you may also detect “nervous excitement”.&lt;/p>
&lt;p>Over the last few months we have reviewed earlier research and talked with a number of members, affiliates, and academics. Turns out we’re the plain talkers of the industry, the do-ers, the scrappy people who get stuff done, chivvy others along, and in some cases we are—dare I say it—the voice of reason!&lt;/p>
&lt;p>While balancing differing views within the scholarly community, we’re all about making connections – literally and figuratively. We help bring together people and metadata in pursuit of an excellent research communications system for all. And, to mirror one of Ed Pentz’s new catchphrases, we are &amp;ldquo;keeping it real&amp;rdquo;; with down-to-earth language.&lt;/span>&lt;figure id="attachment_978" class="wp-caption alignnone">&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/wp/blog/uploads/2015/11/Screen-Shot-2015-11-09-at-16.52.41.png"
alt="Crossref Key Messages" width="785" height="478">&lt;figcaption>
&lt;p>&lt;em>Crossref Key Messages&lt;/em>&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;p>New logos and names for all our products will come soon (in some cases it’ll be a ‘de-brand’ rather than a re-brand!). We’ll gradually phase in the new identity over the next month or two, starting with our annual meeting, and with a complete website relaunch following in 2016. We will contact all of our members and partners in the coming weeks with information about using the new logo, using a content delivery network (CDN) so that sites can reference the correct file.&lt;/p>
&lt;h2 id="why-rebrand">Why rebrand?&lt;/h2>
&lt;p>We have not rebranded because we plan on doing something different but rather to better express the things we already do. Our ‘problem’ was that often people didn’t know Crossref was behind initiatives like CrossCheck, Crossmark and FundRef. Our products had become unlinked from the organisation. And since we’re all about linking things together, that just made no sense.&lt;/p>
&lt;ul>
&lt;li>We needed an icon to give more flexibility across the web that a word mark cannot do alone. The icon is made up of two interlinked angle brackets familiar to those who work with metadata, and can also act as arrows depicting &lt;span style="color: #3eb1c8;">Metadata In&lt;/span> and &lt;span style="color: #3eb1c8;">Metadata Out&lt;/span>, two themes under which our services can generally be grouped.&lt;/span>&lt;/li>
&lt;li>Sentence case helps to avoid splitting the word; we do not want to tempt the Cross and the Ref to divide again. So that lowercase R you see in the middle of our name is indeed an official change. (Hopefully we can change the habit!)&lt;/li>
&lt;li>The palette gives a nod to the history of Crossref with red &amp;amp; dark grey, but brings in contemporary colors for a fresh palette that is distinctive in our industry (we researched a lot - everyone has circles, and traditional shades abound). Our aesthetic embodies classic Swiss design principles and is minimalist in keeping with our straight-talking personality.&lt;/li>
&lt;/ul>
&lt;p>So, in the words of Board Chair, Ian Bannerman, &lt;strong>&lt;span style="color: #3eb1c8;">it’s time for Crossref to step forward&lt;/span>&lt;/strong>.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/wp/blog/uploads/2015/11/Screen-Shot-2015-11-09-at-16.28.57.png"
alt="About Crossref - Boilerplate copy" width="937" height="527">&lt;figcaption>
&lt;p>&lt;em>About Crossref&lt;/em>&lt;/p>
&lt;/figcaption>
&lt;/figure>
&lt;p>I’m looking forward to revealing more of the story at our annual meeting next week!&lt;/p></description></item><item><title>Auto-Update Has Arrived! ORCID Records Move to the Next Level</title><link>https://www.crossref.org/blog/auto-update-has-arrived-orcid-records-move-to-the-next-level/</link><pubDate>Mon, 26 Oct 2015 00:00:00 +0000</pubDate><author>Rachael Lammey</author><discourseUsername>rlammey</discourseUsername><guid>https://www.crossref.org/blog/auto-update-has-arrived-orcid-records-move-to-the-next-level/</guid><description>&lt;p class="p1">
&lt;span class="s1">Crossref goes live in tandem with DataCite to push both publication and dataset information to ORCID profiles automatically. All organisations that deposit ORCID iDs with Crossref and/or DataCite will see this information going further, automatically updating author records. &lt;/span>
&lt;/p>
&lt;p class="p1">
&lt;/p>
&lt;p>We’re cross-posting &lt;a href="https://info.orcid.org/auto-update-has-arrived-orcid-records-move-to-the-next-level/" target="_blank">ORCID’s blog&lt;/a> below with all the details:&lt;/p>
&lt;p>&lt;span >Since ORCID’s inception, our key goal has been to unambiguously identify researchers and provide tools to automate the connection between researchers and their creative works.  We are taking a big step towards achieving this goal today, with the launch of &lt;a href="https://info.orcid.org/new-functionality-friday-auto-update-your-orcid-record/" target="_blank">Auto-Update functionality&lt;/a> in collaboration with &lt;a href="http://www.crossref.org/" target="_blank">Crossref&lt;/a> and [DataCite](&lt;a href="https://www.datacite.org/" target="_blank">https://www.datacite.org/&lt;/a>.  &lt;/p>
&lt;p>&lt;span >There’s already been a lot of excitement about Auto-Update: Crossref’s recent announcement about the imminent launch generated a flurry of discussion and celebration on social media. Our own &lt;a href="https://twitter.com/ORCID_Org/status/647020600192581633" target="_blank">tweet&lt;/a> on the topic was viewed over 10,500 times and retweeted by 60 other accounts. &lt;/span>&lt;/p>
&lt;p>&lt;span >So why all the fuss? We think Auto-Update will transform the way researchers manage their scholarly record.  Until now, researchers have had to manually maintain their record, connecting new activities as they are made public.  In ORCID, that meant using &lt;a href="https://support.orcid.org/hc/en-us/articles/360006973653-Add-works-by-direct-import-from-other-systems" target="_blank">Search &amp;amp; Link tools&lt;/a> developed by our member organisations to claim works manually.  Researchers frequently ask,  “Why, if I include my ORCID iD when I submit a manuscript or dataset, isn’t my ORCID record “automagically” updated when the work is published?”&lt;/span>&lt;/p>
&lt;p>&lt;span >With the launch of Auto-Update, that is just what will happen. &lt;/span>&lt;/p>
&lt;p>&lt;strong>It might seem like magic but there are a few steps to make it work:&lt;/strong>&lt;/p>
&lt;p>&lt;strong>Researchers.&lt;/strong> &lt;span >You need to do two things:  (1) use your ORCID iD when submitting a paper or dataset, and (2) &lt;a href="https://support.orcid.org/hc/en-us/articles/360006973653-Add-works-by-direct-import-from-other-systems" target="_blank">authorize Crossref and DataCite to update your ORCID record&lt;/a>.   In keeping with &lt;a href="https://orcid.org/privacy-policy" target="_blank">our commitment to ensuring that researchers maintain full control of their ORCID record&lt;/a>, you may revoke this permission at any time, and may also choose privacy settings for the information posted on your record.&lt;/span>&lt;/p>
&lt;p>&lt;strong>Publishers and data centers.&lt;/strong> These organisations also have two things to do: (1) collect ORCID identifiers during the submission workflow, using a process that involves authentication (not a type-in field!), and (2) embed the iD in the published paper and include the iD when submitting information to Crossref or DataCite.  &lt;/p>
&lt;p>&lt;strong>Crossref and DataCite.&lt;/strong> Upon receipt of data from a publisher or data center with a valid identifier, Crossref or DataCite can automatically push that information to the researcher’s ORCID record.  &lt;/p>
&lt;p>&lt;span >More information about how to opt out of this service can be found here: &lt;a href="https://support.orcid.org/hc/en-us/articles/360006972953-ORCID-inbox-notifications-and-frequency-settings" target="_blank">the ORCID Inbox&lt;/a>.&lt;/p>
&lt;div style="float:right;margin:10px">
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/ORCID-graphic-223x300.png" width="350">
&lt;/figure>
&lt;/div>
&lt;p>&lt;span >Why is this so revolutionary? &lt;/span>&lt;/p>
&lt;p>A bit of background, first. Crossref and DataCite, both non-profit organisations, are leaders in minting DOIs (Digital Object Identifiers) for research publications and datasets.  A &lt;a href="http://www.crossref.org/01company/16fastfacts.html#sthash.o7NGwOnP.dpuf" target="_blank">DOI&lt;/a> is a unique alphanumeric string assigned to a digital object – in this case, an electronic journal article, book chapter, or a dataset. Each DOI is associated with a set of basic metadata and a URL pointer to the full text, so that it uniquely identifies the content item and provides a persistent link to its location on the internet.&lt;/p>
&lt;p>&lt;span >Crossref, working with over a thousand scholarly publishers, has generated well over 75 million DOIs for journal articles and book chapters.  DataCite works with nearly 600 data centers worldwide and has generated over 6.5 million DOIs to date. Between them, Crossref and DataCite have already received almost a half a million works from publishers and data centers that include an ORCID iD validated by the author/contributor.  With Auto-Update functionality in place, information about these articles can transit (with the author’s permission) to the author’s ORCID record. &lt;/span>&lt;/p>
&lt;p>&lt;span >Auto-Update doesn’t stop at a researcher’s ORCID record.  Systems that have integrated ORCID APIs and have a researcher’s ORCID record connected to that system — their faculty profile system, library repository, webpage, funder reporting system — can receive alerts from ORCID.  Information can move easily and unambiguously across systems. &lt;/span>&lt;/p>
&lt;p>&lt;span >This is the beginning of the end for the endless rekeying of information that plagues researchers — and anyone involved in research reporting.  Surely something to celebrate!&lt;/span>&lt;/p>
&lt;p>&lt;span >Questions you may have:&lt;/span>&lt;/p>
&lt;p>&lt;strong>Q. What do I need to do to sign up for auto-update?&lt;/strong>&lt;/p>
&lt;p>&lt;span >You need to grant permission to Crossref and DataCite to post information to your ORCID record.  You can do this today by using the Search and Link wizard for DataCite available through the ORCID Registry or the DataCite &lt;a href="https://web.archive.org/web/20151123212630/http://search.labs.datacite.org/" target="_blank">Metadata Search page&lt;/a>.  We also have added a new ORCID Inbox, so that you can receive a message from Crossref or DataCite if they receive a datafile with your iD, and you can grant permission directly. See &lt;a href="https://support.orcid.org/hc/en-us/articles/360006972953-ORCID-inbox-notifications-and-frequency-settings" target="_blank">More on the ORCID Inbox&lt;/a>.&lt;/p>
&lt;p>&lt;strong>Q. Will Crossref and DataCite be able to update my ORCID record with already published works for which I did not use my ORCID iD?&lt;/strong>&lt;/p>
&lt;p>&lt;span >No.  The auto-update process only applies to those works that these organisations receive that include your ORCID iD. For previous works that did not include your ORCID iD, you will need to use the DataCite and Crossref Search and Link wizards to connect information with your iD.&lt;/span>&lt;/p>
&lt;p>&lt;strong>Q. What information will be posted to my record?&lt;/strong>&lt;/p>
&lt;p>&lt;span >With your permission, basic information about the article (such as title, list of contributors, journal or publisher) or dataset (such as data center name and date of publication) will be posted, along with a DOI that allows users to navigate to the source paper or dataset landing page.&lt;/span>&lt;/p>
&lt;p>&lt;strong>Q. What if my journal or data center doesn’t collect ORCID iDs?&lt;/strong>&lt;/p>
&lt;p>Ask them to!  This simple step can be accomplished using either the Public or Member ORCID APIs. Information about integrating ORCID iDs in &lt;a href="https://info.orcid.org/documentation/workflows/" target="_blank">publishing&lt;/a> and &lt;a href="http://members.orcid.org/repository-systems" target="_blank">repository&lt;/a> workflows is publicly available.&lt;/p></description></item><item><title>Nov 9th - New Webinar: Crossref for Open Access Publishers</title><link>https://www.crossref.org/blog/nov-9th-new-webinar-crossref-for-open-access-publishers/</link><pubDate>Mon, 19 Oct 2015 00:00:00 +0000</pubDate><author>April Ondis</author><guid>https://www.crossref.org/blog/nov-9th-new-webinar-crossref-for-open-access-publishers/</guid><description>&lt;p>&lt;em>&lt;span >&lt;a href="https://www.crossref.org/wp/blog/uploads/2015/10/Nov-9-Crossref-Webinar-Open-Access-Publishers.jpg">&lt;img class="alignleft size-medium wp-image-929" src="https://www.crossref.org/wp/blog/uploads/2015/10/Nov-9-Crossref-Webinar-Open-Access-Publishers-300x300.jpg" alt="November 9 Crossref Webinar for Open Access Publishers" width="300" height="300" srcset="https://www.crossref.org/wp/blog/uploads/2015/10/Nov-9-Crossref-Webinar-Open-Access-Publishers-300x300.jpg 300w, https://www.crossref.org/wp/blog/uploads/2015/10/Nov-9-Crossref-Webinar-Open-Access-Publishers-150x150.jpg 150w, https://www.crossref.org/wp/blog/uploads/2015/10/Nov-9-Crossref-Webinar-Open-Access-Publishers-624x624.jpg 624w, https://www.crossref.org/wp/blog/uploads/2015/10/Nov-9-Crossref-Webinar-Open-Access-Publishers.jpg 693w" sizes="(max-width: 300px) 85vw, 300px" />&lt;/a>&lt;a href="https://attendee.gotowebinar.com/register/4198524003003451650" target="_blank">Register for our webinar&lt;/a>&lt;/span>&lt;strong>&lt;span > to learn best practices for&lt;/span>&lt;/strong> &lt;span >depositing metadata and ways to help with the dissemination and discoverability of OA content.&lt;/span>&lt;/em>&lt;/p>
&lt;p>&lt;span style="font-weight: 400; color: #000000;">New Crossref services are being developed that have particular application to OA publishers. Did you know that our upcoming &lt;a style="color: #000000;" href="https://www.crossref.org/blog/det-poised-for-launch/">DOI Event Tracker service&lt;/a> was inspired by a group of OASPA publishers asking if there was a way to centrally support the gathering of data that could be analyzed as altmetrics?&lt;/span>&lt;/p>
&lt;p>&lt;span style="font-weight: 400; color: #000000;">A large number of Crossref members classify their content as Open Access, and we’ve been thinking about how our infrastructure can support and communicate this.  In many ways, it already does:&lt;/span>&lt;/p>
&lt;ul>
&lt;li>&lt;span style="font-weight: 400; color: #000000;">Crossref supports the deposit of license and funding information in the DOI metadata.&lt;/span>&lt;/li>
&lt;li>&lt;span style="font-weight: 400; color: #000000;">Crossref’s Crossmark Service is useful to OA publishers who need to have the means to update info about their content, no matter where it sits.&lt;/span>&lt;/li>
&lt;li>&lt;span style="color: #000000;">&lt;span >Crossref’s APIs allow&lt;/span> &lt;span >publishers to make it easier for researchers to mine full-text content.&lt;/span>&lt;/span>&lt;/li>
&lt;/ul>
&lt;p>&lt;span style="color: #000000;">&lt;b>Register for the Crossref Open Access Webinar&lt;/b>&lt;/span>&lt;/p>
&lt;p>&lt;span style="color: #ff0000;">&lt;b>Date:&lt;/b>&lt;/span> &lt;span style="font-weight: 400; color: #000000;">November 9, 2015&lt;/span>&lt;/p>
&lt;p>&lt;span style="color: #ff0000;">&lt;b>Time:&lt;/b>&lt;/span>&lt;span style="color: #000000;">&lt;span > 8:00 am (San Francisco), &lt;/span>&lt;span >11:00 am (New York), 4:00 pm (London) &lt;/span>&lt;/span>&lt;/p>
&lt;p>&lt;span style="color: #000000;">&lt;b>Register:&lt;/b>&lt;/span> &lt;a href="https://attendee.gotowebinar.com/register/4198524003003451650" target="_blank">&lt;span >https://attendee.gotowebinar.com/register/4198524003003451650&lt;/span>&lt;/a>&lt;/p>
&lt;p>&lt;span style="color: #000000;">Please join us for this new webinar that gives an overview of Crossref and its network of member publishers, along with information on Crossref services that have specific relevance to OA scholarly content.&lt;/span>&lt;/p>
&lt;p>&lt;span >Crossref will be joined by two guest speakers - &lt;a href="http://www.frontiersin.org" target="_blank">Frontiers&lt;/a> &lt;/span>&lt;span >&lt;span style="color: #000000;">will talk about their OA workflows and how Crossref services integrate with these, and James MacGregor from &lt;a href="http://pkp.sfu.ca" target="_blank">PKP&lt;/a> will show &lt;/span>&lt;/span>&lt;span >participants the Crossref Export/Registration Plugin which journals can enable to assign DOIs with Crossref and to help them participate in other Crossref services.&lt;/span>&lt;/p>
&lt;p>&lt;span style="font-weight: 400; color: #000000;">There will be time for questions and discussion during the webinar. The webinar will be recorded.&lt;/span>&lt;/p></description></item><item><title>2015 Annual Meeting: Speakers Announced</title><link>https://www.crossref.org/blog/2015-annual-meeting-speakers-announced/</link><pubDate>Tue, 13 Oct 2015 00:00:00 +0000</pubDate><author>April Ondis</author><guid>https://www.crossref.org/blog/2015-annual-meeting-speakers-announced/</guid><description>&lt;p>&lt;span style="color: #000000;">&lt;a href="https://www.crossref.org/wp/blog/uploads/2015/10/15th-Anniversary.jpg">&lt;img class=" wp-image-904 alignleft" src="https://www.crossref.org/wp/blog/uploads/2015/10/15th-Anniversary-300x240.jpg" alt="15th Anniversary" width="187" height="149" srcset="https://www.crossref.org/wp/blog/uploads/2015/10/15th-Anniversary-300x240.jpg 300w, https://www.crossref.org/wp/blog/uploads/2015/10/15th-Anniversary-1024x819.jpg 1024w, https://www.crossref.org/wp/blog/uploads/2015/10/15th-Anniversary-624x499.jpg 624w" sizes="(max-width: 187px) 85vw, 187px" />&lt;/a>Curious about who will be speaking at Crossref’s Annual Meeting this year? We have a flock of scholarly communications talent gathering at the Taj Hotel in Boston from November 17-18, 2015.  In addition to our line-up of keynote speeches and technical workshops, we will be celebrating Crossref’s 15&lt;sup>th&lt;/sup> Anniversary with a quindecennial fête on Wednesday evening, November 18&lt;sup>th&lt;/sup>. There’s&lt;/span> &lt;a href="https://www.eventbrite.com/e/crossref15-tech-workshops-member-meeting-tickets-17921679225" target="_blank">still time to register&lt;/a>&lt;span style="color: #000000;">, so please join us!  &lt;/span>&lt;/p>
&lt;p>&lt;span style="color: #000000;">&lt;strong>&lt;u>Distinguished Guest Speaker Bios:&lt;/u>&lt;/strong>&lt;/span>&lt;/p>
&lt;ul>
&lt;li>&lt;span style="color: #000000;">&lt;strong>Marc Abrahams&lt;/strong> w&lt;/span>ill be a keynote speaker at Crossref’s 2015 Annual Meeting.  Marc writes about research that makes people LAUGH, then THINK. He is editor and co-founder of the magazine &lt;a href="https://web.archive.org/web/20151115193315/http://www.improbable.com/magazine" target="_blank">&lt;em>Annals of Improbable Research&lt;/em>&lt;/a> (AIR), host and main writer of the &lt;a href="http://www.improbable.com/category/the-weekly-improbable-research-podcast/" target="_blank">Improbable Research weekly podcast&lt;/a> (distributed by CBS), and author of &lt;em>&lt;a href="https://www.oneworld-publications.com/books/marc-abrahams/this-is-improbable-too" target="_blank">This is Improbable Too&lt;/a>&lt;/em> and other books. He edits and writes much of the web site and blog&lt;span style="color: #000000;"> &lt;/span>&lt;a href="http://www.improbable.com/" target="_blank">www.improbable.com&lt;/a>, and for thirteen years wrote a column (called “Improbable Research”) for &lt;em>&lt;a href="http://www.theguardian.com/education/series/improbableresearch" target="_blank">The Guardian&lt;/a>&lt;/em> newspaper.&lt;/li>
&lt;/ul>
&lt;p >
Marc is the father and Master of Ceremonies of the &lt;a href="https://improbable.com/ig/about-the-ig-nobel-prizes/">Ig Nobel Prize Ceremony&lt;/a>, honoring achievements that make people LAUGH, then THINK. The Prizes are handed out by genuine Nobel Laureates at a gala ceremony held each autumn at Harvard University and broadcast on the internet and on National Public Radio.
&lt;/p>
&lt;p >
Marc is author of the books &lt;em>The Ig Nobel Prizes, The Man Who Cloned Himself&lt;/em>, &lt;em>Why Chickens Prefer Beautiful Humans&lt;/em>,&lt;span style="color: #000000;"> &lt;/span>&lt;em>&lt;a href="http://www.amazon.com/gp/product/1851689311/ref=as_li_tf_tl?ie=UTF8&amp;camp=1789&amp;creative=9325&amp;creativeASIN=1851689311&amp;linkCode=as2&amp;tag=annalsofimprobab">This Is Improbable&lt;/a>&lt;/em>, &lt;em>&lt;a href="https://www.oneworld-publications.com/books/marc-abrahams/this-is-improbable-too">This is Improbable Too&lt;/a>&lt;/em>, &lt;em>&lt;a href="http://www.amazon.com/Ig-Nobel-Cookbook-1/dp/1939385164/ref=sr_1_1?s=books&amp;ie=UTF8&amp;qid=1410121636&amp;sr=1-1&amp;keywords=ig+nobel+cookbook">The Ig Nobel Cookbook, volume 1&lt;/a>&lt;/em> (co-authored with Corky White and Gus Rancatore). He edited (and wrote much of) the science humor anthologies &lt;em>The Best of Annals of Improbable Research&lt;/em> and &lt;em>Sex As a Heap of Malfunctioning Rubble (and other improbabilities)&lt;/em>.&lt;span style="color: #000000;">  &lt;/span>
&lt;/p>
&lt;p >
Marc has a degree in applied mathematics from Harvard College, spent several years developing optical character recognition computer systems (including a reading machine for the blind) at Kurzweil Computer Products, and later founded Wisdom Simulators, a creator of educational software.
&lt;/p>
&lt;ul>
&lt;li>
&lt;p>&lt;span style="color: #000000;">&lt;strong>Juan Pablo Alperin&lt;/strong> will be a keynote speaker at Crossref’s 2015 Annual Meeting. Juan is an Assistant Professor and a Research Associate with the Public Knowledge Project (PKP) at Simon Fraser University. Juan started working with the PKP in 2007, and has continued to be involved as systems developer, project manager, and researcher. Juan leads and advises on several of PKP’s R&amp;amp;D and Scholarly Inquiry initiatives as a complement to his research and work on scholarly communications more broadly. He can be reached via @juancommander.  ORCID iD: orcid.org/0000-0002-9344-7439.&lt;/span>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;span style="color: #000000;">&lt;strong>Scott Chamberlain&lt;/strong> will be a keynote speaker as well as a presenter at Crossref’s 2015 Annual Meeting. Scott is a scientific programmer who contributes to the field of scholarly literature by developing software for accessing open data on the web.  He co-founded a developer collective called rOpenSci to help connect open source data into the R environment, a free software environment for statistical computing and graphics that runs on all major platforms.  Scott maintains a few clients to work with Crossref APIs, and a text mining client that leverages Crossref’s TDM service.  In addition, Scott maintains clients in R, Ruby, and Python to interact with Legotto, a platform for collecting and delivering altmetric data.  A former ecologist, Scott is currently working full time on rOpenSci at the University of California at Berkeley.  He can be reached via @recology_/@opensci.  ORCID iD: &lt;a href="http://orcid.org/0000-0003-1444-9135" target="_blank">http://orcid.org/0000-0003-1444-9135&lt;/a>.&lt;/span>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;span style="color: #000000;">&lt;strong>John Chodacki&lt;/strong> will be a presenter at Crossref’s 2015 tech workshops. John is Director of University of California Curation Center (UC3) at California Digital Library (CDL).  At UC3, John works with UC campuses and the broader community to ensure that CDL’s digital curation services meet the emerging needs of the scholarly community, including digital preservation, data management, and reuse.  Prior to joining UC3, John was Product Director at PLOS where he led cross-departmental strategic projects such as the Article-Level Metrics (ALM) initiative.  He has served on the Crossref board and is currently the Committee Chair for DOI Event Tracker (DET). He can be reached via @chodacki.  ORCID iD: orcid.org/0000-0002-7378-2408. &lt;/span>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;span style="color: #000000;">&lt;strong>Anne Coghill &lt;/strong>will be a presenter at Crossref’s 2015 Annual Meeting. Anne is Manager, Peer Review Operations, in the American Chemical Society Publications Division.  She and her colleagues manage the manuscript submission and peer review environment for ACS’ scholarly journals and books publishing program. Anne holds a Bachelor of Science in chemistry from Illinois State University and a Master in Science in Management Studies from Northwestern University.  She is also the co-editor of The ACS Style Guide, third edition.  She can be reached via @AnneCoghill.  ORCID iD: orcid.org/0000-0002-2773-2282. &lt;/span>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;span style="color: #000000;">&lt;strong>Helen Duriez&lt;/strong> will be a presenter at Crossref’s 2015 tech workshops. Helen is the ePublishing Manager at the Royal Society, responsible for developing the Society’s digital journals strategy as well as the day-to-day management of its journal websites. Since digital innovation transcends the traditional boundaries of scholarly publishing, she spends a lot of time pondering a variation of Freud’s musings, ‘what do researchers want?’ Helen can be contacted via @HDuriez and @RSocPublishing.&lt;/span>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;span style="color: #000000;">&lt;strong>Martin Paul Eve&lt;/strong> will be a keynote speaker as well as a presenter at Crossref’s 2015 Annual Meeting. Martin is Senior Lecturer in Literature, technology and Publishing at Birkbeck, University of London and a founder of the Open Library of Humanities. He is the author of three books: Pynchon and Philosophy: Wittgenstein, Foucault and Adorno (Palgrave, 2014); Open Access and the Humanities: Contexts, Controversies and the Future (Cambridge University Press, 2014); and Password [a cultural history (Bloomsbury, forthcoming 2016) and many journal articles. A strong advocate for open access to scholarly material, Martin has given evidence to the UK House of Commons Select Committee Inquiry into Open Access; served on the Jisc OAPEN-UK Advisory Board, the Jisc National Monograph Strategy Group, and the Jisc Scholarly Communications Advisory Board; been a member of the HEFCE Open Access Monographs Expert Reference Group; and is a member of the SCONUL Strategy Group on Academic Content and Communications. Martin is also a qualified computer programmer (Microsoft Professional in C# and the .NET Framework) and is the author of the digital publishing tools meTypeset and CaSSius.  He can be reached via @martin_eve. ORCID iD: orcid.org/0000-0002-5589-8511.&lt;/span>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;span style="color: #000000;">&lt;strong>Ben Hogan&lt;/strong> will be a presenter at Crossref’s 2015 tech workshops.  Ben is a Regional Manager in Wiley’s Peer Review Management team, responsible for leading the North America and Open Access teams. He works with internal and external stakeholders to bring in new work and refine the peer review experience to be as efficient as possible for authors and editorial offices. Ben’s worked in publishing since 2007 in a variety of capacities, including books and journals production, training, and peer review. His interests include user experience and publication ethics.&lt;/span>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;span style="color: #000000;">&lt;span style="color: #000000;">&lt;strong>Jure Triglav&lt;/strong> will be a presenter at Crossref’s 2015 tech workshops.  His presentation,&lt;/span>&lt;/span>&lt;span style="color: #000000;">&lt;em>Using Crossref’s API to Make Smarter Science Writing , &lt;/em>will explore how continuously talking to Crossref’s API can help us write better scientific content. Topics will include calling the API from JavaScript, combining Crossref data with modern web-based text editors, and more.&lt;/span>&lt;span style="color: #000000;">Jure is an open science software developer. Jure graduated from medical school 4 years ago, but started working as a developer for Academia.edu shortly after. Now he focuses on technology issues present in open science and runs several projects in this space: @ScienceGist, @ScienceToolbox and @ScholarNinja. Jure also works with open science organisations like PLOS, working on software that will power the future of scientific publishing. He can be reached via @juretriglav.&lt;/span>&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p >
&lt;span style="color: #000000;">&lt;strong>&lt;u>Crossref Staff Speaker Bios:&lt;/u>&lt;/strong>&lt;/span>
&lt;/p>
&lt;ul>
&lt;li>
&lt;p>&lt;span style="color: #000000;">&lt;strong>Geoffrey Bilder&lt;/strong> is Director of Strategic Initiatives at Crossref, where he has led the technical development and launch of a number of industry initiatives including CrossCheck, Crossmark, ORCID and FundRef. He co-founded Brown University’s Scholarly Technology Group in 1993, providing the Brown academic community with advanced technology consulting in support of their research, teaching and scholarly communication. He was subsequently head of IT R&amp;amp;D at Monitor Group, a global management consulting firm. From 2002 to 2005, Geoffrey was Chief Technology Officer of scholarly publishing firm Ingenta, and just prior to joining Crossref, he was a Publishing Technology Consultant at Scholarly Information Strategies.  He can be reached via @Geoffrey Bilder.&lt;strong>  &lt;/strong>ORCID iD: orcid.org/0000-0003-1315-5960.&lt;/span>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;span style="color: #000000;">&lt;strong>Ginny Hendricks&lt;/strong> is Director of Member &amp;amp; Community Outreach for Crossref, and is responsible for Crossref’s communications, business development, member services, and product support initiatives. Before joining Crossref, she ran Ardent Marketing for nine years, where she consulted with publishers to craft multichannel marketing strategies, develop, brand, and launch online products, and build engaged communities. She previously managed Elsevier’s launch of Scopus, the abstract and citation database of peer-reviewed literature.  While at Elsevier, she established advisory boards and outreach programs with library and scientific communities. In 1998, Ginny started an early e-resources help desk for Blackwell’s information Services and later led training and communication programs for Swets’ digital portfolio in Asia Pacific, Middle East, and Africa. She’s lived and worked in many parts of the world, has managed globally dispersed creative, technical, and commercial teams, and co-hosts the Scholarly Social networking events in London.  She can be reached via @GinnyLDN.  ORCID iD: &lt;a href="http://orcid.org/0000-0002-0353-2702" target="_blank">http://orcid.org/0000-0002-0353-2702&lt;/a>.&lt;/span>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;span style="color: #000000;">&lt;strong>Chuck Koscher&lt;/strong> has been the Director of Technology for Crossref since 2002. His primary responsibility has been the development and operation of Crossref’s core services and technical infrastructure. As a senior staff member he also contributes to the definition of Crossref’s mission and the expansion of its services such as the recent launch of Fundref. His role includes management of technical support and back-end business operations. Chuck and his team interface directly with members in dealing with issues effected by new or evolving industry practices such as those involving non-journal content like books, standards and databases. Chuck has been active within the industry having served 9 years on the NISO board of directors, and a participant in initiatives such as the NISO/NFAIS Best Practices in Journal Publishing and NISO’s Supplemental Material Working Group. Prior to Crossref Chuck has over 20 years in software engineering experience primarily in the aerospace industry. ORCID iD: orcid.org/0000-0003-2181-9595.&lt;/span>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;span style="color: #000000;">&lt;strong>Rachael Lammey&lt;/strong> is a Product Manager on Crossref’s Crosscheck plagiarism screening and Text and Data Mining API initiatives, among other tools that Crossref make available for publishers build upon.  Rachael has been with Crossref since March 2012. She previously worked in journals publishing for Taylor &amp;amp; Francis for nearly six years, managing a team who worked with online submission and peer review systems. She has a degree in English Literature from St. Andrews University and a MA in Publishing Studies from the University of Stirling. She can be reached via @rachaellammey.  ORCID iD: &lt;a href="http://orcid.org/0000-0001-5800-1434" target="_blank">http://orcid.org/0000-0001-5800-1434&lt;/a>.&lt;/span>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;span style="color: #000000;">&lt;strong>Jennifer Lin&lt;/strong> is the Director of Product Management at Crossref.  She has worked in product development, project management, community outreach, and change management within the scholarly communications, education, and public sectors since 2000. She spent four years at the Public Library of Science (PLOS) where she oversaw product strategy and development for their data program, article-level metrics initiative, and open assessment activities. Prior to PLOS, she was a consultant with Accenture, working with Fortune 500 companies as well as governments, to develop and deploy new products and services. Jennifer earned her PhD at Johns Hopkins University. Jennifer can be reached via @jenniferlin15.  ORCID iD: &lt;a style="color: #000000;" href="http://orcid.org/0000-0002-9680-2328">&lt;a href="http://orcid.org/0000-0002-9680-2328" target="_blank">http://orcid.org/0000-0002-9680-2328&lt;/a>&lt;/a>.&lt;/span>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;span style="color: #000000;">&lt;strong>Ed Pentz&lt;/strong> is the Executive Director of Crossref, a not-for-profit membership association of publishers set up to provide a cross-publisher reference linking service to organise publisher metadata, run the infrastructure that makes Digital Object Identifier (DOI) links work, and rally multiple community stakeholders to develop tools and services that enable advancements in scholarly publishing.  Ed was appointed as Crossref’s first Executive Director when the organisation was created in 2000.  Crossref is now the largest DOI registrar in the world with over 75,000,000 DOIs.  Ed is also Chair of the Board of ORCID, a registry of unique identifiers for researchers established in 2010. Prior to joining Crossref, Ed held electronic publishing, editorial and sales positions at Harcourt Brace in the US and UK and managed the launch of Academic Press’ first online journal, the Journal of Molecular Biology, in 1995. Ed has a degree in English Literature from Princeton University and lives in Oxford, England. He can be reached via @epentz. ORCID iD &lt;a href="http://orcid.org/0000-0002-5993-8592" target="_blank">http://orcid.org/0000-0002-5993-8592&lt;/a>.&lt;/span>&lt;/p>
&lt;/li>
&lt;/ul></description></item><item><title>DOIs in Reddit</title><link>https://www.crossref.org/blog/dois-in-reddit/</link><pubDate>Wed, 30 Sep 2015 00:00:00 +0000</pubDate><author>Joe Wass</author><guid>https://www.crossref.org/blog/dois-in-reddit/</guid><description>&lt;p>&lt;span >Skimming the headlines on Hacker News yesterday morning, I noticed something exciting. A dump of &lt;a href="https://news.ycombinator.com/item?id=10289220">all the submissions to Reddit since 2006&lt;/a>. “How many of those are DOIs?”, I thought. Reddit is a very broad community, but has some very interesting parts, including some great science communication. How much are DOIs used in Reddit?&lt;/span>&lt;/p>
&lt;p>&lt;span >(There has since been a &lt;a href="https://news.ycombinator.com/item?id=10309581">discussion about this blog post&lt;/a> on Hacker News)&lt;/span>&lt;/p>
&lt;p>&lt;span >We have a whole &lt;a href="https://www.crossref.org/categories/event-data">strategy for DOI Event Tracking&lt;/a>, but nothing beats a quick hack or is more irresistible than a data dump.&lt;/span>&lt;/p>
&lt;h1 id="span-what-is-a-doispan">&lt;span >What is a DOI?&lt;/span>&lt;/h1>
&lt;p>&lt;span >If you know what a DOI is, skip this! The DOI system (Digital Object Identifier) is a link redirection service. When a publisher puts some content online they could just hand out the URL. But the URL can change, and within a very short space of time, &lt;a href="https://en.wikipedia.org/wiki/Link_rot">link-rot&lt;/a> happens. DOIs are designed to fight link rot. When a publisher mints a DOI to an article they just published, they can change the article’s URL and then update the DOI to point to the new place. DOIs are persistent. They are URLs. They’re also identifiers (kind of like ISBNs), and they’re used in scholarly publishing as to do citations.&lt;/span>&lt;/p>
&lt;p>&lt;span >Crossref is the DOI registration agency for scholarly publishing. That means mostly things like journal articles. There are other registration agencies, for example, DataCite, who do DOIs for research datasets. But at this point in time, most DOIs are Crossref’s.&lt;/span>&lt;/p>
&lt;h1 id="span-what-does-finding-dois-in-reddit-meanspan">&lt;span >What does finding DOIs in Reddit mean?&lt;/span>&lt;/h1>
&lt;p>&lt;span >It means someone used a DOI to cite something! DOIs can be used for any kind of content, but because of the sheer volume of scientific publishing, lots of DOIs are for science. Having a DOI doesn’t say anything about quality or content. But it does indicate that the person who created the DOI probably intended it to be cited. We care because it means that every time a DOI is used a tiny bit of link-rot doesn’t have the opportunity to take hold. Every time something is discussed on Reddit and the DOI is used, it means that archaeologists using the data dump in 100 years will have identifiers to find the things being discussed, even if the web and URLs have long since crumbled to dust.&lt;/span>&lt;/p>
&lt;p>&lt;span >Or, more likely, in five year’s time when a few URLs will have shuffled around.&lt;/span>&lt;/p>
&lt;h1 id="span-the-resultsspan">&lt;span >The results&lt;/span>&lt;/h1>
&lt;p>&lt;span >DOIs have been used on Reddit since 2008 (the logs start in 2006). After a rocky start, we see hundreds being used per year.&lt;/span>&lt;/p>
&lt;img src="https://www.crossref.org/wp/blog/uploads/2015/09/year-count.png" class="img-responsive" alt="DOI submissions per year" >
&lt;p>&lt;span >That’s dozens per month.&lt;/span>&lt;/p>
&lt;img src="https://www.crossref.org/wp/blog/uploads/2015/09/year-month-count.png" class="img-responsive" alt="DOI submissions per month" >
&lt;p>&lt;span >The best subreddit to find DOIs is &lt;a href="http://reddit.com/r/Scholar">/r/Scholar&lt;/a>, followed by &lt;a href="http://reddit.com/r/science">/r/science&lt;/a>. And then a lot of others with one or two per year.&lt;/span>&lt;/p>
&lt;img src="https://www.crossref.org/wp/blog/uploads/2015/09/year-subreddit-count.png" class="img-responsive" alt="DOI submissions per subreddit per year" >
&lt;h1 id="span-opportunitiesspan">&lt;span >Opportunities&lt;/span>&lt;/h1>
&lt;p>&lt;span >It’s great to see DOIs being used in Reddit. But let’s be honest, it’s not a massive amount.&lt;/span>&lt;/p>
&lt;p>&lt;span >We have a list of domains that our DOIs point to. They mostly belong to publishers, so every time we see a link to a domain on the list, there’s a chance (not a certainty) that the link could have been made using a DOI. We found a large number of these, orders of magnitude more than DOIs. We’re still crunching the data.&lt;/span>&lt;/p>
&lt;h1 id="span-the-dataspan">&lt;span >The data&lt;/span>&lt;/h1>
&lt;p>&lt;span >The data is quite large. It’s a 40 Gigabyte download compressed, which comes to about 170 GB that uncompressed. It contains the submissions to reddit between 2006 and 2015, not the comments, so each data point represents a thread of conversation &lt;em>about&lt;/em> a DOI.&lt;/span>&lt;/p>
&lt;h1 id="span-reproducibility-updatedspan">&lt;span >Reproducibility (updated)&lt;/span>&lt;/h1>
&lt;p>&lt;span >You can find the source code and reproduce the figures at &lt;a href="http://github.com/crossref/reddit-dump-experiment">&lt;a href="http://github.com/crossref/reddit-dump-experiment" target="_blank">http://github.com/crossref/reddit-dump-experiment&lt;/a>&lt;/a>. We use Apache Spark for this kind of thing.&lt;/span>&lt;/p>
&lt;p>&lt;span >The data and methodology are very experimental. You can download all results here:&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;a href="https://s3-eu-west-1.amazonaws.com/crossref-labs-data/2015-10-06/reddit-dump-experiment.zip">&lt;a href="https://s3-eu-west-1.amazonaws.com/crossref-labs-data/2015-10-06/reddit-dump-experiment.zip" target="_blank">https://s3-eu-west-1.amazonaws.com/crossref-labs-data/2015-10-06/reddit-dump-experiment.zip&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >It includes all data for charts in this post, as well as the full list of DOIs, the full list of URLs that could possibly have DOIs, and the full JSON input line for each of these.&lt;/span>&lt;/p>
&lt;h2 id="span-more-infospan">&lt;span >More info&lt;/span>&lt;/h2>
&lt;p>&lt;span >Read about our &lt;a href="https://www.crossref.org/categories/altmetrics">DOI Event Tracking strategy&lt;/a>, including &lt;a href="https://www.crossref.org/blog/real-time-stream-of-dois-being-cited-in-wikipedia/">our live stream of Wikipedia citations&lt;/a>.&lt;/span>&lt;/p></description></item><item><title>Scheduled Booth Presentations at the Frankfurt Book Fair</title><link>https://www.crossref.org/blog/scheduled-booth-presentations-at-the-frankfurt-book-fair/</link><pubDate>Tue, 29 Sep 2015 00:00:00 +0000</pubDate><author>Anna Tolwinska</author><guid>https://www.crossref.org/blog/scheduled-booth-presentations-at-the-frankfurt-book-fair/</guid><description>&lt;p>Oktoberfest is in full swing and that makes me think that it’s almost Frankfurt Book Fair time again!&lt;/p>
&lt;p>This year in addition to individual meetings we’ll have scheduled flash presentations on our booth, &lt;strong>M91 in Hall 4.2&lt;/strong>. These short (10-minute) presentations are great for anyone wanting a quick intro to what Crossref is all about. &lt;strong>Running on Wednesday, Thursday, and Friday&lt;/strong> - at the following times each of those days:&lt;/p>
&lt;ul>
&lt;li>10am - &lt;strong>Small Publisher Tools&lt;/strong>&lt;/li>
&lt;li>12pm - &lt;strong>DOIs &amp;amp; Metadata Basics&lt;/strong>&lt;/li>
&lt;li>3pm - &lt;strong>Exploring through APIs&lt;/strong>&lt;/li>
&lt;/ul>
&lt;p>If you’d like to meet with us (Ed Pentz, Ginny Hendricks, Rachael Lammey, or Anna Tolwinska) please contact &lt;a href="mailto:rclark@crossref.org">Rosa Morais Clark&lt;/a> to set up a meeting.&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/images/blog/fbm-logo.png"
alt="FBM logo" width="40%">
&lt;/figure>
&lt;br/>
We look forward to seeing you there!</description></item><item><title>Taxonomies Meet-up at #FBM15</title><link>https://www.crossref.org/blog/taxonomies-meet-up-at-fbf15/</link><pubDate>Fri, 25 Sep 2015 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/taxonomies-meet-up-at-fbf15/</guid><description>&lt;p>The &lt;a href="https://www.crossref.org/labs/" target="_blank">Taxonomies Interest Group&lt;/a> would like to invite Crossref members to an informal drop-in at the Frankfurt Book Fair:&lt;/p>
&lt;p>&lt;strong>4-5pm on Wednesday 14th October at the TEMIS booth H76&lt;/strong>&lt;/p>
&lt;p>The group would like to discuss how different publishers use their taxonomies for content enrichment and to explore the role that the Crossref interest group can play in promoting industry collaboration and emerging standards. TEMIS have kindly offered to host the event at their booth and provide refreshments: Please come by from 4pm at Booth H76.&lt;/p>
&lt;p>Graham McCann from IOP Publishing and Christian Kohl from De Gruyter will be coordinating the event. For background information on the work the group is doing, take a look at this &lt;a href="https://www.crossref.org/webinars" target="_blank">webinar recording from March 2015&lt;/a>.&lt;/p></description></item><item><title>Crossref to Auto-Update ORCID Records</title><link>https://www.crossref.org/blog/crossref-to-auto-update-orcid-records/</link><pubDate>Thu, 24 Sep 2015 00:00:00 +0000</pubDate><author>Ginny Hendricks</author><discourseUsername>ginny</discourseUsername><guid>https://www.crossref.org/blog/crossref-to-auto-update-orcid-records/</guid><description>&lt;p>&lt;span >In the next few weeks, authors with an ORCID iD will be able to have Crossref automatically push information about their published work to their ORCID record. It’s something that &lt;a href="https://info.orcid.org/new-functionality-friday-auto-update-your-orcid-record/">ORCID users have been asking for&lt;/a> and we’re pleased to be the first to develop the integration. 230 publishers already include ORCID iDs in their metadata deposits with us, and currently there are 248,000 DOIs that include ORCID iDs.&lt;/span>&lt;/p>
&lt;span >
&lt;/span>
&lt;p>&lt;span >&lt;strong>What this means for researchers&lt;/strong>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;img class="alignright" src="http://info.orcid.org/wp-content/uploads/2021/01/orcid_128x128.png" alt="ORCID iD icon" /> More visibility for your work! Crossref represents over 5000 scholarly publishers and many of them ask authors for their ORCID iD and include it in the publication information they send us. Also it will mean less manual searching and adding; you’ve always been able to &lt;a href="https://web.archive.org/web/20131229210637/http://search.crossref.org/">search crossref metadata&lt;/a> for your name and/or publications and manually add them to your ORCID record, this auto-update simply means that when your publishers include the info we can update and add work(s) to your ORCID record automatically for you. You can still choose to hide/show whatever works you choose, and, of course, you’ll have the opportunity to authorize or switch off the integration completely (though future publications may trigger a new request). Overall, you’ll benefit from a more complete and up-to-date ORCID record to showcase your work.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>What this means for publishers&lt;/strong>&lt;/span>&lt;/p>
&lt;p>&lt;span >If you’re one of the 230 Crossref publishers who already supply ORCID iDs along with the usual metadata submissions, then you’re all good. If you don’t offer this yet, you might want to think about starting - it’s beneficial for funders, publishers, other researchers, libraries, and universities to be able to integrate with complete researcher records. You can ask for ORCIDs upon manuscript submission or acceptance and tag it in your metadata deposits with Crossref. We’ll ensure the rest.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>Various caveats and important details to be aware of&lt;/strong>&lt;/span>&lt;/p>
&lt;ol>
&lt;li>&lt;span >Apparently not all publishers are members of Crossref (we know, crazy), and in addition only a subset of Crossref publishers (230 in total) are asking authors for ORCID iDs and/or including them in their metadata deposits.&lt;/span>&lt;/li>
&lt;li>&lt;span >Some publishers may choose to opt out of passing through the details to ORCID using the Crossref auto-update (perhaps they plan to send this directly at some point) but if you’ve included your ORCID with your submission and it isn’t automatically updated, then check with your publisher.&lt;/span>&lt;/li>
&lt;li>&lt;span >We have a “backlog” of almost 250,000 DOIs that include ORCID iDs so that may mean we do some bulk updates at a later date where authors will receive an email with a long list of works to add. Even if the works have been listed before, it’s worth accepting as it will add the most up-to-date metadata to ensure the most accurate record.&lt;/span>&lt;/li>
&lt;/ol>
&lt;p>&lt;span >Any questions can be directed to &lt;a href="http://mailto:support@crossref.org">our support team&lt;/a>.&lt;/span>&lt;/p></description></item><item><title>Annual Meeting: Join Crossref in Boston this November!</title><link>https://www.crossref.org/blog/annual-meeting-join-crossref-in-boston-this-november/</link><pubDate>Fri, 18 Sep 2015 00:00:00 +0000</pubDate><author>April Ondis</author><guid>https://www.crossref.org/blog/annual-meeting-join-crossref-in-boston-this-november/</guid><description>&lt;p>We’d like to invite the scholarly publishing community to get together in Boston this November with the Crossref Annual Meeting as a rally point. This is the event we hold just once a year to get the whole team under one roof, host a lively discussion with the leading voices in scholarly communications, present technical workshops, and offer you the chance to get hands’ on with our latest metadata services. &lt;strong>Our &lt;a href="https://www.eventbrite.com/e/crossref15-tech-workshops-member-meeting-tickets-17921679225" target="_blank">free two-day event&lt;/a> takes place from November 17-18, 2015 in Boston, MA.&lt;/strong>&lt;/p>
&lt;p>&lt;strong>Agenda:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>Tuesday, November 17 - Tech Workshops:&lt;/li>
&lt;/ul>
&lt;p>The morning is an opportunity to get into small groups and talk directly with our development and support teams. We will present best practices around using Crossref’s metadata. After lunch, we will feature member case studies with tips on implementation and lessons learned. If you’re on the technical production side of scholarly publishing, you’ll want to be there — and not just for the beer &amp;amp; pretzels afterwards.&lt;/p>
&lt;ul>
&lt;li>Wednesday, November 18 - Member Meeting:&lt;/li>
&lt;/ul>
&lt;p>A day to hear from thought leaders from the larger scholarly publishing community as well as from inside Crossref. Our keynote speaker will be &lt;strong>Dr. Ben Goldacre&lt;/strong> (Bad Science), and our distinguished speakers include &lt;strong>Dr. Scott Chamberlain&lt;/strong> (rOpenSci), &lt;strong>Dr. Juan Pablo Alperin&lt;/strong> (Public Knowledge Project), and &lt;strong>Dr. Martin Eve&lt;/strong>, (Open Library of Humanities). We will share details about the road map for Crossref Labs’ current and future initiatives, hear about the latest organisational developments from new members of our team, and see the debut of our new brand logo and communications strategy. Following the formal discussion, we’ll continue the conversation over cocktails as part of our celebration of Crossref’s milestone 15th Anniversary!&lt;/p>
&lt;p>&lt;strong>✱ Tickets:&lt;/strong>&lt;/p>
&lt;p>Reserve your free tickets here: &lt;a href="https://www.eventbrite.com/e/crossref15-tech-workshops-member-meeting-tickets-17921679225" target="_blank">https://www.eventbrite.com/e/crossref15-tech-workshops-member-meeting-tickets-17921679225&lt;/a>&lt;/p>
&lt;p>&lt;strong>Who Should Attend?&lt;/strong>&lt;/p>
&lt;p>Scholarly publishers, technology providers, librarians, researchers, academic institutions, funders, journalists, and others who are keen to discuss tools and services to advance scholarly publishing are encouraged to attend.&lt;/p>
&lt;p>&lt;strong>✱ Venue:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>&lt;a href="https://www.tajhotels.com/en-in/taj/taj-boston/" target="_blank">Hotel Taj Boston&lt;/a>&lt;/li>
&lt;li>15 Arlington Street&lt;/li>
&lt;li>Boston, MA 02116 USA&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>About Crossref&lt;/strong> Crossref is a not-for profit membership organisation that wants to improve research communication. We organize publisher metadata, run the infrastructure that makes DOI links work, and we rally multiple community stakeholders in order to develop tools and services to enable advancements in scholarly publishing.&lt;/p></description></item><item><title>DOI Event Tracker (DET): Pilot progresses and is poised for launch</title><link>https://www.crossref.org/blog/det-poised-for-launch/</link><pubDate>Tue, 15 Sep 2015 00:00:00 +0000</pubDate><author>Jennifer Lin</author><guid>https://www.crossref.org/blog/det-poised-for-launch/</guid><description>&lt;p>&lt;span >&lt;a href="https://www.crossref.org/wp/blog/uploads/2015/09/doi_tracker_graphic.001.jpg">&lt;img class="alignright size-medium wp-image-700" src="https://www.crossref.org/wp/blog/uploads/2015/09/doi_tracker_graphic.001-300x225.jpg" alt="" width="300" height="225" srcset="https://www.crossref.org/wp/blog/uploads/2015/09/doi_tracker_graphic.001-300x225.jpg 300w, https://www.crossref.org/wp/blog/uploads/2015/09/doi_tracker_graphic.001.jpg 1024w, https://www.crossref.org/wp/blog/uploads/2015/09/doi_tracker_graphic.001-624x468.jpg 624w" sizes="(max-width: 300px) 85vw, 300px" />&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >Publishers, researchers, funders, institutions and technology providers are all interested in better understanding how scholarly research is used. Scholarly content has always been discussed by scholars outside the formal literature and by others beyond the academic community. We need a way to monitor and distribute this valuable information.&lt;/p>
&lt;/span>
&lt;h2 id="span-the-crossref-doi-event-tracker-detspan">&lt;span >The Crossref DOI Event Tracker (DET)&lt;/span>&lt;/h2>
&lt;p>&lt;span >To meet this need, Crossref will be introducing a new service that tracks activity surrounding a research work from potentially any web source where an event is associated with a DOI. Following a successful &lt;a href="https://www.crossref.org/blog/crossrefs-doi-event-tracker-pilot/">pilot run&lt;/a> started Spring 2014, the service has been approved to move toward production and is expected to launch in 2016. Any party wishing to join this phase is welcome to contact Jennifer Lin. The DOI Event Tracker (DET) registers a wide variety of events such as bookmarks, comments, social shares, citations, and links to other research entities, from a growing list of online sources. DET aggregates them, and stores and delivers the data in many ways.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>Open, portable, and licensed for maximum reuse&lt;/strong>&lt;/span>&lt;/p>
&lt;p>&lt;span >Crossref has long served as the citation linking and metadata infrastructure provider for scholarly communication; the new DOI Event Tracker is a natural next step, providing a practical solution as a resource for the whole community. The tracker offers the following features:&lt;/span>&lt;/p>
&lt;ul>
&lt;li>&lt;span >Data on event activity across a common pool of online channels.&lt;/span>&lt;/li>
&lt;li>&lt;span >Near real-time alerting for select sources with push notifications to the system.&lt;/span>&lt;/li>
&lt;li>&lt;span >Cross-publisher monitoring to enable benchmarking and provide context to the data.&lt;/span>&lt;/li>
&lt;li>&lt;span >Common format for normalizing data results across the diverse set of sources via modern REST API.&lt;/span>&lt;/li>
&lt;li>&lt;span >Secure and regularly refreshed backups of critical data for long term data preservation.&lt;/span>&lt;/li>
&lt;li>&lt;span >Transparency of data collection so as to ensure auditable, replicable, and trustworthy results.&lt;/span>&lt;/li>
&lt;li>&lt;span >Query-initiated retrieval or real-time alerts when an event of interest occurs.&lt;/span>&lt;/li>
&lt;li>&lt;span >CC-0 license for open and flexible propagation of data.&lt;/span>&lt;/li>
&lt;/ul>
&lt;p>&lt;span >A number of platforms are already confirmed and more parties are welcomed at any stage. So far we have confirmation to track DOI events on the following platforms:&lt;/span>&lt;/p>
&lt;p>&lt;span >[table id=1 /]&lt;/span>&lt;/p>
&lt;p>&lt;span >This set of sources reflects our initial focus on parties willing to allow their data to be redistributed in the common pool. Efforts are underway to expand the source list to include &lt;a href="http://twitter.com">Twitter&lt;/a> and &lt;a href="https://www.mysciencework.com/">MyScienceWork&lt;/a>, among others. Publishers can also act as sources by publishing and distributing DOI event data via the DET when an event occurs on its platform (for example, when a PDF is downloaded, or when a comment mentions a DOI in a locally hosted discussion forum, etc.). This would make local DOI activity globally available to funders, researchers, institutions, etc.&lt;/span>&lt;/p>
&lt;p>&lt;span >DET provides benefits of scale and ease of access as a central point for collecting and propagating data to the community. As a single point of access, it overcomes the business and technical hurdles that are a part of managing multiple online sources where scholarly activity occurs, in a rapidly changing landscape of online channels. This resource covers content across publishers and serves as a strong foundation to support the development of tools and services by any party. DET users will always be able to combine the DET data with those individually collected via negotiated or paid access. DET remains a utility separate from any value-added amenities, such as analytics, presentation, and reporting.&lt;/span>&lt;/p>
&lt;h2 id="span-det-service-level-agreementspan">&lt;span >DET Service-Level Agreement&lt;/span>&lt;/h2>
&lt;p>&lt;span >For those who seek the highest level of service and a more flexible range of access options, Crossref will provide a Service-Level Agreement (SLA) service for the DOI Event Tracker. The DET SLA includes the following additional features on top of the common data offering:&lt;/span>&lt;/p>
&lt;ul>
&lt;li>&lt;span >Access to the complete suite of sources, which includes restricted and/or paid sources in addition to common data, providing the fullest picture of DOI usage activity possible.&lt;/span>&lt;/li>
&lt;li>&lt;span >Guaranteed uptime and response time to the latest raw data on the aggregate activity surrounding a DOI.&lt;/span>&lt;/li>
&lt;li>&lt;span >Guaranteed support response time to questions and issues surrounding data and data delivery.&lt;/span>&lt;/li>
&lt;li>&lt;span >Flexible data access options: on-demand real time data access and scheduled bulk downloads for processing batch analytics.&lt;/span>&lt;/li>
&lt;li>&lt;span >Optimum retrieval rates and accelerated delivery speeds with the dedicated SLA API.&lt;/span>&lt;/li>
&lt;li>&lt;span >Access to a webhook API for events of interest as an alternative to polling DET.&lt;/span>&lt;/li>
&lt;li>&lt;span >Standardized and enhanced linkback service for the difficult-to-track, grey literature.&lt;/span>&lt;/li>
&lt;/ul>
&lt;p>&lt;span >The DET SLA service has a simple, value-based pricing model based on subscriber size. &lt;a href="https://docs.google.com/a/crossref.org/forms/d/1_pOnL6500eFebismbHMlAJINxVFqvDFMMkupZualmNo/viewform?usp=send_form">Register your interest&lt;/a> in Crossref’s DOI Event Tracker and the DET SLA service if you would like stay informed of the upcoming launch. Please contact &lt;a href="mailto:jlin@crossref.org">Jennifer Lin&lt;/a> for more information.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;em>Image modified from “&lt;a href="https://thenounproject.com/term/radar/50290/">Radar&lt;/a>” icon by Karsten Barnett from the Noun Project.&lt;/em>&lt;/span>&lt;/p></description></item><item><title>Best Practices for Depositing Funding Data</title><link>https://www.crossref.org/blog/best-practices-for-depositing-funding-data/</link><pubDate>Tue, 01 Sep 2015 00:00:00 +0000</pubDate><author>Kirsty Meddings</author><guid>https://www.crossref.org/blog/best-practices-for-depositing-funding-data/</guid><description>&lt;p>Crossref’s funding data initiative (FundRef) encourages publishers to deposit information about the funding sources of authors’ research as acknowledged in their papers. The funding data comprises funder name and identifier, and grant number or numbers. Funding data can be deposited on its own or with the rest of the metadata for an item of content.&lt;/p>
&lt;p>There are two ways that publishers can collect this funding information for any given piece of content: by asking authors to input the funder name(s) and award number(s) via their submission system, or extracting the funder names and award numbers from the acknowledgements in the paper.&lt;/p>
&lt;p>The funding data is only useful if it is standardised, and so it is absolutely critical that funder names are deposited with their associated funder IDs from the &lt;a href="https://www.crossref.org/services/funder-registry/" target="_blank">Funder Registry&lt;/a>.&lt;/p>
&lt;p>For publishers considering or about to start collecting and depositing funding data, and for those already doing so, we have drawn up some guidelines that will help you to ensure good quality metadata.&lt;/p>
&lt;p>&lt;strong>If you are collecting funding information from authors via your submission system:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>Provide very clear instructions for your authors. Your submission system should prompt the author towards the canonical name from Crossref’s Funder Registry as they type, or guide them through a pick-list. Make it clear to authors that they should choose funder names from this list and not copy and paste from their manuscript.&lt;/li>
&lt;li>Work with your submission system vendor or adapt your in-house system to make it easy for authors to select from the Funder Registry, and more difficult to paste incorrect names or ignore the suggested names. Consider a warning message if an unknown name is entered, and offer a list of close matches.&lt;/li>
&lt;li>Instruct authors to look for the name of the funding body rather than a specific program or project.&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>If you or one of your vendors is extracting funding information from papers:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>Provide the same clear instructions to your vendor(s). Stress the importance of matching the funder names in the acknowledgements to the names in the Funder Registry.&lt;/li>
&lt;li>Look for common text-extraction errors such as concatenated funder names, punctuation errors, and stop words such as “of/for” that are commonly used interchangeably, or the presence or absence of “the” at the start of a funder name.&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>For both workflows:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>Add QA into your workflow. Many of the names sent to Crossref without IDs are very obviously funders that are in the Registry, and a check by editorial or production staff could correct misspellings or fill in blanks. Check that grant numbers have been separated and are not being deposited as one long string.&lt;/li>
&lt;li>Be aware that funder names deposited without IDs are not valid funding data and will be hidden from Crossref’s search tools and APIs until such time as they are updated with a funder ID.&lt;/li>
&lt;li>The funding data section of a deposit (but not the rest of the deposit) will be rejected by the Crossref deposit system if
&lt;ul>
&lt;li>The funder_name field contains a numerical string longer than 4 digits&lt;/li>
&lt;li>The funder_id field contains a number that is not an ID from the Funder Registry&lt;/li>
&lt;li>The funder_name contains text that exceeds 200 characters&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>Consider only depositing data that has funder IDs and holding the rest to re-poll against the Funder Registry at a later date when more funder names have been added. The Funder Registry is updated at approximately two-monthly intervals. You can sign up to be alerted to updates &lt;a href="http://visitor.r20.constantcontact.com/manage/optin?v=001Vzv-UqW3G57-t0YXoJQ2YghheQfSiYyOAlZ1dw67TbFqm0n5SVhTn3urBLe_9ZlAoeQapfs9PznTGUB97pFIdgExWoqkEBPsXyDwctEP7L9znpQ1xb6mqZeJQPsq76yE9nG7WXAqcooSo0WzTw5BdDRRzENtU2lqcwXjSRYMI_H7ojX16927cuXlBbOXiprZsZVoValPqpg=" target="_blank">here&lt;/a>.&lt;/li>
&lt;li>If there are funders that appear regularly in your particular subject or geographical area that are not in the Registry, send a list to &lt;a href="mailto:funder.registry@crossref.org">funder.registry@crossref.org&lt;/a>.&lt;/li>
&lt;/ul></description></item><item><title>DOIs and matching regular expressions</title><link>https://www.crossref.org/blog/dois-and-matching-regular-expressions/</link><pubDate>Tue, 11 Aug 2015 00:00:00 +0000</pubDate><author>Andrew Gilmartin</author><guid>https://www.crossref.org/blog/dois-and-matching-regular-expressions/</guid><description>&lt;p>&lt;span >We regularly see developers using regular expressions to validate or scrape for DOIs. For modern Crossref DOIs the regular expression is short&lt;/span>&lt;/p>
&lt;p>&lt;code>/^10.\d{4,9}/[-._;()/:A-Z0-9]+$/i&lt;/code>&lt;/p>
&lt;p>&lt;span >For the 74.9M DOIs we have seen this matches 74.4M of them. If you need to use only one pattern then use this one.&lt;/p>
&lt;/span>
&lt;p>&lt;span >The other 500K are mostly from Crossref’s early days when the battle between “human-readable” identifiers and “opaque” identifiers was still being fought, the web was still new, and it was expected that “doi” would become as well a supported URI schema name as “gopher”, “wais”, …. Ok, that didn’t go so well.&lt;/span>&lt;/p>
&lt;p>&lt;span >An early Crossref’s member was John Wiley &amp;amp; Sons. They faced the need to design DOIs without much prior work to lean on. Many of those early DOIs are not expression friendly. Nevertheless, they are still valid and valuable permanent links to the work’s version of record. You can catch 300K more DOIs with&lt;/span>&lt;/p>
&lt;p>&lt;code>/^10.1002/[^\s]+$/i&lt;/code>&lt;/p>
&lt;p>&lt;span >While the DOI caught is likely to be the DOI within the text it may also contain trailing characters that, due to the lack of a space, are caught up with the DOI. Even the recommended expression catches DOIs ending with periods, colons, semicolons, hyphens, and underscores. Most DOIs found in the wild are presented within some visual design program. While pleasant to look at the visual design can misdirect machines. Is the period at the end of the line part of the DOI or part of the design? Is that endash actually a hyphen? These issues lead to a DOI bycatch.&lt;/span>&lt;/p>
&lt;p>&lt;span >Adding the following 3 expressions with the previous 2 leaves only 72K DOIs uncaught. To catch these 72K would require a dozen or more additional patterns. Each additional pattern, unfortunately, weakens the overall precision of the catch. More bycatch.&lt;/span>&lt;/p>
&lt;p>&lt;code>/^10.\d{4}/\d+-\d+X?(\d+)\d+&amp;lt;[\d\w]+:[\d\w]*&amp;gt;\d+.\d+.\w+;\d$/i&lt;/code>&lt;/p>
&lt;p>&lt;code>/^10.1021/\w\w\d++$/i&lt;/code>&lt;/p>
&lt;p>&lt;code>/^10.1207/[\w\d]+\&amp;amp;\d+_\d+$/i&lt;/code>&lt;/p>
&lt;p>&lt;span >Crossref is not the only DOI Registration Agency and while our members account for 65-75% of all registered DOIs this means there are tens of millions of DOIs that we have not seen. Luckily, the newer RAs and their publishers can copy our successes and avoid our mistakes.&lt;/span>&lt;/p></description></item><item><title>Rehashing PIDs without stabbing myself in the eyeball</title><link>https://www.crossref.org/blog/rehashing-pids-without-stabbing-myself-in-the-eyeball/</link><pubDate>Thu, 11 Jun 2015 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/rehashing-pids-without-stabbing-myself-in-the-eyeball/</guid><description>&lt;p>Anybody who knows me or reads this blog is probably aware that I don’t exactly &lt;a href="https://www.crossref.org/blog/dois-unambiguously-and-persistently-identify-published-trustworthy-citable-online-scholarly-literature-right/">hold back&lt;/a> when discussing &lt;a href="https://www.crossref.org/blog/january-2015-doi-outage-followup-report">problems&lt;/a> with the DOI system. But just occasionally I find myself actually defending the thing…&lt;/p>
&lt;p>About once a year somebody suggests that we could replace existing persistent citation identifiers (e.g. DOIs) with some new technology that would fix some of the weaknesses of the current systems. Usually said person is unhappy that current systems like&lt;/p>
&lt;p>&lt;a href="http://www.doi.org" target="_blank">DOI&lt;/a>, &lt;a href="http://www.handle.net" target="_blank">Handle&lt;/a>, &lt;a href="http://en.wikipedia.org/wiki/Archival_Resource_Key" target="_blank">Ark&lt;/a>, &lt;a href="http://perma.cc" target="_blank">perma.cc&lt;/a>, etc. depend largely on a social element to update the pointers between the identifier and the current location of the resource being identified. It just seems manifestly old-fashioned and ridiculous that we should still depend on &lt;a href="http://tvtropes.org/pmwiki/pmwiki.php/Main/CallAHumanAMeatbag" target="_blank">bags of meat&lt;/a> to keep our digital linking infrastructure from falling apart.&lt;/p>
&lt;p>In the past, &lt;a href="https://web.archive.org/web/20170811141334/http://blogs.plos.org/mfenner/2009/02/17/interview_with_geoffrey_bilder/" target="_blank">I’ve threatened to stab myself in the eyeball&lt;/a> if I was forced to have the discussion again. But the dirty little secret is that I play this game myself sometimes. After all, &lt;a href="http://cameronneylon.net/blog/principles-for-open-scholarly-infrastructures/" target="_blank">the best thing a mission-driven membership organisation could do for its members would be to fulfil its mission and put itself out of business&lt;/a>. If we could come up with a technical fix that didn’t require the social component, it would save our members a lot of money and effort.&lt;/p>
&lt;p>When one of these ideas is posed, there is a brief flurry of activity as another generation goes through the same thought processes and (so far) comes to the same conclusions.&lt;/p>
&lt;p>The proposals I’ve seen generally fall into one of the following groups:&lt;/p>
&lt;ul>
&lt;li>Replace persistent identifiers (PIDs) with &lt;a href="http://en.wikipedia.org/wiki/Hash_function" target="_blank">hashes&lt;/a>, &lt;a href="http://en.wikipedia.org/wiki/Checksum" target="_blank">checksums&lt;/a>, etc.&lt;/li>
&lt;li>Just use search (often, but not always coupled with 1 above)&lt;/li>
&lt;li>Automagically create PIDs out of metadata.&lt;/li>
&lt;li>Automagically redirect broken citations to archived versions of the content identified&lt;/li>
&lt;li>And more recently… use the &lt;a href="http://en.wikipedia.org/wiki/Blockchain" target="_blank">blockchain&lt;/a>&lt;/li>
&lt;/ul>
&lt;p>I thought it might help advance the discussion and avoid a bunch of dead ends if I summarised (rehashed?) some of the issues that should be considered when exploring these options.&lt;/p>
&lt;p>Warning: Refers to &lt;a href="http://en.wikipedia.org/wiki/Functional_Requirements_for_Bibliographic_Records" target="_blank">FRBR&lt;/a> terminology. Those of a sensitive disposition might want to turn away now.&lt;/p>
&lt;ul>
&lt;li>DOIs, PMIDs, etc. and other persistent identifiers are primarily used by our community as “citation identifiers”. We generally cite at the “expression” level.&lt;/li>
&lt;li>Consider the difference between how a “citation identifier” a “work identifier” and a “content verification identifier” might function.&lt;/li>
&lt;li>How do you deal with “equivalent manifestations” of the same expression. For example the ePub, PDF and HTML representations of the same article are intellectually equivalent and interchangeable when citing. The same applies to csv &amp;amp; tsv representations of the same dataset. So, for example, how do hashes work here as a citation identifier?&lt;/li>
&lt;li>Content can be changed in ways that typically doesn’t effect the interpretation or crediting of the work. For example, by reformatting, correcting spelling, etc. In these cases the copies should share the same citation identifier, but the hashes will be different.&lt;/li>
&lt;li>Content that is virtually identical (and shares the same hash) might be republished in different venues (e.g. a normal issue and a thematic issue). Context in citation is important. How do you point somebody at the copy in the correct context?&lt;/li>
&lt;li>Some copies of an article or dataset are stewarded by publishers. That is, if there is an update, errata, corrigenda, retraction/withdrawal, they can reflect that on the stewarded copy, not on copies they don’t host or control. Location is, in fact, important here.&lt;/li>
&lt;li>Some copies of content will be nearly identical, but will differ in ways that would affect the interpretation and/or crediting of the work. A corrected number in a table for example. How would you create a citation form a search that would differentiate the correct version from the incorrect version?&lt;/li>
&lt;li>Some content might be restricted, private or under embargo. For example private patient data, sensitive data about archaeological finds or the migratory patterns of endangered animals.&lt;/li>
&lt;li>Some content is behind paywalls (cue jeremiads)&lt;/li>
&lt;li>Content is increasingly composed of static and dynamic elements. How do you identify the parts that can be hashed?&lt;/li>
&lt;li>How do you create an identifier out of metadata and not have them look like &lt;a href="http://en.wikipedia.org/wiki/Serial_Item_and_Contribution_Identifier" target="_blank">this&lt;/a>?&lt;/li>
&lt;/ul>
&lt;p>This list is a starting point that should allow people to avoid a lot of blind alleys.&lt;/p>
&lt;p>In the mean time, good luck to those seeking alternatives to the current crop of persistent citation identifier systems. I’m not convinced it is possible to replace them, but if it is- I hope I beat you to it. 🙂 And I hope I can avoid stabbing myself in the eye.&lt;/p></description></item><item><title>Coming to you Live from Wikipedia</title><link>https://www.crossref.org/blog/coming-to-you-live-from-wikipedia/</link><pubDate>Wed, 20 May 2015 00:00:00 +0000</pubDate><author>Joe Wass</author><guid>https://www.crossref.org/blog/coming-to-you-live-from-wikipedia/</guid><description>&lt;p>We’ve been collecting &lt;a href="https://www.crossref.org/blog/real-time-stream-of-dois-being-cited-in-wikipedia/">citation events from Wikipedia&lt;/a> for some time. We’re now pleased to announce a &lt;a href="https://web.archive.org/web/20150422055509/http://events.labs.crossref.org/events/types/WikipediaCitation" target="_blank">&lt;strong>live stream of citations&lt;/strong>&lt;/a>, as they happen, when they happen. Project this on your wall and watch live DOI citations as people edit Wikipedia, round the world.&lt;/p>
&lt;h2 id="view-live-stream-2">&lt;a href="https://web.archive.org/web/20150422055509/http://events.labs.crossref.org/events/types/WikipediaCitation" target="_blank">&lt;strong>View live stream »&lt;/strong>&lt;/a>&lt;/h2>
&lt;p>In the hours since this feature launched, there are events from Indonesian, Portugese, Ukrainian, Serbian and English Wikipedias (in that order).&lt;/p>
&lt;img src="https://www.crossref.org/wp/blog/uploads/2015/05/Screen-Shot-2015-05-20-at-16.30.00-1024x760.png" class="img-responsive" alt="Live event stream" >
&lt;p>The usual weasel words apply. This is a labs project and so may not be 100% stable. If you experience any problems please email &lt;a href="mailto:labs@crossref.org">labs@crossref.org&lt;/a> .&lt;/p></description></item><item><title>January 2015 DOI Outage: Followup Report</title><link>https://www.crossref.org/blog/january-2015-doi-outage-followup-report/</link><pubDate>Tue, 17 Mar 2015 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/january-2015-doi-outage-followup-report/</guid><description>&lt;h2 id="span-backgroundspan">&lt;span >Background&lt;/span>&lt;/h2>
&lt;p>&lt;span >On January 20th, 2015 the main DOI HTTP proxy at doi.org experienced a partial, rolling global outage. The system was never completely down, but for at least part of the subsequent 48 hours, up to 50% of DOI resolution traffic was effectively broken. This was true for almost all DOI registration agencies, including Crossref, &lt;a href="http://www.datacite.org">DataCite&lt;/a> and &lt;a href="http://www.medra.org">mEDRA&lt;/a>.&lt;/span>&lt;/p>
&lt;p>&lt;span >At the time we kept people updated on what we knew via Twitter, mailing lists and our &lt;a href="https://www.crossref.org/blog/problems-with-dx.doi.org-on-january-20th-2015-what-we-know./">technical blog at CrossTech&lt;/a>. We also promised that, once we’d done a thorough investigation, we’d report back. Well, we haven’t finished investigating all implications of the outage. There are both substantial technical and governance issues to investigate. But last week we provided a preliminary report to the Crossref board on the basic technical issues, and we thought we’d share that publicly now.&lt;/p>
&lt;/span>
&lt;h2 id="span-the-gory-detailsspan">&lt;span >The Gory Details&lt;/span>&lt;/h2>
&lt;p>&lt;span >First, the outage of January 20th was not caused by a software or hardware failure, but was instead due to an administrative error at the Corporation for National Research Initiatives (CNRI). The domain name “doi.org” is managed by CNRI on behalf of the International DOI Foundation (IDF). The domain name was not on “auto-renew” and CNRI staff simply forgot to manually renew the domain. Once the domain name was renewed, it took about 48 hours for the fix to propagate through the DNS system and for the DOI resolution service to return to normal. Working with CNRI we analysed traffic through the Handle HTTP proxy and here’s the graph:&lt;/span>&lt;figure id="attachment_537" class="wp-caption alignnone">&lt;/p>
&lt;p>&lt;img class="size-full wp-image-537" src="https://www.crossref.org/wp/blog/uploads/2015/03/doi_outage_impact.jpeg" alt="Chart of Handle HTTP proxy traffic during outage" width="800" height="545" srcset="https://www.crossref.org/wp/blog/uploads/2015/03/doi_outage_impact.jpeg 800w, https://www.crossref.org/wp/blog/uploads/2015/03/doi_outage_impact-300x204.jpeg 300w, https://www.crossref.org/wp/blog/uploads/2015/03/doi_outage_impact-624x425.jpeg 624w" sizes="(max-width: 709px) 85vw, (max-width: 909px) 67vw, (max-width: 984px) 61vw, (max-width: 1362px) 45vw, 600px" />&lt;figcaption class="wp-caption-text">The above graph shows traffic over a 24 hour period on each day from January 12, 2015 through February 10th, 2015. The heavy blue line for January 20th and the heavy red line for January 21st show how referrals declined as the doi.org domain was first deleted, and then added back to DNS.&lt;/figcaption>&lt;/figure>&lt;/p>
&lt;p>&lt;span >It could have been much worse. The domain registrar (GoDaddy) at least had a “&lt;a href="https://www.godaddy.com/en-ph/help/what-happens-when-my-domain-expires-609">renewal grace and registry redemption period&lt;/a>” which meant that even though CNRI forgot to pay its bill to renew the domain, the domain was simply “parked” and could easily be renewed by them. This is the standard setting for GoDaddy. Cheaper domain registrars might not include this kind of protection by default. Had there been no grace period, then it would have been possible for somebody other than CNRI to quickly buy the domain name as soon as it expired. There are many automated processes which search for and register recently expired domain names. Had this happened, at the very least it would have been expensive for CNRI to buy the domain back. The interruption to DOI resolutions during this period would have also been almost complete.&lt;/span>&lt;/p>
&lt;p>&lt;span >So we got off relatively easy. The domain name is now on auto-renew. The outage was not as bad as it could have been. It was addressed quickly and we can be reasonably confident that the same administrative error will not happen again. Crossref even managed to garner some public praise for the way in which we handled the outage. It is tempting to heave a sigh of relief and move on.&lt;/span>&lt;/p>
&lt;p>&lt;span >We also know that everybody involved at CNRI, the IDF and Crossref have felt truly dreadful about what happened. So it is also tempting to not re-open old wounds.&lt;/span>&lt;/p>
&lt;p>&lt;span >But it would be a mistake if we did not examine a fundamental strategic issue that this partial outage has raised: How can Crossref claim that its DOIs are ‘persistent’ if Crossref does not control some of the key infrastructure on which it depends? What can we do to address these dependencies?&lt;/span>&lt;/p>
&lt;h2 id="span-what-do-we-mean-by-persistentspanfigure-idattachment_540--classwp-caption-alignnone">&lt;span >What do we mean by “persistent?”&lt;/span>&lt;figure id="attachment_540" class="wp-caption alignnone">&lt;/h2>
&lt;p>&lt;img class="size-full wp-image-540" src="https://www.crossref.org/wp/blog/uploads/2015/03/image02.png" alt="@kaythaney tweets on definition of &amp;quot;persistent&amp;quot;" width="542" height="66" srcset="https://www.crossref.org/wp/blog/uploads/2015/03/image02.png 542w, https://www.crossref.org/wp/blog/uploads/2015/03/image02-300x37.png 300w" sizes="(max-width: 542px) 85vw, 542px" />&lt;figcaption class="wp-caption-text">&lt;/span> &lt;span >@kaythaney tweets on definition of “persistent”&lt;/span>&lt;/figcaption>&lt;/figure>&lt;/p>
&lt;p>&lt;span >To start with, we should probably explore what we mean by ‘persistent’. We use the word “persistent” or “persistence” about 470 times on the Crossref web site. The word “persistent” appears central to our image of ourselves and of the services that we provide. We describe our core, mandatory service as the “Crossref Persistent Citation Infrastructure.”&lt;/span>&lt;/p>
&lt;p>&lt;span >The primary sense of the word “persistent” in the New Oxford American Dictionary is:&lt;/span>&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >Continuing firmly or obstinately in a course of action in spite of difficulty or opposition.&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;span >We play on this sense of the word as a synonym for “stubborn” when we half-jokingly say that, “Crossref DOIs are as persistent as Crossref staff.” Underlying this joke is a truth, which is that persistence is primarily a social issue, not a technical issue.&lt;/span>&lt;/p>
&lt;p>&lt;span >Yet presumably we once chose to use the word “persistent” instead of “perpetual” or “permanent” for other reasons. “Persistence” implies longevity, without committing to “forever.” Scholarly publishers, perhaps more than most industries, understand the long term. After all, the scholarly record dates back to at least 1665 and we know that the scholarly community values even our oldest journal backfiles. By using the word “persistent” as opposed to the more emphatic “permanent” we are essentially acknowledging that we, as an industry, understand the complexity and expense of stewarding the content for even a few hundred years to say nothing of “forever.” Only the chronologically naïve would recklessly coin terms like “permalink” for standard HTTP links which have a documented half-life of well under a decade.&lt;/span>&lt;/p>
&lt;p>&lt;span >So “persistent” implies longevity- without committing to forever- but this still begs questions. What time span is long enough to qualify as “persistent?” What, in particular, do we mean by “persistent” when we talk about Crossref’s “Persistent Citation Infrastructure?” or of Crossref DOIs being “persistent identifiers?”&lt;/span>&lt;/p>
&lt;h2 id="span-what-do-we-mean-by-persistent-identifiersspanfigure-idattachment_541--classwp-caption-alignnone">&lt;span >What do we mean by “persistent identifiers?”&lt;/span>&lt;figure id="attachment_541" class="wp-caption alignnone">&lt;/h2>
&lt;p>&lt;img class="size-full wp-image-541" src="https://www.crossref.org/wp/blog/uploads/2015/03/image03.png" alt="@violetailik tweets on outage and implication for term &amp;quot;persistent identifier&amp;quot;" width="543" height="64" srcset="https://www.crossref.org/wp/blog/uploads/2015/03/image03.png 543w, https://www.crossref.org/wp/blog/uploads/2015/03/image03-300x35.png 300w" sizes="(max-width: 543px) 85vw, 543px" />&lt;figcaption class="wp-caption-text">&lt;/span> &lt;span >]&lt;a href="https://www.crossref.org/wp/blog/uploads/2015/03/image03.png">5&lt;/a> @violetailik tweets on outage and implication for term “persistent identifier”&lt;/span>&lt;/figcaption>&lt;/figure>&lt;/p>
&lt;p>&lt;span >First, we often make the mistake of talking about “persistent identifiers” as if there is some technical magic that makes them continue working when things like HTTP URIs break. The very term “persistent identifier” encourages this kind of magical thinking and, ideally, we would instead talk about “persist-able” identifiers. That is, those that have some form of indirection built into them. There are many technologies that do this- Handles, DOIs, Purls, ARKs and every URL shortener in existence. Each of them simply introduces a pointer mapping between an identifier and location where a resource or content resides. This mapping can be updated when the content moves, thus preserving the link. Of course, just because an identifier is persist-able doesn’t mean it is persistent. If Purls or DOIs are not updated when content moves, then they are no more persistent than normal URLs.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;a href="http://andrew.treloar.net/">Andrew Treloar&lt;/a> points out that when we talk about “persistent identifiers,” we tend to conflate several things:&lt;/span>&lt;/p>
&lt;ol >
&lt;li>
&lt;span >The persistence of the identifier- that is the token or string itself.&lt;/span>
&lt;/li>
&lt;li>
&lt;span >The persistence of the thing being pointed at by the identifier. For example, the content.&lt;/span>
&lt;/li>
&lt;li>
&lt;span >The persistence of the mapping of the identifier to the thing being identified.&lt;/span>
&lt;/li>
&lt;li>
&lt;span >The persistence of the resolver that allows one to follow the mapping of the identifier to the thing being identified.&lt;/span>
&lt;/li>
&lt;li>
&lt;span >The persistence of a mechanism for updating the mapping of the identifier to the thing being identified.&lt;/span>
&lt;/li>
&lt;/ol>
&lt;p>&lt;span >If any of the above fails, then “persistence” fails. This is probably why we tend to conflate them in the first place.&lt;/span>&lt;/p>
&lt;p>&lt;span >Each of these aspects of “persistence” is worthy of much closer scrutiny, however, in the most recent case of the January outage of “doi.org,” the problem specifically occurred with item “D”- the persistence of the resolver. When CNRI failed to renew the domain name for “doi.org” on time, the DOI resolver was rendered unavailable to a large percentage of people over a period of about 48 hours as global DNS servers first removed, and then added back the “doi.org” domain.&lt;/span>&lt;/p>
&lt;h2 id="span-turtles-all-the-way-downa-hrefhttpenwikipediaorgwikiturtles_all_the_way_downaspan">&lt;span >Turtles all the way down&lt;a href="http://en.wikipedia.org/wiki/Turtles_all_the_way_down">*&lt;/a>&lt;/span>&lt;/h2>
&lt;p>&lt;span >The initial public reaction to the outage was, almost unanimous in one respect- people assumed that the problem originated with Crossref.&lt;/span>&lt;figure id="attachment_544" class="wp-caption alignnone">&lt;/p>
&lt;p>&lt;img class="size-full wp-image-544" src="https://www.crossref.org/wp/blog/uploads/2015/03/image06.png" alt="@iainh_z tweets to Crossref enquiring about failed DOI resoluton" width="543" height="69" srcset="https://www.crossref.org/wp/blog/uploads/2015/03/image06.png 543w, https://www.crossref.org/wp/blog/uploads/2015/03/image06-300x38.png 300w" sizes="(max-width: 543px) 85vw, 543px" />&lt;figcaption class="wp-caption-text">&lt;/span> &lt;span >@iainh_z tweets to Crossref enquiring about failed DOI resoluton&lt;/span>&lt;/figcaption>&lt;/figure> &lt;figure id="attachment_543" class="wp-caption alignnone">&lt;img class="size-full wp-image-543" src="https://www.crossref.org/wp/blog/uploads/2015/03/image05.png" alt="@LibSkrat tweets at Crossref about DOI outage" width="540" height="65" srcset="https://www.crossref.org/wp/blog/uploads/2015/03/image05.png 540w, https://www.crossref.org/wp/blog/uploads/2015/03/image05-300x36.png 300w" sizes="(max-width: 540px) 85vw, 540px" />&lt;figcaption class="wp-caption-text">&lt;/span> &lt;span >@LibSkrat tweets at Crossref about DOI outage&lt;/span>&lt;/figcaption>&lt;/figure>&lt;/p>
&lt;p>&lt;span >This is both surprising and unsurprising. It is surprising because we have fairly recent data indicating that lots of people recognise the DOI brand, but not the Crossref brand. Chances are, that this relatively superficial “brand” awareness does not correlate with understanding how the system works or how it relates to persistence. It is likely plenty of people clicked on DOIs at the time of the outage and, when they didn’t work, simply shrugged or cursed under their breath. They were aware of the term ‘DOI’ but not of the promise of “persistence”. Hence, they did not take to twitter to complain about it, and if they did, they probably wouldn’t have known who to complain to or even how to complain to them (neither CNRI or the IDF has a Twitter account).&lt;/span>&lt;/p>
&lt;p>&lt;span >But the focus on Crossref is also unsurprising. Crossref is by far the largest and most visible DOI Registration Agency. Many otherwise knowledgeable people in the industry simply don’t know that there are even other RAs.&lt;/span>&lt;/p>
&lt;p>&lt;span >They also generally didn’t know of the strategic dependencies that exist in the Crossref system. By “strategic dependencies” we are not talking about the vendors, equipment and services that virtually every online enterprise depends on. These kinds of services are largely fungible. Their failures may be inconvenient and even dramatic, but they are rarely existential.&lt;/span>&lt;/p>
&lt;p>&lt;span >Instead we are talking about dependencies that underpin Crossref’s ability to deliver on its mission. Dependencies that not only affect Crossref’s operations, but also its ability to self-govern and meet the needs of its membership. In this case there are three major dependencies: Two of which are specific to Crossref and other DOI registration agencies and one which is shared by virtually all online enterprises today. The organisations are: The International DOI Foundation (IDF), Corporation for National Research Initiatives (CNRI) and the Internet Corporation for Assigned Names and Numbers (ICANN).&lt;/span>&lt;figure id="attachment_545" class="wp-caption alignnone">&lt;/p>
&lt;p>&lt;img class="size-full wp-image-545" src="https://www.crossref.org/wp/blog/uploads/2015/03/image07.png" alt="Dependency of RAs on IDF, CNRI and ICANN. Turtles all the way down." width="800" height="571" srcset="https://www.crossref.org/wp/blog/uploads/2015/03/image07.png 800w, https://www.crossref.org/wp/blog/uploads/2015/03/image07-300x214.png 300w, https://www.crossref.org/wp/blog/uploads/2015/03/image07-624x445.png 624w" sizes="(max-width: 709px) 85vw, (max-width: 909px) 67vw, (max-width: 984px) 61vw, (max-width: 1362px) 45vw, 600px" />&lt;figcaption class="wp-caption-text">&lt;/span> &lt;span >Dependency of RAs on IDF, CNRI and ICANN. Turtles all the way down.&lt;/span>&lt;/figcaption>&lt;/figure>&lt;/p>
&lt;p>&lt;span >Each of these agencies has technology, governance and policy impacts on Crossref and the other DOI registration agencies, but here we will focus on the technological dependencies.&lt;/span>&lt;/p>
&lt;p>&lt;span >At the top of the diagram are a subset of the various DOI Registration Agencies. Each RA uses the DOI for a particular constituency (e.g. scholarly publishers) and application (e.g. citation). Sometimes these constituencies/applications overlap (as with mEDRA, Crossref and DataCite), but sometimes they are orthogonal to the other RAs, as is the case with EIDR. All, however, are members of the IDF.&lt;/span>&lt;/p>
&lt;p>&lt;span >The IDF sets technical policies and development agendas for the DOI infrastructure. This includes recommendations about how RAs should display and link DOIs. Of course all of these decisions have an impact on the RAs. However, the IDF provides little technical infrastructure of its own as it has no full-time staff. Instead it outsources the operation of the system to CNRI, this includes the management of the doi.org domain which the IDF owns.&lt;/span>&lt;/p>
&lt;p>&lt;span >The actual DOI infrastructure is hosted on a platform called the &lt;a href="http://en.wikipedia.org/wiki/Handle_System">Handle System&lt;/a> which was developed by and is currently run by CNRI. The Handle System is part of a quite complex and sophisticated platform for managing digital objects that was originally developed for &lt;a href="http://en.wikipedia.org/wiki/DARPA">DARPA&lt;/a>. A subset of the Handle system is designated for use by DOIs and is identified by the “10” prefix (e.g. 10.5555/12345678). The Handle system itself is not based on HTTP (the web protocol). Indeed, one of the much touted features of the Handle System is that it isn’t based on any specific resolution technology. This was seen as a great virtue in the late 1990s when the DOI system was developed and the internet had just witnessed an explosion of seemingly transient, competing protocols (e.g. &lt;a href="http://en.wikipedia.org/wiki/Gopher_%28protocol%29">Gopher&lt;/a>, &lt;a href="http://en.wikipedia.org/wiki/Wide_area_information_server">WAIS&lt;/a>, &lt;a href="http://en.wikipedia.org/wiki/Archie_search_engine">Archie&lt;/a>, &lt;a href="http://www.mprove.de/diplom/text/2.1.15_hyperg.html">HyperWave/Hyper-G&lt;/a>, HTTP, etc.). But what looked like a wild-west of protocols quickly settled into an HTTP hegemony. In practice, virtually all DOI interactions with the Handle system are via HTTP and so, in order to interact with the web, the Handle System employs a “Handle proxy” which translates back and forth between HTTP, and the native Handle system. This all may sound complicated, and the backend of the Handle system is really very sophisticated, but it turns out that the DOI really uses only a fraction of the Handle system’s features. In fact, the vast majority of DOI interactions merely use the Handle system as a giant lookup table which allows one to translate an identifier into a web location. For example, it will take a DOI Handle like this:&lt;/span>&lt;/p>
&lt;pre>&lt;span >10.5555/12345678&lt;/span>&lt;/pre>
&lt;p>&lt;span >and redirect it to (as of this writing) the following URL:&lt;/span>&lt;/p>
&lt;pre>&lt;span >http://psychoceramics.labs.crossref.org/10.5555-12345678.html&lt;/span>&lt;/pre>
&lt;p>&lt;span >This whole transformation is normally never seen by a user. It is handled transparently by the web browser, which does the lookup and redirection in the background using HTTP and talking to the Handle Proxy. In the late 1990s, even doing this simple translation quickly, at scale with a robust distributed infrastructure, was not easy. These days however we see dozens if not hundreds of “URL Shorteners” doing exactly the same thing at far greater scale than the Handle System.&lt;/span>&lt;/p>
&lt;p>&lt;span >It may seem a shame that more of the Handle Systems features are not used, but the truth is the much touted platform independence of the Handle System rapidly became more of a liability and impediment to persistence than an aid. To be blunt, if in X years a new technology comes out that supersedes the web, what do we think the societal priority is going to be?&lt;/span>&lt;/p>
&lt;ul>
&lt;li>&lt;span >To provide a robust and transparent transition from the &lt;a href="http://en.wiktionary.org/wiki/squillion">squillions&lt;/a> of existing HTTP URI identifiers that the entire world depends on?&lt;/span>&lt;/li>
&lt;li>&lt;span >To provide a robust and transparent transition from the tiny subset of Handle-based identifiers that are used by about a hundred million specialist resources?&lt;/span>&lt;/li>
&lt;/ul>
&lt;p>&lt;span >Quite simply, the more the Handle/DOI systems diverge from common web protocols and practice, then the more we will jeopardise the longevity of our so-called persistent identifiers.&lt;/span>&lt;/p>
&lt;p>&lt;span >So, in the end, DOI registration agencies really only use the Handle system for translating web addresses. All of the other services and features one might associate with DOIs (reference resolution, metadata lookup, content negotiation, OAI-PMH, REST APIs, Crossmark, CrossCheck, TDM Services, FundRef etc) are all provided at the RA level.&lt;/span>&lt;/p>
&lt;p>&lt;span >But this address resolution is still critical. And it is exactly what failed for many users on January 20th 2015. And to be clear, it wasn’t the robust and scaleable Handle System that failed. It wasn’t the Handle Proxy that failed. And it certainly wasn’t any RA-controlled technology that failed. These systems were all up and running. What happened was that the standard handle proxy that the IDF recommends RAs use, “dx.doi.org”, was effectively rendered invisible to wide portions the internet because the “doi.org” domain was not renewed. This underscores two important points.&lt;/span>&lt;/p>
&lt;p>&lt;span >The first is that it doesn’t much matter what precisely caused the outage. In this case it was an administrative error. But the effect would have been similar if the Handle proxies had failed of if the Handle system itself had somehow collapsed. In the end, Crossref and all DOI registration agencies are existentially dependent on the Handle system running and being accessible.&lt;/span>&lt;/p>
&lt;p>&lt;span >The second is that the entire chain of dependencies from the RAs down through CNRI are also dependent on the DNS system which, in turn, is governed by ICANN. We should really not be making too much of the purported technology independence of the DOI and Handle systems. To be fair, this limitation is inherent to all persistent identifier schemes that aim to work with the web. It really is “&lt;a href="http://en.wikipedia.org/wiki/Turtles_all_the_way_down">turtles all the way down.&lt;/a>”&lt;/span>&lt;/p>
&lt;h2 id="span-what-didnt-fail-on-january-19th20th-and-whyspan">&lt;span >What didn’t fail on January 19th/20th and why?&lt;/span>&lt;/h2>
&lt;p>&lt;span >You may have noticed a lot of hedging in our description of the outage of January 19th/20th. For one thing, we use the term “rolling outage.” Access to the Handle Proxy via “dx.doi.org” was never completely unavailable during the period. As we’ve explained, this is because the error was discovered very quickly and the domain was renewed hours after it expired. The nature of DNS propagation meant that even as some DNS servers were deleting the “doi.org” entry, others were adding it back to their tables. In some ways this was really confusing because it meant it was difficult to predict where the system was working and where it wasn’t. Ultimately it all stabilised after the standard 48-hour DNS propagation cycle.&lt;/span>&lt;/p>
&lt;p>&lt;span >But there were also some Handle-based services that simply were not affected at all by the outage. During the outage, a few people asked us if there was an alternative way to resolve DOIs. The answer was “yes,” there were several. It turns out that “doi.org” is not the only DNS name that points to the Handle Proxy. People could easily substitute “dx.doi.org” with “dx.crossref.org” or “dx.medra.org” or “hdl.handle.net” and “resolve” any DOI. Many of Crossref’s internal services use these internal names and so the services continued to work. This is partly why we only discovered the “doi.org” was down from people reporting it on Twitter.&lt;/span>&lt;/p>
&lt;p>&lt;span >And, of course, there were other services that were not affected by the outage. Crossmark, the REST API, and Crossref Metadata Search all continued to work during the outage.&lt;/span>&lt;/p>
&lt;h2 id="span-protecting-ourselvesspan">&lt;span >Protecting ourselves&lt;/span>&lt;/h2>
&lt;p>&lt;span >So what can we do to reduce our dependencies and/or the risks intrinsic to those dependencies?&lt;/span>&lt;/p>
&lt;p>&lt;span >Obviously, the simplest way to have avoided the outage would have been to ensure that the “doi.org” domain was set to automatically renew. That’s been done. Is there anything else we should do? A few ideas have been floated that might allow us to provide even more resilience. They range greatly in complexity and involvement.&lt;/span>&lt;/p>
&lt;ol>
&lt;li>&lt;span >Provide well-publicised public status dashboards that show what systems are up and which clearly map dependencies so that people could, for instance, see that the doi.org server was not visible to systems that depended on it. Of course, if such a dashboard had been hosted at doi.org, nobody would have been able to connect to it. Stoopid turtles.&lt;/span>&lt;/li>
&lt;li>&lt;span >Encourage DOI RAs to have the members point to Handle proxies using domain names under the RA’s control. Simply put, if Crossref members had been using “dx.crossref.org” instead of “dx.doi.org”, then Crossref DOIs would have continued to work throughout the outage of “doi.org”. The same with mEDRA, and the other RAs. This way each RA would have control over another critical piece of their infrastructure. It would also mean that if any single RA made a similar domain name renewal mistake, the impact would be isolated to a particular constituency. Finally, using RA-specific domains for resolving DOIs might also make it clear that different DOIs are managed by different RAs and might have different services associated with them. Perhaps Crossref would spend less time supporting non-Crossref DOIs?&lt;/span>&lt;/li>
&lt;li>&lt;span >Provide a parallel, backup resolution technology that could be pointed to in the event of a catastrophic Handle System failure. For example we could run a parallel system based on &lt;a href="http://en.wikipedia.org/wiki/Persistent_uniform_resource_locator">PURLs&lt;/a>, &lt;a href="http://en.wikipedia.org/wiki/Archival_Resource_Key">ARKs&lt;/a> or another persist-able identifier infrastructure.&lt;/span>&lt;/li>
&lt;li>&lt;span >Explore working with ICANN to get the handle resolvers moved under the special “.arpa” top level domain (TLD). This TLD (&lt;a href="https://tools.ietf.org/html/rfc3172">RFC 3172&lt;/a>) is reserved for services that are considered to be “critical to the operation of the internet.” This is an option that was first discussed at &lt;a href="http://www.circleid.com/posts/20111215_providing_persistent_domain_names_under_arpa/">a meeting of persistent identifier providers in 2011&lt;/a>.&lt;/span>&lt;/li>
&lt;/ol>
&lt;p>&lt;span >These are all tactical approaches to addressing the specific technical problem of the Handle System becoming unavailable, but they do not address deeper issues relating to our strategic dependence on several third parties. Even though the IDF and CNRI provide us with pretty simple and limited functionality, that functionality is critical to our operations and our claim to be providing persistent identifiers. Yet these technologies are not in our direct control. We had to scramble to get hold of people to fix the problem. For a while, we were not able to tell our users or members what was happening because we did not know ourselves.&lt;/span>&lt;/p>
&lt;p>&lt;span >The irony is that &lt;em>Crossref&lt;/em> was held to account, and we were in the firing line the entire time. Again, this was almost unavoidable. In addition to being the largest DOI RA, we are also the only RA that has any significant social media presence and support resources. Still, it meant that we were the public face of the outage while the IDF and CNRI remained in the background.&lt;/span>&lt;/p>
&lt;p>&lt;span >And this is partly why our board has encouraged us to investigate another option:&lt;/span>&lt;/p>
&lt;ol start="5">
&lt;li>
&lt;span >Explore what it would take to remove Crossref dependencies on the IDF and CNRI.&lt;/span>
&lt;/li>
&lt;/ol>
&lt;p>&lt;span >Crossref is just part of a chain of dependencies the goes from our members down through the IDF, CNRI and, ultimately, ICANN. Our claim to providing a persistent identifier structure depends entirely on the IDF and CNRI. Here we have explored some of the technical dependencies. But there are also complex governance and policy implications of these dependencies. Each organisation has membership rules, guidelines and governance structures which can impact Crossref members. Indeed, the IDF and CNRI are themselves members of groups (ISO and DONA, respectively) which might ultimately have policy or governance impact for DOI registration agencies. We will need to understand the strategic implications of these non technical dependencies as well.&lt;/span>&lt;/p>
&lt;p>&lt;span >Note that the Crossref board has merely asked us to “explore” what it would take to remove dependencies. They have not asked us to actually take any action. Crossref has been massively supportive of the IDF and CNRI, and they have been massively supportive of us. Still, over the years we have all grown and our respective circumstances have changed. It is important that occasionally we question what we might have once considered to be axioms. As we discussed above, we use the term “persistent” which, in turn, is a synonym for “stubborn.” At the very least we need to document the inter-dependencies that we have so that we can understand just how stubborn we can reasonably expect our identifiers to be.&lt;/span>&lt;/p>
&lt;p>&lt;span >The outage of January 20th was a humbling experience. But in a way we were lucky: Forgetting to renew the domain name was a silly and prosaic way to partially bring down a persistent identifier infrastructure, but it was also relatively easy to fix. Inevitably, there was a little snark and some pointed barbs directed at us during the outage, but we were truly overwhelmed by the support and constructive criticism we received as well. We have also been left with a clear message that, in order for this good-will to continue, we need to follow-up with a public, detailed and candid analysis of our infrastructure and its dependencies. Consider this to be the first section of a multi-part report.&lt;/span>&lt;figure id="attachment_539" class="wp-caption alignnone">&lt;/p>
&lt;p>&lt;img class="size-full wp-image-539" src="https://www.crossref.org/wp/blog/uploads/2015/03/image01.png" alt="@kevingashley tweets asking for followup analysis" width="544" height="63" srcset="https://www.crossref.org/wp/blog/uploads/2015/03/image01.png 544w, https://www.crossref.org/wp/blog/uploads/2015/03/image01-300x35.png 300w" sizes="(max-width: 544px) 85vw, 544px" />&lt;figcaption class="wp-caption-text">&lt;/span> &lt;span >@kevingashley tweets asking for followup analysis&lt;/span>&lt;/figcaption>&lt;/figure> &lt;figure id="attachment_542" class="wp-caption alignnone">&lt;img class="size-full wp-image-542" src="https://www.crossref.org/wp/blog/uploads/2015/03/image04.png" alt="@WilliamKilbride tweets asking for followup and lessons learned" width="539" height="63" srcset="https://www.crossref.org/wp/blog/uploads/2015/03/image04.png 539w, https://www.crossref.org/wp/blog/uploads/2015/03/image04-300x35.png 300w" sizes="(max-width: 539px) 85vw, 539px" />&lt;figcaption class="wp-caption-text">&lt;/span> &lt;span >@WilliamKilbride tweets asking for followup and lessons learned&lt;/span>&lt;/figcaption>&lt;/figure>&lt;/p>
&lt;h3 id="span-image-creditsspan">&lt;span >Image Credits&lt;/span>&lt;/h3>
&lt;p>&lt;span >Turtle image CC-BY “Unrecognised MJ” from the Noun Project&lt;/span>&lt;/p></description></item><item><title>Real-time Stream of DOIs being cited in Wikipedia</title><link>https://www.crossref.org/blog/real-time-stream-of-dois-being-cited-in-wikipedia/</link><pubDate>Tue, 03 Mar 2015 00:00:00 +0000</pubDate><author>Joe Wass</author><guid>https://www.crossref.org/blog/real-time-stream-of-dois-being-cited-in-wikipedia/</guid><description>&lt;h2 id="span-tldrspan">&lt;span >TL;DR&lt;/span>&lt;/h2>
&lt;p>&lt;span >Watch a real-time stream of DOIs being cited (and “un-cited!” ) in Wikipedia articles across the world: &lt;a href="https://live.eventdata.crossref.org/live.html" target="_blank">https://live.eventdata.crossref.org/live.html&lt;/a>&lt;/p>
&lt;h2 id="span-backgroundspan">&lt;span >Background&lt;/span>&lt;/h2>
&lt;p>&lt;span >For years we’ve known that the Wikipedia was a major referrer of Crossref DOIs and about a year ago &lt;a href="https://www.crossref.org/blog/many-metrics-such-data-wow/">we confirmed&lt;/a> that, in fact, the Wikipedia is the 8th largest refer of Crossref DOIs. We know &lt;a href="http://chronograph.labs.crossref.org/domain.html?domain=wikipedia.org">that people follow the DOIs&lt;/a>, too. This despite a fraction of Wikipedia citations to the scholarly literature even using DOIs. So back in August we decided to create a &lt;a href="https://www.crossref.org/blog/citation-needed/">Wikimedia Ambassador programme&lt;/a>. The goal of the programme was to promote the use of persistent identifiers in citation and attribution in Wikipedia articles.&lt;/span> We would do this through outreach and through the development of better citation-related tools.&lt;/p>
&lt;p>Remember when we &lt;a href="https://www.crossref.org/blog/many-metrics-such-data-wow">originally wrote about our experiments with the PLOS ALM code&lt;/a> and how that has transitioned into the &lt;a href="https://www.crossref.org/blog/crossrefs-doi-event-tracker-pilot/">DOI Event Tracking Pilot&lt;/a>? In those posts we mentioned that one of the hurdles in gathering information about DOI events is the actual process of polling third party APIs for activity related to millions of DOIs. Most parties simply wouldn’t be willing handle the load of a 100K API calls an hour. Besides, polling is a tremendously inefficient process, only a fraction of DOIs are ever going to generate events, but we’d have to poll for each of them, repeatedly, forever, to get an accurate picture of DOI activity. We needed a better way. We needed to see if we could reverse this process and convince some parties to instead “push” us information whenever they saw DOI related events (e.g. citations, downloads, shares, etc). If only we could convince somebody to try this…&lt;/p>
&lt;h2 id="wikipedia-doi-events">Wikipedia DOI Events&lt;/h2>
&lt;p>In December 2014 we took the opportunity of the &lt;a href="http://figshare.com/articles/ALM_Workshop_2014_Report/1287503" target="_blank">2014 PLOS/Crossref ALM Workshop&lt;/a> in San Francisco too meet with &lt;a href="https://en.wikipedia.org/wiki/User:Notconfusing" target="_blank">Max Klein&lt;/a> and &lt;a href="https://twitter.com/dfko_0" target="_blank">Anthony Di Franco&lt;/a> where we kicked off a very exciting project.&lt;/p>
&lt;p>There’s always someone editing a &lt;a href="https://en.wikipedia.org/wiki/List_of_Wikipedias" target="_blank">Wikipedia&lt;/a> somewhere in the world. In fact, you can see a dizzying &lt;a href="http://wikistream.wmflabs.org/" target="_blank">live stream of edits&lt;/a>. We thought that given that there are so many DOIs in Wikipedia, that live stream may contain some diamonds (DOIs are made of diamond, that’s how they can be persistent). Max and Anthony went away and came back with a demo that contains a surprising amount of DOI activity.&lt;/p>
&lt;p>That demo is evolving into a concrete service, called &lt;a href="https://github.com/notconfusing/cocytus" target="_blank">Cocytus&lt;/a>. It is running at Wikimedia Labs monitoring live edits as you read this.&lt;/p>
&lt;p>For now we’re feeding that data into the &lt;a href="https://web.archive.org/web/20150308012303/http://events.labs.crossref.org/" target="_blank">DOI Events Collection app&lt;/a> (which is an off-shoot of the &lt;a href="https://www.crossref.org/blog/introducing-chronograph/">Chronograph project&lt;/a>). We are in the process of modifying the &lt;a href="https://github.com/articlemetrics/lagotto" target="_blank">Lagotto code&lt;/a> so that we can instead push those events into the &lt;a href="http://det.labs.crossref.org/" target="_blank">DOI Event Tracking Instance&lt;/a>.&lt;/p>
&lt;p>The first DOI event we noticed was delightfully prosaic: The DOI for &lt;a href="http://dx.doi.org/10.1145/1978942.1979213" target="_blank">“The polymath project”&lt;/a> is cited by the Wikipedia page for &lt;a href="https://en.wikipedia.org/wiki/Polymath_Project" target="_blank">“Polymath Project”&lt;/a>. Prosaic perhaps, but the authors of that paper probably want to know. Maybe they can help edit the page.&lt;/p>
&lt;p>Or how about this. Someone wrote a a paper about &lt;a href="http://dx.doi.org/10.1080/0144929x.2014.929744" target="_blank">why people edit Wikipedia&lt;/a> and then it was cited by Wikipedia. And then &lt;a href="https://web.archive.org/web/20150321130048/http://events.labs.crossref.org/dois/10.1080/0144929x.2014.929744" target="_blank">the citation was removed&lt;/a>. The plot thickens…&lt;/p>
&lt;p>We’re interested in seeing how DOIs are used outside of the formal scholarly literature. What does that mean? We don’t fully know, that’s the point. We have retractions in scholarly literature (and our &lt;a href="https://www.crossref.org/services/crossmark" target="_blank">Crossmark metadata and service&lt;/a> allow publishers to record that), but it’s a bit different on Wikipedia. Edit wars are fought over … well you can &lt;a href="https://en.wikipedia.org/wiki/Wikipedia:Lamest_edit_wars" target="_blank">see for yourself&lt;/a>.&lt;/p>
&lt;p>Citations can slip in and out of articles. We saw the DOI &lt;a href="http://dx.doi.org/10.1001/archpediatrics.2011.832" target="_blank">10.1001/archpediatrics.2011.832&lt;/a> deleted from &lt;a href="https://en.wikipedia.org/wiki/Bipolar_disorder_in_children" target="_blank">“Bipolar disorder in children”&lt;/a>. If we’d not been monitoring the live feed (we had considered analysing snapshots of the Wikipedia in bulk) we might never have seen that. This is part of what non-traditional citations means, and it wasn’t obvious until we’d seen it.&lt;/p>
&lt;p>You can see this activity on the &lt;a href="https://web.archive.org/web/20150422055509/http://events.labs.crossref.org/events/types/WikipediaCitation" target="_blank">Chronograph’s stream&lt;/a>. Or &lt;a href="https://web.archive.org/web/20150308012303/http://events.labs.crossref.org/" target="_blank">check your favourite DOI&lt;/a>. Please be aware that we’re only collecting newly added citations as of today. We do intend to go back and back-fill, but that may take some time- as it * cough * requires polling again.&lt;/p>
&lt;h2 id="some-technical-things">Some Technical Things&lt;/h2>
&lt;p>A few interesting things that happened as a result of all this:&lt;/p>
&lt;h3 id="span-secure-urlsspan">&lt;span >Secure URLs&lt;/span>&lt;/h3>
&lt;p>&lt;span >SSL and HTTPS were invented so you could do things like banking on the web without fear of interception or tampering. As the web becomes a more important part of life, many sites are upgrading from HTTP to HTTPS, the secure version. This is not only because your confidential details may be tampered with, but because certain governments might not like you reading certain materials.&lt;/span>&lt;/p>
&lt;p>&lt;span >Because of this, some time ago, Wikipedia decided to embark on an upgrade to &lt;a href="https://blog.wikimedia.org/2013/08/01/future-https-wikimedia-projects/">HTTPS&lt;/a> last year, and they are a certain way along the path. The &lt;a href="http://www.doi.org/">IDF&lt;/a>, who are responsible for running the DOI system, upgraded to HTTPS this Summer, although most DOIs are referred to by HTTP still.&lt;/span>&lt;/p>
&lt;p>&lt;span >We met with &lt;a href="http://nitens.org/taraborelli/home">Dario Taraborelli&lt;/a> at the ALM workshop and discussed the DOI referral data that is fed into the &lt;a href="http://chronograph.labs.crossref.org">Chronograph&lt;/a>. We put two and two together and realised that Wikipedia was linking to DOIs (which are mostly HTTP) from pages which might be served over HTTPS. New policies in HTML5 specify that referrer URL headers shouldn’t be sent from HTTPS to HTTP (in case there was something secret in them). The upshot of this is, if someone’s browsing Wikipedia via HTTPS and click on a normal DOI, we won’t know that the user came from Wikipedia. Not a huge problem today, but as Wikipedia switches over to entirely secure, we’re going to miss out on very useful information.&lt;/span>&lt;/p>
&lt;p>&lt;span >Fortunately, the HTML5 specification includes a way to fix this (without leaking sensitive information). We discussed this with Dario, and he did some research, and &lt;a href="https://meta.wikimedia.org/wiki/Research:Wikimedia_referrer_policy">came up with a suggestion&lt;/a>, which got &lt;a href="https://meta.wikimedia.org/wiki/Research_talk:Wikimedia_referrer_policy">discussed&lt;/a>. It’s fascinating to watch a democratic process like this take place and take part in it.&lt;/span>&lt;/p>
&lt;p>&lt;span >We’re waiting to see how the discussion turns out, and hope that it all works out so we can continue to report on how amazing Wikipedia is at sending people to scholarly literature.&lt;/span>&lt;/p>
&lt;h3 id="span-how-shall-i-cite-theespan">&lt;span >How shall I cite thee?&lt;/span>&lt;/h3>
&lt;p>&lt;span >Another discussion grew out of that process, and we started talking to a Wikipedian called Nemo (note to Latin scholars: we weren’t just talking to ourselves). Nemo (real name Federico Leva) had a few suggestions of his own. Another way to solve the referrer problem is by using HTTPS URLs (HTML5 allows browsers to send the referrer domain when going from HTTPS to HTTPS).&lt;/span>&lt;/p>
&lt;p>&lt;span >This means going back to all the articles that use DOIs and change them from HTTP to HTTPS. Not as simple as it sounds, and it doesn’t sound simple. We started looking into how DOIs were cited on Wikipedia.&lt;/span>&lt;/p>
&lt;p>&lt;span >After some research we found that there are more ways that we expected to cite DOIs.&lt;/span>&lt;/p>
&lt;p>&lt;span >First, there’s the URL. You can see it in action in &lt;a href="https://en.wikipedia.org/w/index.php?title=GridLAB-D&amp;action=edit">this article&lt;/a>. URLs can take various forms.&lt;/span>&lt;/p>
&lt;ul>
&lt;li>&lt;span >&lt;a href="http://dx.doi.org/10.5555/12345678" target="_blank">http://dx.doi.org/10.5555/12345678&lt;/a>&lt;/span>&lt;/li>
&lt;li>&lt;span >&lt;a href="http://doi.org/10.5555/12345678" target="_blank">http://doi.org/10.5555/12345678&lt;/a>&lt;/span>&lt;/li>
&lt;li>&lt;span >&lt;a href="https://dx.doi.org/10.5555/12345678" target="_blank">https://dx.doi.org/10.5555/12345678&lt;/a>&lt;/span>&lt;/li>
&lt;li>&lt;span >&lt;a href="https://doi.org/10.5555/12345678" target="_blank">https://doi.org/10.5555/12345678&lt;/a>&lt;/span>&lt;/li>
&lt;li>&lt;span >&lt;a href="http://doi.org/hvx" target="_blank">http://doi.org/hvx&lt;/a>&lt;/span>&lt;/li>
&lt;li>&lt;span >&lt;a href="https://doi.org/hvx" target="_blank">https://doi.org/hvx&lt;/a>&lt;/span>&lt;/li>
&lt;/ul>
&lt;p>&lt;span >Second there’s the &lt;a href="https://en.wikipedia.org/wiki/Template:Cite_journal">official template tag&lt;/a>, seen in action &lt;a href="https://en.wikipedia.org/w/index.php?title=Bird&amp;action=edit">here&lt;/a>:&lt;/span>&lt;/p>
&lt;pre>&amp;lt;ref name="SCI-20140731"&amp;gt;{{cite journal |title=Sustained miniaturization and anatomical innovation in the dinosaurian ancestors of birds |url=http://www.sciencemag.org/content/345/6196/562 |date=1 August 2014 |journal=[[Science (journal)|Science]] |volume=345 |issue=6196 |pages=562–566 |doi=10.1126/science.1252243 |accessdate=2 August 2014 |last1=Lee |first1=Michael S. Y. |first2=Andrea|last2=Cau |first3=Darren|last3=Naish|first4=Gareth J.|last4=Dyke}}&amp;lt;/ref&amp;gt;
&lt;/pre>
&lt;p>&lt;span >There’s a DOI in there somewhere. This is the best way to cite DOIs, firstly as it’s actually a proper traditional citation and there’s nothing magic about DOIs, secondly because it’s a template tag and can be re-rendered to look slightly different if needed.&lt;/span>&lt;/p>
&lt;p>&lt;span >Third there’s the old official &lt;a href="https://en.wikipedia.org/wiki/Template:Cite_doi">DOI template tag&lt;/a> that’s now discouraged:&lt;/span>&lt;/p>
&lt;pre>&amp;lt;ref name="Example2006"&amp;gt;{{Cite doi|10.1146/annurev.earth.33.092203.122621}}&amp;lt;/ref&amp;gt;&lt;/pre>
&lt;p>&lt;span >And then there’s another &lt;a href="https://en.wikipedia.org/wiki/Wikipedia:Template_messages/Links#Miscellanea">one&lt;/a>.&lt;/span>&lt;/p>
&lt;pre>{{doi|10.5555/123456789}}
&lt;/pre>
&lt;p>&lt;span >Knowing all this helps us find DOIs. But if we want to convert DOIs links in Wikipedia to use HTTPS, it means that there are more template tags to modify and more pages to re-render.&lt;/span>&lt;/p>
&lt;p>&lt;span >Nemo also put DOIs on the &lt;a href="https://meta.wikimedia.org/wiki/Interwiki_map">Interwiki Map&lt;/a> which should make automatically changing some of the URLs a lot easier.&lt;/span>&lt;/p>
&lt;p>&lt;span >We’re very grateful to Nemo for his suggestions and work on this. We’ll report back!&lt;/span>&lt;/p>
&lt;h3 id="span-the-elephant-in-the-roomspan">&lt;span >The elephant in the room&lt;/span>&lt;/h3>
&lt;p>&lt;span >Those of you who know how DOIs work will have spotted an unsecured elephant in the room. When you visit a DOI, you visit the URL, which hits the &lt;a href="http://www.doi.org/doi_handbook/3_Resolution.html#3.7.3">DOI resolver proxy server&lt;/a>, which returns a message to your browser to redirect to the landing page on the publisher’s site.&lt;/span>&lt;/p>
&lt;p>&lt;span >Securely talking to the DOI resolver by using HTTPS instead of HTTP means that no-one can eavesdrop and see which DOI you are visiting, or tamper with the result and send you off to a different page. But the page you are sent to will be, in nearly all cases, still HTTP. Upgrading infrastructure isn’t trivial, and, with over 4000 members (mostly publishers), most Crossref DOIs will still redirect to standard HTTP pages for the foreseeable future.&lt;/span>&lt;/p>
&lt;p>&lt;span >You can keep as secure as possible by using &lt;a href="https://www.eff.org/https-everywhere">HTTPS Everywhere&lt;/a>.&lt;/span>&lt;/p>
&lt;h2 id="span-finspan">&lt;span >Fin&lt;/span>&lt;/h2>
&lt;p>&lt;span >There’s lots going on, watch this space to see developments. Thanks for reading this, and all the links. We’d love to know what you think.&lt;/span>&lt;/p>
&lt;h2 id="span-bootnotespan">&lt;span >Bootnote&lt;/span>&lt;/h2>
&lt;p>&lt;span >Not long after this blog post was published we saw something very interesting.&lt;/span>&lt;/p>
&lt;img src="https://www.crossref.org/wp/blog/uploads/2015/03/Screen-Shot-2015-03-04-at-17.18.42.png" alt="Interesting DOI" class="img-responsive" />
&lt;p>&lt;span >That’s no DOI. We like interesting things, but they can panic us. This turned out to be a great example of why this kind of thing can be useful. A minute’s digging and we &lt;a href="https://ja.wikipedia.org/w/index.php?title=%E6%9C%80%E5%A4%A7%E3%83%95%E3%83%AD%E3%83%BC%E5%95%8F%E9%A1%8C&amp;diff=54616146&amp;oldid=54612246">found the article edit&lt;/a>:&lt;/span>&lt;/p>
&lt;img src="https://www.crossref.org/wp/blog/uploads/2015/03/Screen-Shot-2015-03-04-at-17.20.06.png" alt="Wikipedia typo" class="img-responsive" />
&lt;p>&lt;span >It turns out that this was a typo: someone put a title when they should have put in a DOI. And, as &lt;a href="http://events.labs.crossref.org/dois/a%20data%20structure%20for%20dynamic%20trees">the event&lt;/a> shows, this was removed from the Wikipedia article.&lt;/span>&lt;/p></description></item><item><title>Crossref’s DOI Event Tracker Pilot</title><link>https://www.crossref.org/blog/crossrefs-doi-event-tracker-pilot/</link><pubDate>Mon, 02 Mar 2015 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/crossrefs-doi-event-tracker-pilot/</guid><description>&lt;h2 id="tldr">TL;DR&lt;/h2>
&lt;p>Crossref’s “DOI Event Tracker Pilot”- 11 million+ DOIs &amp;amp; 64 million+ events. You can play with it at: &lt;a href="http://goo.gl/OxImJa" target="_blank">http://goo.gl/OxImJa&lt;/a>&lt;/p>
&lt;h2 id="tracking-doi-events">Tracking DOI Events&lt;/h2>
&lt;p>So have you been wondering what we’ve been doing &lt;a href="https://www.crossref.org/blog/many-metrics-such-data-wow/">since we posted about the experiments we were conducting using PLOS’s open source ALM code&lt;/a>? A lot, it turns out. About a week after our post, we were contacted by a group of our members from &lt;a href="http://oaspa.org/" target="_blank">OASPA&lt;/a> who expressed an interest in working with the system. Apparently they were all about to conduct similar experiments using the ALM code, and they thought that it might be more efficient and interesting if they did so together using our installation. Yippee. Publishers working together. That’s what we’re all about.&lt;/p>
&lt;p>So we convened the interested parties and had a meeting to discuss what problems they were trying to solve and how Crossref might be able to help them. That early meeting came to a consensus on a number of issues:&lt;/p>
&lt;ul>
&lt;li>The group was interested in exploring the role Crossref could play in providing an open, common infrastructure to track activities around DOIs, they were not interested in having Crossref play a role in the value-add services of reporting on an interpreting the meaning of said activities.&lt;/li>
&lt;li>The working group needed representatives from multiple stakeholders in the industry. Not just open access publishers from OASPA, but from subscription based publishers, funders, researchers and third party service providers as well.&lt;/li>
&lt;li>That it was desirable to conduct a pilot to see if the proposed approach was both technically feasible and financially sustainable.&lt;/li>
&lt;/ul>
&lt;p>And so after that meeting, the “experiment” graduated to becoming a “pilot.” This Crossref pilot is based on the premise that the infrastructure involved in tracking common information about “DOI events” can be usefully separated from the value-added services of analysing and presenting these events in the form of qualitative indicators. There are many forms of events and interactions which may be of interest. Service providers will wish to analyse, aggregate and present those in a range of different ways depending on the customer and their problem. The capture of the underlying events can be kept separate from those services.&lt;/p>
&lt;p>In order to ensure that the Crossref pilot is not mistaken for some sub rosa attempt to establish new metrics for evaluating scholarly output, we also decided eschew any moniker that includes the word “metrics” or synonyms. So the “ALM Experiment” is dead. Long live the “”DOI Event Tracker” (DET) pilot. Similarly PLOS’s &lt;a href="https://github.com/articlemetrics/lagotto" target="_blank">open source “ALM software”&lt;/a> has been resurrected under the name “&lt;a href="http://en.wikipedia.org/wiki/Lagotto_Romagnolo" target="_blank">Lagotto&lt;/a>.”&lt;/p>
&lt;h2 id="the-technical-issues">The Technical Issues&lt;/h2>
&lt;p>Crossref members are interested in knowing about “events” relating to the DOIs that identify their content. But our members face a now-classic problem. There are a large number of sources for scholarly publications (3k+ Crossref members) and that list is still growing. Similarly, there are an unbounded number of potential sources for usage information. For example:&lt;/p>
&lt;ul>
&lt;li>Supplemental and grey literature (e.g. data, software, working papers)&lt;/li>
&lt;li>Orthogonal professional literature (e.g. patents, legal documents, governmental/NGO/IGO reports, consultation reports, professional trade literature).&lt;/li>
&lt;li>Scholarly tools (e.g. citation management systems, text and data mining applications).&lt;/li>
&lt;li>Secondary outlets for scholarly literature (institutional and disciplinary repositories, A&amp;amp;I services).&lt;/li>
&lt;li>Mainstream media (e.g. BBC, New York Times).&lt;/li>
&lt;li>Social media (e.g. Wikipedia, Twitter, Facebook, Blogs, Yo).&lt;/li>
&lt;/ul>
&lt;p>Finally, there is a broad and growing audience of stakeholders who are interested in seeing how the literature is being used. The audience includes publishers themselves as well as funders, researchers, institutions, policy makers and citizens.&lt;/p>
&lt;p>Publishers (or other stakeholders) could conceivably each choose to run their own system to collect this information and redistribute it to interested parties. Or they can work with a vendor to do the same. But either case, they would face the following problems:&lt;/p>
&lt;ul>
&lt;li>The N sources will change. New ones will emerge. Old ones will vanish.&lt;/li>
&lt;li>The N audiences will change. New ones will emerge. Old ones will vanish.&lt;/li>
&lt;li>Each publisher/vendor will need to deal with N source’s different APIs, rate limits, T&amp;amp;Cs, data licenses, etc. This is a logistical headache for both the publishers/vendors and for the sources.&lt;/li>
&lt;li>Each audience will need to deal with N publisher/vendor APIs, rate limits, T&amp;amp;Cs, data licenses, etc. This is a logistical headache for both the audiences and for the publishers.&lt;/li>
&lt;li>If publishers/vendors use different systems which in turn look at different sources, it will be difficult to compare or audit results across publishers/vendors.&lt;/li>
&lt;li>If a journal moves from one publisher to another, then how are the metrics for that journal’s articles going to follow the journal?&lt;/li>
&lt;/ul>
&lt;p>And then there is the simple issue of scale. Most parties will be interested in comparing the data that they collect for their own content, with data about their competitors. Hence, if they all run their own system, they will each be querying much more than their own data. If, for example, just the commercial third-party providers were interested in collecting data covering the formal scholarly literature, they would &lt;em>each&lt;/em> find themselves querying the same sources for the same 80 million DOIs. To put this into perspective, to refresh the data for 10 million DOIs once a month, would require sources to support ~ 14K API calls an hour. 60 million DOIs would require 100K API calls an hour. Current standard API caps for many of the sources that people are interested in querying hover around 2K per hour. We may see these sources lift that cap for exceptional cases, but they are unlikely to do so for many different clients all of whom are querying essentially the same thing.&lt;/p>
&lt;p>These issues typify the “multiple bilateral relationships” problem that Crossref was founded to try and ameliorate. When we have many organisations trying to access the exact same APIs to process the exact same data (albeit to different ends), then it seems likely that Crossref could help make the process more efficient.&lt;/p>
&lt;h2 id="piloting-a-proposed-solution">Piloting A Proposed Solution&lt;/h2>
&lt;p>The Crossref DET pilot aims to show the feasibility of providing a hub for the collection, storage and propagation of DOI events from multiple sources to multiple audiences.&lt;/p>
&lt;h3 id="data-collection">Data Collection&lt;/h3>
&lt;ul>
&lt;li>&lt;strong>Pull&lt;/strong>: DET will collect DOI event data from sources that are of common interest to the membership, but which are unlikely to make special efforts to accommodate the scholarly communications industry. Examples of this class of source include large, broadly popular services like FaceBook, Twitter, VK, Sina Weibo, etc.&lt;/li>
&lt;li>&lt;strong>Push&lt;/strong>: DET will allow sources to send DOI event data directly to Crossref in one of three ways:
&lt;ul>
&lt;li>Standard Linkback: Using standards that are widely used on the web. This will automatically enable linkback-aware systems like WordPress, Moveable Type, etc. to alert DET to DOI events.&lt;/li>
&lt;li>Scholarly Linkback: A to-be-defined augmented linkback-style API which will be optimized to work with scholarly resources and which will allow for more sophisticated payloads including other identifiers (e.g. ORCIDs, FundRefs), metadata, provenance information and authorization information. This system could be used by tools designed for scholarly communications. So, for example, it could be used by publisher platforms to distribute events related to downloads or comments within their discussion forums. It could also be used by third party scholarly apps like Zotero, Mendeley, Papers, Authorea, IRUS-UK, etc. in order to alert interested parties in events related to specific DOIs.&lt;/li>
&lt;li>&lt;strong>Redirect&lt;/strong>: DET will also be able to serve as a service discovery layer that will allow sources to push DOI event data directly to an appropriate publisher-controlled endpoint using the above scholarly linkback mechanism. This can be used by sources like repositories in order to send sensitive usage data directly to the relevant publishers.&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;h3 id="data-propagation">Data Propagation&lt;/h3>
&lt;p>Parties may want to use the DET in order to propagate information about DOI events. The system will support two broad data propagation patterns:&lt;/p>
&lt;ul>
&lt;li>&lt;strong>one-to-many&lt;/strong>: DOI events that are commonly harvested (pulled) by the DET system from a single source will be distributed freely to anybody who queries the DET API. Similarly, sources that push DOI events via the standard or scholarly linkback mechanisms, will also propagate their DOI events openly to anybody who queries the DET API. DOI events that are propagated in either of these cases will be kept and logged by the DET system along with appropriate provenance information. This will be the most common, default propagation model for the DET system.&lt;/li>
&lt;li>&lt;strong>one-to-one&lt;/strong>: Sources of DOI events can also report (push) DOI event data directly to owner of the relevant DOI &lt;em>if&lt;/em> the DOI owner provides &amp;amp; registers a suitable end-point with the DET system. In these cases, data sources seeking to report information relating to a DOI, will be redirected (with a suitable 30X HTTP status and relevant headers) to the end-point specified by the DOI owner. The DET system will not keep the request or provenance information. One-to-one propagation model is designed to handle use cases where the source of the DOI event has put restrictions on the data and will only share the DOI events with the owner (registrant) of the DOI. This use case may be used, for example, by aggregators or A&amp;amp;I services that want to report confidential data directly back to a publisher. The advantage of the redirect mechanism is that Crossref is not put into the position of having to secure sensitive data as said data will never reside on Crossref systems.&lt;/li>
&lt;/ul>
&lt;p>Note that the two patterns can be combined. So, for example, a publisher might want to have public social media events reported to the DET and propagated accordingly, but to also to private third parties report confidential information directly to the publisher.&lt;/p>
&lt;h2 id="so-where-are-we">So Where Are We?&lt;/h2>
&lt;p>So to start with, the DET Working Group has grown substantially since the early days and we have representatives from a wide variety of stakeholders. The group includes:&lt;/p>
&lt;ul>
&lt;li>Cameron Neylon, PLOS&lt;/li>
&lt;li>Chris Shillum, Elsevier&lt;/li>
&lt;li>Dom Mitchell, Co-action Publishing&lt;/li>
&lt;li>Euan Adie, Altmetric&lt;/li>
&lt;li>Jennifer Lin, PLOS&lt;/li>
&lt;li>Juan Pablo Alperin, PKP&lt;/li>
&lt;li>Kevin Dolby, Wellcome Trust&lt;/li>
&lt;li>Liz Ferguson, Wiley&lt;/li>
&lt;li>Maciej Rymarz, Mendeley&lt;/li>
&lt;li>Mark Patterson, eLife&lt;/li>
&lt;li>Martin Fenner, PLOS&lt;/li>
&lt;li>Mike Thelwell, U Wolverhampton&lt;/li>
&lt;li>Rachel Craven, BMC&lt;/li>
&lt;li>Richard O’Beirne, OUP&lt;/li>
&lt;li>Ruth Ivimey-Cook, eLife&lt;/li>
&lt;li>Victoria Rao, Elsevier&lt;/li>
&lt;/ul>
&lt;p>As well as the usual contingent of Crossref cat-herders including: Geoffrey Bilder, Rachael Lammey &amp;amp; Joe Wass.&lt;/p>
&lt;p>When we announced the then-DET experiment, we said that one of the biggest challenges would be to create something that scaled to industry levels. At launch, we only loaded in about 317,500+ Crossref DOIs representing publications from 2014 and we could see the system was going to struggle. Since then Martin Fenner and Jennifer Lin at PLOS have been focusing on making sure that the Lagotto code scales appropriately and now it is currently humming along with just over 11.5 million DOIs for which we’ve gathered over 64 million “events.” We aren’t worried about scalability on that front any more.&lt;/p>
&lt;p>We’ve also shown that third parties should be able to access the API to provide value added reporting and metrics. As a demonstration of this, &lt;a href="https://web.archive.org/web/20150924184918/http://parascope.crowdometer.org/" target="_blank">PLOS configured a copy of its reporting software “Parascope”&lt;/a> to point at the Crossref DET instance. The next step we’re taking is to start testing the “push” API mechanism and the “point-to-point redirect” API mechanism. For the push API, we should have a really exciting demo available to show within the next few days. And on the point-to-point redirect, we have a sub-group exploring how the point-to-point redirect mechanism could potentially be used for reporting &lt;a href="http://www.projectcounter.org/about.html" target="_blank">COUNTER&lt;/a> stats as a compliment to the &lt;a href="http://www.niso.org/workrooms/sushi" target="_blank">Sushi&lt;/a> initiative.&lt;/p>
&lt;p>The other major outstanding task we have before us is to calculate what the costs will be of running the DET system as a production service. In this case we expect to have some pretty accurate data to go on as we will have had close to half a year of running the pilot with a non-trivial number of DOIs and sources. Note that the work group is concerned to ensure that the underlying data from the system remains open to all. Keeping this raw data open as seen as critical to establishing trust in the metrics and reporting systems that third parties build on the data. The group has also committed to leaving the creation of value-add services to third parties. As such we have been focusing on exploring business models based around service-level-agreement backed versions of the API to complement the free version of the same API. The free API will come with no guarantees of uptime, performance characteristics or support. For those users that depend on the API in order to deliver their services, we will offer paid-for SLA-backed versions of the free APIs. We can then configure our systems so that we can independently scale these SLA-backed APIs in order to meet SLA agreements.&lt;/p>
&lt;p>Our goal is to have these calculations complete in time for the working group to make a recommendation to the Crossref board meeting in July 2015.&lt;/p>
&lt;p>Until then, we’ll use CrossTech as a venue for notifying people when we’ve hit new milestones or added new capabilities to the DET Pilot system.&lt;/p></description></item><item><title>Problems with dx.doi.org on January 20th 2015- what we know.</title><link>https://www.crossref.org/blog/problems-with-dx.doi.org-on-january-20th-2015-what-we-know./</link><pubDate>Wed, 21 Jan 2015 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/problems-with-dx.doi.org-on-january-20th-2015-what-we-know./</guid><description>&lt;p>&lt;span >&lt;a href="http://www.urbandictionary.com/define.php?term=Hells%20Teeth">Hell’s teeth&lt;/a>.&lt;/span>&lt;/p>
&lt;p>&lt;span >So today (January 20th, 2015) the DOI HTTP resolver at dx.doi.org started to fail intermittently around the world. The doi.org domain is managed by &lt;a href="http://www.cnri.reston.va.us/">CNRI&lt;/a> on behalf of the &lt;a href="http://www.doi.org/">International DOI Foundation&lt;/a>. This means that the problem affected all DOI registration agencies including Crossref, &lt;a href="https://www.datacite.org/">DataCite&lt;/a>, &lt;a href="https://www.medra.org/">mEDRA&lt;/a> etc. This also means that more popularly known end-user services like &lt;a href="http://figshare.com/">FigShare&lt;/a> and &lt;a href="https://zenodo.org/">Zenodo&lt;/a> were affected. The problem has been fixed, but the fix will take some time to propagate throughout the DNS system. You can monitor the progress here:&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;a href="https://www.whatsmydns.net/#A/doi.org">&lt;a href="https://www.whatsmydns.net/#A/doi.org" target="_blank">https://www.whatsmydns.net/#A/doi.org&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >Now for the embarrassing stuff…&lt;/span>&lt;/p>
&lt;p>&lt;span >At first lots of people were speculating that the problem had to do with somebody forgetting to renew the dx.doi.org domain name. Our information from CNRI was that the problem had to do with a mistaken change to a DNS record and that the domain name wasn’t the issue. We corrected people who were reporting that domain name renewal as the cause, but eventually we learned that it was actually true. We have had it confirmed that the problem originated with CNRI manually renewing the domain name at the last minute. Ugh. &lt;span >CNRI will issue a statement soon. We’ll link to it as soon as they do.&lt;/span> UPDATE (Jan 21st): CNRI has sent Crossref a statement. They do not have it on their site yet, so we have can included it &lt;a href="#cnri">below&lt;/a>.&lt;/span>&lt;/p>
&lt;p>&lt;span >In the mean time, if you are having trouble resolving DOIs, a neat trick to know is that you can do so using the Handle system directly. For example:&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;a href="http://hdl.handle.net/10.5555/12345678">&lt;a href="http://hdl.handle.net/10.5555/12345678" target="_blank">http://hdl.handle.net/10.5555/12345678&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >Crossref will, of course, also analyse what occurred, and issue a public report as well. Obviously, this report will include an analysis of how the outage effected DOI referrals to our members.&lt;/span>&lt;/p>
&lt;p>&lt;span >The amazingly cool thing is that everybody online has been very supportive and has helped us to diagnose the problem. Some have even said that the event underscores a point we often make about so-called “persistent-identifiers”- which is that they are not magic technology; the “persistence” is the result of a social contract. We like to say that Crossref DOIs are as persistent as Crossref staff. Well, to that phrase we have to add “and IDF staff” and “CNRI staff” and “ICANN staff”. It is &lt;a href="http://en.wikipedia.org/wiki/Turtles_all_the_way_down">turtles all the way down&lt;/a>.&lt;/span>&lt;/p>
&lt;p>&lt;span >We don’t want to dismiss this event as an inevitable consequence of interdependent systems.And we don’t want to pass the buck. We need to learn something practical from this. How can we guard against this type of problem in the future? Again, people following this issue on Twitter have already been helping with suggestions and ideas. Can we crowd-source the monitoring of persistent identifier SLAs? Could we leverage Wikipedia, Wikidata or something similar to monitor critical identifiers and other infrastructure like purls, DOIs, handles, PMIDs, perma.cc, etc? Should we be looking at designating special exceptions to the normal rules governing DNS names? Do we need to distribute the risk more? Or is it enough &lt;em>cough&lt;/em> to simply ensure that somebody, somewhere in the dependency chain had enabled DNS protection and auto-renewal for critical infrastructure DNS names?&lt;/span>&lt;/p>
&lt;p>&lt;span >Truly, we are humbled. For all the redundancy built into our systems (multiple servers, multiple hosting sites, Raid drives, redundant power), we were undone by a simple administrative task. Crossref, IDF and CNRI- we all feel &lt;a href="http://www.urbandictionary.com/define.php?term=a%20bit%20crap">a bit crap&lt;/a>. But we’ll get back. We’ll fix things. And we’ll let you know how we do it.&lt;/span>&lt;/p>
&lt;p>&lt;span >We will update this space as we know more. We will also keep people updated on twitter on @CrossrefNews. And we will report back in detail as soon as we can.&lt;/span>&lt;/p>
&lt;hr>
&lt;h3 id="cnri">CNRI Statement&lt;/h3>
&lt;p>&lt;code>&amp;quot;The doi.org domain name was inadvertently allowed to expire for a brief period this morning (Jan 20). It was reinstated shortly after 9am this morning as soon as the relevant CNRI employee learned of it. A reminder email sent earlier this month to renew the registration was apparently missed. We sincerely apologize for any difficulties this may have caused. The domain name has since been placed on automatic renewal, which should prevent any repeat of this event.&amp;quot;&lt;/code>&lt;/p></description></item><item><title>Introducing the Crossref Labs DOI Chronograph</title><link>https://www.crossref.org/blog/introducing-chronograph/</link><pubDate>Mon, 12 Jan 2015 00:00:00 +0000</pubDate><author>Joe Wass</author><guid>https://www.crossref.org/blog/introducing-chronograph/</guid><description>&lt;p>tl;dr &lt;a href="http://chronograph.labs.crossref.org" target="_blank">http://chronograph.labs.crossref.org&lt;/a>&lt;/p>
&lt;p>At Crossref we mint DOIs for publications and send them out into the world, but we like to hear how they’re getting on out there. Obviously, DOIs are used heavily within the formal scholarly literature and for citations, but they’re increasingly being used outside of formal publications in places we didn’t expect. With our DOI Event Tracking / ALM pilot project we’re collecting information about how DOIs are mentioned on the open web to try and build a picture about new methods of citation.&lt;/p>
&lt;p>As part of the &lt;a href="https://www.crossref.org/blog/many-metrics-such-data-wow">preparation for collaborating with Wikipedia&lt;/a>, we looked at our statistics about when DOIs are clicked and discovered that Wikipedia was, over a two year period from 2012, the eighth largest referrer of DOIs. This means that not only does Wikipedia have a lot of DOIs, but people click them too. This bit of one-off data analysis (which surprised us) gave us enough of a prod to kickstart our collaboration with Wikipedia.&lt;/p>
&lt;p>At the &lt;a href="https://www.crossref.org/">ALM Workshop 2014 in San Francisco&lt;/a> we talked to some Wikipedians and bibliometricians and realised that we were sitting on a really interesting data-set and that it would be churlish not to share it. At the hackathon (&lt;a href="http://dx.doi.org/10.6084/m9.figshare.1287503" target="_blank">read the report here&lt;/a>) we started work on a service to gather information about DOIs and, a month later, we’re ready to unveil the DOI Chronograph.&lt;/p>
&lt;p>&lt;strong>Show me the goods&lt;/strong>&lt;/p>
&lt;p>You can see:&lt;/p>
&lt;p>Daily referrals (clicks) from top level domains, e.g. Wikipedia.org: &lt;a href="http://chronograph.labs.crossref.org/domain.html?domain=wikipedia.org" target="_blank">http://chronograph.labs.crossref.org/domain.html?domain=wikipedia.org&lt;/a>&lt;/p>
&lt;img src="https://www.crossref.org/wp/blog/uploads/2015/01/wikipedia-referrals.png" alt="wikipedia-referrals" class="img-responsive" />
&lt;p>Daily referrals from specific subdomains, e.g. fr.wikipedia.org: &lt;a href="http://chronograph.labs.crossref.org/domain.html?domain=fr.wikipedia.org" target="_blank">http://chronograph.labs.crossref.org/domain.html?domain=fr.wikipedia.org&lt;/a>&lt;/p>
&lt;img src="https://www.crossref.org/wp/blog/uploads/2015/01/fr-wikipedia-referrals.png" class="img-responsive" />
&lt;p>Daily resolutions per DOI: &lt;a href="http://chronograph.labs.crossref.org/doi.html?doi=10.1787%2F20752288" target="_blank">http://chronograph.labs.crossref.org/doi.html?doi=10.1787%2F20752288&lt;/a>&lt;/p>
&lt;img src="https://www.crossref.org/wp/blog/uploads/2015/01/doi-referrals.png" alt="doi-referrals" class="img-responsive"/>
&lt;p>&lt;a name="ranking">&lt;/a>&lt;/p>
&lt;p>And, the chart that kicked this all off: DOI referring domains league tables. This shows that Wikipedia is the 3rd or 4th non-traditional referrer of DOIs (i.e. excluding referrals from Publishers’ domains): &lt;a href="http://chronograph.labs.crossref.org/top.html" target="_blank">http://chronograph.labs.crossref.org/top.html&lt;/a>&lt;/p>
&lt;img src="https://www.crossref.org/wp/blog/uploads/2015/01/top-domains.png" alt="top-domains" class="img-responsive" />
&lt;p>&lt;strong>Try it out&lt;/strong>&lt;/p>
&lt;p>Visit the Chronograph and give it a try &lt;a href="http://chronograph.labs.crossref.org" target="_blank">chronograph.labs.crossref.org&lt;/a> on your &lt;a href="http://chronograph.labs.crossref.org/doi.html?doi=10.1657%2F1938-4246-44.4.483" target="_blank">favourite DOI&lt;/a> (&lt;a href="http://chronograph.labs.crossref.org/doi.html?doi=10.1007%2Fs12110-002-1021-6" target="_blank">everyone&lt;/a> &lt;a href="http://chronograph.labs.crossref.org/doi.html?doi=10.1136%2Fbmj.327.7429.1459" target="_blank">has&lt;/a> &lt;a href="http://chronograph.labs.crossref.org/doi.html?doi=10.1016/j.imavis.2011.05.002" target="_blank">one&lt;/a>).&lt;/p>
&lt;p>&lt;strong>More data&lt;/strong>&lt;/p>
&lt;p>Talking to a bibliometrician we also realised we can correlate other data for DOIs. We’re getting the issue date (approximately the publication date) from our own metadata, as well as the date that the Crossref metadata was updated. This gives interesting results, like &lt;a href="http://chronograph.labs.crossref.org/doi.html?doi=10.1038%2Fncomms2953" target="_blank">the resolutions for 10.1038/ncomms2953&lt;/a>, which peak after publication and then tails off. We are attempting to collect the following information:&lt;/p>
&lt;ul>
&lt;li>daily resolution counts&lt;/li>
&lt;li>day on which resolution was first successful&lt;/li>
&lt;li>day on which it’s possible to resolve the DOI (we’ve got a bot running for new publications)&lt;/li>
&lt;li>day on which the publisher says the article was published&lt;/li>
&lt;li>day on which the metadata was most recently deposited with us&lt;/li>
&lt;li>day on which the metadata was first deposited with us&lt;/li>
&lt;/ul>
&lt;p>We’re not there yet, but we’ve made a start and we’ve already got some pretty interesting data!&lt;/p>
&lt;p>&lt;strong>Weasel words&lt;/strong>&lt;/p>
&lt;p>It’s a labs project so the usual weasel words apply. Specifically, we currently have the logs for 2012 to 2014 (we’re working at digging out the rest), and the referral information for 50 million DOIs (out of 71 million). That number will be higher by the time you read this. If your page is slow to load, be patient, as it’s currently working hard crunching numbers.&lt;/p>
&lt;p>This project is focused on exploring the use of DOIs outside of the formal literature. As such, we are only looking at referrals from domains that do not appear to belong to primary publishers (i.e. our members). If you try a domain and it doesn’t work, it could be that the domain belongs to one of our members. If you’ve notice any mistakes, please email us at &lt;a href="mailto:labs@crossref.org">labs@crossref.org&lt;/a> .&lt;/p>
&lt;p>Finally, these numbers contain all DOI resolutions. That’s human clicks but also content negotiation to retrieve metadata, robots etc. We might try to filter them in future, but for now be aware that not every visitor is a human.&lt;/p>
&lt;p>I’ll detail some of the the technical stuff (it’s very interesting) and what happened next with Wikipedia in a future post. Watch this space.&lt;/p></description></item><item><title>Linking data and publications</title><link>https://www.crossref.org/blog/linking-data-and-publications/</link><pubDate>Sun, 21 Sep 2014 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/linking-data-and-publications/</guid><description>&lt;p>&lt;span >Do you want to see if a Crossref DOI (typically assigned to publications) refers to DataCite DOIs (typically assigned to data)? Here you go:&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;a href="https://web.archive.org/web/20150121025249/http://api.labs.crossref.org/graph/doi/10.4319/lo.1997.42.1.0001">&lt;a href="https://web.archive.org/web/20150121025249/http://api.labs.crossref.org/graph/doi/10.4319/lo.1997.42.1.0001" target="_blank">https://web.archive.org/web/20150121025249/http://api.labs.crossref.org/graph/doi/10.4319/lo.1997.42.1.0001&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >Conversely, do you want to see if a DataCite DOI refers to Crossref DOIs? Voilà:&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;a href="https://web.archive.org/web/20150321190744/http://api.labs.crossref.org/graph/doi/10.1594/pangaea.185321">&lt;a href="https://web.archive.org/web/20150321190744/http://api.labs.crossref.org/graph/doi/10.1594/pangaea.185321" target="_blank">https://web.archive.org/web/20150321190744/http://api.labs.crossref.org/graph/doi/10.1594/pangaea.185321&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;h3 id="span-backgroundspan">&lt;span >Background&lt;/span>&lt;/h3>
&lt;p>&lt;span >“How can we effectively integrate data into the scholarly record?” This is the question that has, for the past few years, generated an unprecedented amount of handwringing on the part researchers, librarians, funders and publishers. Indeed, this week I am in Amsterdam to attend the 4th RDA plenary in which this topic will no doubt again garner a lot of deserved attention.&lt;/span>&lt;/p>
&lt;p>&lt;span >We hope that the small example above will help push the RDAs agenda a little further. Like the recent &lt;a href="http://odin-project.eu">ODIN&lt;/a> project, It illustrates how we can simply combine two existing scholarly infrastructure systems to build important new functionality for integrating research objects into the scholarly literature.&lt;/span>&lt;/p>
&lt;p>&lt;span >Does it solve all of the problems associated with citing and referring to data? Can the various workgroups at RDA just cancel their data citation sessions and spend the week riding bikes and gorging on croquettes? Of course not. But my guess is that by simply integrating DataCite and Crossref in this way, we can make a giant push in the right direction.&lt;/span>&lt;/p>
&lt;p>&lt;span >There are certainly going to be differences between traditional citation and data citation. Some even claim that citing data isn’t “as simple as citing traditional literature.” But this is a caricature of traditional citation. If you believe this, go off an peruse the MLA, Chicago, Harvard, NLM and APA citation guides. Then read &lt;a href="http://www.hup.harvard.edu/catalog.php?isbn=9780674307605"> Anthony Grafton’s, &lt;em>The Footnote&lt;/em>&lt;/a>? Are you back yet? Good, so let’s continue…&lt;/span>&lt;/p>
&lt;p>&lt;span >Citation &lt;em>of any sort&lt;/em> is a complex issue- full of subtleties, edge-cases exceptions, disciplinary variations and kludges. Historically, the way to deal with these edge-cases has been social, not technical. For traditional literature we have simply evolved and documented citation practices which generally make contextually-appropriate use of the same technical infrastructure (footnotes, endnotes, metadata, etc.). I suspect the same will be true in citing data. The solutions will not be technical, they will mostly be social. Researchers, and publishers will evolve new, contextually appropriate mechanisms to use existing infrastructure deal with the peculiarities of data citation.&lt;/span>&lt;/p>
&lt;p>&lt;span >Does this mean that we will never have to develop new systems to handle data citation? Possibly But I don’t think we’ll know what those systems are or how they should work until we’ve actually had researchers attempting to use and adapt the tools we have.&lt;/span>&lt;/p>
&lt;h3 id="span-technical-backgroundspan">&lt;span >Technical background&lt;/span>&lt;/h3>
&lt;p>&lt;span >About five years ago, Crossref and DataCite explored the possibility of exposing linkages between DataCite and Crossref DOIs. Accordingly, we spent some time trying to assemble an example corpus that would illustrate the power of interlinking these identifiers. We encountered a slight problem. We could hardly find any examples. At that time, virtually nobody cited data with DataCite DOIs and, if they did, the Crossref system did not handle them properly. We had to sit back and wait a while.&lt;/span>&lt;/p>
&lt;p>&lt;span >And now the situation has changed.&lt;/span>&lt;/p>
&lt;p>&lt;span >This demonstrator harvests DataCite DOIs using their OAI-PMH API and links them in a graph database with Crossref DOIs.&lt;/span> &lt;span >We have exposed this functionality on the “labs” (i.e. experimental) version of our REST API as a graph resource. So…&lt;/span>&lt;/p>
&lt;p>&lt;span >Now you can get a list of Crossref DOIs that refer to DataCite DOIs using &lt;a href="https://www.crossref.org/services/event-data/" target="_blank">Event Data&lt;/a>.&lt;/p>
&lt;h3 id="span-caveats-and-weasel-wordsspan">&lt;span >Caveats and Weasel Words&lt;/span>&lt;/h3>
&lt;ul>
&lt;li>&lt;span >We have not finished indexing all the links.&lt;/span>&lt;/li>
&lt;li>&lt;span >The API is currently a very early labs project. It is about as reliable as a devolution promise from Westminster.&lt;/span>&lt;/li>
&lt;li>&lt;span >The API is run on a pair of raspberry-pi’s connected to the internet via bluetooth.&lt;/span>&lt;/li>
&lt;li>&lt;span >It is not fast.&lt;/span>&lt;/li>
&lt;li>&lt;span >The representation and the API is under active development.&lt;/span>&lt;span >Things will change. Watch the Crossref Labs site for updates on this collaboration with DataCite&lt;/span>&lt;/li>
&lt;/ul></description></item><item><title>Citation needed</title><link>https://www.crossref.org/blog/citation-needed/</link><pubDate>Thu, 07 Aug 2014 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/citation-needed/</guid><description>&lt;p>Remember when &lt;a href="https://www.crossref.org/blog/many-metrics-such-data-wow">I said that the Wikipedia was the 8th largest referrer of DOI links to published research&lt;/a>? This &lt;em>despite&lt;/em> only a fraction of eligible references in the free encyclopaedia using DOIs.&lt;/p>
&lt;p>We aim to fix that. Crossref and Wikimedia are launching a new initiative to better integrate scholarly literature in the world’s largest public knowledge space, Wikipedia.&lt;/p>
&lt;p>This work will help promote standard links to scholarly references within Wikipedia, which persist over time by ensuring consistent use of DOIs and other citation identifiers in Wikipedia references. Crossref will support the development and maintenance of Wikipedia’s citation tools on Wikipedia. This work will include bug fixes and performance improvements for existing tools, extending the tools to enable Wikipedia contributors to more easily look up and insert DOIs, and providing a “linkback” mechanism that alerts relevant parties when a persistent identifier is used in a Wikipedia reference.&lt;/p>
&lt;p>In addition, Crossref is creating the role of Wikimedia Ambassador (modeled after &lt;a href="https://outreach.wikimedia.org/wiki/Wikipedian_in_Residence" target="_blank">Wikimedian-in-Residence&lt;/a>) to act as liaison with the Wikimedia community, promote use of scholarly references on Wikipedia, and educate about DOIs and other scholarly identifiers (ORCIDs, PubMed IDs, DataCite DOIs, etc) across Wikimedia projects.&lt;/p>
&lt;p>Starting today, Crossref will be working with &lt;a href="https://www.crossref.org/blog/many-metrics-such-data-wow">Daniel Mietchen&lt;/a> to coordinate Crossref’s Wikimedia-related activities. Daniel’s team will be composed of &lt;a href="https://github.com/notconfusing" target="_blank">Max Klein&lt;/a> and &lt;a href="https://github.com/wrought" target="_blank">Matt Senate&lt;/a>, who will work to enhance Wikimedia citation tools, and will share the role of Wikipedia ambassador with &lt;a href="http://www.dorothyhoward.com/" target="_blank">Dorothy Howard&lt;/a>.&lt;/p>
&lt;p>Since the beginnings of Wikipedia, Daniel Mietchen has worked to integrate scholarly content into Wikimedia projects. He is part of an impressive community of active Wikipedians and developers who have worked extensively on linking Wikipedia articles to the formal literature and other scholarly resources. We’ve been talking to him about this project for nearly a year, and are happy to finally get it off the ground.&lt;/p>
&lt;p>-G&lt;figure id="attachment_367" class="wp-caption alignnone">&lt;/p>
&lt;p>&lt;img src="https://www.crossref.org/wp/blog/uploads/2014/08/IMG_0602-300x150.jpg" alt="Matt, Max and Daniel at #wikimania2014. Photo by Dorothy." width="300" height="150" class="size-medium wp-image-367" srcset="https://www.crossref.org/wp/blog/uploads/2014/08/IMG_0602-300x150.jpg 300w, https://www.crossref.org/wp/blog/uploads/2014/08/IMG_0602-1024x515.jpg 1024w, https://www.crossref.org/wp/blog/uploads/2014/08/IMG_0602-624x314.jpg 624w" sizes="(max-width: 300px) 85vw, 300px" />&lt;figcaption class="wp-caption-text">]&lt;a href="https://github.com/wrought" target="_blank">7&lt;/a> Matt, Max and Daniel at #wikimania2014. Photo by Dorothy.&lt;/figcaption>&lt;/figure>&lt;/p>
&lt;h1 id="wikimania2014">wikimania2014&lt;/h1></description></item><item><title>♫ Researchers just wanna have funds ♫</title><link>https://www.crossref.org/blog/researchers-just-wanna-have-funds/</link><pubDate>Thu, 10 Apr 2014 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/researchers-just-wanna-have-funds/</guid><description>&lt;p>&lt;img src="https://www.crossref.org/wp/blog/uploads/2014/04/5788184739_03b5b2a20d_b-150x150.jpg" alt="Cindy Lauper">&lt;/p>
&lt;p>&lt;a href="https://www.flickr.com/photos/59935931@N05/5788184739/" target="_blank">photo credit&lt;/a>&lt;/p>
&lt;h2 id="summary">Summary&lt;/h2>
&lt;p>You can use a new Crossref &lt;a href="http://en.wikipedia.org/wiki/Application_programming_interface" target="_blank">API&lt;/a> to query all sorts of interesting things about who funded the research behind the content Crossref members publish.&lt;/p>
&lt;h2 id="background">Background&lt;/h2>
&lt;p>Back in May 2013 we launched Crossref’s &lt;a href="https://www.crossref.org/services/funder-registry/" target="_blank">FundRef&lt;/a> service. It can be summarized like this:&lt;/p>
&lt;ul>
&lt;li>Crossref keeps and manages a &lt;a href="https://www.crossref.org/services/funder-registry/" target="_blank">canonical list&lt;/a> of Funder Names (ephemeral) and associated identifiers (persistent).&lt;/li>
&lt;li>We encourage our members (or anybody, really- the list is available under A &lt;a href="http://creativecommons.org/choose/zero/" target="_blank">CC-Zero&lt;/a> license waiver) to use this list for collecting information on who funded the research behind the content that our members publish.&lt;/li>
&lt;li>We then ask that our members deposit this data in their normal Crossref metadata deposits.&lt;/li>
&lt;/ul>
&lt;p>And that was cool.&lt;/p>
&lt;p>But then people started asking us awkward questions. Questions like “what can I do with the funder data?” and “how do I query it?”.&lt;/p>
&lt;p>Stoopit people.&lt;/p>
&lt;p>Can’t you just let us bask for a few minutes in the sunny glow of actually conceiving of and launching a project within a year?&lt;/p>
&lt;p>But seriously, funders, were interested to see how they could use the funder metadata being collected in Crossref. In particular, some funding agencies were interested in being able to measure Key Performance Indicators (“KPIs” to management wonks) related to recent mandates such as the February 22nd 2013 OSTP memo, &lt;em>&lt;a href="http://www.whitehouse.gov/blog/2013/02/22/expanding-public-access-results-federally-funded-research" target="_blank">Public Access to the Results of Federally Funded Research&lt;/a>.&lt;/em> Two groups also approached us, &lt;a href="http://chorusaccess.org/" target="_blank">CHORUS&lt;/a> and &lt;a href="https://www.arl.org/resources/shared-access-research-ecosystem-share-proposal/" target="_blank">SHARE&lt;/a>. Both are interested in exploring how to build reporting tools for funders, institutions and researchers and each brought us a gigantic hairball of use-cases they were hoping we would be able to meet.&lt;/p>
&lt;p>Conveniently, we were in the process of creating a revised, modern Crossref API that is entirely &lt;a href="http://en.wikipedia.org/wiki/Buzzword_compliant" target="_blank">buzzword-compliant&lt;/a>, and so we set to work…&lt;/p>
&lt;p>We thought people might be interested in seeing what you can do with the Crossref &lt;a href="http://en.wikipedia.org/wiki/Representational_state_transfer" target="_blank">REST&lt;/a> API in relation to funding information and the expectations that are increasingly being attached to them. CHORUS is already using the Crossref REST API heavily and we expect that SHARE will soon start making use of it as well. The feedback from both groups has been very useful, but we are looking for broader feedback as well. The API is still in development, so now is your chance to help us shape it.&lt;/p>
&lt;h2 id="brief-examples">Brief Examples&lt;/h2>
&lt;p>&lt;em>Please note&lt;/em>, the following are APIs calls, although you can copy and paste the URIs into your browser, the data is returned in a machine readable representation called &lt;a href="http://en.wikipedia.org/wiki/JSON" target="_blank">JSON&lt;/a>. If you want the results to look a little more presentable, we advise you install the JSONView plugin:&lt;/p>
&lt;ul>
&lt;li>Firefox Users: &lt;a href="http://jsonview.com/" target="_blank">JSONView&lt;/a>&lt;/li>
&lt;li>Chrome Users: &lt;a href="https://chrome.google.com/webstore/detail/jsonview/chklaanhfefbnpoihckbnefhakgolnmc" target="_blank">JSONView&lt;/a>&lt;/li>
&lt;/ul>
&lt;p>Also note that publishers have only just started to deposit the metadata needed for these APIs to work, so the data is currently sparse. We know that many of our members are working feverishly to populate more of the needed metadata, but this requires updates to the their manuscript tracking systems, production systems and hosting systems. It takes time.&lt;/p>
&lt;p>But for now you can paste the relevant URIs below into your browser and see the results that we do have. Expect these numbers to increase sharply over the next few months&lt;/p>
&lt;p>To start with, you might want to know how many articles in Crossref have FundRef metadata:&lt;/p>
&lt;pre>&lt;code>https://api.crossref.org/v1/works?filter=has-funder:true&amp;amp;rows=0
&lt;/code>&lt;/pre>
&lt;p>You could then be interested in knowing how many works in Crossref use FundRef to credit the United States’ National Science Foundation (NSF) for funding their research? First you need to find out what the FundRef identifier is for the NSF:&lt;/p>
&lt;pre>&lt;code>https://api.crossref.org/v1/funders?query=NSF
&lt;/code>&lt;/pre>
&lt;p>You can see that there are several entries that match “NSF”, and that the one we are looking for has the identifier &lt;code>http://dx.doi.org/10.13039/100000001&lt;/code>. Remember, funding agency names can change frequently, the ID provides a persistent link to the funder even if their name changes.&lt;/p>
&lt;p>If you are curious, you can see the details for the NSF entry, including its location, parent and child organisations:&lt;/p>
&lt;pre>&lt;code>https://api.crossref.org/v1/funders/10.13039/100000001
&lt;/code>&lt;/pre>
&lt;p>Notice that the results also lists the &lt;code>work-count&lt;/code>. This is the number of works in the Crossref metadata that list the US NSF as having funded the research.&lt;/p>
&lt;p>So perhaps you would like to see the list of works. The following will list the first twenty:&lt;/p>
&lt;pre>&lt;code>https://api.crossref.org/v1/funders/10.13039/100000001/works
&lt;/code>&lt;/pre>
&lt;p>You can page through the results with the offset argument:&lt;/p>
&lt;pre>&lt;code>https://api.crossref.org/v1/funders/10.13039/100000001/works?offset=20
https://api.crossref.org/v1/funders/10.13039/100000001/works?offset=40
...
&lt;/code>&lt;/pre>
&lt;p>How many works that have listed the NSF as a funder have license information:&lt;/p>
&lt;pre>&lt;code>https://api.crossref.org/v1/funders/10.13039/100000001/works?filter=has-license:true&amp;amp;rows=0
&lt;/code>&lt;/pre>
&lt;p>Lets see the first batch that have license information:&lt;/p>
&lt;pre>&lt;code>https://api.crossref.org/v1/funders/10.13039/100000001/works?filter=has-license:true
&lt;/code>&lt;/pre>
&lt;p>Lets look at the metadata for one of the DOIs returned:&lt;/p>
&lt;pre>&lt;code>https://api.crossref.org/v1/works/10.1063/1.3593378
&lt;/code>&lt;/pre>
&lt;p>Interesting, the metadata shows an article published by &lt;a href="http://www.aip.org/" target="_blank">AIP&lt;/a>. It includes license information (CC-BY 3.0) as well as a link to the full text. If you follow the link to the full text, you can retrieve it:&lt;/p>
&lt;pre>&lt;code>http://link.aip.org/link/applab/v98/i21/p216101/pdf/CHORUS
&lt;/code>&lt;/pre>
&lt;p>Wow- A pretty short article. But you can see that it does credit the NSF and that the award number recorded in the text is the same as the award number recorded in the FundRef section of the Crossref metadata. Yay.&lt;/p>
&lt;p>You can see in the brief examples above that there is a lot of other metadata you may want to query on and explore. It can include ORCIDS, information about archiving arrangements- even abstracts. It all depends on what the Crossref member has decided to provide.&lt;/p>
&lt;p>You can get a simple overview of what a Crossref member has provided by looking at a member summary. Here is an example for &lt;a href="http://www.hindawi.com/" target="_blank">Hindawi&lt;/a>:&lt;/p>
&lt;pre>&lt;code>https://api.crossref.org/v1/members?query=hindawi
&lt;/code>&lt;/pre>
&lt;p>Note again that names are fickle, so the above query can also be accomplished using the member identifier like this:&lt;/p>
&lt;pre>&lt;code>https://api.crossref.org/v1/members/98
&lt;/code>&lt;/pre>
&lt;p>Groovy init?&lt;/p>
&lt;p>If you want more pointers on where you can learn how to use the API, read on…&lt;/p>
&lt;h2 id="more-examples-and-documentation">More examples and documentation.&lt;/h2>
&lt;p>We have a draft of the &lt;a href="https://api.crossref.org" target="_blank">full documentation for the Crossref REST API&lt;/a>. Note that this is undergoing active revision and we ask that you look at the updated documentation if things that once work cease to. We would also love your feedback and suggestions. Send them to:&lt;/p>
&lt;p>&lt;img src="https://www.crossref.org/wp/blog/uploads/2013/01/labs_email.png" alt="email address">&lt;/p>
&lt;p>We often get asked “what metadata does a publisher need to provide in order to enable this kind of functionality?” To answer that, we have developed a document titled &lt;a href="https://github.com/CrossRef/rest-api-doc/blob/master/funder_kpi_metadata_best_practice.md" target="_blank">Crossref metadata best practice to support key performance indicators (KPIs) for funding agencies&lt;/a>. Try saying that ten times very fast.&lt;/p>
&lt;h2 id="the-future-of-the-crossref-rest-api">The Future of the Crossref REST API.&lt;/h2>
&lt;p>Our aim is for the Crossref REST API to go into production this Summer (2014). As with most of our newer APIs, there will be a free API for public use and a paid for API for professional use. The only difference between the two will be that the professional version will come with a service level agreement (SLA) covering uptime, response time and support. Naturally, this also means that the professional one will be on dedicated hosting equipment so that we can meet these SLAs, whereas the performance of the free version will be subject to the vicissitudes inherent in using a shared, constrained resource (i.e. the server and network it is running on).&lt;/p>
&lt;p>Again, the basics of the API are in place. It should be fairly stable, but we do reserve the right to make changes to it over the next few months. Please send us feedback.&lt;/p>
&lt;p>— The Weasel&lt;/p></description></item><item><title>Many Metrics. Such Data. Wow.</title><link>https://www.crossref.org/blog/many-metrics-such-data-wow/</link><pubDate>Mon, 24 Feb 2014 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/many-metrics-such-data-wow/</guid><description>&lt;p>[&lt;img class=" wp-image-302 alignnone" title="many metrics. such data. wow." src="https://www.crossref.org/wp/blog/uploads/2014/02/many_metrics.jpg" alt="many_metrics" width="288" height="288" srcset="https://www.crossref.org/wp/blog/uploads/2014/02/many_metrics.jpg 480w, https://www.crossref.org/wp/blog/uploads/2014/02/many_metrics-150x150.jpg 150w, https://www.crossref.org/wp/blog/uploads/2014/02/many_metrics-300x300.jpg 300w" sizes="(max-width: 288px) 85vw, 288px" />&lt;/p>
&lt;blockquote>
&lt;p>Crossref Labs loves to be the last to jump on an internet trend, so what better than than to combine the &lt;a href="http://en.wikipedia.org/wiki/Doge_(meme)" target="_blank">Doge meme&lt;/a> with &lt;a href="http://en.wikipedia.org/wiki/Altmetrics" target="_blank">altmetrics&lt;/a>?&lt;/p>
&lt;/blockquote>
&lt;p>&lt;strong>Note:&lt;/strong> The API calls below have been superceeded with the development of the Event Data project. See &lt;a href="http://eventdata.crossref.org/" target="_blank">the latest API documentation&lt;/a> for equivalent functionality&lt;/p>
&lt;p>Want to know how many times a Crossref DOI is cited by the Wikipedia?&lt;/p>
&lt;pre tabindex="0">&lt;code>http://det.labs.crossref.org/works/doi/10.1371/journal.pone.0086859
&lt;/code>&lt;/pre>&lt;p>Or how many times one has been mentioned in Europe PubMed Central?&lt;/p>
&lt;pre tabindex="0">&lt;code>http://det.labs.crossref.org/works/doi/10.1016/j.neuropsychologia.2013.10.021
&lt;/code>&lt;/pre>&lt;p>Or DataCite?&lt;/p>
&lt;pre tabindex="0">&lt;code>http://det.labs.crossref.org/works/doi/10.1111/jeb.12289
&lt;/code>&lt;/pre>&lt;h2 id="background">Background&lt;/h2>
&lt;p>Back in 2011 &lt;a href="http://www.plos.org/" target="_blank">PLOS&lt;/a> released its awesome &lt;a href="https://web.archive.org/web/20190118175222if_/https://www.plos.org/article-level-metrics" target="_blank">ALM system&lt;/a> as &lt;a href="http://en.wikipedia.org/wiki/Open-source_software" target="_blank">open source software&lt;/a> (OSS). At &lt;a href="https://www.crossref.org/labs/" target="_blank">Crossref Labs&lt;/a>, we thought it might be interesting to see what would happen if we ran our own instance of the system and loaded it up with a few Crossref DOIs. So we did. And the code fell over. Oops. Somehow it didn’t like dealing with 10 million DOIs. Funny that.&lt;/p>
&lt;p>But the beauty of OSS is that we were able to work with PLOS to scale the code to handle our volume of data. Crossref contracted with &lt;a href="http://cottagelabs.com/" target="_blank">Cottage Labs&lt;/a>  and we both worked with PLOS to make changes to the system. These eventually got fed back into the main &lt;a href="https://github.com/articlemetrics/alm/" target="_blank">ALM source on Github&lt;/a>. Now everybody benefits from our work. Yay for OSS.&lt;/p>
&lt;p>So if you want to know technical details, skip to &lt;a href="#details">Details for Propellerheads&lt;/a>. But if you want to know why we did this, and what we plan to do with it, read on.&lt;/p>
&lt;h2 id="span-whyspan">&lt;span >Why?&lt;/span>&lt;/h2>
&lt;p dir="ltr">
&lt;span >There are (cough) some problems in our industry that we can best solve with shared infrastructure. When publishers first put scholarly content online, they used to make bilateral reference linking agreements. These agreements allowed them to link citations using each other’s proprietary reference linking APIs. But this system didn’t scale. It was too time-consuming to negotiate all the agreements needed to link to other publishers. And linking through many proprietary citation APIs was too complex and too fragile. So the industry founded Crossref to create a common, cross-publisher citation linking API. Crossref has since obviated the need for bilateral linking arrangements.&lt;/span>
&lt;/p>
&lt;p dir="ltr">
&lt;span >So-called &lt;a href="http://en.wikipedia.org/wiki/Altmetrics" target="_blank">altmetrics&lt;/a> look like they might have similar characteristics. You have ~4000 Crossref member publishers and N sources (e.g. Twitter, Mendeley, Facebook, CiteULike, etc.) where people use (e.g. discuss, bookmark, annotate, etc.) scholarly publications. Publishers could conceivably each choose to run their own system to collect this information. But if they did, they would face the following problems:&lt;/span>
&lt;/p>
&lt;ul>
&lt;li>&lt;span >The N sources will be volatile. New ones will emerge. Old ones will vanish.&lt;/span>&lt;/li>
&lt;li>&lt;span >Each publisher will need to deal with each source’s different APIs, rate limits, T&amp;amp;Cs, data licenses, etc. This is a logistical headache for both the publishers and for the sources.&lt;/span>&lt;/li>
&lt;li>&lt;span >If publishers use different systems which in turn look at different sources, it will be difficult to compare results across publishers.&lt;/span>&lt;/li>
&lt;li>&lt;span >If a journal moves from one publisher to another, then how are the metrics for that journal’s articles going to follow the journal? This isn’t a complete list, but it shows that there might be some virtue in publishers sharing an infrastructure for collecting this data. But what about commercial providers? Couldn’t they provide these ALM services? Of course - and some of them currently do. But normally they look on the actual collection of this data as a means to an end. The real value they provide is in the analysis, reporting and tools that they build on top of the data. Crossref has no interest in building front-ends to this data. If there is a role for us to play here, it is simply in the collection and distribution of the data.&lt;/span>&lt;/li>
&lt;/ul>
&lt;h2 id="span-no-really-whyspan">&lt;span >No, really, WHY?&lt;/span>&lt;/h2>
&lt;p dir="ltr">
&lt;span >Aren’t these altmetrics &lt;a href="https://web.archive.org/web/20170112105521/https://scholarlyoa.com/2013/08/01/article-level-metrics/" target="_blank">an ill-conceived and meretricious idea&lt;/a>? By providing this kind of information, isn’t Crossref just encouraging feckless, &lt;a href="http://blogs.lse.ac.uk/impactofsocialsciences/2014/01/27/its-the-neoliberalism-stupid-kansa/" target="_blank">neoliberal university administrators&lt;/a> to hasten academia’s slide into a &lt;a href="http://en.wikipedia.org/wiki/Stakhanovite_movement" target="_blank">Stakhanovite&lt;/a> dystopia? Can’t these systems be gamed?&lt;/span>
&lt;/p>
&lt;p dir="ltr">
&lt;span >FOR THE LOVE OF &lt;a href="http://en.wikipedia.org/wiki/Flying_Spaghetti_Monster" target="_blank">FSM&lt;/a>, WHY IS CROSSREF DABBLING IN SOMETHING OF SUCH QUESTIONABLE VALUE?&lt;/span>
&lt;/p>
&lt;p dir="ltr">
&lt;span >takes deep breath. wipes spittle from beard&lt;/span>
&lt;/p>
&lt;p dir="ltr">
&lt;span >These are all serious concerns. &lt;a href="http://en.wikipedia.org/wiki/Goodhart's_law" target="_blank">Goodhart’s Law&lt;/a> and all that… If a university’s appointments and promotion committee is largely swayed by &lt;a href="http://en.wikipedia.org/wiki/Impact_factor" target="_blank">Impact Factor&lt;/a>, it won’t improve a thing if they substitute or supplement Impact Factor with altmetrics. &lt;a href="http://www.linkedin.com/profile/view?id=8488638&amp;authType=NAME_SEARCH&amp;authToken=6zaC&amp;locale=en_US&amp;srchid=4700671392208272787&amp;srchindex=1&amp;srchtotal=32&amp;trk=vsrp_people_res_name&amp;trkInfo=VSRPsearchId%3A4700671392208272787%2CVSRPtargetId%3A8488638%2CVSRPcmpt%3Aprimary" target="_blank">Amy Brand&lt;/a> has repeatedly pointed out, &lt;a href="http://article-level-metrics.plos.org/files/2013/10/Brand.pptx" target="_blank">the best institutions simply don’t use metrics this way at all&lt;/a> (PowerPoint presentation). They know better.&lt;/span>
&lt;/p>
&lt;p dir="ltr">
&lt;span >But yes, it is still likely that some powerful people will come to lazy conclusions based on altmetrics. And following that, other lazy, unscrupulous and opportunistic people will attempt to game said metrics. We may even see an industry emerge to exploit this mess and provide the scholarly equivalent of &lt;a href="http://en.wikipedia.org/wiki/Search_engine_optimization" target="_blank">SEO&lt;/a>. Feh. Now I’m depressed and I need a drink.&lt;/span>
&lt;/p>
&lt;p dir="ltr">
&lt;span >So again, why is Crossref doing this? Though we have our doubts about how effective altmetrics will be in evaluating the quality of content, we do believe that they are a useful tool for understanding how scholarly content is used and interpreted. &lt;em>The most eloquent arguments against altmetrics for measuring quality, inadvertently make the case for altmetrics as a tool for monitoring attention.&lt;/em>&lt;/span>
&lt;/p>
&lt;p dir="ltr">
&lt;span >Critics of altmetrics point out that much of the attention that research receives outside of formal scholarly communications channels can be ascribed to:&lt;/span>
&lt;/p>
&lt;ul>
&lt;li>&lt;span >Puffery. Researchers and/or university/publisher “&lt;a href="http://www.dcscience.net/?p=6369" target="_blank">PR wonks&lt;/a>” over-promoting research results.&lt;/span>&lt;/li>
&lt;li>&lt;span >Innocent misinterpretation. A lay audience simply doesn’t understand the research results.&lt;/span>&lt;/li>
&lt;li>&lt;span >Deliberate misinterpretation. Ideologues misrepresent research results to support their agendas.&lt;/span>&lt;/li>
&lt;li>&lt;span >Salaciousness. The research appears to be about sex, drugs, crime, video games or other popular bogeymen.&lt;/span>&lt;/li>
&lt;li>&lt;span >Neurobollocks. &lt;a href="https://web.archive.org/web/20160405135736/http://www.wired.co.uk/news/archive/2012-11/08/neurobollocks" target="_blank">A category unto itself these days&lt;/a>.&lt;/span>&lt;/li>
&lt;/ul>
&lt;p dir="ltr">
&lt;span >In short, scholarly research might be misinterpreted. Shock horror. Ban all metrics. Whew. That won’t happen again.&lt;/span>
&lt;/p>
&lt;p dir="ltr">
&lt;span >Scholarly research has always been discussed outside of formal scholarly venues. Both by scholars themselves and by interested laity. Sometimes these discussions advance the scientific cause. Sometimes they undermine it. The University of Utah didn’t depend on widespread Internet access or social networks to promote &lt;a href="http://en.wikipedia.org/wiki/Cold_fusion" target="_blank">yet-to-be peer-reviewed claims about cold fusion&lt;/a>. That was just old-fashioned analogue puffery. And the Internet played no role in the Laetrile or&lt;a href="http://www.cancer.org/treatment/treatmentsandsideeffects/complementaryandalternativemedicine/pharmacologicalandbiologicaltreatment/dmso" target="_blank"> DMSO crazes of the 1980s&lt;/a>. You see, there were once these things called “&lt;a href="http://en.wikipedia.org/wiki/Newspaper" target="_blank">newspapers.&lt;/a>” And another thing called “&lt;a href="http://en.wikipedia.org/wiki/Television" target="_blank">television.&lt;/a>” And a sophisticated &lt;a href="http://www.urbandictionary.com/define.php?term=meatspace" target="_blank">meatspace&lt;/a>-based social network called a “&lt;a href="http://en.wikipedia.org/wiki/Town_square" target="_blank">town square&lt;/a>.”&lt;/span>
&lt;/p>
&lt;p dir="ltr">
&lt;span >But there are critical differences between then and now. As &lt;a href="https://obamawhitehouse.archives.gov/blog/2013/02/22/expanding-public-access-results-federally-funded-research" target="_blank">citizens get more access to the scholarly literature&lt;/a>, it is far more likely that research is going to be discussed outside of formal scholarly venues. Now we can build tools to help researchers track these discussions. Now researchers can, if they need to, engage in the conversations as well. One would think that conscientious researchers would see it as their responsibility to remain engaged, to know how their research is being used. And especially to know when it is being misused.&lt;/span>
&lt;/p>
&lt;p dir="ltr">
&lt;span >That isn’t to say that we expect researchers will welcome this task. We are no Pollyannas. Researchers are already famously overstretched. They &lt;a href="https://ddoi.org/10.1016/j.lisr.2009.02.002" target="_blank">barely have time to keep up with the formally published literature&lt;/a>. It seems cruel to expect them to keep up with the firehose of the Internet as well.&lt;/span>
&lt;/p>
&lt;p dir="ltr">
&lt;span >Which gets us back to the value of altmetrics tools. Our hope is that, as altmetrics tools evolve, they will provide publishers and researchers with an efficient mechanism for monitoring the use of their content in non-traditional venues. Just in the way that citations were used before they were distorted into proxies for credit and kudos.&lt;/span>
&lt;/p>
&lt;p dir="ltr">
&lt;span >We don’t think altmetrics are there yet. Partly because some parties are still tantalized by the prospect of usurping one metric for another. But mostly because the entire field is still nascent. People don’t yet know how the information can be combined and used effectively. So we still make naive assumptions such as “link=like” and “more=better.” Surely it will eventually occur to somebody that, instead, there may be a connection between &lt;a href="http://www.nytimes.com/2013/04/28/magazine/diederik-stapels-audacious-academic-fraud.html?_r=1&amp;" target="_blank">repeated headline-grabbing research and academic fraud&lt;/a>. A neuroscientist might be interested in a tool that alerts them if the MRI scans in their research paper are being misinterpreted on the web to promote neurobollocks. An immunologist may want to know if their research is being misused by the anti-vaccination movement. Perhaps the real value in gathering this data will be seen when somebody builds tools to help researchers DETECT puffery, social-citation cabals, and misinterpretation of research results?&lt;/span>
&lt;/p>
&lt;p dir="ltr">
&lt;span >But Crossref won’t be building those tools. What we might be able to do is help others overcome another hurdle that blocks the development of more sophisticated tools; getting hold of the needed data in the first place. This is why we are dabbling in altmetrics.&lt;/span>
&lt;/p>
&lt;p dir="ltr">
&lt;span >Wikipedia is already the 8th largest referrer of Crossref DOIs. Note that this doesn’t just mean that the Wikipedia cites lots of Crossref DOIs, it means that people actually click on and follow those DOIs to the scholarly literature. As scholarly communication transcends traditional outlets and as the audience for scholarly research broadens, we think that it will be more important for publishers and researcher to be aware of how their research is being discussed and used. They may even need to engage more with non-scholarly audiences. In order to do this, they need to be aware of the conversations. Crossref is providing this experimental data source in the hope that we can spur the development of more sophisticated tools for detecting and analyzing these conversations. Thankfully, this is an inexpensive experiment to conduct - largely thanks to the decision on the part of PLOS to open source its ALM code.&lt;/span>
&lt;/p>
&lt;h2 id="what-now">What Now?&lt;/h2>
&lt;p dir="ltr">
Crossref’s instance of PLOS’s ALM code is an experiment. We mentioned that we had encountered scalability problems and that we had resolved some of them. But there are still big scalability issues to address. For example, assuming a response time of 1 second, if we wanted to poll the English-language version of the Wikipedia to see what had cited each of the 65 million DOIs held in Crossref, the process would take years to complete. But this is how the system is designed to work at the moment.&lt;span > It polls various source APIs to see if a particular DOI is “mentioned”. Parallelizing the queries might reduce the amount of time it takes to poll the Wikipedia, but it doesn’t reduce the work. Another obvious way in which we could improve the scalability of the system is to add a push mechanism to supplement the pull mechanism. Instead of going out and polling the Wikipedia 65 million times, we could establish a &amp;#8220;scholarly &lt;a href="http://en.wikipedia.org/wiki/Linkback" target="_blank">linkback&lt;/a>” mechanism that would allow third parties to alert us when DOIs and other scholarly identifiers are referenced (e.g. cited, bookmarked, shared). If the Wikipedia used this, then even in an extreme case scenario (i.e. everything in Wikipedia cites at least one Crossref DOI), this would mean that we would only need to process ~ 4 million trackbacks.&lt;/span>
&lt;/p>
&lt;p dir="ltr">
&lt;span >The other significant advantage of adding a push API is that it would take the burden off of Crossref to know what sources we want to poll. At the moment, if a new source comes online, we’d need to know about it and build a custom plugin to poll their data. This needlessly disadvantages new tools and services as it means that their data will not be gathered until they are big enough for us to pay attention to. If the service in question addresses a niche of the scholarly ecosystem, they may never become big enough. But if we allow sources to push data to us using a common infrastructure, then new sources do not need to wait for us to take notice before they can participate in the system.&lt;/span>
&lt;/p>
&lt;p dir="ltr">
&lt;span >Supporting (potentially) many new sources will raise another technical issue- tracking and maintaining the provenance of the data that we gather. The current ALM system does a pretty good job of keeping data, but if we ever want third parties to be able to rely on the system, we probably need to extend the provenance information so that the data is cheaply and easily auditable.&lt;/span>
&lt;/p>
&lt;p dir="ltr">
&lt;span >Perhaps the most important thing we want to learn from running this experimental ALM instance is: what it would take to run the system as a production service? What technical resources would it require? How could they be supported? And from this we hope to gain enough information to decide whether the service is worth running and, if so, by whom. Crossref is just one of several organisations that could run such a service, but it is not clear if it would be the best one. We hope that as we work with PLOS, our members and the rest of the scholarly community, we’ll get a better idea of how such a service should be governed and sustained.&lt;/span>
&lt;/p>
&lt;h2 id="details">&lt;span >Details for Propellerheads&lt;/span>&lt;/h2>
&lt;h3 dir="ltr">
&lt;span >Warning, Caveats and Weasel Words&lt;/span>
&lt;/h3>
&lt;p dir="ltr">
&lt;span >The Crossref ALM instance is a &lt;a href="https://www.crossref.org/labs/" target="_blank">Crossref Labs&lt;/a> project. It is running on R&amp;D equipment in a non-production environment administered by an orangutang on a diet of Redbulls and vodka.&lt;/span>
&lt;/p>
&lt;h3 dir="ltr">
&lt;span >So what is working?&lt;/span>
&lt;/h3>
&lt;p dir="ltr">
&lt;span >The system has been initially loaded with 317,500+  Crossref DOIs representing publications from 2014. We will load more DOIs in reverse chronological order until we get bored or until the system falls over again.&lt;/span>
&lt;/p>
&lt;p dir="ltr">
&lt;span >We have activated the following sources:&lt;/span>
&lt;/p>
&lt;li dir="ltr">
&lt;span >PubMed&lt;/span>
&lt;/li>
&lt;li dir="ltr">
&lt;span >DataCite&lt;/span>
&lt;/li>
&lt;li dir="ltr">
&lt;span >PubMedCentral Europe Citations and Usage&lt;/span>
&lt;/li>
&lt;p dir="ltr">
&lt;span >We have data from the following sources but will need some work to achieve stability:&lt;/span>
&lt;/p>
&lt;li dir="ltr">
&lt;span >Facebook&lt;/span>
&lt;/li>
&lt;li dir="ltr">
&lt;span >Wikipedia&lt;/span>
&lt;/li>
&lt;li dir="ltr">
&lt;span >CiteULike&lt;/span>
&lt;/li>
&lt;li dir="ltr">
&lt;span >Twitter&lt;/span>
&lt;/li>
&lt;li dir="ltr">
&lt;span >Reddit&lt;/span>
&lt;/li>
&lt;p dir="ltr">
&lt;span >Some of them are faster than others. Some are more temperamental than others. WordPress, for example, seems to go into a sulk and shut itself off  after approximately 1,300 API calls.&lt;/span>
&lt;/p>
&lt;p dir="ltr">
&lt;span >In any case, we will be monitoring and tweaking the sources as we gather data. We will also add new sources as we get requested API keys. We will probably even create one or two new sources ourselves. Watch this blog and we’ll update you as we add/tweak sources.&lt;/span>
&lt;/p>
&lt;h3 dir="ltr">
&lt;span >Dammit, shut up already and tell me how to query stuff.&lt;/span>
&lt;/h3>
&lt;p dir="ltr">
&lt;span >You can &lt;a href="#" target="_blank">login to the Crossref ALM instance&lt;/a> simply using a &lt;a href="" target="_blank">Mozilla Persona&lt;/a> (yes, we’d eventually like to support ORCID too). Once logged-in, &lt;a href="" target="_blank">your account page&lt;/a> will list an API key. Using the API key, you can do things like:&lt;/span>
&lt;/p>
&lt;pre tabindex="0">&lt;code>http://det.labs.crossref.org/api/v5/articles?ids=10.1038/nature12990
&lt;/code>&lt;/pre>&lt;p>&lt;span >And you will see that (as of this writing), said Nature article has been cited by the Wikipedia article here:&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;code>&lt;a href="http://en.wikipedia.org/wiki/HE0107-5240">&lt;a href="https://en.wikipedia.org/wiki/HE0107-5240#cite_ref-Keller2014_4-0;" target="_blank">https://en.wikipedia.org/wiki/HE0107-5240#cite_ref-Keller2014_4-0;&lt;/a>&lt;/code>&lt;/span>&lt;/p>
&lt;p dir="ltr">
&lt;span >PLOS has provided &lt;a href="#" target="_blank"> lovely detailed instructions for using the API&lt;/a>- &lt;span >So, please, play with the API and see what you make of it. On our side we will be looking at how we can improve performance and expand coverage. We don’t promise much- the logistics here are formidable. As we said above, once you start working with millions of documents, the polling process starts to hit API walls quickly. But that is all part of the experiment. We appreciate your helping us and would like your feedback. We can be contacted at:&lt;/span>&lt;/span>
&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/wp/blog/uploads/2013/01/labs_email.png">&lt;img class="alignnone size-full wp-image-261" src="https://www.crossref.org/wp/blog/uploads/2013/01/labs_email.png" alt="labs_email" width="233" height="42" />&lt;/a>&lt;/p></description></item><item><title>DOIs unambiguously and persistently identify published, trustworthy, citable online scholarly literature. Right?</title><link>https://www.crossref.org/blog/dois-unambiguously-and-persistently-identify-published-trustworthy-citable-online-scholarly-literature-right/</link><pubDate>Fri, 20 Sep 2013 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/dois-unambiguously-and-persistently-identify-published-trustworthy-citable-online-scholarly-literature-right/</guid><description>&lt;h2 id="span-span">&lt;span > &lt;/span>&lt;/h2>
&lt;p>&lt;span >The South Park movie , “Bigger, Longer &amp;amp; Uncut” has a DOI:&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>a)&lt;/strong> &lt;a href="http://dx.doi.org/10.5240/B1FA-0EEC-C316-3316-3A73-L">&lt;a href="http://dx.doi.org/10.5240/B1FA-0EEC-C316-3316-3A73-L" target="_blank">http://dx.doi.org/10.5240/B1FA-0EEC-C316-3316-3A73-L&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >So does the pornographic movie, “Young Sex Crazed Nurses”:&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>b)&lt;/strong> &lt;a href="http://dx.doi.org/10.5240/4CF3-57AB-2481-651D-D53D-Q">&lt;a href="http://dx.doi.org/10.5240/4CF3-57AB-2481-651D-D53D-Q" target="_blank">http://dx.doi.org/10.5240/4CF3-57AB-2481-651D-D53D-Q&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >And the following DOI points to a fake article on a “Google-Based Alien Detector”:&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>c)&lt;/strong> &lt;a href="http://dx.doi.org/10.6084/m9.figshare.93964">&lt;a href="http://dx.doi.org/10.6084/m9.figshare.93964" target="_blank">http://dx.doi.org/10.6084/m9.figshare.93964&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >And the following DOI refers to an &lt;a href="http://en.wikipedia.org/wiki/Sokal_affair">infamous fake article&lt;/a> on literary theory:&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>d)&lt;/strong> &lt;a href="http://dx.doi.org/10.2307/466856">&lt;a href="http://dx.doi.org/10.2307/466856" target="_blank">http://dx.doi.org/10.2307/466856&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >This scholarly article discusses the entirely fictitious Australian “Drop Bear”:&lt;/span>&lt;/p>
&lt;p>&lt;span >e) &lt;a href="http://dx.doi.org/10.1080/00049182.2012.731307">&lt;a href="http://dx.doi.org/10.1080/00049182.2012.731307" target="_blank">http://dx.doi.org/10.1080/00049182.2012.731307&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >The following two DOIs point to the same article- the first DOI points to the final author version, and the second DOI points to the final published version:&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>f)&lt;/strong> &lt;a href="https://web.archive.org/web/20160423204031/https://figshare.com/articles/Relating_ion_channel_expression,_bifurcation_structure,_and_diverse_firing_patterns_in_a_model_of_an_identified_motor_neuron/96546">&lt;a href="https://web.archive.org/web/20160423204031/https://figshare.com/articles/Relating_ion_channel_expression,_bifurcation_structure,_and_diverse_firing_patterns_in_a_model_of_an_identified_motor_neuron/96546" target="_blank">https://web.archive.org/web/20160423204031/https://figshare.com/articles/Relating_ion_channel_expression,_bifurcation_structure,_and_diverse_firing_patterns_in_a_model_of_an_identified_motor_neuron/96546&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>g)&lt;/strong> &lt;a href="http://dx.doi.org/10.1007/s10827-012-0416-6">&lt;a href="http://dx.doi.org/10.1007/s10827-012-0416-6" target="_blank">http://dx.doi.org/10.1007/s10827-012-0416-6&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >This following two DOIs point to the same article- there is no apparent difference between the two copies:&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>h)&lt;/strong> &lt;a href="http://dx.doi.org/10.6084/m9.figshare.91541">&lt;a href="http://dx.doi.org/10.6084/m9.figshare.91541" target="_blank">http://dx.doi.org/10.6084/m9.figshare.91541&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>i)&lt;/strong> &lt;a href="http://dx.doi.org/10.1038/npre.2012.7151.1">&lt;a href="http://dx.doi.org/10.1038/npre.2012.7151.1" target="_blank">http://dx.doi.org/10.1038/npre.2012.7151.1&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >Another example where two DOIs point to the same article and there is no apparent difference between the two copies:&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>j)&lt;/strong> &lt;a href="http://dx.doi.org/10.1364/AO.39.005477">&lt;a href="http://dx.doi.org/10.1364/AO.39.005477" target="_blank">http://dx.doi.org/10.1364/AO.39.005477&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>k)&lt;/strong> &lt;a href="http://dx.doi.org/10.3929/ethz-a-005707391">&lt;a href="http://dx.doi.org/10.3929/ethz-a-005707391" target="_blank">http://dx.doi.org/10.3929/ethz-a-005707391&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >These journals assigned DOIs, but not through Crossref:&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>l)&lt;/strong> &lt;a href="http://dx.doi.org/10.3233/BIR-2008-0496">&lt;a href="http://dx.doi.org/10.3233/BIR-2008-0496" target="_blank">http://dx.doi.org/10.3233/BIR-2008-0496&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>m)&lt;/strong> &lt;a href="https://web.archive.org/web/20160423192452/https://figshare.com/articles/Role_of_brain_glutamic_acid_metabolism_changes_in_neurodegenerative_pathologies/95564">&lt;a href="https://web.archive.org/web/20160423192452/https://figshare.com/articles/Role_of_brain_glutamic_acid_metabolism_changes_in_neurodegenerative_pathologies/95564" target="_blank">https://web.archive.org/web/20160423192452/https://figshare.com/articles/Role_of_brain_glutamic_acid_metabolism_changes_in_neurodegenerative_pathologies/95564&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>n)&lt;/strong> &lt;a href="https://web.archive.org/web/20160423192452/https://figshare.com/articles/Role_of_brain_glutamic_acid_metabolism_changes_in_neurodegenerative_pathologies/95564">&lt;a href="http://dx.doi.org/10.3205/cto000081" target="_blank">http://dx.doi.org/10.3205/cto000081&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >These two DOIs are assigned to two different data sets by two different RAs:&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>o)&lt;/strong> &lt;a href="http://dx.doi.org/10.1107/S0108767312019034/eo5016sup1.xls">&lt;a href="http://dx.doi.org/10.1107/S0108767312019034/eo5016sup1.xls" target="_blank">http://dx.doi.org/10.1107/S0108767312019034/eo5016sup1.xls&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>p)&lt;/strong> &lt;a href="http://dx.doi.org/10.1594/PANGAEA.726855">&lt;a href="http://dx.doi.org/10.1594/PANGAEA.726855" target="_blank">http://dx.doi.org/10.1594/PANGAEA.726855&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >This DOI appears to have been published, but was not registered until well after it was published. There were 254 unsuccessful attempts to resolve it in September 2012 alone:&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;strong>q)&lt;/strong> &lt;a href="http://dx.doi.org/10.4233/uuid:995dd18a-dc5d-4a9a-b9eb-a16a07bfcc6d">&lt;a href="http://dx.doi.org/10.4233/uuid:995dd18a-dc5d-4a9a-b9eb-a16a07bfcc6d" target="_blank">http://dx.doi.org/10.4233/uuid:995dd18a-dc5d-4a9a-b9eb-a16a07bfcc6d&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >The owner of prefix, ‘10.4223,’ who is responsible for the above DOI had 378,790 attempted resolutions in September 2012 of which there were 377,001 failures. The top 10 DOI failures for this prefix each garnered over 200 attempted resolutions. As of November 2012 the prefix had only registered 349 DOIs.&lt;/span>&lt;/p>
&lt;p>&lt;span >Of the above 16 example DOIs 11 cannot be used for &lt;a href="http://www.crossref.org/crosscheck/index.html" target="_blank">CrossCheck&lt;/a> or &lt;a href="https://www.crossref.org/services/crossmark/" target="_blank">Crossmark&lt;/a>. 3 cannot be used with &lt;a href="https://en.wikipedia.org/wiki/Content_negotiation" target="_blank">content negotiation&lt;/a>. To search metadata for the above examples, you need to visit four sites:&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;a href="https://web.archive.org/web/20131229210637/http://search.crossref.org/">&lt;a href="https://web.archive.org/web/20131229210637/http://search.crossref.org/" target="_blank">https://web.archive.org/web/20131229210637/http://search.crossref.org/&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;a href="https://ui.eidr.org/search">&lt;a href="https://ui.eidr.org/search" target="_blank">https://ui.eidr.org/search&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;a href="https://www.medra.org/en/search.htm">&lt;a href="https://www.medra.org/en/search.htm" target="_blank">https://www.medra.org/en/search.htm&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;a href="https://search.datacite.org/">&lt;a href="https://search.datacite.org/" target="_blank">https://search.datacite.org/&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >The 14 examples come from just 4 of the 8 existing&lt;a href="http://www.doi.org/registration_agencies.html" target="_blank"> DOI registration agencies&lt;/a> (RAs) It is virtually impossible for somebody without specialized knowledge to tell which DOIs are Crossref DOIs and which ones are not.&lt;/span>&lt;/p>
&lt;h2 id="span-backgroundspan">&lt;span >Background&lt;/span>&lt;/h2>
&lt;p>&lt;span >So DOIs unambiguously and persistently identify published, trustworthy, citable online scholarly literature. Right? Wrong.&lt;/span>&lt;/p>
&lt;p>&lt;span >The examples above are useful because they help elucidate some misconceptions about the DOI itself, the nature of the DOI registration agencies and, in particular issues being raised by new RAs and new DOI allocation models.&lt;/span>&lt;/p>
&lt;h2 id="span-dois-are-just-identifiersspan">&lt;span >DOIs are just identifiers&lt;/span>&lt;/h2>
&lt;p>&lt;span >Crossref’s dominance as the primary DOI registration agency makes it easy to assume Crossref’s &lt;em>particular&lt;/em> application of the DOI as a scholarly citation identifier is somehow intrinsic to the DOI. The truth is, the DOI has nothing specifically to do with citation or scholarly publishing. It is simply an identifier that can be used for virtually any application. DOIs could be used as serial numbers on car parts, as supply-chain management identifiers for videos and music or as cataloguing numbers for museum artifacts. The first two identifiers listed in the examples &lt;strong>(a &amp;amp; b)&lt;/strong> illustrate this. They both belong to &lt;a href="http://www.movielabs.com/" target="_blank">MovieLabs&lt;/a> and are part of the &lt;a href="http://eidr.org/" target="_blank">EIDR&lt;/a> (Entertainment Identifier Registry) effort to create a unique identifier for television and movie assets. At the moment, the DOIs that MoveLabs are assigning are B2B-focused and users are unlikely to see them in the wild. But we should recall that Crossref’s application of DOIs was also initially considered a B2B identifier- but it has since become widely recognized and depended on by researchers, librarians and third parties. The visibility of EIDR DOIs could change rapidly as they become more popular.&lt;/span>&lt;/p>
&lt;h2 id="span-multiple-dois-can-be-assigned-to-the-same-objectspan">&lt;span >Multiple DOIs can be assigned to the same object&lt;/span>&lt;/h2>
&lt;p>&lt;span >There is no &lt;a href="http://www.doi.org/" target="_blank">International DOI Foundation&lt;/a> (IDF) prohibition against assigning multiple DOIs to the same object. At most the IDF suggests that RAs might coordinate to avoid duplicate assignments, but it provides no guidelines on how such cross-RA checks would work.&lt;/span>&lt;/p>
&lt;p>&lt;span >Crossref, in its particular application of the DOI, attempts to ensure that we don’t assign two different copies of the same article with different DOIs, but that is designed in order to avoid having publishers mistakenly making duplicate submissions. Even then, there are subtle exceptions to this rule- the same article, if legitimately published in two different issues (e.g. a regular issue and a thematic issue) will be assigned different DOIs. This is because, though the actual article content might be identical, the &lt;em>context&lt;/em> in which it is cited is also important to record and distinguish. Finally, of course, we assign multiple DOIs to the same “object” when we assign book-level and chapter level DOIs. Or when we assign DOIs to components or reference work entries.&lt;/span>&lt;/p>
&lt;p>&lt;span >The likelihood of multiple DOIs being assigned to the same object increases as we have multiple RAs. In the future we might legitimately have a monograph that has different &lt;a href="http://www.bowker.co.uk/en-UK/" target="_blank">Bowker&lt;/a> DOIs for different e-book platforms (Kindle, iPad, Kobo.) yet all three might share the same Crossref DOI for citation purposes.&lt;/span>&lt;/p>
&lt;p>&lt;span >Again, the examples show this already happening. The examples &lt;strong>f &amp;amp; g&lt;/strong> are assigned by &lt;a href="http://www.datacite.org/" target="_blank">DataCite&lt;/a> (via &lt;a href="http://figshare.com/" target="_blank">FigShare&lt;/a>) and Crossref respectively. The first identifies the author version and was presumably assigned by said author. The second identifies the publisher version and was assigned by the publisher.&lt;/span>&lt;/p>
&lt;p>&lt;span >Although Crossref, as a publisher-focused RA, might have historically proscribed the assignment of Crossref DOIs to archive or author versions, there has never been and could never be any such restrictions on other DOI RAs. These are legitimate applications of two citation identifiers to two versions of the same article.&lt;/span>&lt;/p>
&lt;p>&lt;span >However, the next set of examples, &lt;strong>h, i, j&lt;/strong> and &lt;strong>k&lt;/strong> show what appears to be a slightly different problem. In these cases articles that appear to be in all aspects &lt;em>identical&lt;/em> have been assigned two separate DOIs by different RAs. In one respect this is a logistical or technical problem- although Crossref can check for such potential duplicate assignments within its own system, there is no way for us to do this across different RAs. But this is also a marketing and education problem- how do RAs with similar constituencies (publishers, researchers, librarians) and application of the DOI (scholarly citation) educate and inform their members about best practice in applying DOIs in that particular RAs context?&lt;/span>&lt;/p>
&lt;h2 id="span-doi-registration-agencies-are-not-focused-on-record-types-they-are-focused-on-constituencies-and-applicationsspan">&lt;span >DOI registration agencies are not focused on record types, they are focused on constituencies and applications&lt;/span>&lt;/h2>
&lt;p>&lt;span >The examples &lt;strong>f&lt;/strong> through &lt;strong>k&lt;/strong> also illustrate another area of fuzzy thinking about RAs- that they are somehow built around particular record types. We routinely hear people mistakenly explain that difference between Crossref and DataCite is that “Crossref assigns DOIs to journal articles” and that “DataCite assigns DOIs to data.” Sometimes this is supplemented with “and Bowker assigns DOIs to books.” This is nonsense. Crossref assigns DOIs to data (example &lt;strong>o&lt;/strong>) as well as conference proceedings, programs, images, tables, books, chapters, reference entries, etc. And DataCite covers a similar breadth of record types including articles (examples &lt;strong>c, h, f, l, m&lt;/strong> ). The difference between Crossref, DataCite and Bowker is their constituencies and applications- not the record types they apply DOIs to. Crossref’s constituency is publishers. DataCite’s constituency is data repositories, archives and national libraries. But even though Crossref and DataCite have different constituencies, they share a similar application of the DOI- that is the use of DOI as citation identifiers. This is in contrast to MovieLabs whose application of the DOI is supply chain management.&lt;/span>&lt;/p>
&lt;h2 id="span-doi-registration-agency-constituencies-and-applications-can-overlap-or-be-entirely-separatespan">&lt;span >DOI registration agency constituencies and applications can overlap &lt;em>or&lt;/em> be entirely separate&lt;/span>&lt;/h2>
&lt;p>&lt;span >Although Crossref’s constituency is “publishers”, we are catholic in our definition of “publisher” and have several members who run repositories that also “publish” content such as working papers and other grey literature (e.g. Woods Hole Oceanographic Institution, University of Michigan Library, University of Illinois Library). DataCite’s constituency is data repositories, archives and national libraries, but this doesn’t stop DataCite (through CDL/FigShare) from working with the publisher, PLoS, on their “&lt;a href="http://blogs.plos.org/everyone/2012/08/14/plos-one-launches-reproducibility-initiative/" target="_blank">Reproducibility Initiative&lt;/a>” which requires the archiving of article-related datasets. PloS has announced that they will host all supplemental data sets on FigShare but will assign DOIs to those items through Crossref.&lt;/span>&lt;/p>
&lt;p>&lt;span >Crossref’s constituency of publishers overlaps heavily with &lt;a href="http://doi.airiti.com/" target="_blank">Airiti&lt;/a>, &lt;a href="http://japanlinkcenter.org/jalc/" target="_blank">JaLC&lt;/a>, &lt;a href="http://www.medra.org/" target="_blank">mEDRA&lt;/a>, &lt;a href="http://www.doi.org.cn/portal/index.htm" target="_blank">ISTIC&lt;/a> and &lt;a href="http://www.bowker.co.uk/en-UK/" target="_blank">Bowker&lt;/a>. In the case of all but Bowker we also overlap in our application of the DOI in the service of citation identification. Bowker, though it shares Crossref’s constituency, uses DOIs for supply chain management applications.&lt;/span>&lt;/p>
&lt;p>&lt;span >Meanwhile, &lt;a href="http://eidr.org/" target="_blank">EIDR&lt;/a> is an outlier, its constituency does not overlap with Crossref’s &lt;em>and&lt;/em> its application of the DOI is different as well.&lt;/span>&lt;/p>
&lt;p>&lt;span >The relationship between RA constituency overlap (e.g. scholarly publishers vs television/movie studios) and application overlap (e.g. citation identification vs. supply chain management) can be visualized as such:&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;a href="https://www.crossref.org/wp/blog/uploads/2013/06/ra_overlap.png">&lt;img class="alignnone size-full wp-image-280" src="https://www.crossref.org/wp/blog/uploads/2013/06/ra_overlap.png" alt="RA Application/Constituency overlap" width="602" height="452" srcset="https://www.crossref.org/wp/blog/uploads/2013/06/ra_overlap.png 602w, https://www.crossref.org/wp/blog/uploads/2013/06/ra_overlap-300x225.png 300w" sizes="(max-width: 709px) 85vw, (max-width: 909px) 67vw, (max-width: 984px) 61vw, (max-width: 1362px) 45vw, 600px" />&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >The differences (subtle or large) between the various RAs are not evident to anybody without a fairly sophisticated understanding of the identifier space and the constituencies represented by the various RAs. To the ordinary person these are all just DOIs, which in turn are described as simply being “persistent interoperable identifiers.”&lt;/span>&lt;/p>
&lt;p>&lt;span >Which of course begs the question, what do we mean by “persistent” and “interoperable?”&lt;/span>&lt;/p>
&lt;h2 id="span-dois-only-are-as-persistent-as-the-registration-agencys-application-warrantsspan">&lt;span >DOIs only are as persistent as the registration agency’s application warrants.&lt;/span>&lt;/h2>
&lt;p>&lt;span >The word “persistent” does not mean “permanent.” &lt;a href="http://andrew.treloar.net/">Andrew Treloar&lt;/a> is known to point out that the primary sense of the word “persistent” in the New Oxford American Dictionary is:&lt;/span>&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >Continuing firmly or obstinately in a course of action in spite of difficulty or opposition&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;span >Yet presumably the IDF once chose to use the word “persistent” instead of “perpetual” or “permanent” for other reasons. “Persistence” implies longevity, without committing to “forever.”&lt;/span>&lt;/p>
&lt;p>&lt;span >It may sound prissy, but it seems reasonable to expect that the useful life-expectancy for the identifier used for managing inventory of the the movie “Young Sex Crazed Nurses” might be different than the life expectancy for the identifier used to cite Henry Oldenburg’s “Epistle Dedicatory” in the first issue of the Philosophical Transactions. In other words, some RAs have a mandate to be more “obstinate” than others and so their definitions of “persistence” may vary. Different RAs have different service level agreements.&lt;/span>&lt;/p>
&lt;p>&lt;span >The problem is that ordinary users of the “persistent” DOI have no way of distinguishing between those DOIs that are expected to have a useful life of 5 years and those DOIs that are expected to have a useful lifespan of 300+ years. Unfortunately, if one of the more than 6 million non-Crossref DOIs breaks today, it will likely be blamed on Crossref.&lt;/span>&lt;/p>
&lt;p>&lt;span >Similarly, if a DOI doesn’t work with an existing Crossref service, like CrossCheck, Crossmark or Crossref Metadata Search, it will also be laid at the foot of Crossref. This scenario is likely to become even more complex as different RAs provide different specialized services for their constituencies.&lt;/span>&lt;/p>
&lt;p>&lt;span >Ironically, the converse doesn’t always apply. Crossref oftentimes does not get credit for services that we instigated at the IDF level. For instance, FigShare has been widely praised for implementing content negotiation for DOIs even though this initiative had nothing to do with FigShare, instead it was implemented by DataCite with the prodding and active help of Crossref (DataCite even used Crossref’s code for a while). To be clear, we don’t begrudge praise for FigShare. We think FigShare is very cool- this just serves as an example of the confusion that is already occurring.&lt;/span>&lt;/p>
&lt;figure>&lt;img src="https://www.crossref.org/wp/blog/uploads/2013/06/impressed.png"
alt="screenshot of tweet by Leigh Dodds" width="595" height="210">
&lt;/figure>
&lt;h2 id="heading">&lt;/h2>
&lt;h2 id="span-dois-are-only-interoperable-at-a-least-common-denominator-level-of-functionalityspan">&lt;span >DOIs are only “interoperable” at a least common denominator level of functionality&lt;/span>&lt;/h2>
&lt;p>&lt;span >There is no question that use of Crossref DOIs has enabled the interoperability of citations across scholarly publisher sites. The extra level of indirection built into the DOI means that publishers do not have to worry about negotiating multiple bilateral linking agreements and proprietary APIs. Furthermore, at the mundane technical level of following HTTP links, publishers also don’t have to worry about whether the DOI was registered with mEDRA, DataCite or Crossref as long as the DOI in question was applied with citation linking in mind.&lt;/span>&lt;/p>
&lt;p>&lt;span >However, what happens if somebody wants to use metadata to search for a particular DOI? What happens if they expect that DOI to work with content negotiation or to enable a CrossCheck analysis or show a Crossmark dialog or carry &lt;a href="https://www.crossref.org/services/funder-registry/" target="_blank">FundRef&lt;/a> data? At this level, the purported interoperability of the DOI system falls apart. A publisher issuing DataCite DOIs cannot use CrossCheck. A user with a mEDRA DOI cannot use it with content negotiation. Somebody searching Crossref Metadata Search or using Crossref’s OpenURL API will not find DataCite records. Somebody depositing metadata in an RA other than Crossref or DataCite will not be able to deposit ORCIDs.&lt;/span>&lt;/p>
&lt;p>&lt;span >There are no easy or cheap technical solutions to fix this level of incompatibility baring the creation of a superset of all RA functionality at the IDF level. But even if we had a technical solution to this problem- it isn’t clear that such a high-level of interoperability is warranted across all RAs. The degree of interoperability that is desirable between RAs is only in proportion to the degree that they serve overlapping constituencies (e.g. publishers) or use the DOI for overlapping applications (e.g. citation)&lt;/span>&lt;/p>
&lt;h2 id="span-doi-interoperability-matters-more-for-some-registration-agencies-than-othersspan">&lt;span >DOI Interoperability matters more for some registration agencies than others&lt;/span>&lt;/h2>
&lt;p>&lt;span >This raises the question of what it even means to be “interoperable” between different RAs that share virtually no overlap in constituencies or applications. In what meaningful sense do you make a DOI used for inventory control “interoperable” with a DOI used for identifying citable scholarly works? Do we want to be able to check “Young Sex Crazed Nurses” for plagiarism? Or let somebody know when the South Park movie has been retracted or updated? Do we need to alert somebody when their inventory of citations falls below a certain threshold? Or let them know how many copies of a PDF are left in the warehouse?&lt;/span>&lt;/p>
&lt;p>&lt;span >The opposite, but equally vexing issue arrises for RAs that actually share constituencies and/or applications. Crossref, DataCIte and mEDRA have &lt;em>all&lt;/em> built separate metadata search capabilities, separate deposit APIs, separate OpenURL APIs, and separate stats packages- &lt;em>all&lt;/em> geared at handling scholarly citation linking.&lt;/span>&lt;/p>
&lt;p>&lt;span >Finally, it seems a shame that a third party, like ORCID, who wants to enable researchers to add &lt;em>any&lt;/em> DOI and its associated metadata to their ORCID profile, will end up having to interface with 4-5 different RAs.&lt;/span>&lt;/p>
&lt;h2 id="span-summary-and-closing-thoughtsspan">&lt;span >Summary and closing thoughts&lt;/span>&lt;/h2>
&lt;p>&lt;span >Crossref was founded by publishers who were prescient in understanding that, as scholarly content moved online, there was the potential to add great value to publications by directly linking citations to the documents cited. However, publishers also realized that many of the architectural attributes that made the WWW so successful (decentralization, simple protocols for markup, linking and display, etc.), also made the web a fragile platform for persistent citation.&lt;/span>&lt;/p>
&lt;p>&lt;span >The Crossref solution to this dilemma was to introduce the use of the DOI identifier as a level of citation indirection in order to layer a persist-able citation infrastructure onto the web. The success of this mechanism has been evident at a number of levels. A first-order effect of the system is that it has allowed publishers to create reliable and persistent links between copies of publisher content. Indeed uptake of the Crossref system by scholarly and professional publishers has been rapid and almost all serious scholarly publishers are now Crossref members.&lt;/span>&lt;/p>
&lt;p>&lt;span >The second order effects of the Crossref system have also been remarkable. Firstly, just as researchers have long expected that any serious paper-based publication would include citations, now researchers expect that serious online scholarly publications will also support robust online citation linking. Secondly, some have adopted a cargo-cult practice of seeing the mere presence of a DOI on a publication as a putative sign of “citability” or “authority.” Thirdly, interest in use of the DOI as a linking mechanism has started to filter out to researchers themselves, thus potentially extending the use of Crossref DOIs beyond being primarily a B2B citation convention.&lt;/span>&lt;/p>
&lt;p>&lt;span >The irony is that although the DOI system was almost single-handedly popularized and promoted by Crossref, the DOI brand is better known than Crossref itself. We now find that new RAs like EIDR, DataCite and new services like FigShare are building on the DOI brand and taking it in new directions. As such the first and second order benefits of Crossref’s pioneering work with DOIs are likely to be effected by the increasing activity of the new DOI RAs as well as the introduction of new models for assigning and maintaining DOIs.&lt;/span>&lt;/p>
&lt;p>&lt;span >How can you trust that a DOI is persistent if different RAs have different conceptions of persistence? How can you expect the presence of a DOI to indicate “authority” or “scholarliness” if DOIs are being assigned to porn movies? How can you expect a DOI to point to the “published” version of an article when authors can upload and assign DOIs to their own copies of articles?&lt;/span>&lt;/p>
&lt;p>&lt;span >It is precisely because we think that some of the qualities traditionally (and wrongly) accorded to DOIs (e.g. scholarly, published, stewarded, citable, persistent) are going to be diluted in the long term that we have focused so much of our recent attention on new initiatives that have a more direct and unambiguous connection to assessing the trustworthiness of Crossref member’s content. CrossCheck and the CrossCheck logos are designed to highlight the role that publishers play in detecting and preventing academic fraud. The Crossmark identification service will serve as a signal to researchers that publishers are committed to maintaining their scholarly content as well as giving scholars the information they need to verify that they are using the most recent and reliable versions of a document. FundRef is designed to make the funding sources for research and articles transparent and easily accessible. And finally we have been both adjusting Crossref’s branding and display guidelines as well as working with the IDF to refine its branding and display guidelines so as to help clearly differentiate different DOI applications and constituencies.&lt;/span>&lt;/p>
&lt;p>&lt;span >Whilst it might be worrying to some that DOIs are being applied in ways that Crossref has not expected and may not have historically endorsed, we should celebrate that the broader scholarly community is finally recognizing the importance of persist-able citation identifiers.&lt;/span>&lt;/p>
&lt;p>&lt;span >These developments also serve to reinforce a strong trend that we have encountered in several guises before. That is, the complete scholarly citation record is made up of more than citations to the formally published literature. Our work on &lt;a href="http://www.orcid.org" target="_blank">ORCID&lt;/a> underscored that researchers, funding agencies, institutions and publishers are interested in developing a more holistic view of the manifold contributions that are integral to research. The “C” in ORCID stands for “contributor” and ORCID profiles are designed to ultimately allow researchers to record “products” which include not only formal publications, but also data sets, patents, software, web pages and other research outputs. Similarly, Crossref’s analysis of the CitedBy references revealed that one in fifteen references in the scholarly literature published in 2012 included a plain, ordinary HTTP URI- clear evidence that researchers need to be able to cite informally published content on the web. If the trend in CitedBy data continues, then in two to three years one in ten citations will be of informally published literature.&lt;/span>&lt;/p>
&lt;p>&lt;span >The developments that we are seeing are a response to the need that users have to persistently identify and cite the full gamut of record types that make up the scholarly literature. If we can not persistently site these record types, the scholarly citation record will grow increasingly porous and structurally unsound.  We can either stand back and let these gaps be filled by other players under their terms and deal reactively with the confusion that is likely to ensue- or we can start working in these areas too and help to make sure that what gets developed interacts with the existing online scholarly citation record in a responsible way.&lt;/span>&lt;/p></description></item><item><title>Easily add publications to your ORCID profile</title><link>https://www.crossref.org/blog/easily-add-publications-to-your-orcid-profile/</link><pubDate>Thu, 24 Jan 2013 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/easily-add-publications-to-your-orcid-profile/</guid><description>&lt;p>&lt;span >You can now easily search for publications and add them to your &lt;a href="http://www.orcid.org" target="_blank">ORCID&lt;/a> profile in the new beta of &lt;a href="https://web.archive.org/web/20131229210637/http://search.crossref.org/" target="_blank">Crossref Metadata Search&lt;/a> (CRMDS). The user interface is pretty self-explanatory, but if you want to read about it before trying it, here is a summary of how it works.&lt;/span>&lt;/p>
&lt;p>&lt;span >When you go to to CRMDS, you will see that there is now a small ORCID sign-in button on the top right-hand side of the screen.&lt;/span>&lt;figure id="attachment_244" class="wp-caption aligncenter">&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/blog/easily-add-publications-to-your-orcid-profile/" target="_blank" rel="attachment wp-att-244">&lt;img class="size-medium wp-image-244 " src="https://www.crossref.org/wp/blog/uploads/2013/01/crmds_home-300x253.png" alt="crmds_home" width="300" height="253" srcset="https://www.crossref.org/wp/blog/uploads/2013/01/crmds_home-300x253.png 300w, https://www.crossref.org/wp/blog/uploads/2013/01/crmds_home-624x527.png 624w, https://www.crossref.org/wp/blog/uploads/2013/01/crmds_home.png 859w" sizes="(max-width: 300px) 85vw, 300px" />&lt;/a>&lt;figcaption class="wp-caption-text">click on thumbnail to see larger image&lt;/figcaption>&lt;/figure>&lt;/p>
&lt;p>&lt;span >Clicking on this button allows you to connect CRMDS to your ORCID profile and authorises CRMDS to add publications to your profile. First, if you are not already logged into ORCID, CRMDS will ask ORCID to log you in:&lt;/span>&lt;figure id="attachment_245" class="wp-caption aligncenter">&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/blog/easily-add-publications-to-your-orcid-profile/" rel="attachment wp-att-245">&lt;img class="size-medium wp-image-245 " src="https://www.crossref.org/wp/blog/uploads/2013/01/orcid_login_prompt-300x230.png" alt="orcid_login_prompt" width="300" height="230" srcset="https://www.crossref.org/wp/blog/uploads/2013/01/orcid_login_prompt-300x230.png 300w, https://www.crossref.org/wp/blog/uploads/2013/01/orcid_login_prompt-624x479.png 624w, https://www.crossref.org/wp/blog/uploads/2013/01/orcid_login_prompt.png 915w" sizes="(max-width: 300px) 85vw, 300px" />&lt;/a>&lt;figcaption class="wp-caption-text">click on thumbnail to see larger image&lt;/figcaption>&lt;/figure>&lt;/p>
&lt;p>&lt;span >Once you have logged in, ORCID will ask you if you want to allow CRMDS to be able to view and update your ORCID profile:&lt;/span>&lt;figure id="attachment_248" class="wp-caption aligncenter">&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/blog/easily-add-publications-to-your-orcid-profile/" target="_blank" rel="attachment wp-att-248">&lt;img class="size-medium wp-image-248 " src="https://www.crossref.org/wp/blog/uploads/2013/01/orcid_authorize-300x230.png" alt="orcid_authorize" width="300" height="230" srcset="https://www.crossref.org/wp/blog/uploads/2013/01/orcid_authorize-300x230.png 300w, https://www.crossref.org/wp/blog/uploads/2013/01/orcid_authorize-624x480.png 624w, https://www.crossref.org/wp/blog/uploads/2013/01/orcid_authorize.png 925w" sizes="(max-width: 300px) 85vw, 300px" />&lt;/a>&lt;figcaption class="wp-caption-text">click on thumbnail to see larger image&lt;/figcaption>&lt;/figure>&lt;/p>
&lt;p>&lt;span >After you authorise CRMDS to access your profile, you will be returned to the CRMDS screen and the top right corner of the CRMDS page will indicate that you have connected to your ORCID profile (note, you can always de-authorise CRMDS from accessing your ORCID profile in your ORCID settings):&lt;/span>&lt;figure id="attachment_249" class="wp-caption aligncenter">&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/blog/easily-add-publications-to-your-orcid-profile/" target="_blank" rel="attachment wp-att-249">&lt;img class="size-medium wp-image-249 " src="https://www.crossref.org/wp/blog/uploads/2013/01/orcid_logged_in-300x231.png" alt="orcid_logged_in" width="300" height="231" srcset="https://www.crossref.org/wp/blog/uploads/2013/01/orcid_logged_in-300x231.png 300w, https://www.crossref.org/wp/blog/uploads/2013/01/orcid_logged_in-624x481.png 624w, https://www.crossref.org/wp/blog/uploads/2013/01/orcid_logged_in.png 915w" sizes="(max-width: 300px) 85vw, 300px" />&lt;/a>&lt;figcaption class="wp-caption-text">click on thumbnail to see larger image&lt;/figcaption>&lt;/figure>&lt;/p>
&lt;p>&lt;span >Once you are logged in, you can enter search terms that are likely to return records of your publications:&lt;/span>&lt;figure id="attachment_250" class="wp-caption aligncenter">&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/blog/easily-add-publications-to-your-orcid-profile/" target="_blank" rel="attachment wp-att-250">&lt;img class="size-medium wp-image-250 " src="https://www.crossref.org/wp/blog/uploads/2013/01/crmds_search_terms-300x231.png" alt="crmds_search_terms" width="300" height="231" srcset="https://www.crossref.org/wp/blog/uploads/2013/01/crmds_search_terms-300x231.png 300w, https://www.crossref.org/wp/blog/uploads/2013/01/crmds_search_terms-624x481.png 624w, https://www.crossref.org/wp/blog/uploads/2013/01/crmds_search_terms.png 915w" sizes="(max-width: 300px) 85vw, 300px" />&lt;/a>&lt;figcaption class="wp-caption-text">click on thumbnail to see larger image&lt;/figcaption>&lt;/figure>&lt;/p>
&lt;p>&lt;span >Each search result will show an icon telling you whether that particular item is visible in your ORCID profile. If the item is not in your ORCID profile, you see an icon like this:&lt;/span>&lt;/p>
&lt;p >
&lt;a href="https://www.crossref.org/blog/easily-add-publications-to-your-orcid-profile/" rel="attachment wp-att-251">&lt;img class="size-full wp-image-251 aligncenter" src="https://www.crossref.org/wp/blog/uploads/2013/01/add_to_orcid_button.png" alt="add_to_orcid_button" width="113" height="30" />&lt;/a>
&lt;/p>
&lt;p>&lt;span >And if the item is already in your ORCID profile, you will see an icon like this:&lt;/span>&lt;/p>
&lt;p >
&lt;a href="https://www.crossref.org/blog/easily-add-publications-to-your-orcid-profile/" rel="attachment wp-att-252">&lt;img class="size-full wp-image-252 aligncenter" src="https://www.crossref.org/wp/blog/uploads/2013/01/in_your_profile.png" alt="in_your_profile" width="133" height="27" />&lt;/a>&lt;span >In the following search results you can see that 1 item is already in Josiah Carberry’s profile, and 2 items are not:&lt;/span>
&lt;/p>&lt;figure id="attachment_254" class="wp-caption aligncenter">
&lt;p>&lt;a href="https://www.crossref.org/blog/easily-add-publications-to-your-orcid-profile/" target="_blank" rel="attachment wp-att-254">&lt;img class=" wp-image-254 " src="https://www.crossref.org/wp/blog/uploads/2013/01/crmds_search_results.png" alt="crmds_search_results" width="329" height="254" srcset="https://www.crossref.org/wp/blog/uploads/2013/01/crmds_search_results.png 915w, https://www.crossref.org/wp/blog/uploads/2013/01/crmds_search_results-300x231.png 300w, https://www.crossref.org/wp/blog/uploads/2013/01/crmds_search_results-624x481.png 624w" sizes="(max-width: 329px) 85vw, 329px" />&lt;/a>&lt;figcaption class="wp-caption-text">click on thumbnail to see larger image&lt;/figcaption>&lt;/figure>&lt;/p>
&lt;p>&lt;span >Clicking on the “Add to Profile” button will confirm that you want to add the specified publication to your ORCID profile:&lt;/span>&lt;figure id="attachment_255" class="wp-caption aligncenter">&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/blog/easily-add-publications-to-your-orcid-profile/" rel="attachment wp-att-255">&lt;img class=" wp-image-255 " src="https://www.crossref.org/wp/blog/uploads/2013/01/crmds_add_work.png" alt="crmds_add_work" width="329" height="254" srcset="https://www.crossref.org/wp/blog/uploads/2013/01/crmds_add_work.png 915w, https://www.crossref.org/wp/blog/uploads/2013/01/crmds_add_work-300x231.png 300w, https://www.crossref.org/wp/blog/uploads/2013/01/crmds_add_work-624x481.png 624w" sizes="(max-width: 329px) 85vw, 329px" />&lt;/a>&lt;figcaption class="wp-caption-text">click on thumbnail to see larger image&lt;/figcaption>&lt;/figure>&lt;/p>
&lt;p >
&lt;span >After clicking on &amp;#8220;Yes&amp;#8221; to add the publication to your profile, the search results will refresh to reflect that the item has been added.&lt;/span>
&lt;/p>&lt;figure id="attachment_257" class="wp-caption aligncenter">
&lt;p>&lt;a href="https://www.crossref.org/blog/easily-add-publications-to-your-orcid-profile/" target="_blank" rel="attachment wp-att-257">&lt;img class=" wp-image-257 " src="https://www.crossref.org/wp/blog/uploads/2013/01/crmds_work_added.png" alt="crmds_work_added" width="329" height="254" srcset="https://www.crossref.org/wp/blog/uploads/2013/01/crmds_work_added.png 915w, https://www.crossref.org/wp/blog/uploads/2013/01/crmds_work_added-300x231.png 300w, https://www.crossref.org/wp/blog/uploads/2013/01/crmds_work_added-624x481.png 624w" sizes="(max-width: 329px) 85vw, 329px" />&lt;/a>&lt;figcaption class="wp-caption-text">click on thumbnail to see larger image&lt;/figcaption>&lt;/figure>&lt;/p>
&lt;p >
&lt;span >You can then just continue searching for and adding any publications that are not in your ORCID profile.&lt;/span>
&lt;/p>
&lt;p >
&lt;span >Note that, occasionally, you may see an orange icon that says that an item is &amp;#8220;Not Visible&amp;#8221;&lt;/span>
&lt;/p>&lt;figure id="attachment_258" class="wp-caption aligncenter">
&lt;p>&lt;a href="https://www.crossref.org/blog/easily-add-publications-to-your-orcid-profile/" target="_blank" rel="attachment wp-att-258">&lt;img class="wp-image-258 " src="https://www.crossref.org/wp/blog/uploads/2013/01/not_visible.png" alt="not_visible" width="329" height="254" srcset="https://www.crossref.org/wp/blog/uploads/2013/01/not_visible.png 915w, https://www.crossref.org/wp/blog/uploads/2013/01/not_visible-300x231.png 300w, https://www.crossref.org/wp/blog/uploads/2013/01/not_visible-624x481.png 624w" sizes="(max-width: 329px) 85vw, 329px" />&lt;/a>&lt;figcaption class="wp-caption-text">click on thumbnail to see larger image&lt;/figcaption>&lt;/figure>&lt;/p>
&lt;p >
&lt;span >This only occurs when you have previously added an item to your profile using CRMDS and then either:&lt;/span>
&lt;/p>
&lt;ol>
&lt;li>&lt;span >Set the ORCID privacy for that particular work item to “Private” in your ORCID profile.&lt;/span>&lt;/li>
&lt;li>&lt;span >Deleted the work from your ORCID profile.&lt;/span>&lt;/li>
&lt;/ol>
&lt;p>&lt;span >Unfortunately, CRMDS has no way to determine which of these two events occurred  However, If you click on the “Not Visible” icon, you will be prompted with two ways to resolve this issue. Either you can:&lt;/span>&lt;/p>
&lt;ol>
&lt;li>&lt;span >Reset the privacy settings on the specified work to “Public” or “Limited”&lt;/span>&lt;/li>
&lt;li>&lt;span >Confirm to CRMDS that you have deleted the item from your profile.&lt;/span>&lt;figure id="attachment_259" class="wp-caption aligncenter">&lt;/li>
&lt;/ol>
&lt;p>&lt;a href="https://www.crossref.org/blog/easily-add-publications-to-your-orcid-profile/" target="_blank" rel="attachment wp-att-259">&lt;img class=" wp-image-259 " src="https://www.crossref.org/wp/blog/uploads/2013/01/not_visible_prompt.png" alt="not_visible_prompt" width="329" height="254" srcset="https://www.crossref.org/wp/blog/uploads/2013/01/not_visible_prompt.png 915w, https://www.crossref.org/wp/blog/uploads/2013/01/not_visible_prompt-300x231.png 300w, https://www.crossref.org/wp/blog/uploads/2013/01/not_visible_prompt-624x481.png 624w" sizes="(max-width: 329px) 85vw, 329px" />&lt;/a>&lt;figcaption class="wp-caption-text">click on thumbnail to see larger image&lt;/figcaption>&lt;/figure>&lt;/p>
&lt;p >
&lt;span >If the issue was your privacy settings, then once you have changed the privacy settings to public/limited you can simply click on the &amp;#8220;Refresh&amp;#8221; button and CRMDS will reflect the correct status of the work.&lt;/span>
&lt;/p>
&lt;p >
&lt;span >The best way to avoid this kind of confusion is to go to your ORCID settings and set the default privacy level for &amp;#8220;works&amp;#8221; to either &amp;#8220;limited&amp;#8221; or &amp;#8220;public.&amp;#8221;&lt;/span>
&lt;/p>
&lt;p >
&lt;span >Crossref Metadata Search is still a &amp;#8220;&lt;a title="Crossref Labs" href="https://www.crossref.org/labs/" target="_blank">Crossref Labs&lt;/a>&amp;#8221; project and, as such, we are very interested to hear feedback on this new ORCID functionality for CRMDS. Please send comments, etc. to:&lt;/span>
&lt;/p>
&lt;p >
&lt;span >&lt;a href="https://www.crossref.org/blog/easily-add-publications-to-your-orcid-profile/" rel="attachment wp-att-261">&lt;img class="alignnone size-full wp-image-261" src="https://www.crossref.org/wp/blog/uploads/2013/01/labs_email.png" alt="labs_email" width="233" height="42" />&lt;/a>&lt;/span>
&lt;/p></description></item><item><title>Crossref Metadata Search++</title><link>https://www.crossref.org/blog/crossref-metadata-search-plus-plus/</link><pubDate>Thu, 11 Oct 2012 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/crossref-metadata-search-plus-plus/</guid><description>&lt;p>We have just released a bunch of new functionality for &lt;a href="https://web.archive.org/web/20131229210637/http://search.crossref.org//" target="_blank">Crossref Metadata Search&lt;/a>. The tool now supports the following features:&lt;/p>
&lt;ul class="disc" >
&lt;li>
A completely new UI
&lt;/li>
&lt;li>
&lt;a href="http://en.wikipedia.org/wiki/Faceted_search" rel="external" target="_blank" >Faceted&lt;/a>&lt;span class="Apple-converted-space">&amp;nbsp;&lt;/span>searches
&lt;/li>
&lt;li>
Copying of search results as formatted citations using&lt;span class="Apple-converted-space">&amp;nbsp;&lt;/span>&lt;a href="http://en.wikipedia.org/wiki/Citation_Style_Language" rel="external" target="_blank" >CSL&lt;/a>
&lt;/li>
&lt;li>
&lt;a href="http://en.wikipedia.org/wiki/COinS" rel="external" target="_blank" >COinS&lt;/a>, so that you can easily import results into Zotero and other document management tools
&lt;/li>
&lt;li>
&lt;a href="http://web.archive.org/web/20121014215757/http://search.labs.crossref.org/help/api" rel="external" target="_blank" >An API&lt;/a>, so that you can integrate Crossref Metadata Search into your own applications, plugins, etc.
&lt;/li>
&lt;li>
Basic&lt;span class="Apple-converted-space">&amp;nbsp;&lt;/span>&lt;a href="http://en.wikipedia.org/wiki/OpenSearch" rel="external" target="_blank" >OpenSearch&lt;/a>&lt;span class="Apple-converted-space">&amp;nbsp;&lt;/span>support- so that you can integrate Crossref Metadata Search into your browser’s search bar.
&lt;/li>
&lt;li>
Searching for a particular Crossref DOI
&lt;/li>
&lt;li>
Searching for a particular Crossref&lt;span class="Apple-converted-space">&amp;nbsp;&lt;/span>&lt;a href="http://shortdoi.org/" rel="external" target="_blank" >ShortDOI&lt;/a>
&lt;/li>
&lt;li>
Searching for articles in a particular journal via the journal’s ISSN
&lt;/li>
&lt;/ul>
&lt;p>At the moment, Crossref Metadata Search (CRMDS) is a &lt;a href="https://www.crossref.org/labs/" target="_blank">Crossref Labs project&lt;/a> and, as such, should be used with some trepidation. Our goal is to release CRMS as a production service ASAP, but we wanted to get public feedback on the service before making the move to a production system.&lt;/p></description></item><item><title>PatentCite</title><link>https://www.crossref.org/blog/patentcite/</link><pubDate>Mon, 13 Aug 2012 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/patentcite/</guid><description>&lt;p>If you’ve ever thought that scholarly citation practice was antediluvian and perverse- you should check-out patents some day.&lt;/p>
&lt;p>Over the past year of so Crossref has been working with &lt;a href="https://web.archive.org/web/20201202050237/http://www.cambia.org/" target="_blank">Cambia&lt;/a> and the &lt;a href="http://beta.lens.org/lens/" target="_blank">The Lens&lt;/a> to explore how we can better link scholarly literature to and from the patent literature. The first object of our collaboration was to attempt to link patents hosted on the new, beta version of The Lens to the Scholarly literature. To do this, Crossref and Cambia been enhancing Crossref’s citation matching mechanisms in order to better resolve the wide variety of eclectic and terse patent citation styles to Crossref DOIs.&lt;/p>
&lt;p>You can see the results of these ongoing attempts on the The Lens beta site where all of The Len’s &lt;strike>8 million+&lt;/strike> 80 million+ patents and applications (obtained through subscriptions with &lt;a href="http://www.wipo.int/" target="_blank">WIPO&lt;/a>, &lt;a href="http://www.uspto.gov/" target="_blank">USPTO&lt;/a>, &lt;a href="http://www.epo.org/" target="_blank">EPO&lt;/a> and &lt;a href="mailto:http://www.ipaustralia.gov.au/">IP Australia&lt;/a>) are starting to be linked directly to the scholarly literature. See, for example:&lt;/p>
&lt;p>&lt;code>http://beta.lens.org/lens/patent/US\_RE42150\_E1/citations&lt;/code>&lt;br>
[&lt;em>Editor&amp;rsquo;s update: Link is broken. Removed January 2021&lt;/em>]&lt;/p>
&lt;p>Crossref has taken this matched data and has now released a &lt;a href="https://web.archive.org/web/20121023015419/http://patents.labs.crossref.org/" target="_blank">Crossref Labs *experimental* service , called PatentCite&lt;/a>, that allows you to take any Crossref DOI and see what Patents in the The Lens system cite it.&lt;/p>
&lt;p>As with all Crossref Labs services- this one is likely to be:&lt;/p>
&lt;p>a) As stable as the global economy&lt;/p>
&lt;p>c) As reliable as a UK train&lt;/p>
&lt;p>ii) Out-of-date. It is based on a snapshot of Crossref /Lens data.&lt;/p>
&lt;ol>
&lt;li>As accurate as my list ordering&lt;/li>
&lt;/ol>
&lt;p>Howzat for an SLA?&lt;/p>
&lt;p>As we get feedback from Crossref’s membership and as we gain more experience linking Patents to and from the scholarly literature, we will explore including this functionality in our production CitedBY service. But until then- please send us your feedback on this experimental service.&lt;/p></description></item><item><title>Crossref and DataCite unify support for HTTP content negotiation</title><link>https://www.crossref.org/blog/crossref-and-datacite-unify-support-for-http-content-negotiation/</link><pubDate>Thu, 17 May 2012 00:00:00 +0000</pubDate><author>Crossref</author><guid>https://www.crossref.org/blog/crossref-and-datacite-unify-support-for-http-content-negotiation/</guid><description>&lt;p>Last year Crossref and DataCite announced support for HTTP content negotiation for DOI names. Today, we are pleased to report further collaboration on the topic. We think it is very important that the two largest DOI Registration Agencies work together in order to provide metadata services to DOI names.&lt;/p>
&lt;p>The current implementation is documented in detail at &lt;a href="http://citation.crosscite.org/" target="_blank">http://citation.crosscite.org/&lt;/a>&lt;/p>
&lt;p>The documentation explains HTTP content negotiation as implemented by both Registration Agencies and provides a list of supported resource/content/record types.&lt;/p>
&lt;p>An example application of HTTP content negotiation is a citation formatting service. You can try it at &lt;a href="http://citation.crosscite.org/" target="_blank">http://citation.crosscite.org/&lt;/a>.&lt;/p>
&lt;p>This service will accept DOIs from both Crossref and DataCite, unlike the previous formatting service which accepted only Crossref DOI names.&lt;/p>
&lt;p>This is possible because Crossref and DataCite support a shared, common metadata format. When you input a DOI into the formatting service, it doesn’t know where the DOI was registered. The service will make an&lt;/p>
&lt;p>HTTP content negotiation request to the global DOI resolver specifying which format of the metadata should be returned in the HTTP Accept header. The global DOI resolver will notice (Accept header!) that this is not a regular DOI resolution request; it will turn to Crossref or DataCite accordingly for the relevant metadata instead of redirecting to a landing page. The format of metadata is shared between both registration agencies so the formatting service can interpret it without knowledge of the DOI origin.&lt;/p>
&lt;p>In summary HTTP content negotiation lets you process a DOI’s metadata without knowledge of its origin or specifics of the registration agency.&lt;/p>
&lt;p>If you have any problems, email us at
&lt;a href="mailto:tech@datacite.org">tech@datacite.org&lt;/a> or &lt;a href="mailto:labs@crossref.org">labs@crossref.org&lt;/a>. For general discussion please kindly leave a comment below.&lt;/p></description></item><item><title>PDF-Extract</title><link>https://www.crossref.org/blog/pdf-extract/</link><pubDate>Tue, 17 Apr 2012 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/pdf-extract/</guid><description>&lt;h1 id="pdf-extract">PDF-EXTRACT&lt;/h1>
&lt;p>&lt;a href="https://www.crossref.org/labs/" target="_blank">Crossref Labs&lt;/a> is happy to announce the first public release of “&lt;a href="https://www.crossref.org/labs/pdfextract/" target="_blank">pdf-extract&lt;/a>” an open source set of tools and libraries for extracting citation references (and, eventually, other semantic metadata) from PDFs. We first demonstrated this tool to Crossref members at our annual meeting last year. See the &lt;a href="https://www.crossref.org/labs/pdfextract/" target="_blank">pdf-extract labs page&lt;/a> for a detailed introduction to this new set of tools.&lt;/p>
&lt;p>If you are unable to download and install the tool, &lt;a href="https://www.crossref.org/labs/pdfextract/" target="_blank">you can play with a experimental web interface called “Extracto.”&lt;/a> Be warned, &lt;strong>Extracto is running on very feeble server using an erratic and slow internet connection&lt;/strong>. The only guarantee that we can make about using it is that &lt;strong>it will repeatedly fall over and annoy you.&lt;/strong> &lt;em>The weasel has spoken.&lt;/em>&lt;/p></description></item><item><title>DOIs for PHD Comics’ Valentine’s Day Reading List</title><link>https://www.crossref.org/blog/dois-for-phd-comics-valentines-day-reading-list/</link><pubDate>Tue, 14 Feb 2012 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/dois-for-phd-comics-valentines-day-reading-list/</guid><description>&lt;p class="p1">
&lt;a href="http://goo.gl/8OzY">PHD Comics&lt;/a> has posted its &lt;a href="http://goo.gl/V5hhs">Valentine’s Day Reading&lt;/a> list.
&lt;/p>
&lt;p class="p1">
&lt;/p>
&lt;p class="p1">
Without DOIs!&amp;nbsp;
&lt;/p>
&lt;p class="p1">
&amp;nbsp;
&lt;/p>
&lt;p class="p1">
So in order to preserve the scholarly citation record, we’ve resolved those that have DOIs&amp;#8230;.
&lt;/p>
&lt;p class="p1">
&lt;/p>
&lt;p class="p1">
Title:&amp;nbsp; &lt;i>The St. Valentine’s Day Frontal Passage&lt;/i>
&lt;/p>
&lt;p class="p1">
Citation:&amp;nbsp; Sassen, K, 1980, &amp;#8216;The St. Valentine’s Day Frontal Passage’, &lt;i>Bulletin of the American Meteorological Society&lt;/i>, vol. 61, no. 2, p. 122.
&lt;/p>
&lt;p class="p2">
&lt;span class="s1">Crossref DOI:&amp;nbsp; &lt;a href="http://dx.doi.org/10.1175/1520-0477(1980)061%3C0122:TSVDFP%3E2.0.CO;2">&lt;span class="s2">http://dx.doi.org/10.1175/1520-0477(1980)061&lt;0122:TSVDFP>2.0.CO;2&lt;/span>&lt;/a>&lt;/span>
&lt;/p>
&lt;p class="p3">
&lt;/p>
&lt;p class="p1">
Title:&amp;nbsp; &lt;i>SUICIDE AND HOMICIDE ON ST. VALENTINE’S DAY&lt;/i>
&lt;/p>
&lt;p class="p1">
Citation:&amp;nbsp; LESTER, D, 1990, &amp;#8216;SUICIDE AND HOMICIDE ON ST. VALENTINE’S DAY’, &lt;i>Perceptual and Motor Skills&lt;/i>, vol. 71, no. 7, p. 994.
&lt;/p>
&lt;p class="p2">
&lt;span class="s1">Crossref DOI:&amp;nbsp; &lt;a href="http://dx.doi.org/10.2466/PMS.71.7.994-994">&lt;span class="s2">http://dx.doi.org/10.2466/PMS.71.7.994-994&lt;/span>&lt;/a>&lt;/span>
&lt;/p>
&lt;p class="p3">
&lt;/p>
&lt;p class="p1">
Title:&amp;nbsp; &lt;i>The St. Valentineʼs Day Massacre&lt;/i>
&lt;/p>
&lt;p class="p1">
Citation:&amp;nbsp; Eckert, W, 1980, &amp;#8216;The St. Valentineʼs Day Massacre’, &lt;i>The American Journal of Forensic Medicine and Pathology&lt;/i>, vol. 1, no. 1, pp. 67-70.
&lt;/p>
&lt;p class="p2">
&lt;span class="s1">Crossref DOI:&amp;nbsp; &lt;a href="http://dx.doi.org/10.1097/00000433-198003000-00011">&lt;span class="s2">http://dx.doi.org/10.1097/00000433-198003000-00011&lt;/span>&lt;/a>&lt;/span>
&lt;/p>
&lt;p class="p3">
&lt;/p>
&lt;p class="p1">
Title:&amp;nbsp; &lt;i>For Valentine’s Day&lt;/i>
&lt;/p>
&lt;p class="p1">
Citation:&amp;nbsp; Kutzner, H, 2001, &amp;#8216;For Valentine’s Day’, &lt;i>Cancer&lt;/i>, vol. 91, no. 4, pp. 804-805.
&lt;/p>
&lt;p class="p2">
&lt;span class="s1">Crossref DOI:&amp;nbsp; &lt;a href="http://dx.doi.org/10.1002/1097-0142(20010215)91:4%3C804::AID-CNCR1067%3E3.3.CO;2-K">&lt;span class="s2">http://dx.doi.org/10.1002/1097-0142(20010215)91:4&lt;804::AID-CNCR1067>3.3.CO;2-K&lt;/span>&lt;/a>&lt;/span>
&lt;/p></description></item><item><title>Turning DOIs into formatted citations</title><link>https://www.crossref.org/blog/turning-dois-into-formatted-citations/</link><pubDate>Mon, 28 Nov 2011 00:00:00 +0000</pubDate><author>Karl Ward</author><guid>https://www.crossref.org/blog/turning-dois-into-formatted-citations/</guid><description>&lt;p>&lt;span >Today two new record types were added to dx.doi.org resolution for Crossref DOIs. These allow anyone to retrieve DOI bibliographic metadata as formatted bibliographic entries. To perform the formatting we’re using the &lt;a href="http://citationstyles.org/">citation style language&lt;/a> processor, &lt;a href="https://web.archive.org/web/20120113111420/https://bitbucket.org/fbennett/citeproc-js/wiki/Home">citeproc-js&lt;/a> which supports a shed load of citation styles and locales. &lt;/span>&lt;/p>
&lt;p>&lt;span >In fact, all the styles and locales found in the CSL repositories, including many common styles such as bibtex, apa, ieee, harvard, vancouver and chicago are supported. First off, if you’d like to try citation formatting without using content negotiation, there’s &lt;a href="https://web.archive.org/web/20120201085933/http://citation.crrd.dyndns.org/">&lt;strong>a simple web UI&lt;/strong>&lt;/a> that allows input of a DOI, style and locale selection. If you’re more into accessing the web via your favorite programming language, have a look at these content negotiation curl examples. To make a request for the new “text/bibliography” record type:&lt;/span> &lt;tt>$ curl -LH &amp;ldquo;Accept: text/bibliography; style=bibtex&amp;rdquo; &lt;a href="http://dx.doi.org/10.1038/nrd842" target="_blank">http://dx.doi.org/10.1038/nrd842&lt;/a> @article{Atkins_Gershell_2002, title={From the analyst&amp;rsquo;s couch: Selective anticancer drugs}, volume={1}, DOI={10.1038/nrd842}, number={7}, journal={Nature Reviews Drug Discovery}, author={Atkins, Joshua H. and Gershell, Leland J.}, year={2002}, month={Jul}, pages={491-492}}&lt;/tt> A locale can be specified with the “locale” record type parameter, like this: &lt;tt>$ curl -LH &amp;ldquo;Accept: text/bibliography; style=mla; locale=fr-FR&amp;rdquo; &lt;a href="http://dx.doi.org/10.1038/nrd842" target="_blank">http://dx.doi.org/10.1038/nrd842&lt;/a> Atkins, Joshua H., et Leland J. Gershell. « From the analyst&amp;rsquo;s couch: Selective anticancer drugs ». Nature Reviews Drug Discovery 1.7 (2002): 491-492.&lt;/tt> &lt;span >You may want to process metadata through CSL yourself. For this use case, there’s another new record type, “application/citeproc+json” that returns metadata in a citeproc-friendly JSON form:&lt;/span> &lt;tt>$ curl -LH &amp;ldquo;Accept: application/citeproc+json&amp;rdquo; &lt;a href="http://dx.doi.org/10.1038/nrd842" target="_blank">http://dx.doi.org/10.1038/nrd842&lt;/a> {&amp;ldquo;volume&amp;rdquo;:&amp;ldquo;1&amp;rdquo;,&amp;ldquo;issue&amp;rdquo;:&amp;ldquo;7&amp;rdquo;,&amp;ldquo;DOI&amp;rdquo;:&amp;ldquo;10.1038/nrd842&amp;rdquo;,&amp;ldquo;title&amp;rdquo;:&amp;ldquo;From the analyst&amp;rsquo;s couch: Selective anticancer drugs&amp;rdquo;,&amp;ldquo;container-title&amp;rdquo;:&amp;ldquo;Nature Reviews Drug Discovery&amp;rdquo;,&amp;ldquo;issued&amp;rdquo;:{&amp;ldquo;date-parts&amp;rdquo;:[[2002,7]]},&amp;ldquo;author&amp;rdquo;:[{&amp;ldquo;family&amp;rdquo;:&amp;ldquo;Atkins&amp;rdquo;,&amp;ldquo;given&amp;rdquo;:&amp;ldquo;Joshua H.&amp;rdquo;},{&amp;ldquo;family&amp;rdquo;:&amp;ldquo;Gershell&amp;rdquo;,&amp;ldquo;given&amp;rdquo;:&amp;ldquo;Leland J.&amp;rdquo;}],&amp;ldquo;page&amp;rdquo;:&amp;ldquo;491-492&amp;rdquo;,&amp;ldquo;type&amp;rdquo;:&amp;ldquo;article-journal&amp;rdquo;}&lt;/tt> &lt;span >Finally, to retrieve lists of supported styles and locales, see:&lt;/span>&lt;/p>
&lt;p>&lt;span >* &lt;a href="https://crosscite.org">&lt;a href="https://crosscite.org" target="_blank">https://crosscite.org&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;a href="https://github.com/citation-style-language/styles">style&lt;/a> and &lt;a href="https://github.com/citation-style-language/locales">locale&lt;/a> repositories. There’s one big caveat to all this. The CSL processor will do its best with Crossref metadata which can unfortunately be quite patchy at times. There may be pieces of metadata missing, inaccurate metadata or even metadata items stored under the wrong field, all resulting in odd-looking formatted citations. Most of the time, though, it works.&lt;/span>&lt;/p></description></item><item><title>Determining the Crossref membership status of a domain</title><link>https://www.crossref.org/blog/determining-the-crossref-membership-status-of-a-domain/</link><pubDate>Tue, 22 Nov 2011 00:00:00 +0000</pubDate><author>Karl Ward</author><guid>https://www.crossref.org/blog/determining-the-crossref-membership-status-of-a-domain/</guid><description>&lt;p>We’ve been asked a few times if it is possible to determine whether or not a particular domain name belongs to a Crossref member. To address this we’re launching another small service that performs something like a “reverse look-up” of URLs and domain names to DOIs and Crossref member status.&lt;/p>
&lt;p>The service provides an API that will attempt to reverse look-up a URL to a DOI and return the membership status (member or non-member) of the root domain of the URL. In practice resolving URLs to DOIs has substantial limitations - many publishers redirect the resolution URL of DOIs to other online content and URLs become clogged up with session IDs and other cruft appearing in their query parameters. All of this means it is unlikely that the URLs that appear to be the end result of DOI resolution are actually the URLs pointed to.&lt;/p>
&lt;p>However, it’s also possible to provide only a host name, in which case, as with a URL, the Crossref membership status for the root domain will be returned.&lt;/p>
&lt;p>There’s also a downloadable list of hashed domains that belong to Crossref members which will be useful to those who want to determine the membership status of a domain locally. Also, a bookmarklet allows anyone to easily check a web page they are looking at to see if the domain it is hosted on belongs to a Crossref member.&lt;/p></description></item><item><title>DataCite supporting content negotiation</title><link>https://www.crossref.org/blog/datacite-supporting-content-negotiation/</link><pubDate>Mon, 10 Oct 2011 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/datacite-supporting-content-negotiation/</guid><description>&lt;p>In April &lt;a href="https://www.crossref.org/blog/content-negotiation-for-crossref-dois/">In April&lt;/a> for its DOIs. At the time I cheekily called-out &lt;a href="http://datacite.org/" target="_blank">DataCite&lt;/a> to start supporting content negotiation as well.&lt;/p>
&lt;p>Edward Zukowski (DataCite’s resident propellor-head) took up the challenge with gusto and, as of September 22nd &lt;a href="http://data.datacite.org/" target="_blank">DataCite has also been supporting content negotiation for its DOIs&lt;/a>. This means that one million more DOIs are now &lt;a class="zem_slink" href="http://en.wikipedia.org/wiki/Linked_Data" title="Linked Data" rel="wikipedia">linked-data&lt;/a> friendly. Congratulations to Ed and the rest of the team at DataCite.&lt;/p>
&lt;p>We hope this is a trend. Back in June &lt;a href="http://www.knowledge-exchange.info/" target="_blank">Knowledge Exchange&lt;/a> organized a seminar on Persistent Object Identifiers. One of the outcomes of the meeting was “&lt;a href="https://web.archive.org/web/20130808010317/http://www.knowledge-exchange.info/Default.aspx?ID=62&amp;amp;M=News&amp;amp;NewsID=124" target="_blank">Den Haag Manifesto&lt;/a>” a document outlining five relatively simple steps that different persistent identifier systems could take in order to increase interoperability. Most of these steps involved adopting linked data principles including support for content negotiation. We look forward to hearing about other persistent identifiers adopting these principles over the next year.&lt;/p>
&lt;p>Having said that, this time I will refrain from calling-out anybody specifically…&lt;/p>
&lt;div class="zemanta-pixie">
&lt;a class="zemanta-pixie-a" href="http://www.zemanta.com/" title="Enhanced by Zemanta">&lt;img class="zemanta-pixie-img" src="http://img.zemanta.com/zemified_e.png?x-id=f7639c9b-8fd7-4af4-9c08-4f283778f4c2" alt="Enhanced by Zemanta" />&lt;/a>
&lt;/div></description></item><item><title>Family Names Service</title><link>https://www.crossref.org/blog/family-names-service/</link><pubDate>Thu, 06 Oct 2011 00:00:00 +0000</pubDate><author>Karl Ward</author><guid>https://www.crossref.org/blog/family-names-service/</guid><description>&lt;p>Today I’m announcing &lt;a href="https://web.archive.org/web/20120113212842/http://names.crrd.dyndns.org/" target="_blank">a small web API&lt;/a> that wraps a family name database here at Crossref R&amp;amp;D. The database, built from Crossref’s metadata, lists all unique family names that appear as contributors to articles, books, datasets and so on that are known to Crossref. As such the database likely accounts for the majority of family names represented in the scholarly record.&lt;/p>
&lt;p>The web API comes with two services: a family name detector that will pick out potential family names from chunks of text and a family name autocompletion system.&lt;/p>
&lt;p>Very brief documentation can be found &lt;a href="https://web.archive.org/web/20120113212842/http://names.crrd.dyndns.org/" target="_blank">here&lt;/a> along with a jQuery example of autocompletion.&lt;/p>
&lt;p>The database is still in development so there may be some oddities and inaccuracies in there. Right now one obvious omission from the name list that I hope to address soon are double-worded names such as “von Neumann”. We’re not proposing this database as an authority but rather something that backs a practical service for family name detection and autocompletion.&lt;/p></description></item><item><title>Content Negotiation for Crossref DOIs</title><link>https://www.crossref.org/blog/content-negotiation-for-crossref-dois/</link><pubDate>Tue, 19 Apr 2011 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/content-negotiation-for-crossref-dois/</guid><description>&lt;p>&lt;span >So does anybody remember the posting &lt;a href="https://www.crossref.org/blog/dois-and-linked-data-some-concrete-proposals/">DOIs and Linked Data: Some Concrete Proposals&lt;/a>?&lt;/span>&lt;/p>
&lt;p>&lt;span >Well, we went with option “D.”&lt;/span>&lt;/p>
&lt;p>&lt;span >From now on, DOIs, &lt;i>expressed as &lt;a href="http://en.wikipedia.org/wiki/Uniform_Resource_Identifier">HTTP URI&lt;/a>s&lt;/i>, can be used with &lt;a href="http://en.wikipedia.org/wiki/Content_negotiation">content-negotiation&lt;/a>.&lt;/span>&lt;/p>
&lt;p>&lt;span >Let’s get straight to the point. If you have &lt;a href="http://curl.haxx.se/">curl&lt;/a> installed, you can start playing with content-negotiation and Crossref DOIs right away:&lt;/span>&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >curl -D - -L -H   “Accept: application/rdf+xml” “&lt;code>http://dx.doi.org/10.1126/science.1157784&lt;/code>” &lt;/span>&lt;/p>
&lt;p>&lt;span >curl -D - -L -H   “Accept: text/turtle” “&lt;code>http://dx.doi.org/10.1126/science.1157784&lt;/code>”&lt;br /> &lt;/span>&lt;/p>
&lt;p>&lt;span >curl -D - -L -H   “Accept: application/atom+xml” “&lt;code>http://dx.doi.org/10.1126/science.1157784&lt;/code>” &lt;/span>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;span >Or if you are already using Crossref’s “&lt;a href="https://www.crossref.org/schema/unixref1.1.xsd" target="_blank">unixref&lt;/a>” format:&lt;/span>&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >curl -D - -L -H “Accept: application/unixref+xml” “&lt;code>http://dx.doi.org/10.1126/science.1157784&amp;amp;&lt;/code>#8221; &lt;/span>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;span >This will work with over 46 million Crossref DOIs as of today, but the beauty of the setup is that from now on, any &lt;a href="http://www.doi.org/registration_agencies.html">DOI registration agency&lt;/a> can enable content negotiation for their constituencies as well. &lt;a href="http://datacite.org/">DataCite&lt;/a>- we’re looking at you 😉 .&lt;/span>&lt;/p>
&lt;p>&lt;span >It also means that, as registration agency members (Crossref publishers, for instance) start providing more complete and richer representations of their content, we can simply redirect content-negotiated requests directly to them.&lt;/span>&lt;/p>
&lt;p>&lt;span >We expect that that this development will round-out Crossref’s efforts to support standard APIs including &lt;a href="https://support.crossref.org/hc/en-us/articles/214880143">OpenURL&lt;/a> and &lt;a href="https://support.crossref.org/hc/en-us/articles/213679866">OAI_PMH&lt;/a> and we look forward to seeing DOIs increasingly used in &lt;a href="http://en.wikipedia.org/wiki/Linked_Data">linked data&lt;/a> applications.&lt;/span>&lt;/p>
&lt;p>&lt;span >Finally, Crossref would just like to thank the &lt;a href="http://www.doi.org/foundation/bios.html">IDF&lt;/a> and &lt;a href="http://www.cnri.reston.va.us/">CNRI&lt;/a> for their hard work on this as well as &lt;a href="http://www.linkedin.com/in/tonyhammond">Tony Hammond&lt;/a> and &lt;a href="http://www.ldodds.com/">Leigh Dodds&lt;/a> for their valuable advice and persistent goading.&lt;/span>&lt;/p></description></item><item><title>Monitoring Crossref Technical Developments</title><link>https://www.crossref.org/blog/monitoring-crossref-technical-developments/</link><pubDate>Tue, 29 Mar 2011 00:00:00 +0000</pubDate><author>Anna Tolwinska</author><guid>https://www.crossref.org/blog/monitoring-crossref-technical-developments/</guid><description>&lt;p>Announcements regarding Crossref system status or changes are posted in an Announcements forum on our support portal (&lt;a href="http://support.crossref.org" target="_blank">http://support.crossref.org&lt;/a>). We recommend that someone from your organisation monitor this forum to stay informed about Crossref system status, schema changes, or other issues affecting deposits and queries. Subscribe to this forum via RSS feed (&lt;a href="https://support.crossref.org/hc/en-us" target="_blank">https://support.crossref.org/hc/en-us&lt;/a>) or select the ‘Subscribe’ option in the forum to subscribe by email.&lt;/p>
&lt;p>The TWG Discussion forum replaces the TWG mailing list and can be accessed by members of the Crossref community who log in to our support portal. Intended topics include technical matters related to Crossref’s services, DOI issues and Crossref system operation.&lt;/p></description></item><item><title>Add linked images to PDFs</title><link>https://www.crossref.org/blog/add-linked-images-to-pdfs/</link><pubDate>Mon, 16 Aug 2010 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/add-linked-images-to-pdfs/</guid><description>&lt;p>While working on an internal project, we developed “&lt;a href="https://www.crossref.org/labs/pdfstamp/" target="_blank">pdfstamp&lt;/a>“, a command-line tool that allows one to easily apply linked images to PDFs. We thought some in our community might find it useful and have &lt;a href="http://github.com/Crossref/pdfstamp" target="_blank">released it on github.&lt;/a> Some more PDF-related tools will follow soon.&lt;/p></description></item><item><title>XMP in RSC PDFs</title><link>https://www.crossref.org/blog/xmp-in-rsc-pdfs/</link><pubDate>Tue, 03 Aug 2010 00:00:00 +0000</pubDate><author>Crossref</author><guid>https://www.crossref.org/blog/xmp-in-rsc-pdfs/</guid><description>&lt;p>Just a quick heads-up to say that we’ve had a go at incorporating InChIs and ontology terms into our PDFs with XMP. There isn’t a lot of room in an XMP packet so we’ve had to be a bit particular about what we include.&lt;/p>
&lt;ul>
&lt;li>InChIs: the bigger the molecule the longer the InChI, so we’ve standardized on the fixed-length InChIKey. This doesn’t mean anything on its own, so we’ve gone the Semantic Web route of including an InChI resolver HTTP URI. Alternatively you can extract the InChIKeys with a regular expression.&lt;/li>
&lt;li>Ontology terms: we’re using HTTP URIs again and pointing to either Open Biomedical Ontology URIs (biology, biomedicine; slashy) or RSC ontology terms (chemistry; hashy). Often the OBO URIs resolve to a specific web page, but for the moment the RSC URIs just point to a large OWL file. Slashy URIs are quite a bit more involved so we’ll have to see what the demand is like.&lt;/li>
&lt;/ul>
&lt;p>There’s only about 4K to play with, so it’s only ever going to be a best-of. More detailed article metadata has to go in either a sidecar file, as Tony has pointed out before, or ideally on the article landing page. The example files are &lt;a href="https://web.archive.org/web/20070314231423/http://www.rsc.org/Publishing/Journals/ProjectProspect/Examples.asp" target="_blank">here&lt;/a> and I’ve posted something with a different slant on the &lt;a href="http://blogs.rsc.org/technical/2010/08/02/pdfs-enhanced-with-xmp/" target="_blank">RSC technical blog&lt;/a>.&lt;/p></description></item><item><title>OpenSearch/SRU Integration Paper</title><link>https://www.crossref.org/blog/opensearch/sru-integration-paper/</link><pubDate>Mon, 19 Jul 2010 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/opensearch/sru-integration-paper/</guid><description>&lt;p>Since I’ve already blogged about this a number of times before here, I thought I ought to include a link to a fuller writeup in this month’s &lt;a href="http://dlib.org" target="_blank">D-Lib Magazine&lt;/a> of our &lt;a href="https://www.nature.com/opensearch/" target="_blank">nature.com OpenSearch&lt;/a> service which serves as a case study in OpenSearch and SRU integration:&lt;/p>
&lt;p>&lt;a href="http://dlib.org/dlib/july10/hammond/07hammond.html" target="_blank">&lt;img border="0" src="https://www.crossref.org/wp/blog/images/dlib-page.png" height="320" width="450" />&lt;/a>&lt;/p>
&lt;p>&lt;a href="http://dlib.org/dlib/july10/hammond/07hammond.html" target="_blank">doi:10.1045/july2010-hammond&lt;/a>&lt;/p></description></item><item><title>Search: An Evolution</title><link>https://www.crossref.org/blog/search-an-evolution/</link><pubDate>Wed, 28 Apr 2010 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/search-an-evolution/</guid><description>&lt;p>&lt;a href="https://www.crossref.org/wp/blog/images/search-triple-store.png">&lt;img border=0 alt="doi-what-do-we-got.png" src="https://www.crossref.org/wp/blog/images/search-triple-store.png" width="416" height="325" />&lt;/a>&lt;/p>
&lt;p>(Click image for full size graphic.)&lt;/p>
&lt;p>I thought I could take this opportunity to demonstrate one evolution path from traditional record-based search to a more contemporary triple-based search. The aim is to show that these two modes of search do not have to be alternative approaches but can co-exist within a single workflow.&lt;/p>
&lt;p>Let me first mention a couple of terms I’m using here: ‘graphs’ and ‘properties’. I’m using ‘property’ loosely to refer to the individual RDF statement (or triple) containing a property, i.e. a triple is a ‘(subject, property, value)’ assertion. And a ‘graph’ is just a collection of ‘properties’ (or, more properly, triples). Oh, and I’ll also use the term ‘records’ when considering ‘graphs’ as pre-fabricated objects returned within a result set.&lt;/p>
&lt;p>So, what do we have here? We have on the left a traditional means of disseminating search results which is typically record based. A new set of records may be generated by querying using the API provided, whether proprietary or public such as &lt;a href="https://lucene.apache.org/" target="_blank">Lucene&lt;/a> or &lt;a href="http://www.loc.gov/standards/sru/" target="_blank">SRU/CQL&lt;/a>. We can thus consider this search service as a ‘record store’ – even though records tend to generated anew rather than retrieved. The individual records in the result set are collections or groupings of ‘properties’ about the subjects of the query. Note that this is somewhat similar to the way music is packaged for physical distribution with many tracks (‘properties’) combined onto a single album (‘record’ or ‘graph’) which contains a thematic coherence – either same artist or compilation around a given topic.&lt;/p>
&lt;p>Digital music distribution, on the other hand, allows for albums to be atomized so that individual tracks may be cherry-picked at will. This is not dissimilar from what happens in a ‘triple store’ where the basic properties (‘tracks’) that in a regular search engine were together combined in a ‘record’ (‘album’) to present a search result can now be plucked apart and recombined into newer bespoke ensembles. Note that this querying and recombination can be applied across the full triple store or even across this triple store and remote triple stores since the same data model is applied. Certainly, at the data model level federated searching thus becomes a non-issue.&lt;/p>
&lt;p>Suppose now that our search server (or record store) is an &lt;a href="http://www.opensearch.org/" target="_blank">OpenSearch&lt;/a>-type service, i.e. the result sets are distributed as some list-based format, typically RSS, and that the list-based format either provides an RDF graph or can be transformed to such a graph, we could then use that as a basis for feeding an RDF triple store.&lt;/p>
&lt;p>So, now then at right we have a triple store which is a large database of triples (or properties) compiled from all the records in the record store. And since this is a triple store we can query it using &lt;a href="http://www.w3.org/TR/rdf-sparql-query/" target="_blank">SPARQL&lt;/a>. For example, this trival SPARQL query:&lt;/p>
&lt;pre>&lt;tt>
PREFIX dc: &amp;lt;http://purl.org/dc/elements/1.1/&amp;gt;
PREFIX prism: &amp;lt;http://prismstandard.org/namespaces/basic/2.0/&amp;gt;
SELECT ?doi ?title
WHERE {
?s prism:doi ?doi .
?s dc:title ?title .
FILTER regex(?title, "boson", "i" )
}
LIMIT 5
&lt;/tt>&lt;/pre>
&lt;p>returns the first five articles (referenced by DOI) with title containing the word ‘boson’:&lt;/p>
&lt;pre>&lt;tt>
--------------------------------------------------------------------------------------------------
| doi | title |
==================================================================================================
| "10.1038/nature05513" | "Comparison of the Hanbury Brown–Twiss effect for bosons and fermions" |
| "10.1038/221999a0" | "Physics: The Intermediate Boson" |
| "10.1038/313506b0" | "The nuts and bolts of bosons" |
| "10.1038/301287a0" | "The search for bosons: A golden year for the weak force" |
| "10.1038/424003a" | "Below-par performance hampers Fermilab quest for Higgs boson" |
--------------------------------------------------------------------------------------------------
&lt;/tt>&lt;/pre>
&lt;p>Now let’s contrast this with a conventional record-based search, such as shown at left, to find the first five articles (referenced by DOI) with title containing the word ‘boson’ would use a query (here SRU/CQL, and CQL is bolded) such as:&lt;/p>
&lt;pre>&lt;tt>
?query=&lt;b>dc.title="boson"&lt;/b>&amp;maximumRecords=5&amp;httpAccept=application/rss+xml
&lt;/tt>&lt;/pre>
&lt;p>and would receive a set of result records (here RSS) like so:&lt;/p>
&lt;pre>&lt;tt>
...
&amp;lt;item rdf:about="http://dx.doi.org/10.1038/nature05513"&amp;gt;
&amp;lt;title&amp;gt;Comparison of the Hanbury Brown–Twiss effect for bosons and fermions&amp;lt;/title&amp;gt;
&amp;lt;link&amp;gt;http://dx.doi.org/10.1038/nature05513&amp;lt;/link&amp;gt;
&amp;lt;dc:identifier&amp;gt;doi:10.1038/nature05513&amp;lt;/dc:identifier&amp;gt;
&amp;lt;dc:title&amp;gt;Comparison of the Hanbury Brown–Twiss effect for bosons and fermions&amp;lt;/dc:title&amp;gt;
...
&amp;lt;/item&amp;gt;
&amp;lt;item rdf:about="http://dx.doi.org/10.1038/221999a0"&amp;gt;
&amp;lt;title&amp;gt;Physics: The Intermediate Boson&amp;lt;/title&amp;gt;
&amp;lt;link&amp;gt;http://dx.doi.org/10.1038/221999a0&amp;lt;/link&amp;gt;
&amp;lt;dc:identifier&amp;gt;doi:10.1038/221999a0&amp;lt;/dc:identifier&amp;gt;
&amp;lt;dc:title&amp;gt;Physics: The Intermediate Boson&amp;lt;/dc:title&amp;gt;
...
&amp;lt;/item&amp;gt;
...
&lt;/tt>&lt;/pre>
&lt;p>Note also that there is an interesting halfway house as shown in the diagram, whereby a set of result records presenting a single RDF graph can be queried as its own (very) restricted triple store.&lt;/p>
&lt;p>In general, because a triple store is so primitive and it can be queried alongside other triple stores the queries that can be put together can be highly complex and customized with arbitrary data. The result from such a query differs from a traditional ‘record’ where a fixed property set is bound together in a presentation. Such a result is user-determined as opposed to the server-determined nature of traditional result ‘records’.&lt;/p>
&lt;p>I hope that this post has been able to show in some degree that although there are some obvious differences there is nevertheless a synergy between these two modes of searching: prêt-à-porter and tailored.&lt;/p></description></item><item><title>DOIs and Linked Data: Some Concrete Proposals</title><link>https://www.crossref.org/blog/dois-and-linked-data-some-concrete-proposals/</link><pubDate>Thu, 25 Mar 2010 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/dois-and-linked-data-some-concrete-proposals/</guid><description>&lt;p>Since last month’s threads (&lt;a href="https://www.crossref.org/blog/doi-what-do-we-got/">here&lt;/a>, &lt;a href="https://www.crossref.org/blog/the-response-page/">here&lt;/a>, &lt;a href="https://www.crossref.org/blog/does-a-crossref-doi-identify-a-work/">here&lt;/a> and &lt;a href="https://www.crossref.org/blog/is-frbr-the-osi-for-web-architecture/">here&lt;/a>) talking about the issues involved in making the DOI a first-class identifier for linked data applications, I’ve had the chance to actually sit down with some of the thread’s participants (&lt;a href="http://uk.linkedin.com/in/tonyhammond" target="_blank">Tony Hammond&lt;/a>, &lt;a href="http://www.ldodds.com/" target="_blank">Leigh Dodds&lt;/a>, &lt;a href="https://web.archive.org/web/20100525232458/http://www.tertius.ltd.uk/" target="_blank">Norman Paskin&lt;/a>) and we’ve been able sketch-out some possible scenarios for migrating the DOI into a linked data world.&lt;/p>
&lt;p>I think that several of us were struck by how little actually needs to be done in order to fully address virtually all of the concerns that the linked data community has expressed about DOIs. Not only that- but in some of these scenarios we would put ourselves in a position to be able to semantically-enable over 40 million DOIs with what amounts to the flick of a switch.&lt;/p>
&lt;p>Given the huge interest in linked data on the part of researchers and Crossref members- it seems like it would be a fantastic boon to both the IDF (&lt;a href="http://www.doi.org/" target="_blank">International DOI Foundation&lt;/a>) and Crossref if we were able to do something quickly here.&lt;/p>
&lt;p>Anyway- The following are notes outlining several concrete proposals for addressing the limitations of DOIs as identifiers in linked data applications. They range in complexity/effort involved- with the simplest scenario providing minimal (yet functional) LD capabilities for just one RA’s members (Crossref’s) and the most complex providing per-RA and per-RA-member configurability on how DOIs would behave for LD applications.&lt;/p>
&lt;p>We’d appreciate comments, questions, suggestions, corrections, etc.&lt;/p>
&lt;h2 id="a-simplest-scenario">A: Simplest Scenario&lt;/h2>
&lt;h3 id="what-would-need-to-be-done">What would need to be done?&lt;/h3>
&lt;ol>
&lt;li>Crossref implements a linked data service. For example, hosted at rdf.crossref.org.&lt;/li>
&lt;li>Crossref recommends that any member publisher who wants to add rudimentary linked data capabilities to their site could simply insert some simple link elements into their landing Pages. So, for instance, for the article with the DOI 10.5555/1234567 in the &lt;em>Journal of Psychoceramics&lt;/em>, the publisher would put the following in the landing page for the article:&lt;/li>
&lt;/ol>
&lt;p>&lt;code>&amp;lt;link rel=&amp;quot;primarytopic&amp;quot; href=&amp;quot;http://doi.crossref.org/10.5555/1234567&amp;quot; /&amp;gt;&lt;/code>
&lt;code>&amp;lt;link rel=&amp;quot;alternate&amp;quot; type=&amp;quot;application/rdf+xml&amp;quot; href=&amp;quot;http://rdf.crossref.org/metadata/10.5555/1234567.rdf&amp;quot; title=&amp;quot;RDF/XML version of this document&amp;quot;/&amp;gt;&lt;/code>
&lt;code>&amp;lt;link rel=&amp;quot;alternate&amp;quot; type=&amp;quot;text/html&amp;quot; href=&amp;quot;http://www.journalofpsychoceramics.org/10.5555/1234567.html&amp;quot; title=&amp;quot;HTML version of this document&amp;quot;/&amp;gt; &lt;/code>
&lt;code>&amp;lt;link rel=&amp;quot;alternate&amp;quot; type=&amp;quot;application/json&amp;quot; href=&amp;quot;http://rdf.crossref.org/metadata/10.5555/1234567.json&amp;quot; title=&amp;quot;RDF/JSON version of this document&amp;quot;/&amp;gt;&lt;/code>
&lt;code>&amp;lt;link rel=&amp;quot;alternate&amp;quot; type=&amp;quot;text/turtle&amp;quot; href=&amp;quot;http://rdf.crossref.org/metadata/10.5555/1234567.ttl&amp;quot; title=&amp;quot;Turtle version of this document&amp;quot;/&amp;gt;&lt;/code>&lt;/p>
&lt;p>In the above snippet the HTML version of the document is the publisher’s existing landing page.&lt;/p>
&lt;h3 id="how-it-would-work">How it would work&lt;/h3>
&lt;ol>
&lt;li>A sem-web-enabled browser would query dx.doi.org/10.5555/1234567 and get a normal 302 redirect to the publisher’s landing page.&lt;/li>
&lt;li>The sem-web-enabled browser would sniff the page for the link elements and retrieve the representations it wanted from rdf.crossref.org&lt;/li>
&lt;li>The returned document would contain an appropriate representation of the metadata that the publisher has deposited with Crossref. It would also assert that:&lt;/li>
&lt;/ol>
&lt;p>&lt;code> doi.crossref.org/10.5555/12334567 owl:sameAs dx.doi.org/10.5555/1234567 .&lt;/code>
&lt;code>dx.doi.org/10.5555/12334567 owl:sameAs info:doi/10.5555/12334567&amp;lt;/p&amp;gt; &lt;/code>
&lt;code>info:doi/10.5555/12334567 owl:sameAs doi:10.5555/1234567&lt;/code>&lt;/p>
&lt;p>
Alternatively, the publisher could implement their own linked data support on their own domain using whatever appropriate method they want. So, for instance, a larger publisher could support content negotiation at their site and return different/enhanced metadata, etc.
&lt;/p>
&lt;h3 id="pros">Pros&lt;/h3>
&lt;ol>
&lt;li>Doesn’t require changes at DOI/Handle levels&lt;/li>
&lt;li>Is easy for publisher to opt-in or opt-out&lt;/li>
&lt;li>Requires minimal development on the part of Crossref.&lt;/li>
&lt;/ol>
&lt;h3 id="cons">Cons&lt;/h3>
&lt;ol>
&lt;li>Only applies to Crossref DOIs.&lt;/li>
&lt;li>It depends on publishers taking action. Might be a long time before publishers add the needed links to their landing pages or support content negotiation.&lt;/li>
&lt;li>DOI system is still not strictly LD compliant (e.g. it is returning 302 redirects. Naive sem-web browsers might ‘stop’ after getting a 302. Should ideally use 303s, content negotiation, etc.)&lt;/li>
&lt;li>Doesn’t work for DOIs that currently bypass landing pages and which go directly to content.&lt;/li>
&lt;/ol>
&lt;h2 id="b-simple--idf-global-semantic-compliance">B: Simple + IDF Global Semantic Compliance&lt;/h2>
&lt;h3 id="what-would-need-to-be-done-1">What would need to be done?&lt;/h3>
&lt;ol>
&lt;li>Same as “Simplest Scenario”&lt;/li>
&lt;li>IDF globally changes dx.doi.org to return 303 redirect&lt;/li>
&lt;/ol>
&lt;h3 id="how-would-it-work">How would it work?&lt;/h3>
&lt;div>
Same as Simplest Scenario, except that, because sem-web-enabled browser had been told it was being redirected to a NIR (via the 303), it would presumably be more likely to continue.
&lt;/div>
&lt;h3 id="pros-1">Pros&lt;/h3>
&lt;ol>
&lt;li>All DOIs conform to expectations for LD identifiers&lt;/li>
&lt;li>Easy for publisher to opt-in or opt-out&lt;/li>
&lt;li>Requires minimal development on part of Crossref&lt;/li>
&lt;li>Requires minimal work (?) on part of IDF&lt;/li>
&lt;/ol>
&lt;h3 id="cons-1">Cons&lt;/h3>
&lt;ol>
&lt;li>Requires global change on part of IDF. Global change might conflict with requirements of other RAs.&lt;/li>
&lt;li>It depends on publishers taking action. Might be a long time before publishers add needed links to their landing pages or support content negotiation.&lt;/li>
&lt;li>Doesn’t work for DOIs that currently bypass landing pages (e.g. OECD spreadhseets, UICR datasets, etc.)&lt;/li>
&lt;/ol>
&lt;h2 id="c-simple--idf-global-semantic-compliance--ra-cn-intercept">C: Simple + IDF Global Semantic Compliance + RA CN Intercept&lt;/h2>
&lt;h3 id="what-would-need-to-be-done-2">What would need to be done?&lt;/h3>
&lt;ol>
&lt;li>
&lt;p>Same as “B: Simple + IDF Global Semantic Compliance” Scenario&lt;/p>
&lt;/li>
&lt;li>
&lt;p>IDF  changes dx.doi.org to redirect content-negotiated dx.doi.org queries to RA-controlled resolver depending on the preferences of the RA.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>RA implements DOI resolver (e.g. dx.crossref.org) that supports content negotiation. RA allows its members to specify to the RA  that they want either: &lt;code>&amp;lt;ol type=a&amp;gt;&lt;/code>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>RA to forward all requests to the member’s site.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>RA to “intercept” content-negotiations for non-HTML representations and direct them appropriately (e.g. return appropriate representation from rdf.crossref.org) &lt;/ol>&lt;/p>
&lt;/li>
&lt;/ol>
&lt;h3 id="font-size3how-would-it-workfont">&lt;font size=3>How would it work?&lt;/font>&lt;/h3>
&lt;p>&lt;a href="https://www.crossref.org/wp/blog/images/scenario_c_flow_v3.html" onclick="window.open('/wp/blog/images/scenario_c_flow_v3.html','popup','width=1600,height=1200,scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0'); return false">&lt;img src="https://www.crossref.org/wp/blog/images/scenario_c_flow_v3-thumb.png" width="400" height="300" alt="" />&lt;/a>&lt;/p>
&lt;h3 id="pros-2">Pros&lt;/h3>
&lt;ol>
&lt;li>All DOIs conform to expectations for LD identifiers&lt;/li>
&lt;li>Allows RA to potentially LD-enable its members very quickly.&lt;/li>
&lt;li>Easy for ra-members to opt-in or opt-out&lt;/li>
&lt;li>Requires minimal development on part of Crossref&lt;/li>
&lt;li>Would even work for DOIs that bypass landing pages&lt;/li>
&lt;/ol>
&lt;h3 id="cons-2">Cons&lt;/h3>
&lt;ol>
&lt;li>Requires global change on part of IDF. Global change might conflict with requirements of other RAs.&lt;/li>
&lt;li>Requires change to add decision logic implementation on part of IDF. &lt;/li>
&lt;li>Requires development of RA resolvers that implement per-member resolution logic (note- this would probably actually be done at DOI level)&lt;/li>
&lt;/ol>
&lt;h2 id="d-simple--idf-selective-semantic-compliance--ra-cn-intercept">D: Simple + IDF Selective Semantic Compliance + RA CN Intercept&lt;/h2>
&lt;h3 id="what-would-need-to-be-done-3">What would need to be done?&lt;/h3>
&lt;ol>
&lt;li>
&lt;p>Same as Simplest Scenario&lt;/p>
&lt;/li>
&lt;li>
&lt;p>IDF  changes dx.doi.org to return either 302 or 303 redirect depending on the preferences of the RA.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>IDF  changes dx.doi.org to redirect content-negotiated dx.doi.org queries to RA-controlled resolver depending on the preferences of the RA.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>RA implements DOI resolver (e.g. dx.crossref.org) that supports content negotiation. RA allows its members to specify to the RA  that they want either: &lt;ol type=a>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>RA to forward all requests to the member’s site.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>RA to “intercept” content-negotiations for non-HTML representations and direct them appropriately (e.g. return appropriate representation from rdf.crossref.org) &lt;/ol>&lt;/p>
&lt;/li>
&lt;/ol>
&lt;h3 id="how-would-it-work-1">How would it work?&lt;/h3>
&lt;p>&lt;a href="https://www.crossref.org/wp/blog/images/scenario_d_flow_v31.html" onclick="window.open('/wp/blog/images/scenario_d_flow_v31.html','popup','width=1600,height=1200,scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0'); return false">&lt;img src="https://www.crossref.org/wp/blog/images/scenario_d_flow_v3-thumb.png" width="400" height="300" alt="" />&lt;/a>&lt;/p>
&lt;h3 id="pros-3">Pros&lt;/h3>
&lt;ol>
&lt;li>Allows RA to potentially LD-enable its members very quickly.&lt;/li>
&lt;li>Easy for ra-members to opt-in or opt-out&lt;/li>
&lt;li>Requires minimal development on part of Crossref&lt;/li>
&lt;li>Would even work for DOIs that bypass landing pages&lt;/li>
&lt;/ol>
&lt;h3 id="cons-3">Cons&lt;/h3>
&lt;ol>
&lt;li>Only some DOIs conform to expectations for LD identifiers&lt;/li>
&lt;li>Requires change to add decision logic implementation on part of IDF. &lt;/li>
&lt;li>Requires development of RA resolvers that implement per-member resolution logic (note- this would probably actually be done at DOI level)&lt;/li>
&lt;/ol></description></item><item><title>Is FRBR the OSI for Web Architecture?</title><link>https://www.crossref.org/blog/is-frbr-the-osi-for-web-architecture/</link><pubDate>Sat, 13 Feb 2010 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/is-frbr-the-osi-for-web-architecture/</guid><description>&lt;p>(This post is just a repost of a comment to Geoff’s &lt;a href="https://www.crossref.org/blog/does-a-crossref-doi-identify-a-work">last entry&lt;/a> made because it’s already rather long, because it contains one original thought - FRBR as OSI - and because, well, it didn’t really want to wait for moderation.)&lt;/p>
&lt;p>Hi Geoff:&lt;/p>
&lt;p>First off, there is no question but that Crossref was established to take on the reference linking challenge for scholarly literature. (Hell, it’s there, as you point out, in the organisation name - PILA - as well as in the application name - Crossref.)&lt;/p>
&lt;p>But one should also remember that DOI as it was sold at the time was promising so much more. I disagree with you that the participants back then were as wholly innocent of the FRBR terms as you might suggest. Certainly there were ample presentations on DOI that sought to elucidate those relationships.&lt;/p>
&lt;p>No matter. FRBR is a useful reference model to clarify some of these concepts. But not one that we are overly concerned with at this time. Nor even whether DOI maps one to one onto a given FRBR layer. What we are more concerned with on a pragmatic level is how DOI maps onto the Web architecture and especially how it plays along with Linked Data concepts.&lt;/p>
&lt;p>(Aside: A propos FRBR we might be in danger of repeating the OSI mistake for standardizing the network layer model. Ultimately that was maintained as a reference model but dropped as a concrete model in favour of the TCP/IP stack. Could be that FRBR is our OSI and Linked Data is our TCP/IP stack? That is, we might have to settle on the coarser data model in order to get a coherent story out the door where all can agree.)&lt;/p>
&lt;p>You say:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“we need a mechanism to distinguish between when we are getting the thing pointed to by the Crossref DOI (the PDF , HTML, etc.) as opposed to “something about the thing” (e.g. the landing page, metadata record, etc.)”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>But that is exactly what we were chasing up in the earlier posts (both my &lt;a href="https://www.crossref.org/blog/doi-what-do-we-got/">DOI: What Do We Got?&lt;/a> and John Erickson’s &lt;a href="http://bitwacker.wordpress.com/2010/02/04/dois-uris-and-cool-resolution/" target="_blank">DOIs, URIs and Cool Resolution&lt;/a>). You want to distinguish between a thing and a description about a thing. And Web architecture does just that: it distinguishes between Information Resources (i.e. the things) and Non-Information Resources (i.e. descriptions of the things).&lt;/p>
&lt;p>Now is this something that Crossref can truly distinguish and make apparent in its service architecture? If we retain the notion of landing page we are already essentially saying that a Crossref HTTP URI identifies a decsription of the resource, i.e. a Non-Information Resource, or Other Resource, and that is properly indicated within the architecture by returning a “303 See Other status” code.&lt;/p>
&lt;p>I think that’s all we’re saying at the moment as a first step.&lt;/p>
&lt;p>Web architecture wants to know if the DOI HTTP URI is a thing or description of a thing. I say the latter. You seem to suggest in your comment the latter too. I wonder if we could get a vote on that.&lt;/p>
&lt;p>And btw, I am not suggesting that Crossref needs to dive into the business of &lt;em>“tracking compoend documents in their entirety”&lt;/em>. Far from it. Lets just get a common resource architecture agreed publicly and then we can build on that.&lt;/p>
&lt;p>This observation I received in a private email is something I fully support:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“The real problem is what doi http uri identify on the web. Everything flows from the answer to that Q.”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>Tony&lt;/p></description></item><item><title>Does a Crossref DOI identify a "work?"</title><link>https://www.crossref.org/blog/does-a-crossref-doi-identify-a-work/</link><pubDate>Thu, 11 Feb 2010 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/does-a-crossref-doi-identify-a-work/</guid><description>&lt;p>Tony’s recent thread on &lt;a href="https://www.crossref.org/blog/doi-what-do-we-got/">making DOIs play nicely in a linked data world&lt;/a> has raised an issue I’ve meant to discuss here for some time- a lot of the thread is predicated on the idea that Crossref DOIs are applied at the abstract “work” level. Indeed, that it what it currently says in our guidelines. Unfortunately, this is a case where theory, practice and documentation all diverge.&lt;/p>
&lt;p>When the Crossref linking system was developed it was focused primarily on facilitating persistent linking amongst journals and conference proceedings. The system was quickly adapted to handle books and more recently to handle working papers, technical reports, standards and “components”- a catchall term used to refer to everything from individual article images to database records.&lt;/p>
&lt;p>In practice the content outside of the core journals and conference proceedings has accounted for relatively low volume. However, we expect that over the next few years this will change and that books and databases will increasingly drive the future growth in Crossref’s citation linking services. Interestingly, these record types all share characteristics that make them substantially different from the journals and conference proceedings that we have hitherto focused on.&lt;/p>
&lt;p>Both books and databases introduce new challenges to technology and policies of our citation linking service. The challenges revolved around two areas:&lt;/p>
&lt;ul>
&lt;li>Structure: Both books and databases can have complex structures and the publishers of this content are likely to require granular identification of these content substructures along with a mechanism for documenting the relationship between these substructures (e.g. this section is part of this chapter which is part of this monograph which is part of this series)&lt;/li>
&lt;li>Versioning: Unlike typical journals and conference proceedings, books and database records sometimes change over time.&lt;/li>
&lt;/ul>
&lt;p>When confronted with the issues of structure and versioning publishers are often tempted to take shortcuts and decide to simply assign DOIs at the highest level structure and to the “work” instead of a particular “manifestation” or version of that work. Indeed, section 5.5 of Crossref’s [DOI Name Information and Guidelines][2] recommends this. But this approach could have a negative impact on the integrity of the scholarly citation record that Crossref is attempting to maintain.&lt;/p>
&lt;p>Fundamentally, Crossref DOIs are aimed at providing a persistent online citation infrastructure for scholarly and professional publishers. Consequently, decisions about where to apply Crossref DOIs should be guided by common expectations about the way in which citations work. Citations are typically used to credit ideas or provide evidence. A reader follows a citation in order to obtain more detail or to verify that an author is accurately representing the item cited. A rule of thumb is that a reader has a reasonable expectation that when they follow a citation, they will be taken to what the author saw when creating the citation. Any divergent behavior could result in the reader concluding that the author was misrepresenting the item cited. A further implication of this is that any changes to content that are likely to effect the crediting or interpretation of the content should result in that changed content getting a new Crossref DOI.&lt;/p>
&lt;p>Typically, this means that Crossref DOIs should be probably assigned at the expression level and different expressions should be assigned different Crossref DOIs. This is because assigning a Crossref DOI at the higher “work” level is generally not granular enough to guarantee that a reader following the citation will see what the author saw when creating the citation. For example, one translation of a work might be substantially different from another translation of the same work. Similarly a draft version of a work might be substantially different from the final published version of the work. In each case, resolving a citation to a different expression of the work than the expression that was originally cited might result in the reader interpreting the content differently than the citing author.&lt;/p>
&lt;p>In general, different “equivalent manifestations” of the same work can safely be assigned the same Crossref DOI. So, for instance, the HTML formatted version an article and the PDF formatted version of an article can almost always be assigned the same Crossref DOI. Any differences between the two are unlikely to affect the crediting of, or reader’s interpretation of, the work. But sometimes it is even possible that different manifestations of an expression will differ enough to merit different Crossref DOIs. For instance, a semantically enhanced version of an article might require new crediting (e.g. the parties responsible for adding the semantic information) and the resulting semantic enhancement may conceivably alter the reader’s interpretation of the article.&lt;/p>
&lt;p>Unfortunately, there is no hard and fast rule about where and when to assign new Crossref DOIs. Instead there is only a guideline, namely:&lt;/p>
&lt;blockquote>
&lt;p>“Assign new Crossref DOIs to content in a way that will ensure that a reader following the citation will see something as close to what the original author cited as is possible.”&lt;/p>
&lt;/blockquote>
&lt;p>The implications of this to publishers are important, especially when they are assigning DOIs to protean records types. For instance, it may mean that:&lt;/p>
&lt;ul>
&lt;li>Book publishers should be expected to keep old editions of books available for link resolution purposes.&lt;/li>
&lt;li>Publishers of content that can change rapidly (e.g. by the second) should provide facilities for creating frozen, archived snapshots of content for citation purposes.&lt;/li>
&lt;li>All publishers of protean content should issue guidelines instructing researchers on when it is appropriate to cite a work, manifestation or version.&lt;/li>
&lt;/ul>
&lt;p>Crossref needs to actively consider these issues as publishers start assigning Crossref DOIs to more dynamic types of content. Minimally, we should be able to provide publishers with recommendations on how to make dynamic content citable. We may even want to consider enshrining certain types of behavior in our terms and conditions so as to ensure the future integrity of the scholarly citation record.&lt;/p>
&lt;p>In short, we need to update our guidelines.&lt;/p>
&lt;p>[2]: &lt;a href="https://doi.org/10.13003/5jchdy" target="_blank">Crossref DOI display guidelines&lt;/a>&lt;/p></description></item><item><title>The Response Page</title><link>https://www.crossref.org/blog/the-response-page/</link><pubDate>Wed, 10 Feb 2010 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/the-response-page/</guid><description>&lt;p>(&lt;strong>Update - 2010.02.10:&lt;/strong> I just saw that I posted &lt;a href="https://www.crossref.org/blog/machine-readable-are-we-there-yet">here&lt;/a> on this same topic over a year ago. Oh well, I guess this is a perennial.)&lt;/p>
&lt;p>I am opening a new entry to pick up one point that John Erickson made in his &lt;a href="https://www.crossref.org/blog/doi-what-do-we-got">last comment&lt;/a> to the previous entry:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“I am suggesting that one “baby step” might be to introduce (e.g.) RDFa coding standards for embedding the doi:D syntax.”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>Yea!&lt;/p>
&lt;p>It might be worth consulting the latest Crossref &lt;a href="https://doi.org/10.13003/5jchdy" target="_blank">DOI Name Information and Guidelines&lt;/a> to see what that has to say about this. &lt;em>Section 6.3 - The response page&lt;/em> has these two specific requirements for publishers:&lt;/p>
&lt;ol>
&lt;li>When metadata and DOIs are deposited with Crossref, the publisher &lt;strong>must&lt;/strong> have active response pages in place so that they can resolve incoming links.
&lt;ul>
&lt;li>
&lt;p>A minimal response page &lt;strong>must&lt;/strong> contain a full bibliographic citation displayed to the user. A response page without bibliographic information should never be presented to a user.&lt;/ol>
What is truly shocking about these requirements is that this are purely user focussed. There is no mention whatsoever of machines. One might have thought that with the &lt;a href="https://web.archive.org/web/20100413223720/http://linkeddata.org/" target="_blank">Linked Data&lt;/a> gospel in full swing there would at least be a nod to machine-readable metadata. But there’s none. I’m not saying that there should be any requirement, or even any recommendation. But a mention might have been useful to chivvy us all along.&lt;/p>
&lt;p>I agree with John that publishers could be encouraged (or even just reminded) that machine-readable metadata could be made available through various mechanisms: HTML META tags (such as we currently provide at Nature - and as blogged &lt;a href="https://www.crossref.org/blog/natures-metadata-for-web-pages">here&lt;/a> earlier), COinS objects, RDF/XML comments, or best of all RDFa markup as John mentions.&lt;/p>
&lt;p>The Web is getting semantic. It’s about time that Crossref members joined the wave. And would be helpful if Crossref were there to help us with some new guidelines too!&lt;/p>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ol></description></item><item><title>DOI: What Do We Got?</title><link>https://www.crossref.org/blog/doi-what-do-we-got/</link><pubDate>Tue, 09 Feb 2010 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/doi-what-do-we-got/</guid><description>&lt;p>&lt;a href="https://www.crossref.org/wp/blog/images/doi-what-do-we-got.png">&lt;img border=0 alt="doi-what-do-we-got.png" src="https://www.crossref.org/wp/blog/images/doi-what-do-we-got.png" width="467" height="323" />&lt;/a>&lt;/p>
&lt;p>(Click image for full size graphic.)&lt;/p>
&lt;p>Following the JISC seminar last week on persistent identifiers (&lt;code>#jiscpid&lt;/code> on Twitter) there was some discussion about DOI and its role within a &lt;a href="https://web.archive.org/web/20100413223720/http://linkeddata.org/" target="_blank">Linked Data&lt;/a> context. John Erickson has responded with a very thoughtful post &lt;a href="http://bitwacker.wordpress.com/2010/02/04/dois-uris-and-cool-resolution" target="_blank">DOIs, URIs and Cool Resolution&lt;/a>, which ably summarizes how the current problem with DOI in that the way the DOI is is implemented by the handle HTTP proxy may not have kept pace with actual HTTP developments. (For example, John notes that the proxy is not capable of dealing with ‘Accept’ headers.) He has proposed a solution, and the post has attracted several comments.&lt;/p>
&lt;p>I just wanted to offer here the above diagram in an attempt to corral some of the various facets relating to DOI that I am aware of. I realize that this may seem like an open invitation to flame on - and this is a very preliminary draft - but … be kind!&lt;/p>
&lt;p>So, this may be totally off the wall but it represents my best understanding of DOI as used by Crossref.&lt;/p>
&lt;p>I have distinguished three main contexts:&lt;/p>
&lt;ol>
&lt;li>&lt;strong>Generic Data&lt;/strong> - A generalized information context where the an object is identified with a DOI, an identifier system that is currently being ratified through the ISO process. This is the raw DOI number. (This definitely is not a first class object on the Web as it has no URI.)
&lt;ul>
&lt;li>&lt;strong>Web Data&lt;/strong> - An online information context (here I use the term ‘Web’ in its widest sense) where resources are identified by URI (not necessarily an HTTP URI). Here DOI is represented under two URI schemes: ‘doi:’ (unregistered but preferred by Crossref), and ‘info:’ (registered and available for general URI use). Also it has a presence on the Web via an HTTP proxy (dx.doi.org) URL where it is used as a slug to create a permalink (as listed at ‘A’). A simple HTTP redirect is used (with status code 302) to turn this permalink into the publisher response page http://example/1. (Note that typically a second redirect will occur on the publisher platform, here shown by the redirect to http://example/2.)
&lt;ul>
&lt;li>
&lt;p>&lt;strong>Linked Data&lt;/strong> - An online information context where resources are identified by HTTP URI and conform to Linked Data principles. Now this is where there is a tension arises between the common publisher perspective and the strict semantic viewpoint. Implicit in the general Web context given above was the notion that the permalink (‘A’) was somehow related to the abstract object and the redirection service applied to it associated the abstract resource with concrete representations of the object.&lt;/ol>
So how do we relate the DOI HTTP URI with the abstract (‘work’) identifier listed at ‘D’ in the diagram?&lt;/p>
&lt;p>Well the &lt;a href="http://www.w3.org/TR/webarch/" target="_blank">Architecture of the World Wide Web&lt;/a> recognizes two distinct classes of resources: Information Resources (IR) and Non-Information Resources (NR). (Note: Only the term ‘information resource’ is used in AWWW.) IR are those that can be directly retrieved using HTTP, whereas NR are not directly retrievable but have an associated description which is retrievable and is itself a proxy for the real world object.&lt;/p>
&lt;p>So either the HTTP URI denotes an IR (as listed at ‘B’) and is resolved (through HTTP status code ‘302 Found’) to a default representation, which is the view that the Linked Data community would currently have of DOI. But this is at odds with what the Crossref position which regards DOI as identifying the abstract work. Alternately to fit better the Crossref model of DOI the HTTP URI would denote an NR (as listed at ‘A’) which would be resolved (through HTTP status code ‘303 See Other’) to an associated description - a publisher response page.&lt;/p>
&lt;p>There will be those self-appointed URI czars who will bemoan the fact of there being multiple URIs. But frankly there is nothing inherently wrong with that. Just as in the real world there are many languages so in the online world there are multiple contexts and histories. We can attempt to make some sense of this by making use of the well-known semantic properties &lt;code>owl:sameAs&lt;/code> and &lt;code>ore:similarTo&lt;/code> and declare (as also shown in the diagram) the following assertions:&lt;/p>
&lt;blockquote>
&lt;p>``&lt;a href="https://www.crossref.org/wp/blog/images/doi-what-do-we-got.png">&lt;img border=0 alt="doi-what-do-we-got.png" src="https://www.crossref.org/wp/blog/images/doi-what-do-we-got.png" width="467" height="323" />&lt;/a>&lt;/p>
&lt;/blockquote>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ol>
&lt;p>(Click image for full size graphic.)&lt;/p>
&lt;p>Following the JISC seminar last week on persistent identifiers (&lt;code>#jiscpid&lt;/code> on Twitter) there was some discussion about DOI and its role within a &lt;a href="https://web.archive.org/web/20100413223720/http://linkeddata.org/" target="_blank">Linked Data&lt;/a> context. John Erickson has responded with a very thoughtful post &lt;a href="http://bitwacker.wordpress.com/2010/02/04/dois-uris-and-cool-resolution" target="_blank">DOIs, URIs and Cool Resolution&lt;/a>, which ably summarizes how the current problem with DOI in that the way the DOI is is implemented by the handle HTTP proxy may not have kept pace with actual HTTP developments. (For example, John notes that the proxy is not capable of dealing with ‘Accept’ headers.) He has proposed a solution, and the post has attracted several comments.&lt;/p>
&lt;p>I just wanted to offer here the above diagram in an attempt to corral some of the various facets relating to DOI that I am aware of. I realize that this may seem like an open invitation to flame on - and this is a very preliminary draft - but … be kind!&lt;/p>
&lt;p>So, this may be totally off the wall but it represents my best understanding of DOI as used by Crossref.&lt;/p>
&lt;p>I have distinguished three main contexts:&lt;/p>
&lt;ol>
&lt;li>&lt;strong>Generic Data&lt;/strong> - A generalized information context where the an object is identified with a DOI, an identifier system that is currently being ratified through the ISO process. This is the raw DOI number. (This definitely is not a first class object on the Web as it has no URI.)
&lt;ul>
&lt;li>&lt;strong>Web Data&lt;/strong> - An online information context (here I use the term ‘Web’ in its widest sense) where resources are identified by URI (not necessarily an HTTP URI). Here DOI is represented under two URI schemes: ‘doi:’ (unregistered but preferred by Crossref), and ‘info:’ (registered and available for general URI use). Also it has a presence on the Web via an HTTP proxy (dx.doi.org) URL where it is used as a slug to create a permalink (as listed at ‘A’). A simple HTTP redirect is used (with status code 302) to turn this permalink into the publisher response page http://example/1. (Note that typically a second redirect will occur on the publisher platform, here shown by the redirect to http://example/2.)
&lt;ul>
&lt;li>
&lt;p>&lt;strong>Linked Data&lt;/strong> - An online information context where resources are identified by HTTP URI and conform to Linked Data principles. Now this is where there is a tension arises between the common publisher perspective and the strict semantic viewpoint. Implicit in the general Web context given above was the notion that the permalink (‘A’) was somehow related to the abstract object and the redirection service applied to it associated the abstract resource with concrete representations of the object.&lt;/ol>
So how do we relate the DOI HTTP URI with the abstract (‘work’) identifier listed at ‘D’ in the diagram?&lt;/p>
&lt;p>Well the &lt;a href="http://www.w3.org/TR/webarch/" target="_blank">Architecture of the World Wide Web&lt;/a> recognizes two distinct classes of resources: Information Resources (IR) and Non-Information Resources (NR). (Note: Only the term ‘information resource’ is used in AWWW.) IR are those that can be directly retrieved using HTTP, whereas NR are not directly retrievable but have an associated description which is retrievable and is itself a proxy for the real world object.&lt;/p>
&lt;p>So either the HTTP URI denotes an IR (as listed at ‘B’) and is resolved (through HTTP status code ‘302 Found’) to a default representation, which is the view that the Linked Data community would currently have of DOI. But this is at odds with what the Crossref position which regards DOI as identifying the abstract work. Alternately to fit better the Crossref model of DOI the HTTP URI would denote an NR (as listed at ‘A’) which would be resolved (through HTTP status code ‘303 See Other’) to an associated description - a publisher response page.&lt;/p>
&lt;p>There will be those self-appointed URI czars who will bemoan the fact of there being multiple URIs. But frankly there is nothing inherently wrong with that. Just as in the real world there are many languages so in the online world there are multiple contexts and histories. We can attempt to make some sense of this by making use of the well-known semantic properties &lt;code>owl:sameAs&lt;/code> and &lt;code>ore:similarTo&lt;/code> and declare (as also shown in the diagram) the following assertions:&lt;/p>
&lt;blockquote>
&lt;p>``&lt;/p>
&lt;/blockquote>
&lt;p>Note that ore:similarTo (stemming from the &lt;a href="http://www.openarchives.org/ore/" target="_blank">OAI-ORE&lt;/a> work) is a weaker kind of relationship than owl:sameAs (which comes from &lt;a href="http://www.w3.org/TR/owl-ref/" target="_blank">OWL&lt;/a>) and may be appropriate in this usage.&lt;/p>
&lt;p>In sum, scenario ‘A’ is what we have currently implemented, scenario ‘B’ is what might be commonly perceived as being implemented, and scenario ‘C’ may be a more correct semantic position.&lt;/p>
&lt;p>Your comments (and not unkind comments, please;) are more than welcome.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ol></description></item><item><title>A Christmas Reading List&amp;#8230; with DOIs</title><link>https://www.crossref.org/blog/a-christmas-reading-list-with-dois/</link><pubDate>Sun, 13 Dec 2009 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/a-christmas-reading-list-with-dois/</guid><description>&lt;p>Was outraged (outraged, I tell you) that one of my favorite online comics, &lt;a href="http://www.phdcomics.com/comics.php" target="_blank">PhD&lt;/a>, didn’t include DOIs in &lt;a href="http://www.phdcomics.com/comics/archive.php?comicid=1262" target="_blank">their recent bibliography of Christmas-related citations.&lt;/a>. So I’ve compiled them below.&lt;/p>
&lt;p>We care about these things so that you don’t have to. Bet you will sleep better at night knowing this.&lt;/p>
&lt;p>Or perhaps not…&lt;/p>
&lt;h2 id="a-christmas-reading-list8230-with-dois">A Christmas Reading List… with DOIs.&lt;/h2>
&lt;p>Citation:  Biggs, R, Douglas, A, Macfarlane, R, Dacie, J, Pitney, W, Merskey, C &amp;amp; O’Brien, J, 1952, ‘Christmas Disease’, BMJ, vol. 2, no. 4799, pp. 1378-1382.&lt;/p>
&lt;p>Crossref DOI:  &lt;a href="http://dx.doi.org/10.1136/bmj.2.4799.1378" target="_blank">http://dx.doi.org/10.1136/bmj.2.4799.1378&lt;/a>&lt;/p>
&lt;p>Title:  More Than a Labor of Love: Gender Roles and Christmas Gift Shopping&lt;/p>
&lt;p>Citation:  Fischer, E &amp;amp; Arnold, S, 1990, ‘More Than a Labor of Love: Gender Roles and Christmas Gift Shopping’, Journal of Consumer Research, vol. 17, no. 3, p. 333.&lt;/p>
&lt;p>Crossref DOI:  &lt;a href="http://dx.doi.org/10.1086/208561" target="_blank">http://dx.doi.org/10.1086/208561&lt;/a>&lt;/p>
&lt;p>Title:  Looking at Christmas trees in the nucleolus&lt;/p>
&lt;p>Citation:  Scheer, U, Xia, B, Merkert, H &amp;amp; Weisenberger, D, 1997, ‘Looking at Christmas trees in the nucleolus’, Chromosoma, vol. 105, no. 7-8, pp. 470-480.&lt;/p>
&lt;p>Crossref DOI:  &lt;a href="http://dx.doi.org/10.1007/s004120050209" target="_blank">http://dx.doi.org/10.1007/s004120050209&lt;/a>&lt;/p>
&lt;p>Title:  The Vela glitch of Christmas 1988&lt;/p>
&lt;p>Citation:  McCulloch, P, Hamilton, P, McConnell, D &amp;amp; King, E, 1990, ‘The Vela glitch of Christmas 1988’, Nature, vol. 346, no. 6287, pp. 822-824.&lt;/p>
&lt;p>Crossref DOI:  &lt;a href="http://dx.doi.org/10.1038/346822a0" target="_blank">http://dx.doi.org/10.1038/346822a0&lt;/a>&lt;/p>
&lt;p>Title:  Cardiac Mortality Is Higher Around Christmas and New Year’s Than at Any Other Time: The Holidays as a Risk Factor for Death&lt;/p>
&lt;p>Citation:  Phillips, D, 2004, ‘Cardiac Mortality Is Higher Around Christmas and New Year’s Than at Any Other Time: The Holidays as a Risk Factor for Death’, Circulation, vol. 110, no. 25, pp. 3781-3788.&lt;/p>
&lt;p>Crossref DOI:  &lt;a href="http://dx.doi.org/10.1161/01.CIR.0000151424.02045.F7" target="_blank">http://dx.doi.org/10.1161/01.CIR.0000151424.02045.F7&lt;/a>&lt;/p>
&lt;p>Title:  Red Crabs in Rain Forest, Christmas Island: Biotic Resistance to Invasion by an Exotic Snail&lt;/p>
&lt;p>Citation:  Lake, P &amp;amp; O’Dowd, D, 1991, ‘Red Crabs in Rain Forest, Christmas Island: Biotic Resistance to Invasion by an Exotic Snail’, Oikos, vol. 62, no. 1, p. 25.&lt;/p>
&lt;p>Crossref DOI:  &lt;a href="http://dx.doi.org/10.2307/3545442" target="_blank">http://dx.doi.org/10.2307/3545442&lt;/a>&lt;/p>
&lt;p>Title:  The Carvedilol Hibernation Reversible Ischaemia Trial, Marker of Success (CHRISTMAS) study Methodology of a randomised, placebo controlled, multicentre study of carvedilol in hibernation and heart failure&lt;/p>
&lt;p>Citation:  Pennell, D, 2000, ‘The Carvedilol Hibernation Reversible Ischaemia Trial, Marker of Success (CHRISTMAS) study Methodology of a randomised, placebo controlled, multicentre study of carvedilol in hibernation and heart failure’, International Journal of Cardiology, vol. 72, no. 3, pp. 265-274.&lt;/p>
&lt;p>Crossref DOI:  &lt;a href="http://dx.doi.org/10.1016/S0167-5273%2899%2900198-9" target="_blank">http://dx.doi.org/10.1016/S0167-5273(99)00198-9&lt;/a>&lt;/p></description></item><item><title>Add Crossref metadata to PDFs using XMP</title><link>https://www.crossref.org/blog/add-crossref-metadata-to-pdfs-using-xmp/</link><pubDate>Wed, 09 Dec 2009 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/add-crossref-metadata-to-pdfs-using-xmp/</guid><description>&lt;p>In order to encourage publishers and other content producers to embed metadata into their PDFs, we have released an experimental tool called “pdfmark”, This open source tool allows you to add &lt;a href="http://en.wikipedia.org/wiki/Extensible_Metadata_Platform" target="_blank">XMP&lt;/a> metadata to a PDF. What’s really cool, is that if you give the tool a Crossref DOI, it will lookup the metadata in Crossref and then apply said metadata to the PDF. More detail can be found on the &lt;a href="https://www.crossref.org/labs/pdfmark/" target="_blank">pdfmark page&lt;/a> on the &lt;a href="https://www.crossref.org/labs/" target="_blank">Crossref Labs&lt;/a> site. The usual weasels words and excuses about “experiments” apply.&lt;/p></description></item><item><title>QR Codes and DOIs</title><link>https://www.crossref.org/blog/qr-codes-and-dois/</link><pubDate>Tue, 08 Dec 2009 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/qr-codes-and-dois/</guid><description>&lt;p>Inspired by &lt;a href="http://www.techcrunch.com/2009/12/06/google-local-maps-qr-code/" target="_blank">Google’s recent promotion of QR Codes&lt;/a>, I thought it might be fun to experiment with encoding a Crossref DOI and a bit of metadata into one of the critters. I’ve put a &lt;a href="https://www.crossref.org/labs/qr-code-generator/" target="_blank">short write-up of the experiment&lt;/a> on the &lt;a href="https://www.crossref.org/labs/" target="_blank">Crossref Labs&lt;/a> site, which includes a demonstration of how you can generate a &lt;a href="http://en.wikipedia.org/wiki/QR_Code" target="_blank">QR Code&lt;/a> for any given Crossref DOI. Put them on postcards and send them to your friends for the holidays. Tattoo them on your pets. The possibilities are endless.&lt;/p></description></item><item><title>got SEARCH if you want it!</title><link>https://www.crossref.org/blog/got-search-if-you-want-it/</link><pubDate>Tue, 24 Nov 2009 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/got-search-if-you-want-it/</guid><description>&lt;p>[See this link if you’re short on time: &lt;a href="http://nurture.nature.com/opensearch/apps/client-facets.html" target="_blank">facets&lt;/a> search client. Only tested on Firefox at this point. &lt;strong>Caveat:&lt;/strong> At time of writing the Crossref Metadata Search was being &lt;em>very&lt;/em> slow but was still functional. Previously it was just slow.]&lt;/p>
&lt;p>Following on from Geoff’s &lt;a href="https://www.crossref.org/blog/crossref-labs/">announcement&lt;/a> last month of a prototype Crossref Metadata OpenSearch on &lt;a href="https://www.crossref.org/labs/" target="_blank">labs.crossref.org&lt;/a>, I wanted to show what typical OpenSearch responses might look like in a more mature implementation.&lt;/p>
&lt;p>I have taken the liberty of modelling these on the response formats that we are already providing in our nature.com OpenSearch service which in turn are based on the draft syndication formats that I &lt;a href="https://www.crossref.org/blog/opensearch-formats-for-review/">blogged here&lt;/a> earlier.&lt;/p>
&lt;p>I am therefore returning ATOM, JSON, JSONP and RSS responses from these four OpenSearch URL templates:&lt;/p>
&lt;ul>
&lt;li>&lt;a href="http://nurture.nature.com/cgi-bin/opensearch?db=crossref&amp;amp;#038;out=atom&amp;amp;#038;q=%7bsearchTerms%7d" target="_blank">http://nurture.nature.com/cgi-bin/opensearch?db=crossref&amp;#038;out=atom&amp;#038;q={searchTerms}&lt;/a>
&lt;ul>
&lt;li>&lt;a href="http://nurture.nature.com/cgi-bin/opensearch?db=crossref&amp;amp;#038;out=json&amp;amp;#038;q=%7bsearchTerms%7d" target="_blank">http://nurture.nature.com/cgi-bin/opensearch?db=crossref&amp;#038;out=json&amp;#038;q={searchTerms}&lt;/a>
&lt;ul>
&lt;li>&lt;a href="http://nurture.nature.com/cgi-bin/opensearch?db=crossref&amp;amp;#038;out=jsonp&amp;amp;#038;q=%7bsearchTerms%7d" target="_blank">http://nurture.nature.com/cgi-bin/opensearch?db=crossref&amp;#038;out=jsonp&amp;#038;q={searchTerms}&lt;/a>
&lt;ul>
&lt;li>
&lt;p>&lt;a href="http://nurture.nature.com/cgi-bin/opensearch?db=crossref&amp;amp;#038;out=rss&amp;amp;#038;q=%7bsearchTerms%7d" target="_blank">http://nurture.nature.com/cgi-bin/opensearch?db=crossref&amp;#038;out=rss&amp;#038;q={searchTerms}&lt;/a>&lt;/ul>
as this &lt;a href="http://nurture.nature.com/opensearch/xml/opencrossref.xml" target="_blank">OpenSearch description&lt;/a> file details. Note that the URL templates include no indexing or pagination parameters as the Crossref prototype does not currently support these features.&lt;/p>
&lt;p>An example query (‘apple’) returning an ATOM feed from a Crossref Metadata OpenSearch would be the following:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>&lt;a href="http://nurture.nature.com/cgi-bin/opensearch?db=crossref&amp;amp;#038;out=atom&amp;amp;#038;q=apple" target="_blank">http://nurture.nature.com/cgi-bin/opensearch?db=crossref&amp;#038;out=atom&amp;#038;q=apple&lt;/a>&lt;/ul>
And the same query returning a JSON version of that ATOM feed would look as follows:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>&lt;a href="http://nurture.nature.com/cgi-bin/opensearch?db=crossref&amp;amp;#038;out=json&amp;amp;#038;q=apple" target="_blank">http://nurture.nature.com/cgi-bin/opensearch?db=crossref&amp;#038;out=json&amp;#038;q=apple&lt;/a>&lt;/ul>
By the way, this is just for demonstration purposes and there are still issues to be resolved including character encoding.&lt;/p>
&lt;p>This interface uses the existing Crossref OpenSearch response format and parses the &lt;a href="https://web.archive.org/web/20090927174724/http://ocoins.info/" target="_blank">COinS&lt;/a> objects embedded in that response to provide a more standard OpenSearch syndication result set format. The prototype implemenatation also has some bugs which I needed to work around. (I will forward on details of these.) And there is also a more fundamental issue of response time from the experimental search server.&lt;/p>
&lt;p>But still this should give some idea of what a Crossref Metadata OpenSearch service could look like.&lt;/p>
&lt;p>To show this all in action I’ve worked up one of my &lt;a href="http://nurture.nature.com/opensearch/apps/client-facets.html" target="_blank">demo OpenSearch clients&lt;/a> for &lt;a href="http://www.nature.com/opensearch/" target="_blank">nature.com OpenSearch&lt;/a> which displays a facetted search response for a Crossref search. For good measure this includes also an OpenSearch interface for PubMed and the search client allows for simple selection between three journals databases: nature.com, Crossref and PubMed.&lt;/p>
&lt;p>Of course, with a reasonably uniform set of search result formats such as presented here it then becomes a simple exercise to reuse these search responses in additional search clients.&lt;/p>
&lt;p>As can be anticipated it would be very straightforward to carry this over into a single metasearch service which could run across these multiple databases.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul></description></item><item><title>A Cheatsheet for nature.com OpenSearch</title><link>https://www.crossref.org/blog/a-cheatsheet-for-nature.com-opensearch/</link><pubDate>Thu, 22 Oct 2009 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/a-cheatsheet-for-nature.com-opensearch/</guid><description>&lt;img alt="opensearch-cheatsheet-fragment.png" src="https://www.crossref.org/wp/blog/images/opensearch-cheatsheet-fragment.png" width="328" height="267" />
&lt;p>Following on from my &lt;a href="https://www.crossref.org/blog/nature.com-opensearch-a-structured-search-service/">recent post&lt;/a> about our shiny new &lt;a href="http://www.nature.com/opensearch" target="_blank">nature.com OpenSearch&lt;/a> service we just put up a cheatsheet for users. I’m posting about this here as this may also be of interest especially to those exploring how SRU and OpenSearch intersect.&lt;/p>
&lt;p>The cheatsheet can be downloaded from our nature.com OpenSearch &lt;a href="http://nurture.nature.com/opensearch" target="_blank">test page&lt;/a> and is available in two forms:&lt;/p>
&lt;ul>
&lt;li>&lt;a href="http://nurture.nature.com/opensearch/docs/opensearch-cheatsheet.pdf" target="_blank">Cheatsheet (PDF, 65K)&lt;/a>
&lt;ul>
&lt;li>&lt;a href="http://nurture.nature.com/opensearch/docs/opensearch-cheatsheet.png" target="_blank">Cheatsheet (PNG, 141K)&lt;/a>&lt;/ul>
Naurally, all comments welcome.&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul></description></item><item><title>Recommendations on RSS Feeds for Scholarly Publishers</title><link>https://www.crossref.org/blog/recommendations-on-rss-feeds-for-scholarly-publishers/</link><pubDate>Mon, 19 Oct 2009 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/recommendations-on-rss-feeds-for-scholarly-publishers/</guid><description>&lt;p>We’re pleased to announce that a Crossref working group has released a set of &lt;a href="http://oxford.crossref.org/best_practice/rss/" target="_blank">best practice recommendations&lt;/a> for scholarly publishers producing RSS feeds.&lt;/p>
&lt;p>Variations in practice amongst publisher feeds can be irritating for end-users, but they can be insurmountable for automated processes. RSS feeds are increasingly being consumed by knowledge discovery and data mining services. In these cases, variations in date formats, the practice of lumping all authors together in one &lt;font color="#3eb1c8">&amp;lt;dc:creator&amp;gt; &lt;/font> element, or generating invalid XML can render the RSS feed useless to the service accessing it.&lt;/p>
&lt;p>The recommendations intended to facilitate good practice in the production and provision of TOC RSS Feeds. The guidelines include general recommendations for good practice, specific recommendations on the use of RSS Modules and an example RSS TOC feed. Ultimately, we expect that industry wide adoption of these best practices will help drive more traffic to publisher web sites. Note that most of these recommendation can also be applied to non-TOC RSS feeds such as thematic feeds, automated search result feeds, etc.&lt;/p></description></item><item><title>Crossref Labs</title><link>https://www.crossref.org/blog/crossref-labs/</link><pubDate>Tue, 13 Oct 2009 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/crossref-labs/</guid><description>&lt;p>The other day &lt;a href="http://www.redbrick.dcu.ie/~noel/" target="_blank">Noel O’Boyle&lt;/a> wrote to tell me that he had updated the Ubiquity plug-in that we had developed in order to to make it work with the latest version of Firefox. The problem was, I had *also* updated the Ubiquity plug-in, but I hadn’t really indicated to anybody how they could find updates to the plug-in. /me=embarrassed. So it seemed time to provide a home for some of the prototypes and experiments that we’ve been developing at Crossref. To that end, we have created a &lt;a href="https://www.crossref.org/labs/" target="_blank">Crossref Labs&lt;/a> site. Here you can find links to various tools and services that either make it easier to use Crossref services (e.g. &lt;a href="https://www.crossref.org/labs/wordpress-moveable-type-plugins/" target="_blank">Blog&lt;/a>/&lt;a href="https://www.crossref.org/labs/ubiquity-plugin/" target="_blank">Ubiquity&lt;/a> plugins and &lt;a href="https://www.crossref.org/labs/" target="_blank">OpenSearch Description files&lt;/a>) or that serve to illustrate a concept that has been of interest to our members (&lt;a href="https://www.crossref.org/labs/inchi-lookup/" target="_blank">InChI lookup&lt;/a>, &lt;a href="https://www.crossref.org/labs/toi-doi-i-e-short-dois/" target="_blank">TOI-DOIs&lt;/a>). Oh, yeah- and when we update these experiments, you should be able to find the updates on their respective pages. Sorry about that Noel… Finally, I will quote from the &lt;a href="https://www.crossref.org/labs/" target="_blank">Crossref Labs home page&lt;/a>:&lt;/p>
&lt;blockquote>
&lt;p>“Most of the experiments linked to here are running on R&amp;amp;D equipment in a non-production environment. They may disappear without warning and/or perform erratically. If one of them isn’t working for some reason, come back later and try again.” Have fun.&lt;/p>
&lt;/blockquote></description></item><item><title>nature.com OpenSearch: A Structured Search Service</title><link>https://www.crossref.org/blog/nature.com-opensearch-a-structured-search-service/</link><pubDate>Mon, 05 Oct 2009 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/nature.com-opensearch-a-structured-search-service/</guid><description>&lt;map name="GraffleExport">
&lt;area shape=rect coords="266,26,519,220" href="https://web.archive.org/web/20070815000000*/http://blogs.nature.com/wp/nascent/2009/10/naturecom_opensearch.html"> &lt;area shape=rect coords="230,220,486,414" href="https://web.archive.org/web/20070815000000*/http://blogs.nature.com/wp/nascent/2009/10/desktop_widgets_naturecom_sear.html"> &lt;area shape=rect coords="231,220,487,414" href="https://web.archive.org/web/20070815000000*/http://blogs.nature.com/wp/nascent/2009/10/desktop_widgets_naturecom_sear.html"> &lt;area shape=rect coords="2,123,253,317" href="https://web.archive.org/web/20070815000000*/http://blogs.nature.com/wp/nascent/2009/10/web_clients_for_naturecom_open.html">
&lt;/map>
&lt;img border="0" alt="opensearch-triptych.jpg" src="https://www.crossref.org/wp/blog/images/opensearch-triptych.jpg" width="522" height="438" usemap="#GraffleExport" />
&lt;p>(Click panels in figure to read related posts.)&lt;/p>
&lt;p>Following up on my earlier posts here about the structured search technologies &lt;a href="http://www.opensearch.org/" target="_blank">OpenSearch&lt;/a> and &lt;a href="http://www.loc.gov/standards/sru/" target="_blank">SRU&lt;/a>, I wanted to reference three recent posts on our web publishing blog Nascent which discuss our new &lt;a href="http://www.nature.com/opensearch" target="_blank">&lt;em>nature.com OpenSearch&lt;/em>&lt;/a> service:&lt;/p>
&lt;dl>
&lt;dt>&lt;a href="https://web.archive.org/web/20070815000000*/http://blogs.nature.com/wp/nascent/2009/10/naturecom_opensearch.html" target="_blank">1. Service&lt;/a>&lt;/dt>
&lt;dd>Describes the new &lt;a href="http://www.nature.com/opensearch" target="_blank">&lt;em>nature.com OpenSearch&lt;/em>&lt;/a> service which provides a structured resource discovery facility for content hosted on nature.com.&lt;/dd>
&lt;dt>&lt;a href="https://web.archive.org/web/20070815000000*/http://blogs.nature.com/wp/nascent/2009/10/web_clients_for_naturecom_open.html" target="_blank">2. Clients&lt;/a>&lt;/dt>
&lt;dd>Points to a small gallery of &lt;a href="http://nurture.nature.com/opensearch/apps" target="_blank">demo web clients&lt;/a> for &lt;em>nature.com OpenSearch&lt;/em> which all use the text-based JSON interface.&lt;/dd>
&lt;dt>&lt;a href="https://web.archive.org/web/20070815000000*/http://blogs.nature.com/wp/nascent/2009/10/desktop_widgets_naturecom_sear.html" target="_blank">3. Widgets&lt;/a>&lt;/dt>
&lt;dd>Introduces the new &lt;em>nature.com search&lt;/em> &lt;a href="https://web.archive.org/web/20110309190725/http://www.nature.com/libraries/public_interfaces/widgets.html" target="_blank">desktop widgets&lt;/a> which interface with the &lt;em>nature.com OpenSearch&lt;/em> service via an RSS feed. (See also the &lt;a href="http://www.youtube.com/watch?v=sqf_ew4o3U8" target="_blank">screencast&lt;/a> posted to YouTube.)&lt;/dd>
&lt;/dl>
&lt;p>We hope that this new search service will prove to be useful and may also provide a model for other implementations.&lt;/p></description></item><item><title>Please join us for the 2009 Crossref Technical Meeting.</title><link>https://www.crossref.org/blog/please-join-us-for-the-2009-crossref-technical-meeting./</link><pubDate>Tue, 08 Sep 2009 00:00:00 +0000</pubDate><author>Anna Tolwinska</author><guid>https://www.crossref.org/blog/please-join-us-for-the-2009-crossref-technical-meeting./</guid><description>&lt;p>Crossref Technical Meeting*&lt;/p>
&lt;p>The Charles Hotel, Cambridge, MA&lt;/p>
&lt;p>Monday, November 9th, 2009&lt;/p>
&lt;p>2:00 pm - 5:00 pm&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/crossref-annual-meeting/" target="_blank">Please register today!&lt;/a>&lt;/p>
&lt;p>We also encourage you to register for our 10th Anniversary Celebration Dinner, which will take place Monday, November 9th, 2009 at 6:30 pm following the Crossref Technical Meeting at the Museum of Science in Boston, MA. Transportation from the Charles Hotel to the Museum of Science will be provided. Our 2009 Annual Meeting will take place on Tuesday, November 10th at 9:00 am in the Charles Hotel in Cambridge, MA and we urge you to register soon (if you haven’t already done so)&lt;/p>
&lt;p>as space is limited. &lt;a href="https://www.crossref.org/crossref-annual-meeting/" target="_blank">You may register for both events here&lt;/a>.&lt;/p>
&lt;p>*Please note that this year’s Technical Meeting will be on Monday afternoon.&lt;/p></description></item><item><title>PRC Report and &amp;#8220;iPub&amp;#8221; revisited</title><link>https://www.crossref.org/blog/prc-report-and-ipub-revisited/</link><pubDate>Mon, 07 Sep 2009 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/prc-report-and-ipub-revisited/</guid><description>&lt;p>OK, so this has nothing to do with any Crossref projects- but there is an interesting &lt;a href="http://www.publishingresearch.net/SMEaccess.htm" target="_blank">new PRC report&lt;/a> out by &lt;a href="http://mrkwr.wordpress.com/mark-ware-consulting/" target="_blank">Mark Ware&lt;/a> in which he explores how SMEs (small/medium-sized enterprises) make use of scholarly articles and whether the scholarly publishing industry is doing anything to make their lives easier. This is a topic that is close to my heart. For the past few years I’ve been saying (&lt;a href="http://www.slideshare.net/Crossref/itunes-for-scholarly-publishing" target="_blank">most recently at SSP09&lt;/a>) that I think scholarly publishers are much too quick to dismiss the possibility of creating an iTunes-like service for scholarly publications (aka “iPub”). The report certainly seems to indicate that there is an important audience that would benefit from such a service (SMEs) and even goes so far as to cite my occasional rants on the subject. The summary of my iPub argument has been that:&lt;/p>
&lt;ul>
&lt;li>A very large percentage of the web visits that hit publishers web sites come from sources that are unrecognised. That is, they don’t come from a subscribing institution and they don’t seem to come from a registered user or anybody who has visited the site previously. For many publishers the level of such unrecognised visitors can amount to over 90% of all the traffic that hits their sites. Most industries would look at this percentage and work hard to figure out how to monetize some of it. Our industry seems to treat it like “noise”, reasoning that only people in recognised academic and professional institutions are going to desire or understand the content on scholarly journal sites.&lt;/li>
&lt;li>&lt;a href="http://www.nsf.gov/statistics/srvyrecentgrads/" target="_blank">Evidence from the NSF&lt;/a> shows that significantly more than 50% of US students who graduate with an S&amp;amp;E degree end up employed outside of directly S&amp;amp;E related fields. This represents a large percentage of potential consumers of scholarly and professional publications who are not part of a recognised academic or professional institution. SMEs anybody?&lt;/li>
&lt;li>These potential consumers are faced with a bewildering variety of sources for their content. They have to deal with multiple publisher sites with different interfaces and different PPV checkout procedures. And they have to navigate all this without the aid of library finding tools or the professional researcher’s understanding the scholarly journal environment. It is no wonder that they give up hope once they land on our abstract pages and face the gauntlet of another PPV checkout system.&lt;/li>
&lt;/ul>
&lt;p>It seems to me that the industry could provide a single interface and PPV shopping cart interface targeted at allowing people who work outside of traditional subscribing institutions to easily purchase individual article downloads from scholarly publishers. The system would be modelled at least in part by Apple’s iTunes, a system that has been lauded (and denounced) for revolutionising the way in which consumers buy music online. The chief virtues of the iTunes system are often cited as being:&lt;/p>
&lt;ul>
&lt;li>It contains a critical mass of content&lt;/li>
&lt;li>It provides a simple and consistent user interface&lt;/li>
&lt;li>It has a simple and inexpensive pricing model&lt;/li>
&lt;li>It disaggregated content (per song purchases)&lt;/li>
&lt;li>It interfaced transparently with the iPod.&lt;/li>
&lt;/ul>
&lt;p>A scholarly publishing “iPub” system could seek to emulate many of these strengths but not all. Clearly such a system could not impose uniform pricing or dictate pricing, as that would be anti-competitive. The PRC report makes this same point.&lt;/p>
&lt;p>Some, including the PRC report, also claim that the publishing industry has no equivalent of the “iPod” and that this would be a weakness of the system. I don’t agree with this- I think that the “iPod” in this case is currently called “paper.” In the future we will almost certainly migrate to some iPod/Kindle-like device, but as far as fulfilling most of the iPod’s functionality (portable rendering of the content) right now, I suspect paper fits the bill.&lt;/p>
&lt;p>Finally, there is a another oft-expressed concern that such a system might confuse channels for existing audiences and that librarians in particular would be very hostile to such a system. The truth is, I don’t know how librarians would react to such a system. The few I’ve mentioned it to certainly seemed amenable to the idea. Maybe this is where the PRC should do some follow-up research?&lt;/p>
&lt;p>In any case, it seems to me that there is potentially much to be gained by simply providing an easy PPV experience where users don’t have to register with multiple sites and cope with multiple shopping cart applications. Publishers can’t seriously think that they gain competitive advantage through their shopping carts? If not, then why not standardise on a uniform interface that is easily purchased from? Perhaps it doesn’t have to look like iTunes but can instead look like PayPal (PayPub?). Providing a simple mechanism like this might enable the industry to meet the needs of important and often overlooked audiences. I keep wondering if &lt;a href="http://www.copyright.com/" target="_blank">CCC&lt;/a> could help publishers do something here?&lt;/p></description></item><item><title>Crossref is hiring an R&amp;D Developer in Oxford</title><link>https://www.crossref.org/blog/crossref-is-hiring-an-rd-developer-in-oxford/</link><pubDate>Thu, 20 Aug 2009 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/crossref-is-hiring-an-rd-developer-in-oxford/</guid><description>&lt;p>We are looking to hire an R&amp;amp;D Developer in our Oxford offices. We are look for somebody who:&lt;/p>
&lt;ul>
&lt;li>Is passionate about creating tools for online scholarly communication.&lt;/li>
&lt;li>Relishes working with metadata.&lt;/li>
&lt;li>Has experience delivering web-based applications using agile methodologies.&lt;/li>
&lt;li>Wants to learn new skills and work with a variety of programming languages.&lt;/li>
&lt;li>Enjoys working with a small, geographically dispersed team.&lt;/li>
&lt;li>Groks mixed-content model XML.&lt;/li>
&lt;li>Groks RDF.&lt;/li>
&lt;li>Groks REST.&lt;/li>
&lt;li>Has explored MapReduce-based database systems.&lt;/li>
&lt;li>Is expert in one or more popular development language (Java, C, C++, C#).&lt;/li>
&lt;li>Is expert in one or more popular scripting language (Ruby, Python, Javascript).&lt;/li>
&lt;li>Has deployed and maintained Linux/BSD-based systems.&lt;/li>
&lt;li>Understands relational databases (MySQL, Postgres, Oracle).&lt;/li>
&lt;li>Tests first.&lt;/li>
&lt;/ul>
&lt;p>If you are interested, please see the &lt;a href="http://oxford.crossref.org/jobs/rd_developer.html" target="_blank">full job description&lt;/a>. If you are not interested, but know somebody who might be, please let them know about this great opportunity.&lt;/p></description></item><item><title>Strategic Reading</title><link>https://www.crossref.org/blog/strategic-reading/</link><pubDate>Fri, 14 Aug 2009 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/strategic-reading/</guid><description>&lt;p>&lt;a href="https://web.archive.org/web/20171002013342/http://people.ischool.illinois.edu/~renear/renearcv.html" target="_blank">Allen Renear&lt;/a> and &lt;a href="https://web.archive.org/web/20090901021407/http://people.lis.illinois.edu/~clpalmer" target="_blank">Carole Palmer&lt;/a> have just published an article titled “Strategic Reading, Ontologies, and the Future of Scientific Publishing” in the current issue of &lt;em>Science&lt;/em> (&lt;a href="http://dx.doi.org/10.1126/science.1157784" target="_blank">http://dx.doi.org/10.1126/science.1157784&lt;/a>). I’m particularly happy to see this paper published because I actually got to witness the genesis of these ideas in my living room back in 2006. Since then, Allen and Carole’s ideas have profoundly influenced my thinking on the application of technology to scholarly communication.&lt;/p>
&lt;p>Those who have seen me speak at conferences recently will have heard me do an awful lot of ranting about the how publishers and librarians need to help researchers practice the time-honored art of “reading avoidance” (or as Renear and Palmer politely put it- “strategic reading”). I even managed to squeeze this rant into a &lt;a href="http://www.wiley.com/bw/journalnews/newsitem.asp?release=2262" target="_blank">recent interview&lt;/a> I did with Wiley-Blackwell.&lt;/p>
&lt;p>The essence of my argument has been that our industries need not be bamboozled by the technical jargon and messianic hand-waving that typically accompany discussions of new technology trends like “web 2.0”, “text-mining”, “the semantic web”, “micro-blogging”, etc. This is because there is a fairly simple way for us to understand the relative import (or lack thereof) of new technologies to scholarly communication and that is to ask the following question:&lt;/p>
&lt;p>“Can the application of this technology in the realm of scholarly communication help researchers to read less?”&lt;/p>
&lt;p>If the answer is “yes”, then you’d better pay very close attention to it.&lt;/p>
&lt;p>In fact, I’d go so far as to say the history of scholarly publishing can be characterized by the successful adoption of conventions and tools that help researchers read strategically.&lt;/p>
&lt;p>Now I have something to cite when I rant.&lt;/p>
&lt;p>Anyway, congratulations to Allen &amp;amp; Carole.&lt;/p></description></item><item><title>OpenSearch Formats for Review</title><link>https://www.crossref.org/blog/opensearch-formats-for-review/</link><pubDate>Thu, 23 Jul 2009 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/opensearch-formats-for-review/</guid><description>&lt;p>In an &lt;a href="https://www.crossref.org/blog/structured-search-using-prism-elements/">earlier post&lt;/a> I talked about using the PAM (PRISM Aggregator Message) schema for an SRU result set. I have also noted in &lt;a href="https://www.crossref.org/blog/search-web-service">another post&lt;/a> that a Search Web Service could support both &lt;a href="http://www.loc.gov/standards/sru/" target="_blank">SRU&lt;/a> and &lt;a href="http://opensearch.org/" target="_blank">OpenSearch&lt;/a> interfaces. This does then beg the question of what a corresponding OpenSearch result set might look like for such a record.&lt;/p>
&lt;p>Based on the &lt;a href="http://www.opensearch.org/Specifications/OpenSearch/1.1" target="_blank">OpenSearch spec&lt;/a> and also on a new &lt;a href="http://www.oasis-open.org/committees/download.php/33410/atom-extension-for-sru.doc" target="_blank">Atom extension for SRU&lt;/a>, I have contrived to show how a PAM record might be returned in a coomon OpenSearch format. Below I offer some mocked-up examples for each of the following formats for review purposes:&lt;/p>
&lt;ul>
&lt;li>RSS 1.0
&lt;ul>
&lt;li>ATOM
&lt;ul>
&lt;li>
&lt;p>JSON &lt;/ul>
Just click the relevant figure for a text rendering of each result format for the following phrase search:&lt;/p>
&lt;blockquote>
&lt;p>cql.keywords adj “solar eclipse”&lt;/p>
&lt;/blockquote>
&lt;p>In this example we imagine that two records have been requested. (The example formats also include navigational links as per the OpenSearch spec examples.)&lt;/p>
&lt;p>Note that the JSON example closely follows the ATOM schema with a couple of main deviations:&lt;/p>
&lt;ul>
&lt;li>Repeated elements are gathered together in an array (e.g. “entry”, “dc:creator”)
&lt;ul>
&lt;li>
&lt;p>Attributes are broken out alongside their parent elements (e.g. “rel”, “href”)&lt;/ul>
It would be interesting to hear what readers think of these examples - especially the JSON format.&lt;/p>
&lt;table cellpadding="10" border="0">
&lt;tr>
&lt;td>
&lt;a href="http://nurture.nature.com/opensearch/demo/solar2-rss.txt">&lt;img alt="solar2-rss.jpg" border="0" width="159" height="309" src="https://www.crossref.org/wp/blog/images/solar2-rss.jpg" />&lt;/a>
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
&amp;lt;a href=&amp;quot;http://nurture.nature.com/opensearch/demo/solar2-atom.txt&amp;quot;&amp;gt;&amp;lt;img alt=&amp;quot;solar2-atom.jpg&amp;quot; border=&amp;quot;0&amp;quot; width=&amp;quot;159&amp;quot; height=&amp;quot;309&amp;quot; src=&amp;quot;/wp/blog/images/solar2-atom.jpg&amp;quot; /&amp;gt;&amp;lt;/a&amp;gt;
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;a href=&amp;quot;http://nurture.nature.com/opensearch/demo/solar2-json.txt&amp;quot;&amp;gt;&amp;lt;img alt=&amp;quot;solar2-json.jpg&amp;quot; border=&amp;quot;0&amp;quot; width=&amp;quot;159&amp;quot; height=&amp;quot;309&amp;quot; src=&amp;quot;/wp/blog/images/solar2-json.jpg&amp;quot; /&amp;gt;&amp;lt;/a&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;th>
RSS 1.0
&lt;/th>
&lt;pre>&lt;code>&amp;lt;th&amp;gt;
ATOM
&amp;lt;/th&amp;gt;
&amp;lt;th&amp;gt;
JSON
&amp;lt;/th&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;/table>
&lt;p>(Click image to get text format.)&lt;/p>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul></description></item><item><title>OASIS Drafts of SRU 2.0 and CQL 2.0</title><link>https://www.crossref.org/blog/oasis-drafts-of-sru-2.0-and-cql-2.0/</link><pubDate>Wed, 22 Jul 2009 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/oasis-drafts-of-sru-2.0-and-cql-2.0/</guid><description>&lt;p>As posted &lt;a href="http://listserv.loc.gov/cgi-bin/wa?A2=ind0907&amp;amp;#038;L=zng&amp;amp;#038;T=0&amp;amp;#038;P=52" target="_blank">here&lt;/a> on the &lt;a href="https://web.archive.org/web/20130303230855/http://sun8.loc.gov/listarch/zng.html" target="_blank">SRU Implementors&lt;/a> list, the &lt;a href="http://www.oasis-open.org/committees/tc_home.php?wg_abbrev=search-ws" target="_blank">OASIS Search Web Services Technical Committee&lt;/a> has announced the release of drafts of SRU and CQL version 2.0:&lt;/p>
&lt;ul>
&lt;li>&lt;a href="http://www.oasis-open.org/committees/download.php/33498/sru-2-0-draft.doc" target="_blank">sru-2-0-draft.doc&lt;/a>
&lt;ul>
&lt;li>&lt;a href="http://www.oasis-open.org/committees/download.php/33497/cql-2-0-draft.doc" target="_blank">cql-2-0-draft.doc&lt;/a> &lt;/ul>
The Committee is soliciting feedback on these two documents. Comments should be &lt;a href="https://web.archive.org/web/20130303230855/http://sun8.loc.gov/listarch/zng.html" target="_blank">posted to the SRU list&lt;/a> by August 13.&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul></description></item><item><title>Crossref OpenURL resolver</title><link>https://www.crossref.org/blog/crossref-openurl-resolver/</link><pubDate>Tue, 07 Jul 2009 00:00:00 +0000</pubDate><author>Chuck Koscher</author><guid>https://www.crossref.org/blog/crossref-openurl-resolver/</guid><description>&lt;p>A new version of our OpenURL resolver was deployed July 2 which should handle higher traffic (e.g. we have re-enable the LibX plug-in ) Unfortunately there were a few hick ups with the new version which I believe are now corrected (a character encoding bug and a XML structure translation problem).&lt;/p>
&lt;p>Sorry for any inconvenience.&lt;/p></description></item><item><title>XMP Primer</title><link>https://www.crossref.org/blog/xmp-primer/</link><pubDate>Wed, 10 Jun 2009 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/xmp-primer/</guid><description>&lt;p>There’s a new &lt;a href="https://web.archive.org/web/20101124111737/http://www.idealliance.org/filefolder/XMPPrimer.pdf" target="_blank">XMP Primer&lt;/a> (PDF) by Ron Roskiewicz (ed. Dianne Kennedy) available from XMP-Open. This is copyrighted 2008 but I only just saw this now. This is a 43 page document which provides a very gentle introduction to metadata and labelling of media and then introduces XMP into the content lifecycle and talks to the business case for using XMP. The primer covers the following areas:&lt;/p>
&lt;ul>
&lt;li>Introduction to Metadata
&lt;ul>
&lt;li>Introduction to XMP
&lt;ul>
&lt;li>XMP and the Content Lifecycle
&lt;ul>
&lt;li>XMP in Action; Use Cases
&lt;ul>
&lt;li>Additional XMP Resources &lt;/ul>
One small gripe would be that this seems to have been prepared for US letter-sized pages and although is printable on A4 there is the slightest of clippings on the right-hand margin with no real loss of information but it does confer a sense of “incompleteness”. Really there can be little excuse these days for this parochialism. Also, for a document talking up the benefits of using XMP, it’s decidedly odd that it doesn’t make use of XMP itself - or rather there is a default XMP packet in the PDF with no real useful properties such as title, author, or date. Could have been a nice little object lesson in using XMP.&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul></description></item><item><title>Aligning OpenSearch and SRU</title><link>https://www.crossref.org/blog/aligning-opensearch-and-sru/</link><pubDate>Fri, 05 Jun 2009 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/aligning-opensearch-and-sru/</guid><description>&lt;p>[&lt;strong>Update - 2009.06.07:&lt;/strong> As pointed out by Todd Carpenter of NISO (see comments below) the phrase “&lt;em>SRU by contrast is an initiative to update Z39.50 for the Web&lt;/em>” is inaccurate. I should have said “&lt;em>By contrast SRU is an initiative recognized by ZING (Z39.50 International Next Generation) to bring Z39.50 functionality into the mainstream Web&lt;/em>“.]&lt;/p>
&lt;p>[&lt;strong>Update - 2009.06.08:&lt;/strong> Bizarrely I find in mentioning query languages below that I omitted to mention SQL. I don’t know what that means. Probably just that there’s no Web-based API. And that again it’s tied to a particular technology - RDBMS.]&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/wp/blog/images/queryType.png">&lt;img alt="queryType.png" src="https://www.crossref.org/wp/blog/images/queryType.png" width="379" height="261" border="0" />&lt;/a>&lt;/p>
&lt;p>(Click image to enlarge.)&lt;/p>
&lt;p>There are two well-known public search APIs for generic Web-based search: &lt;a href="http://www.opensearch.org/" target="_blank">OpenSearch&lt;/a> and &lt;a href="http://www.loc.gov/standards/sru/" target="_blank">SRU&lt;/a>. (Note that the key term here is “generic”, so neither &lt;a href="http://lucene.apache.org/solr/" target="_blank">Solr&lt;/a>/&lt;a href="http://lucene.apache.org/" target="_blank">Lucene&lt;/a> nor &lt;a href="http://www.w3.org/XML/Query/" target="_blank">XQuery&lt;/a> really qualify for that slot. Also, I am concentrating here on “classic” query languages rather than on semantic query languages such as &lt;a href="http://www.w3.org/2009/sparql/wiki/Main_Page" target="_blank">SPARQL&lt;/a>.)&lt;/p>
&lt;p>OpenSearch was created by Amazon’s &lt;a href="https://web.archive.org/web/20090610223844/http://a9.com/" target="_blank">A9.com&lt;/a> and is a cheap and cheerful means to interface to a search service by declaring a template URL and returning a structured XML format. It therefore allows for structured result sets while placing no constraints on the query string. As outlined in my earlier post &lt;a href="https://www.crossref.org/blog/search-web-service">Search Web Service&lt;/a>, there is support for search operation control parameters (pagination, encoding, etc.), but no inroads are made into the query string itself which is regarded as opaque.&lt;/p>
&lt;p>SRU by contrast is an initiative to update &lt;a href="http://www.loc.gov/z3950/agency/" target="_blank">Z39.50&lt;/a> for the Web and is firmly focussed on structured queries and responses. Specifically a query can be expressed in the high-level query language &lt;a href="http://www.loc.gov/standards/sru/specs/cql.html" target="_blank">CQL&lt;/a> which is independent of any underlying implementation. Result records are returned using any declared W3C XML Schema format and are transported within a defined XML wrapper format for SRU. (Note that the &lt;a href="https://www.oasis-open.org/committees/documents.php?wg_abbrev=search-ws" target="_blank">SRU 2.0 draft&lt;/a> provides support for arbitrary result formats based on media type.)&lt;/p>
&lt;p>One can summarize the respective OpenSearch and SRU functionalities as in this table:&lt;/p>
&lt;table border="1" width="50%">
&lt;tr>
&lt;th width="33%" align="left">
Structure
&lt;/th>
&lt;pre>&lt;code>&amp;lt;th width=&amp;quot;33%&amp;quot; align=&amp;quot;center&amp;quot;&amp;gt;
OpenSearch
&amp;lt;/th&amp;gt;
&amp;lt;th width=&amp;quot;33%&amp;quot; align=&amp;quot;center&amp;quot;&amp;gt;
SRU
&amp;lt;/th&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
query
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td align=&amp;quot;center&amp;quot;&amp;gt;
no
&amp;lt;/td&amp;gt;
&amp;lt;td align=&amp;quot;center&amp;quot;&amp;gt;
yes
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
results
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td align=&amp;quot;center&amp;quot;&amp;gt;
yes
&amp;lt;/td&amp;gt;
&amp;lt;td align=&amp;quot;center&amp;quot;&amp;gt;
yes
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
control
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td align=&amp;quot;center&amp;quot;&amp;gt;
yes
&amp;lt;/td&amp;gt;
&amp;lt;td align=&amp;quot;center&amp;quot;&amp;gt;
yes
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
diagnostics
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td align=&amp;quot;center&amp;quot;&amp;gt;
no
&amp;lt;/td&amp;gt;
&amp;lt;td align=&amp;quot;center&amp;quot;&amp;gt;
yes
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;/table>
&lt;p>What I wanted to discuss here was the OpenSearch and SRU interfaces to a &lt;a href="https://www.crossref.org/blog/search-web-service">Search Web Service&lt;/a> such as outlined in my previous post. The &lt;a href="https://www.crossref.org/wp/blog/images/queryType.png">diagram&lt;/a> at top of this post shows query forms for OpenSearch and SRU and associated result types. The Search Web Service is taken to be exposing an SRU interface. It might be simplest to walk through each of the cases.&lt;/p>
&lt;p>(Continues below.)&lt;/p>
&lt;p>&lt;em>Case 1: OpenSearch (Native Client)&lt;/em>&lt;/p>
&lt;p>As noted, OpenSearch uses a URL template (declared in an OpenSearch description document) where recognized parameters are mapped to implementation-specific parameters. The bolded parameter “&lt;strong>query&lt;/strong>” in the figure indicates an OpenSearch parameter “&lt;strong>searchTerms&lt;/strong>” which has been mapped to the Search Web Service parameter “&lt;strong>query&lt;/strong>“,&lt;/p>
&lt;p>As also noted, SRU 2.0 offers support for alternate result formats (other than SRU XML) by allowing a media type (aka mime type) to be passed in an “http:accept” parameter. There is, however, no OpenSearch parameter corresponding to a format selector, so this must be hard coded directly into the URL template with a value of “application/rss+xml” - the standard media type for an RSS feed which is the common result format for OpenSearch.&lt;/p>
&lt;p>(In the diagram I have noted in parentheses that RSS in its RSS 1.0 form is RDF. And that format is a strong candidate for semantic interoperability. An alternate format would be Atom, which could be similarly selected with a value of “application/atom+xml”, but it is difficult to see at this time what advantage Atom confers. It does not conform to the RDF data model but may find better support in code libraries and applications.)&lt;/p>
&lt;p>The third parameter shown for Case 1, is “queryType” which is another new SRU 2.0 parameter. I had noted earlier that an OpenSearch query string could be passed directly through to the Search Web Service and its associated CQL parser. It tuns out that this needs to be analyzed further. (And many thanks to Jonathan Rochkind for useful discussions on this.)&lt;/p>
&lt;p>I had naively assumed that an OpenSearch query string would either be packed as a CQL string or would be a simple text string which could be interpreted as CQL. The latter interpretation (text string) turns out to be true only for a single bare word or for a quoted string - both of which are recognized CQL query strings (i.e. a single search term which has a default index and relationship to that index). It fails, however, for the more general case of unquoted strings. See table below for these cases.&lt;/p>
&lt;table border="1" width="50%">
&lt;tr>
&lt;th width="50%">
Query type
&lt;/th>
&lt;pre>&lt;code>&amp;lt;th width=&amp;quot;50%&amp;quot;&amp;gt;
Query string
&amp;lt;/th&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
A. bare word
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td align=&amp;quot;left&amp;quot;&amp;gt;
this
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
B. quoted string
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td align=&amp;quot;left&amp;quot;&amp;gt;
&amp;amp;#8220;this is a query&amp;amp;#8221;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
C. unquoted string
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td align=&amp;quot;left&amp;quot;&amp;gt;
this is a query
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;/table>
&lt;p>Case C would fail a CQL parser. So we need to signal to the Search Web Service that this is not a CQL string. And that’s where the “queryType” parameter comes in. If it’s set to “cql” then the query string is to be parsed as CQL, otherwise it must be handled in an alternate fashion. (As of now there is no value set for this parameter that I am aware of so I am using the terms “plain” and “cql” to differentiate.)&lt;/p>
&lt;p>How this should be handled by a CQL aware application is not immediately obvious. My first thought was to allow the application to silently quote such a string but that would change the semantics. It would be better to split the string into separate search clauses for each word and to join the search cluases by a default boolean operator, e.g. “&lt;code>AND&lt;/code>“, so that case C in the table might be interpreted by the application as:&lt;/p>
&lt;p>&lt;blockqoute>``[&lt;strong>Update - 2009.06.07:&lt;/strong> As pointed out by Todd Carpenter of NISO (see comments below) the phrase “&lt;em>SRU by contrast is an initiative to update Z39.50 for the Web&lt;/em>” is inaccurate. I should have said “&lt;em>By contrast SRU is an initiative recognized by ZING (Z39.50 International Next Generation) to bring Z39.50 functionality into the mainstream Web&lt;/em>“.]&lt;/p>
&lt;p>[&lt;strong>Update - 2009.06.08:&lt;/strong> Bizarrely I find in mentioning query languages below that I omitted to mention SQL. I don’t know what that means. Probably just that there’s no Web-based API. And that again it’s tied to a particular technology - RDBMS.]&lt;/p>
&lt;p>&lt;a href="https://www.crossref.org/wp/blog/images/queryType.png">&lt;img alt="queryType.png" src="https://www.crossref.org/wp/blog/images/queryType.png" width="379" height="261" border="0" />&lt;/a>&lt;/p>
&lt;p>(Click image to enlarge.)&lt;/p>
&lt;p>There are two well-known public search APIs for generic Web-based search: &lt;a href="http://www.opensearch.org/" target="_blank">OpenSearch&lt;/a> and &lt;a href="http://www.loc.gov/standards/sru/" target="_blank">SRU&lt;/a>. (Note that the key term here is “generic”, so neither &lt;a href="http://lucene.apache.org/solr/" target="_blank">Solr&lt;/a>/&lt;a href="http://lucene.apache.org/" target="_blank">Lucene&lt;/a> nor &lt;a href="http://www.w3.org/XML/Query/" target="_blank">XQuery&lt;/a> really qualify for that slot. Also, I am concentrating here on “classic” query languages rather than on semantic query languages such as &lt;a href="http://www.w3.org/2009/sparql/wiki/Main_Page" target="_blank">SPARQL&lt;/a>.)&lt;/p>
&lt;p>OpenSearch was created by Amazon’s &lt;a href="https://web.archive.org/web/20090610223844/http://a9.com/" target="_blank">A9.com&lt;/a> and is a cheap and cheerful means to interface to a search service by declaring a template URL and returning a structured XML format. It therefore allows for structured result sets while placing no constraints on the query string. As outlined in my earlier post &lt;a href="https://www.crossref.org/blog/search-web-service">Search Web Service&lt;/a>, there is support for search operation control parameters (pagination, encoding, etc.), but no inroads are made into the query string itself which is regarded as opaque.&lt;/p>
&lt;p>SRU by contrast is an initiative to update &lt;a href="http://www.loc.gov/z3950/agency/" target="_blank">Z39.50&lt;/a> for the Web and is firmly focussed on structured queries and responses. Specifically a query can be expressed in the high-level query language &lt;a href="http://www.loc.gov/standards/sru/specs/cql.html" target="_blank">CQL&lt;/a> which is independent of any underlying implementation. Result records are returned using any declared W3C XML Schema format and are transported within a defined XML wrapper format for SRU. (Note that the &lt;a href="https://www.oasis-open.org/committees/documents.php?wg_abbrev=search-ws" target="_blank">SRU 2.0 draft&lt;/a> provides support for arbitrary result formats based on media type.)&lt;/p>
&lt;p>One can summarize the respective OpenSearch and SRU functionalities as in this table:&lt;/p>
&lt;table border="1" width="50%">
&lt;tr>
&lt;th width="33%" align="left">
Structure
&lt;/th>
&lt;pre>&lt;code>&amp;lt;th width=&amp;quot;33%&amp;quot; align=&amp;quot;center&amp;quot;&amp;gt;
OpenSearch
&amp;lt;/th&amp;gt;
&amp;lt;th width=&amp;quot;33%&amp;quot; align=&amp;quot;center&amp;quot;&amp;gt;
SRU
&amp;lt;/th&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
query
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td align=&amp;quot;center&amp;quot;&amp;gt;
no
&amp;lt;/td&amp;gt;
&amp;lt;td align=&amp;quot;center&amp;quot;&amp;gt;
yes
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
results
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td align=&amp;quot;center&amp;quot;&amp;gt;
yes
&amp;lt;/td&amp;gt;
&amp;lt;td align=&amp;quot;center&amp;quot;&amp;gt;
yes
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
control
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td align=&amp;quot;center&amp;quot;&amp;gt;
yes
&amp;lt;/td&amp;gt;
&amp;lt;td align=&amp;quot;center&amp;quot;&amp;gt;
yes
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
diagnostics
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td align=&amp;quot;center&amp;quot;&amp;gt;
no
&amp;lt;/td&amp;gt;
&amp;lt;td align=&amp;quot;center&amp;quot;&amp;gt;
yes
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;/table>
&lt;p>What I wanted to discuss here was the OpenSearch and SRU interfaces to a &lt;a href="https://www.crossref.org/blog/search-web-service">Search Web Service&lt;/a> such as outlined in my previous post. The &lt;a href="https://www.crossref.org/wp/blog/images/queryType.png">diagram&lt;/a> at top of this post shows query forms for OpenSearch and SRU and associated result types. The Search Web Service is taken to be exposing an SRU interface. It might be simplest to walk through each of the cases.&lt;/p>
&lt;p>(Continues below.)&lt;/p>
&lt;!--more-->
&lt;p>&lt;em>Case 1: OpenSearch (Native Client)&lt;/em>&lt;/p>
&lt;p>As noted, OpenSearch uses a URL template (declared in an OpenSearch description document) where recognized parameters are mapped to implementation-specific parameters. The bolded parameter “&lt;strong>query&lt;/strong>” in the figure indicates an OpenSearch parameter “&lt;strong>searchTerms&lt;/strong>” which has been mapped to the Search Web Service parameter “&lt;strong>query&lt;/strong>“,&lt;/p>
&lt;p>As also noted, SRU 2.0 offers support for alternate result formats (other than SRU XML) by allowing a media type (aka mime type) to be passed in an “http:accept” parameter. There is, however, no OpenSearch parameter corresponding to a format selector, so this must be hard coded directly into the URL template with a value of “application/rss+xml” - the standard media type for an RSS feed which is the common result format for OpenSearch.&lt;/p>
&lt;p>(In the diagram I have noted in parentheses that RSS in its RSS 1.0 form is RDF. And that format is a strong candidate for semantic interoperability. An alternate format would be Atom, which could be similarly selected with a value of “application/atom+xml”, but it is difficult to see at this time what advantage Atom confers. It does not conform to the RDF data model but may find better support in code libraries and applications.)&lt;/p>
&lt;p>The third parameter shown for Case 1, is “queryType” which is another new SRU 2.0 parameter. I had noted earlier that an OpenSearch query string could be passed directly through to the Search Web Service and its associated CQL parser. It tuns out that this needs to be analyzed further. (And many thanks to Jonathan Rochkind for useful discussions on this.)&lt;/p>
&lt;p>I had naively assumed that an OpenSearch query string would either be packed as a CQL string or would be a simple text string which could be interpreted as CQL. The latter interpretation (text string) turns out to be true only for a single bare word or for a quoted string - both of which are recognized CQL query strings (i.e. a single search term which has a default index and relationship to that index). It fails, however, for the more general case of unquoted strings. See table below for these cases.&lt;/p>
&lt;table border="1" width="50%">
&lt;tr>
&lt;th width="50%">
Query type
&lt;/th>
&lt;pre>&lt;code>&amp;lt;th width=&amp;quot;50%&amp;quot;&amp;gt;
Query string
&amp;lt;/th&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
A. bare word
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td align=&amp;quot;left&amp;quot;&amp;gt;
this
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
B. quoted string
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td align=&amp;quot;left&amp;quot;&amp;gt;
&amp;amp;#8220;this is a query&amp;amp;#8221;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
C. unquoted string
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td align=&amp;quot;left&amp;quot;&amp;gt;
this is a query
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;/table>
&lt;p>Case C would fail a CQL parser. So we need to signal to the Search Web Service that this is not a CQL string. And that’s where the “queryType” parameter comes in. If it’s set to “cql” then the query string is to be parsed as CQL, otherwise it must be handled in an alternate fashion. (As of now there is no value set for this parameter that I am aware of so I am using the terms “plain” and “cql” to differentiate.)&lt;/p>
&lt;p>How this should be handled by a CQL aware application is not immediately obvious. My first thought was to allow the application to silently quote such a string but that would change the semantics. It would be better to split the string into separate search clauses for each word and to join the search cluases by a default boolean operator, e.g. “&lt;code>AND&lt;/code>“, so that case C in the table might be interpreted by the application as:&lt;/p>
&lt;p>&lt;blockqoute>`` &lt;/blockquote>&lt;/p>
&lt;p>Now, of course, we must not expect that a typical OpenSearch implementation would be aware of CQL (or any of the SRU technologies). Instead we can simply indicate in the URL template that the “queryType” is non-CQL, by hard coding “queryType=plain”. The actual URL template which is declared in the OpenSearch description would thus be something like the following (with whitespace added for clarity):&lt;/p>
&lt;pre>&amp;lt;!-- 1. queryType="plain" --&amp;gt;
&amp;lt;Url type="application/rss+xml"
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;template="http://www.example/search?
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;query={&lt;b>searchTerms&lt;/b>}
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;queryType=plain
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;http:accept=application/rss+xml
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;"
/&amp;gt;
&lt;/pre>
&lt;p>This URL template uses one OpenSearch parameter(“searchTerms”) and that is mapped to the SRU parameter “query”. The SRU 2.0 parameters “queryType” and “http:accept” are wired in. This means that a Search Web Service would be aware of the query, would know that it was not CQL (so might invoke a handler), and would be know that a result set in RSS was required.&lt;/p>
&lt;p>&lt;em>Case 2: OpenSearch (CQL-Aware Client)&lt;/em>&lt;/p>
&lt;p>The above case, works for a general OpenSearch client but now is problematic for a CQL-aware client. With the “queryType” set at “plain” there is no opportunity to indicate that a generic CQL string might be passed instead. We certainly wouldn’t want a non-CQL handler to operate on a valid CQL string. We need to vary the SRU 2.0 parameters and within the scope of OpenSearch this can only be done by recognizing the parameters as &lt;a href="https://opensearch.org/blog/introducing-extensions-for-opensearch/" target="_blank">OpenSearch extensions&lt;/a>. Basically, an extension is nothing more than a separately namespaced element or attribute. Recommendation is that the XML namespace would resolve to a specification document detailing the intention and format of the extension.&lt;/p>
&lt;p>The URL template for a CQL-aware OpenSearch description could make use of the “queryType” and “http:accept” parameters as OpenSearch extensions (marked in bold italics in the figure) using a declaration like this:&lt;/p>
&lt;pre>&amp;lt;!-- 2. queryType="cql" --&amp;gt;
&amp;lt;Url type="application/xml"
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;xmlns:sru="http://opensearch.example/sru-extension"
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;template="http://www.example/search?
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;query={&lt;b>searchTerms&lt;/b>}
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;queryType={&lt;b>sru:queryType?&lt;/b>}
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;http:accept={&lt;b>sru:httpAccept?&lt;/b>}
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;"
/&amp;gt;
&lt;/pre>
&lt;p>Note here that both parameters have been specified as being optional. Also the namespace here is pointed at a fictional OpenSearch extension document. (It doesn’t need to point to such a document - could be anything - but it is recommended that there be a specification.)&lt;/p>
&lt;p>I’m not aware of any such OpenSearch extension document for SRU currently existing but would be prepared to contribute to drafting such a document. It seems to me that it would be would be very useful for general OpenSearch/SRU compatibility and probably should detail all the SRU 2.0 parameters for “searchRetrieve”. In fact, that document could be the SRU spec itself, once that was established at a fixed URL. (Whether there should be a specific OpenSearch extension document depends on whether it would be useful to provide OpenSearch implementation details.)&lt;/p>
&lt;p>&lt;em>Case 3: SRU (Native Client)&lt;/em>&lt;/p>
&lt;p>This is easy. We’re on home ground now. The query type is by default CQL, and the result format is SRU XML. The only thing that might be specified is “recordSchema” to require a schema for the result records, if there are alternate schemas supported by the Search Web Service. A default for the result records is anyway supplied.&lt;/p>
&lt;p>&lt;em>Case 4: SRU (Media-Typed Client)&lt;/em>&lt;/p>
&lt;p>Again, we’re on familiar ground. For a media-savvy SRU interface we would need to use the SRU 2.0 parameter “http:accept”. This could be used to override the default SRU XML with an alternate format, e.g. RSS.&lt;/p>
&lt;p>And that’s about it for this review of aligning the OpenSearch and SRU interfaces. It seems that using URL templates and OpenSearch extensions as indicated should allow for an easy OpenSearch interface onto an SRU-based Search Web Service. At a minimum we just need a permanent URL for the SRU 2.0 spec (when finalized). Alternately a separate OpenSearch extension document could be drafted and registered. That would allow for details specific to OpenSearch to be provided, as well as bringing SRU closer into the OpenSearch realm. And such a document could be created now and updated with the URL for the SRU 2.0 spec as it progresses from draft to final.&lt;/p></description></item><item><title>Search Web Service</title><link>https://www.crossref.org/blog/search-web-service/</link><pubDate>Sat, 30 May 2009 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/search-web-service/</guid><description>&lt;p>&lt;a href="https://www.crossref.org/wp/blog/images/search-web-service.png">&lt;img alt="search-web-service.png" src="https://www.crossref.org/wp/blog/images/search-web-service.png" width="405" height="303" border="0" />&lt;/a>&lt;/p>
&lt;p>(Click image to enlarge graphic.)&lt;/p>
&lt;p>While the &lt;a href="http://www.oasis-open.org/committees/tc_home.php?wg_abbrev=search-ws" target="_blank">OASIS Search Web Services TC&lt;/a> is currently working towards reconciling SRU and OpenSearch, I thought it would be useful to share here a simple graphic outlining how a search web service for structured search might be architected.&lt;/p>
&lt;p>Basically there are two views of this search web service (described in separate XML description files and discoverable through autodiscovery links added to HTML pages):&lt;/p>
&lt;ul>
&lt;li>&lt;a href="http://www.opensearch.org/" target="_blank">OpenSearch&lt;/a>
&lt;ul>
&lt;li>
&lt;p>&lt;a href="http://www.loc.gov/standards/sru/" target="_blank">SRU (Search and Retrieve by URL)&lt;/a>&lt;/ul>
One can see at a glance that there’s more happening down in the SRU layer. The SRU layer implements a heavyweight, robust service which provides a detailed listing of search indexes and index relations in the description document (‘SRU Explain’), is searchable using a standard query grammar - &lt;a href="http://www.loc.gov/standards/sru/specs/cql.html" target="_blank">CQL&lt;/a> (‘Contextual Query Language’), responds with result sets inside a standard XML wrapper and expressed as an XML record set (e.g. PAM) that is validatable using W3C XML Schema, and makes available a full roster of diagnostics.&lt;/p>
&lt;p>By contrast the OpenSearch layer provides a lightweight view onto the search web service in which a simple opaque query string is sent to the server and a simple XML result set returned (usually RSS or Atom). Again a description document is made available (‘OpenSearch Description’) but this is much more coarse grained than the SRU description - e.g. it does not specify query components such as indexes or relations.&lt;/p>
&lt;p>In practice, both views can be provided for by the same search web service. While OpenSearch does not specify any structured query it can make use of a CQL packaged query. That is, a single parameter value for the OpenSearch ‘query’ parameter can be unpacked by a CQL parser to yield a complex search query. The search query does not need to be splattered all over the URL querystring which is already using its parameter set to provide control information for the search (e.g. pagination, encoding and the like).&lt;/p>
&lt;p>And how would this relate to existing platform-hosted search services? Well, such services are usually bound to the host platform and are not intended to support remote applications. A search web service, on the other hand, would be ideally suited to offering direct support for running structured searches on platform-hosted content using off-platform apps.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul></description></item><item><title>Structured Search Using PRISM Elements</title><link>https://www.crossref.org/blog/structured-search-using-prism-elements/</link><pubDate>Sat, 30 May 2009 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/structured-search-using-prism-elements/</guid><description>&lt;p>We just registered in the &lt;a href="http://www.loc.gov/standards/sru/" target="_blank">SRU&lt;/a> (Search and Retrieve by URL) search registry the following components:&lt;/p>
&lt;dl>
&lt;dt>&lt;strong>&lt;em>Context Sets&lt;/em>&lt;/strong>&lt;/p>&lt;/dt>
&lt;dd>&lt;ul>
&lt;li>&lt;a href="http://www.loc.gov/standards/sru/cql/contextSets/prism-context-set-v2-0.html" target="_blank">PRISM Context Set version 2.0&lt;/a>
&lt;ul>
&lt;li>
&lt;dl>
&lt;dt>&lt;a href="http://www.loc.gov/standards/sru/cql/contextSets/prism-context-set-v2-1.html" target="_blank">PRISM Context Set version 2.1&lt;/a>&lt;/ul>&lt;/dt>
&lt;dt>&lt;strong>&lt;em>Schemas&lt;/em>&lt;/strong>&lt;/p>&lt;/dt>
&lt;dd>
&lt;ul>
&lt;li>PRISM Aggregator Message Record Schema Version 2.0
&lt;ul>
&lt;li>
&lt;p>PRISM Aggregator Message Record Schema Version 2.1&lt;/ul> &lt;/dl>
This means that an &lt;a href="http://www.loc.gov/standards/sru/" target="_blank">SRU&lt;/a> (Search and Retrieve by URL) search engine that supported one of the PRISM context sets registered above could accept &lt;a href="http://www.loc.gov/standards/sru/specs/cql.html" target="_blank">CQL&lt;/a> (Contextual Query Language) queries such as the following:&lt;/p>
&lt;ol>
&lt;li>&lt;tt>prism.doi = &amp;ldquo;10.1038/nature05398&amp;rdquo;&lt;/tt>
&lt;ul>
&lt;li>&lt;tt>prism.publicationName = &amp;ldquo;Nature&amp;rdquo; and prism.volume = &amp;ldquo;444&amp;rdquo; and prism.number = &amp;ldquo;7119&amp;rdquo; and prism.startingPage = &amp;ldquo;E9&amp;rdquo;&lt;/tt>
&lt;ul>
&lt;li>&lt;tt>dc.identifier = &amp;ldquo;doi:10.1038/nature05398&amp;rdquo;&lt;/tt>
&lt;ul>
&lt;li>&lt;tt>dc.creator = &amp;ldquo;Jones-Smith&amp;rdquo; and prism.publicationName = &amp;ldquo;Nature&amp;rdquo; and prism.publicationDate &amp;gt; &amp;ldquo;2006-01-01&amp;rdquo;&lt;/tt>
&lt;ul>
&lt;li>&lt;tt>dc.title any &amp;ldquo;fractal pollock&amp;rdquo; and prism.publicationName = &amp;ldquo;Nature&amp;rdquo; sortBy prism.publicationDate/sort.descending&lt;/tt>
&lt;ul>
&lt;li>
&lt;p>&lt;tt>&amp;ldquo;fractal anlysis&amp;rdquo; and prism.publicationDate within &amp;ldquo;2005-01-01 2008-12-31&amp;rdquo; sortBy dc.creator/sort.ascending&lt;/tt>&lt;/ol>
(Note that the quotes are only needed above for the DOI strings which contain a “/” character. Otherwise they are optional in the above examples.)&lt;/p>
&lt;p>Any query such as one of the above (here #1) could be sent to the server on a querystring like so:&lt;/p>&lt;/p>
&lt;blockquote>
&lt;p>&lt;tt>?version=1.1&amp;amp;operation=searchRetrieve&amp;amp;query=prism.doi=%2210.1038/nature05398%22&lt;/tt>&lt;/p>
&lt;/blockquote>
&lt;p>and if the server were also equipped to respond with &lt;a href="https://www.idealliance.org/pam" target="_blank">PAM&lt;/a> (PRISM Aggregator Message) format for result records, a response might look like this:&lt;/p>
&lt;img alt="fractal-analysis-pam.jpg" src="https://www.crossref.org/wp/blog/images/fractal-analysis-pam.jpg" width="686" height="450" />
&lt;p>PAM was discussed &lt;a href="https://www.crossref.org/blog/prism-aggregator-message/">here&lt;/a> earlier.&lt;/p>
Such a structured response would provide the metadata elements for applications to build various interfaces into the original article:&lt;/p>
&lt;img alt="fractal-analysis.jpg" src="https://www.crossref.org/wp/blog/images/fractal-analysis.jpg" width="459" height="401" />
&lt;p>We think that these PRISM components (context sets and schemas) will be useful for structured search of scholarly publications.&lt;/p>&lt;/p>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ol>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/dd>
&lt;/dl>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/dd>
&lt;/dl></description></item><item><title>OAI-ORE: Workshop Slides</title><link>https://www.crossref.org/blog/oai-ore-workshop-slides/</link><pubDate>Tue, 26 May 2009 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/oai-ore-workshop-slides/</guid><description>&lt;div id="__ss_1465963">
&lt;a href="http://www.slideshare.net/hvdsomp/an-overview-of-the-oai-object-reuse-and-exchange-interoperability-framework?type=presentation" title="An Overview of the OAI Object Reuse and Exchange Interoperability Framework">An Overview of the OAI Object Reuse and Exchange Interoperability Framework&lt;/a>&lt;/p>
&lt;div >
View more Microsoft Word documents from &lt;a href="http://www.slideshare.net/hvdsomp">hvdsomp&lt;/a>.
&lt;/div>
&lt;/div>
&lt;p>This is a very slick presentation by Herbert Van de Sompel on &lt;a href="http://www.openarchives.org/ore/" target="_blank">OAI-ORE&lt;/a> which he’s due to give today for a &lt;a href="https://web.archive.org/web/20090615233000/http://www.inforum.cz/en/workshop/" target="_blank">workshop&lt;/a> at the &lt;a href="https://web.archive.org/web/20090615233000/http://www.inforum.cz/en/workshop/" target="_blank">INFORUM 2009 15th Conference on Prrofessional Information Resources&lt;/a> in Prague. It’s on the long side at 167 slides but even if you just flip though or sample it selectively you’ll be bound to come away with something.&lt;/p>
&lt;p>Describing aggregations of resources is a subject that really has to be of interest to Crossref publishers.&lt;/p></description></item><item><title>PRISM Aggregator Message</title><link>https://www.crossref.org/blog/prism-aggregator-message/</link><pubDate>Fri, 08 May 2009 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/prism-aggregator-message/</guid><description>&lt;p>The new OAI-PMH interface to Nature.com sports one particular novelty which may well be of interest here: it makes use of the &lt;a href="https://www.idealliance.org/pam/" target="_blank">PRISM Aggregator Message&lt;/a>. (For an announcement of this service see the post on our web publishing blog &lt;a href="https://web.archive.org/web/20070815000000*/http://blogs.nature.com/wp/nascent/" target="_blank">Nascent&lt;/a>.)&lt;/p>
&lt;p>As a protocol for the harvesting of metadata records within a digital repository, OAI-PMH records may be expressed in a variety of different metadata formats. For reasons of interoperability a base metadata format (‘Dublin Core’) is mandated for all OAI-PMH implementations. The expectation is that this base format would be augmented by community-specific vocabularies.&lt;/p>
&lt;p>Our natural inclination was to mirror the article descriptions which we already circulate in our RSS feeds and within our HTML pages (as META tags) and PDF files (as XMP packets). In these cases we have used open data models (e.g. RDF) with simple properties cherry-picked from the DC and PRISM namespaces. But OAI-PMH has a special ‘gotcha’ in this regard: any metadata format must allow for W3C XML Schema validation. That is, the properties need to be constrained by an XSD data model. Enter PRISM Aggregator Message (PAM).&lt;/p>
&lt;p>(Continues)&lt;/p>
&lt;p>For the longest time I must confess I did not ‘get’ what PAM was about. PRISM was clearly a metadata vocabulary and yet with PAM there was all this wrangling with content, which as an academic publisher we frankly had no interest in as we already had our own journal article DTD and for interop we were beginning to look at &lt;a href="http://dtd.nlm.nih.gov/" target="_blank">NLM DTD&lt;/a>. And then it dawned on me (albeit slowly) that the PAM DTD is the equivalent to NLM DTD but for trade magazine publishing, where there might not be such a strong practice of XML. And since the release of PRISM 2.0 (February 2008) there was now also an W3C XML Schema defined for PAM. (Note that the latest revision of PRISM 2.1 is about to be published, although the changes there do not have any bearing on this implementation.)&lt;/p>
&lt;p>So, PAM defines PRISM elements to be used with XML content markup. Examining further reveals that within a PAM message there are one or more articles with metadata packaged into a head section, and content (if present) in a body section.&lt;/p>
&lt;img alt="pam-message.png" src="https://www.crossref.org/wp/blog/images/pam-message.png" width="362" height="71" />
&lt;p>Section 4.3 in the PAM 2.0 specification lists the allowable head elements by logical grouping, 11 in all: &lt;em>key elements&lt;/em>, &lt;em>title&lt;/em>, &lt;em>creative origin&lt;/em>, &lt;em>publication&lt;/em>, &lt;em>publication date&lt;/em>, &lt;em>additional article ID&lt;/em>, &lt;em>positional&lt;/em>, &lt;em>topic&lt;/em>, &lt;em>length&lt;/em>, &lt;em>related content&lt;/em>, &lt;em>rights &amp;amp; usage&lt;/em>. Note that not all PRISM elements are supported; in fact only 43 of the 57 PRISM 2.0 elements are supported. Among the missing are ‘&lt;tt>prism:endingPage&lt;/tt>‘. Also only 7 of the 15 DC elements are supported. Nevertheless we found that the bulk of the article descriptions could easily be accommodated within the PAM format. And because this is W3C XML Schema constrained there is an element ordering prescribed, and hence there is an interleaving of DC and PRISM elements.&lt;/p>
&lt;p>The Nature.com OAI-PMH service has two access points:&lt;/p>
&lt;dl>
&lt;dt>&lt;em>User interface:&lt;/em>&lt;/dt>
&lt;dd>&lt;a href="http://www.nature.com/oai" target="_blank">http://www.nature.com/oai&lt;/a>&lt;/dd>
&lt;dt>&lt;em>Service endpoint:&lt;/em>&lt;/dt>
&lt;dd>&lt;a href="http://www.nature.com/oai/request" target="_blank">http://www.nature.com/oai/request&lt;/a>&lt;/dd>
&lt;/dl>
&lt;p>So, to work an example, if we want to get the record for &lt;strong>doi:10.1038/nature01234&lt;/strong> (which has an OAI-PMH identifier of &lt;strong>oai:nature.com:10.1038/nature01234&lt;/strong>) we could use this call to get the description in PAM format:&lt;/p>
&lt;p>&lt;a href="http://www.nature.com/oai/request?verb=GetRecord&amp;amp;#038;identifier=10.1038/nature01234&amp;amp;#038;metadataPrefix=pam" target="_blank">http://www.nature.com/oai/request?verb=GetRecord&amp;#038;identifier=10.1038/nature01234&amp;#038;metadataPrefix=pam&lt;/a>&lt;/p>
&lt;p>(Note that as a convenience for the user we also allow a DOI to be used directly in place of the full OAI-PMH identifier as there is a one-to-one correspondence between the two within our repository. Simplifies cut and paste operations.)&lt;/p>
&lt;p>This returns the following properties (shown in document order and by PAM logical grouping):&lt;/p>
&lt;img alt="pam-elements.jpg" src="https://www.crossref.org/wp/blog/images/pam-elements.jpg" width="462" height="450" />
&lt;p>With PAM we are thus able to replicate in OAI-PMH the same journal article descriptions that we are currently disseminating through other service/content channels.&lt;/p></description></item><item><title>Crossref’s OpenURL query interface</title><link>https://www.crossref.org/blog/crossrefs-openurl-query-interface/</link><pubDate>Wed, 06 May 2009 00:00:00 +0000</pubDate><author>Chuck Koscher</author><guid>https://www.crossref.org/blog/crossrefs-openurl-query-interface/</guid><description>&lt;p>Over the past two weeks we’ve focused on our OpenURL query interface with the goal being to improve its reliability. I’d like to mention some things we’ve done.&lt;/p>
&lt;ol>
&lt;li>
&lt;p>We now require an OpenURL account to use this interface (see &lt;a href="https://apps.crossref.org/requestaccount/" target="_blank">the registration page&lt;/a>) . This account is still free, there are no fixed usage limits, and the terms of use have been greatly simplified.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Resources have been re-arranged dedicating more horse-power to the OpenURL function.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>The OpenURL function is now in our advanced monitoring function which means some lucky staff member will be getting phone calls at 3AM (me included!).&lt;/p>
&lt;/li>
&lt;/ol>
&lt;p>I should note that #1 has already reduced inappropriate usage. This also is not the end of planned changes. Crossref has undertaken a major rewrite of parts of our system and this will include the OpenURL interface.&lt;/p>
&lt;p>Chuck&lt;/p></description></item><item><title>OCLC defines requirements for a &amp;#8220;Cooperative Identities Hub&amp;#8221;</title><link>https://www.crossref.org/blog/oclc-defines-requirements-for-a-cooperative-identities-hub/</link><pubDate>Fri, 01 May 2009 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/oclc-defines-requirements-for-a-cooperative-identities-hub/</guid><description>&lt;p>OCLC has &lt;a href="https://web.archive.org/web/20101210071719/http://www.oclc.org/research/publications/library/2009/2009-05.pdf" target="_blank">published a report&lt;/a> (PDF) identifying some requirements for what they call a “Cooperative Identities Hub”. A quick glance through it seems to show that the use cases focus on what we are calling the “Knowledge Discovery” use cases. As I mentioned in my &lt;a href="https://web.archive.org/web/20091225201433/http://network.nature.com/people/mfenner/blog/2009/02/17/interview-with-geoffrey-bilder" target="_blank">interview with Martin Fenner&lt;/a>, there is also a category of “authentication” use cases that I think needs to be addressed by a contributor identifier system. Still, this is a good report that highlights many of the complexities that an identifier system needs to address.&lt;/p></description></item><item><title>What do people want from an author identifier?</title><link>https://www.crossref.org/blog/what-do-people-want-from-an-author-identifier/</link><pubDate>Mon, 27 Apr 2009 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/what-do-people-want-from-an-author-identifier/</guid><description>&lt;p>&lt;span >&lt;a href="https://web.archive.org/web/20090219202623/http://network.nature.com/people/mfenner/profile">Martin Fenner&lt;/a> continues his interest in the subject of author identifiers. He recently &lt;a href="https://web.archive.org/web/20090417062326/http://network.nature.com/people/mfenner/blog/2009/04/13/a-few-questions-about-author-identifiers">posted an online poll&lt;/a> asking people some specific questions about how they would like to see an author identifier implemented.&lt;sup>*&lt;/sup>&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;a href="https://web.archive.org/web/20090429164110/http://network.nature.com/people/mfenner/blog/2009/04/26/a-few-questions-about-author-identifiers-the-answers">The results of the poll&lt;/a> are in and, though the sample was very small, the results are interesting. The responses are both gratifying -there seems to be a general belief that Crossref has a roll to play here- and perplexing -most think the identifier needs to identify other “contributors” to the scholarly communications process- yet there seems to be a preference for the moniker “digital author identifier”. This latter preference is certainly a surprise to us as we had been focusing our efforts on identifying analog authors. The only “digital authors” I know of are &lt;a href="http://pdos.csail.mit.edu/scigen/">this one at at MIT&lt;/a> and possibly &lt;a href="http://news.bbc.co.uk/2/hi/science/nature/7979113.stm">this one at Aberystwyth University.&lt;/a> 😉&lt;/span>&lt;/p>
&lt;p>&lt;span >Anyway, There are some &lt;a href="https://www.facebook.com/e/bb174794-519c-02a7-8a00-9283013298d8/A-few-questions-about-author-identifiers-the">additional reactions&lt;/a> to Martin’s poll on FriendFeed.&lt;/span>&lt;/p>
&lt;p>&lt;span >Finally, I should have blogged about this earlier, but the March issue of &lt;em>Science&lt;/em> included a summary of the initiatives and discussions surrounding the creation of an industry “author identifier” in an article titled “Are You Ready to Become a Number” (&lt;a href="http://dx.doi.org/10.1126/science.323.5922.1662">&lt;a href="http://dx.doi.org/10.1126/science.323.5922.1662" target="_blank">http://dx.doi.org/10.1126/science.323.5922.1662&lt;/a>&lt;/a>).&lt;/span>&lt;/p>
&lt;p>&lt;span >In pointing people at this, I feel like I must make a clarification to the article. In short, I don’t think any of our members would “force” anybody to use an author identifier whether it came from Crossref or from anybody else. Though it is likely that in the interview I used the terms “carrot” and “stick”, in truth publisher’s would, instead of “a stick”, at most wield a &lt;a href="http://en.wikipedia.org/wiki/Nerf">Nerf&lt;/a> bat. Having said that, the essential point remains- even if most major publishers &lt;em>strongly&lt;/em> encouraged all of their authors to use the system, it would take several years before the system had a critical mass of data.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;sup>*&lt;/sup>Note that I deliberately didn’t point CrossTech readers at this poll as it was being conducted because I thought doing so might introduce a Crossref bias.&lt;/span>&lt;/p></description></item><item><title>Introductory Signals</title><link>https://www.crossref.org/blog/introductory-signals/</link><pubDate>Mon, 23 Mar 2009 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/introductory-signals/</guid><description>&lt;p>So while doing some background reading today I realized that legal citations already widely support a form of “&lt;a href="https://www.crossref.org/blog/citation-typing-ontology/">citation typing&lt;/a>” in the form of “&lt;a href="http://en.wikipedia.org/wiki/Citation_signal" target="_blank">Introductory Signals&lt;/a>“. The 10 introductory signals break down as follows…&lt;/p>
&lt;p>In support of an argument:&lt;/p>
&lt;p>   1) [no signal]. (NB that, apparently, this is increasingly deprecated.)&lt;/p>
&lt;p>   2) accord;&lt;/p>
&lt;p>   3) see;&lt;/p>
&lt;p>   4) see also;&lt;/p>
&lt;p>   5) cf.;&lt;/p>
&lt;p>For Comparisons:&lt;/p>
&lt;p>   6) compare … with …;&lt;/p>
&lt;p>For contradiction:&lt;/p>
&lt;p>   7) but see;&lt;/p>
&lt;p>   8) but cf.;&lt;/p>
&lt;p>For background:&lt;/p>
&lt;p>   9) see generally;&lt;/p>
&lt;p>And for examples:&lt;/p>
&lt;p>   10) e.g.&lt;/p>
&lt;p>Clever lawyers.&lt;/p></description></item><item><title>Citation Typing Ontology</title><link>https://www.crossref.org/blog/citation-typing-ontology/</link><pubDate>Fri, 20 Mar 2009 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/citation-typing-ontology/</guid><description>&lt;p>I was happy to read David Shotton’s recent &lt;a href="http://www.ingentaconnect.com/contentone/alpsp/lp/2009/00000022/00000002/art00002" target="_blank">&lt;em>Learned Publishing&lt;/em>&lt;/a> article, &lt;a href="http://dx.doi.org/10.1087/2009202" target="_blank">&lt;em>Semantic Publishing: The Coming Revolution in scientific journal publishing&lt;/em>&lt;/a>, and see that he and his team have drafted a &lt;a href="http://imageweb.zoo.ox.ac.uk/pub/2009/citobase/cito-20090311/cito-content/owldoc/" target="_blank">Citation Typing Ontology&lt;/a>.&lt;sup>*&lt;/sup>&lt;/p>
&lt;p>Anybody who has seen me speak at conferences knows that I often like to proselytize about the concept of the “typed link”, a notion that hypertext pioneer, &lt;a href="https://web.archive.org/web/20090609163002/http://www.workpractice.com/trigg//" target="_blank">Randy Trigg&lt;/a>, discussed extensively &lt;a>in his 1983 &lt;a href="https://web.archive.org/web/20090609163002/http://www.workpractice.com/trigg//thesis-default.html">Ph.D. thesis.&lt;/a>. Basically, Trigg points out something that should be fairly obvious- a citation (i.e. “a link”) is not &lt;em>always&lt;/em> a “vote” in favor of the thing being cited.&lt;br /> In fact, there are all sorts of reasons that an author might want to cite something. They might be elaborating on the item cited, they might be critiquing the item cited, they might even be trying to refute the item cited (For an exhaustive and entertaining survey of the use and abuse of citations in the humanities, &lt;a href="http://en.wikipedia.org/wiki/Anthony_Grafton">Anthony Grafton&lt;/a>‘s, &lt;a href="http://www.amazon.com/Footnote-Curious-History-Anthony-Grafton/dp/0571196012/ref=sr_1_2?ie=UTF8&amp;#038;s=books&amp;#038;qid=1237549279&amp;#038;sr=1-2">The Footnote: A Curious History&lt;/a>, is a rich source of examples)&lt;br /> Unfortunately, the naive assumption that a citation is tantamount to a vote of confidence has become inshrined in everything from the way in which we measure scholarly reputation, to the way in which we &lt;a href="http://www.hefce.ac.uk/Research/ref/">fund universities&lt;/a> and the way in which search engines rank their results. The distorting affect of this assumption is profound. If nothing else, it leads to a perverse situation in which people will often discuss books, articles, and blog postings that they disagree with without actually citing the relevant content, just so that they can avoid inadvertently conferring “&lt;a href="http://en.wikipedia.org/wiki/Whuffie">wuffie&lt;/a>” on the item being discussed. This can’t be right.&lt;br /> Having said that, there has been a half-hearted attempt to introduce a gross level of link typology with the introduction of the &lt;a href="http://en.wikipedia.org/wiki/Nofollow">“nofollow” link attribute&lt;/a>- an initiative started by Google in order to try to address the increasing problem of &lt;a href="http://en.wikipedia.org/wiki/Spamdexing">“Spamdexing”&lt;/a>. But this is a pretty ham-fisted form of link typing- particularly in the way it is implemented by the Wikipedia where Crossref DOI links to formally published scholarly literature have a “nofollow” attribute attached to them but, inexplicably, items with a &lt;a href="http://en.wikipedia.org/wiki/PMID">PMID&lt;/a> are not so hobbled (view the HTML source of &lt;a href="http://en.wikipedia.org/wiki/Prion">this page&lt;/a>, for example). Essentially, this means that, the Wikipedia is a black-hole of reputation. That is, it absorbs reputation (through links too the Wikipedia), but it doesn’t let reputation back out again. Hell, I feel dirty for even linking to it here ;-).&lt;br /> Anyway, scholarly publishers should certainly read Shotton’s article because it is full of good, and practical ideas about what can can be done with today’s technology in order to help us move beyond the “digital incunabula” that the industry is currently churning out. The &lt;a href="https://web.archive.org/web/20090420020704/http://imageweb.zoo.ox.ac.uk/pub/2008/plospaper/latest">sample semantic article&lt;/a> that Shotton’s team created is inspirational and I particularly encourage people to look at &lt;a href="https://web.archive.org/web/20090607084935/http://imageweb.zoo.ox.ac.uk/pub/2008/plospaper/latest/machine/citationinfo.n3">the source file for the ontology-enhanced bibliography&lt;/a> which reveals just how much more useful metadata can be associated with the humble citation.&lt;br /> And now I wonder whether &lt;a href="http://www.citeulike.org/">CiteULike&lt;/a>, &lt;a href="https://web.archive.org/web/20061205061750/http://www.connotea.org/">Connotea&lt;/a>, &lt;a href="http://www.2collab.com/nonLoggedInHomePage;jsessionid=CC0849D76677D585AE1DC3B3139B32A1">2Collab&lt;/a> or &lt;a href="http://www.zotero.org/">Zotero&lt;/a> will consider adding support for the CItation Typing Ontology into their respective services?&lt;br /> * Disclosure:&lt;br /> a) I am on the editorial board of &lt;em>Learned Publishing&lt;/em>&lt;br /> b) Crossref has consulted with David Shotton on the subject of semantically enhancing journal articles&lt;/p>&lt;/p></description></item><item><title>Researcher Identification Primer</title><link>https://www.crossref.org/blog/researcher-identification-primer/</link><pubDate>Wed, 11 Mar 2009 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/researcher-identification-primer/</guid><description>&lt;p>&lt;a href="https://cameronneylon.net/blog/a-specialist-openid-service-to-provide-unique-researcher-ids/" target="_blank">Discussions around “contributor Ids”&lt;/a> (aka “Author ID, Researcher ID, etc.) seem to be becoming quite popular. In &lt;a href="https://web.archive.org/web/20091225201433/http://network.nature.com/people/mfenner/blog/2009/02/17/interview-with-geoffrey-bilder" target="_blank">the interview&lt;/a> that I pointed to in my &lt;a href="https://www.crossref.org/blog/an-interview-about-author-ids/">last post&lt;/a>, I mentioned that Crossref has been talking with a group of researchers who were very interested in creating some sort of authenticated contributor ID as a mechanism for controlling who gets trusted access to sensitive genome-wide aggregate genotype data.&lt;/p>
&lt;p>Well, I’m delighted to say that said group of researchers(at the &lt;a href="http://www.gen2phen.org/" target="_blank">GEN2PHEN&lt;/a> project) have created a “&lt;a href="https://web.archive.org/web/20090418151033/http://www.gen2phen.org/researcher-identification/researcher-identification-primer" target="_blank">Researcher Identification Primer&lt;/a>” website in which they outline the many use-cases and issues around creating a mechanism for unambiguously identifying and/or authenticating researchers. This looks like a great resource and I expect it will serve as a useful focus for further discussion around the issue.&lt;/p></description></item><item><title>An interview about &amp;#8220;Author IDs&amp;#8221;</title><link>https://www.crossref.org/blog/an-interview-about-author-ids/</link><pubDate>Thu, 19 Feb 2009 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/an-interview-about-author-ids/</guid><description>&lt;p>Over the past few months there seems to have been a &lt;a href="https://cameronneylon.net/blog/a-specialist-openid-service-to-provide-unique-researcher-ids/" target="_blank">sharp upturn in general interest&lt;/a> around implementing an “author identifier” system for the scholarly community. This, in turn, has meant that more people have been getting in touch with us about our nascent “Contributor ID” project. The other day, after seeing my comments in the above thread, &lt;a href="https://web.archive.org/web/20090219202623/http://network.nature.com/people/mfenner/profile" target="_blank">Martin Fenner&lt;/a> asked if he could interview me about the issue of author identifiers for &lt;a href="https://web.archive.org/web/20090228185451/http://network.nature.com/people/mfenner/blog" target="_blank">his blog on Nature Networks, Gobbledygook&lt;/a>. I agreed and he &lt;a href="https://web.archive.org/web/20091225201433/http://network.nature.com/people/mfenner/blog/2009/02/17/interview-with-geoffrey-bilder" target="_blank">posted the interview&lt;/a> the other day.&lt;/p>
&lt;p>I warn you ahead of time, I did ramble on a bit and the interview is long. There is a lot of stuff at the beginning about the DOI and it might seem off-topic, but I do think that there is a lot that we can learn from our DOI experiences which would apply to any author identifier. Just be thankful I didn’t start talking about the privacy issues that will inevitably arise from any author identifier system. If I had, the interview would have probably gone on for another six pages ;-).&lt;/p>
&lt;p>Anyway, as most of our membership knows, we have a pilot project underway to explore what it would take to launch a “Crossref Contributor ID” system. We still haven’t concluded whether it makes sense for us to do it, but one thing is clear from the recent discussions we’ve had and that is that, if we don’t do it, somebody else almost certainly will.&lt;/p></description></item><item><title>Real PRISM in the RSS Wilds</title><link>https://www.crossref.org/blog/real-prism-in-the-rss-wilds/</link><pubDate>Thu, 19 Feb 2009 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/real-prism-in-the-rss-wilds/</guid><description>&lt;p>Alf Eaton just &lt;a href="http://hublog.hubmed.org/archives/001818.html" target="_blank">posted&lt;/a> a real nice analysis of &lt;a href="http://www.tictocs.ac.uk/" target="_blank">ticTOCs&lt;/a> RSS feeds. Good to see that almost half of the feeds (46%) are now in RDF and that fully a third (34%) are using PRISM metadata to disclose bibliographic fields.&lt;/p>
&lt;p>The one downside from a Crossref point of view is that these feeds are still using the old PRISM version (1.2) and not the new version (2.0) which was released a year ago and blogged &lt;a href="https://www.crossref.org/blog/prismdoi/">here&lt;/a>. That version supports the elements &lt;strong>prism:doi&lt;/strong> for the bare DOI, as well as &lt;strong>prism:url&lt;/strong> for the DOI proxy server URL.&lt;/p>
&lt;p>There are still some improvements to be made in serving up these feeds (as Alf’s analysis shows for record type), but overall things are looking pretty good. 🙂&lt;/p></description></item><item><title>DOIs in an iPhone application</title><link>https://www.crossref.org/blog/dois-in-an-iphone-application/</link><pubDate>Thu, 12 Feb 2009 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/dois-in-an-iphone-application/</guid><description>&lt;p>Very cool to see Alexander Griekspoor releasing an iPhone version of his award-winning Papers application. A while ago Alex intigrated DOI metadata lookup into the Mac version of papers and now I can get a silly thrill from seeing Crossref DOIs integrated in an iPhone app. Alex has just posted &lt;a href="https://web.archive.org/web/20100317112846/http://mekentosj.com/papers/iphone/" target="_blank">a preview video of the iPhone application&lt;/a> and it includes a cameo appearance by a DOI. Yay.&lt;/p></description></item><item><title>CURIE Syntax 1.0</title><link>https://www.crossref.org/blog/curie-syntax-1.0/</link><pubDate>Mon, 19 Jan 2009 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/curie-syntax-1.0/</guid><description>&lt;p>The W3C has recently (Jan. 16) released &lt;a href="http://www.w3.org/TR/2009/CR-curie-20090116/" target="_blank">CURIE Syntax 1.0&lt;/a> as a Candidate Recommendation and is inviting implementations.&lt;/p>
&lt;p>(Note that I made a fuller post &lt;a href="https://www.crossref.org/blog/curies-a-cure-for-uris/">here&lt;/a> on CURIEs and erroneously confused the &lt;a href="http://www.w3.org/MarkUp/2008/ED-curie-20081023/" target="_blank">Editor’s Draft (Oct. 23, ’08)&lt;/a> as being a Candidate Recommendation. Well, at least it’s got there now.)&lt;/p></description></item><item><title>Standard InChI Defined</title><link>https://www.crossref.org/blog/standard-inchi-defined/</link><pubDate>Sat, 17 Jan 2009 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/standard-inchi-defined/</guid><description>&lt;p>IUPAC has just released &lt;a href="https://web.archive.org/web/20090616040900/http://www.iupac.org/inchi/release102final.html" target="_blank">the final version (1.02)&lt;/a> of its &lt;a href="https://web.archive.org/web/20090206153708/http://iupac.org/inchi/download/index.html" target="_blank">InChI software&lt;/a>, which generates Standard InChIs and Standard InChIKeys. (InChI is the IUPAC International Chemical Identifier.)&lt;/p>
&lt;p>The Standard InChI &lt;em>“removes options for properties such as tautomerism and stereoconfiguration”&lt;/em>, so that a molecule will always generate the same stable identifier - a unique InChI - which facilitates &lt;em>“interoperability/compatibility between large databases/web searching and information exchange”&lt;/em>. Note also that any &lt;em>“shortcomings in Standard InChI may be addressed using non-Standard InChI (currently obtainable using InChI version 1.02beta)”&lt;/em>.&lt;/p>
&lt;p>On a practical level this means that the 27-character length InChIKeys (a hashed form of the InChI), with the following generic form&lt;/p>
&lt;p>&lt;code>AAAAAAAAAAAAAA-BBBBBBBBFV-P&lt;/code>&lt;/p>
&lt;p>can now be readily and reliably generated and will start to be used in search indexing and linking applications.&lt;/p></description></item><item><title>XMP Library for Flash</title><link>https://www.crossref.org/blog/xmp-library-for-flash/</link><pubDate>Fri, 16 Jan 2009 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/xmp-library-for-flash/</guid><description>&lt;p>Update about new &lt;a href="https://web.archive.org/web/20191031084320/http://www.adobe.com/devnet/xmp.html" target="_blank">XMP Library&lt;/a> from &lt;a href="http://labs.adobe.com/" target="_blank">Adobe Labs&lt;/a>:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“The new Adobe XMP Library for ActionScript is now available for download on Adobe Labs. Adobe Extensible Metadata Platform (XMP) is a labeling technology that allows you to embed data about a file, known as metadata, into the file itself. XMP is an open technology based on RDF and RDF/XML. &lt;strong>With this new library you can read existing XMP metadata from Flash based file formats via the Adobe Flash Player.&lt;/strong>“&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>Any volunteers?&lt;/p></description></item><item><title>Poorboy Metadata Hack</title><link>https://www.crossref.org/blog/poorboy-metadata-hack/</link><pubDate>Tue, 06 Jan 2009 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/poorboy-metadata-hack/</guid><description>&lt;p>I was playing around recently and ran across this little metadata hack. At first, I thought somebody was doing something new. But no, nothing so forward apparently. (Heh! 🙂&lt;/p>
&lt;p>I was attempting to grab the response headers from an HTTP request on an article page and was using by default the Perl &lt;a href="http://search.cpan.org/dist/libwww-perl/lib/LWP.pm" target="_blank">LWP&lt;/a> library. For some reason I was getting metadata elements being spewed out as response headers - at least from some of the sites I tested. With some further investigation I tracked this back to LWP itself which parses HTML headers and generates HTTP pseudo-headers using an &lt;code>X-Meta-&lt;/code> style header. (This can be viewed either as a feature of LWP or a bug as &lt;a href="https://web.archive.org/web/20090205050504/http://www.semicomplete.com/blog/geekery/show-headers-in-get-request.html" target="_blank">this article&lt;/a> bemoans.)&lt;/p>
&lt;p>What this means anyway is that I can issue a simple call like this to get the HTML metadata - shown here for &lt;a href="http://dx.doi.org/10.1087/095315108X288947" target="_blank">doi:10.1087/095315108X288947&lt;/a>:&lt;/p>
&lt;p>``I was playing around recently and ran across this little metadata hack. At first, I thought somebody was doing something new. But no, nothing so forward apparently. (Heh! 🙂&lt;/p>
&lt;p>I was attempting to grab the response headers from an HTTP request on an article page and was using by default the Perl &lt;a href="http://search.cpan.org/dist/libwww-perl/lib/LWP.pm" target="_blank">LWP&lt;/a> library. For some reason I was getting metadata elements being spewed out as response headers - at least from some of the sites I tested. With some further investigation I tracked this back to LWP itself which parses HTML headers and generates HTTP pseudo-headers using an &lt;code>X-Meta-&lt;/code> style header. (This can be viewed either as a feature of LWP or a bug as &lt;a href="https://web.archive.org/web/20090205050504/http://www.semicomplete.com/blog/geekery/show-headers-in-get-request.html" target="_blank">this article&lt;/a> bemoans.)&lt;/p>
&lt;p>What this means anyway is that I can issue a simple call like this to get the HTML metadata - shown here for &lt;a href="http://dx.doi.org/10.1087/095315108X288947" target="_blank">doi:10.1087/095315108X288947&lt;/a>:&lt;/p>
&lt;p>``&lt;/p>
&lt;p>This shows a simple (read lazy) means of accessing metadata added as &lt;code>&amp;lt;meta&amp;gt;&lt;/code> tags in HTML headers, such as those we &lt;a href="https://www.crossref.org/blog/natures-metadata-for-web-pages">added&lt;/a> for &lt;em>Nature&lt;/em>. (Of course, machine readable metadata is best added using RDFa as &lt;a href="https://www.crossref.org/blog/machine-readable-are-we-there-yet/">noted&lt;/a> earlier, but does not preclude also adding in &lt;code>&amp;lt;meta&amp;gt;&lt;/code> tags which are also usable with HTML as well as XHTML.)&lt;/p>
&lt;p>(Btw, wouldn’t it be fun if Crossref had a random DOI facility? That would be real handy for testing as well as giving users a feel for what real-life DOIs look like and what lies at the other end of them.)&lt;/p></description></item><item><title>And the DOI is &amp;#8230;</title><link>https://www.crossref.org/blog/and-the-doi-is/</link><pubDate>Mon, 22 Dec 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/and-the-doi-is/</guid><description>&lt;p>Once structured metadata is added to a file then retrieving a given metadata element is usually a doddle. For example, for PDFs with embedded XMP one can use Phil Harvey’s excellent &lt;a href="https://exiftool.org/" target="_blank">Exiftool&lt;/a> utility.&lt;/p>
&lt;p>Exiftool is a Perl library and application which I’ve blogged about &lt;a href="https://www.crossref.org/blog/exiftool/">here&lt;/a> earlier which is available as a ‘&lt;code>.zip&lt;/code>‘ file for Windows (no Perl required) or ‘&lt;code>.dmg&lt;/code>‘ for MacOS. Note that Phil maintains this actively and has done so over the last five years. (And when I say actively I mean just that. I once made the mistake of printing out the change file.)&lt;/p>
&lt;p>If Perl’s not your thing, then there’s a Ruby wrapper gem (&lt;a href="https://exiftool.org/" target="_blank">MiniExiftool&lt;/a>) to access the Exiftool command in trouper OO fashion. Here’s an example Ruby one-liner to get the DOI from a PDF (broken here to meet column width restriction):&lt;/p>
&lt;blockquote>
&lt;p>&lt;code>% ruby -rubygems -e 'require &amp;quot;mini_exiftool&amp;quot;;&amp;lt;br /&amp;gt; &amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp;puts MiniExiftool.new(&amp;quot;test.pdf&amp;quot;)[&amp;quot;doi&amp;quot;]'&amp;lt;br /&amp;gt; 10.1038/nphoton.2008.200&lt;/code>&lt;/p>
&lt;/blockquote>
&lt;p>Of course, that could also have been run against an image, audio or video file with XMP packet.&lt;/p>
&lt;p>(Makes one wonder vaguely about the feasibility of having a Swiss Army knife type of utility that could read &lt;strong>&lt;em>any&lt;/em>&lt;/strong> file to get the DOI using the embedded XMP, RDFa, RDF, HTML headers, COiNS, etc. Possibly even as last resort fall back to scanning the raw text - if any.)&lt;/p></description></item><item><title>Xmas XMP</title><link>https://www.crossref.org/blog/xmas-xmp/</link><pubDate>Fri, 19 Dec 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/xmas-xmp/</guid><description>&lt;p>Well, as I &lt;a href="https://web.archive.org/web/20070815000000*/http://blogs.nature.com/wp/nascent/2008/12/xmp_labelling_for_nature.html" target="_blank">blogged&lt;/a> on our web publishing blog &lt;a href="https://web.archive.org/web/20070815000000*/http://blogs.nature.com/wp/nascent/" target="_blank">Nascent&lt;/a> we just went live with XMP labelling on &lt;em>Nature&lt;/em> in yesterday’s double issue. We will be adding XMP to all new issues of &lt;em>Nature&lt;/em> as well as rolling out across all our other titles in the next few weeks and months.&lt;/p>
&lt;p>The screenshots below from Acrobat (&lt;em>File &amp;gt; Properties&lt;/em>, &lt;code>CMD-D&lt;/code> / &lt;code>CTL-D&lt;/code>) show what the user might see both with (bottom-left) and without (top-right) semantic markup.&lt;/p>
&lt;img alt="pdf_props.png" src="https://www.crossref.org/wp/blog/images/pdf_props.png" width="399" height="377" />
&lt;p>As to the actual contents of the metadata record, see &lt;a href="http://lists.w3.org/Archives/Public/semantic-web/2008Dec/0134.html" target="_blank">this sample&lt;/a> I posted to the semantic web list.&lt;/p></description></item><item><title>ORE/POWDER: Remarks on Ratings</title><link>https://www.crossref.org/blog/ore-powder-remarks-on-ratings/</link><pubDate>Sat, 06 Dec 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/ore-powder-remarks-on-ratings/</guid><description>&lt;p>I wanted to make some remarks about the “Ease of use” and “Learn curve” ratings which I gave in the ORE/POWDER &lt;a href="https://web.archive.org/web/20100901074832/http://nurture.nature.com/tony/blogs/crosstech/ore-pwdr.html" target="_blank">comparison table&lt;/a> that I &lt;a href="https://www.crossref.org/blog/describing-resource-sets-ore-vs-powder/">blogged&lt;/a> about here the other day. It may seem that I came out a little harsh on ORE and a little easy on POWDER. I just wanted to rationalize the justification for calling it that way. (By the way, the revised comparison table includes a qualification to those ratings.)&lt;/p>
&lt;p>My primary interest was from the perspective of a data provider rather than a data consumer. What does it take to get a resource description document (“resource map”, “description resource” or “sitemap”) ready for publication?&lt;/p>
&lt;p>(Continues)&lt;/p>
&lt;p>To look at POWDER first, it defines two sets of semantics: an “operational semantics” which is embodied in the simple XML that is intended as the primary publication vehicle, and a “formal semantics” embodied in the RDF/OWL document that would typically be generated by a POWDER processor.&lt;/p>
&lt;p>The operational semantics (XML) document requires minimal RDF understanding (and arguably none at all): it only requires that URI resources be organized into &lt;strong>&lt;iriset>&lt;/strong> groups by pattern matching, and that metadata be attached to those groups using &lt;strong>&lt;descriptorset>&lt;/strong> groups.&lt;/p>
&lt;p>URI patterns are specified using any of the following XML elements for inclusive patterns:&lt;/p>
&lt;pre tabindex="0">&lt;code>&amp;gt; **&amp;lt;includeschemes&amp;gt;**, **&amp;lt;includehosts&amp;gt;**, **&amp;lt;includeexactpaths&amp;gt;**, **&amp;lt;includepathcontains&amp;gt;**, **&amp;lt;includepathstartswith&amp;gt;**, **&amp;lt;includepathendswith&amp;gt;**, **&amp;lt;includeports&amp;gt;**, **&amp;lt;includequerycontains&amp;gt;**, **&amp;lt;includeiripattern&amp;gt;**, **&amp;lt;includeregex&amp;gt;**, **&amp;lt;includeresources&amp;gt;**
&lt;/code>&lt;/pre>&lt;p>and their exclusive counterparts&lt;/p>
&lt;p>&lt;code>**&amp;lt;excludeschemes&amp;gt;**, &amp;amp;#8230;&lt;/code>&lt;/p>
&lt;p>These are turned into corresponding regular expressions by a POWDER processor which then emits RDF/OWL classes using those expressions as property restrictions on set membership. &lt;strong>&lt;em>But a publisher is not required to understand this transformation nor the formal semantics generated from the simple XML document that was authored.&lt;/em>&lt;/strong>&lt;/p>
&lt;p>Now, as to metadata. Resource group descriptors are either free text (tags) or properties from a published namespace. For example, the property &lt;strong>name&lt;/strong> from a namespace &lt;strong>ex:&lt;/strong> would be added in one of two ways, depending on whether it were a simple literal string (“value”, say) or a resource URI:&lt;/p>
&lt;pre tabindex="0">&lt;code>http://example.org/value
&lt;/code>&lt;/pre>&lt;ul>
&lt;li>&lt;strong>&amp;lt;ex:name rdf:resource=”&lt;code>http://example.org/value&lt;/code>“/&amp;gt;&lt;/strong>&lt;/li>
&lt;/ul>
&lt;p>While technically this is RDF/XML it hardly qualifies, I think, as requiring any great knowledge of RDF, more a knowledge of XML namespaces alone would be sufficient.&lt;/p>
&lt;p>And that’s about it – all that is required for publication of a POWDER “description resource” document. (The guidelines for discovery mechanisms of a POWDER document might also need to be consulted.)&lt;/p>
&lt;p>So, on that basis I would judge POWDER to be at most “medium” on the “Learn curve”. However, as soon as the mapping to the formal semantics (POWDER-S) using RDF/OWL is considered, then that learn curve rating would automatically swing to “high”.&lt;/p>
&lt;p>Now, ORE on the other hand is a straightforward RDF application. What does make ORE a bit of a challenge are the following two aspects:&lt;/p>
&lt;pre>&lt;code> 1. concept of named aggregation
* abstract data model - no fixed bindings&amp;lt;/ol&amp;gt;
Well, the first aspect is what ORE is all about &amp;amp;ndash; its USP &amp;amp;ndash; and what it gives us beyond the simpler POWDER approach of merely describing resource bundles. Still, it’s a concept that needs to be grokked. All too easy to take it for granted.
It is the second aspect that may make ORE appear to be &amp;amp;#8220;difficult&amp;amp;#8221;. It does not prescribe a single binding or set of bindings but provides an abstract data model. That means that a prospective user must endeavour to understand something of the model before deploying.
But enough of that. Because who really reads instruction manuals anyway? So to deploy there are user guides available for one standalone document format (RDF/XML), and two carrier document formats (Atom, RDFa). That means right there that the publisher must either embrace RDF/XML or learn how to weave it into an existing document markup. (By the way, it should be remarked that there is an excellent [primer][3] available - as there is also for POWDER - and user guides for each of the formats.)
So that I think warrants the &amp;amp;#8220;high&amp;amp;#8221; rating for ORE on the learn curve, and the corresponding &amp;amp;#8220;low&amp;amp;#8221; ease of use. But that is not to say that the two initiatives are in any competition and that one should be favoured over the other. They serve different purposes. Any yet they may also have compatibilities as the previous [mapping of ORE in POWDER][4] attempts to show. I’ll leave that task for other commentators.
&lt;/code>&lt;/pre></description></item><item><title>Resource Maps Encoded in POWDER</title><link>https://www.crossref.org/blog/resource-maps-encoded-in-powder/</link><pubDate>Fri, 05 Dec 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/resource-maps-encoded-in-powder/</guid><description>&lt;p>Following right on from &lt;a href="https://www.crossref.org/blog/describing-resource-sets-ore-vs-powder/">yesterday’s post&lt;/a> on ORE and POWDER, I’ve attempted to map the worked examples in the &lt;a href="http://www.openarchives.org/ore/1.0/rdfxml" target="_blank">ORE User Guide for RDF/XML&lt;/a> (specifically &lt;a href="http://www.openarchives.org/ore/1.0/rdfxml#Examples" target="_blank">Sect. 3&lt;/a>) to POWDER to show that POWDER can be used to model ORE, see&lt;/p>
&lt;blockquote>
&lt;p>&lt;strong>&lt;a href="https://web.archive.org/web/20100901173559/http://nurture.nature.com/tony/demos/ore-ex/pwdr.htm" target="_blank">Resource Maps Encoded in POWDER&lt;/a>&lt;/strong>&lt;/p>
&lt;/blockquote>
&lt;p>(A full explanation for each example is given in the &lt;a href="http://www.openarchives.org/ore/1.0/rdfxml#Examples" target="_blank">RDF/XML Guide, Sect. 3&lt;/a> which should be consulted.)&lt;/p>
&lt;p>This could just all be sheer doolally or might possibly turn out to have a modicum of instructional value – I don’t know. (It would be real good to get some feedback here.) There are, however, a couple points to note in mapping ORE to POWDER:&lt;/p>
&lt;ol>
&lt;li>The POWDER form is actually more long-winded because it splits the RDF triples into subject and predicate/object divisions, with the first listed within an &lt;strong>iriset&lt;/strong> and the second within a &lt;strong>descriptorset&lt;/strong>. The net effect, however, may be somewhat cleaner since POWDER uses a simple XML format rather than RDF/XML.
&lt;ul>
&lt;li>POWDER only supports simple object types (literals or resources) so the blank nodes in the RDF/XML examples for &lt;strong>dcterms:creator&lt;/strong> cannot be mapped as such. I have chosen here to use either &lt;strong>foaf:name&lt;/strong> or &lt;strong>foaf:page&lt;/strong> value.
&lt;ul>
&lt;li>Likewise, and as far as I am aware, POWDER does not support datatyping but I could be wrong on this. I have thus dropped the datatypes on &lt;strong>dcterms:created&lt;/strong> and &lt;strong>dcterms:modified&lt;/strong>. &lt;/ol>
This is a fairly naïve mapping. POWDER’s real strength comes in defining groups of resources with its powerful pattern matching capabilities, whereas here I am using a named single resource in each &lt;strong>iriset&lt;/strong> through the &lt;strong>includeresource&lt;/strong> element. I think, though, this does show how the abstract ORE data model can be serialized in yet another format.&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ol></description></item><item><title>Describing Resource Sets: ORE vs POWDER</title><link>https://www.crossref.org/blog/describing-resource-sets-ore-vs-powder/</link><pubDate>Thu, 04 Dec 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/describing-resource-sets-ore-vs-powder/</guid><description>&lt;p>I’ve been reading up on &lt;a href="http://www.w3.org/2007/powder/" target="_blank">POWDER&lt;/a> recently (the W3C Protocol for Web Description Resources) which is currently in last call status (with comments due in tomorrow). This is an effort to describe groups of Web resources and as such has clear similarities to the Open Archives Initiative &lt;a href="http://www.openarchives.org/ore/" target="_blank">ORE&lt;/a> data model, which has been blogged about here before.&lt;/p>
&lt;p>In an attempt to better understand the similarities (and differences) between the two data models, I’ve put up the table which directly compares the two heavyweight contendors OAI-ORE and POWDER and also (unfairly) places them alongside the featherweight &lt;a href="http://www.sitemaps.org/protocol.php" target="_blank">Sitemaps Protocol&lt;/a> for reference.&lt;/p>
&lt;p>This is very much a draft document and I will aim to update the table based on my own further reading and on any feedback that I may get (contributions gratefully received). I’m all too aware that my understanding of the respective data models is painfully limited and I, for one, hope to profit through this exercise. There will be certainly errors which I will aim to fix as soon as I get wind of them. 🙂&lt;/p>
&lt;p>By the way, the ORE work especially is of interest to Crossref members and has obvious synergies with the multiple resolution potential that DOI has long promised but not quite delivered on.&lt;/p></description></item><item><title>CURIEs - A Cure for URIs</title><link>https://www.crossref.org/blog/curies-a-cure-for-uris/</link><pubDate>Wed, 03 Dec 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/curies-a-cure-for-uris/</guid><description>&lt;p>A quick straw poll of a few folks at &lt;a href="https://web.archive.org/web/20081218013716/http://www.online-information.co.uk/index.html" target="_blank">London Online&lt;/a> yesterday revealed that they had not heard of CURIE’s. And there was I thinking that most everybody must have heard of them by now. 🙂 So anyway here’s something brief by way of explanation.&lt;/p>
&lt;p>&lt;strong>&lt;em>&lt;a href="http://www.w3.org/MarkUp/2008/ED-curie-20081023/" target="_blank">CURIE&lt;/a>&lt;/em>&lt;/strong> stands for &lt;strong>&lt;em>Compact URI&lt;/em>&lt;/strong> and does the signal job or rendering long and difficult to read URI strings into something more manageable. (URIs do have the particular gift of being “human transcribable” but in practice their length and the actual characters used in the URI strings tend to muddy things for the reader.) So given that the Web is built upon a bedrock of URIs, anything that then makes URIs easier to handle is going to be an important contributor to our overall ease of interaction with the Web.&lt;/p>
&lt;p>(Continues)&lt;/p>
&lt;p>Ten years back (in February 1998) when XML was first introduced it presented a flat naming system for document markup. For purposes of modularity and markup reuse the XML Namespaces specification released the following year allowed for element and attribute names to be replaced by &lt;strong>expanded&lt;/strong> names in which the hitherto simple names would be replaced by name pairs consisting of a &lt;strong>namespace&lt;/strong> name and a &lt;strong>local&lt;/strong> name. The use of URIs for the namespace name thus opened the doors to assigning globally unique names for XML element/attribute names. As a practical point (both to keep the names short and to deal with URI characters), the notion of a qualified name (or QName) was introduced, whereby the local name would be qualified by a prefix which stood in for the namespace name.&lt;/p>
&lt;p>This was such a successful device that over time it was applied to URIs in general as a mechanism for abbreviation. Especially in RDF/XML schema elements were referenced by QName. And the practice has spilled over into non-XML syntaxes (e.g. the N3 and Turtle RDF grammars which use a “@prefix” directive). But there were problems since the device was grounded in XML the local names were constrained by allowable characters for XML elements and attributes (e.g. names cannot start with a digit character), as well as there being no specification for applying this same device to non-XML grammars.&lt;/p>
&lt;p>CURIE is an initiative to generalize this notion of qualified names for URIs beyond the immediate XML context for naming elements and attributes (which would also allow their use in attribute values), to a more general use in applications beyond XML. The development of CURIE is based upon work done in the definition of &lt;a href="http://www.w3.org/TR/xhtml2" target="_blank">XHTML2&lt;/a>, and upon work done by the &lt;a href="http://www.w3.org/2001/sw/BestPractices/HTML/" target="_blank">RDF-in-HTML Task Force&lt;/a>, a joint task force of the &lt;a href="http://www.w3.org/2001/sw/BestPractices/" target="_blank">Semantic Web Best Practices and Deployment Working Group&lt;/a> and &lt;a href="http://www.w3.org/MarkUp/" target="_blank">XHTML 2 Working Group&lt;/a>. The Editor’s draft &lt;a href="http://www.w3.org/MarkUp/2008/ED-curie-20081023/" target="_blank">CURIE Syntax 1.0&lt;/a> is currently a W3C Candidate Recommendation which is receiving comments through Jan 15, 2009, at which time it is intended to put it forward as a W3C Proposed Recommendation. Meantime, though, the new W3C Recommendation &lt;a href="http://www.w3.org/TR/rdfa-syntax/" target="_blank">RDFa Syntax in XHTML&lt;/a> (published Oct 14, 2008) has a normative section on CURIEs (see &lt;a href="http://www.w3.org/TR/rdfa-syntax/#s_curies" target="_blank">Sect. 7&lt;/a>).&lt;/p>
&lt;p>So, what do CURIEs look like? Taking a simple RDFa example for DOI we might have a fragment such as:&lt;/p>
&lt;pre>&amp;lt;div xmlns:doi="https://doi.org/" xmlns:dcterms="http://purl.org/dc/terms/"&amp;gt;
&amp;nbsp;&amp;nbsp;&amp;lt;div &lt;b>about="doi:10.1038/nature07184"&lt;/b>&amp;gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;span &lt;b>property="dcterms:hasPart"&lt;/b> &lt;b>resource="[doi:10.1038/nature07184]"&lt;/b>/&amp;gt;
&amp;nbsp;&amp;nbsp;&amp;lt;/div&amp;gt;
&amp;lt;/div&amp;gt;&lt;/pre>
&lt;p>This would be processed by an RDFa processor to yield the RDF triple (in N3/Turtle):&lt;/p>
&lt;pre>&amp;lt;doi:10.1038/nature07184&amp;gt; dcterms:hasPart &amp;lt;https://doi.org/10.1038/nature07184&amp;gt; .&lt;/pre>
&lt;p>This triple (or fact) says that the resource identified by DOI 10.1038/nature07184 has as a component part (cf. &lt;a href="http://dublincore.org/documents/dcmi-terms/" target="_blank">DCTERMS&lt;/a> vocabulary) the resource identified by &lt;a href="https://doi.org/10.1038/nature07184" target="_blank">https://doi.org/10.1038/nature07184&lt;/a>. (The abstract work identified by the DOI has as a component part the splash page identified by the proxy URL.)&lt;/p>
&lt;p>OK, so what’s going on? The “property” attribute takes a CURIE as value where the prefix “dcterms” is standing in for the XML namespace URI. The “about” and “resource” attributes both take a URI or CURIE as value, but because of any potential confusion a (so-called) “Safe CURIE” must be used which is a CURIE wrapped in brackets. The above example does not use brackets for the “about” attribute and therefore an RDFa processor would read this as being a full URI, i.e. &amp;amp;lt’doi:10.1038/nature07184&amp;gt;, whereas it does use brackets for the “resource” attribute and therefore this would be read as being a (Safe) CURIE, i.e. &lt;a href="https://doi.org/10.1038/nature07184" target="_blank">https://doi.org/10.1038/nature07184&lt;/a>.&lt;/p>
&lt;p>We can turn this around as follows:&lt;/p>
&lt;pre>&amp;lt;div xmlns:doi="https://doi.org/" xmlns:dcterms="http://purl.org/dc/terms/"&amp;gt;
&amp;nbsp;&amp;nbsp;&amp;lt;div &lt;b>about="[doi:10.1038/nature07184]"&lt;/b>&amp;gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;span &lt;b>property="dcterms:isPartOf"&lt;/b> &lt;b>resource="doi:10.1038/nature07184"&lt;/b>/&amp;gt;
&amp;nbsp;&amp;nbsp;&amp;lt;/div&amp;gt;
&amp;lt;/div&amp;gt;&lt;/pre>
&lt;p>This would be processed by an RDFa processor to yield the RDF triple (in N3/Turtle):&lt;/p>
&lt;pre>&amp;lt;https://doi.org/10.1038/nature07184&amp;gt; dcterms:isPartOf &amp;lt;doi:10.1038/nature07184&amp;gt; .&lt;/pre>
&lt;p>This triple (or fact) says that the resource identified by &lt;a href="https://doi.org/10.1038/nature07184" target="_blank">https://doi.org/10.1038/nature07184&lt;/a> is a component part (cf. &lt;a href="http://dublincore.org/documents/dcmi-terms/" target="_blank">DCTERMS&lt;/a> vocabulary) of the resource identified by DOI 10.1038/nature07184. (The splash page identified by the proxy URL is a component part of the abstract work identified by the DOI.)&lt;/p>
&lt;p>So what do CURIEs give us? Nothing more than a generic means to be able to make human-friendly statements such as&lt;/p>
&lt;pre>&amp;lt;doi:10.1038/nature07184&amp;gt; dcterms:hasPart doi:10.1038/nature07184 .&lt;/pre>
&lt;p>instead of having to spell it out in full triples form using long-winded URIs:&lt;/p>
&lt;pre>&amp;lt;doi:10.1038/nature07184&amp;gt;
&amp;nbsp;&amp;nbsp;&amp;lt;http://http://purl.org/dc/terms/hasPart&amp;gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;lt;https://doi.org/10.1038/nature07184&amp;gt; .&lt;/pre></description></item><item><title>Ubiquity commands for Crossref services</title><link>https://www.crossref.org/blog/ubiquity-commands-for-crossref-services/</link><pubDate>Wed, 03 Dec 2008 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/ubiquity-commands-for-crossref-services/</guid><description>&lt;p>So the other day &lt;a href="http://baoilleach.blogspot.com/" target="_blank">Noel O’Boyle&lt;/a> made me feel guilty when he pinged me and asked about the possibility using one of the Crossref APIs for creating a &lt;a href="https://wiki.mozilla.org/Labs/Ubiquity" target="_blank">Ubiquity&lt;/a> extension. You see, I had played with the idea myself and had not gotten around to doing much about it. This seemed inexcusable- particularly given how easy it is to build such extensions using the API we developed for the &lt;a href="http://sourceforge.net/projects/crossref-cite/" target="_blank">WordPress and Moveable Type plugins&lt;/a> that we &lt;a href="https://www.crossref.org/blog/crossref-citation-plugin-for-wordpress/">announced&lt;/a> earlier in the year. So I dug up my half-finished code, cleaned it up a bit and have &lt;a href="https://www.crossref.org/labs/ubiquity-plugin/" target="_blank">posted the results.&lt;/a>&lt;/p>
&lt;p>Note that the back-end that supports the plugins has been moved to more stable machines and the index is now being automatically updated with journal and conference proceeding deposits (sorry, no books yet).&lt;/p>
&lt;p>Also note that we are hoping that others will look at the code for the WordPress, Moveable Type and Ubiquity plugins and create more such extensions. If you do, please let us know about them at &lt;a href="mailto:citation-plugin@crossref.org">citation-plugin@crossref.org&lt;/a>.&lt;/p></description></item><item><title>RSS Good Practice Guidelines</title><link>https://www.crossref.org/blog/rss-good-practice-guidelines/</link><pubDate>Mon, 24 Nov 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/rss-good-practice-guidelines/</guid><description>&lt;p>I just wanted to flag up here Lisa Rogers’ recent review article on RSS in FUMSI (the online magazine for information professionals published by Free Pint Ltd)&lt;/p>
&lt;p>&lt;a href="https://web.archive.org/web/20081102075322/http://web.fumsi.com/go/article/share/3356" target="_blank">RSS and Scholarly Journal Tables of Contents: the ticTOCs Project, and Good Practice Guidelines for Publishers&lt;/a>&lt;/p>
&lt;p>Especially of interest is the diagram in Fig. 2 which breaks out the metadata elements that might be encountered in a rich web feed. Worthwhile pointing out that this reflects current practice and that under the item elements one would soon hope to see publishers routinely adding in &lt;strong>prism:doi&lt;/strong> (with the bare DOI as value) and &lt;strong>prism:url&lt;/strong> (with DOI target URL as value) from the PRISM 2.0 vocabulary published earlier this year. Publishers should also be aware of the new PRISM Usage Rights vocabulary which is expected to be published some time in the new year.&lt;/p></description></item><item><title>Machine Readable: Are We There Yet?</title><link>https://www.crossref.org/blog/machine-readable-are-we-there-yet/</link><pubDate>Wed, 19 Nov 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/machine-readable-are-we-there-yet/</guid><description>&lt;p>The guidelines for Crossref publishers (“DOI Name Information and Guidelines” - [PDF, 210K][1]) has this to say in “&lt;em>Sect. 6.3 The response page&lt;/em>” regarding the response page for a DOI:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“A minimal response page must contain a full bibliographic citation displayed to the user. A response page without bibliographic information should never be presented to a user.”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>which would seem to be all fine and dandy. But if that user is a machine (or an agent acting for a user) they’ll likely be out of luck as the metadata in the bibliographic citation is generally targeted at human users.&lt;/p>
&lt;p>So here’s a quick and dirty implementation of what a machine readable page could look like using RDFa. (The demo uses Jeni Tennison’s wonderful [rdfQuery][2] plugin which I [blogged][3] about earlier.)&lt;/p>
&lt;p>Clicking the DOI link below will bring up in a sub-window a bibliographic citation which might be found in a typical DOI repsonse page. If you now click the “Read Me” link you should see an alert message which presents the bibliographic metadata as a complete RDF document (in a simple N3 – or Notation3 – format). This document is assembled on the fly by rdfQuery using the RDFa markup embedded in the page.&lt;/p>
&lt;!-- broken links not in wayback machine
&lt;a href="http://nurture.nature.com/tony/demos/rdfa.html" onclick="w = open('http://nurture.nature.com/tony/demos/rdfa.html','myWin','width=600,height=400,top=150,left=150,scrollbars=1, resizable=1');w.focus();return false">&lt;b style="color:#006699">https://doi.org/10.1038/nature05634&lt;/b>&lt;/a> (Click for demo)
-->
&lt;p>See the “View Source” link to list the actual XHTML markup and the RDFa properties which have been added. And note also that some of the properties are partially “hidden” to the human reader, e.g. a publication date is given in year form only whereas the machine record has the date in full, and some of the properties are fully “hidden”: print and electronic ISSNs, issue number, ending page, etc.&lt;/p>
&lt;p>(Continues below.)&lt;/p>
&lt;p>So, what’s new about this? There are already various means of adding metadata to pages using e.g. metadata tags (see [here][4] for an earlier post on this), or COinS objects, or even RDF/XML in comment sections. All of these have their various utilities but are still just early attempts at automation. What makes this new and compelling is that RDFa allows publishers to embed machine readable metadata that can be read as a complete machine description in RDF using pretty much off-the-shelf tools and that this markup is embedded unobtrusively into the content in the proper &lt;strong>&lt;em>context&lt;/em>&lt;/strong>.&lt;/p>
&lt;p>Note that there are some similarities here between embedding an XMP packet (which includes metadata) into an arbitrary binary object, e.g. a PDF file, and embedding RDF into a section of a web page – or perhaps “&lt;em>draping&lt;/em>” the RDF over the document markup would be a better term – so that the metadata travels along with the actual content.&lt;/p>
&lt;p>By the way, the RDFa can be processed to yield valid RDF (as is shown in the demo) and which can also be seen by running the web page through the [RDFa Distiller][5]. (You just need to cut and paste the link of the demo page given above into the Distiller form box.) This will produce RDF in various serializations (N3, XML, Triples) from the RDFa.&lt;/p>
&lt;p>So, is there really any longer any reason &lt;em>not&lt;/em> to have machine readable metadata at the end of the DOI? Are we there yet?&lt;/p>
&lt;p>[1]: &lt;a href="https://doi.org/10.13003/5jchdy" target="_blank">Crossref DOI display guidelines&lt;/a>
[2]: &lt;a href="http://code.google.com/p/rdfquery/wiki/RdfPlugin" target="_blank">http://code.google.com/p/rdfquery/wiki/RdfPlugin&lt;/a>
[3]: /blog/rdfquery
[4]: /blog/natures-metadata-for-web-pages
[5]: &lt;a href="http://www.w3.org/2007/08/pyRdfa/" target="_blank">http://www.w3.org/2007/08/pyRdfa/&lt;/a>&lt;/p></description></item><item><title>rdfQuery</title><link>https://www.crossref.org/blog/rdfquery/</link><pubDate>Mon, 17 Nov 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/rdfquery/</guid><description>&lt;p>Whaddya know? I was just on the point of blogging about the real nice demo given by Jeni Tennison at last week’s &lt;a href="https://web.archive.org/web/20120101062734/http://swig.networkedplanet.com/november2008.html" target="_blank">SWIG UK meeting&lt;/a> at HP Labs in Bristol of &lt;a href="http://code.google.com/p/rdfquery/wiki/RdfPlugin" target="_blank">rdfQuery&lt;/a> (an RDF plugin for &lt;a href="http://jquery.com/" target="_blank">jQuery&lt;/a> - the zip file is &lt;a href="https://web.archive.org/web/*/http://rdfquery.googlecode.com/files/rdfQuery%20v0.2.zip" target="_blank">here&lt;/a>). And there today on her blog I see that she has a full &lt;a href="http://www.jenitennison.com/blog/node/94" target="_blank">writeup&lt;/a> on rdfQuery, so I’ll defer to the expert. :~)&lt;/p>
&lt;p>All I can really add to that is that rdfQuery is a pretty darn cool way to add and manipulate RDFa using jQuery. Does it get any better?&lt;/p>
&lt;p>And now that RDFa is a &lt;a href="http://www.w3.org/TR/rdfa-syntax/" target="_blank">W3C Rec&lt;/a> since last month (see &lt;a href="http://www.w3.org/TR/xhtml-rdfa-primer/" target="_blank">Primer&lt;/a> and &lt;a href="http://www.w3.org/TR/rdfa-syntax/" target="_blank">Syntax&lt;/a>) it will be interesting to see how Crossref members might begin to deploy it on their pages - especially on DOI landing pages.&lt;/p></description></item><item><title>PRISM 2.1</title><link>https://www.crossref.org/blog/prism-2.1/</link><pubDate>Fri, 24 Oct 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/prism-2.1/</guid><description>&lt;p>Yesterday a new &lt;a href="https://web.archive.org/web/20081019002715/http://www.prismstandard.org/" target="_blank">PRISM&lt;/a> spec (v2.1) was released for public comment. (Comment period lasts up to Dec. 3, ’08.)&lt;/p>
&lt;p>Changes are listed in pages 8 and 9 of the Introduction document. Some highlights:&lt;/p>
&lt;ul>
&lt;li>New &lt;em>PRISM Usage Rights&lt;/em> namespace
&lt;ul>
&lt;li>Accordingly usage of &lt;strong>prism:copyright&lt;/strong>, &lt;strong>prism:embargoDate&lt;/strong>, and &lt;strong>prism:expirationDate&lt;/strong> no longer recommended
&lt;ul>
&lt;li>New element &lt;strong>prism:isbn&lt;/strong> introduced for book serials&lt;/ul>
An updated &lt;a href="https://web.archive.org/web/20090310235930/http://nurture.nature.com/rss/modules/mod_prism_04.html" target="_blank">mod_prism&lt;/a> RSS 1.0 module is available which lists all versions of PRISM specs including the forthcoming v2.1 spec. I will see about getting this added now to a more permanent location. Current version of PRISM remains at v2.0. Versions 2.0 and 2.1 are especially of interest to users of Crossref because of their support for &lt;strong>prism:doi&lt;/strong> and &lt;strong>prism:url&lt;/strong> and users should consider upgrading their applications, e.g. RSS feeds.&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul></description></item><item><title>XMP Marches On</title><link>https://www.crossref.org/blog/xmp-marches-on/</link><pubDate>Mon, 20 Oct 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/xmp-marches-on/</guid><description>&lt;p>For those who may be interested in the progress of XMP, Adobe’s Gunar Penikis has just announced &lt;sup id="fnref:1">&lt;a href="#fn:1" class="footnote-ref" role="doc-noteref">1&lt;/a>&lt;/sup> two new releases of XMP SDKs: XMP Toolkit 4.4 (with support for new file formats), and FileInfo SDK (for customizing CS4 UIs). More importantly, though, may be the new edition of the XMP spec - see &lt;a href="http://www.adobe.com/devnet/xmp/" target="_blank">here&lt;/a>, which is bumped from a modest 112 page document to a 3-parter at 199 pages.&lt;/p>
&lt;p>Looks to be quite a thorough spec bar one telling particular: there is no version number and no date! The previous version was likewise unnumbered though at least dated as “September 2005”. Btw, I’m not sure of there is any archive of XMP specs being maintained by Adobe. At least, I’m not aware of any page with that information. Perhaps we can refer to our &lt;a href="https://www.crossref.org/blog/now-what-about-xmp/">earlier call&lt;/a> to have XMP turned over to a standards organisation to formalize a public spec.&lt;/p>
&lt;div class="footnotes" role="doc-endnotes">
&lt;hr>
&lt;ol>
&lt;li id="fn:1">
&lt;p>Update Aug 2022: the announcement blog post mentioned above was previously at blogs.adobe.com/gunar/2008/10/new_xmp_sdks_released.html but is no longer live.&amp;#160;&lt;a href="#fnref:1" class="footnote-backref" role="doc-backlink">&amp;#x21a9;&amp;#xfe0e;&lt;/a>&lt;/p>
&lt;/li>
&lt;/ol>
&lt;/div></description></item><item><title>Yer Basic One-Liner</title><link>https://www.crossref.org/blog/yer-basic-one-liner/</link><pubDate>Tue, 14 Oct 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/yer-basic-one-liner/</guid><description>&lt;p>&lt;a href="https://www.crossref.org/wp/blog/images/one-line-alert.jpg">&lt;img border="0" alt="one-line-alert-small0.jpg" src="https://www.crossref.org/wp/blog/images/one-line-alert-small0.jpg" width="130" height="150" style="float:right; margin-left=20px" />&lt;/a>&lt;/p>
&lt;p>Here&amp;rsquo;s your basic one-line handle client (all of it) for the browser:&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-javascript" data-lang="javascript">&lt;span class="line">&lt;span class="cl">&lt;span class="nx">OpenHandle&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">Util&lt;/span>&lt;span class="p">().&lt;/span>&lt;span class="nx">getHandleData&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="s2">&amp;#34;10.1038/nature05826&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span> &lt;span class="kd">function&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="nx">data&lt;/span>&lt;span class="p">)&lt;/span> &lt;span class="p">{&lt;/span> &lt;span class="nx">alert&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="nx">OpenHandle&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">Util&lt;/span>&lt;span class="p">().&lt;/span>&lt;span class="nx">helloWorld&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="nx">data&lt;/span>&lt;span class="p">));&lt;/span> &lt;span class="p">});&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>Can&amp;rsquo;t see how to make that much shorter (bar tossing spaces). But here&amp;rsquo;s one attempt (shorter though now it&amp;rsquo;s not strictly a one-liner):&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-javascript" data-lang="javascript">&lt;span class="line">&lt;span class="cl">&lt;span class="kd">var&lt;/span> &lt;span class="nx">u&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="nx">OpenHandle&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">Util&lt;/span>&lt;span class="p">();&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nx">u&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">getHandleData&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="s2">&amp;#34;10.1038/nature05826&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span> &lt;span class="kd">function&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="nx">_&lt;/span>&lt;span class="p">)&lt;/span> &lt;span class="p">{&lt;/span> &lt;span class="nx">alert&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="nx">u&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">helloWorld&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="nx">_&lt;/span>&lt;span class="p">));&lt;/span> &lt;span class="p">});&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>Here I&amp;rsquo;ve used two utility convenience methods from the OpenHandle client library:&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-javascript" data-lang="javascript">&lt;span class="line">&lt;span class="cl">&lt;span class="nx">OpenHandle&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">Util&lt;/span>&lt;span class="p">().&lt;/span>&lt;span class="nx">getHandleData&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="nx">handle&lt;/span>&lt;span class="p">,&lt;/span> &lt;span class="nx">callback&lt;/span>&lt;span class="p">,&lt;/span> &lt;span class="p">[&lt;/span>&lt;span class="nx">server&lt;/span>&lt;span class="p">])&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nx">OpenHandle&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">Util&lt;/span>&lt;span class="p">().&lt;/span>&lt;span class="nx">helloWorld&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="nx">JSON&lt;/span>&lt;span class="p">)&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>You will though need to include a couple of libraries: &lt;a href="http://archive.is/TF3tq" target="_blank">openhandle.js&lt;/a> and &lt;a href="http://jquery.com/" target="_blank">jquery.js&lt;/a>. (Note that the &lt;span style="font-family: arial, sans-serif; font-size: 8pt; color: #00f; text-decoration: none">getHandleData()&lt;/span> method supplied in the &lt;a href="http://archive.is/TF3tq" target="_blank">openhandle.js&lt;/a> library uses &lt;a href="http://jquery.com/" target="_blank">jQuery&lt;/a>. Feel free to overwrite that.) A complete working document can thus be implemented as:&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-html" data-lang="html">&lt;span class="line">&lt;span class="cl">&lt;span class="p">&amp;lt;&lt;/span>&lt;span class="nt">html&lt;/span>&lt;span class="p">&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">&amp;lt;&lt;/span>&lt;span class="nt">head&lt;/span>&lt;span class="p">&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">&amp;lt;&lt;/span>&lt;span class="nt">script&lt;/span> &lt;span class="na">type&lt;/span>&lt;span class="o">=&lt;/span>&lt;span class="s">&amp;#34;text/javascript&amp;#34;&lt;/span> &lt;span class="na">src&lt;/span>&lt;span class="o">=&lt;/span>&lt;span class="s">&amp;#34;http://jqueryjs.googlecode.com/files/jquery-1.2.6.js&amp;#34;&lt;/span>&lt;span class="p">&amp;gt;&amp;lt;/&lt;/span>&lt;span class="nt">script&lt;/span>&lt;span class="p">&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">&amp;lt;&lt;/span>&lt;span class="nt">script&lt;/span> &lt;span class="na">type&lt;/span>&lt;span class="o">=&lt;/span>&lt;span class="s">&amp;#34;text/javascript&amp;#34;&lt;/span> &lt;span class="na">src&lt;/span>&lt;span class="o">=&lt;/span>&lt;span class="s">&amp;#34;http://openhandle.googlecode.com/files/openhandle-0.2.3.js&amp;#34;&lt;/span>&lt;span class="p">&amp;gt;&amp;lt;/&lt;/span>&lt;span class="nt">script&lt;/span>&lt;span class="p">&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">&amp;lt;&lt;/span>&lt;span class="nt">script&lt;/span> &lt;span class="na">type&lt;/span>&lt;span class="o">=&lt;/span>&lt;span class="s">&amp;#34;text/javascript&amp;#34;&lt;/span>&lt;span class="p">&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nx">jQuery&lt;/span>&lt;span class="p">().&lt;/span>&lt;span class="nx">ready&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="kd">function&lt;/span>&lt;span class="p">()&lt;/span> &lt;span class="p">{&lt;/span> &lt;span class="cm">/* action when body content is loaded */&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="kd">var&lt;/span> &lt;span class="nx">u&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="nx">OpenHandle&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">Util&lt;/span>&lt;span class="p">();&lt;/span> &lt;span class="nx">u&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">getHandleData&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="s2">&amp;#34;10.1038/nature05826&amp;#34;&lt;/span>&lt;span class="p">,&lt;/span> &lt;span class="kd">function&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="nx">_&lt;/span>&lt;span class="p">)&lt;/span> &lt;span class="p">{&lt;/span> &lt;span class="nx">alert&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="nx">u&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">helloWorld&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="nx">_&lt;/span>&lt;span class="p">));&lt;/span> &lt;span class="p">});&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">});&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">&amp;lt;/&lt;/span>&lt;span class="nt">script&lt;/span>&lt;span class="p">&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">&amp;lt;/&lt;/span>&lt;span class="nt">head&lt;/span>&lt;span class="p">&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">&amp;lt;&lt;/span>&lt;span class="nt">body&lt;/span>&lt;span class="p">&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">Boo!
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">&amp;lt;/&lt;/span>&lt;span class="nt">body&lt;/span>&lt;span class="p">&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">&amp;lt;/&lt;/span>&lt;span class="nt">html&lt;/span>&lt;span class="p">&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>Let me know if this doesn&amp;rsquo;t work for you. I&amp;rsquo;ve tried to test this and seems to function OK but sure as the sun rises I ain&amp;rsquo;t no &lt;a href="http://www.slideshare.net/jeresig/secrets-of-javascript-libraries?src=embed" target="_blank">JS ninja&lt;/a>.&lt;/p>
&lt;p>Of course, we normally want to do more than just dump the values. So, given that it&amp;rsquo;s pretty straightforward to grab and to manipulate a handle&amp;rsquo;s data values over the Web, how can we put this into practice?&lt;/p>
&lt;p>Let&amp;rsquo;s consider a couple of Crossref use cases.&lt;/p>
&lt;p>&lt;em>(Disclaimer: These examples are not intended as being in any way a replacement for the existing Crossref services but merely show how those services could be implemented on the client side. These illustrations will be useful for new bespoke services accessing other data elements that may be registered with the DOI.)&lt;/em>&lt;/p>
&lt;p>&lt;strong>Single Resolution&lt;/strong>&lt;/p>
&lt;p>Here is how one could implement the regular URL redirect service from the client:&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-javascript" data-lang="javascript">&lt;span class="line">&lt;span class="cl">&lt;span class="kd">var&lt;/span> &lt;span class="nx">handle&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="s2">&amp;#34;10.1038/nature05826&amp;#34;&lt;/span>&lt;span class="p">;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="kd">var&lt;/span> &lt;span class="nx">callback&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="kd">function&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="nx">json&lt;/span>&lt;span class="p">)&lt;/span> &lt;span class="p">{&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="kd">var&lt;/span> &lt;span class="nx">hv&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="k">new&lt;/span> &lt;span class="nx">OpenHandle&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">Handle&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="nx">json&lt;/span>&lt;span class="p">).&lt;/span>&lt;span class="nx">getValuesByType&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="err">‘&lt;/span>&lt;span class="nx">URL&lt;/span>&lt;span class="err">&amp;#39;&lt;/span>&lt;span class="p">)[&lt;/span>&lt;span class="mi">0&lt;/span>&lt;span class="p">];&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="kd">var&lt;/span> &lt;span class="nx">url&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="k">new&lt;/span> &lt;span class="nx">OpenHandle&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">HandleValue&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="nx">hv&lt;/span>&lt;span class="p">).&lt;/span>&lt;span class="nx">getData&lt;/span>&lt;span class="p">();&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="c1">// alert(&amp;#34;Redirecting to &amp;#34; + url);
&lt;/span>&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="c1">&lt;/span> &lt;span class="nb">window&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">location&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="nx">url&lt;/span>&lt;span class="p">;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">};&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nx">OpenHandle&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">Util&lt;/span>&lt;span class="p">().&lt;/span>&lt;span class="nx">getHandleData&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="nx">handle&lt;/span>&lt;span class="p">,&lt;/span> &lt;span class="nx">callback&lt;/span>&lt;span class="p">);&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>The &lt;code>getValuesByType(‘URL')[0]&lt;/code> call returns the first handle value of type &amp;lsquo;URL&amp;rsquo;. The next line just parses this value as a handle value object and gets the data field, i.e. the URL itself.&lt;/p>
&lt;p>Note here that this client can show the URL that the user will be redirected to. With normal DOI resolution the resolution takes place on the proxy server (&lt;a href="http://dx.doi.org/" target="_blank">dx.doi.org&lt;/a>) and the URL is not available to the user - until they are so redirected. In fact, the user may never get to see the URL stored in the handle value if this is the head of a redirect chain.&lt;/p>
&lt;p>To recap, Crossref DOIs are not resolved by the user to URLs - rather, they invoke a service on the server which returns a content page.&lt;/p>
&lt;p>&lt;strong>Multiple Resolution&lt;/strong>&lt;/p>
&lt;p>Let&amp;rsquo;s now take a look at a case of Crossref multiple resolution. This code uses the &lt;code>getValues()&lt;/code> method to return all values:&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-javascript" data-lang="javascript">&lt;span class="line">&lt;span class="cl">&lt;span class="kd">var&lt;/span> &lt;span class="nx">handle&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="s2">&amp;#34;10.1130/B25510.1&amp;#34;&lt;/span>&lt;span class="p">;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="kd">var&lt;/span> &lt;span class="nx">callback&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="kd">function&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="nx">json&lt;/span>&lt;span class="p">)&lt;/span> &lt;span class="p">{&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="kd">var&lt;/span> &lt;span class="nx">s&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="s2">&amp;#34;&amp;#34;&lt;/span>&lt;span class="p">;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="kd">var&lt;/span> &lt;span class="nx">hv&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="p">(&lt;/span>&lt;span class="k">new&lt;/span> &lt;span class="nx">OpenHandle&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">Handle&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="nx">json&lt;/span>&lt;span class="p">)).&lt;/span>&lt;span class="nx">getValues&lt;/span>&lt;span class="p">();&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="k">for&lt;/span> &lt;span class="p">(&lt;/span>&lt;span class="kd">var&lt;/span> &lt;span class="nx">i&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="mi">0&lt;/span>&lt;span class="p">;&lt;/span> &lt;span class="nx">i&lt;/span> &lt;span class="o">&amp;lt;&lt;/span> &lt;span class="nx">hv&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">length&lt;/span>&lt;span class="p">;&lt;/span> &lt;span class="nx">i&lt;/span>&lt;span class="o">++&lt;/span>&lt;span class="p">)&lt;/span> &lt;span class="p">{&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="kd">var&lt;/span> &lt;span class="nx">v&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="k">new&lt;/span> &lt;span class="nx">OpenHandle&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">HandleValue&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="nx">hv&lt;/span>&lt;span class="p">[&lt;/span>&lt;span class="nx">i&lt;/span>&lt;span class="p">]);&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nx">s&lt;/span> &lt;span class="o">+=&lt;/span> &lt;span class="nx">v&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">getType&lt;/span>&lt;span class="p">()&lt;/span> &lt;span class="o">+&lt;/span> &lt;span class="s2">&amp;#34;: &amp;#34;&lt;/span> &lt;span class="o">+&lt;/span> &lt;span class="nx">v&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">getData&lt;/span>&lt;span class="p">();&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="p">}&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nx">alert&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="nx">s&lt;/span>&lt;span class="p">);&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">};&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nx">OpenHandle&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">Util&lt;/span>&lt;span class="p">().&lt;/span>&lt;span class="nx">getHandleData&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="nx">handle&lt;/span>&lt;span class="p">,&lt;/span> &lt;span class="nx">callback&lt;/span>&lt;span class="p">);&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>which yields&lt;/p>
&lt;pre tabindex="0">&lt;code>700050: 200508231619480000
HS_ADMIN: [object Object]
URL.0: http://www.gsajournals.org/gsaonline/?request=get-abstract&amp;amp;doi=10%2E1130%2FB25510%2E1
URL.1: http://bulletin.geoscienceworld.org/cgi/doi/10.1130/B25510.1
CR-LR: &amp;lt;MR&amp;gt;&amp;lt;LI label=&amp;#34;GeoScienceWorld&amp;#34; resource=&amp;#34;URL.1&amp;#34; /&amp;gt;&amp;lt;LI label=&amp;#34;Geological Society of America&amp;#34; resource=&amp;#34;URL.0&amp;#34; /&amp;gt;
&lt;/code>&lt;/pre>&lt;p>Oops! Too much information. This includes types such as &amp;lsquo;700050&amp;rsquo; and &amp;lsquo;HS_ADMIN&amp;rsquo; which are used by the Crossref application, and not intended for the end user. Maybe we should just limit it to the URL types with &lt;code>getValuesByType('URL')&lt;/code>:&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-javascript" data-lang="javascript">&lt;span class="line">&lt;span class="cl">&lt;span class="nx">getValuesByType&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="err">‘&lt;/span>&lt;span class="nx">URL&lt;/span>&lt;span class="s1">&amp;#39;):
&lt;/span>&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s1">
&lt;/span>&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s1">var handle = &amp;#34;10.1130/B25510.1&amp;#34;;
&lt;/span>&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s1">var callback = function(json) {
&lt;/span>&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s1"> var s = &amp;#34;&amp;#34;;
&lt;/span>&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="s1"> var hv = (new OpenHandle.Handle(json)).getValuesByType(‘URL&amp;#39;&lt;/span>&lt;span class="p">);&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="k">for&lt;/span> &lt;span class="p">(&lt;/span>&lt;span class="kd">var&lt;/span> &lt;span class="nx">i&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="mi">0&lt;/span>&lt;span class="p">;&lt;/span> &lt;span class="nx">i&lt;/span> &lt;span class="o">&amp;lt;&lt;/span> &lt;span class="nx">hv&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">length&lt;/span>&lt;span class="p">;&lt;/span> &lt;span class="nx">i&lt;/span>&lt;span class="o">++&lt;/span>&lt;span class="p">)&lt;/span> &lt;span class="p">{&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="kd">var&lt;/span> &lt;span class="nx">v&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="k">new&lt;/span> &lt;span class="nx">OpenHandle&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">HandleValue&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="nx">hv&lt;/span>&lt;span class="p">[&lt;/span>&lt;span class="nx">i&lt;/span>&lt;span class="p">]);&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nx">s&lt;/span> &lt;span class="o">+=&lt;/span> &lt;span class="nx">v&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">getType&lt;/span>&lt;span class="p">()&lt;/span> &lt;span class="o">+&lt;/span> &lt;span class="s2">&amp;#34;: &amp;#34;&lt;/span> &lt;span class="o">+&lt;/span> &lt;span class="nx">v&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">getData&lt;/span>&lt;span class="p">();&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="p">}&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nx">alert&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="nx">s&lt;/span>&lt;span class="p">);&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">};&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nx">OpenHandle&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">Util&lt;/span>&lt;span class="p">().&lt;/span>&lt;span class="nx">getHandleData&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="nx">handle&lt;/span>&lt;span class="p">,&lt;/span> &lt;span class="nx">callback&lt;/span>&lt;span class="p">);&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>which yields&lt;/p>
&lt;pre tabindex="0">&lt;code>URL.0: http://www.gsajournals.org/gsaonline/?request=get-abstract&amp;amp;doi=10%2E1130%2FB25510%2E1
URL.1: http://bulletin.geoscienceworld.org/cgi/doi/10.1130/B25510.1
&lt;/code>&lt;/pre>&lt;p>_(By the way, the previous example shows the unregulated state of handle types. We have everything but the kitchen sink in this one example:&lt;/p>
&lt;ul>
&lt;li>simple types, both well-known (&amp;lsquo;URL&amp;rsquo;) and opaque (&amp;lsquo;700050&amp;rsquo;)
&lt;ul>
&lt;li>
&lt;p>compound, or namespaced, types with various hierarchy delimiters: dot (&amp;lsquo;URL.0&amp;rsquo;, &amp;lsquo;URL.1&amp;rsquo;), underscore (&amp;lsquo;HS_ADMIN&amp;rsquo;), and hyphen (&amp;lsquo;CR-LR&amp;rsquo;)&lt;/ul>
Well, they&amp;rsquo;re all in there now so we gotta deal with that, but generally one would probably have preferred well-known types and where namespaces are used the usual dot notation as this is a) familiar to programmers, and b) supported by the handle client library code. The underscore is used in the handle RFCs for system types so that can be viewed as a sort of inline namespacing. Seems to be no obvious excuse for hyphens though.)&lt;/i>&lt;/p>
&lt;p>Back to the example we can see that the first URL goes to a Crossref service which we can dispense with since this example is to be run client side. That leaves us with the two actual URL targets. But how to differentiate those for a user choice? That&amp;rsquo;s where that other type &amp;lsquo;CR-LR&amp;rsquo; comes in which provides an XML fragment that relates label to type. There are obviously many ways to support resource labelling - this is just the method used by Crossref.&lt;/p>
&lt;p>Let&amp;rsquo;s parse out the XML fragment for labels and resources and save those in an object keyed on resource:&lt;/p>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-javascript" data-lang="javascript">&lt;span class="line">&lt;span class="cl">&lt;span class="kd">var&lt;/span> &lt;span class="nx">labels&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="p">{};&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="kd">var&lt;/span> &lt;span class="nx">hv_&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="p">(&lt;/span>&lt;span class="k">new&lt;/span> &lt;span class="nx">OpenHandle&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">Handle&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="nx">json&lt;/span>&lt;span class="p">)).&lt;/span>&lt;span class="nx">getValuesByType&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="err">‘&lt;/span>&lt;span class="nx">CR&lt;/span>&lt;span class="o">-&lt;/span>&lt;span class="nx">LR&lt;/span>&lt;span class="err">&amp;#39;&lt;/span>&lt;span class="p">)[&lt;/span>&lt;span class="mi">0&lt;/span>&lt;span class="p">];&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="kd">var&lt;/span> &lt;span class="nx">v&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="k">new&lt;/span> &lt;span class="nx">OpenHandle&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">HandleValue&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="nx">hv_&lt;/span>&lt;span class="p">);&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="kd">var&lt;/span> &lt;span class="nx">xml&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="nx">v&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">getData&lt;/span>&lt;span class="p">();&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="kd">var&lt;/span> &lt;span class="nx">li&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="nx">xml&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">match&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="sr">/&amp;lt;li [^\&amp;gt;]* \/&amp;gt;/ig&lt;/span>&lt;span class="p">);&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="k">for&lt;/span> &lt;span class="p">(&lt;/span>&lt;span class="kd">var&lt;/span> &lt;span class="nx">i&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="mi">0&lt;/span>&lt;span class="p">;&lt;/span> &lt;span class="nx">i&lt;/span> &lt;span class="o">&amp;lt;&lt;/span> &lt;span class="nx">li&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">length&lt;/span>&lt;span class="p">;&lt;/span> &lt;span class="nx">i&lt;/span>&lt;span class="o">++&lt;/span>&lt;span class="p">)&lt;/span> &lt;span class="p">{&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="kd">var&lt;/span> &lt;span class="nx">a&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="nx">li&lt;/span>&lt;span class="p">[&lt;/span>&lt;span class="nx">i&lt;/span>&lt;span class="p">].&lt;/span>&lt;span class="nx">match&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="err">/label=\&amp;#34;([^\&amp;#34;]+)\&amp;#34;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nx">resource&lt;/span>&lt;span class="o">=&lt;/span>&lt;span class="err">\&lt;/span>&lt;span class="s2">&amp;#34;([^\&amp;#34;]+)\&amp;#34;&lt;/span>&lt;span class="o">/&lt;/span>&lt;span class="nx">i&lt;/span>&lt;span class="p">);&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nx">labels&lt;/span>&lt;span class="p">[&lt;/span>&lt;span class="nx">a&lt;/span>&lt;span class="p">[&lt;/span>&lt;span class="mi">2&lt;/span>&lt;span class="p">]]&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="nx">a&lt;/span>&lt;span class="p">[&lt;/span>&lt;span class="mi">1&lt;/span>&lt;span class="p">];&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">}&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>Now we&amp;rsquo;ll also need to build a similar object for the URLs:&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-javascript" data-lang="javascript">&lt;span class="line">&lt;span class="cl">&lt;span class="kd">var&lt;/span> &lt;span class="nx">urls&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="p">{};&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="kd">var&lt;/span> &lt;span class="nx">hv&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="p">(&lt;/span>&lt;span class="k">new&lt;/span> &lt;span class="nx">OpenHandle&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">Handle&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="nx">json&lt;/span>&lt;span class="p">)).&lt;/span>&lt;span class="nx">getValuesByType&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="err">‘&lt;/span>&lt;span class="nx">URL&lt;/span>&lt;span class="err">&amp;#39;&lt;/span>&lt;span class="p">);&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="k">for&lt;/span> &lt;span class="p">(&lt;/span>&lt;span class="kd">var&lt;/span> &lt;span class="nx">i&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="mi">0&lt;/span>&lt;span class="p">;&lt;/span> &lt;span class="nx">i&lt;/span> &lt;span class="o">&amp;lt;&lt;/span> &lt;span class="nx">hv&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">length&lt;/span>&lt;span class="p">;&lt;/span> &lt;span class="nx">i&lt;/span>&lt;span class="o">++&lt;/span>&lt;span class="p">)&lt;/span> &lt;span class="p">{&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="kd">var&lt;/span> &lt;span class="nx">v&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="k">new&lt;/span> &lt;span class="nx">OpenHandle&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">HandleValue&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="nx">hv&lt;/span>&lt;span class="p">[&lt;/span>&lt;span class="nx">i&lt;/span>&lt;span class="p">]);&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nx">urls&lt;/span>&lt;span class="p">[&lt;/span>&lt;span class="nx">v&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">getType&lt;/span>&lt;span class="p">()]&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="nx">v&lt;/span>&lt;span class="p">.&lt;/span>&lt;span class="nx">getData&lt;/span>&lt;span class="p">();&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">}&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>And now with both these objects we can build a set of labelled links as:&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-javascript" data-lang="javascript">&lt;span class="line">&lt;span class="cl">&lt;span class="kd">var&lt;/span> &lt;span class="nx">s&lt;/span> &lt;span class="o">=&lt;/span> &lt;span class="s2">&amp;#34;&amp;#34;&lt;/span>&lt;span class="p">;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="k">for&lt;/span> &lt;span class="p">(&lt;/span>&lt;span class="nx">item&lt;/span> &lt;span class="k">in&lt;/span> &lt;span class="nx">labels&lt;/span>&lt;span class="p">)&lt;/span> &lt;span class="p">{&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl"> &lt;span class="nx">s&lt;/span> &lt;span class="o">+=&lt;/span> &lt;span class="s2">&amp;#34;&amp;lt;a href=\&amp;#34;&amp;#34;&lt;/span> &lt;span class="o">+&lt;/span> &lt;span class="nx">urls&lt;/span>&lt;span class="p">[&lt;/span>&lt;span class="nx">item&lt;/span>&lt;span class="p">]&lt;/span> &lt;span class="o">+&lt;/span> &lt;span class="s2">&amp;#34;\&amp;#34;&amp;gt;&amp;#34;&lt;/span> &lt;span class="o">+&lt;/span> &lt;span class="nx">labels&lt;/span>&lt;span class="p">[&lt;/span>&lt;span class="nx">item&lt;/span>&lt;span class="p">]&lt;/span> &lt;span class="o">+&lt;/span> &lt;span class="s2">&amp;#34;&amp;lt;/a&amp;gt;&amp;#34;&lt;/span>&lt;span class="p">;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">}&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="nx">alert&lt;/span>&lt;span class="p">(&lt;/span>&lt;span class="nx">s&lt;/span>&lt;span class="p">);&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>to yield&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-html" data-lang="html">&lt;span class="line">&lt;span class="cl">&lt;span class="p">&amp;lt;&lt;/span>&lt;span class="nt">a&lt;/span> &lt;span class="na">href&lt;/span>&lt;span class="o">=&lt;/span>&lt;span class="s">&amp;#34;http://bulletin.geoscienceworld.org/cgi/doi/10.1130/B25510.1&amp;#34;&lt;/span>&lt;span class="p">&amp;gt;&lt;/span>GeoScienceWorld&lt;span class="p">&amp;lt;/&lt;/span>&lt;span class="nt">a&lt;/span>&lt;span class="p">&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&lt;span class="p">&amp;lt;&lt;/span>&lt;span class="nt">a&lt;/span> &lt;span class="na">href&lt;/span>&lt;span class="o">=&lt;/span>&lt;span class="s">&amp;#34;http://www.gsajournals.org/gsaonline/?request=get-abstract&amp;amp;doi=10%2E1130%2FB25510%2E1&amp;#34;&lt;/span>&lt;span class="na">gt&lt;/span>&lt;span class="err">;&lt;/span>&lt;span class="na">Geological&lt;/span> &lt;span class="na">Society&lt;/span> &lt;span class="na">of&lt;/span> &lt;span class="na">America&lt;/span>&lt;span class="err">&amp;lt;/&lt;/span>&lt;span class="na">a&lt;/span>&lt;span class="p">&amp;gt;&lt;/span>
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>How to build a page with those labelled links is now a simple exercise. (The actual Crossref service for &lt;a href="http://dx.doi.org/10.1130/B25510.1" target="_blank">doi:10.1130/B25510.1&lt;/a> returns a page with labelled links, logos, and metadata pulled from the Crossref database.)&lt;/p>
&lt;p>&lt;strong>Next Steps&lt;/strong>&lt;/p>
&lt;p>The aim of this work has been to show that getting access to handle data values and manipulating those values in the browser can be fairly straightforward. How additional values get to be added to DOIs (or other handles) and what those values refer to is another matter, but services to access such values do not need to be centralized. User-generated services are also a possibility.&lt;/p></description></item><item><title>OpenHandle JavaScript API</title><link>https://www.crossref.org/blog/openhandle-javascript-api/</link><pubDate>Wed, 08 Oct 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/openhandle-javascript-api/</guid><description>&lt;p>&lt;a href="http://nurture.nature.com/openhandle/docs/openhandle-api.pdf" target="_blank">&lt;img border="0" alt="openhandle-api-small.png" src="https://www.crossref.org/wp/blog/images/openhandle-api-small.png" width="340" height="257" />&lt;/a> (Click figure for &lt;a href="http://nurture.nature.com/openhandle/docs/openhandle-api.pdf" target="_blank">PDF&lt;/a>.)&lt;/p>
&lt;p>I just posted updated versions of the &lt;a href="http://code.google.com/p/openhandle/" target="_blank">OpenHandle&lt;/a> JavaScript Client Library (&lt;a href="https://web.archive.org/web/20160405130901/http://openhandle.googlecode.com/files/openhandle-0.2.2.js" target="_blank">v0.2.2&lt;/a>) and Utilities (&lt;a href="https://web.archive.org/web/20160323010335/http://openhandle.googlecode.com/files/openhandle-utils-0.2.2.js" target="_blank">v0.2.2&lt;/a>) to the project site. Mainly this post is just by way of saying that there’s now a “cheat sheet” for the API (see figure above, click for &lt;a href="http://nurture.nature.com/openhandle/docs/openhandle-api.pdf" target="_blank">PDF&lt;/a>) which will give some idea of scope. The JavaScript API attempts to reflect the Java Client Library API for Handle data structures, and has in excess of 100 methods. A &lt;a href="http://nurture.nature.com/openhandle/lib/CHANGELOG.txt" target="_blank">change log&lt;/a> is available.&lt;/p>
&lt;p>The new API supports:&lt;/p>
&lt;ul>
&lt;li>Single namespace
&lt;ul>
&lt;li>Introspection
&lt;ul>
&lt;li>Unit testing, see &lt;a href="http://nurture.nature.com/openhandle/unit/test-openhandle.html" target="_blank">here&lt;/a>&lt;/ul>
Why is this of interest to Crossref? Well, if DOIs are ever to begin take advantage of their innate &lt;a href="https://www.crossref.org/blog/multiple-resolution/">Multiple Resolution&lt;/a> capabiities there needs to be nimbler means of accessing the data items stored with a DOI. A JavaScript API would allow the data to be manipulated in the browser down by the user and so enable bespoke services. That, at least, is the idea.&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul></description></item><item><title>Handle Clients #1, #2, #3</title><link>https://www.crossref.org/blog/handle-clients-1-2-3/</link><pubDate>Wed, 01 Oct 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/handle-clients-1-2-3/</guid><description>&lt;img border=0 usemap="#GraffleExport" alt="clients-123.png" src="https://www.crossref.org/wp/blog/images/clients-123-0.png/clients-123.png" width="340" height="294" />
&lt;p>Three alternate clients for viewing a Handle (or DOI): #1 (sky - text), #2 (black - tuples), #3 (white - cards) - the image above is clickable. When Handle clients become JavaScript-able, one really can have it one’s own way. (The JavaScript library is here, the demo service interface here - the code for setting up a new service interface can be got from the &lt;a href="https://github.com/tonyhammond/openhandle" target="_blank">OpenHandle project&lt;/a>.)&lt;/p>
&lt;p>&lt;em>Noted: As of February 2023, most of the links in this blog are not longer available.&lt;/em>&lt;/p></description></item><item><title>The Last Mile</title><link>https://www.crossref.org/blog/the-last-mile/</link><pubDate>Wed, 01 Oct 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/the-last-mile/</guid><description>&lt;p>&lt;a href="https://www.crossref.org/wp/blog/images/last-mile.png">&lt;img alt="last-mile.png" border="0" src="https://www.crossref.org/wp/blog/images/last-mile.png" width="357" height="252" />&lt;/a>&lt;/p>
&lt;p>The figure above (click to enlarge) is probably self-explanatory but a few words may be in order.&lt;/p>
&lt;p>With no end-to-end delivery of data from the Handle System to the user’s application (browser or reader), getting data out of the Handle System has traditionally meant using the Web (ie. HTTP) as a courier - in effect, this is the “&lt;a href="http://en.wikipedia.org/wiki/Last_mile" target="_blank">last mile&lt;/a>” for Handle data. Typically an upstream (Handle) client provides services to the user. The most well known of these services is the URL redirect service which underpins the Crossref reference linking service. Another hosted service is the web form which displays data stored in the Handle records in a simple HTML table for user browsing. See panel a) in the figure above.&lt;/p>
&lt;p>By contrast, the OpenHandle proposal aims to move data in the Handle record in structured form (JSON or XML) over the Web for downstream processing - either in the user’s browser or on the desktop. See panel b). Advantages are that the Handle data and data structures are moved closer to the user and the services provided are capable of being better targeted and made more relevant. Data mobility as a whole is much improved. The data are accessible using standard Web description and scripting languages. One might almost say (to paraphrase the well-known Java slogan “&lt;a href="http://en.wikipedia.org/wiki/Write_once,_run_anywhere" target="_blank">write once, run anywhere&lt;/a>“) that this is a case of “read once, write anywhere”.&lt;/p></description></item><item><title>Look Ma, No Plugins!</title><link>https://www.crossref.org/blog/look-ma-no-plugins/</link><pubDate>Mon, 22 Sep 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/look-ma-no-plugins/</guid><description>&lt;p>&lt;tt>var f = function (OpenHandleJson) {&lt;br />   var h = new OpenHandle(OpenHandleJson);&lt;br />   var hv = h.getHandleValues();&lt;br />   for (var i = 0; i &amp;lt; hv.length; i++) {     var v = new HandleValue(hv[i]);     if (v.hasType(&amp;lsquo;URL&amp;rsquo;)) {       print(v.getData());     }     else if (v.hasType(&amp;lsquo;HS_ADMIN&amp;rsquo;)) {       var a = new AdminRecord(v.getData());       print(a.getAdminPermissionString())     }   } }&lt;/tt>&lt;/p>
&lt;table border="0" width="100%">
&lt;tr>
&lt;td align="right">
&lt;i>"And that, gentlemen, is how we do that." - Apollo 13&lt;/i>
&lt;/td>
&lt;/tr>
&lt;/table>
&lt;p>Following on from my earlier &lt;a href="https://www.crossref.org/blog/client-handle-demo/">Client Handle Demo&lt;/a> post, this entry is just to mention the availability of a port of (part of) the Handle client library (in Java) to JavaScript: openhandle-0.1.1.js which is being maintained on the &lt;a href="https://web.archive.org/web/20160504131854if_/https://code.google.com/archive/p/openhandle/" target="_blank">OpenHandle&lt;/a> site. The JavaScript module contains methods for three classes: &lt;tt>OpenHandle&lt;/tt>, &lt;tt>HandleValue&lt;/tt> and &lt;tt>AdminRecord&lt;/tt>.&lt;/p>
&lt;p>What does all that mean? It means that Handles and their constituent values and value fields can be accessed directly within **&lt;em>any Web browser&lt;/b>&lt;/em> (using an OpenHandle service) which allows a &lt;strong>&lt;em>dynamic Handle client&lt;/em>&lt;/strong> to be generated and presented within a &lt;strong>&lt;em>user context&lt;/em>&lt;/strong>. No plugins required. The port mirrors the class methods in the standard Java client library for Handle.&lt;/p>
&lt;p>As a demo of this JavaScript module in action, see this &lt;a href="http://nurture.nature.com/openhandle/inspect.html" target="_blank">Inspector&lt;/a> app for a card index view of Handle (and by implication DOI) records.&lt;/p>&lt;/p></description></item><item><title>Crossref is hiring an R&amp;D software engineer</title><link>https://www.crossref.org/blog/crossref-is-hiring-an-rd-software-engineer/</link><pubDate>Thu, 18 Sep 2008 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/crossref-is-hiring-an-rd-software-engineer/</guid><description>&lt;p>Crossref is hiring an R&amp;amp;D software engineer to work in our Oxford office. This is a fantastic opportunity to work on wide range of projects that promise to revolutionize scholarly publishing.&lt;/p></description></item><item><title>Multiple Resolution</title><link>https://www.crossref.org/blog/multiple-resolution/</link><pubDate>Fri, 22 Aug 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/multiple-resolution/</guid><description>&lt;p>I’ve been meaning for some time to write something about DOI and so-called “Multiple Resolution”, which to be honest is the only technology feature of any real interest as concerns DOI. (DOI as a business and social compact for guaranteeing name persistence of Web resources has been an extraordinarily successful venture in the academic publishing world with more than 32m items registered and maintained over eight years of operation but that may not have required any specialized technology. More a consensus to adopt a single location service in the &lt;a href="http://dx.doi.org/" target="_blank">DOI proxy&lt;/a>.)&lt;/p>
&lt;p>Multiple resolution, though. Now, that’s something else. Seems like it should be able to offer a lot of general funkiness and yet it has not been much used up to now. And I have to wonder why.&lt;/p>
&lt;p>(Continues below.)&lt;/p>
&lt;p>I guess we should start out with some definitions: the DOI Handbook, the (draft) ISO standard, and Crossref:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>&lt;a href="https://web.archive.org/web/20090301014307/http://www.doi.org/hb.html" target="_blank">DOI Handbook&lt;/a> - From &lt;a href="https://web.archive.org/web/20080704170549/http://www.doi.org/handbook_2000/resolution.html" target="_blank">Sect. 3.3 Multiple resolution&lt;/a>:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“Multiple resolution allows one entity to be resolved to multiple other entities; it can be used to embody e.g a parent-children relationship, or any other relationship. … A DOI name can be resolved to an arbitrary number of different points on the Internet: multiple URLs, other DOI names, and other data types.”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;ul>
&lt;li>
&lt;p>&lt;a href="https://www.iso.org/standard/81599.html" target="_blank">ISO CD 26324&lt;/a> - I’ve blogged &lt;a href="https://doi.org/10.64000/sr0bx-a3x54" target="_blank">here&lt;/a> before about the ISO standardization of DOI which is now &lt;a href="https://www.iso.org/standard/81599.html" target="_blank">available&lt;/a> as a Committee Draft. Multiple resolution is specifically mentioned in Sects. 3.2 and 6.2 and discussed in Sect. 6.1. From Sect. 3 “Terms and definitions” we have this definition:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“Multiple resolution is the simultaneous return as output of several pieces of current information related to the object, in defined data structures.”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>And then Section 6 “Resolution of DOI name” goes on to say this:&lt;/p>
&lt;blockquote>
&lt;p>_“DOI resolution records may include one or more URLs, where the object may be located, and other&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>information provided about the entity to which a DOI name has been assigned, optionally including but not restricted to: names, identifiers, descriptions, types, classifications, locations, times, measurements, and relationships to other entities.”_&lt;/p>
&lt;/blockquote>
&lt;ul>
&lt;li>
&lt;p>&lt;a href="http://www.crossref.org/" target="_blank">Crossref&lt;/a> - In the help page &lt;a href="https://www.crossref.org/education/content-registration/creating-and-managing-dois/multiple-resolution/">Multiple Resolution Intro&lt;/a> there is this:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“As of May 2008 the Crossref main system will support assigning more than one URL to a single DOI, a concept known as multiple resolution (MR). “&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>The intro goes on to talk about the two pilot forms of multiple resolution service that have been trialled: a) interim page, and b) menu pop-up. The pop-up service is no longer supported. Only the interim page is currently offered as a production service. The help page &lt;a href="https://www.crossref.org/education/content-registration/creating-and-managing-dois/multiple-resolution/">Interim Page multiple resolution overview&lt;/a> leads off thus:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“Crossref’s MR service provides an interim page solution which presents a list of link choices to the end user. Each choice represents a location at which the item may be obtained and are commonly services that are co-hosting the content under agreement with the content’s Copyright holder.”&lt;/em>&lt;/ul>&lt;/p>
&lt;/blockquote>
&lt;p>So, there it is. That’s DOI multiple resolution. The real important thing to note is that the official DOI position (IDF, ISO) is invitingly open while both the Crossref implementation and the description of multiple resolution itself is unduly restrictive. Multiple resolution as described by Crossref is essentially the deposition of additional URLs (pointing to copies of the same resource) for alternate routing (for geographical reasons, co-hosting arrangements, etc.) with a service presentation of alternate locations for user selection.&lt;/p>
&lt;p>Multiple resolution proper (as per the DOI Handbook and ISO draft) is the deposition of arbitrary data values and return of same with &lt;em>no particular services implied&lt;/em>. Use cases for multiple resolution include the addition of URLs for referencing different (but related) network objects, e.g. a metadata record, or other resources such as supplementary information, datasets, etc. Deposit of arbitrary data types is not yet catered for by Crossref. There could, I would suggest, at least be some rudimentary provision for depositing vanilla type/value pairs (subject to policy constraints). (There is currently some work under way in defining handle data types but this need not be any showstopper to depositing new data types as any type management system will likely need to evolve over time.)&lt;/p>
&lt;p>An obvious use case for multiple resolution (to me, anyway) would be the registration of a second URL which would point not onto a copy of the resource but onto a public metadata record. (I have earlier posted here about &lt;a href="https://www.crossref.org/blog/exposing-public-data">architectures&lt;/a> and &lt;a href="https://www.crossref.org/blog/exposing-public-data-options/">options&lt;/a> for exposing public data.)&lt;/p>
&lt;p>With more than one data value in a resolution record, the process of resolving such a record is potentially complicated. As the ISO CD says in Sect. 62f:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“Resolution requests should be capable of returning all associated values of current information, individual values, or all values of one data type.”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>The DOI Handbook itself recognizes the problems that multiple resolution may present.&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“If the DOI name can point to many different possible “resolutions”, how is the choice made between different options? At its simplest, the user may be provided with a list from which to make a manual choice. However, this is not a scalable solution for an increasingly complex and automated environment. The DOI name will increasingly depend on automation of “service requests”, through which users (and, more importantly, users’ application software) can be passed seamlessly from a DOI name to the specific service that they require.”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>Indeed, services like &lt;a href="http://code.google.com/p/openhandle/" target="_blank">OpenHandle&lt;/a> will make it much easier to programmatically access data stored in the handle record associated with a DOI name. (I have blogged previously about the OpenHandle &lt;a href="https://www.crossref.org/blog/openhandle-google-code-project/">project&lt;/a> and its &lt;a href="https://www.crossref.org/blog/openhandle-languages-support/">languages support&lt;/a>.) Note that presentation of data values to a human user may be a non-issue for mediated services.&lt;/p>
&lt;p>And talking of computer languages it may be amusing to ruminate briefly on their own built-in support for multiple return values. Perhaps unsurprisingly, of the dominant languages Java has no such support as this &lt;a href="http://www.osnews.com/story/20076/Multiple_Return_Values_in_Java" target="_blank">recent post&lt;/a> addresses:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“Today was one of those days when I wished Java would support multiple return values… but Java allows you to return only one value either an object or a primitive type.”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>By contrast, languages such as Common Lisp do have support for multiple return values. See &lt;a href="http://lambda-the-ultimate.org/node/2833" target="_blank">this post&lt;/a> for some gory details and insights. Interesting also to reflect that as in the world of computing languages where there is a decided tilt towards a mainstream family of languages based on (or derived from) C, there may be dominant protocols at large on the Internet but no single “winner takes all”.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul></description></item><item><title>mod_prism (Updated)</title><link>https://www.crossref.org/blog/mod_prism-updated/</link><pubDate>Thu, 21 Aug 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/mod_prism-updated/</guid><description>&lt;p>I’ve just put up for comment a revised mod_prism (0.3) of the existing mod_prism RSS 1.0 module. This is now updated to the current PRISM version (v2.0) which was released in February ’08 and reissued with Errata in July ’08. The current mod_prism draft is registered &lt;a href="http://purl.org/rss/1.0/modules/" target="_blank">here&lt;/a>.&lt;/p>
&lt;p>The new draft charts all (five) versions of the PRISM specification (v1.0-v2.0) and maps PRISM terms to RSS 1.0 elements. Though not required as such for use of terms within an RSS 1.0 feed, an RSS 1.0 module does allow for easy housekeeping as well as providing usage guidelines and examples for how to use PRISM terms within an RSS 1.0 feed.&lt;/p>
&lt;p>The main interest for Crossref members will be the opportunity to update their current RSS 1.0 feeds to include the new PRISM terms &lt;strong>prism:doi&lt;/strong> and &lt;strong>prism:url&lt;/strong>. I blogged earlier &lt;a href="https://www.crossref.org/blog/prismdoi/">here&lt;/a> about &lt;strong>prism:doi&lt;/strong> as it first appeared. The suggestions I put forward there were subsequently incorporated into the Errata for 2.0 which were published in July and are avaliable as a zip file &lt;a href="https://web.archive.org/web/20081019002715/http://www.prismstandard.org//specifications/2.0/PRISM2.0Errata.zip" target="_blank">here&lt;/a>.&lt;/p>
&lt;p>I would be very interested in receiving any feedback. I guess I should add to the v1.2 example of an RSS item in the draft an example also of a v2.0 RSS item which makes use of both &lt;strong>prism:doi&lt;/strong> and &lt;strong>prism:url&lt;/strong>.&lt;/p></description></item><item><title>Search Web Services - New Committee Drafts</title><link>https://www.crossref.org/blog/search-web-services-new-committee-drafts/</link><pubDate>Tue, 29 Jul 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/search-web-services-new-committee-drafts/</guid><description>&lt;p>As posted &lt;a href="http://listserv.loc.gov/cgi-bin/wa?A2=ind0807&amp;amp;#038;L=zng&amp;amp;#038;T=0&amp;amp;#038;P=52" target="_blank">here&lt;/a> on the &lt;a href="https://web.archive.org/web/20130303230855/http://sun8.loc.gov/listarch/zng.html" target="_blank">SRU Implementors&lt;/a> list, the &lt;a href="http://www.oasis-open.org/committees/tc_home.php?wg_abbrev=search-ws" target="_blank">OASIS Search Web Services Technical Committee&lt;/a> has announced the release of five Committee Drafts, informally known as:&lt;/p>
&lt;ol>
&lt;li>&lt;a href="http://docs.oasis-open.org/search-ws/v1.0/apd-V1.0.html" target="_blank">Abstract Protocol Definition  (APD)&lt;/a>
&lt;ul>
&lt;li>&lt;a href="http://docs.oasis-open.org/search-ws/v1.0/sru-1-2-V1.0.html" target="_blank">Binding for SRU 1.2&lt;/a>
&lt;ul>
&lt;li>&lt;a href="http://docs.oasis-open.org/search-ws/v1.0/binding-for-get-V1.0.html" target="_blank">Auxiliary Binding for HTTP GET&lt;/a>
&lt;ul>
&lt;li>&lt;a href="http://docs.oasis-open.org/search-ws/v1.0/cql-1-2-v1.0.html" target="_blank">CQL 1.2&lt;/a>
&lt;ul>
&lt;li>
&lt;p>&lt;a href="http://docs.oasis-open.org/search-ws/v1.0/opensearch-v1.0.html" target="_blank">Binding for OpenSearch&lt;/a>&lt;/ol>
Links to specific document formats are given at the bottom of the mail. A list of the TC public documents is also available &lt;a href="https://www.oasis-open.org/committees/documents.php?wg_abbrev=search-ws" target="_blank">here&lt;/a>.&lt;/p>
&lt;p>The next phase of work for the TC will be the development of SRU/CQL 2.0, and the Description Language.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ol></description></item><item><title>Does Size Matter?</title><link>https://www.crossref.org/blog/does-size-matter/</link><pubDate>Mon, 28 Jul 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/does-size-matter/</guid><description>&lt;p>Interesting &lt;a href="http://googleblog.blogspot.com/2008/07/we-knew-web-was-big.html" target="_blank">post&lt;/a> from Google, in which they say:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“Recently, even our search engineers stopped in awe about just how big the web is these days — when our systems that process links on the web to find new content hit a milestone: 1 trillion (as in 1,000,000,000,000) unique URLs on the web at once!”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>Puts Crossref’s 32,639,020 unique DOIs into some kind of perspective: 0.0033%. But nonetheless that trace percentage still seems to me to be reasonably large, especially in view of it forming a persistent and curated set.&lt;/p>
&lt;p>&lt;strong>&lt;em>Update:&lt;/em>&lt;/strong> Talking of Google numbers, &lt;a href="http://royal.pingdom.com/" target="_blank">pingdom&lt;/a> has a post “&lt;a href="http://royal.pingdom.com/?p=276" target="_blank">Map of all Google data center locations&lt;/a>” with maps of US, Europe and World locations.&lt;/p></description></item><item><title>Five Years</title><link>https://www.crossref.org/blog/five-years/</link><pubDate>Mon, 28 Jul 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/five-years/</guid><description>&lt;p>Oh wow! A rather remarkable plea &lt;a href="http://lists.w3.org/Archives/Public/public-lod/2008Jul/0120.html" target="_blank">here&lt;/a> from Dan Brickley on the &lt;a href="http://lists.w3.org/Archives/Public/public-lod/" target="_blank">public-lod&lt;/a> mailing list which calls for the registrant of the &lt;a href="http://dbpedia.org/" target="_blank">dbpedia.org&lt;/a> DNS entry to top it up with another 5+ years worth of clocktime. Some quotes:&lt;/p>
&lt;blockquote>
&lt;p>_“The idea of such a cool RDF namespace having only 6 months left on the DNS registration gives me the worries.”&lt;/p>&lt;/p>
&lt;p>“If you could add another 5-10 years to the DNS registration I’d sleep easier at night.”&lt;/p>&lt;/p>
&lt;p>“Let me stress I’m not suggesting that this domain is actually at risk. Just that the not-at-risk-ness isn’t readily evident from a quick look in the DNS.”&lt;/p>&lt;/p>
&lt;p>“Those in the know are probably confident this is all in hand, but as the SW gets bigger I suspect we ought to establish practices such as “vocabularies that seek global adoption should always have 5+ years on their DNS registries”.”_&lt;/p>
&lt;/blockquote>
&lt;p>Yes, and maybe those cool URIs should have &lt;a href="http://en.wikipedia.org/wiki/Kite_mark" target="_blank">kite marks&lt;/a>, too. 😉&lt;/p>
&lt;p>(Btw, for those who may not already know the maximum length of time that &lt;em>any&lt;/em> DNS name may be leased out in a single registration is 10 years, see the &lt;a href="https://web.archive.org/web/20110318010717/http://www.icann.org/en/faq" target="_blank">FAQ&lt;/a> put out by ICANN.)&lt;/p>
&lt;p>So, pity the poor user of a given semantic web application who may not know what the expectancy is behind the nodes in an RDF graph of assertions. Shifting sands, indeed.&lt;/p></description></item><item><title>Knols and Citations</title><link>https://www.crossref.org/blog/knols-and-citations/</link><pubDate>Thu, 24 Jul 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/knols-and-citations/</guid><description>&lt;p>So, Google’s &lt;a href="https://web.archive.org/web/20091108072655/http://knol.google.com/k" target="_blank">Knol&lt;/a> is now live (see &lt;a href="http://googleblog.blogspot.com/2008/07/knol-is-open-to-everyone.html" target="_blank">this announcement&lt;/a> on Google’s Blog). There’ll be comment aplenty about the merits of this service and how it compares to other user contributed content sites. But one curious detail struck me. In terms of citeability, compare how a Knol contribution (or “knol”) may be linked to as may be a corresponding entry in Wikipedia (here I’ve chosen the subject “Eclipse”):&lt;/p>
&lt;dl>
&lt;dt>&lt;em>Knol&lt;/em>&lt;/p>&lt;/dt>
&lt;dd>&lt;a href="https://web.archive.org/web/20080730124803/http://knol.google.com/k/jay-pasachoff/eclipse/IDZ0Z-SC/wTLUGw" target="_blank">https://web.archive.org/web/20080730124803/http://knol.google.com/k/jay-pasachoff/eclipse/IDZ0Z-SC/wTLUGw&lt;/a>&lt;/p>
&lt;dl>
&lt;dt>&lt;em>Wikipedia&lt;/em>&lt;/p>&lt;/dt>
&lt;dd>
&lt;p>&lt;a href="http://en.wikipedia.org/wiki/Eclipse" target="_blank">http://en.wikipedia.org/wiki/Eclipse&lt;/a>&lt;/dl>&lt;/p>
&lt;p>The Knol link includes author name, subject, and service gunk, while the Wikipedia link includes only the subject. That makes the Wikipedia link both more readily citeable as well as being to some degree discoverable. I wonder what Google’s intentions, if any, are with respect to the citing of their pages (or “knols”) as authoritative sources of information. They don’t seem to be doing themselves many favours.&lt;/p>
&lt;p>I am minded of &lt;a href="https://web.archive.org/web/20061103051120/http://q6.oclc.org/" target="_blank">this post&lt;/a> on Jeff Young’s &lt;a href="https://web.archive.org/web/20061103051120/http://q6.oclc.org/" target="_blank">Q6&lt;/a> which cites this passage from the HTTP spec (see &lt;a href="http://www.w3.org/Protocols/rfc2616/rfc2616-sec3.html#sec3.2" target="_blank">RFC 2616, Sect. 3.2&lt;/a>):&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“As far as HTTP is concerned, Uniform Resource Identifiers are simply formatted strings which identify-via name, location, or any other characteristic-a resource.”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>URIs bearing these so-called “characteristics” are what I would call a service URI in contrast to a name URI (something that I will elaborate on in a separate post). For now, however, I would just note that the Knol URI looks more like a service URI and the Wikipedia URI more like a name URI. I know which URI form I would prefer to cite.&lt;/p>
&lt;/dd>
&lt;/dl>
&lt;/dd>
&lt;/dl></description></item><item><title>Knols and Citations Part II</title><link>https://www.crossref.org/blog/knols-and-citations-part-ii/</link><pubDate>Thu, 24 Jul 2008 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/knols-and-citations-part-ii/</guid><description>&lt;p>&lt;a href="https://www.crossref.org/blog/knols-and-citations/">Tony’s post&lt;/a> highlights Knol’s “service” URIs. Another issue is that many Knol entries have nice long lists of unlinked references. The HTML code behind the references is very sparse.&lt;/p>
&lt;p>Might the DOI be of use in linking out from these references? I think so. Then, of course, there’s the issue of DOIs for Knols…&lt;/p></description></item><item><title>CrossTech By Numbers</title><link>https://www.crossref.org/blog/crosstech-by-numbers/</link><pubDate>Mon, 21 Jul 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/crosstech-by-numbers/</guid><description>&lt;p>CrossTech is two years old (less one month) and we have now seen some 145 posts. Breaking the posts down by poster we arrive at the following chart:&lt;/p>
&lt;img alt="crosstech.png" src="https://www.crossref.org/wp/blog/images/crosstech.png" width="477" height="171" />
&lt;p>Note this is not any real attempt at vainglory, more a simple excuse to play with the wonderful &lt;a href="http://code.google.com/apis/chart/" target="_blank">Google Chart API&lt;/a>. Also, above I’ve taken the liberty of putting up an image (.png), although the chart could have been generated on the fly from &lt;a href="https://web.archive.org/web/20090218215119/http://code.google.com/apis/chart" target="_blank">this link&lt;/a> (or &lt;a href="http://tinyurl.com/6k38ra" target="_blank">tinyurl here&lt;/a>).&lt;/p>
&lt;p>What is of interest in the chart is that approximately 3/4 of the posts are by Crossref members (TH, EN, RK) and 1/4 by Crossref staff (EP, GB, AT, CK). Certainly Crossref staffers are doing their bit for this blog. There’s also way too many posts from me. It would be really interesting to see some others’ views or observations per the CrossTech logo legend (&lt;em>“…, collaboration, …”&lt;/em>).&lt;/p>
&lt;p>I guess the real impediment is that one needs to request an account before posting. (Certainly there’s no reason for any member to be shy about requesting an account and posting.) Note that I haven’t considered the number of commentators to the blog which is larger than the number of posters. Also a number of Crossref members are very active with their own blogs. Those blogs with a tech focus could (should?) be scooped up by a &lt;a href="https://web.archive.org/web/20080907094552/http://www.planetplanet.org/" target="_blank">Planet&lt;/a> style aggregator if there would be sufficient interest in maintaining a publishing technology hub.&lt;/p>
&lt;p>One can only hope that the numbers will continue to grow (by direct posts or by aggregations) and that there will be a wider info share over the next couple of years.&lt;/p></description></item><item><title>Library APIs</title><link>https://www.crossref.org/blog/library-apis/</link><pubDate>Mon, 21 Jul 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/library-apis/</guid><description>&lt;p>Roy Tennant in a &lt;a href="https://web.archive.org/web/20081201160108/http://lists.webjunction.org/wjlists/xml4lib/2008-July/006059.html" target="_blank">post&lt;/a> to XML4Lib announces a new list of library APIs hosted at&lt;/p>
&lt;blockquote>
&lt;p>&lt;a href="https://web.archive.org/web/20080730080413/http://techessence.info/apis//" target="_blank">https://web.archive.org/web/20080730080413/http://techessence.info/apis//&lt;/a>&lt;/p>
&lt;/blockquote>
&lt;p>A useful rough guide for us publishers to consider as we begin cultivating the multiple access routes into our own content platforms and tending to the “alphabet soup” that taken together comprises our public interfaces.&lt;/p></description></item><item><title>Metadata Matters</title><link>https://www.crossref.org/blog/metadata-matters/</link><pubDate>Mon, 21 Jul 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/metadata-matters/</guid><description>&lt;p>Andy Powell has &lt;a href="http://www.slideshare.net/eduservfoundation/does-metadata-matter" target="_blank">published on Slideshare&lt;/a> this talk about metadata - see his &lt;a href="http://efoundations.typepad.com/efoundations/2008/07/does-metadata-m.html" target="_blank">eFoundations post&lt;/a> for notes. It’s 130 slides long and aims&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“to cover a broad sweep of history from library cataloguing, thru the Dublin Core, Web search engines, IEEE LOM, the Semantic Web, arXiv, institutional repositories and more.”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>Don’t be fooled by the length though. This is a flip through and is a readily accessible overview on the importance of metadata. Slides 86-91 might be of interest here. 😉&lt;/p></description></item><item><title>PRISM Press Release</title><link>https://www.crossref.org/blog/prism-press-release/</link><pubDate>Wed, 09 Jul 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/prism-press-release/</guid><description>&lt;p>The &lt;a href="https://web.archive.org/web/20081019002715/http://www.prismstandard.org//" target="_blank">PRISM&lt;/a> metadata standards group issued a &lt;a href="https://web.archive.org/web/20160326011637/http://prismstandard.org/news/2008/PRISM_%20PR070808.pdf" target="_blank">press release&lt;/a> yesterday which covered three points:&lt;/p>
&lt;dl>
&lt;dt>PRISM Cookbook&lt;/p>&lt;/dt>
&lt;dd>The Cookbook provides &lt;em>“a set of practical implementation steps for a chosen set of use cases and provides insights into more sophisticated PRISM capabilities. While PRISM has 3 profiles, the cookbook only addresses the most commonly used profile #1, the well-formed XML profile. All recipes begin with a basic description of the business purpose it fulfills, followed by ingredients (typically a set of PRISM metadata fields or elements), and, closes with a step-by-step implementation method with sample XMLs and illustrative images.”&lt;/em>&lt;/p>
&lt;dl>
&lt;dt>PRISM 2.0 Errata &lt;/p>&lt;/dt>
&lt;dd>
&lt;p>The Errata &lt;em>“addresses a range of issues, from editorial to technical, that have been reported by the PRISM user community.”&lt;/em>&lt;/p>&lt;/p>
&lt;dl>
&lt;dt>PRISM 2.1&lt;/p>&lt;/dt>
&lt;dd>
&lt;p>The next version of the PRISM Specification, PRISM 2.1, is slated for release in late 2008. &lt;em>“This release will address complex rights for multi-platform and global distribution channels.”&lt;/em>&lt;/dl>&lt;/p>
&lt;/dd>
&lt;/dl>
&lt;/dd>
&lt;/dl>
&lt;/dd>
&lt;/dl></description></item><item><title>Now What About XMP?</title><link>https://www.crossref.org/blog/now-what-about-xmp/</link><pubDate>Tue, 08 Jul 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/now-what-about-xmp/</guid><description>&lt;p>With PDF now passed over to ISO as keeper of the format (as blogged &lt;a href="https://www.crossref.org/blog/iso-standard-for-pdf/">here&lt;/a> on CrossTech), Kas Thomas (on CMS Watch’s &lt;a href="https://web.archive.org/web/20090703195909/http://www.cmswatch.com/Trends" target="_blank">TrendWatch&lt;/a>) blogs &lt;a href="http://www.cmswatch.com/Trends/1295-PDF-now-has-a-standard-home,-but-whither-XMP?source=RSS" target="_blank">here&lt;/a> that Adobe should now do the right thing by XMP and look to hand that over too in order to establish it as a truly open standard. As he says:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“Let’s cut to the chase. If Adobe wants to demonstrate its commitment to openness, it should do for XMP what it has already done for PDF: Put it in the hands of a legitimate standards body. Right now it’s open in name only. “&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>And this:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“Adobe is pushing the XMP standard … at Adobe’s pace and in ways that benefit Adobe. (The parallels with PDF are numerous and obvious.) There are lingering technical issues waiting to be solved, however. Issues whose solutions shouldn’t have to be dependent on Adobe’s needs only.”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>He’s absolutely bang on. With XMP on the threshold of finally shining through we really could do with Adobe cutting it loose. It’s time to leave home.&lt;/p></description></item><item><title>ISO Standard for PDF</title><link>https://www.crossref.org/blog/iso-standard-for-pdf/</link><pubDate>Thu, 03 Jul 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/iso-standard-for-pdf/</guid><description>&lt;p>I blogged &lt;a href="https://www.crossref.org/blog/an-open-pdf/">here&lt;/a> back in Jan. 2007 about Adobe submitting PDF 1.7 for standardization by ISO. From yesterday’s ISO &lt;a href="https://web.archive.org/web/20131005054720/http://www.iso.org/iso/news.htm?refid=Ref1141" target="_blank">press release&lt;/a> this:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“The new standard, &lt;strong>ISO 32000-1, Document management – Portable document format – Part 1: PDF 1.7&lt;/strong>, is based on the PDF version 1.7 developed by Adobe. This International Standard supplies the essential information needed by developers of software that create PDF files (conforming writers), software that reads existing PDF files and interprets their contents for display and interaction (conforming readers), and PDF products that read and/or write PDF files for a variety of other purposes (conforming products).”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>Congrats to Adobe Systems!&lt;/p></description></item><item><title>Q6</title><link>https://www.crossref.org/blog/q6/</link><pubDate>Thu, 03 Jul 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/q6/</guid><description>&lt;p>For anybody interested in the why’s and wherefore’s of OpenURL, Jeff Young at OCLC has started posting over on his blog Q6: 6 Questions - A simpler way to understand OpenURL 1.0: Who, What, Where, When, Why, and How (note: no longer available online). He’s already amassing quite a collection of thought provoking posts. His latest is The Potential of OpenURL (note: no longer available online), from which:&lt;/p>
&lt;p>&lt;em>OpenURL has effectively cornered the niche market where Referrers need to be decoupled from Resolvers.&lt;/em>&lt;/p>
&lt;p>Blog has UML diags, definitions, musings, etc. - something for everybody. Definitely worth checking out.&lt;/p></description></item><item><title>Client Handle Demo</title><link>https://www.crossref.org/blog/client-handle-demo/</link><pubDate>Tue, 01 Jul 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/client-handle-demo/</guid><description>&lt;p>This test form shows handle value data being processed by JavaScript &lt;strong>&lt;em>in the browser&lt;/em>&lt;/strong> using an &lt;a href="http://code.google.com/p/openhandle/" target="_blank">OpenHandle&lt;/a> service. This is different from the handle &lt;a href="http://www.handle.net/" target="_blank">proxy server&lt;/a> which processes the handle data on the server - the data here is processed by the client.&lt;/p>
&lt;p>Enter a handle and the standard OpenHandle “Hello World” document is printed. Other processing could equally be applied to the handle values. (Note that the form may not work in web-based feed readers.)&lt;/p>
&lt;div id="response">
&lt;/div></description></item><item><title>Exposing Public Data: Options</title><link>https://www.crossref.org/blog/exposing-public-data-options/</link><pubDate>Tue, 01 Jul 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/exposing-public-data-options/</guid><description>&lt;p>This is a follow-on to an &lt;a href="https://www.crossref.org/blog/exposing-public-data">earlier post&lt;/a> which set out the lie of the land as regards DOI services and data for DOIs registered with Crossref. That post differentiated between a native DOI resolution through a public DOI service which acts upon the &lt;em>“associated values held in the DOI resolution record”&lt;/em> (per &lt;a href="https://www.iso.org/standard/81599.html" target="_blank">ISO CD 26324&lt;/a>) and other related DOI protected and/or private services which merely use the DOI as a key into non-public database offering.&lt;/p>
&lt;p>Following the service architecture outlined in that post, options for exposing public data appear as follows:&lt;/p>
&lt;ol>
&lt;li>
&lt;p>Private Service &lt;ol type="a">&lt;/p>
&lt;li>
Publisher hosted – Publisher private service&lt;/ol> &lt;li>
Protected Service &lt;ol type="a">
&lt;li>
Crossref hosted – Industry protected service &lt;li>
Crossref routed – Publisher private service&lt;/ol> &lt;li>
Public Service &lt;ol type="a">
&lt;li>
Handle System (DOI handle) – Global public service (native DOI service) &lt;li>
Handle System (DOI ‘buddy’ handle) – Publisher public service&lt;/ol> &lt;/ol> &lt;p>
(Continues below.)
&lt;/p>
&lt;pre>&lt;code> &amp;lt;p&amp;gt;
&lt;/code>&lt;/pre>
&lt;/li>
&lt;/ol>
&lt;p>&lt;br /> Option #1 would make public data available through a private service at a publisher host based on the DOI. This places certain constraints on service discovery and persistence. Autodiscovery links can be placed into Web pages, but there is no opportunity to ‘embed’ the services into the DOI itself, and hence these cannot be considered native DOI services. Without a published API (and hence some degree of commitment from the publisher) the service access points (and possibly the services, too) are fragile.&lt;br /> Option #2 would require Crossref to develop a service which would either a) deliver some public data on behalf of the publisher, or b) route requests through to a bespoke publisher service. Both options would require development at Crossref and an upload mechanism for the publisher to pass along data or service address. Both options would be offered as a new member service and would thus likely be subject to membership policy arrangements. One should consider that there would be some restrictions on service operation. One possible restriction might be that this would be a one-time service registration at Crossref and that any additional services would need to be added at the publisher end.&lt;br /> Option #3 uses the existing &lt;a href="http://www.handle.net/">Handle System&lt;/a> infrastructure and provides a public read service. There are two possibilities: a) add a record (or records) to the DOI handle, or b) add records to a DOI ‘buddy’ handle under publisher control. Both require further explanation:&lt;br /> Option #3a would require Crossref consent. Unless these records (handle values) were registered by Crossref there would be concerns over interoperability. That and security concerns would almost certainly require that Crossref writes the record. But this would then need to be developed as per Option #2 above. And if a mechanism were put in place it could be restrictive in practice, e.g. not allowing additional records to be inserted (as already noted in Option #2).&lt;br /> Option #3b requires no prior Crossref consent. It is an option available to publishers who run a handle server. This can best be viewed as deploying a platform (a DOI ‘buddy’ handle) for hosting service access points with an intention to upload into the DOI handle (effectively Option #3a) as common public services are developed. In short, a public service incubator. Meantime the platform provides for an independent deployment and multiple services can be added as required. An uplink from a so-called DOI ‘buddy’ handle to the DOI handle would be maintained, and also as Crossref allows a down link from the DOI handle to the DOI ‘buddy’ handle (a ‘see also’ type link) could be established thus pairing off these two handles. (Of course, additional values whether held in the DOI resolution record or especially in an associated DOI ‘buddy’ record would be subject to common typing constraints for semantic interoperability.)&lt;br /> My personal feeling is that public data is best exposed via a public resolution record with no strings attached. That is the surest way to guarantee both data persistence and accessibility.
&lt;/p>&lt;/p></description></item><item><title>The Thing About DOI</title><link>https://www.crossref.org/blog/the-thing-about-doi/</link><pubDate>Mon, 30 Jun 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/the-thing-about-doi/</guid><description>&lt;p>With Library of Congress sometime back (Feb. ’08) &lt;a href="http://catalogablog.blogspot.com/2008/02/lccn-permalink.html" target="_blank">announcing&lt;/a> &lt;a href="http://lccn.loc.gov/" target="_blank">LCCN Permalinks&lt;/a> and NLM also (Mar. ’08) introducing &lt;a href="https://www.nlm.nih.gov/pubs/techbull/ma08/ma08_simplified_web_links.html" target="_blank">simplified web links&lt;/a> with its PubMed identifier one might be forgiven for wondering what is the essential difference between a DOI name and these (and other) seemingly like-minded identifiers from a purely web point of view. Both these identifiers can be accessed through very simple URL structures:&lt;/p>
&lt;p>With Library of Congress sometime back (Feb. ’08) &lt;a href="http://catalogablog.blogspot.com/2008/02/lccn-permalink.html" target="_blank">announcing&lt;/a> &lt;a href="http://lccn.loc.gov/" target="_blank">LCCN Permalinks&lt;/a> and NLM also (Mar. ’08) introducing &lt;a href="http://www.nlm.nih.gov/pubs/techbull/ma08/ma08_simplified_web_links.html" target="_blank">simplified web links&lt;/a> with its PubMed identifier one might be forgiven for wondering what is the essential difference between a DOI name and these (and other) seemingly like-minded identifiers from a purely web point of view. Both these identifiers can be accessed through very simple URL structures:&lt;/p>
&lt;ul>
&lt;li>&lt;a href="https://lccn.loc.gov/2003556443" target="_blank">https://lccn.loc.gov/2003556443&lt;/a>
&lt;ul>
&lt;li>
&lt;p>&lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/16481614" target="_blank">http://www.ncbi.nlm.nih.gov/pubmed/16481614&lt;/a> (although &lt;a href="https://web.archive.org/web/20090106151604/http://pubmed.com/1386390" target="_blank">https://web.archive.org/web/20090106151604/http://pubmed.com/1386390&lt;/a> also works as noted &lt;a href="http://shelved.blogspot.com/2008/04/pubmed-sends-out-few-new-blooms.html" target="_blank">here&lt;/a>)&lt;/ul>
And the DOI itself can be resolved using an equally simple URL structure:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>&lt;a href="http://dx.doi.org/10.1000/1" target="_blank">http://dx.doi.org/10.1000/1&lt;/a>&lt;/ul>
So, why does DOI not just present itself as a simple database number which is accessed through a simple web link and have done with it, e.g. a page for the object named by the DOI “10.1000/1” is retrieved from the DOI proxy server at &lt;a href="http://dx.doi.org/" target="_blank">http://dx.doi.org/&lt;/a>?&lt;/p>
&lt;p>Essentially the typical DOI link presents an elementary web-based URL which performs a useful redirect service. What is different about this and, say a &lt;a href="http://purl.org/" target="_blank">PURL&lt;/a>, which offers a similar redirect service? What’s the big deal?&lt;/p>
&lt;p>(Continues below.)&lt;/p>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;p>Well, the thing about DOI is that it is built upon a directory service - the &lt;a href="http://www.handle.net/" target="_blank">Handle System&lt;/a> - and can be accessed either through native directory calls or more likely through standard web interfaces. From a web point of view we are usually interested in the latter. Differently from a simple lookup and/or redirect service which has a fixed entry point on the Web, the DOI can be serviced at &lt;em>any&lt;/em> DOI service access point on the Internet. There are potentially multiple entry points which can be hosted by different organisations with separate IP addresses and/or DNS names.&lt;/p>
&lt;pre>&lt;code> For example, the [DOI proxy][8] (described [here][9]) is just _one instance_ of such a service. Others could equally exist. And, in fact, they do. The following handle web services will also take the DOI and do the business:
* &amp;lt;http://hdl.handle.net/10.1000/1&amp;gt;
* &amp;lt;http://hdl.nature.com/10.1000/1&amp;gt;&amp;lt;/ul&amp;gt;
With handle we have in essence a redirect to a redirect. Or in the case of a web service, a redirect (from HTTP to HDL) to a redirect (from HDL to HDL) to a redirect (from HDL to HTTP). That is, switch down from the web interface to the native handle layer, route the call from this local handle sever (via the global handle server) to the DOI handle server, fetch the URL stored with the DOI and switch back to the Web at that location.
But there’s more. The standard URL redirect is just _one_ example of a DOI service. But multiple services can also be provided for the DOI. Currently the DOI travels light and is bound to the minimum of useful data, essentially just the URL for a splash page in the case of many Crossref DOIs. But it could also carry pointers to structured information or to relationships with other objects.
As yet, the DOI is a fledgling in terms of realizing its true potential as a seasoned actor that can play out many roles - assume many guises. A queen bee, in effect, with a hive of worker bees servicing it. It is not joined at the hip with any particular web service as might be commonly understood with the current simple redirect service. It offers much more.
It is, however, true that both for reasons of link persistency and in order to maintain link ranking with search crawlers that a preferred web entry point is via the [DOI proxy][8]. It just doesn’t have to be that way - that’s all. Hard linking is something we are beginning to unlearn and instead we are taking our first steps towards embracing service-mediated links such as OpenURL and DOI can both offer.
&lt;/code>&lt;/pre></description></item><item><title>Handle System Workshop</title><link>https://www.crossref.org/blog/handle-system-workshop/</link><pubDate>Fri, 20 Jun 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/handle-system-workshop/</guid><description>&lt;img alt="charlemagne.jpg" src="https://www.crossref.org/wp/blog/images/charlemagne.jpg" width="275" height="134" />
&lt;p>I was invited to speak at the &lt;a href="http://www.handle.net/workshop_08/" target="_blank">Handle System Workshop&lt;/a> which was run back to back with an &lt;a href="https://www-old.doi.org/doi_presentations/members_meeting_2008/index.html" target="_blank">IDF Open Meeting&lt;/a> earlier this week in Brussels and hosted at the Office for Official Publications of the European Union. (Location was in the Charlemagne Building, at left in image, within the rather impressive meeting room Jean Durieux, at right.)&lt;/p>
&lt;p>My talk (‘&lt;a href="http://www.handle.net/workshop_08/presentations/Hammond_Handle08.ppt" target="_blank">A Distributed Metadata Architecture&lt;/a>‘) was focussed on how &lt;a href="http://code.google.com/p/openhandle/" target="_blank">OpenHandle&lt;/a> and &lt;a href="http://www.adobe.com/products/xmp/" target="_blank">XMP&lt;/a> could be leveraged to manage dispersed media assets. (The &lt;a href="http://code.google.com/p/openhandle/" target="_blank">OpenHandle&lt;/a> work makes the Handle and DOI systems more readily acessible to applications.)&lt;/p>
&lt;p>Other speakers were Norman Paskin (IDF), Gordon Dunsire (Centre for Digital Library Research, University of Strathclyde), Brian Green (Editeur), Jill Cousins (European Digital Library Foundation), Jan Brase (TIB, Germany), Larry Lannom (CNRI), Ed Pentz (Crossref), Nigel Ward (Link Affiliates), and Dan Broeder (CLARIN/MPG).&lt;/p>
&lt;p>The agendas for the two meetings are posted &lt;a href="https://www-old.doi.org/doi_presentations/members_meeting_2008/index.html" target="_blank">here&lt;/a> (DOI) and &lt;a href="http://www.handle.net/workshop_08/" target="_blank">here&lt;/a> (Handle).&lt;/p></description></item><item><title>PubMed Central Links to Publisher Full Text</title><link>https://www.crossref.org/blog/pubmed-central-links-to-publisher-full-text/</link><pubDate>Thu, 12 Jun 2008 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/pubmed-central-links-to-publisher-full-text/</guid><description>&lt;p>A Crossref Member Briefing is available that explains how &lt;a href="http://www.pubmedcentral.nih.gov/" target="_blank">PubMed Central (PMC)&lt;/a> links to publisher full text, how PMC uses DOIs and how PMC &lt;em>should&lt;/em> be using DOIs. The briefing is entitled &lt;a href="http://www.crossref.org/pdfs/pmc-briefing-june2008.pdf" target="_blank">“Linking to Publisher Full Text from PubMed Central” (PDF 85k)&lt;/a>.&lt;/p>
&lt;p>Crossref considers it very important the PMC uses DOIs as the main means to link to the publisher version of record for an article and we are recommending that publishers try to convince PMC to use DOIs in an automated way. Almost all of the PMC articles contain DOIs but they aren’t linked. This seems like a waste considering that publishers have invested a lot in Crossref and DOIs as unique identifiers and persistent links.&lt;/p>
&lt;p>This issue will be of interest to anyone who publishers journal articles that are the result of NIH funding and fall under the &lt;a href="http://publicaccess.nih.gov/" target="_blank">NIH Public Access Policy&lt;/a>.&lt;/p></description></item><item><title>Robots: One Standard Fits All</title><link>https://www.crossref.org/blog/robots-one-standard-fits-all/</link><pubDate>Wed, 04 Jun 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/robots-one-standard-fits-all/</guid><description>&lt;p>Interesting &lt;a href="https://web.archive.org/web/20080630064024/http://www.ysearchblog.com/archives/000587.html" target="_blank">post&lt;/a> from Yahoo! Search’s Director of Product Management, Priyank Garg, “&lt;em>One Standard Fits All: Robots Exclusion Protocol for Yahoo!, Google and Microsoft&lt;/em>“. Interesting also for what it doesn’t talk about. No mention here of &lt;a href="https://web.archive.org/web/20080514202201/http://the-acap.org/" target="_blank">ACAP&lt;/a>.&lt;/p></description></item><item><title>Exposing Public Data</title><link>https://www.crossref.org/blog/exposing-public-data/</link><pubDate>Sat, 31 May 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/exposing-public-data/</guid><description>&lt;p>As the range of public services (e.g. RSS) offered by publishers has matured this gives rise to the question: How can they expose their public data so that a user may discover them? Especially, with DOI there is now in place a persistence link infrastructure for accessing primary content. How can publishers leverage that infrastructure to advantage?&lt;/p>
&lt;p>Anyway, I offer this figure as to how I see the current lie of the land as regards DOI services and data.&lt;/p>
&lt;table>
&lt;tr>
&lt;td>
&lt;img alt="doi-services.jpg" src="https://www.crossref.org/wp/blog/images/doi-services.jpg" width="482" height="383" />
&lt;/td>
&lt;/tr>
&lt;tr>
&lt;td>
&lt;i>Legend - Current DOI service architecture showing data repositories, service access points, and open/closed data domains.&lt;/i>
&lt;/td>
&lt;/tr>
&lt;/table>
&lt;p>The figure above shows the three data repositories and service access points in the current DOI services architecture. At right and bottom of the figure are the two types of service (&lt;strong>public services&lt;/strong> and &lt;strong>private services&lt;/strong>) that together are instrumental in getting a user from a DOI-based link (on a third-party site) to the correct page of content (from the primary content provider). (Note that a fourth, private data repository – the institutional repository – comes into play when OpenURL user context-sensitive linking is added.)&lt;/p>
&lt;p>At left of the figure are services operated by Crossref on its own metadata database which support a) publisher lookups of DOI, and b) third-party metadata services (DOI-to-metadata and metadata-to-DOI conversions). These might best be labelled &lt;strong>protected services&lt;/strong> since they are not freely available: the first is open to members at a cost, while the second is free but to associated organisations only – members, affiliates, etc.&lt;/p>
&lt;p>The term &lt;strong>open data&lt;/strong> is used here in the sense implied by the current W3C SWEO LOD (&lt;a href="http://esw.w3.org/topic/SweoIG/TaskForces/CommunityProjects/LinkingOpenData" target="_blank">Linking Open Data&lt;/a>) Project. Open data is public data unencumbered by any access restrictions. By contrast, &lt;strong>closed data&lt;/strong> is data that has some access restrictions placed on it – even data that is open to affiliates. (This is not an issue that LOD addresses directly, although it is implied that data is globally ‘open’, i.e. public.)&lt;/p>
&lt;p>The current DOI service architecture thus breaks down as:&lt;/p>
&lt;ul>
&lt;li>Native DOI services – resolving the DOI token
&lt;ul>
&lt;li>Public – DOI Proxy Server (‘dx.doi.org’)&lt;/ul>
&lt;ul>
&lt;li>Related DOI services – using the DOI token
&lt;ul>
&lt;li>Protected – Crossref
&lt;ul>
&lt;li>
&lt;p>Private – Publisher&lt;/ul> &lt;/ul>
Note that a DOI is ‘resolved’ into state data registered with it, or as &lt;a href="https://www.iso.org/standard/81599.html" target="_blank">ISO CD 26324&lt;/a> puts it: &lt;em>“Resolution is the process of submitting a specific DOI name to the DOI system and receiving in return the associated values held in the DOI resolution record for one or more types of data relating to the object identified by that DOI name.”&lt;/em>&lt;/p>
&lt;p>So, how might publishers best leverage this DOI service architecture to expose their public data?&lt;/p>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul></description></item><item><title>Dark Side of the DOI</title><link>https://www.crossref.org/blog/dark-side-of-the-doi/</link><pubDate>Thu, 29 May 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/dark-side-of-the-doi/</guid><description>&lt;p>&lt;a href="https://www.crossref.org/wp/blog/images/openhandle_p5_radial.jpg" onclick="window.open('/wp/blog/images/openhandle_p5_radial.jpg','popup','width=902,height=603,scrollbars=no,resizable=no,toolbar=no,directories=no,location=no,menubar=no,status=no,left=0,top=0'); return false">&lt;img alt="openhandle_p5_radial.jpg" src="https://www.crossref.org/wp/blog/images/openhandle_p5_radial.jpg" width="450" height="300" />&lt;/a>&lt;/p>
&lt;p>(Click to enlarge.)&lt;/p>
&lt;p>For infotainment only (and because it’s a pretty printing). Glimpse into the dark world of DOI. Here, the handle contents for &lt;a href="http://dx.doi.org/doi:10.1038/nature06930" target="_blank">doi:10.1038/nature06930&lt;/a> exposed as a standard &lt;a href="http://code.google.com/p/openhandle/" target="_blank">OpenHandle&lt;/a> ‘Hello World’ document. Browser image courtesy of &lt;a href="http://ejohn.org/blog/processingjs/" target="_blank">Processing.js&lt;/a> and &lt;a href="http://www.mozilla.com/en-US/firefox/all-rc.html" target="_blank">Firefox 3 RC1&lt;/a>.&lt;/p></description></item><item><title>Referencing OpenURL</title><link>https://www.crossref.org/blog/referencing-openurl/</link><pubDate>Thu, 29 May 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/referencing-openurl/</guid><description>&lt;p>So, why is it just so difficult to reference OpenURL?&lt;/p>
&lt;p>Apart from the standard itself (hardly intended for human consumption - see abstract page &lt;a href="http://www.niso.org/kst/reports/standards?step=2&amp;amp;#038;project_key=d5320409c5160be4697dc046613f71b9a773cd9e" target="_blank">here&lt;/a> and PDF and don’t even think to look at those links - they weren’t meant to be cited!), seems that the best reference is to the &lt;a href="http://en.wikipedia.org/wiki/OpenURL" target="_blank">Wikipedia page&lt;/a>. There is the OpenURL Registry page at &lt;a href="http://openurl.info/registry" target="_blank">http://alcme.oclc.org/openurl/servlet/OAIHandler?verb=ListSets&lt;/a> but this is just a workshop. Not much there beyond the OpenURL registered items. (And why does the page seem uncertain as to whether it’s a “repository” or a “registry”? Is there no difference between those terms?) The only other links are to a mix of HTML and PDF resources. (There really should be a health warning on links to PDFs - they are just not browser friendly documents.) And, I do have to wonder at this: the registry page has a link to the unofficial 0.1 version but not to the 1.0 standard. Er, why? And don’t even try this link: &lt;a href="http://openurl.info/" target="_blank">http://openurl.info/&lt;/a>. Not much info there.&lt;/p>
&lt;p>Where else to go? The &lt;a href="http://niso.org/" target="_blank">NISO site&lt;/a> allows a search on “openurl” which returns links to the standard and to other related documents.&lt;/p>
&lt;p>And then there’s the community site &lt;a href="https://web.archive.org/web/20091027024029/http://openurl.code4lib.org/" target="_blank">https://web.archive.org/web/20091027024029/http://openurl.code4lib.org/&lt;/a> targeted at developers and its &lt;a href="https://web.archive.org/web/20111104051428/http://openurl.code4lib.org/aggregator" target="_blank">Planet OpenURL&lt;/a> which is a useful source for current awareness.&lt;/p>
&lt;p>Me, I’m sticking with the Wikipedia page as the best reference for OpenURL. How odd that OpenURL aimed at improving linking on the Web should not have it’s own simple access point. Thank heavens at least that DOI has a single reference point: &lt;a href="http://doi.org/" target="_blank">http://doi.org/&lt;/a>.&lt;/p></description></item><item><title>Tombstone</title><link>https://www.crossref.org/blog/tombstone/</link><pubDate>Fri, 23 May 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/tombstone/</guid><description>&lt;p>So, the big guns have decided that &lt;a href="http://www.oasis-open.org/committees/xri/" target="_blank">XRI&lt;/a> is out. In a &lt;a href="http://lists.w3.org/Archives/Public/www-tag/2008May/0078" target="_blank">message&lt;/a> from the TAG yesterday, variously noted as being “categorical” (Andy Powell, &lt;a href="http://efoundations.typepad.com/efoundations/2008/05/w3c-technical-a.html" target="_blank">eFoundations&lt;/a>) and a “proclamation” (Edd Dumbill, &lt;a href="http://www.oreillynet.com/xml/blog/2008/05/xris_bad_uris_good.html" target="_blank">XML.com&lt;/a>), the co-chairs (Tim Berners-Lee and Stuart Williams) had this to say:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“We are not satisfied that XRIs provide functionality not readily available from http: URIs. Accordingly the TAG recommends against taking the XRI specifications forward, or supporting the use of XRIs as identifiers in other specifications.”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>Alas, poor XRI. But what might this also mean for other URI schemes (note the reference above to “http: URIs)? Well, the message starts out with this:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“In The Architecture of the World Wide Web &lt;a href="http://www.oasis-open.org/committees/xri/" target="_blank">1&lt;/a> the TAG sets out the reasons why http: URIs are the foundation of the value proposition for the Web, and should be used for naming on the Web. “&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>Now I’m not sure that this is quite what &lt;a href="http://www.w3.org/TR/webarch/" target="_blank">AWWW&lt;/a> actually says. I don’t find it to be that insistent that “&lt;em>http” URIs … should be used for naming on the Web&lt;/em>” but I would need to read it more carefully. Certainly, “http: URIs” fit the bill and are top of the class. But there is also a general recognition that other schemes than “http:” do exist.&lt;/p>
&lt;p>Interesting times anyway with a “winner takes all” approach to identification. I wonder what this all means for DOI.&lt;/p></description></item><item><title>Metadata Reuse Policies</title><link>https://www.crossref.org/blog/metadata-reuse-policies/</link><pubDate>Tue, 20 May 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/metadata-reuse-policies/</guid><description>&lt;p>Following on from yesterday’s &lt;a href="https://www.crossref.org/blog/natures-metadata-for-web-pages">post&lt;/a> about making metadata available on our Web pages, I wanted to ask here about “metadata reuse policies”. Does anybody have a clue as to what might constitute a best practice in this area? I’m specifically interested in license terms, rather than how those terms would be encoded or carried. Increasingly we are finding more channels to distribute metadata (RSS, HTML, OAI-PMH, etc.) but don’t yet have any clear statement for our customers as to how they might reuse that data.&lt;/p>
&lt;p>Time to put the caveats aside and focus on the actuals.&lt;/p></description></item><item><title>Nature’s Metadata for Web Pages</title><link>https://www.crossref.org/blog/natures-metadata-for-web-pages/</link><pubDate>Mon, 19 May 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/natures-metadata-for-web-pages/</guid><description>&lt;p>Well, we may not be the first but wanted anyway to report that Nature has now embedded metadata (&lt;a href="http://www.w3.org/TR/html4/struct/global.html#h-7.4.4" target="_blank">HTML meta tags&lt;/a>) into all its newly published pages including full text, abstracts and landing pages (all bar four titles which are currently being worked on). Metadata coverage extends back through the Nature archives (and depth of coverage varies depending on title). This conforms to the W3C’s Guideline 13.2 in the &lt;a href="http://www.w3.org/TR/1999/WAI-WEBCONTENT-19990505/#gl-facilitate-navigation" target="_blank">Web Content Accessibility Guidelines 1.0&lt;/a> which exhorts content publishers to “provide metadata to add semantic information to pages and sites”.&lt;/p>
&lt;p>Metadata is provided in both DC and PRISM formats as well as in Google’s own bespoke metadata format. This generally follows the &lt;a href="http://dublincore.org/documents/dc-html/" target="_blank">DCMI recommendation&lt;/a> “&lt;em>Expressing Dublin Core metadata using HTML/XHTML meta and link elements&lt;/em>, and the earlier &lt;a href="http://www.ietf.org/rfc/rfc2731.txt" target="_blank">RFC 2731&lt;/a> “&lt;em>Encoding Dublin Core Metadata in HTML”&lt;/em>. (Note that schema name is normalized to lowercase.) Some notes:&lt;/p>
&lt;ul>
&lt;li>The DOI is included in the “&lt;code>dc.identifier&lt;/code>” term in URI form which is the Crossref recommendation for citing DOI.
&lt;ul>
&lt;li>We could consider adding also “&lt;code>prism.doi&lt;/code>” for disclosing the native DOI form. This requires the PRISM namespace declaration to be bumped to v2.0. We might consider synchronizing this change with our RSS feeds which are currently pegged at v1.2, although note that the RSS module &lt;a href="https://web.archive.org/web/20080726155717/http://www.prismstandard.org/resources/mod_prism.html" target="_blank">mod_prism&lt;/a> currently applies only to PRISM v1.2.
&lt;ul>
&lt;li>We could then also add in a “&lt;code>prism.url&lt;/code>” term to link back (through the DOI proxy server) to the content site. The namespace issue listed above still holds.
&lt;ul>
&lt;li>The “&lt;code>citation_&lt;/code>” terms are not anchored in any published namespace which does make this term set problematic in application reuse. It would be useful to be able to reference a namespace (e.g. “&lt;code>rel=&amp;quot;schema.gs&amp;quot; href=&amp;quot;...&amp;quot;&lt;/code>“) for these terms and to cite them as e.g. “&lt;code>gs.citation_title&lt;/code>“. &lt;/ul>
The HTML metadata sets from an example &lt;a href="http://dx.doi.org/10.1038/nature06925" target="_blank">landing page&lt;/a> are presented below.&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;p>If you view the page source you should see something like the text below. (Note that you may have to scroll past whitespace which is emitted by the HTML template generator.)&lt;/p>
&lt;pre>&lt;code> &amp;lt;pre&amp;gt;&amp;amp;lt;link title=&amp;quot;schema(DC)&amp;quot; rel=&amp;quot;schema.dc&amp;quot; href=&amp;quot;http://purl.org/dc/elements/1.1/&amp;quot; /&amp;amp;gt;
&lt;/code>&lt;/pre>
&lt;p>&amp;lt;meta name=&amp;ldquo;dc.publisher&amp;rdquo; content=&amp;ldquo;Nature Publishing Group&amp;rdquo; /&amp;gt;
&amp;lt;meta name=&amp;ldquo;dc.language&amp;rdquo; content=&amp;ldquo;en&amp;rdquo; /&amp;gt;
&amp;lt;meta name=&amp;ldquo;dc.rights&amp;rdquo; content=&amp;quot;© 2008 Nature Publishing Group&amp;quot; /&amp;gt;
&amp;lt;meta name=&amp;ldquo;dc.title&amp;rdquo; content=&amp;ldquo;Crystal structure of squid rhodopsin&amp;rdquo; /&amp;gt;
&amp;lt;meta name=&amp;ldquo;dc.creator&amp;rdquo; content=&amp;ldquo;Midori Murakami&amp;rdquo; /&amp;gt;
&amp;lt;meta name=&amp;ldquo;dc.creator&amp;rdquo; content=&amp;ldquo;Tsutomu Kouyama&amp;rdquo; /&amp;gt;
&amp;lt;meta name=&amp;ldquo;dc.identifier&amp;rdquo; content=&amp;ldquo;doi:10.1038/nature06925&amp;rdquo; /&amp;gt;
&amp;lt;link title=&amp;ldquo;schema(PRISM)&amp;rdquo; rel=&amp;ldquo;schema.prism&amp;rdquo; href=&amp;ldquo;&lt;a href="https://web.archive.org/web/20080516191035/http://www.prismstandard.org//namespaces/1.2/basic/%22" target="_blank">https://web.archive.org/web/20080516191035/http://www.prismstandard.org//namespaces/1.2/basic/"&lt;/a> /&amp;gt;
&amp;lt;meta name=&amp;ldquo;prism.copyright&amp;rdquo; content=&amp;rdquo;© 2008 Nature Publishing Group&amp;quot; /&amp;gt;
&amp;lt;meta name=&amp;ldquo;prism.rightsAgent&amp;rdquo; content=&amp;ldquo;&lt;a href="mailto:permissions@nature.com">permissions@nature.com&lt;/a>&amp;rdquo; /&amp;gt;
&amp;lt;meta name=&amp;ldquo;prism.publicationName&amp;rdquo; content=&amp;ldquo;Nature&amp;rdquo; /&amp;gt;
&amp;lt;meta name=&amp;ldquo;prism.issn&amp;rdquo; content=&amp;ldquo;0028-0836&amp;rdquo; /&amp;gt;
&amp;lt;meta name=&amp;ldquo;prism.eIssn&amp;rdquo; content=&amp;ldquo;1476-4687&amp;rdquo; /&amp;gt;
&amp;lt;meta name=&amp;ldquo;prism.volume&amp;rdquo; content=&amp;ldquo;453&amp;rdquo; /&amp;gt;
&amp;lt;meta name=&amp;ldquo;prism.number&amp;rdquo; content=&amp;ldquo;7193&amp;rdquo; /&amp;gt;
&amp;lt;meta name=&amp;ldquo;prism.startingPage&amp;rdquo; content=&amp;ldquo;363&amp;rdquo; /&amp;gt;
&amp;lt;meta name=&amp;ldquo;prism.endingPage&amp;rdquo; content=&amp;ldquo;367&amp;rdquo; /&amp;gt;
&amp;lt;meta name=&amp;ldquo;citation_journal_title&amp;rdquo; content=&amp;ldquo;Nature&amp;rdquo; /&amp;gt;
&amp;lt;meta name=&amp;ldquo;citation_publisher&amp;rdquo; content=&amp;ldquo;Nature Publishing Group&amp;rdquo; /&amp;gt;
&amp;lt;meta name=&amp;ldquo;citation_authors&amp;rdquo; content=&amp;ldquo;Midori Murakami, Tsutomu Kouyama&amp;rdquo; /&amp;gt;
&amp;lt;meta name=&amp;ldquo;citation_title&amp;rdquo; content=&amp;ldquo;Crystal structure of squid rhodopsin&amp;rdquo; /&amp;gt;
&amp;lt;meta name=&amp;ldquo;citation_volume&amp;rdquo; content=&amp;ldquo;453&amp;rdquo; /&amp;gt;
&amp;lt;meta name=&amp;ldquo;citation_issue&amp;rdquo; content=&amp;ldquo;7193&amp;rdquo; /&amp;gt;
&amp;lt;meta name=&amp;ldquo;citation_firstpage&amp;rdquo; content=&amp;ldquo;363&amp;rdquo; /&amp;gt;
&amp;lt;meta name=&amp;ldquo;citation_doi&amp;rdquo; content=&amp;ldquo;doi:10.1038/nature06925&amp;rdquo; /&amp;gt;
&lt;/pre>&lt;/p>
&lt;pre>&lt;code> While it is not expected that search engines will index these terms directly and that no direct SEO is intended, we think there is enough value for applications to make use of these terms. The terms are reasonably accessible to simple scripts, etc. Note that even in [RFC 2731][4] (published in 1999) there is a Perl script listed in Section 9 which allows the metadata name/value pairs to be easily pulled out. Running this over the example page yields the following output:
&amp;lt;pre&amp;gt;@(urc;
&lt;/code>&lt;/pre>
&lt;p>@|MISSING ELEMENT NAME; text/css
@|MISSING ELEMENT NAME; text/html; charset=iso-8859-1
@|robots; noarchive
@|keywords; Nature, science, science news, biology, physics, genetics, astronomy, astrophysics, quantum physics, evolution, evolutionary biology, geophysics, climate change, earth science, materials science, interdisciplinary science, science policy, medicine, systems biology, genomics, transcriptomics, palaeobiology, ecology, molecular biology, cancer, immunology, pharmacology, development, developmental biology, structural biology, biochemistry, bioinformatics, computational biology, nanotechnology, proteomics, metabolomics, biotechnology, drug discovery, environmental science, life, marine biology, medical research, neuroscience, neurobiology, functional genomics, molecular interactions, RNA, DNA, cell cycle, signal transduction, cell signalling.
@|description; Nature is the international weekly journal of science: a magazine style journal that publishes full-length research papers in all disciplines of science, as well as News and Views, reviews, news, features, commentaries, web focuses and more, covering all branches of science and how science impacts upon all aspects of society and life.
@|dc.publisher; Nature Publishing Group
@|dc.language; en
@|dc.rights; #169; 2008 Nature Publishing Group
@|dc.title; Crystal structure of squid rhodopsin
@|dc.creator; Midori Murakami
@|dc.creator; Tsutomu Kouyama
@|dc.identifier; doi:10.1038/nature06925
@|prism.copyright; © 2008 Nature Publishing Group
@|prism.rightsAgent; &lt;a href="mailto:permissions@nature.com">permissions@nature.com&lt;/a>
@|prism.publicationName; Nature
@|prism.issn; 0028-0836
@|prism.eIssn; 1476-4687
@|prism.volume; 453
@|prism.number; 7193
@|prism.startingPage; 363
@|prism.endingPage; 367
@|citation_journal_title; Nature
@|citation_publisher; Nature Publishing Group
@|citation_authors; Midori Murakami, Tsutomu Kouyama
@|citation_title; Crystal structure of squid rhodopsin
@|citation_volume; 453
@|citation_issue; 7193
@|citation_firstpage; 363
@|citation_doi; doi:10.1038/nature06925
@)urc;
&lt;/pre>&lt;/p></description></item><item><title>DOIs and PubMed Central - why no links?</title><link>https://www.crossref.org/blog/dois-and-pubmed-central-why-no-links/</link><pubDate>Wed, 14 May 2008 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/dois-and-pubmed-central-why-no-links/</guid><description>&lt;p>Further to my previous post &lt;a href="https://www.crossref.org/blog/nih-mandate-and-pmcids/">“NIH Mandate and PMCIDs”&lt;/a> we’ve been looking into linking to articles on publishers’ sites from PubMed Central (PMC). There are a couple of ways this happens currently (see details below) but these are complicated and will lead to broken links and more difficulty for PMC and publishers in managing the links. Crossref is going to be putting together a briefing note for its members on this soon.&lt;/p>
&lt;p>The main issue we are raising with PMC, and that we will encourage publishers to raise too, is &lt;strong>why doesn’t PMC just automatically link DOIs?&lt;/strong> Most of the articles in PMC have DOIs so this would require very little effort from PMC and &lt;strong>no&lt;/strong> effort from publishers and would give readers a perisistent link to the publisher’s version of an article.&lt;/p>
&lt;p>Current PMC linking methods. 1) Links on &lt;a href="http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=2377029" target="_blank">Author Manuscripts&lt;/a> in PMC are pulled in from PubMed’s &lt;a href="http://www.ncbi.nlm.nih.gov/projects/linkout/" target="_blank">LinkOut service&lt;/a> which requires the publisher to register with PubMed and provide linking files. The DOI can be specified as the linking mechanism via LinkOut.&lt;/p>
&lt;ol start="2">
&lt;li>For &lt;a href="http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=2254226&amp;amp;#038;rendertype=abstract" target="_blank">final version of articles&lt;/a> in PMC the journal image at the top of the page can be &lt;a href="http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=2248749&amp;amp;#038;rendertype=abstract" target="_blank">linked to the journal homepage&lt;/a> or can have a &lt;a href="http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=2235871&amp;amp;#038;rendertype=abstract" target="_blank">“this article” link&lt;/a> to the publisher’s site. The publisher has to sign up with PMC for specifying the header graphic and the links. &lt;a href="https://web.archive.org/web/20080916065531/http://www.pubmedcentral.nih.gov/pmcdoc/pubsetup.doc" target="_blank">https://web.archive.org/web/20080916065531/http://www.pubmedcentral.nih.gov/pmcdoc/pubsetup.doc (word doc)&lt;/a> say “The static base (&lt;a href="http://www.biomedcentral.com/" target="_blank">http://www.biomedcentral.com/&lt;/a>) of the URLs for this link comes from the HTML template. PMC then dynamically completes the URL by adding an issn/vol/page. ” and then says that any item in the XML (such as the DOI) can be used.&lt;/li>
&lt;/ol>
&lt;p>Both of the approaches outlined above require extra work and will be difficult for smaller publishers. In addition, the links will be fragile by not being based on DOIs. Publishers can specify that DOIs can be used but it isn’t easy. We’d like to leverage the resources that publishers have already put into the DOI system but automatically making the DOIs active links - it would be very easy.&lt;/p></description></item><item><title>OpenHandle: Languages Support</title><link>https://www.crossref.org/blog/openhandle-languages-support/</link><pubDate>Mon, 21 Apr 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/openhandle-languages-support/</guid><description>&lt;p>Following up the earlier &lt;a href="https://www.crossref.org/blog/openhandle-google-code-project/">post&lt;/a> on &lt;a href="http://code.google.com/p/openhandle/" target="_blank">OpenHandle&lt;/a>, there are now a number of language examples which have been contributed to the project. The diagram below shows the OpenHandle service in schematic with various languages support. Briefly, OpenHandle aims to provide a web services interface to the Handle System to simplify access to the data stored for a given Handle.&lt;/p>
&lt;p>(Note that the diagram is an HTML imagemap and all elements are “clickable”.)&lt;/p>
&lt;p>&lt;em>Note: this diagram is no longer available online as of 2023. We show the code here for reference.&lt;/em>&lt;/p>
&lt;pre tabindex="0">&lt;code>&amp;lt;map name=&amp;#34;GraffleExport&amp;#34;&amp;gt;
&amp;lt;area shape=poly coords=&amp;#34;302,133,273,117,244,133,266,149,261,157,274,150,302,133&amp;#34; href=&amp;#34;http://code.google.com/p/openhandle/wiki/OpenHandleCodeLisp&amp;#34;&amp;gt; &amp;lt;area shape=poly coords=&amp;#34;359,93,330,77,302,93,324,109,318,117,332,110,359,93&amp;#34; href=&amp;#34;http://code.google.com/p/openhandle/wiki/OpenHandleCodeFSharp&amp;#34;&amp;gt; &amp;lt;area shape=poly coords=&amp;#34;186,93,157,77,129,93,151,109,145,117,159,110,186,93&amp;#34; href=&amp;#34;http://code.google.com/p/openhandle/wiki/OpenHandleCodeAppleScript&amp;#34;&amp;gt; &amp;lt;area shape=poly coords=&amp;#34;244,93,215,77,186,93,208,109,203,117,217,110,244,93&amp;#34; href=&amp;#34;http://code.google.com/p/openhandle/wiki/OpenHandleCodeCSharp&amp;#34;&amp;gt; &amp;lt;area shape=poly coords=&amp;#34;244,174,215,157,186,174,208,189,203,197,217,190,244,174&amp;#34; href=&amp;#34;http://code.google.com/p/openhandle/wiki/OpenHandleCodePython&amp;#34;&amp;gt; &amp;lt;area shape=poly coords=&amp;#34;244,133,215,117,186,133,208,149,203,157,217,150,244,133&amp;#34; href=&amp;#34;http://code.google.com/p/openhandle/wiki/OpenHandleCodeJavaScript&amp;#34;&amp;gt; &amp;lt;area shape=poly coords=&amp;#34;186,174,157,157,129,174,151,189,145,197,159,190,186,174&amp;#34; href=&amp;#34;http://code.google.com/p/openhandle/wiki/OpenHandleCodePhp&amp;#34;&amp;gt; &amp;lt;area shape=poly coords=&amp;#34;302,93,273,77,244,93,266,109,261,117,274,110,302,93&amp;#34; href=&amp;#34;http://code.google.com/p/openhandle/wiki/OpenHandleCodeErlang&amp;#34;&amp;gt; &amp;lt;area shape=poly coords=&amp;#34;359,133,330,117,302,133,324,149,318,157,332,150,359,133&amp;#34; href=&amp;#34;http://code.google.com/p/openhandle/wiki/OpenHandleCodePerl&amp;#34;&amp;gt; &amp;lt;area shape=poly coords=&amp;#34;302,174,273,157,244,174,266,189,261,197,274,190,302,174&amp;#34; href=&amp;#34;http://code.google.com/p/openhandle/wiki/OpenHandleCodeRuby&amp;#34;&amp;gt; &amp;lt;area shape=poly coords=&amp;#34;359,174,330,157,302,174,324,189,318,197,332,190,359,174&amp;#34; href=&amp;#34;http://code.google.com/p/openhandle/wiki/OpenHandleCodeSmalltalk&amp;#34;&amp;gt; &amp;lt;area shape=poly coords=&amp;#34;186,133,157,117,129,133,151,149,145,157,159,150,186,133&amp;#34; href=&amp;#34;http://code.google.com/p/openhandle/wiki/OpenHandleCodeJava&amp;#34;&amp;gt; &amp;lt;area shape=poly coords=&amp;#34;255,237,255,260,266,260,244,272,222,260,233,260,233,237,222,237,244,225,266,237,255,237&amp;#34; href=&amp;#34;http://www.ietf.org/rfc/rfc2616.txt&amp;#34;&amp;gt; &amp;lt;area shape=poly coords=&amp;#34;174,527,174,495,196,491,217,495,217,527,196,531,174,527&amp;#34; href=&amp;#34;http://hdl.handle.net/&amp;#34;&amp;gt; &amp;lt;area shape=poly coords=&amp;#34;270,527,270,495,292,491,314,495,314,527,292,531,270,527&amp;#34; href=&amp;#34;http://hdl.handle.net/&amp;#34;&amp;gt; &amp;lt;area shape=poly coords=&amp;#34;152,268,210,268,210,300,152,304,152,268&amp;#34; href=&amp;#34;http://nascent.nature.com/openhandle/handle?id=10100/nature&amp;amp;#038;mimetype=text/plain&amp;amp;#038;format=rdf&amp;#34;&amp;gt; &amp;lt;area shape=poly coords=&amp;#34;201,307,258,307,258,339,201,343,201,307&amp;#34; href=&amp;#34;http://nascent.nature.com/openhandle/handle?id=10100/nature&amp;amp;#038;mimetype=text/plain&amp;amp;#038;format=n3&amp;#34;&amp;gt; &amp;lt;area shape=poly coords=&amp;#34;267,297,325,297,325,329,267,333,267,297&amp;#34; href=&amp;#34;http://nascent.nature.com/openhandle/handle?id=10100/nature&amp;amp;#038;mimetype=text/plain&amp;amp;#038;format=json&amp;#34;&amp;gt; &amp;lt;area shape=poly coords=&amp;#34;255,426,255,450,266,450,244,461,222,450,233,450,233,426,222,426,244,414,266,426,255,426&amp;#34; href=&amp;#34;http://www.ietf.org/rfc/rfc3651.txt&amp;#34;&amp;gt; &amp;lt;area shape=poly coords=&amp;#34;262,355,262,388,226,388,226,355,262,355&amp;#34; href=&amp;#34;http://nascent.nature.com/openhandle/handle?id=10100/nature&amp;amp;#038;mimetype=text/plain&amp;amp;#038;format=rdf&amp;#34;&amp;gt; &amp;lt;area shape=poly coords=&amp;#34;277,223,292,208,320,208,344,220,330,235,301,235,277,223&amp;#34; href=&amp;#34;http://nascent.nature.com/openhandle/handle?id=10.1000/1&amp;amp;#038;mimetype=text/plain&amp;amp;#038;format=json&amp;#34;&amp;gt; &amp;lt;area shape=poly coords=&amp;#34;148,244,162,229,191,229,215,241,201,256,172,256,148,244&amp;#34; href=&amp;#34;http://nascent.nature.com/openhandle/handle?id=10100/nature&amp;amp;#038;mimetype=text/plain&amp;amp;#038;format=json&amp;#34;&amp;gt; &amp;lt;area shape=poly coords=&amp;#34;222,507,222,475,244,471,266,475,266,507,244,511,222,507&amp;#34; href=&amp;#34;http://hdl.handle.net/&amp;#34;&amp;gt; &amp;lt;area shape=poly coords=&amp;#34;49,215,89,102,191,62,305,79,401,143,393,282,315,350,198,363,105,317,49,215,120,481,140,501,178,501,199,474,186,442,152,436,122,447,120,481,49,215,57,569,79,571,89,547,71,534,53,548,57,569,49,215,57,569&amp;#34; href=&amp;#34;http://code.google.com/p/openhandle/&amp;#34;&amp;gt;
&amp;lt;/map&amp;gt;
&lt;/code>&lt;/pre></description></item><item><title>NIH Mandate and PMCIDs</title><link>https://www.crossref.org/blog/nih-mandate-and-pmcids/</link><pubDate>Tue, 15 Apr 2008 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/nih-mandate-and-pmcids/</guid><description>&lt;p>The &lt;a href="http://publicaccess.nih.gov/" target="_blank">NIH Public Access Policy&lt;/a> says “When citing their NIH-funded articles in NIH applications, proposals or progress reports, authors must include the PubMed Central reference number for each article” and the &lt;a href="http://publicaccess.nih.gov/FAQ.htm#c6" target="_blank">FAQ&lt;/a> provides some examples of this:&lt;/p>
&lt;p>Examples:&lt;/p>
&lt;p>Varmus H, Klausner R, Zerhouni E, Acharya T, Daar A, Singer P. 2003. PUBLIC HEALTH: Grand Challenges in Global Health. Science 302(5644): 398-399. PMCID: 243493&lt;/p>
&lt;p>Zerhouni, EA. (2003) A New Vision for the National Institutes of Health. Journal of Biomedicine and Biotechnology (3), 159-160. PMCID: 400215&lt;/p>
&lt;p>It’s interesting to note that on PMC itself both the [The &lt;a href="http://publicaccess.nih.gov/" target="_blank">NIH Public Access Policy&lt;/a> says “When citing their NIH-funded articles in NIH applications, proposals or progress reports, authors must include the PubMed Central reference number for each article” and the &lt;a href="http://publicaccess.nih.gov/FAQ.htm#c6" target="_blank">FAQ&lt;/a> provides some examples of this:&lt;/p>
&lt;p>Examples:&lt;/p>
&lt;p>Varmus H, Klausner R, Zerhouni E, Acharya T, Daar A, Singer P. 2003. PUBLIC HEALTH: Grand Challenges in Global Health. Science 302(5644): 398-399. PMCID: 243493&lt;/p>
&lt;p>Zerhouni, EA. (2003) A New Vision for the National Institutes of Health. Journal of Biomedicine and Biotechnology (3), 159-160. PMCID: 400215&lt;/p>
&lt;p>It’s interesting to note that on PMC itself both the][3] - but the DOI isn’t linked. Two things occur to me - 1) should Crossref map DOIs to PMCIDs and vice versa and make PMCIDs available in it’s query interfaces and 2) shouldn’t publishers ask that the PMC copy of the article link back to the publisher version? It would be very easy with the DOI.&lt;/p></description></item><item><title>Word Add-in for Scholarly Authoring and Publishing</title><link>https://www.crossref.org/blog/word-add-in-for-scholarly-authoring-and-publishing/</link><pubDate>Wed, 26 Mar 2008 00:00:00 +0000</pubDate><author>Crossref</author><guid>https://www.crossref.org/blog/word-add-in-for-scholarly-authoring-and-publishing/</guid><description>&lt;p>Last week Pablo Fernicola sent me email announcing that Microsoft have finally released a beta of their Word plugin for marking-up manuscripts with the NLM DTD. I say “finally” because we’ve know this was on the way and have been pretty excited to see it. We once even hoped that MS might be able to show the plug-in at the &lt;a href="http://www.alpsp.org.uk/ngen_public/article.asp?id=335&amp;amp;#038;did=47&amp;amp;#038;aid=1244&amp;amp;#038;st=&amp;amp;#038;oaid=-1" target="_blank">ALPSP session on the NLM DTD&lt;/a>, but we couldn’t quite manage it.&lt;/p>
&lt;p>The plugin is targeted at production/editorial staff, but, of course, it will be interesting to see if any of this work can be pushed back to the author. I won’t hold my breath on the latter score, but it will be fun to watch.&lt;/p>
&lt;p>One thing I would note is that the NLM DTD can also be used in the humanities and social sciences, so, frankly, I think they should market it more broadly.&lt;/p>
&lt;p>Anyway- the plugin can be &lt;a href="http://www.microsoft.com/downloads/details.aspx?FamilyId=09C55527-0759-4D6D-AE02-51E90131997E&amp;amp;#038;displaylang=en" target="_blank">downloaded&lt;/a> from the Microsoft site.&lt;/p>
&lt;p>And Pablo has setup a &lt;a href="https://web.archive.org/web/20080725223420/http://blogs.msdn.com/exscientia/archive/2008/03/20/Technology-Preview-Launch.aspx" target="_blank">blog where testers can discuss&lt;/a> the add-in.&lt;/p>
&lt;p>And there is also an &lt;a href="https://web.archive.org/web/20080411085902/http://www.microsoft.com/mscorp/tc/scholarly-publishing.mspx" target="_blank">entry for the project&lt;/a> on the Microsoft Research site (an interesting place to peruse, if you have a moment).&lt;/p>
&lt;p>Congatulations to Pablo and his team.&lt;/p></description></item><item><title>OpenHandle: Google Code Project</title><link>https://www.crossref.org/blog/openhandle-google-code-project/</link><pubDate>Fri, 07 Mar 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/openhandle-google-code-project/</guid><description>&lt;p>Just announced on the &lt;a href="http://www.handle.net/mail-archive/handle-info/msg00254.html" target="_blank">handle-info&lt;/a> and &lt;a href="http://lists.w3.org/Archives/Public/semantic-web/2008Mar/0054.html" target="_blank">semantic-web&lt;/a> mailing lists is the &lt;a href="http://code.google.com/p/openhandle/" target="_blank">OpenHandle&lt;/a> project on Google Code. This may be of some interest to the DOI community as it allows the handle record underpinning the DOI to be exposed in various common text-based serializations to make the data stored within the records more accessible to Web applications. Initial serializations include RDF/XML, RDF/N3, and JSON.&lt;/p>
&lt;p>We’d be very interested in receiving feedback on this project - either on this blog or over on the &lt;a href="https://code.google.com/archive/p/openhandle/" target="_blank">project wiki&lt;/a>.&lt;/p></description></item><item><title>Object Reuse and Exchange</title><link>https://www.crossref.org/blog/object-reuse-and-exchange/</link><pubDate>Wed, 05 Mar 2008 00:00:00 +0000</pubDate><author>Chuck Koscher</author><guid>https://www.crossref.org/blog/object-reuse-and-exchange/</guid><description>&lt;p>On March 3rd the Open Archives Initiative held a roll out meeting of the first alpha release of the ORE specification (&lt;a href="http://www.openarchives.org/ore/" target="_blank">http://www.openarchives.org/ore/&lt;/a>) . According to Herbert Van de Sompel a beta release is planned for late March / early April and a 1.0 release targeted for September. The presentations focused on the aggregation concepts behind ORE and described an ATOM based implementation. ORE is the second project from the OAI but unlike its sibling PMH it is not exclusively a repository technology. ORE provides machine readable manifests for related Web resources in any context. For instance, DOI landing pages (aka splash page) are human readable resources containing links to any number of resources related to the work identified by the DOI. An ORE instance for the DOI (called a Rem or resource map) would describe the same set of resources in a machine friendly format. A standardized form of redirection understood by the DOI proxy would yield the Rem instead of normal page e.g.&lt;/p>
&lt;p>&lt;code>http://dx.doi.org/10.5555/abcd?type=rem&lt;/code>
which could be useful for crawlers.&lt;/p>
&lt;p>A second roll out meeting is planned during the Sparc-08 workshops in early April.&lt;/p></description></item><item><title>ISO/CD 26324 (DOI)</title><link>https://www.crossref.org/blog/iso/cd-26324-doi/</link><pubDate>Fri, 22 Feb 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/iso/cd-26324-doi/</guid><description>&lt;p>Following on from my &lt;a href="https://www.crossref.org/blog/prismdoi/">previous post&lt;/a> about &lt;tt>prism:doi&lt;/tt> I didn’t mention, or reference, the ongoing ISO work on DOI, Indeed I hadn’t realized that the DOI site now has a &lt;a href="http://doi.org/about_the_doi.html#standards" target="_blank">status update&lt;/a> on the ISO work:&lt;/p>
&lt;blockquote>
&lt;p>_“The DOI® System is currently being standardised through ISO. It is expected that the process will be finalised during 2008. In December 2007 the Working Group for this project approved a final draft as a Committee Draft (standard for voting) which is now being processed by ISO. Copies of the Committee Draft (&lt;a href="http://doi.org/ISO_Standard/sc9n475.pdf" target="_blank">SC9N475&lt;/a>) and an accompanying explanatory document detailing issues dealt with during the standards process (&lt;a href="http://doi.org/ISO_Standard/sc9n474.pdf" target="_blank">SC9N474&lt;/a>) are provided here for information.&lt;/p>&lt;/p>
&lt;p>Committee Draft 26324 is subject to ISO’s copyright and is for information only to those interested in the project; it may not be re-distributed. This is currently undergoing the formal ISO voting process; the deadline for comments on CD 26324 from TC46/SC9’s national bodies is April 25, 2008: please contact your national member of ISO TC46/SC9 if you would like it contribute to comments on this draft standard. Other documents for the ISO DOI Working Group are available on a &lt;a href="https://web.archive.org/web/20070610161109/http://www.lac-bac.gc.ca/iso/tc46sc9/wg7/index.html" target="_blank">DOI Project Register&lt;/a>.”_&lt;/p>
&lt;/blockquote></description></item><item><title>prism:doi</title><link>https://www.crossref.org/blog/prismdoi/</link><pubDate>Fri, 22 Feb 2008 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/prismdoi/</guid><description>&lt;p>The new &lt;a href="https://idealliance.org/workflow-innovations-publishers-requirement-for-industry-standard-metadata-prism/" target="_blank">PRISM&lt;/a> spec (v. 2.0) was published this week, see the &lt;a href="https://web.archive.org/web/20160326011620/http://www.prismstandard.org//news/2008/PRISM_%20PR021908.pdf" target="_blank">press release&lt;/a>. (Downloads are available &lt;a href="https://web.archive.org/web/20080828062500/http://www.prismstandard.org/specifications/" target="_blank">here&lt;/a>.)&lt;/p>
&lt;p>This is a significant development as there is support for XMP profiles, to complement the existing XML and RDF/XML profiles. And, as PRISM is one of the major vocabularies being used by publishers, I would urge you all to go take a look at it and to consider upgrading your applications to using it.&lt;/p>
&lt;p>&lt;strong>One caveat.&lt;/strong> There’s a new element &lt;code>&amp;lt;tt&amp;gt;prism:doi&amp;lt;/tt&amp;gt;&lt;/code> (PRISM Namespace, 4.2.13) which sits alongside another new element &lt;code>&amp;lt;tt&amp;gt;prism:url&amp;lt;/tt&amp;gt;&lt;/code> (PRISM Namespace, 4.2.55). Unfortunately the &lt;code>&amp;lt;tt&amp;gt;prism:doi&amp;lt;/tt&amp;gt;&lt;/code> element is shown to take DOI proxy URL as its value - and not the DOI string itself, e.g.&lt;/p>
&lt;ul>
&lt;li>Model #1&lt;br>
&lt;code>&amp;lt;prism:doi rdf:resource=”http://dx.doi.org/10.1030/03054”/&amp;gt;&lt;/code>&lt;/li>
&lt;li>Model #2&lt;br>
&lt;code>&amp;lt;prism:doi&amp;gt;http://dx.doi.org/10.1030/03054&amp;lt;/prism:doi&amp;gt;&lt;/code>&lt;/li>
&lt;/ul>
&lt;p>This seems to me to just plain wrong. The DOI in itself is not a URL (or URI) - although can, and should, be represented in URI form when used in Web contexts (i.e. pretty much most of the time). As a literal it should be used in its native form as specified in &lt;a href="https://web.archive.org/web/20150923223621/http://www.niso.org/apps/group_public/download.php/14689/z39-84-2005_r2010.pdf" target="_blank"> ANSI/NISO Z39.84 - 2005 Syntax for the Digital Object Identifier&lt;/a>. This would only satisfy Model #2 above.&lt;/p>
&lt;p>To satisfy Model #1 above a URI form for DOI would be required. And this is &lt;strong>not&lt;/strong> the service URI denoted by the proxy. It would either have to be:&lt;/p>
&lt;ul>
&lt;li>Model #1 - Registered URI Form&lt;br>
&lt;code>&amp;lt;prism:doi rdf:resource=”info:doi/10.1030/03054”/&amp;gt;&lt;/code>
* Model #1 - Unregistered URI Form&lt;br>
&lt;code>&amp;lt;prism:doi rdf:resource=”doi:10.1030/03054”/&amp;gt;&lt;/code>&lt;/li>
&lt;/ul>
&lt;p>Any comments? Some guidelines from Crossref would be useful - although maybe further discussion is required. It is, of course, a constant bugbear that “doi:” remains an unregistered URI scheme.&lt;/p></description></item><item><title>Added XML format parameter to Crossref’s OpenURL resolver</title><link>https://www.crossref.org/blog/added-xml-format-parameter-to-crossrefs-openurl-resolver/</link><pubDate>Wed, 13 Feb 2008 00:00:00 +0000</pubDate><author>Chuck Koscher</author><guid>https://www.crossref.org/blog/added-xml-format-parameter-to-crossrefs-openurl-resolver/</guid><description>&lt;p>From the beginning our OpenURL resolver has had a non standard feature of returning metadata in response to a request instead of redirecting to the referrent. This feature returned one of our older XML formats which is a bit limited as to the fields it contains.&lt;/p>
&lt;p>Sometime after our resolver was deployed we introduced a more verbose XML format for DOI metadata called ‘UNIXREF”. This was always available to regular queries against the Crossref system but was never introduced to the OpenURL resolver (for no particular reason).&lt;/p>
&lt;p>We’ve since learned that some user’s are relying on the OpenURL’s metadata feature to build proper references in situations where they have a DOI and that the older XML format is insufficient. Therefor I’ve added a ‘format’ parameter to our OpenURL resolver which allows one to request the more verbose UNIXREF. (see &lt;a href="http://www.crossref.org/openurl" target="_blank">www.crossref.org/openurl&lt;/a>)&lt;/p>
&lt;p>As always please feel free to contact us regarding new features or changes to existing features that might be helpful.&lt;/p>
&lt;p>Regards,&lt;/p>
&lt;p>Chuck&lt;/p></description></item><item><title>Crossref Citation Plugin (for WordPress)</title><link>https://www.crossref.org/blog/crossref-citation-plugin-for-wordpress/</link><pubDate>Sat, 09 Feb 2008 00:00:00 +0000</pubDate><author>Crossref</author><guid>https://www.crossref.org/blog/crossref-citation-plugin-for-wordpress/</guid><description>&lt;p>OK, after a number of delays due to everything from indexing slowness to router problems, I’m happy to say that the first public beta of our &lt;a href="http://wordpress.org/" target="_blank">WordPress&lt;/a> citation plugin is available for &lt;a href="https://sourceforge.net/projects/crossref-cite/" target="_blank">download via SourceForge&lt;/a>. A &lt;a href="http://www.movabletype.org/" target="_blank">Movable Type&lt;/a> version is in the works.&lt;/p>
&lt;p>And congratulations to Trey at OpenHelix who became laudably impatient, &lt;a href="https://web.archive.org/web/20080216002622/http://www.openhelix.com/blog/?p=128" target="_blank">found the SourceForge entry for the plugin&lt;/a> back on February 8th and seems to have been testing it since. He has a nice description of how it works (along with screenshots), so I won’t repeat the effort here.&lt;/p>
&lt;p>Having said that, I do include the text of the README after the jump. Please have a look at it before you install, because it might save you some mystification.&lt;/p>
&lt;h2 id="description">Description&lt;/h2>
&lt;p>A WordPress plugin that allows you to search Crossref metadata using citations or partial citations. When you find the reference that you want, insert the formatted and DOI-linked citation into your blog posting along with supporting &lt;a href="https://web.archive.org/web/20090927174724/http://ocoins.info/" target="_blank">COINs&lt;/a> metadata. The plugin supports both a long citation format and a short (op. cit.) format.&lt;/p>
&lt;h2 id="warnings-caveats-and-weasel-words">Warnings, Caveats and Weasel Words&lt;/h2>
&lt;p>Please note the following about this plugin:&lt;/p>
&lt;ol>
&lt;li>We are releasing this as a test. It is running on R&amp;amp;D equipment in a non-production environment and so it may disappear without warning or perform erratically. If it isn’t working for some reason, come back later and try again. If it seems to be broken for a prolonged period of time, then please report the problem to us via sourceforge.&lt;/li>
&lt;li>There is currently a 20 item limit on the number of hits returned per query. This might seem arbitrary and stingy, but please remember- we are not trying to create a fully blown search engine- we’re just trying to create a citation lookup service. Of course, if, after looking at how the service is used, it looks like we need to up this limit, we will.&lt;/li>
&lt;li>If you look in the plugin options (or at the code), you will see that the system includes an API key. At the moment we have no restrictions on use of this service, but have included this in case we need to protect the system from abuse.&lt;/li>
&lt;li>The bulk of the functionality we have developed is actually at the back-end. This plugin is just a lightweight interface to that back-end. You can examine the guts of the plugin in order to easily figure out how to create similar functionality for your favorite blog platform, wiki, etc. If you do create something, please let us know. We’d love to see what people are building.&lt;/li>
&lt;li>We are continuing to experiment with the metadata search function in order to increase its accuracy and flexibility. Again, this might result in seemingly inconsistent behavior. Did we mention that this is a test?&lt;/li>
&lt;li>Please note that this API is not meant for bulk harvesting of Crossref metadata. If you need such facilities, then please look at our web site for information about our metadata services.&lt;/li>
&lt;li>The data currently behind the plugin is *just* a December 2007 snapshot of our our complete journal article metadata. We have not added books or proceedings yet. We will do so soon and we will start updating the metadata weekly.&lt;/li>
&lt;/ol>
&lt;p>We welcome your ideas for tools that we can provide to help researchers. Please, please, please send comments, requests, queries and ideas to us at:&lt;/p>
&lt;p>&lt;a href="mailto:citation-plugin@crossref.org">citation-plugin@crossref.org&lt;/a>&lt;/p></description></item><item><title>CLADDIER Final Report</title><link>https://www.crossref.org/blog/claddier-final-report/</link><pubDate>Tue, 15 Jan 2008 00:00:00 +0000</pubDate><author>Crossref</author><guid>https://www.crossref.org/blog/claddier-final-report/</guid><description>&lt;p>I just ran across the final report from the &lt;a href="http://www.ukoln.ac.uk/repositories/digirep/index/CLADDIER" target="_blank">CLADDIER project.&lt;/a> CLADDIER comes from the &lt;a href="http://www.jisc.ac.uk/" target="_blank">JISC&lt;/a> and stands for “CITATION, LOCATION, And DEPOSITION IN DISCIPLINE &amp;amp; INSTITUTIONAL REPOSITORIES”. I suspect JISC has an entire department dedicated to creating impossible acronyms (the JISC Acronym Preparation Executive?)&lt;/p>
&lt;p>Anyhoo- the report describes a distributed citation location and updating service based on the &lt;a href="http://en.wikipedia.org/wiki/Linkback" target="_blank">linkback&lt;/a> mechanism that is widely used in the blogging community.&lt;/p>
&lt;p>I think this is an interesting approach and is one that I talked about &lt;a href="http://www.uksg.org/sites/uksg.org/files/PresentationBilder.pdf" target="_blank">briefly&lt;/a> (PDF) at the &lt;a href="https://web.archive.org/web/20080512153431/http://www.uksg.org/events/measure" target="_blank">UKSG’s Measure for Measure seminar&lt;/a> last June. I think that, like most proponents of p2p distributed architectures, they massively underestimate the problem of trust in the network. They fully knowledge the problem of linkback spam, but their hand-wavy-solution(tm) of using whitelists just means the system effectively becomes semi-centralized again (you have to have trusted keepers of the whitelists).&lt;/p>
&lt;p>And of course I was mildly exasperated by the report’s characterization of one of the perceived “disadvantages” of the Crossref architectural model being a :&lt;/p>
&lt;blockquote>
&lt;p>“Centralised service hosting a large persistent store – with the need for a (possibly commercial) business model to justify providing the service.”&lt;/p>
&lt;/blockquote>
&lt;p>Though DOI registries like &lt;a href="http://www.bowker.com/" target="_blank">Bowker&lt;/a> and &lt;a href="http://www.doi.nielsenbookdata.co.uk" target="_blank">Nielsen Bookdata&lt;/a> are commercial, Crossref, the organisation that services the industry that the JISC is concerned with, is *not* a commercial service.&lt;/p>
&lt;p>Also if you replaced the phrase “justify providing” with the word “sustain”, the sentence wouldn’t sound like such a “disadvantage.”&lt;/p>
&lt;p>But aside from these quibbles, the report makes an interesting (if technical) read.&lt;/p></description></item><item><title>BISG Paper on Identifying Digital Book Content</title><link>https://www.crossref.org/blog/bisg-paper-on-identifying-digital-book-content/</link><pubDate>Mon, 14 Jan 2008 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/bisg-paper-on-identifying-digital-book-content/</guid><description>&lt;p>BISG and BIC have published a discussion paper called “The identification of digital book content” - &lt;a href="https://web.archive.org/web/20090920075334/http://www.bisg.org/docs/DigitalIdentifiers_07Jan08.pdf" target="_blank">https://web.archive.org/web/20090920075334/http://www.bisg.org/docs/DigitalIdentifiers_07Jan08.pdf&lt;/a>. The paper discusses ISBN, ISTC and DOI amongst other things and makes a series of recommendations which basically say to consider applying DOI, ISBN and ISTC to digital book content. The paper highlights in a positive way that DOI and ISBN are different but can work together (the idea of the “actionable ISBN” and aiding discovery of content). However, it doesn’t go into much depth on any of the issues or really explain how all these identifiers would work together and the critical role that metadata plays.&lt;/p>
&lt;p>Nevertheless it’s great that the paper has been put forward as a discussion document - Crossref plans to respond and be part of the ongoing discussion in this area.&lt;/p></description></item><item><title>On Google Knol</title><link>https://www.crossref.org/blog/on-google-knol/</link><pubDate>Fri, 14 Dec 2007 00:00:00 +0000</pubDate><author>Crossref</author><guid>https://www.crossref.org/blog/on-google-knol/</guid><description>&lt;p>The recently discussed (announced?) &lt;a href="https://web.archive.org/web/20210926222403/https://googleblog.blogspot.com/2007/12/encouraging-people-to-contribute.html" target="_blank">Google Knol&lt;/a> project could make Google Scholar look like a tiny blip in the the scholarly publishing landscape.&lt;/p>
&lt;p>I love the comment an authority:&lt;/p>
&lt;p>“Books have authors’ names right on the cover, news articles have bylines, scientific articles always have authors — but somehow the web evolved without a strong standard to keep authors names highlighted. We believe that knowing who wrote what will significantly help users make better use of web content.”&lt;/p>
&lt;p>And so I suppose this means they are assigning author identifiers….&lt;/p></description></item><item><title>Zotero and the IA</title><link>https://www.crossref.org/blog/zotero-and-the-ia/</link><pubDate>Fri, 14 Dec 2007 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/zotero-and-the-ia/</guid><description>&lt;p>Dan Cohen at Zotero reports (&lt;a href="http://www.dancohen.org/2007/12/12/zotero-and-the-internet-archive-join-forces" target="_blank">Zotero and the Internet Archive Join Forces&lt;/a>) on a very interesting tie up that will allow researchers using Zotero to deposit content in the Internet Archive and have OCR done on scanned material for free under a two year Mellon grant. Each piece of content will be given a “permanent URI that includes a time and date stamp in addition to the URL” ( would Handle or DOI add value here?) and be part of Zotero Commons (things can also be kept private within a group).&lt;/p>
&lt;p>Zotero Commons is related to but different from Nature Precedings and WebCite in that it’s intended focus is on public domain stuff on researchers hard drives rather than someone else’s material or website that is cited (WebCite) or preprints, datasets, technical reports that are given at least an initial screening (Nature Precedings).&lt;/p></description></item><item><title>STM Innovations 2007</title><link>https://www.crossref.org/blog/stm-innovations-2007/</link><pubDate>Mon, 10 Dec 2007 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/stm-innovations-2007/</guid><description>&lt;p>After a busy Online Information conference, Friday was the STM Innovations Meeting in London (presentations not online yet). There was a very nice selection of tea which helped get the morning off to a good start.&lt;/p>
&lt;p>Patricia Seybold kicked off with a review of Web 2.0 that mentioned lots of sites and some good case studies:&lt;/p>
&lt;p>Alexander Street Press (&lt;a href="https://alexanderstreet.com/" target="_blank">https://alexanderstreet.com/&lt;/a>) - user tags combined with a taxonomy.&lt;/p>
&lt;p>Slideshare (&lt;a href="http://www.slideshare.net" target="_blank">http://www.slideshare.net&lt;/a>) - share presentations&lt;/p>
&lt;p>Threadless (&lt;a href="http://www.threadless.com/" target="_blank">http://www.threadless.com/&lt;/a>) - design and vote on t-shirts&lt;/p>
&lt;p>The most interesting parts of the talk were the case studies of how National Instruments and Staples have built a vibrant community of customers. Staples invited top purchasers on the their site to create product categories and sales went up 30% and now they use the categorization in physical stores and customer reviews from the web are used in stores.&lt;/p>
&lt;p>NI has a whole suite of tools that allow customers to build products and get their jobs done (using NI products and services).&lt;/p>
&lt;p>Five steps to Web 2.0 success –&lt;/p>
&lt;ol>
&lt;li>
&lt;p>Focus on findability&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Solicit sutomers’ reviews, ratings and opinions&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Empower users to classify and organize content&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Nurture community, social networks, communities of practice&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Get lead users to strut their stuff, using your IP to build their IP&lt;/p>
&lt;/li>
&lt;/ol>
&lt;p>The most useful part came in the questions when Geoffrey Bilder asked about &lt;a href="http://en.wikipedia.org/wiki/Astroturfing" target="_blank">“astroturfing”&lt;/a> - this is a problem for Web 2.0. Interestingly, the NI and Staples examples are closed communities and other sites have to have moderators to try and track this stuff down. Often you don’t hear about these types of issues amid the web 2.0 boosterism.&lt;/p>
&lt;p>Joris van Rossum gave an very good overview of Scirus’ wiki-based Topic Pages (&lt;a href="https://web.archive.org/web/20071231210906/http://topics.scirus.com/" target="_blank">https://web.archive.org/web/20071231210906/http://topics.scirus.com/&lt;/a>). It’s interesting to see the creative way Elsevier is experimenting. Joris said that it is Elsevier’s vision that wiki forms a promising topic-centered platform for informal collaboration and the sharing of highly relevant info within STM in addition to the traditional peer-reviewed system. There is a critical issuem though - will researchers go to publishers for this type of thing or will they self-organize using inexpensive tools? The danger here is that publishers will do their own thing leading to a replay of the portal craze in the late 90s.&lt;/p>
&lt;p>Geoffrey Bilder gave a very good talk entitled “Anonymous Bosh: Attribution in a Mashed-up World” about trust and CrossReg (contributor ID).&lt;/p>
&lt;p>Simon Willison gave a very good explanation and update on OpenID. Some resources for more information - &lt;a href="http://openid.net" target="_blank">http://openid.net&lt;/a>&lt;/p>
&lt;p>&lt;a href="https://openid.net/developers/how-connect-works/" target="_blank">http://www.openidenabled.com&lt;/a>&lt;/p>
&lt;p>&lt;a href="https://web.archive.org/web/20070715235636/http://simonwillison.net/tags/openid/" target="_blank">https://web.archive.org/web/20070715235636/http://simonwillison.net/tags/openid/&lt;/a>&lt;/p>
&lt;p>Mark Bide wrapped things up with an update on ACAP (&lt;a href="https://web.archive.org/web/20071019045302/http://www.the-acap.org/" target="_blank">https://web.archive.org/web/20071019045302/http://www.the-acap.org/&lt;/a>)- “an evolving, open, royalty-free standard for expression of permissions in machine readable form” - that was launched in November. Will the search engines pay any attention?&lt;/p>
&lt;p>Overall, the day was very thought provoking.&lt;/p></description></item><item><title>Search Web Services Document</title><link>https://www.crossref.org/blog/search-web-services-document/</link><pubDate>Fri, 09 Nov 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/search-web-services-document/</guid><description>&lt;p>The OASIS Search Web Services TC has just put out the following document for public review (Nov 7- Dec 7, 2007):&lt;/p>
&lt;blockquote>
&lt;p>_Search Web Services v1.0 Discussion Document&lt;/p>&lt;/p>
&lt;ul>
&lt;li>Editable Source: &lt;a href="http://docs.oasis-open.org/search-ws/v1.0/DiscussionDocument.doc" target="_blank">http://docs.oasis-open.org/search-ws/v1.0/DiscussionDocument.doc&lt;/a>
&lt;ul>
&lt;li>PDF: &lt;a href="http://docs.oasis-open.org/search-ws/v1.0/DiscussionDocument.pdf" target="_blank">http://docs.oasis-open.org/search-ws/v1.0/DiscussionDocument.pdf&lt;/a>
&lt;ul>
&lt;li>
&lt;p>HTML: &lt;a href="http://docs.oasis-open.org/search-ws/v1.0/DiscussionDocument.html" target="_blank">http://docs.oasis-open.org/search-ws/v1.0/DiscussionDocument.html&lt;/a> &lt;/ul>
&lt;/i>&lt;/blockquote>&lt;/p>
&lt;p>From the OASIS announcement:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“This document: “Search Web Services Version 1.0 - Discussion Document - 2 November 2007”, was prepared by the OASIS Search Web Services TC as a strawman proposal, for public review, intended to generate discussion and interest. It has no official status; it is not a Committee Draft. The specification is based on the SRU (Search Retrieve via URL) specification which can be found at &lt;a href="http://www.loc.gov/standards/sru/" target="_blank">http://www.loc.gov/standards/sru/&lt;/a>. It is expected that this standard, when published, will deviate from SRU. How much it will deviate cannot be predicted at this time. The fact that the SRU spec is used as a starting point for development should not be cause for concern that this might be an effort to rubberstamp or fasttrack SRU. The committee hopes to preserve the useful features of SRU, eliminate those that are not considered useful, and add features that are not in SRU but are considered useful. “&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/blockquote></description></item><item><title>DC in (X)HTML Meta/Links</title><link>https://www.crossref.org/blog/dc-in-xhtml-meta/links/</link><pubDate>Tue, 06 Nov 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/dc-in-xhtml-meta/links/</guid><description>&lt;p>This &lt;a href="http://www.jiscmail.ac.uk/cgi-bin/webadmin?A2=ind0711&amp;amp;#038;L=dc-general&amp;amp;#038;T=0&amp;amp;#038;F=&amp;amp;#038;S=&amp;amp;#038;X=1DEA157B9F8232DF23&amp;amp;#038;Y=t.hammond%40nature.com&amp;amp;#038;P=969" target="_blank">message&lt;/a> posted out yesterday on the dc-general list (with following extract) may be of interest:&lt;/p>
&lt;blockquote>
&lt;p>_“Public Comment on encoding specifications for Dublin Core metadata in HTML and XHTML&lt;/p>&lt;/p>
&lt;p>2007-11-05, Public Comment is being held from 5 November through 3 December 2007 on the DCMI Proposed Recommendation, “Expressing Dublin Core metadata using HTML/XHTML meta and link elements” &lt;a href="http://dublincore.org/documents/2007/11/05/dc-html/" target="_blank">&amp;laquo;http://dublincore.org/documents/2007/11/05/dc-html/&amp;raquo;&lt;/a> by Pete Johnston and Andy Powell. Interested members of the public are invited to post comments to the DC-ARCHITECTURE mailing list &lt;a href="http://www.jiscmail.ac.uk/lists/dc-architecture.html" target="_blank">&amp;laquo;http://www.jiscmail.ac.uk/lists/dc-architecture.html&amp;raquo;&lt;/a> , including “[DC-HTML Public Comment]” in the subject line. Depending on comments received, the specification may be finalized after the comment period as a DCMI Recommendation.”&lt;/i>&lt;/blockquote>&lt;/p>
&lt;/blockquote></description></item><item><title>STIX Fonts in Beta</title><link>https://www.crossref.org/blog/stix-fonts-in-beta/</link><pubDate>Tue, 06 Nov 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/stix-fonts-in-beta/</guid><description>&lt;p>Well, Howard already &lt;a href="https://web.archive.org/web/20070815000000*/http://blogs.nature.com/wp/nascent/2007/11/stix_fonts_go_beta.html" target="_blank">blogged on Nascent&lt;/a> last week about the &lt;a href="http://www.stixfonts.org/" target="_blank">STIX fonts&lt;/a> (Scientific and Technical Information Exchange) being launched and now freely available in beta. And today the STM Association also have &lt;a href="https://web.archive.org/web/20080725054716/http://www.stm-assoc.org/home/stix-fonts-project-completes-design-phase.html" target="_blank">blogged&lt;/a> this milestone mark. So, just for the record, I’m noting here on CrossTech those links for easy retrieval. As Howard says:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“I recommend all publishers download the fonts from the STIX web site at &lt;a href="http://www.stixfonts.org/" target="_blank">www.stixfonts.org&lt;/a> today.”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>(And for those who want to see more of Howard, he can be found in interview &lt;a href="http://www.scribemedia.org/2007/03/28/howard-ratner/" target="_blank">here&lt;/a> on the SIIA Executive FaceTime Webcast Series. 🙂&lt;/p></description></item><item><title>DCMI Identifiers Community</title><link>https://www.crossref.org/blog/dcmi-identifiers-community/</link><pubDate>Wed, 17 Oct 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/dcmi-identifiers-community/</guid><description>&lt;p>Another DCMI invitation. And a list. Lovely.&lt;/p>
&lt;p>See &lt;a href="http://www.jiscmail.ac.uk/cgi-bin/webadmin?A2=ind0710&amp;amp;#038;L=dc-general&amp;amp;#038;T=0&amp;amp;#038;F=&amp;amp;#038;S=&amp;amp;#038;P=1223" target="_blank">this message&lt;/a> (copied below) from Douglas Campbell, National Library of New Zealand, to the &lt;a href="http://www.jiscmail.ac.uk/cgi-bin/webadmin?A0=dc-general" target="_blank">dc-general&lt;/a> mailing list.&lt;/p>
&lt;p>(Continues)&lt;/p>
&lt;blockquote>
&lt;p>_“Hi all,&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>I would like to alert members of this list to the new DCMI Identifiers Community established at the recent Dublin Core Metadata Initiative (DCMI) Advisory Board meeting in Singapore. It is moderated by Douglas Campbell (National Library of New Zealand).&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>The community is a forum for individuals and organisations with an interest in the design and use of identifiers in metadata. It also serves as a liaison channel for those involved in identifier efforts in other domains.&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>There was a lot of interest in identifiers at the recent DCMI conference. Identifiers are fundamental to the Web and for managing digital content, but most of us don’t know where to begin in designing and assigning them. The level of confusion can be seen in the number of meetings and workshops held just about identifiers. DCMI is in a unique position to bring together the thinking (and doing) around identifiers from multiple domains.&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>I would like to encourage you to share your identifier efforts and thinking amongst the DCMI community on our Identifiers wiki at:&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>&lt;a href="http://dublincore.org/identifierswiki" target="_blank">http://dublincore.org/identifierswiki&lt;/a>&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>You can join the community by signing up to our JISCMAIL list, linked from our community homepage at:&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>&lt;a href="http://www.dublincore.org/groups/identifiers/" target="_blank">http://www.dublincore.org/groups/identifiers/&lt;/a>&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>or by going direct to jiscmail:&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>&lt;a href="http://www.jiscmail.ac.uk/cgi-bin/webadmin?SUBED1=dc-identifiers&amp;amp;#038;A=1" target="_blank">http://www.jiscmail.ac.uk/cgi-bin/webadmin?SUBED1=dc-identifiers&amp;#038;A=1&lt;/a>&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>Thanx,&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>Douglas”_&lt;/p>
&lt;/blockquote></description></item><item><title>Hybrid</title><link>https://www.crossref.org/blog/hybrid/</link><pubDate>Wed, 17 Oct 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/hybrid/</guid><description>&lt;p>So, back on the old XMP tack. The simple vision from the XMP spec is that XMP packets are embedded in media files and transported along with them - and as such are relatively self-contained units, see Fig 1.&lt;/p>
&lt;img alt="Hybrid - A.jpg" src="https://www.crossref.org/wp/blog/images/Hybrid%20-%20A.jpg" width="316" height="176" />
&lt;p>&lt;em>Fig. 1 - Media files with fully encapsulated descriptions.&lt;/em>&lt;/p>
&lt;p>But this is too simple. Some preliminary considerations lead us to to see why we might want to reference additional (i.e. external) sources of metadata from the original packet:&lt;/p>
&lt;dl>
&lt;dt>&lt;em>PDFs&lt;/em>&lt;/dt>
&lt;dd>PDFs are tightly structured and as such it can be difficult to write a new packet, or to update an existing packet. One solution &lt;a href="https://www.crossref.org/blog/i-want-my-xmp/">proposed earlier&lt;/a> is to embed a minimal packet which could then reference a more complete description in a standalone packet. (And in turn this standalone packet could reference additional sources of metadata.)&lt;/dd>
&lt;dt>&lt;em>Images&lt;/em>&lt;/dt>
&lt;dd>While considerably simpler to write into web-delivery image formats (e.g. JPEG, GIF, PNG), it is the case that metadata pertinent to the image only is likely to be embedded. Also, of interest is the work from which the image is derived which is most likely to be presented externally to the image as a standalone document. (And in turn this standalone packet could reference additional sources of metadata.)&lt;/dd>
&lt;/dl>
&lt;p>(Continues)&lt;/p>
&lt;p>Thus the two cases - PDF documents and images - are not dissimilar. Fig. 2 shows a “wall-to-wall” XMP architecture whereby the standalone metadata documents for the work and for additional sources are expressed in XMP.&lt;/p>
&lt;img alt="Hybrid - B.jpg" src="https://www.crossref.org/wp/blog/images/Hybrid%20-%20B.jpg" width="314" height="191" />
&lt;p>&lt;em>Fig. 2 - XMP “wall-to-wall” architecture.&lt;/em>&lt;/p>
&lt;p>Fig. 3 presents a variant on this theme whereby additional sources are presented as generic RDF/XML. (In the most general case only RDF need be assumed, the serialization being a matter of choice.)&lt;/p>
&lt;img alt="Hybrid - C.jpg" src="https://www.crossref.org/wp/blog/images/Hybrid%20-%20C.jpg" width="303" height="192" />
&lt;p>&lt;em>Fig. 3 - XMP authority metadata with references to generic RDF/XML&lt;/em>&lt;/p>
&lt;p>And finally, Fig. 4 shows the most extreme case whereby XMP is used merely to “bootstrap” RDF descriptions for media objects. The XMP is used to embed a minimal description into the media file with references to a fuller work description and to additional sources which are presented as generic RDF/XML. That is, the metadata descriptions use generic RDF/XML exclusively and only resort to the idiomatic RDF/XML employed by XMP for embedding descriptions into binary structures.&lt;/p>
&lt;img alt="Hybrid - D.jpg" src="https://www.crossref.org/wp/blog/images/Hybrid%20-%20D.jpg" width="303" height="191" />
&lt;p>&lt;em>Fig. 4 - XMP “bootstrap” only - metadata descriptions proper are generic RDF/XML.&lt;/em>&lt;/p>
&lt;p>If I were to choose I might opt for the scenario presented in Fig. 3, but the scenarios in both Figs. 2 and 4 leave room for thought. Such a hybrid solution may be a means to bridge two different concerns:&lt;/p>
&lt;ul>
&lt;li>Generic RDF/XML for unconstrained descriptions.
&lt;ul>
&lt;li>Idiomatic RDF/XML (aka XMP) for embedding the head of a metadata trail. &lt;/ul>
I’m not sure that I see the XMP spec loosening up any time soon to accommodate generic RDF/XML. Nor, likewise is XMP likely to be provided (or even tolerated) down the metadata trail. And the metadata is not going to be fully encapsulated within a media file. The media file will merely encapsulate the head of the metadata trail.&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul></description></item><item><title>NLM Blog Citation Guidelines</title><link>https://www.crossref.org/blog/nlm-blog-citation-guidelines/</link><pubDate>Mon, 15 Oct 2007 00:00:00 +0000</pubDate><author>Geoffrey Bilder</author><guid>https://www.crossref.org/blog/nlm-blog-citation-guidelines/</guid><description>&lt;p>&lt;a href="http://www.boingboing.net/2007/10/12/howto-cite-blogs-in.html" target="_blank">I’ve just returned from Frankfurt Book fair and noticed that there has been some recent&lt;/a> in the &lt;a href="http://www.ncbi.nlm.nih.gov/books/bookres.fcgi/citmed/frontpage.html" target="_blank">The NLM Style Guide for Authors, Editors and Publishers&lt;/a> recommendations concerning &lt;a href="http://www.ncbi.nlm.nih.gov/books/bv.fcgi?rid=citmed.section.61024" target="_blank">citing blogs&lt;/a>.&lt;/p>
&lt;p>Which reminds me of an issue that has periodically been raised here at Crossref- should we be doing something to try and provide a service for reliably citing more ephemeral content such as blogs, wikis, etc.?&lt;/p>
&lt;p>Personally, I cringe when I see people include plain old URLs (POUs?) in citations. What’s the point? They are almost guaranteed to &lt;a href="http://en.wikipedia.org/wiki/Link_rot" target="_blank">fail to resolve&lt;/a> after a few years. In citing them, you are hardly helping to preserve the scholarly record. You might as well just record the metadata associated with the content.&lt;/p>
&lt;p>So why don’t we simply allow individuals to assign DOIs to their content?&lt;/p>
&lt;p>As Chuck Koscher says, “Crossref DOIs are only as persistent as Crossref staff.” Crossref depends on its ability to chase down and berate member publishers when they fail to update their DOI records. Its hard enough doing this with publishers, so just imagine what it would be like trying to chase down individuals. In short, it just wouldn’t scale.&lt;/p>
&lt;p>But what if we provided a different service for more informal content? Recently we have been in talking with Gunther Eysenbach, the creator of the very cool &lt;a href="http://www.webcitation.org/" target="_blank">WebCite&lt;/a> service about whether Crossref could/should operate a citation caching service for ephemera.&lt;/p>
&lt;p>As I said, I think WebCite is wonderful, but I do see a few problems with it in its current incarnation.&lt;/p>
&lt;p>The first is that, the way it works now, it seems to effectively leech usage statistics away from the source of the content. If I have a blog entry that gets cited frequently, I certainly don’t want all the links (and their associated &lt;a href="http://en.wikipedia.org/wiki/Google_juice" target="_blank">Google-juice&lt;/a>) redirected away from my blog. As long as my blog is working, I want traffic coming to my copy of the content, not some cached copy of the content (gee- the same problem publishers face, no?). I would also, ideally, like that traffic to continue to come to to my blog if I move hosting providers, platforms (WordPress, Moveable Type) , blog conglomerates (Gawker, Weblogs, Inc.), etc.&lt;/p>
&lt;p>The second issue I have with WebCite is simpler. I don’t really fancy having to actually recreate and run a web-caching infrastructure when there is already a &lt;a href="http://www.archive.org/index.php" target="_blank">formidable one&lt;/a> in existence.&lt;/p>
&lt;p>So what if we ran a service for individuals that worked like this:&lt;/p>
&lt;ol>
&lt;li>
&lt;p>For a fee, you can assign DOIs to your ephemeral, CC-licensed content.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>When you assign a DOI to an item of content (or update an existing DOI), we will immediately archive said content with the Internet Archive (who, incidentally, &lt;a href="http://www.archive-it.org/" target="_blank">charges for this service&lt;/a>)&lt;/p>
&lt;/li>
&lt;li>
&lt;p>We will direct those DOIs to your web site as long as you are both:&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Paying the fee&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Updating your URLs to point to the correct content&lt;/p>
&lt;/li>
&lt;li>
&lt;p>If you fail in either “a” or “b”, we will then redirect said DOIs to the cached version of the content on the Internet Archive (after having warned you repeatedly via automated e-mail).&lt;/p>
&lt;/li>
&lt;/ol>
&lt;p>(Note, as an aside, that we could in theory provide a similar dark-archive service for publishers with non free content using something like JStore as the archive)&lt;/p>
&lt;p>This approach would help to ensure that a blogger’s version of content was always linked to as long it was available. It would also preserve the “persistence” of Crossref DOIs by making sure that we could always resolve the DOI even if we were not able to get the owner of said DOI to update it.&lt;/p>
&lt;p>So back to the NLM guidelines… On the one hand, I’m delighted to see that the NLM has issued guidelines on citing blogs. It seems glaringly obvious that informal (and ephemeral) content such as blogs and wikis are increasingly becoming vital parts of the scholarly record. On the other hand, it also seems to me that recommending that somebody “cite” with a broken pointer (i.e. a URL) to content verges on tokenism. This isn’t the NLM’s fault- there just isn’t a reliable mechanism for citing informal content in a manner that ensures you can then retrieve and look at said content in the future.&lt;/p>
&lt;p>And this is no longer a problem confined to the Scholarly/Professional publishing space. As Jon Udell has occasionally &lt;a href="http://blog.jonudell.net/2007/01/29/the-persistent-blogosphere/" target="_blank">pointed out,&lt;/a> citation is increasingly an important currency for *any* professional writer on the web. It seems to me that a system for reliably citing blogs and wikis would benefit many communities. I could easily see commercial hosted Blog services (Blogger, WordPress) offering a “Cached-DOI” feature as a premium service to their clients.&lt;/p>
&lt;p>So what do you think? What am I missing? is this something we should be looking at?&lt;/p></description></item><item><title>OpenDocument Adds RDF</title><link>https://www.crossref.org/blog/opendocument-adds-rdf/</link><pubDate>Sun, 14 Oct 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/opendocument-adds-rdf/</guid><description>&lt;p>Bruce D’Arcus left a comment &lt;a href="https://www.crossref.org/blog/metadata-for-the-record/">here&lt;/a> in which he linked to post of his: “&lt;a href="https://web.archive.org/web/20071117090331/http://netapps.muohio.edu/blogs/darcusb/darcusb/archives/2007/10/13/opendocuments-new-metadata-system" target="_blank">OpenDocument’s New Metadata System&lt;/a>“. Not everybody reads comments so I’m repeating it here. His post is worth reading on two counts:&lt;/p>
&lt;ol>
&lt;li>He talks about the new metadata functionality for OpenDocument 1.2 which uses &lt;em>generic&lt;/em> RDF. As he says:&lt;/li>
&lt;/ol>
&lt;pre>&lt;code>&amp;gt; _&amp;amp;#8220;Unlike Microsoft’s custom schema support, we provide this through the standard model of RDF. What this means is that implementors can provide a generic metadata API in their applications, based on an open standard, most likely just using off-the-shelf code libraries.&amp;amp;#8221;_
This is great. It means that description is left up to the user rather than being restricted by any vendor limitation. (Ideally we would like to see the same for XMP. But Adobe is unlikely to budge because of the legacy code base and documents. It’s a wonder that Adobe still wants XMP to breathe.)
* He cites a wonderful passage from Rob Weir of IBM (something which I had been considering to blog but too late now) about the changing shape of documents. Can only say, go read [Bruce’s post][2] and then [Rob’s post][3]. But anyway a spoiler here:
&amp;gt; _&amp;amp;#8220;The concept of a document as being a single storage of data that lives in a single place, entire, self-contained and complete is nearing an end. A document is a stream, a thread in space and time, connected to other documents, containing other documents, contained in other documents, in multiple layers of meaning and in multiple dimensions.&amp;amp;#8221;_&amp;lt;/ol&amp;gt;
I think the ODF initiative is fantastic and wish that Adobe could follow suit. However, I do still hold out something for XMP. After all, nobody else AFAICT is doing anything remotely similar for multimedia. Where’s the W3C and co. when you really need them? (Oh yeah, [faffing][4] about the new [Semantic Web logo][5]. 😉
&lt;/code>&lt;/pre></description></item><item><title>I Want My XMP</title><link>https://www.crossref.org/blog/i-want-my-xmp/</link><pubDate>Sat, 13 Oct 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/i-want-my-xmp/</guid><description>&lt;p>Now, assuming &lt;a href="http://www.adobe.com/products/xmp/" target="_blank">XMP&lt;/a> is a good idea - and I think on balance it is (as blogged earlier), why are we not seeing any metadata published in scholarly media files? The only drawbacks that occur to me are:&lt;/p>
&lt;ol>
&lt;li>Hard to write - it’s too damn difficult, no tools support, etc.
&lt;ul>
&lt;li>
&lt;p>Hard to model - rigid, “simple” XMP data model, both complicates and constrains the RDF data model &lt;/ol>
Well, I don’t really believe that 1) is too difficult to overcome. A little focus and ingenuity should do the trick. I do, however, think 2) is just a crazy straitjacket that Adobe is forcing us all to wear but if we have to live with that then so be it. Better in Bedlam than without. (RSS 1.0 wasn’t so much better but allowed us to do some useful things. And that came from the RDF community itself.) We could argue this till the cows come home but I don’t see any chance of any change any time soon.&lt;/p>
&lt;p>(Continues)&lt;/p>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ol>
&lt;p>So, putting the RDF issue aside for the moment (as if RDF didn’t have problems of its own - XML, URI, etc.) let’s just look at the options for writing the stuff. (Btw, I’m not referencing any tools or toolkits. This is just in the round.) There are various means of publishing metadata in XMP:&lt;/p>
&lt;pre>&lt;code> **Sidecar**
: XMP can be produced as standalone files - see [XMP Specification, (Sept. ’05)][3], p. 36. (These are called &amp;amp;#8220;sidecar&amp;amp;#8221; files if the file has the same name as the main document and is in the same directory.) The only things needed to produce these files are a text editor and a good grasp of the XMP serialization. A template will do for that. The main problem with a standalone file is that it does not travel with the media file and so risks being left behind.
Worth a note here. Not standalone as such but the [Mars][4] format (the draft XML formalization for PDF) discloses its metadata in an independent XMP file &amp;amp;#8220;metadata.xml&amp;amp;#8221; under the &amp;amp;#8220;META-INF/&amp;amp;#8221; directory. For distribution the whole directory structure is packaged up as a zip file and so the XMP is embedded in a &amp;amp;#8220;.mars&amp;amp;#8221; file, but accessed directly from the zip file or from the unpackaged directory the XMP can be manipulated just like any other XML document.
**Embedded**
: This is the normal means of distributing XMP - embedded within the media file. Some graphics formats are essentially linear (JPEG, PNG, GIF) and it is relatively straightforward to add in an XMP packet. Other formats (PDF, TIFF) have internal cross-referencing and are more difficult to deal with.
**Embedded + Sidecar**
: One possible method for dealing with the difficulty of writing XMP is to note that some media (especially PDFs) already have embedded XMP packets. As noted earlier, much if not all of the metadata in these XMP packets will be workflow-related and thus dispensible for final-form products where authority work-related metadata is desired. These packets may, or may not, be writeable and thus include additional padding whitespace. Even for read-only packets there is much (if not all) that can be discarded and also sometimes unnecesary bulk (e.g. default namespace declarations which are never used). _The bottom line is that any legacy XMP packet may typically be 2-3K in size and, just as in transplanting a cell nucleus, the XMP packet innards can be deftly substituted with a minimal XMP packet content, say 1K in size, which would be guaranteed to fit with suitable padding._ A packet that size would be sufficient to provide at minimum for a DOI and for a reference to additional metadata, e.g. a more complete standalone XMP packet. The two forms can coexist.
The third way option here allows embedding a minimal XMP packet into &amp;amp;#8220;difficult&amp;amp;#8221; packaging structures while pointing out to a fully-formed XMP packet. The &amp;amp;#8220;simple&amp;amp;#8221; packaging structures may both include a fully-formed XMP packet while also possibly referencing extended metadata sources as per my previous post [here][4].
&lt;/code>&lt;/pre></description></item><item><title>Metadata - For the Record</title><link>https://www.crossref.org/blog/metadata-for-the-record/</link><pubDate>Sat, 13 Oct 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/metadata-for-the-record/</guid><description>&lt;p>Interesting post from &lt;a href="http://adobemax2007.com/na/speakers/listing/#penikisgunar" target="_blank">Gunar Penikis&lt;/a> of Adobe entitled “Permanent Metadata” Oct. ’04). &lt;sup id="fnref:1">&lt;a href="#fn:1" class="footnote-ref" role="doc-noteref">1&lt;/a>&lt;/sup>.&lt;/p>
&lt;p>He talks about the the issues of embedding metadata in media and comes up with this:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“It may be the case that metadata in the file evolves to become a “cache of convenience” with the authoritative information living on a web service. The web service model is designed to provide the authentication and permissions needed. The link between the two provided by unique IDs. In fact, unique IDs are already created by Adobe applications and stored in the XMP - that is what the XMP Media Management properties are all about.”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>An intriguing idea. Of course, Gunar’s (and Adobe’s) preoccupations with metadata revolve mainly around document workflow whereas, at least as things stand currently, scholarly publisher concerns are mainly with the dissemination of media in final form. Hence some differences in thinking:&lt;/p>
&lt;dl>
&lt;dt>&lt;strong>Subject&lt;/strong>&lt;/dt>
&lt;dd>As just noted Adobe are more interested in workflow than in work. Scholarly articles are rich in descriptive metadata about the work itself and have a well-developed ctation model. Academic interest is in the intellectual content rather than the vehicle used to carry and preserve that content - the file format.&lt;/dd>
&lt;dt>&lt;strong>Unique IDs&lt;/strong>&lt;/dt>
&lt;dd>&lt;dl>
&lt;dt>Workflow IDs are UUIDs which identify specific instances and expressions, but do not identify the abstract work. UUIDs provide a unique identifier but there is no central registry for such identifiers, hence they cannot be “looked up”. Crossref publishers should be concerned to associate closely the DOI for the underlying work with a given media file. That’s the identifier that this community is actively promoting.&lt;/p>&lt;/dt>
&lt;dd>
&lt;dl>
&lt;dt>&lt;strong>Read/Write&lt;/strong>&lt;/dt>
&lt;dd>Because of the focus on workflow, the XMP specification recommends that XMP packets be “writeable”, that is that they be marked as “writeable” and that they include padding whitespace which can accommodate updates without changing packet size. Publishers distributing final form documents are more likely to want to distribute “read-only” metadata which is authoritative and which describes the work, rather than the document format and workflow. Of course, this should not preclude additional sources of metadata which may be added “by reference” rather than “by value”. That is, a pointer to a web page (or service) may be sufficient to relate additional publisher terms and user annotations instead of embedding them directly in the file for various reasons: a) file integrity, b) limiting growth of file size, c) term authority, d) dynamic production (in forward time), and e) multiple sources.&lt;/dl>&lt;/dd>
&lt;/dl>
&lt;/dd>
&lt;/dl>
&lt;/dd>
&lt;/dl>
&lt;div class="footnotes" role="doc-endnotes">
&lt;hr>
&lt;ol>
&lt;li id="fn:1">
&lt;p>Update Aug 2022: the blog post mentioned below was previously at blogs.adobe.com/gunar/2007/10/permanent_metadata.html but is no longer live.&amp;#160;&lt;a href="#fnref:1" class="footnote-backref" role="doc-backlink">&amp;#x21a9;&amp;#xfe0e;&lt;/a>&lt;/p>
&lt;/li>
&lt;/ol>
&lt;/div></description></item><item><title>DataNet</title><link>https://www.crossref.org/blog/datanet/</link><pubDate>Fri, 12 Oct 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/datanet/</guid><description>&lt;p>Last week, my colleague Ian Mulvany &lt;a href="https://web.archive.org/web/20070815000000*/http://blogs.nature.com/wp/nascent/2007/10/datanet_a_call_for_proposals.html" target="_blank">posted&lt;/a> on &lt;a href="https://web.archive.org/web/20070815000000*/http://blogs.nature.com/wp/nascent/" target="_blank">Nascent&lt;/a> an entry about NSF’s recent call for proposals on DataNet (aka “A Sustainable Digital Data Preservation and Access Network”). &lt;a href="http://blogs.lib.berkeley.edu/shimenawa.php" target="_blank">Peter Brantley&lt;/a>, of &lt;a href="http://www.diglib.org/" target="_blank">DLF&lt;/a>, has set up a public group &lt;a href="https://web.archive.org/web/20071020041521/http://network.nature.com/group/datanet" target="_blank">DataNet&lt;/a> on &lt;a href="https://web.archive.org/web/20071002225513/http://network.nature.com./" target="_blank">Nature Network&lt;/a> where all are welcome to join in the discussion on what NSF effectively are viewing as the challenge of dealing with “big data”. As Ian notes in a mail to me:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“It seems that for a fully integrated flow of data then publisher involvement is going to be required, and it is clear from the proposal that the NSF are also interested in rights management or at negotiating that issue.”&lt;/em>&lt;/p>
&lt;/blockquote></description></item><item><title>OTMI Applied - Means More Search Hits</title><link>https://www.crossref.org/blog/otmi-applied-means-more-search-hits/</link><pubDate>Tue, 09 Oct 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/otmi-applied-means-more-search-hits/</guid><description>&lt;p>&lt;a href="https://www.crossref.org/wp/blog/images/otmi-twease.png/otmi-twease-window-alpha.png">&lt;img alt="otmi-twease-window-alpha.png" src="https://www.crossref.org/wp/blog/images/otmi-twease.png/otmi-twease-window-alpha.png" width="214" height="217" />&lt;/a>&lt;/p>
&lt;p>(Click image to enlarge.)&lt;/p>
&lt;p>Following up on previous posts on &lt;a href="http://opentextmining.org/" target="_blank">OTMI&lt;/a> (the proposal from NPG for scholarly publishers to syndicate their full text to drive text-mining applications), &lt;a href="https://web.archive.org/web/20071022163355/http://physiology.med.cornell.edu/faculty/campagne/" target="_blank">Fabien Campagne&lt;/a> from Cornell, a long-time OTMI supporter, has created an &lt;a href="https://web.archive.org/web/20080703194218/http://otmi.twease.org/otmi/app" target="_blank">OTMI-driven search engine&lt;/a> (based on his &lt;a href="https://web.archive.org/web/20080213031456/http://www.twease.org/medline/app" target="_blank">Twease&lt;/a> work). This may be the first publicly accessible OTMI-based service. It currently only contains NPG content from the OTMI archive online - some 2 years worth of &lt;em>Nature&lt;/em> and four other titles. (When will we begin to see other publishers on board?)&lt;/p>
&lt;p>What’s happening here? Well, &lt;a href="https://web.archive.org/web/20080213031456/http://www.twease.org/medline/app" target="_blank">Twease&lt;/a> is a web-based front-end to searching Medline abstracts. As such, a search will retrieve a set of results labeled by PMID and list all lines in the abstract where a match occurs. By contrast, with &lt;a href="https://web.archive.org/web/20080213031456/http://www.twease.org/medline/app" target="_blank">Twease-OTMI&lt;/a> a search is run over the article full text and a will retrieve &lt;strong>&lt;em>all&lt;/em>&lt;/strong> text “snippets” (for Nature we use sentences, although other units of text are possible) which match. See the figure above where the top three results are all labeled by the same DOI and show text matches from various points &lt;strong>&lt;em>within&lt;/em>&lt;/strong> the document.&lt;/p>
&lt;p>This shows that a far superior search match rate is possible using the article full text (as distributed in OTMI format) where text integrity as publishable asset is not compromised.&lt;/p></description></item><item><title>Mars Bar</title><link>https://www.crossref.org/blog/mars-bar/</link><pubDate>Mon, 08 Oct 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/mars-bar/</guid><description>&lt;p>Just noticed that there is now (as of last month) a blog for &lt;a href="https://blog.adobe.com/" target="_blank">Mars&lt;/a> (“Mars: Comments on PDF, Acrobat, XML, and the Mars file format”). See this from the initial post:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“The &lt;a href="https://web.archive.org/web/20071027131726/http://labs.adobe.com/technologies/mars/" target="_blank">Mars Project&lt;/a> at &lt;a href="http://www.adobe.com/" target="_blank">Adobe&lt;/a> is aimed at creating an XML representation for PDF documents. We use a component-based model for representing different aspects of the document and we use the Universal Container Format (a Zip-based packaging format) to hold the pieces. Mars uses XML to represent the individual components where that makes sense, but otherwise uses industry standard formats to represent other components. Examples of these include Fonts (we use OpenType), Images (PNG, GIF, JPEG, JPEG2000), Color (ICC Color Profiles), etc.. We use SVG to represent page content, which fits as both an XML format and an industry standard.”&lt;/em>&lt;/p>
&lt;/blockquote></description></item><item><title>Scholarly DC</title><link>https://www.crossref.org/blog/scholarly-dc/</link><pubDate>Fri, 05 Oct 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/scholarly-dc/</guid><description>&lt;p>This &lt;a href="http://www.jiscmail.ac.uk/cgi-bin/webadmin?A2=ind0710&amp;amp;#038;L=dc-general&amp;amp;#038;T=0&amp;amp;#038;F=&amp;amp;#038;S=&amp;amp;#038;P=459" target="_blank">This&lt;/a> was just sent out to the &lt;a href="http://www.jiscmail.ac.uk/archives/dc-general.html" target="_blank">DC-GENERAL&lt;/a> mailing list about the new &lt;a href="http://dublincore.org/groups/scholar/" target="_blank">DCMI Community for Scholarly Communications&lt;/a>. As Julie Allinson says:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“The aim of the group is to provide a central place for individuals and organisations to exchange information, knowledge and general discussion on issues relating to using Dublin Core for describing items of ‘scholarly communications’, be they research papers, conference presentations, images, data objects. With digital repositories of scholarly materials increasingly being established across the world, this group would like to offer a home for exploring the metadata issues faced.”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>There’s also a &lt;a href="http://www.jiscmail.ac.uk/lists/DC-SCHOLAR.html" target="_blank">DC-SCHOLAR&lt;/a> mailing list (subscribe &lt;a href="http://www.jiscmail.ac.uk/cgi-bin/wa.exe?SUBED1=dc-scholar&amp;amp;#038;A=1" target="_blank">here&lt;/a>). Not too much there yet, but it may be useful to track - or even to participate. 🙂&lt;/p></description></item><item><title>The Names Project</title><link>https://www.crossref.org/blog/the-names-project/</link><pubDate>Fri, 05 Oct 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/the-names-project/</guid><description>&lt;p>Was reminded to blog about this after reading &lt;a href="http://orweblog.oclc.org/archives/001445.html" target="_blank">Lorcan’s post&lt;/a> on the &lt;a href="https://web.archive.org/web/20071013215645/http://names.mimas.ac.uk/" target="_blank">Names Project&lt;/a> being run by JISC. From the blurb:&lt;/p>
&lt;blockquote>
&lt;p>_“The project is going to scope the requirements of UK institutional and subject repositories for a service that will reliably and uniquely identify names of individuals and institutions.&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p> &lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>It will then go on to develop a prototype service which will test the various processes involved. This will include determining the data format, setting up an appropriate database, mapping data from different sources, populating the database with records and testing the use of the data.”_&lt;/p>
&lt;/blockquote>
&lt;p>One immediate project tangible is the &lt;a href="https://web.archive.org/web/20120119133351/http://names.mimas.ac.uk/documents/Names_landscape_report_1Oct2007.pdf" target="_blank">landscape report&lt;/a> (‘A review of the current landscape in relation to a proposed Name Authority Service for UK repositories of research outputs’) which summarizes some current initiatives in author identification from a UK perspective, including &lt;em>inter alia&lt;/em> Elsevier’s &lt;a href="http://help.elsevier.com/app/answers/detail/a_id/2845/p/8150/c/8430" target="_blank">Scopus Author Identifier&lt;/a>.&lt;/p></description></item><item><title>InChIKey</title><link>https://www.crossref.org/blog/inchikey/</link><pubDate>Tue, 02 Oct 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/inchikey/</guid><description>&lt;p>The &lt;a href="http://www.iupac.org/inchi/" target="_blank">InChI&lt;/a> (International Chemical Identifier from IUPAC) has been blogged earlier &lt;a href="https://www.crossref.org/categories/inchi">here&lt;/a>. RSC have especially taken this on board in their Project Prospect and now routinely syndicate InChI identifiers in their RSS feeds as blogged &lt;a href="https://www.crossref.org/blog/rscs-project-prospect-v1.1/">here&lt;/a>.&lt;/p>
&lt;p>As reported variously last month (see &lt;a href="https://web.archive.org/web/20071014205908/http://fiehnlab.ucdavis.edu/staff/kind/InChIKey" target="_blank">here&lt;/a> for one such review) IUPAC have now &lt;a href="https://web.archive.org/web/20071030202540/http://www.iupac.org/inchi/release102.html" target="_blank">released&lt;/a> a new (1.02beta) version of their software which allows hashed versions (fixed length 25-character) of the InChI, so-called InChIKey’s, to be generated which are much more search engine friendly. Compare a regular InChI identifier:&lt;/p>
&lt;p>&lt;tt>&lt;br /> InChI=1/C49H70N14O11/c1-26(2)39(61-42(67)33(12-8-18-55&lt;br /> -49(52)53)57-41(66)32(50)23-38(51)65)45(70)58-34(20-29-1&lt;br /> 4-16-31(64)17-15-29)43(68)62-40(27(3)4)46(71)59-35(22-30&lt;br /> -24-54-25-56-30)47(72)63-19-9-13-37(63)44(69)60-36(48(7&lt;br /> 3)74)21-28-10-6-5-7-11-28/h5-7,10-11,14-17,24-27,32-3&lt;br /> 7,39-40,64H,8-9,12-13,18-23,50H2,1-4H3,(H2,51,65)(H,54,56&lt;br /> )(H,57,66)(H,58,70)(H,59,71)(H,60,69)(H,61,67)(H,62,68)(H,73,74)&lt;br /> (H4,52,53,55)/f/h56-62,73H,51-53H2&lt;br /> &lt;/tt>&lt;/p>
&lt;p>with its InChIKey counterpart:&lt;/p>
&lt;p>&lt;tt>&lt;br /> InChIKey=JYPVVOOBQVVUQV-UHFFFAOYAR&lt;br /> &lt;/tt>&lt;/p>
&lt;p>That’s some saving.&lt;/p></description></item><item><title>Oh No, Not You Again!</title><link>https://www.crossref.org/blog/oh-no-not-you-again/</link><pubDate>Tue, 02 Oct 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/oh-no-not-you-again/</guid><description>&lt;p>Oh dear. Yesterday’s post “Using ISO URNs” was way off the mark. I don’t know. I thought that walk after lunch had cleared my mind. But apparently not. I guess I was fixing on eyeballing the result in RDF/N3 rather than the logic to arrive at that result.&lt;/p>
&lt;p>(Continues.)&lt;/p>
&lt;p>There are three namespace cases (and I was only wrong in two out of the three, I think):&lt;/p>
&lt;ol>
&lt;li>“pdf:”&lt;/li>
&lt;/ol>
&lt;p>I was originally going to suggest the use of “data:” for the PDF information dictionary terms here but then lunged at using an HTTP URI (the URI of &lt;a href="http://www.adobe.com/devnet/pdf/pdf_reference.html" target="_blank">the page&lt;/a> for the PDF Reference manual on the Adobe site) for regular orthodox conformancy and good churchgoing:&lt;/p>
&lt;pre>@prefix pdf: &amp;lt;http://www.adobe.com/devnet/pdf/pdf_reference.html&amp;gt; .
&lt;/pre>
&lt;p>This was wrong on two counts:&lt;/p>
&lt;p>a) Afaik no such use for this URI as a namespace has ever been made by Adobe. And it is in the gift of the DNS tenant (elsewhere called “owner”) to mint URIs under that namespace and to ascribe meanings to those URIs.&lt;/p>
&lt;p>b) Also the URI is not best suited to a role as namespace URI since RDF namespaces typically end in “/” or “#” to make the division between namespace and term clearer. (In XML it doesn’t make a blind bit of difference as XML namespaces are just a scoping mechanism.) So to have a property URI as&lt;/p>
&lt;pre>http://www.adobe.com/devnet/pdf/pdf_reference.htmlAuthor
&lt;/pre>
&lt;p>does the job but looks pretty rough and more importantly precludes (at least, complicates) the possibility of dereferencing the URI to return a page with human or machine readable semantics. Better in RDF terms is one of the following:&lt;/p>
&lt;pre>a) http://www.adobe.com/devnet/pdf/pdf_reference/Author
b) http://www.adobe.com/devnet/pdf/pdf_reference#Author
c) http://www.adobe.com/devnet/pdf/pdf_reference.html#Author
&lt;/pre>
&lt;p>In the absence of any published namespace from Adobe for these terms, I think it would have been more prudent to fall back on “data:” URIs. So&lt;/p>
&lt;pre>@prefix pdf: &amp;lt;data:,&amp;gt; .
&lt;/pre>
&lt;p>leading to&lt;/p>
&lt;pre>data:,Author
data:,CreationDate
data:,Creator
etc.
&lt;/pre>
&lt;p>This is correct (afaict) and merely provides a URI representation for bare strings.&lt;/p>
&lt;p>Had we wanted to relate those terms to the PDF Reference we might have tried something like:&lt;/p>
&lt;pre>data:,PDF%20Reference:Author
data:,PDF%20Reference:CreationDate
data:,PDF%20Reference:Creator
etc.
&lt;/pre>
&lt;p>And if we had wanted to make those truly secondary RDF resources related to a primary RDF resource for the “namespace” we could have attempted something like:&lt;/p>
&lt;pre>data:,PDF%20Reference#Author
data:,PDF%20Reference#CreationDate
data:,PDF%20Reference#Creator
etc.
&lt;/pre>
&lt;p>Note though that the “data:” specification is not clear about the implications of using “#”. (Is it allowed, or isn;t it?) We must suspect that it is not allowed, but see &lt;a href="http://lists.w3.org/Archives/Public/www-style/2005May/0036.html" target="_blank">this mail&lt;/a> from Chris Lilley (W3C) which is most insightful.&lt;/p>
&lt;ol start="2">
&lt;li>“pdfx:”&lt;/li>
&lt;/ol>
&lt;p>The example was just for demo purposes, but (as per 1a above) it is incumbent on the namespace authority (here ISO) to publish a URI for the term to be used. Anyhow, the namespace URI I cited&lt;/p>
&lt;pre>@prefix pdfx: &amp;lt;urn:iso:std:iso-iec:15930:-1:2001&amp;gt; .
&lt;/pre>
&lt;p>would not have been correct and would have led to these mangled URIs:&lt;/p>
&lt;pre>urn:iso:std:iso-iec:15930:-1:2001GTS_PDFXVersion
urn:iso:std:iso-iec:15930:-1:2001GTS_PDFXConformance
&lt;/pre>
&lt;p>It should have been something closer to&lt;/p>
&lt;pre>@prefix pdfx: &amp;lt;urn:iso:std:iso-iec:15930:-1:2001:&amp;gt; .
&lt;/pre>
&lt;p>leading to&lt;/p>
&lt;pre>urn:iso:std:iso-iec:15930:-1:2001:GTS_PDFXVersion
urn:iso:std:iso-iec:15930:-1:2001:GTS_PDFXConformance
&lt;/pre>
&lt;ol start="3">
&lt;li>“_usr:”&lt;/li>
&lt;/ol>
&lt;p>This was the one correct call in yesterday’s post.&lt;/p>
&lt;pre>@prefix _usr: &amp;lt;data:,&amp;gt; .
&lt;/pre>
&lt;p>The only problem here would be to differentiate these terms from the terms listed in the PDF Reference manual, although the PDF information dictionary makes no such distinction itself.&lt;/p>
&lt;p>To sum up, perhaps the best way of rendering the PDF information dictionary keys in RDF would be to use “data:” URIs for all (i.e. a methodology for URI-ifying strings) and to bear in mind that at some point ISO might publish URNs for the PDF/X mandated keys: ‘&lt;tt>GTS_PDFXVersion&lt;/tt>‘ and ‘&lt;tt>GTS_PDFXConformance&lt;/tt>‘. So,&lt;/p>
&lt;pre># document infodict (object 58: 476983):
@prefix: pdfx: &amp;lt;data:,&amp;gt; .
@prefix: pdf: &amp;lt;data:,&amp;gt; .
@prefix: _usr: &amp;lt;data:,&amp;gt; .
&amp;lt;> _usr:Apag_PDFX_Checkup "1.3";
pdf:Author "Scott B. Tully";
pdf:CreationDate "D:20020320135641Z";
pdf:Creator "Unknown";
pdfx:GTS_PDFXConformance "PDF/X-1a:2001";
pdfx:GTS_PDFXVersion "PDF/X-1:2001";
pdf:Keywords "PDF/X-1";
pdf:ModDate "D:20041014121049+10'00'";
pdf:Producer "Acrobat Distiller 4.05 for Macintosh";
pdf:Subject "A document from our PDF archive. ";
pdf:Title "Tully Talk November 2001";
pdf:Trapped "False" .
&lt;/pre></description></item><item><title>Using ISO URNs</title><link>https://www.crossref.org/blog/using-iso-urns/</link><pubDate>Mon, 01 Oct 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/using-iso-urns/</guid><description>&lt;p>(&lt;strong>Update - 2007.10.02:&lt;/strong> Just realized that there were some serious flaws in the post below regarding publication and form of namespace URIs which I’ve now addressed in a subsequent post &lt;a href="https://www.crossref.org/blog/oh-no-not-you-again/">here&lt;/a>.)&lt;/p>
&lt;p>By way of experimenting with a use case for ISO URNs, below is a listing of the document metadata for an arbitrary PDF. (You can judge for yourselves whether the metadata disclosed here is sufficient to describe the document.) Here, the metadata is taken from the information dictionary and from the document metadata stream (XMP packet).&lt;/p>
&lt;p>The metadata is expressed in RDF/N3. That may not be a surprise for the XMP packet which is serialized in RDF/XML, as it’s just a hop, skip and a jump to render it as RDF/N3 with properties taken from schema whose namespaces are identified by URI. What may be more unusual is to see the document information dictionary metadata (the “normal” metadata in a PDF) rendered as RDF/N3 since the information dictionary is not nodelled on RDF, not expressed in XML, and not namespaced. Here, in addition to the trusty HTTP URI scheme, I’ve made use of two particular URI schemes: “&lt;a href="https://tools.ietf.org/html/draft-goodwin-iso-urn-02.html" target="_blank">iso:&lt;/a>” URN namespaces, and “&lt;a href="http://www.ietf.org/rfc/rfc2397.txt" target="_blank">data:&lt;/a>” URIs.&lt;/p>
&lt;p>(Continues.)&lt;/p>
&lt;p>As far as I am aware, there is no formal identifier for entries in the document information dictionary as specified by the &lt;a href="http://www.adobe.com/devnet/pdf/pdf_reference.html" target="_blank">PDF Reference&lt;/a> from Adobe Systems, so it may be appropriate to use the HTTP URI for the Adobe homepage for the PDF Reference manual, from which specific editions are available.&lt;/p>
&lt;p>For the PDF/X keys which are specified in the ISO standard ISO 15930-1 2001, I have used an ISO URN. (I don’t expect this to be correct in all details but it should give some idea of how it might be used. It may be that the URI should express the term itself, rather than the document from which the term was defined.) And finally, for the one additional user-supplied key here I have made use of a “data:” URI with no body (i.e. I’m speechless). One could have provided some text within the body of the “data:” URI if one wanted to differentiate between alternate user keys or to otherwise annotate these keys.&lt;/p>
&lt;p>Note that the prefixes used in the information dictionary and in the metadata stream are unrelated, as are the mappings of property elements to schemas.&lt;/p>
&lt;p>Well, that’s all really just for fun but it may show two things: 1) how a general description might be described with RDF and how general properties can be mapped to URIs (with possibly limited machine utility), and 2) how an ISO URN might be used.&lt;/p>
&lt;pre># document infodict (object 58: 476983):
@prefix: pdfx: &amp;lt;urn:iso:std:iso-iec:15930:-1:2001&amp;gt; .
@prefix: pdf: &amp;lt;http://www.adobe.com/devnet/pdf/pdf_reference.html&amp;gt; .
@prefix: _usr: &amp;lt;data:,&amp;gt; .
&amp;lt;> _usr:Apag_PDFX_Checkup "1.3";
pdf:Author "Scott B. Tully";
pdf:CreationDate "D:20020320135641Z";
pdf:Creator "Unknown";
pdfx:GTS_PDFXConformance "PDF/X-1a:2001";
pdfx:GTS_PDFXVersion "PDF/X-1:2001";
pdf:Keywords "PDF/X-1";
pdf:ModDate "D:20041014121049+10'00'";
pdf:Producer "Acrobat Distiller 4.05 for Macintosh";
pdf:Subject "A document from our PDF archive. ";
pdf:Title "Tully Talk November 2001";
pdf:Trapped "False" .
# document metadata stream (object 41: 472418):
@prefix dc: &amp;lt;http://purl.org/dc/elements/1.1/&amp;gt; .
@prefix pdf: &amp;lt;http://ns.adobe.com/pdf/1.3/&amp;gt; .
@prefix pdfx: &amp;lt;http://ns.adobe.com/pdfx/1.3/&amp;gt; .
@prefix rdf: &amp;lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&amp;gt; .
@prefix xmp: &amp;lt;http://ns.adobe.com/xap/1.0/&amp;gt; .
@prefix xmpMM: &amp;lt;http://ns.adobe.com/xap/1.0/mm/&amp;gt; .
&amp;lt;> pdf:Keywords "PDF/X-1";
pdf:Producer "Acrobat Distiller 4.05 for Macintosh";
pdfx:Apag_PDFX_Checkup "1.3";
pdfx:GTS_PDFXConformance "PDF/X-1a:2001";
pdfx:GTS_PDFXVersion "PDF/X-1:2001";
xmp:CreateDate "2002-03-20T13:56:41Z";
xmp:CreatorTool "Unknown";
xmp:MetadataDate "2004-10-14T12:10:49+10:00";
xmp:ModifyDate "2004-10-14T12:10:49+10:00";
xmpMM:DocumentID "uuid:bd7ae9a1-1110-43c0-8e84-632f2dbb55ab";
dc:creator [
a rdf:Seq;
rdf:_1 "Scott B. Tully" ];
dc:description [
a rdf:Alt;
rdf:_1 "A document from our PDF archive. "@x-default ];
dc:format "application/pdf";
dc:title [
a rdf:Alt;
rdf:_1 "Tully Talk November 2001"@x-default ] .
&lt;/pre></description></item><item><title>Whole Lotta ID</title><link>https://www.crossref.org/blog/whole-lotta-id/</link><pubDate>Mon, 01 Oct 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/whole-lotta-id/</guid><description>&lt;p>ISO has registered with the IANA a URN namespace identifier (“iso:”) for ISO persistent resources. From the Internet-Draft:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“This URN NID is intended for use for the identification of persistent resources published by the ISO standards body (including documents, document metadata, extracted resources such as standard schemata and standard value sets, and other resources).”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>The toplevel grammar rules (ABNF) give some indication of scope:&lt;/p>
&lt;blockquote>
&lt;pre>NSS = std-nss
&lt;/blockquote>
&lt;p>std-nss = &amp;ldquo;std:&amp;rdquo; docidentifier *supplement *docelement [addition]&lt;/pre>&lt;/p>
&lt;p>Just wanted to quote here one of the funkier examples cited in the document:&lt;/p>
&lt;blockquote>
&lt;p>&lt;tt>urn:iso:std:iso:9999:-1:ed-1:v1-amd1.v1:en,fr:amd:2:v2:en:clause:3.1,a.2-b.9&lt;/tt>&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p> &lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>&lt;em>“refers to (sub)clauses 3.1 and A.2 to B.9 in the corrected version of Amendment 2, in English, which amends the document comprising the 1st version of edition 1 of ISO 9999-1 incorporating the 1st version of Amendment 1, in English/French (bilingual document)”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>Wow! That’s some ID. That’s something else.&lt;/p>
&lt;p>As far as DOI is concerned there is nothing obvious to be learned. It is interesting to see such a level of granularity supported though. And since all these documents issue from a central publisher they can be prescriptive about the identifier syntax. Something which cannot be mandated for the many Crossref publishers with their own commercial arrangements. Hence DOI is generally agnostic about suffix strings.&lt;/p>
&lt;p>Seems to be a little confusion about the registration though. The NID was approved Jan. 15, ’07 by the IESG and the &lt;a href="http://www.iana.org/assignments/urn-namespaces" target="_blank">IANA Registry of URN Namespaces&lt;/a> (last updated Aug. 22, ’07) lists the namespace “iso” with the provisional (unnumbered) RFC labelled “RFC-goodwin-iso-urn-01.txt” (being the -01 draft). However, the IETF I-D Tracker reports &lt;a href="https://datatracker.ietf.org/idtracker/draft-goodwin-iso-urn/" target="_blank">this status&lt;/a> for draft-goodwin-iso-urn, which shows that a new I-D (an -02 draft) was submitted in Sept. 7, ’07:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“A Uniform Resource Name (URN) Namespace for the International organisation for Standardization (ISO), &lt;a href="https://tools.ietf.org/html/draft-goodwin-iso-urn-02.html" target="_blank">draft-goodwin-iso-urn-02.txt&lt;/a>“&lt;/em>&lt;/p>
&lt;/blockquote></description></item><item><title>Authors in Context?</title><link>https://www.crossref.org/blog/authors-in-context/</link><pubDate>Sun, 30 Sep 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/authors-in-context/</guid><description>&lt;p>On the subject of author IDs (a subject Crossref is interested in and on which held a meeting earlier this year, as blogged about &lt;a href="https://www.crossref.org/blog/crossref-author-id-meeting/">here&lt;/a>), this post by Karen Coyle “&lt;a href="http://kcoyle.blogspot.com/2007/09/name-authority-control-aka-name.html" target="_blank">Name authority control, aka name identification&lt;/a>” may be worth a read. She starts off with this:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“Libraries do something they call “name authority control”. For most people in IT, this would be called “assigning unique identifiers to names.” Identifying authors is considered one of the essential aspects of library cataloging, and it isn’t done in any other bibliographic environment, as far as I know.”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>and concludes thus:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“Perhaps the days of looking at lists of authors’ names is over. Maybe users need to see a cloud of authors connected to topic areas in which they have published, or related to books titles or institutional affiliations. In this time of author abundance, names are not meaningful without some context.”&lt;/em>&lt;/p>
&lt;/blockquote></description></item><item><title>XMP-Ville</title><link>https://www.crossref.org/blog/xmp-ville/</link><pubDate>Tue, 25 Sep 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/xmp-ville/</guid><description>&lt;p>Been so busy looking into the technical details of XMP that I almost forgot to check out the current landcsape. Luckily I chanced on &lt;a href="https://web.archive.org/web/20071002100600/http://www.roguebutterfly.com/ArticlesbyRonRoszkiewicz.htm" target="_blank">these articles&lt;/a> by Ron Roszkiewicz for &lt;a href="https://web.archive.org/web/20071008012411/http://www.seyboldreports.com/" target="_blank">The Seybold Report&lt;/a> (and apologies for lifting the title of this post from his last). The articles about XMP are well worth reading and chart the painful progress made to date:&lt;/p>
&lt;ul>
&lt;li>&lt;a href="https://web.archive.org/web/20080706141520/http://roguebutterfly.com/documents/TSR-0817_p18-19.pdf" target="_blank">The Brief Tortured Life of XMP&lt;/a> (July ’05)
&lt;ul>
&lt;li>[Thought Leaders Hammer out Metadata Standard] (April ’07)
&lt;ul>
&lt;li>
&lt;p>[Metadata Persistence and “Save for Web…”] (July ’07) &lt;/ul>
From the earlier characterization of XMP as “underachieving teenager” Roszkiewicz is cautiously optimistic that IDEAlliance’s &lt;a href="https://web.archive.org/web/20080509100140/http://www.xmp-open.org/" target="_blank">XMP Open&lt;/a> initiative (an initiative to advance XMP as an open industry specification) will help outreach and foster adoption of this fledgling technology.&lt;/p>
&lt;p>(Continues.)&lt;/p>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;p>There has been some activity here. Following on from an industry open day event last year:&lt;/p>
&lt;pre>&lt;code> * [IDEAlliance XMP Open Day][5], New York, March ’06&amp;lt;/ul&amp;gt;
there have been two metadata summits earlier this year co-sponsored by Adobe Systems and IDEAlliance:
* [Metadata Directions in Advertising and Branding][6], San Francisco, January ’07
* [Content Metadata Summit 1.1][7] New York, March ’07&amp;lt;/ul&amp;gt;
Promising bestirrings. (And also with the recent public airing of the PRISM 2.0 draft with its support for XMP which was reviewed at the PRISM WG F2F last week for publication as a standard.) But generally the state of XMP-Ville at this time is rather sleepy. There’s not much by way of news on the [XMP Open][8] website. At least promise, if no promises.
Back to the articles. The really interesting thing of note (to me at any rate) in Roszkiewicz’s review of the last summit is the almost total absence of any mention of the Web. It is as if XMP users (both consumers and providers) would be content to play within the walled garden of the CS3 product portfolio. I don’t get that. The Web changes everything.
Although XMP maps its native data model to RDF (and RDF is an inherently open technology allowing arbitrary schemas to be mixed at will), XMP betrays its application roots by seeming to want to impose some kind of veto on the schemas to be used. Or rather, how they are to be used. It also seems to be all fussed up by centralized notions such as a cross-mapping schema registry. (As if that were part of its remit.)
As Roszkiewicz notes:
&amp;gt; _&amp;amp;#8220;The consortia [IDEAlliance and the stakeholders] will have ownership responsibility for name space registry, cross-map definition and support, standards group outreach and coordination, compliance certification and logo and the “XMP Open” brand.&amp;amp;#8221;_
And elsewhere:
&amp;gt; _&amp;amp;#8220;So while the standard for XMP might be defined, the data that will be fed into files is not, for want of an IDEAlliance-like standards management body to filter and rationalize the many [schema] into a few.&amp;amp;#8221;_
And then more worryingly, this:
&amp;gt; _&amp;amp;#8220;That schema should be managed by a government agency such as the Library of Congress which could manage the dictionaries and schema, certify them, register the namespace and provide a centralized location to distribute them.&amp;amp;#8221;_
Well, I don’t see what this matters to the core technology of XMP which is just a specification for the sneaking in of an XML document into arbitrary media files. And the use of RDF/XML would seem to be a further indication that XMP is to be independent of the schema used. The use of both RDF plus XML technologies should allow XMP to present itself as a framework or &amp;amp;#8220;platform&amp;amp;#8221; for metadata exchange and to get out of the way of what is actually carried by the XMP packets. App neutrality, if you will.
Again the notion of Web as just an alternate channel is apparent in the third of the articles where Roszkiewicz talks about the Device Central tool which allows a user of a CS3 product to &amp;amp;#8220;Save for Web or Devices&amp;amp;#8230;&amp;amp;#8221;. This article talks about the clumsy handling of metadata in such device saves, whereby the packet may be abbreviated - and metadata terms dropped - when printing to small footprint devices. Not a feature to be retained for too long, I would hope.
So, where are we currently with XMP? According to Roszkiewicz:
&amp;gt; _&amp;amp;#8220;As the developer of a suite of applications that relies on XMP as the vehicle for managing metadata, Adobe has too much invested in its development to allow any substantive changes by outsiders. So “open” primarily will mean open to suggestions, with an official channel in place to process them.&amp;amp;#8221;_
And as to that channel?
&amp;gt; _&amp;amp;#8220;As the principal conduit to Adobe for changes to XMP, IDEAlliance will act as a gateway and support organisation to the user community - a role for which it is well-suited. &amp;amp;#8230; As a sponsor-supported, not-for-profit organisation, IDEAlliance can serve as a credible buffer for Adobe to the user community and synchronize and standardize third-party development efforts.&amp;amp;#8221;_
And goes on:
&amp;gt; _&amp;amp;#8220;The principal unanswered questions at this point are: Will the stakeholders represent all of the key industries; will Adobe provide timely support for considering user input and updating the XMP Toolkit; and can Adobe, IDEAlliance and IDEAlliance workgroups manage all of the responsibilities that will fall upon them when the deal is struck. The hand-over doesn’t seem to have taken place yet, and we are still examining the scope and feasibility of the proposal.&amp;amp;#8221;_
It seems to me that Adobe is the party girl, IDEAlliance is the special guest, and Crossref publishers are the neighbourly gatecrashers who want to play with the toys. And not perhaps too nicely neither. I just hope that the toys aren’t taken away from us. They’re too much fun.
Ironic really that we’re on the outside of this since scholarly publishers have a very clearcut grasp of what to do with metadata and a ready application in terms of citation linking. XMP is worth it.
&lt;/code>&lt;/pre></description></item><item><title>The Name’s The Thing</title><link>https://www.crossref.org/blog/the-names-the-thing/</link><pubDate>Thu, 20 Sep 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/the-names-the-thing/</guid><description>&lt;p>I’m always curious about names and where they come from and what they mean. Hence, my interest was aroused with the constant references to “XAP” in XMP. As the &lt;a href="https://web.archive.org/web/20210811233806/https://www.adobe.com/devnet/xmp.html" target="_blank">XMP Specification&lt;/a> (Sept. 2005) says:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“NOTE: The string “XAP” or “xap” appears in some namespaces, keywords, and related names in this document and in stored XMP data. It reflects an early internal code name for XMP; the names have been preserved for compatibility purposes.”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>Actually, it occurs in most of the core namespaces: XAP, rather than XMP.&lt;/p>
&lt;p>(Continues.)&lt;/p>
&lt;p>An earlier XMP Specification from 2001 (v. 1.5 - and see &lt;a href="https://www.crossref.org/blog/w5m0mpcehihzreszntczkc9d">here&lt;/a> for an earlier post of mine about XMP’s missing version numbers, and &lt;a href="https://www.crossref.org/blog/xmp-some-other-gripes/">here&lt;/a> about Adobe’s lack of archiving for XMP specifications) says almost the same thing:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“NOTE: Many namespaces, keywords, and related names in this document are prefaced with the string “XAP”, which was an early internal code name for XMP metadata. Because the Acrobat 5.0 product shipped using those names and keywords, they were retained for compatibility purposes.”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>So, there’s no indication in either of these specifications as to what the original name signified.&lt;/p>
&lt;p>But then I turned up &lt;a href="http://support.adobe.com/devsup/devsup.nsf/docs/51840.htm" target="_blank">this issue&lt;/a> in the Adobe Developer Knowledgebase:&lt;/p>
&lt;blockquote>
&lt;p>_“Known Issue: The metadate framework name was changed from XAP to XMP&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p> &lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>Summary&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>XAP (Extensible Authoring Publishing) was an early internal code name for XMP (Extensible Metadata Platform).&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p> &lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>Issue&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>Why are many namespaces, keywords, data structures, and related names in the documents and XMP toolkit code prefaced with the string “XAP” rather than “XMP”?&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p> &lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>Solution&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>XAP (Extensible Authoring and Publishing) was an early internal code name for XMP (Extensible Metadata Platform) metadata. Because Acrobat 5.0 used those names, they were retained for compatibility purposes. XMP is the formal name used the framework specification.”_&lt;/p>
&lt;/blockquote>
&lt;p>Aha! Now it’s all clear. And now I’m also wondering if this original name still reflects Adobe’s thinking on the purpose of XMP that it be primarily an authoring utility rather than a workflow utility. That is, is Adobe’s XMP more geared to individual authors of Adobe’s Creative Suite products entering in metadata by hand as part of the authoring act, rather than as a batch entry process within an automated publishing workflow? The emphasis that Adobe put on &lt;a href="http://www.adobe.com/products/xmp/custompanel.html" target="_blank">Custom File Info panels&lt;/a> for their CS products would seem to foster the view that Adobe see XMP as an interactive authoring device for adding metadata. But what about the publishers and their workflows? The SDK is a rather poor effort at garnering any widespread support of XMP within the publishing industry.&lt;/p></description></item><item><title>ACAP - Any chance of success?</title><link>https://www.crossref.org/blog/acap-any-chance-of-success/</link><pubDate>Wed, 19 Sep 2007 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/acap-any-chance-of-success/</guid><description>&lt;p>ACAP has released &lt;a href="https://web.archive.org/web/20071007123940/http://www.the-acap.org/project-documents.php" target="_blank">some documents&lt;/a> outlining the use cases they will be testing and some proposed changes to the Robots Exclusion Protocol (REP) - both robots.txt and META tags. There are some very practical proposals here to improve search engine indexing. However, the only search engine publicly participating in the project is &lt;a href="http://www.exalead.com/" target="_blank">http://www.exalead.com/&lt;/a> (which according to Alexa attracted 0.0043% of global internet visits over the last three months). The main docs are “ACAP pilot Summary use cases being tested”, “ACAP Technical Framework - Robots Exclusion Protocol - strawman proposals Part 1”, “ACAP Technical Framework - Robots Exclusion Protocol - strawman proposals Part 2”, “ACAP Technical Framework - Usage Definitions - draft for pilot testing”.&lt;/p>
&lt;p>What would cause other search engines to recognize the ACAP protocols rather than ignore them? A lot of publishers implementing this and requiring search engines to recognize it to index content could put pressure on the engines. Maybe.&lt;/p></description></item><item><title>Style Guides Recommend DOI strings</title><link>https://www.crossref.org/blog/style-guides-recommend-doi-strings/</link><pubDate>Wed, 19 Sep 2007 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/style-guides-recommend-doi-strings/</guid><description>&lt;p>A couple of recent posts - from &lt;a href="http://jeffline.jefferson.edu/SML/reference/reftips/?p=19" target="_blank">A couple of recent posts - from&lt;/a> at Jefferson University and &lt;a href="https://web.archive.org/web/20080412044026/http://forfaculty.wordpress.com/2007/09/17/apas-new-recommendations-for-citing-e-journals/" target="_blank">IFST at Univ of Delaware&lt;/a>- note that the AMA and APA style guides now recommend using a DOI, if one is assigned, in a journal article citation.&lt;/p>
&lt;p>A citation in the APA style with a DOI would be:&lt;/p>
&lt;blockquote>
&lt;p>Conley, D., Pfeiffera, K. M., &amp;amp; Velez, M. (2007). Explaining sibling differences in achievement and behavioral outcomes: The importance of within- and between-family factors. Social Science Research36(3), 1087-1104. doi:10.1016/j.ssresearch.2006.09.002&lt;/p>
&lt;/blockquote>
&lt;p>In the AMA style a reference would be:&lt;/p>
&lt;blockquote>
&lt;p>Kitajima TS, Kawashima SA, Watanabe Y. The conserved kinetochore protein shugoshin protects centromeric cohesion during meiosis. Nature. 2004;427(6974):510-517. doi:10.1038/nature02312&lt;/p>
&lt;/blockquote>
&lt;p>This is great news. I haven’t looked at the full style guides but it’s not clear if information is given about linking DOIs via &lt;a href="http://dx.doi.org/" target="_blank">http://dx.doi.org/&lt;/a>&lt;/p>
&lt;p>Information on the APA Style Guide is available - &lt;a href="http://apastyle.apa.org/" target="_blank">http://apastyle.apa.org/&lt;/a> with &lt;a href="http://apastyle.apa.org/elecmedia.html" target="_blank">specific info on electronic references, URLs and DOIs&lt;/a> and here is the &lt;a href="http://www.amamanualofstyle.com/" target="_blank">AMA info&lt;/a>.&lt;/p>
&lt;p>This raises the existential question of a DOI as a URI. Is&lt;/p>
&lt;blockquote>
&lt;p>Conley, D., Pfeiffera, K. M., &amp;amp; Velez, M. (2007). Explaining sibling differences in achievement and behavioral outcomes: The importance of within- and between-family factors. Social Science Research36(3), 1087-1104. doi:10.1016/j.ssresearch.2006.09.002 &lt;a href="http://dx.doi.org/10.1016/j.ssresearch.2006.09.002" target="_blank">http://dx.doi.org/10.1016/j.ssresearch.2006.09.002&lt;/a>&lt;/p>
&lt;/blockquote>
&lt;p>unnecessary or redundant?&lt;/p></description></item><item><title>Chapter 9 - The Closed Book</title><link>https://www.crossref.org/blog/chapter-9-the-closed-book/</link><pubDate>Sat, 15 Sep 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/chapter-9-the-closed-book/</guid><description>&lt;p>Hadn’t really noticed before but was fairly gobsmacked by this notice I just saw on the DOI® Handbook:&lt;/p>
&lt;blockquote>
&lt;p>&lt;strong>**Please note that Chapter 9, Operating Procedures is for Registration Agency personnel only.**&lt;/strong>&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>DOI® Handbook&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>doi:10.1000/182&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>&lt;a href="http://www.doi.org/hb.html" target="_blank">http://www.doi.org/hb.html&lt;/a>&lt;/p>
&lt;/blockquote>
&lt;p>And, indeed, the Handbook’s TOC only reconfirms this:&lt;/p>
&lt;blockquote>
&lt;p>9 Operating procedures*&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>*The RA password is required for viewing Chapter 9.&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>9.1 Registering a DOI name with associated metadata&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>9.2 Prefix assignment&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>9.3 Transferring DOI names from one Registrant to another&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>9.4 Handle System® policies and procedures&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>9.4.1 Overview&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>9.4.2 Policies and Procedures&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>9.4.3 Requirements for Administrators of Resolution Services&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>9.4.4 Protocols and Interfaces&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>9.5 DOI® System error messages&lt;/p>
&lt;/blockquote>
&lt;p>That’s spooky. A book with a hidden chapter. I &lt;strong>really&lt;/strong> don’t like that at all. Especially on a book aiming to provide general information and guidance. Seems to be that if that information needs to be kept private to RA’s then it has no business rubbing shoulders with public information. I would suggest that the material be opened up or else moved out. Makes me feel so second class.&lt;/p></description></item><item><title>Custom Panel for CC</title><link>https://www.crossref.org/blog/custom-panel-for-cc/</link><pubDate>Sat, 15 Sep 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/custom-panel-for-cc/</guid><description>&lt;p>&lt;a href="http://creativecommons.org/" target="_blank">Creative Commons&lt;/a> now have a custom panel for adding CC licenses using Adobe apps - see &lt;a href="http://creativecommons.org/weblog/entry/7648" target="_blank">here&lt;/a>.&lt;/p>
&lt;p>Interesting on two counts:&lt;/p>
&lt;ul>
&lt;li>Machine readable licenses
&lt;ul>
&lt;li>XMP metadata&lt;/ul>
But I still think that batch solutions for adding XMP metadata are really required for publishing workflows. And ideally there should be support for adding arbitrary XMP packets if we’re going to have truly rich metadata. I rather fear the constraints that custom panels place upon the publisher.&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul></description></item><item><title>Last Orders Please!</title><link>https://www.crossref.org/blog/last-orders-please/</link><pubDate>Thu, 13 Sep 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/last-orders-please/</guid><description>&lt;p>Public comment period on the &lt;a href="https://web.archive.org/web/20070929195327/http://www.prismstandard.org/" target="_blank">PRISM 2.0&lt;/a> draft ends Saturday (Sept. 15) ahead of next week’s WG meeting to review feedback and finalize the spec.&lt;/p>
&lt;p>(I put in some comments about XMP already. Hope they got that.)&lt;/p></description></item><item><title>Marking up DOI</title><link>https://www.crossref.org/blog/marking-up-doi/</link><pubDate>Tue, 11 Sep 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/marking-up-doi/</guid><description>&lt;p>(&lt;strong>Update - 2007.09.15:&lt;/strong> Clean forgot to add in the &lt;tt>rdf:&lt;/tt> namespace to the examples for &lt;tt>xmp:Identifier&lt;/tt> in this post. I’ve now added in that namespace to the markup fragments listed. Also added in a comment &lt;a href="https://www.crossref.org/blog/marking-up-doi">here&lt;/a> which shows the example in RDF/XML for those who may prefer that over RDF/N3.)&lt;/p>
&lt;p>So, as a preliminary to reviewing how a fuller metadata description of a Crossref resource may best be fitted into an XMP packet for embedding into a PDF, let’s just consider how a DOI can be embedded into XMP. And since it’s so much clearer to read let’s just conduct this analysis using RDF/N3. (Life is too short to be spent reading RDF/XML or C++ code. :~)&lt;/p>
&lt;p>(And further to Chris Shillum’s comment [(&lt;strong>Update - 2007.09.15:&lt;/strong> Clean forgot to add in the &lt;tt>rdf:&lt;/tt> namespace to the examples for &lt;tt>xmp:Identifier&lt;/tt> in this post. I’ve now added in that namespace to the markup fragments listed. Also added in a comment &lt;a href="https://www.crossref.org/blog/marking-up-doi">here&lt;/a> which shows the example in RDF/XML for those who may prefer that over RDF/N3.)&lt;/p>
&lt;p>So, as a preliminary to reviewing how a fuller metadata description of a Crossref resource may best be fitted into an XMP packet for embedding into a PDF, let’s just consider how a DOI can be embedded into XMP. And since it’s so much clearer to read let’s just conduct this analysis using RDF/N3. (Life is too short to be spent reading RDF/XML or C++ code. :~)&lt;/p>
&lt;p>(And further to Chris Shillum’s comment]&lt;a href="https://www.crossref.org/blog/metadata-in-pdf-2.-use-cases#comment-51907">2&lt;/a> on my earlier post &lt;a href="https://www.crossref.org/blog/metadata-in-pdf-2.-use-cases">Metadata in PDF: 2. Use Cases&lt;/a> where he notes that Elsevier are looking to upgrade their markup of DOI in PDF to use XMP, I’m really hoping that Elsevier may have something to bring to the party and share with us. A consensus rendering of DOI within XMP is going to be of benefit to all.)&lt;/p>
&lt;p>(Continues.)&lt;/p>
&lt;p>Within an XMP packet our first idea might be to include the DOI using the Dublin Core (DC) schema element &lt;tt>dc:identifier&lt;/tt> in minimalist fashion:&lt;/p>
&lt;pre>@prefix dc: &amp;lt;http://purl.org/dc/elements/1.1/&amp;gt; .
&amp;lt;&amp;gt; dc:identifier "10.1038/nrg2158" .
&lt;/pre>
&lt;p>This simply says that the current document (denoted by the empty URI “&lt;tt>&amp;lt;&amp;gt;&lt;/tt>“) has a string property &lt;tt>&amp;ldquo;10.1038/nrg2158&amp;rdquo;&lt;/tt> which is of type &lt;tt>identifier&lt;/tt> from the &lt;tt>dc&lt;/tt> (or Dublin Core) schema which is identified by the URI &lt;tt>&lt;a href="http://purl.org/dc/elements/1.1/" target="_blank">http://purl.org/dc/elements/1.1/&lt;/a>&lt;/tt>.&lt;/p>
&lt;p>Now, since this is just a DOI and the wider public cannot be expected to know about DOIs, it would surely be better to present the DOI in URI form (&lt;tt>doi:&lt;/tt>) as&lt;/p>
&lt;pre>@prefix dc: &amp;lt;http://purl.org/dc/elements/1.1/&amp;gt; .
&amp;lt;&amp;gt; dc:identifier "doi:10.1038/nrg2158" .
&lt;/pre>
&lt;p>or, using a registered URI form (&lt;tt>info:&lt;/tt>) as&lt;/p>
&lt;pre>@prefix dc: &amp;lt;http://purl.org/dc/elements/1.1/&amp;gt; .
&amp;lt;&amp;gt; dc:identifier "info:doi/10.1038/nrg2158" .
&lt;/pre>
&lt;p>Aside: This shows up a limitation of XMP where the DC schema property value for &lt;tt>dc:identifier&lt;/tt> is fixed as type &lt;strong>&lt;tt>Text&lt;/tt>&lt;/strong>. The natural way to express the above in RDF/N3 would be as:&lt;/p>
&lt;pre>@prefix dc: &amp;lt;http://purl.org/dc/elements/1.1/&amp;gt; .
&amp;lt;&amp;gt; dc:identifier &amp;lt;info:doi/10.1038/nrg2158&amp;gt; .
&lt;/pre>
&lt;p>which says that the value is a URI (type &lt;strong>&lt;tt>URI&lt;/tt>&lt;/strong> in XMP terms), not a string (type &lt;strong>&lt;tt>Text&lt;/tt>&lt;/strong> in XMP terms). We either have to flout the XMP specification or else live with this restriction. We’ll opt for the latter for now.&lt;/p>
&lt;p>But, the XMP Spec deprecates the use of &lt;tt>dc:identifier&lt;/tt> since the context is not specific. (Note that that’s what was just discussed above. The limitation is built into XMP which builds on RDF but does not fully endorse the RDF world view.) Instead the XMP Spec recommends using &lt;tt>xmp:Identifier&lt;/tt> since the context can be set using a qualified property as:&lt;/p>
&lt;pre>@prefix rdf: &amp;lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&amp;gt; .
@prefix xmp: &amp;lt;http://ns.adobe.com/xap/1.0/&amp;gt; .
@prefix xmpidq: &amp;lt;http://ns.adobe.com/xmp/Identifier/qual/1.0/&amp;gt; .
&amp;lt;&amp;gt; xmp:Identifier [
a rdf:Bag;
rdf:_1 [
xmpidq:Scheme "DOI";
rdf:value "10.1038/nrg2158" ] ] .
&lt;/pre>
&lt;p>This says the string &lt;tt>&amp;ldquo;10.1038/nrg2158&amp;rdquo;&lt;/tt> belongs to the scheme &lt;tt>&amp;ldquo;DOI&amp;rdquo;&lt;/tt>.&lt;/p>
&lt;p>Here we have used the scheme “DOI” and, as noted above, for wider recognition it would be better to employ one of the URI forms, e.g.&lt;/p>
&lt;pre>@prefix rdf: &amp;lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&amp;gt; .
@prefix xmp: &amp;lt;http://ns.adobe.com/xap/1.0/&amp;gt; .
@prefix xmpidq: &amp;lt;http://ns.adobe.com/xmp/Identifier/qual/1.0/&amp;gt; .
&amp;lt;&amp;gt; xmp:Identifier [
a rdf:Bag;
rdf:_1 [
xmpidq:Scheme "URI";
rdf:value "doi:10.1038/nrg2158" ] ] .
&lt;/pre>
&lt;p>This says the string &lt;tt>&amp;ldquo;doi:10.1038/nrg2158&amp;rdquo;&lt;/tt>belongs to the scheme &lt;tt>&amp;ldquo;URI&amp;rdquo;&lt;/tt>.&lt;/p>
&lt;p>But this is the unregistered URI form (&lt;tt>doi:&lt;/tt>), so should we be using instead the registered form (&lt;tt>info:&lt;/tt>)? Well, turns out that this construct for &lt;tt>xmp:Identifier&lt;/tt> is an &lt;tt>rdf:Bag&lt;/tt> so we can include more than one term. How about using this construct then:&lt;/p>
&lt;pre>@prefix rdf: &amp;lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&amp;gt; .
@prefix xmp: &amp;lt;http://ns.adobe.com/xap/1.0/&amp;gt; .
@prefix xmpidq: &amp;lt;http://ns.adobe.com/xmp/Identifier/qual/1.0/&amp;gt; .
&amp;lt;&amp;gt; xmp:Identifier [
a rdf:Bag;
rdf:_1 [
xmpidq:Scheme "URI";
rdf:value "info:doi/10.1038/nrg2158" ];
rdf:_2 [
xmpidq:Scheme "URI";
rdf:value "doi:10.1038/nrg2158" ] ] .
&lt;/pre>
&lt;p>Now we’ve got both forms, which is fair enough since these are equivalent. In RDF terms we can make the statement that:&lt;/p>
&lt;blockquote>
&lt;pre>doi:10.1038/nrg2158 owl:sameAs info:doi10.1038/nrg2158 .&lt;/pre>
&lt;/blockquote>
&lt;p>which asserts that the two URIs are equivalent and that they reference the same resource.&lt;/p>
&lt;p>So, what if we want to include a native DOI without the URI garb? We can easily do that:&lt;/p>
&lt;pre>@prefix rdf: &amp;lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&amp;gt; .
@prefix xmp: &amp;lt;http://ns.adobe.com/xap/1.0/&amp;gt; .
@prefix xmpidq: &amp;lt;http://ns.adobe.com/xmp/Identifier/qual/1.0/&amp;gt; .
&amp;lt;&amp;gt; xmp:Identifier [
a rdf:Bag;
rdf:_1 [
xmpidq:Scheme "URI";
rdf:value "info:doi/10.1038/nrg2158" ];
rdf:_2 [
xmpidq:Scheme "URI";
rdf:value "doi:10.1038/nrg2158" ];
rdf:_3 [
xmpidq:Scheme "DOI";
rdf:value "10.1038/nrg2158" ] ] .
&lt;/pre>
&lt;p>OK, that takes care of the XMP direction to use &lt;tt>xmp:Identifier&lt;/tt>, but, while deprecated by XMP, we note that back in the real world folks will be looking at the DC elements which is the schema with the greatest purchase. So, why not also add in a &lt;tt>dc:identifier&lt;/tt> element such as would be used typically for DOI in citations. How about this:&lt;/p>
&lt;pre>@prefix dc: &amp;lt;http://purl.org/dc/elements/1.1/&amp;gt; .
@prefix rdf: &amp;lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&amp;gt; .
@prefix xmp: &amp;lt;http://ns.adobe.com/xap/1.0/&amp;gt; .
@prefix xmpidq: &amp;lt;http://ns.adobe.com/xmp/Identifier/qual/1.0/&amp;gt; .
&amp;lt;&amp;gt; xmp:Identifier [
a rdf:Bag;
rdf:_1 [
xmpidq:Scheme "URI";
rdf:value "info:doi/10.1038/nrg2158" ];
rdf:_2 [
xmpidq:Scheme "URI";
rdf:value "doi:10.1038/nrg2158" ];
rdf:_3 [
xmpidq:Scheme "DOI";
rdf:value "10.1038/nrg2158" ] ];
dc:identifier "doi:10.1038/nrg2158" .
&lt;/pre>
&lt;p>Right, so we’ve taken care of the identfiers. But maybe there’s something missing? There’s no link to the DOI proxy. For widest applicability we should not assume prior knowledge of the DOI system. Perhaps we could include this link using the property &lt;tt>dc:relation&lt;/tt>? Seems feasible though would really like to get some feedback on this. Any ideas?&lt;/p>
&lt;p>So here, then, is a fairly full and complete expression of DOI within the XMP packet.&lt;/p>
&lt;pre>@prefix dc: &amp;lt;http://purl.org/dc/elements/1.1/&amp;gt; .
@prefix rdf: &amp;lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&amp;gt; .
@prefix xmp: &amp;lt;http://ns.adobe.com/xap/1.0/&amp;gt; .
@prefix xmpidq: &amp;lt;http://ns.adobe.com/xmp/Identifier/qual/1.0/&amp;gt; .
&amp;lt;&amp;gt; xmp:Identifier [
a rdf:Bag;
rdf:_1 [
xmpidq:Scheme "URI";
rdf:value "info:doi/10.1038/nrg2158" ];
rdf:_2 [
xmpidq:Scheme "URI";
rdf:value "doi:10.1038/nrg2158" ];
rdf:_3 [
xmpidq:Scheme "DOI";
rdf:value "10.1038/nrg2158" ] ];
dc:identifier "doi:10.1038/nrg2158";
dc:relation "http://dx.doi.org/10.1038/nrg2158" .
&lt;/pre>
&lt;p>Ta-da!&lt;/p>
&lt;p>(Of course, this is all premised on having freedom in writing out the XMP packet. If one is dependent on commercial applications to write out the packet then things may be different. Actually, they will be very different. They may not even be workable.)&lt;/p>
&lt;p>Feedback would be very welcome.&lt;/p></description></item><item><title>The Second Wave</title><link>https://www.crossref.org/blog/the-second-wave/</link><pubDate>Tue, 11 Sep 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/the-second-wave/</guid><description>&lt;p>You might have been wondering why I’ve been banging on about &lt;a href="http://www.adobe.com/products/xmp/" target="_blank">XMP&lt;/a> here. Why the emphasis on one vendor technology on a blog focussed on an industry linking solution? Well, this post is an attempt to answer that.&lt;/p>
&lt;p>Four years ago we at &lt;a href="http://www.nature.com/" target="_blank">Nature Publishing Group&lt;/a>, along with a select few early adopters, started up our RSS news feeds. We chose to use &lt;a href="http://web.resource.org/rss/1.0/" target="_blank">RSS 1.0&lt;/a> as the platform of choice which allowed us to embed a rich metadata term set using multiple schemas - especially Dublin Core and PRISM. We evangelized this much at the time and published documents on &lt;a href="http://www.xml.com/pub/a/2003/07/23/rssone.html" target="_blank">XML.com&lt;/a> (Jul. ’03) and in &lt;a href="http://www.dlib.org/dlib/december04/hammond/12hammond.html" target="_blank">D-Lib Magazine&lt;/a> (Dec. ’04) as well as speaking about this at various meetings and blogging about it. Since that time many more publishers have come on board and now provide RSS routinely, many of them choosing to enrich their feeds with metadata.&lt;/p>
&lt;p>Well, RSS can be seen in hindsight as being the &lt;strong>&lt;em>First Wave&lt;/em>&lt;/strong> of projecting a web presence beyond the content platform using standard markup formats. With this embedded metadata a publisher can expand their web footprint and allow users to link back to their content server.&lt;/p>
&lt;p>Now, XMP with its potential for embedding metadata in rich media can be seen as a &lt;strong>&lt;em>Second Wave&lt;/em>&lt;/strong>. Media assets distributed over the network can now carry along their own metadata and identity which can be leveraged by third-party applications to provide interesting new functionalities and link-back capability. Again a projection of web presence.&lt;/p>
&lt;p>(Continues.)&lt;/p>
&lt;p>XMP has much in common with RSS 1.0. They are both profiles of RDF/XML. They are both flawed in certain respects because of self-imposed limitations. But they both build on a robust and open data model for the web (RDF) and are reasonably open, at least they are extensible. One (RSS 1.0) was defined in an open process by committee, the other is an open (i.e published) specification provided by a vendor.&lt;/p>
&lt;p>From our point of view both specifications are sufficiently advanced to be immediately useful. I’m not sure how one could interact with the further development of either specification. RSS 1.0 is essentially frozen with Atom being posed as a successor technology, although Atom does not conform to the RDF model. (The upshot is that an RSS 1.0 feed can be consumed completely by an RDF-aware application, while an Atom feed would need to be pre-processed before any RDF “goodness” could be gleaned from it.) By contrast, XMP is a vendor-defined technology and alive, if not perhaps kicking. I am unaware of any process to formally contribute to the XMP development apart from shouting from the terraces. None the less, both technologies are usable as is.&lt;/p>
&lt;p>It is curious that no consistent packaging (and delivery) of metadata has yet been achieved with HTML, the original web interface. The HTML &lt;tt>&lt;title>&lt;/tt> and &lt;tt>&lt;meta>&lt;/tt> elements are employed by publishers with various degrees of consistency. There are also RDF islands that can be embedded within HTML comments (as used e.g. by &lt;a href="http://creativecommons.org/" target="_blank">CC licenses&lt;/a>). And then there are &lt;a href="https://web.archive.org/web/20090927174724/http://ocoins.info/" target="_blank">COinS&lt;/a> objects. But it’s all a bit of a mish-mash to date. Certainly, I don’t recall seeing any guidelines from Crossref as to how machine readable metadata (even markup for the DOI itself) may be embedded within HTML pages, rather than on HTML pages for human readers.&lt;/p>
&lt;p>This lack of uniform metadata deployment for HTML pages could be something to do with context. With RSS and XMP we are dealing with remote objects, whereas with HTML we are generally accessing this directly on the content server and so have a semantic context. It could be though that metadata delivery from HTML pages will finally be more uniformly available with the further development of standards such as &lt;a href="http://microformats.org/" target="_blank">microformats&lt;/a> and especially &lt;a href="http://www.w3.org/2006/07/SWD/RDFa/syntax/" target="_blank">RDFa&lt;/a>, &lt;a href="http://www.w3.org/2004/01/rdxh/spec" target="_blank">GRDDL&lt;/a>, etc. It is also interesting to note that an XMP packet could just as easily be embedded within the HTML page, and if this technology were to be adopted more widely for embedding in other media assets then why not consider the same technology for ordinary web pages?&lt;/p>
&lt;p>I can’t help feeling though that XMP has a lot of promise and is very timely. There are only three real obstacles: creating XMP packets, writing them and reading them. To my mind, once one has a good grasp of XMP then creating the packets can be done with common tools. The same, more or less, for reading the packets. I have shown earlier that this is readily achievable. The only major block is writing the packets into media files although there is support for create/write (if patchy) by open source libraries, as well as there being support (perhaps limited) from products for create/write. But, anyway, it’s certainly do-able.&lt;/p></description></item><item><title>W5M0MpCehiHzreSzNTczkc9d</title><link>https://www.crossref.org/blog/w5m0mpcehihzreszntczkc9d/</link><pubDate>Mon, 10 Sep 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/w5m0mpcehihzreszntczkc9d/</guid><description>&lt;p>What on earth can this string mean: ‘W5M0MpCehiHzreSzNTczkc9d’? This occurs in the XMP packet header:&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >&lt;tt>&lt;?xpacket begin='' id='W5M0MpCehiHzreSzNTczkc9d'?>&lt;/tt>&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;p>Well from the XMP Specification (September 2005) which is available &lt;a href="https://web.archive.org/web/20170830043306/http://wwwimages.adobe.com/content/dam/Adobe/en/devnet/xmp/pdfs/XMPSpecificationPart3.pdf">here&lt;/a> (PDF) there is this text:&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >&lt;i>“The required id attribute must follow begin. For all packets defined by this version of the syntax, the value of id is the following string: W5M0MpCehiHzreSzNTczkc9d”&lt;/i>&lt;/span>&lt;/p>
&lt;p>&lt;span >(See: 3 XMP Storage Model / XMP Packet Wrapper / Header / Attribute: id)&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;span >OK, so it’s no big deal to cut and paste that string, it’s just mighty curious why this cryptic key is needed in an open specification, especially since (contrary to what might be implied by the text) it doesn’t seem to vary with version. (Or hasn’t yet, at any rate - more below.)&lt;/span>&lt;/p>
&lt;span >
&lt;/span>
&lt;p>&lt;span >Right, so now we get down to it. Just what is the version number of the current XMP Specification anyways? I couldn’t for the life of me find one. (Note that I am talking about the XMP Specification itself and not the XMP Toolkit which is versioned at 4.1.1.) I am assuming that I have the latest version, else I really don’t know where else to look. This link&lt;/span>&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >&lt;a href="http://www.adobe.com/products/xmp/">&lt;a href="http://www.adobe.com/products/xmp/" target="_blank">http://www.adobe.com/products/xmp/&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;span >leads me to&lt;/span>&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >&lt;a href="http://www.adobe.com/devnet/xmp/">&lt;a href="http://www.adobe.com/devnet/xmp/" target="_blank">http://www.adobe.com/devnet/xmp/&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;span >which leads me to&lt;/span>&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >&lt;a href=" https://web.archive.org/web/20210811233806/https://www.adobe.com/devnet/xmp.html">&lt;a href="https://web.archive.org/web/20210811233806/https://www.adobe.com/devnet/xmp.html" target="_blank">https://web.archive.org/web/20210811233806/https://www.adobe.com/devnet/xmp.html&lt;/a>&lt;/a>&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;span >which by the way is also the same version that ships with the SDK.&lt;/span>&lt;/p>
&lt;p>&lt;span >I do know that there was a Version 1.5 published in September 14, 2001. (You can see that this is a fairly slow changing technology - the published spec is from 2 years back, and an earlier - the earlier? - version is from 6 years back). Note that this version has a version number (1.5) but still uses the same XMP packer header ‘id’ attribute.&lt;/span>&lt;/p>
&lt;p>&lt;span >No good, by the way, peeking inside the XMP of the XMP Spec either. Here’s a dump (using the DumpMainXMP utility with the SDK):&lt;/span>&lt;/p>
&lt;pre>&lt;span >% xmpd xmp_spec.xmp
 
 
// -----------------------------------
// Dumping main XMP for xmp_spec.xmp :
 
File info : format = " ", handler flags = 00000260
Packet info : offset = 0, length = 4051
 
Initial XMP from xmp_spec.xmp
Dumping XMPMeta object "" (0x0)
 
http://ns.adobe.com/pdf/1.3/ pdf: (0x80000000 : schema)
pdf:Producer = "Acrobat Distiller 7.0 (Windows)"
pdf:Copyright = "2005 Adobe Systems Inc."
pdf:Keywords = "XMP metadata schema XML RDF"
 
http://ns.adobe.com/xap/1.0/ xap: (0x80000000 : schema)
xap:CreateDate = "2005-09-23T15:19:07Z"
xap:ModifyDate = "2005-09-23T15:19:07Z"
xap:CreatorTool = "FrameMaker 7.1"
 
http://purl.org/dc/elements/1.1/ dc: (0x80000000 : schema)
dc:description (0x1E00 : isLangAlt isAlt isOrdered isArray)
[1] = "XMP metadata specification" (0x50 : hasLang hasQual)
? xml:lang = "x-default" (0x20 : isQual)
dc:creator (0x600 : isOrdered isArray)
[1] = "Adobe Developer Technologies"
dc:title (0x1E00 : isLangAlt isAlt isOrdered isArray)
[1] = "Extensible Metadata Platform (XMP) Specification" (0x50 : hasLang hasQual)
? xml:lang = "x-default" (0x20 : isQual)
dc:format = "application/pdf"
 
http://ns.adobe.com/pdfx/1.3/ pdfx: (0x80000000 : schema)
pdfx:Copyright = "2005 Adobe Systems Inc."
 
http://ns.adobe.com/xap/1.0/mm/ xapMM: (0x80000000 : schema)
xapMM:InstanceID = "uuid:99b91701-a78b-4652-84e5-6bccaeb7534e"
xapMM:DocumentID = "uuid:374ea24b-3931-4b83-944d-5b9daa42277e"
&lt;/span>&lt;/pre>
&lt;p>&lt;span >or in more readable form (courtesy of ‘&lt;a href="http://www.w3.org/2000/10/swap/doc/cwm">cwm&lt;/a>‘):&lt;/span>&lt;/p>
&lt;pre>&lt;span >% xmp2n3q docs/XMP-Specification.pdf
#Processed by Id: cwm.py,v 1.164 2004/10/28 17:41:59 timbl Exp
# using base file:/Users/tony/Sources/Build/XMP-SDK/
 
# Notation3 generation by
# notation3.py,v 1.166 2004/10/28 17:41:59 timbl Exp
 
# Base was: file:/Users/tony/Sources/Build/XMP-SDK/
 
@prefix dc: &amp;lt;http://purl.org/dc/elements/1.1/&amp;gt; .
@prefix pdf: &amp;lt;http://ns.adobe.com/pdf/1.3/&amp;gt; .
@prefix pdfx: &amp;lt;http://ns.adobe.com/pdfx/1.3/&amp;gt; .
@prefix xmp: &amp;lt;http://ns.adobe.com/xap/1.0/&amp;gt; .
@prefix xmpMM: &amp;lt;http://ns.adobe.com/xap/1.0/mm/&amp;gt; .
 
&amp;lt;&amp;gt; pdf:Copyright "2005 Adobe Systems Inc.";
pdf:Keywords "XMP metadata schema XML RDF";
pdf:Producer "Acrobat Distiller 7.0 (Windows)";
pdfx:Copyright "2005 Adobe Systems Inc.";
xmp:CreateDate "2005-09-23T15:19:07Z";
xmp:CreatorTool "FrameMaker 7.1";
xmp:ModifyDate "2005-09-23T15:19:07Z";
xmpMM:DocumentID "uuid:374ea24b-3931-4b83-944d-5b9daa42277e";
xmpMM:InstanceID "uuid:99b91701-a78b-4652-84e5-6bccaeb7534e";
dc:creator [
a rdf:Seq;
rdf:_1 "Adobe Developer Technologies" ];
dc:description [
a rdf:Alt;
rdf:_1 "XMP metadata specification"@x-default ];
dc:format "application/pdf";
dc:title [
a rdf:Alt;
rdf:_1 "Extensible Metadata Platform (XMP) Specification"@x-default ] .
 
#ENDS
&lt;/span>&lt;/pre>
&lt;p>&lt;span >So, just what then is the version number of the XMP Specification which the id string ‘W5M0MpCehiHzreSzNTczkc9d’ is marking?&lt;/span>&lt;/p></description></item><item><title>XMP - Some Other Gripes</title><link>https://www.crossref.org/blog/xmp-some-other-gripes/</link><pubDate>Mon, 10 Sep 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/xmp-some-other-gripes/</guid><description>&lt;p>Following on from the missing XMP Specification version number discussed in the &lt;a href="https://www.crossref.org/blog/w5m0mpcehihzreszntczkc9d">previous post&lt;/a> here below are listed some miscellaneous gripes I’ve got with XMP (on what otherwise is a very promising technology). I would be more than happy to be proved wrong on any of these points.&lt;/p>
&lt;p>&lt;em>1. XMP version history and archive&lt;/em>&lt;/p>
&lt;p>There doesn’t appear to be any XMP version history or archive hosted by Adobe as far as I can tell.&lt;/p>
&lt;p>&lt;em>2. Unpublished schemas&lt;/em>&lt;/p>
&lt;p>Also there is nothing published - outside the XMP Spec itself - on the core schemas used by XMP. There’s nothing to be gleaned from the namespace URIs used. The Adobe namespaces, e.g.&lt;/p>
&lt;blockquote>
&lt;p>&lt;a href="http://www.adobe.com/products/xmp/" target="_blank">&lt;a href="https://web.archive.org/web/20070929102516/http://www.adobe.com/products/xmp/" target="_blank">https://web.archive.org/web/20070929102516/http://www.adobe.com/products/xmp/&lt;/a> (listed in XMP Spec)&lt;/a>&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p> &lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>&lt;a href="http://www.adobe.com/products/xmp/" target="_blank">&lt;a href="https://web.archive.org/web/20070929102516/http://www.adobe.com/products/xmp/" target="_blank">https://web.archive.org/web/20070929102516/http://www.adobe.com/products/xmp/&lt;/a> (not listed in XMP Spec)&lt;/a>&lt;/p>
&lt;/blockquote>
&lt;p>seem to all resolve to this page&lt;/p>
&lt;blockquote>
&lt;p>&lt;a href="http://www.adobe.com/products/xmp/" target="_blank">http://www.adobe.com/products/xmp/&lt;/a>.&lt;/p>
&lt;/blockquote>
&lt;p>So, that can leave us with undocumented terms (e.g. ‘&lt;tt>xmpMM:Manifest&lt;/tt>‘ used by Adobe InDesign CS2 4.0.5) from documented schemas and also undocumented schemas (e.g. ‘&lt;tt>pdfx&lt;/tt>‘).&lt;/p>
&lt;p>&lt;em>3. UUID&lt;/em>&lt;/p>
&lt;p>Note also that many Adobe apps do not use the URN syntax for ‘&lt;tt>uuid:&lt;/tt>‘. The XMP Spec also has this to say:&lt;/p>
&lt;blockquote>
&lt;p>_“There is no formal standard for URIs that are based on an abstract UUID. The following proposal may be relevant:&lt;/p>&lt;/p>
&lt;p>&lt;a href="https://datatracker.ietf.org/doc/rfc4122/" target="_blank">https://datatracker.ietf.org/doc/rfc4122/&lt;/a>;&lt;/i>&lt;/p>
&lt;p>(see: 3 XMP Storage Model / Serializing XMP / rdf:Description elements / rdf:about attribute)”&lt;/i>&lt;/blockquote>&lt;/p>
&lt;p>I guess the XMP Spec (Sept. ’05) had just been bedded down more or less when the URN namespace for ‘&lt;tt>uuid:&lt;/tt>‘ was published as &lt;a href="http://www.ietf.org/rfc/rfc4122.txt" target="_blank">RFC 4122&lt;/a> in July ’05.&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>&lt;em>4. RDF/XML serialization&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>The biggie.&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>XMP schemas specify fixed property value types in RDF/XML, i.e. they specify a fixed profile of RDF/XML instead of generic RDF/XML. This has been commented on recently by &lt;a href="http://lists.w3.org/Archives/Public/semantic-web/2007Sep/0007.html" target="_blank">myself&lt;/a> on the &lt;a href="http://lists.w3.org/Archives/Public/semantic-web/" target="_blank">semantic-web&lt;/a> list, and also &lt;a href="http://lists.w3.org/Archives/Public/semantic-web/2007Sep/0008.html" target="_blank">here&lt;/a> by Bruce D’Arcus speaking about OpenDocument, and &lt;a href="http://lists.w3.org/Archives/Public/semantic-web/2007Sep/0027.html" target="_blank">here&lt;/a> by Mike Linksvayer speaking for CC.&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>This profiling of RDF/XML leads to real problems. For example, Adobe have defined a Dublin Core (DC) schema which lists the property value types that DC values can assume in an XMP serialization. Meantime, the &lt;a href="https://web.archive.org/web/20070910031332/http://www.prismstandard.org/" target="_blank">PRISM 2.0&lt;/a> draft spec defines an incompatible mapping of DC terms to XMP property values. Since both schemas would make use of the same DC namespace (though PRISM haven’t actually specified a DC namespace for use with XMP but do use elsewhere the regular DC namespace) this isn’t going to work. I did supply some feedback on this to the PRISM WG but have heard nothing back from them. So, PRISM XMP looks uncertain at this time. Which, for us, must be a shame.&lt;/p>
&lt;/blockquote></description></item><item><title>connecting things: bioGUID, iSpiders and DOI</title><link>https://www.crossref.org/blog/connecting-things-bioguid-ispiders-and-doi/</link><pubDate>Fri, 07 Sep 2007 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/connecting-things-bioguid-ispiders-and-doi/</guid><description>&lt;p>David Shorthouse and Rod Page have developed some great tools for linking references by tying together a number of services and using the Crossref OpenURL interface amongst other things. See David’s post - &lt;a href="http://ispiders.blogspot.com/2007/08/gimme-that-scientific-paper-part-iii.html" target="_blank">Gimme That Scientific Paper Part III&lt;/a> and Rod’s post on OpenURL and using ParaTools - “&lt;a href="http://iphylo.blogspot.com/2007/05/openurl-and-spiders.html" target="_blank">OpenURL and Spiders&lt;/a>“.&lt;/p>
&lt;p>Unfortunately our planned changes to the Crossref OpenURL interface (the 100 queries per day limit in particular) caused some concern for David (“&lt;a href="http://ispiders.blogspot.com/2007/09/crossref-takes-step-back.html" target="_blank">Crossref Takes a Step Back&lt;/a>“) - but make sure you read the comments to see my response!&lt;/p>
&lt;p>We decided to drop the 100 per day query limit for the OpenURL interface and there will be no charges for non-commercial use of the interface - &lt;a href="https://apps.crossref.org/requestaccount/" target="_blank">https://apps.crossref.org/requestaccount/&lt;/a>&lt;/p>
&lt;p>We want to encourage innovative uses of Crossref services and disseminate DOIs as effectively as possible so we appreciate feedback and encourage the type of development David and Rod are doing. It will be interesting to see if what they are doing has wider applicability. Maybe Crossref could host a webpage to point to tools like this and encourage more development?&lt;/p></description></item><item><title>Stop Press</title><link>https://www.crossref.org/blog/stop-press/</link><pubDate>Tue, 28 Aug 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/stop-press/</guid><description>&lt;p>Boy, was I ever so wrong! Contrary to what I said in yesterday’s post, the new PRISM 2.0 spec &lt;strong>&lt;em>does&lt;/em>&lt;/strong> support XMP value type mappings for its terms. See the table below which lists the PRISM basic vocabulary terms and the XMP value types.&lt;/p>
&lt;p>Many thanks to Dianne Kennedy and the rest of the PRISM Working Group for having added this support to PRISM 2.0.&lt;/p>
&lt;table border="1" cellpadding="3">
&lt;tr>
&lt;th>
Section
&lt;/th>
&lt;pre>&lt;code>&amp;lt;th&amp;gt;
PRISM Term
&amp;lt;/th&amp;gt;
&amp;lt;th&amp;gt;
XMP Value Type
&amp;lt;/th&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.1
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:alternateTitle
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;bag Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.2
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:byteCount
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Integer&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.3
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:channel
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.4
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:complianceProfile
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Choice: &amp;amp;#8220;one&amp;amp;#8221;, &amp;amp;#8220;two&amp;amp;#8221;, &amp;amp;#8220;three&amp;amp;#8221;&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.5
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:copyright
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.6
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:corporateEntity
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;bag Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.7
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:coverDate
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Date&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.8
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:coverDisplayDate
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.9
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:creationDate
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Date&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.10
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:distributor
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.11
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:edition
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.12
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:eIssn
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.13
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:embargoDate
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;bag Date&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.14
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:endingPage
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.15
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:event
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;bag Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.16
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:expirationDate
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;bag Date&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.17
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:hasAlternative
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;bag Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.18
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:hasCorrection
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.19
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:hasTranslation
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;bag Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.20
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:industry
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;bag Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.21
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:isCorrectionOf
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;bag Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.22
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:issn
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.23
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:issueIdentifier
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.24
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:issueName
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.25
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:isTranslationOf
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.26
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:killDate
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Date&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.27
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:location
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;bag Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.28
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:modificationDate
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Date&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.29
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:number
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.30
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:object
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;bag Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.31
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:origin
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Choice: &amp;amp;#8220;email&amp;amp;#8221;, &amp;amp;#8220;mobile&amp;amp;#8221;, &amp;amp;#8220;broadcast&amp;amp;#8221;, &amp;amp;#8220;web&amp;amp;#8221;, &amp;amp;#8220;print&amp;amp;#8221;, &amp;amp;#8220;recordableMedia&amp;amp;#8221;, &amp;amp;#8220;other&amp;amp;#8221;&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.32
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:organisation
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;bag Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.33
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:pageRange
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.34
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:person
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;bag Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.35
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:postDate
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Date&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.36.
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:publicationDate
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Date&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.37
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:publicationName
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.38
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:receptionDate
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Date&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.39
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:rightsAgent
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.40
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:section
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;bag Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.41
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:startingPage
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.42
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:subsection1
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;bag Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.43
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:subsection2
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;bag Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.44
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:subsection3
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;bag Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.45
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:subsection4
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;bag Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.46
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:teaser
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.47
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:versionIdentifier
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.48
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:volume
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Text&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;tr>
&lt;td>
4.2.49
&lt;/td>
&lt;pre>&lt;code>&amp;lt;td&amp;gt;
prism:wordCount
&amp;lt;/td&amp;gt;
&amp;lt;td&amp;gt;
&amp;lt;b&amp;gt;Integer&amp;lt;/b&amp;gt;
&amp;lt;/td&amp;gt;
&lt;/code>&lt;/pre>
&lt;/tr>
&lt;/table></description></item><item><title>ExifTool</title><link>https://www.crossref.org/blog/exiftool/</link><pubDate>Mon, 27 Aug 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/exiftool/</guid><description>&lt;p>&lt;span >(&lt;b>Update - 2007.08.28:&lt;/b> I inadvertently missed out the term names in the last example of XMP as RDF/N3 with QNames and have now added these in. Also - a biggie - I said that PRISM had no XMP schema defined. This is actually wrong and as I blogged &lt;a href="https://www.crossref.org/blog/stop-press/">here&lt;/a> today, the new PRISM 2.0 spec does indeed have a mapping of PRISM terms to XMP value types. Should actually have read the spec instead of just blogging about it earlier &lt;a href="https://www.crossref.org/blog/prism-2.0/">here&lt;/a>. :~)&lt;/span>&lt;/p>
&lt;p>&lt;span >Having previously stooped to an extremely crass hack for pulling out a document information dictionary from PDFs (for which no apologies are sufficient but it does often work) I feel I should make some kind of amends and mention the wonderful &lt;a href="http://www.sno.phy.queensu.ca/~phil/exiftool/">ExifTool&lt;/a> by Phil Harvey for reading and writing metadata to media files. This is both a Perl library and command-line application (so it’s cross-platform - a Windows .exe and Mac OS .dmg are also provided.) Besides handling EXIF tags in image files this veritable swissknife of metadata inspectors can also read PDFs for the information dictionary and the document XMP packet. And moreover, intriguingly, can dump the raw (document) XMP packet.&lt;/span>&lt;/p>
&lt;p>&lt;span >I’m still experimenting with it. There’s quite a number of features to explore. But some preliminary finds are listed below.&lt;/span>&lt;/p>
&lt;p>&lt;span >Taking one of our standard (metadata poor) PDFs we get this dump:&lt;/span>&lt;/p>
&lt;pre class="">% exiftool nature05428.pdf
ExifTool Version Number : 6.95
File Name : nature05428.pdf
Directory : .
File Size : 367 kB
File Modification Date/Time : 2007:07:26 14:01:23
File Type : PDF
MIME Type : application/pdf
Page Count : 3
Producer : Acrobat Distiller 6.0.1 (Windows)
Mod Date : 2006:12:19 15:03:23+08:00
Creation Date : 2006:12:18 16:57:58+08:00
Creator : 3B2 Total Publishing System 7.51n/W
Creator Tool : 3B2 Total Publishing System 7.51n/W
Modify Date : 2006:12:19 15:03:23+08:00
Create Date : 2006:12:18 16:57:58+08:00
Metadata Date : 2006:12:19 15:03:23+08:00
Document ID : uuid:f598740b-ad11-41c5-a49e-7caffea783f0
Format : application/pdf
Title : untitled
&lt;/pre>
&lt;p>&lt;span >By way of comparison, if we take a demo (metadata rich) PDF with added descriptive DC and PRISM metadata terms, we then get this dump:&lt;/span>&lt;/p>
&lt;pre>% exiftool 445037a.pdf
ExifTool Version Number : 6.95
File Name : 445037a.pdf
Directory : .
File Size : 265 kB
File Modification Date/Time : 2007:07:26 16:18:17
File Type : PDF
MIME Type : application/pdf
Page Count : 1
Creator Tool : InDesign: pictwpstops filter 1.0
Metadata Date : 2006:12:22 12:10:07Z
Document ID : uuid:4cd39128-2c8e-41c0-9cad-eea2a1fdb64f
Identifier : doi:10.1038/445037a
Description : doi:10.1038/445037a
Source : Nature 445, 37 (2007)
Date : 2007:01:04
Format : application/pdf
Publisher : Nature Publishing Group
Language : en
Rights : © 2007 Nature Publishing Group
Publication Name : Nature
Issn : 0028-0836
E Issn : 1476-4679
Publication Date : 2007-01-04
Copyright : © 2007 Nature Publishing Group
Rights Agent : permissions@nature.com
Volume : 445
Number : 7123
Starting Page : 37
Ending Page : 37
Section : News and Views
Modify Date : 2006:12:22 12:10:07Z
Create Date : 2006:12:22 11:46:18Z
Title : 4.1 N&amp;V NS NEW.indd
Trapped : False
Creator : InDesign: pictwpstops filter 1.0
GTS PDFX Version : PDF/X-1:2001
GTS PDFX Conformance : PDF/X-1a:2001
Author : x
Producer : Acrobat Distiller 6.0.1 for Macintosh
&lt;/pre>
&lt;p>&lt;span >Note that the DC and PRISM terms are encoded as my earlier examples and do not take account of a) how DC is defined as an XMP schema (i.e. the XMP value types for the seperate terms), or b) how PRISM &lt;i>might&lt;/i> (because it isn’t yet) be defined as an XMP schema. Nor are identifier considerations fully taken into account. Nonetheless this gives more than an idea of what things could look like.&lt;/span>&lt;/p>
&lt;p>&lt;span >Now, with ExifTool it is also possible to list out the terms by group, e.g.&lt;/span>&lt;/p>
&lt;pre>% exiftool -g1 445037a.pdf
---- ExifTool ----
ExifTool Version Number : 6.95
---- File ----
File Name : 445037a.pdf
Directory : .
File Size : 265 kB
File Modification Date/Time : 2007:07:26 16:18:17
File Type : PDF
MIME Type : application/pdf
---- PDF ----
Page Count : 1
Modify Date : 2006:12:22 12:10:07Z
Create Date : 2006:12:22 11:46:18Z
Title : 4.1 N&amp;V NS NEW.indd
Trapped : False
Creator : InDesign: pictwpstops filter 1.0
GTS PDFX Version : PDF/X-1:2001
GTS PDFX Conformance : PDF/X-1a:2001
Author : x
Producer : Acrobat Distiller 6.0.1 for Macintosh
---- XMP-xmp ----
Creator Tool : InDesign: pictwpstops filter 1.0
Metadata Date : 2006:12:22 12:10:07Z
---- XMP-xmpMM ----
Document ID : uuid:4cd39128-2c8e-41c0-9cad-eea2a1fdb64f
---- XMP-dc ----
Identifier : doi:10.1038/445037a
Description : doi:10.1038/445037a
Source : Nature 445, 37 (2007)
Date : 2007:01:04
Format : application/pdf
Publisher : Nature Publishing Group
Language : en
Rights : © 2007 Nature Publishing Group
---- XMP-prism ----
Publication Name : Nature
Issn : 0028-0836
E Issn : 1476-4679
Publication Date : 2007-01-04
Copyright : © 2007 Nature Publishing Group
Rights Agent : permissions@nature.com
Volume : 445
Number : 7123
Starting Page : 37
Ending Page : 37
Section : News and Views
&lt;/pre>
&lt;p>&lt;span >Going back to the first example we can extract the (document) XMP packet as:&lt;/span>&lt;/p>
&lt;pre class="">% exiftool -xmp -b nature05428.pdf
&amp;lt;?xpacket begin='' id='W5M0MpCehiHzreSzNTczkc9d' bytes='1753'?&amp;gt;
&amp;lt;rdf:RDF xmlns:rdf='http://www.w3.org/1999/02/22-rdf-syntax-ns#'
xmlns:iX='http://ns.adobe.com/iX/1.0/'&amp;gt;
&amp;lt;rdf:Description about='uuid:3d686cee-18e6-483c-b1c9-e128e9f0d009'
xmlns='http://ns.adobe.com/pdf/1.3/'
xmlns:pdf='http://ns.adobe.com/pdf/1.3/'&amp;gt;
&amp;lt;pdf:Producer&amp;gt;Acrobat Distiller 6.0.1 (Windows)&amp;lt;/pdf:Producer&amp;gt;
&amp;lt;pdf:ModDate&amp;gt;2006-12-19T15:03:23+08:00&amp;lt;/pdf:ModDate&amp;gt;
&amp;lt;pdf:CreationDate&amp;gt;2006-12-18T16:57:58+08:00&amp;lt;/pdf:CreationDate&amp;gt;
&amp;lt;pdf:Title&amp;gt;untitled&amp;lt;/pdf:Title&amp;gt;
&amp;lt;pdf:Creator&amp;gt;3B2 Total Publishing System 7.51n/W&amp;lt;/pdf:Creator&amp;gt;
&amp;lt;/rdf:Description&amp;gt;
&amp;lt;rdf:Description about='uuid:3d686cee-18e6-483c-b1c9-e128e9f0d009'
xmlns='http://ns.adobe.com/xap/1.0/'
xmlns:xap='http://ns.adobe.com/xap/1.0/'&amp;gt;
&amp;lt;xap:CreatorTool&amp;gt;3B2 Total Publishing System 7.51n/W&amp;lt;/xap:CreatorTool&amp;gt;
&amp;lt;xap:ModifyDate&amp;gt;2006-12-19T15:03:23+08:00&amp;lt;/xap:ModifyDate&amp;gt;
&amp;lt;xap:CreateDate&amp;gt;2006-12-18T16:57:58+08:00&amp;lt;/xap:CreateDate&amp;gt;
&amp;lt;xap:Format&amp;gt;application/pdf&amp;lt;/xap:Format&amp;gt;
&amp;lt;xap:Title&amp;gt;
&amp;lt;rdf:Alt&amp;gt;
&amp;lt;rdf:li xml:lang='x-default'&amp;gt;untitled&amp;lt;/rdf:li&amp;gt;
&amp;lt;/rdf:Alt&amp;gt;
&amp;lt;/xap:Title&amp;gt;
&amp;lt;xap:MetadataDate&amp;gt;2006-12-19T15:03:23+08:00&amp;lt;/xap:MetadataDate&amp;gt;
&amp;lt;/rdf:Description&amp;gt;
&amp;lt;rdf:Description about='uuid:3d686cee-18e6-483c-b1c9-e128e9f0d009'
xmlns='http://ns.adobe.com/xap/1.0/mm/'
xmlns:xapMM='http://ns.adobe.com/xap/1.0/mm/'&amp;gt;
&amp;lt;xapMM:DocumentID&amp;gt;uuid:f598740b-ad11-41c5-a49e-7caffea783f0&amp;lt;/xapMM:DocumentID&amp;gt;
&amp;lt;/rdf:Description&amp;gt;
&amp;lt;rdf:Description about='uuid:3d686cee-18e6-483c-b1c9-e128e9f0d009'
xmlns='http://purl.org/dc/elements/1.1/'
xmlns:dc='http://purl.org/dc/elements/1.1/'&amp;gt;
&amp;lt;dc:format&amp;gt;application/pdf&amp;lt;/dc:format&amp;gt;
&amp;lt;dc:title&amp;gt;untitled&amp;lt;/dc:title&amp;gt;
&amp;lt;/rdf:Description&amp;gt;
&amp;lt;/rdf:RDF&amp;gt;
&amp;lt;?xpacket end='r'?&amp;gt;%
&lt;/pre>
&lt;p>&lt;span >Note that this PDF also included XMP packets for illustrations but the tool extracted the main, or document, XMP packet.&lt;/span>&lt;/p>
&lt;p>&lt;span >And now that it’s easier to extract the metadata one can look to do something more interesting. For example, if one has &lt;a href="http://www.w3.org/2000/10/swap/doc/cwm">cwm&lt;/a> installed (Tim BL’s Closed World Machine for semweb dabblings - a Python application, so again cross-platform) one can pipe the XMP packet into cwm as RDF/XML, verify it as valid RDF and read out in another format, e.g. RDF/N3. For the above example we can so this as follows.&lt;/span>&lt;/p>
&lt;p>&lt;span >But let me first define a pipeline to extract the XMP, a couple filters to strip out processing instructions (includes the open and close bracketing &amp;lt;?xpacket&amp;gt; XMP PI’s as well as an undocumented - legacy? - &amp;lt;?adobe&amp;gt; Adobe PI), and then fed into cwm as RDF/XML and read out as RDF/N3. (Note that instead of ExifTool to extract the XMP another tool could have been used, e.g. something based on the sample apps shipped with the Adobe XMP SDK, or something bespoke.)&lt;/span>&lt;/p>
&lt;pre>% alias get_n3
exiftool -xmp -b !$ | grep -v "&amp;lt;?" | grep -v xmpmeta | cwm --rdf --n3
&lt;/pre>
&lt;p>&lt;span >We can then simply request to get the metadata from this PDF in RDF/N3 format:&lt;/span>&lt;/p>
&lt;pre class="">% get_n3 nature05428.pdf
#Processed by Id: cwm.py,v 1.164 2004/10/28 17:41:59 timbl Exp
# using base file:/Users/tony/Xcode/xmp/dev/
# Notation3 generation by
# notation3.py,v 1.166 2004/10/28 17:41:59 timbl Exp
# Base was: file:/Users/tony/Xcode/xmp/dev/
@prefix rdf: &amp;lt;http://www.w3.org/1999/02/22-rdf-syntax-ns#&amp;gt; .
&amp;lt;uuid:3d686cee-18e6-483c-b1c9-e128e9f0d009&amp;gt; &amp;lt;http://ns.adobe.com/pdf/1.3/CreationDate&amp;gt; "2006-12-18T16:57:58+08:00";
&amp;lt;http://ns.adobe.com/pdf/1.3/Creator&amp;gt; "3B2 Total Publishing System 7.51n/W";
&amp;lt;http://ns.adobe.com/pdf/1.3/ModDate&amp;gt; "2006-12-19T15:03:23+08:00";
&amp;lt;http://ns.adobe.com/pdf/1.3/Producer&amp;gt; "Acrobat Distiller 6.0.1 (Windows)";
&amp;lt;http://ns.adobe.com/pdf/1.3/Title&amp;gt; "untitled";
&amp;lt;http://ns.adobe.com/xap/1.0/CreateDate&amp;gt; "2006-12-18T16:57:58+08:00";
&amp;lt;http://ns.adobe.com/xap/1.0/CreatorTool&amp;gt; "3B2 Total Publishing System 7.51n/W";
&amp;lt;http://ns.adobe.com/xap/1.0/Format&amp;gt; "application/pdf";
&amp;lt;http://ns.adobe.com/xap/1.0/MetadataDate&amp;gt; "2006-12-19T15:03:23+08:00";
&amp;lt;http://ns.adobe.com/xap/1.0/ModifyDate&amp;gt; "2006-12-19T15:03:23+08:00";
&amp;lt;http://ns.adobe.com/xap/1.0/Title&amp;gt; [
a rdf:Alt;
rdf:_1 "untitled"@x-default ];
&amp;lt;http://ns.adobe.com/xap/1.0/mm/DocumentID&amp;gt; "uuid:f598740b-ad11-41c5-a49e-7caffea783f0";
&amp;lt;http://purl.org/dc/elements/1.1/format&amp;gt; "application/pdf";
&amp;lt;http://purl.org/dc/elements/1.1/title&amp;gt; "untitled" .
#ENDS
&lt;/pre>
&lt;p>&lt;span >Or writing that out again with QNames for readability (and dropping the UUID as RDF subject as recommended by latest XMP spec) we have:&lt;/span>&lt;/p>
&lt;pre class="">#Processed by Id: cwm.py,v 1.164 2004/10/28 17:41:59 timbl Exp
# using base file:/Users/tony/Xcode/xmp/dev/
# Notation3 generation by
# notation3.py,v 1.166 2004/10/28 17:41:59 timbl Exp
# Base was: file:/Users/tony/Xcode/xmp/dev/
@prefix dc: &amp;lt;http://purl.org/dc/elements/1.1/&amp;gt; .
@prefix pdf: &amp;lt;http://ns.adobe.com/pdf/1.3/&amp;gt; .
@prefix xmp: &amp;lt;http://ns.adobe.com/xap/1.0/&amp;gt; .
@prefix xmpMM: &amp;lt;http://ns.adobe.com/xap/1.0/mm/&amp;gt; .
&amp;lt;&amp;gt; pdf:CreationDate "2006-12-18T16:57:58+08:00";
pdf:Creator "3B2 Total Publishing System 7.51n/W";
pdf:ModDate "2006-12-19T15:03:23+08:00";
pdf:Producer "Acrobat Distiller 6.0.1 (Windows)";
pdf:Title "untitled";
xmp:CreateDate "2006-12-18T16:57:58+08:00";
xmp:CreatorTool "3B2 Total Publishing System 7.51n/W";
xmp:Format "application/pdf";
xmp:MetadataDate "2006-12-19T15:03:23+08:00";
xmp:ModifyDate "2006-12-19T15:03:23+08:00";
xmp:Title [
a rdf:Alt;
rdf:_1 "untitled"@x-default ];
xmpMM:DocumentID "uuid:f598740b-ad11-41c5-a49e-7caffea783f0";
dc:format "application/pdf";
dc:title "untitled" .
#ENDS
&lt;/pre>
&lt;p>&lt;span >Now just imagine that there were something a little more interesting in there. Like a DOI. Like descriptive metadata, perhaps. 🙂&lt;/span>&lt;/p></description></item><item><title>pdfa.org</title><link>https://www.crossref.org/blog/pdfa.org/</link><pubDate>Thu, 23 Aug 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/pdfa.org/</guid><description>&lt;p>Following on from yesterday’s post I just came across this very useful source of information on PDF/A: the &lt;a href="http://pdfa.org/" target="_blank">PDF/A Conformance Center&lt;/a>. This provides links to resources such as this whitepaper &lt;a href="http://pdfa.org/" target="_blank">PDF/A - A new Standard for Long-Term Archiving&lt;/a>, and a number of technical notes, especially &lt;a href="http://pdfa.org/" target="_blank">Metadata and PDF/A-1&lt;/a>(also available as a &lt;a href="http://pdfa.org/" target="_blank">PDF&lt;/a>). (This latter corrects some errors in the ISO standard which are to be redressed in a forthcoming Technical Corrigendum later this year.)&lt;/p>
&lt;p>The site also links to the standard, to a FAQ, to &lt;a href="http://pdfa.org/" target="_blank">PDF/A products&lt;/a> and to news and events. There’s also an &lt;a href="http://pdfa.org/" target="_blank">RSS feed&lt;/a> and a &lt;a href="https://www.pdfa.org/discussion-forums/" target="_blank">discussion forum&lt;/a>.&lt;/p>
&lt;p>Still difficult to find examples of PDF/A though (the discussion forum doesn’t throw up too much on that score) although at least the Technical Note linked to above is a PDF/A-1 document as can be seen from this XMP description:&lt;/p>
&lt;pre>&amp;lt;rdf:Description rdf:about=""
xmlns:pdfaid="http://www.aiim.org/pdfa/ns/id/"&amp;gt;
&amp;lt;pdfaid:part&amp;gt;1&amp;lt;/pdfaid:part&amp;gt;
&amp;lt;pdfaid:conformance&amp;gt;A&amp;lt;/pdfaid:conformance&amp;gt;
&amp;lt;/rdf:Description&amp;gt;
&lt;/pre>
&lt;p>As noted before, PDF/A may be more (and less) than Crossref publishers require at this time, but nonetheless it is certainly a useful yardstick as regards embedding metadata within a PDF and is anyway a technology worth tracking in its own right.&lt;/p></description></item><item><title>Weird Scenes Inside the Gold Mine</title><link>https://www.crossref.org/blog/weird-scenes-inside-the-gold-mine/</link><pubDate>Wed, 22 Aug 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/weird-scenes-inside-the-gold-mine/</guid><description>&lt;p>&lt;span >So, following up on my recent posts here on Metadata in PDFs (&lt;a href="https://www.crossref.org/blog/metadata-in-pdf-1.-strategies/">Strategies&lt;/a>, &lt;a href="https://www.crossref.org/blog/metadata-in-pdf-2.-use-cases/">Use Cases&lt;/a>, &lt;a href="https://www.crossref.org/blog/metadata-in-pdf-3.-deployment/">Deployment&lt;/a>), I finally came across PDF/A and PDF/X, two ISO standardized subsets of PDF. the former (&lt;a href="http://www.iso.org/iso/catalogue_detail?csnumber=38920">ISO 19005-1:2005&lt;/a>) for archiving and the latter (&lt;a href="http://www.iso.org/iso/catalogue_detail?csnumber=34607">ISO 15929:2002&lt;/a>, &lt;a href="http://www.iso.org/iso/catalogue_detail?csnumber=29061">ISO 15930-1:2001&lt;/a>, etc.) for prepress digital data exchange.&lt;/span>&lt;/p>
&lt;p>&lt;span>Both formats share some common ground such as minimizing surprises between producer and consumer and keeping things open and predictable. But my interest here is specifically in metadata and to see what guidance these standards might provide us. Not unsurprisingly, metadata is a key issue for PDF/A, less so for PDF/X. I’ll discuss PDF/X briefly but the bulk of this post is focussed on PDF/A. See below.&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;b>&lt;i>PDF/X&lt;/i>&lt;/b>&lt;/span>&lt;/p>
&lt;p>&lt;span >The main reference I am using here is the “Application Notes for PDF/X Standards” cited below [PDF/X 2]. There are two key sections which deal with metadata in PDF/X: “2.3 Identification and conformance”, and “2.20 Document identification and metadata”.&lt;/span>&lt;/p>
&lt;p>&lt;span >Section 2.3 states that a conforming PDF/X file has the key “&lt;tt>/GTS_PDFXVersion&lt;/tt>” in the document information dictionary, and (depending on version) may or may not have the key “&lt;tt>/GTS_PDFXConformance&lt;/tt>“.&lt;/span>&lt;/p>
&lt;p>&lt;span >Section 2.20 then talks about inclusion of a document ID within the document trailer to ensure correct identification of the file. It then goes on specifically to say:&lt;/span>&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“Additionally, the use of the PDF version 1.4 Metadata key is allowed. Note that although information placed using this mechanism may be beneficial to production processes, any reader that is not PDF version 1.4 compliant may ignore this information.”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;span >That is, PDF/X requires the use of a document information dictionary with the key “&lt;tt>/GTS_PDFXVersion&lt;/tt>” (and as version demands also the key “&lt;tt>/GTS_PDFXConformance&lt;/tt>“) to signal conformance. It is lukewarm, though with regard to the inclusion of XMP metadata (as would be indicated by the “&lt;tt>/Metadata&lt;/tt>” key in the document catalog).&lt;/span>&lt;/p>
&lt;p>&lt;span >&lt;b>&lt;i>PDF/A&lt;/i>&lt;/b>&lt;/span>&lt;/p>
&lt;p>&lt;span >The main reference I’m using here is the “ISO DIS 19005-1:2005” draft cited below [PDF/A, 1].&lt;/span>&lt;/p>
&lt;p>&lt;span >Completely differently from PDF/X, PDF/A puts all its attention on the XMP metadata, while at the same time acknowledging that the document information dictionary may be used. Note 1 in Section 6.7.3 notes that:&lt;/span>&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >&lt;i>“Since a document information dictionary is allowed within a conforming file, it is possible for a single file to be both PDF/A-1 and PDF/X [12, 13] conformant.”&lt;/i>&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;span >The non-normative Annex B also has this to say:&lt;/span>&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >&lt;i>“Use of non-XMP metadata at the file level is strongly discouraged as there is no assurance that such metadata can be preserved in accordance with this specification. In cases where non-XMP metadata is present, the preference is to convert it to XMP, embed it in the file, and describe the conversion in the xmpMM:History property.”&lt;/i>&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;span >It’s not fully clear here whether “file level” is intended to be the same as “document level”. But note that this anyway is from a non-normative section and does not reflect the actual normative wording used in the standard (Section 6.7.3) which allows the use of the document information dictionary.&lt;/span>&lt;/p>
&lt;p>&lt;span >The key section for our purposes in the standard is “6.7 Metadata”.&lt;/span>&lt;/p>
&lt;p>&lt;span >Section “6.7.2 Properties” says:&lt;/span>&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >&lt;i>“The document catalog dictionary of a conforming file shall contain the Metadata key. The metadata stream that forms the value of that key shall conform to XMP Specification. All metadata properties pertaining to a file that are embedded in that file, except for document information dictionary entries that have no analogue in predefined XMP schemas as defined in 6.7.3, shall be in the form of one or more XMP packets as defined by XMP Specification, 3. Metadata properties shall be specified in predefined XMP schemas or in one or more extension schemas that comply with XMP requirements. Metadata object stream dictionaries shall not contain the Filter key.”&lt;/i>&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;span >This is quite something. Not only is PDF/A fully supportive of XMP (even if Adobe sometimes appear to be less than enthusiastic) it actually requires it. Further it says that the XMP packets shall be human readable (well, apart from the small matter of XML, that is :).&lt;/span>&lt;/p>
&lt;p>&lt;span >Section “6.7.3 Document information dictionary” then goes on to say:&lt;/span>&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >&lt;i>“A document information dictionary may appear within a conforming file. If it does appear, then all of its entries that have analogous properties in predefined XMP schemas, as defined by Table 1, shall also be embedded in the file in XMP form with equivalent values. Any document information dictionary entry not listed in Table 1 shall not be embedded using a predefined XMP schema property.”&lt;/i>&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;span >This says that the primary source of metadata will be the XMP packet and that, as far as possible, metadata properties in the document information dictionary will be mapped directly to the XMP packet as specified and will not cause any conflict.&lt;/span>&lt;/p>
&lt;p>&lt;span >I’m not quite sure how to read the last sentence. Does that mean that is one were to use an “&lt;tt>/Identifier&lt;/tt>” key in the document information dictionary then one couldn’t map it as “&lt;tt>dc:identifier&lt;/tt>“, say, in the XMP. I think that would be OK. My read is that it precludes the use of a predefined term within the information dictionary, so one couldn’t have something like “&lt;tt>dc:identifier&lt;/tt>” in the information dictionary.&lt;/span>&lt;/p>
&lt;p>&lt;span >Note also that the one quirky mapping in Table 1 which arises from the need to sync the information dictionary entries with the XMP properties is this:&lt;/span>&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >&lt;i>“If the dc:creator property is present in XMP metadata then it shall be represented by an ordered Text array of length one whose single entry shall consist of one or more names. The value of dc:creator and the document information dictionary Author entry shall be equivalent.”&lt;/i>&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;span >This means that:&lt;/span>&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >&lt;i>“The document information dictionary entry:&lt;br /> &lt;/i>&lt;/span>&lt;/p>
&lt;pre>&lt;span >/Author (Peter, Paul, and Mary)&lt;/span>&lt;/pre>
&lt;p>&lt;span > &lt;/span>&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>&lt;span >is equivalent to the XMP property:&lt;/span>&lt;/p>
&lt;pre>&lt;span >&amp;lt;dc:creator&amp;gt;
&lt;/blockquote>
&lt;p>&amp;lt;rdf:Seq&amp;gt;
&amp;lt;rdf.:li&amp;gt;Peter, Paul, and Mary&amp;lt;/rdf:li&amp;gt;
&amp;lt;/rdf:Seq&amp;gt;
&amp;lt;/dc:creator&amp;gt;
&lt;/span>&lt;/pre>&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >“&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;span >Weird, or what? Well, of course, I see the rationale, but …&lt;/span>&lt;/p>
&lt;p>&lt;span >The remaining sections of interest here are “6.7.6 File identifiers” which says that:&lt;/span>&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >&lt;i>“A conforming file should have one or more metadata properties to characterize, categorize, and otherwise identify the file. This part of ISO 19005 does not mandate any specific identification scheme. Identifiers may be externally based, such as an International Standard Book Number (ISBN) or a Digital Object Identifier (DOI), or internally based, such as a Globally Unique Identifier/Universally Unique Identifier (GUID/UUID) or another designation assigned during workflow operations.”&lt;/i>&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;span >Hmm, not that DOI is a file identifier necessarily. And certainly not in the Crossref usage where is denotes a work rather than a manifestation.&lt;/span>&lt;/p>
&lt;p>&lt;span >Section “6.7.8 Extension schemas” talks about the need to rigorously declare any extension (undefined) schema with the following PDF/A extension schema description schema properties:&lt;/span>&lt;/p>
&lt;ul>
&lt;li>
&lt;p>&lt;span >pdfaSchema:schema&lt;/span>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;span >pdfaSchema:namespaceURI&lt;/span>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;span >pdfaSchema:prefix&lt;/span>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;span >pdfaSchema:property&lt;/span>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;span >pdfaSchema:valueType&lt;/span>&lt;span >I think this means that were PRISM terms to be used the extension schema terms would need to be defined.&lt;/span>&lt;/p>
&lt;p>&lt;span >And finally, the section “6.7.11 Version and conformance level identification” says that:&lt;/span>&lt;/p>&lt;/p>
&lt;blockquote>
&lt;p>&lt;span >&lt;i>“The PDF/A version and conformance level of a file shall be specified using the PDF/A Identification extension schema defined in this clause.”&lt;/i>&lt;/span>&lt;/p>
&lt;/blockquote>
&lt;p>&lt;span >This uses the PDF/A identification schema properties:&lt;/span>&lt;/p>
&lt;ul>
&lt;li>&lt;span >pdfaid:part&lt;/span>&lt;/li>
&lt;/ul>
&lt;/li>
&lt;li>
&lt;p>&lt;span >pdfaid:amd&lt;/span>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;span >pdfaid:conformance&lt;/span>&lt;span >&lt;b>&lt;i>Summary&lt;/i>&lt;/b>&lt;/span>&lt;/p>
&lt;p>&lt;span >What does this all mean? Main lessons are to be learned from PDF/A which endorses (well, actually mandates) the use of XMP. Moreover, it requires that the document information dictionary and the XMP packet be in sync. Why it signals conformance through the XMP packet rather than through the information dictionary (as does PDF/X) is a mystery. Or at least not specify a means to also signal conformance through the information dictionary. The latter is readily get-at-able. A very crude hack to extract a PDF information dictionary can be as simple as&lt;/span>&lt;/p>&lt;/p>
&lt;pre>&lt;span >% strings &amp;lt;filename.pdf&amp;gt; | grep "/Producer"
&lt;/li>
&lt;/ul>
&lt;p>&lt;/span>&lt;/pre>&lt;/p>
&lt;pre>&lt;code>&amp;lt;span &amp;gt;or some other likely key. That will usually pull a line containing the full dictionary. The XMP packet is much harder to extract and then you’re still left with XML to parse.&amp;lt;/span&amp;gt;
&amp;lt;span &amp;gt;My gut feeling is that both mechanisms should be required (and sync’ed). And it’s hard not to see the DOI being required in both sections. Leads to considerations on which schemas/terms to use and how to render the DOI. I am biased and would prefer to see it rendered in URI form, i.e. in an inclusive rather than an exclusive representation. DOI is special - but not that special. Other identifiers are also useful.&amp;lt;/span&amp;gt;
&amp;lt;span &amp;gt;As per my &amp;lt;a href=&amp;quot;/blog/metadata-in-pdf-1.-strategies/&amp;quot;&amp;gt;earlier post&amp;lt;/a&amp;gt;, I could imagine that both DC and PRISM terms could be added to an XMP packet. I’m not sure whether there is any real interest at this time to follow the PDF/A specification or rather to be informed by it. There seems to be a lot of overhead and I’m still looking to meet up with some examples (either in the wild or fabricated) to see what it might look like in practice.&amp;lt;/span&amp;gt;
&amp;lt;span &amp;gt;Interested as always in others’ views.&amp;lt;/span&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;b&amp;gt;&amp;lt;i&amp;gt;References&amp;lt;/i&amp;gt;&amp;lt;/b&amp;gt;&amp;lt;/span&amp;gt;
&amp;lt;span &amp;gt;So, note that these are ISO documents and as such are available for purchase from the &amp;lt;a href=&amp;quot;https://web.archive.org/web/20070614003151/http://www.iso.org/iso/en/prods-services/ISOstore/store.html&amp;quot;&amp;gt;ISO Store&amp;lt;/a&amp;gt;. (The citations above are linked to the relevant ISO Store pages.)&amp;lt;/span&amp;gt;
&amp;lt;span &amp;gt;See also this recent post (August 1, 2007) by Rick Jelliffe on XML.com: &amp;lt;a href=&amp;quot;http://www.oreillynet.com/xml/blog/2007/08/where_to_get_iso_standards_on.html&amp;quot;&amp;gt;Where to get ISO Standards on the Internet free&amp;lt;/a&amp;gt;.&amp;lt;/span&amp;gt;
&amp;lt;span &amp;gt;There appear to be three main sources of information for these technologies: the ISO standards, application notes and FAQs. NPES (The Association for Suppliers of Printing, Publishing and Converting Technologies) hosts pages with relevant links - see &amp;lt;a href=&amp;quot;https://web.archive.org/web/20050504132522/http://www.npes.org/standards/&amp;quot;&amp;gt;here&amp;lt;/a&amp;gt;.&amp;lt;/span&amp;gt;
&amp;lt;span &amp;gt;Below are listed specific links to freely available documentation that may be useful. Note that I have not purchased the ISO standards but have made use of an ISO DIS (draft international standard) for PDF/A and Application Notes for PDF/X by CGATS. (As yet there are no links to Application Notes for PDF/A.)&amp;lt;/span&amp;gt;
&amp;lt;span &amp;gt;&amp;lt;a href=&amp;quot;http://www.npes.org/standards/toolspdfx.html&amp;quot;&amp;gt;PDF/X&amp;lt;/a&amp;gt;&amp;lt;/span&amp;gt;
1. &amp;lt;span &amp;gt;(No Draft International Standard found.)&amp;lt;/span&amp;gt;
&lt;/code>&lt;/pre>
&lt;ul>
&lt;li>&lt;span >&lt;a href="">Application Notes for PDF/X Standards Version 3&lt;/a>, September 2002, CGATS&lt;/span>
Application Notes for PDF/X Standards Version 4 (PDF/X-1a:2003, PDF/X-2:2003 &amp;amp; PDF/X-3:2003)&lt;/a>, September 2006 , CGATS&lt;/li>
&lt;li>&lt;span >&lt;a href="http://www.npes.org/LinkClick.aspx?fileticket=qStx8zxAyHA%3d&amp;tabid=158&amp;mid=669">Frequently Asked Questions&lt;/a>, November 2005, Martin Bailey, Chair, ISO/TC130/WG2/TF2 (PDF/X)&lt;/span>&lt;span >&lt;a href="http://www.npes.org/standards/toolspdfa.html">PDF/A&lt;/a>&lt;/span>
&lt;ol>
&lt;li>&lt;span >&lt;a href="https://web.archive.org/web/20060426194815/http://www.archivists.org.au/pubs/ISO_DIS_19005-1.pdf">Draft International Standard ISO/DIS 19005-1&lt;/a>, ISO/TC171/SC2, Document management— Electronic document file format for long-term preservation — Part 1: Use of PDF 1.4 (PDF/A-1)&lt;/span>&lt;/li>
&lt;/ol>
&lt;/li>
&lt;li>&lt;span >(No Application Notes for PDF/A available yet.)&lt;/span>&lt;/li>
&lt;li>&lt;span >&lt;a href="https://web.archive.org/web/20070107093446/http://www.npes.org/standards/Tools/19005-1_FAQ.pdf">Frequently Asked Questions (FAQs), ISO 19005-1:2005, PDF/A-1&lt;/a>, July 2006, PDF/A Joint Working Group &lt;/span>&lt;/li>
&lt;/ul></description></item><item><title>New SRU (1.2) Website</title><link>https://www.crossref.org/blog/new-sru-1.2-website/</link><pubDate>Wed, 08 Aug 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/new-sru-1.2-website/</guid><description>&lt;p>From Ray Denenberg’s post to the &lt;a href="https://web.archive.org/web/20070813010703/http://www.loc.gov/standards/sru/community/listserv.html" target="_blank">SRU Listserv&lt;/a> yesterday:&lt;/p>
&lt;blockquote>
&lt;p>_“The new SRU web site is now up: &lt;a href="http://www.loc.gov/sru/" target="_blank">http://www.loc.gov/sru/&lt;/a>&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>It is completely reorganized and reflects the version 1.2 specifications.&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>(It also includes version 1.1 specifications, but is oriented to version&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>1.2.)&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>…&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>There is an official 1.1 archive under the new site,&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>&lt;a href="https://web.archive.org/web/20080724063403/http://www.loc.gov/sru/sru1-1archive/" target="_blank">https://web.archive.org/web/20080724063403/http://www.loc.gov/sru/sru1-1archive/&lt;/a>. And note also, that the new spec incorporates both version 1.1 and 1.2 (anything specific to version 1.1 is annotated as such).”_&lt;/p>
&lt;/blockquote>
&lt;p>Interested to learn if any Crossref publishers are currently implementing SRU.&lt;/p></description></item><item><title>Handle Plugin: Some Notes</title><link>https://www.crossref.org/blog/handle-plugin-some-notes/</link><pubDate>Thu, 02 Aug 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/handle-plugin-some-notes/</guid><description>&lt;p>The first thing to note is that this demo (the Acrobat plugin) is an application. And that comes with its own baggage, i.e. this is a Windows only plugin and is targeted at Acrobat Reader 8. On a wider purview the application merely bridges an identifier embedded in the media file and the handle record filed against that identifier and delivers some relevant functionality. The data (or metadata) declared in the PDF and in the associated handle if rich enough and structured openly can also be used by other applications. I think this is a key point worth bearing in mind, that the demo besides showing off new functionalities is also demonstrating how data (or metadata) can be embedded at the respective endpoints (PDF, handle).&lt;/p>
&lt;p>Some initial observations follow below.&lt;/p>
&lt;p>&lt;strong>&lt;em>Install problems&lt;/em>&lt;/strong>&lt;/p>
&lt;p>As noted in my previous post I had to haul out the old HP laptop and engage in a dialog with our IT folks to get both Acrobat Reader 8 and the plugin installed as I did not have admin privileges on my own machine. Wasn’t pretty but they were kind.&lt;/p>
&lt;p>&lt;strong>&lt;em>Useability&lt;/em>&lt;/strong>&lt;/p>
&lt;p>I don’t know what’s happening here but from our network it seems as if the first attempts to contact the handle server are timing out and the handle client in the plugin is failing over to an alternate route (HTTP?). So, the plugin doesn’t work as expected since the user has to wait an untenably long time (somewhere between 60s and 90s). Of course, if a certain network access policy is required that would need to be specified and implemented by institutions for their users.&lt;/p>
&lt;p>I used both Firefox and Internet Explorer browsers and ran into occasional Acrobat plugin crashes which would lock up the browser. Due to the severe network access problems noted above I wasn’t able to rigorously test this further apart from to note that it was “buggy”.&lt;/p>
&lt;p>&lt;strong>&lt;em>Functionality&lt;/em>&lt;/strong>&lt;/p>
&lt;p>I tested most of the demo cases, but was hampered by the useability restrictions noted above. I didn’t see the “Related Links” or get the “Collections” to work but did see all the other cases and tried the buttons provided.&lt;/p>
&lt;p>One thing of note is that the Crossref metadata record was spoofed and returned from a stored data file rather than an active query to Crossref. A real query would have been been interesting to guage the impact of network latency, although the lookup point is made by hardwiring a response.&lt;/p>
&lt;p>&lt;strong>&lt;em>PDF Metadata&lt;/em>&lt;/strong>&lt;/p>
&lt;p>OK, so the document DOI is embedded in the PDF both in the document information dictionary and in the (document) metadata stream within an XMP packet. This is great although I do have some specific comments about how the DOI is actually disclosed. See my &lt;a href="https://www.crossref.org/blog/metadata-in-pdf-2.-use-cases">Metadata in PDF: 2. Use Cases&lt;/a> post for details.&lt;/p>
&lt;p>&lt;strong>&lt;em>Handle Data&lt;/em>&lt;/strong>&lt;/p>
&lt;p>Handle types are generally a matter for the handle administrators to oversee, although the unregulated use of new types is not going to help foster interoperability between handle applications. In passing I note that the handles used in this demo&lt;/p>
&lt;pre>10.5555/pdftest-collection
10.5555/pdftest-collection-item1
10.5555/pdftest-collection-item2
10.5555/pdftest-collection-item3
10.5555/pdftest-crossref
10.5555/pdftest-kernelmetadata
10.5555/pdftest-multires
10.5555/pdftest-rights
10.5555/pdftest-version
&lt;/pre>
&lt;p>make use of the following handle types (periods and underscores used as below)&lt;/p>
&lt;pre>COLLECTION
COLLECTION_ITEM
HS_ADMIN
HS_MODIFIED
HDL_MD
HDL.RIGHTS
HDL.XREF
&lt;/pre>
&lt;p>There is some degree of variability here which presumably will be managed better with a central handle type registry.&lt;/p>
&lt;p>&lt;strong>&lt;em>DOI/Handle&lt;/em>&lt;/strong>&lt;/p>
&lt;p>And lastly, this demo raises questions again about DOI and handle boundaries. From a handle viewpoint a DOI is nothing more than a branded handle, whereas from a DOI viewpoint a DOI is a specific handle profile with governance and policies, and its own service portfolio. The two terms should not be used interchangeably which I fear is where some of the demo details would lead us. As a very crude analogy (and with apologies to Bob Kahn) but I would see the relationship between DOI and handle as not being dissimilar from that between TCP and IP.&lt;/p></description></item><item><title>Metadata in PDF: 3. Deployment</title><link>https://www.crossref.org/blog/metadata-in-pdf-3.-deployment/</link><pubDate>Thu, 02 Aug 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/metadata-in-pdf-3.-deployment/</guid><description>&lt;p>So, assuming we know the form of the metadata we wish to add to our PDFs (or else to comply with if there is already a set of guidelines, or some industry initiative in effect) how can we realize this? And, on the flip side, how can we make it easier for consumers to extract metadata we have embedded in our PDFs.&lt;/p>
&lt;p>Below are some considerations on deploying metadata in PDFs and consumer access.&lt;/p>
&lt;p>&lt;strong>&lt;em>Write New&lt;/em>&lt;/strong>&lt;/p>
&lt;p>Obviously the best option would be to speak to one’s suppliers and to get metadata added to the PDF at create time. This leads to questions such as:&lt;/p>
&lt;ul>
&lt;li>What metadata do we have available in the workflow process? Do we have the full set we wish to write, or just a subset?
&lt;ul>
&lt;li>Do we include metadata in the document information dictionary, or in the document metadata stream, or both?
&lt;ul>
&lt;li>
&lt;p>OK, so we’ve decided to (also) include an XMP packet. So, now do we make that XMP packet read only or write? That is, do we allow the possibility of further edits by adding in trailing whitespace and marking it as “write”? &lt;/ul>
&lt;strong>&lt;em>Write Update&lt;/em>&lt;/strong>&lt;/p>
&lt;p>What possibilities exist for updating legacy PDF archives?&lt;/p>
&lt;p>The cleanest means of updating a PDF is in-place edits. This maintains the number of PDF objects together with their lengths and byte offests. Specifically we are interested in metadata objects. There isn’t too much one can do with the document information dictionary apart from overwriting a field value or substituting a field. This is something that may be possible on a “one off” basis only. On the other hand, XMP packets are ripe for updating if they are set in “write” mode and have trailing whitespace. This can be used to supplement the metadata already contained in the packet.&lt;/p>
&lt;p>There is some “wiggle” room, however, even in read-only XMP packets which have no trailing whitespace. Some XMP packets may include unused default namespace declarations and/or empty elements. These could be safely stripped and used for more positive purposes. This may not be enough to write in a full metadata set, but could be enough to squeeze in the DOI.&lt;/p>
&lt;p>The usual way to update a PDF file is to append new objects. This means that a replacement document information dictionary and (document) metadata stream can be provided without worrying about shoe-horning the data into any leftover space in the original objects.&lt;/p>
&lt;p>And this would be just fine, but for the small matter of Linerarized PDFs. These are widely deployed as web friendly PDFs ready for byte serving and are written out in a strictly determined ordering. (See Appendix F, “Linearized PDF” in the &lt;a href="http://www.adobe.com/devnet/pdf/pdf_reference.html" target="_blank">PDF Reference Manual&lt;/a>.) The manual does, however, say (Section F.4.6, “Accessing an Updated File”) this about updating a Linearized PDF:&lt;/p>
&lt;blockquote>
&lt;p>_“As stated earlier, if a Linearized PDF file subsequently has an incremental update appended to it, the linearization and hints are no longer valid. Actually, this is not necessarily true, but the viewer application must do some additional work to validate&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>the information.&lt;/p>&lt;/p>
&lt;p>…&lt;/p>
&lt;p>For a PDF file that has received only a small update, this approach may be worthwhile. Accessing the file this way is quicker than accessing it without hints or retrieving the entire file before displaying any of it.”&lt;/i>&lt;/blockquote>&lt;/p>
&lt;p>This may warrant some further investigation.&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>&lt;strong>&lt;em>Read&lt;/em>&lt;/strong>&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>Now for consumers, how can publishers help users to read the metadata embedded in a file? The document information dictionary is reasobaly accessible and is in the clear. It probably would not provide for much in terms of metadata but should anyway hopefully contain the DOI.&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>The XMP SDK is still far too unwieldy for wide use. Things would be much improved if there were even some &lt;a href="http://www.swig.org/" target="_blank">SWIG wrappers&lt;/a> for more popular languages such as Perl, Python, Ruby, etc. around the C++ code. The other thing to bear in mind is that the XMP SDK is dealing with generalities such as constructing and parsing XMP objects for reading and updating in a range of binary files. A consumer metadata app would only be interested in extracting the RDF/XML from the PDF. This can then be dealt with as appropriate to the application. Another problem concerns multiple XMP packets occurring in the same PDF, only one of them being the main (or document) XMP packet. This may be a non-problem in that all the RDF/XML could be extracted and the main XMP packet would be identifiable through the metadata it provided.&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>I suggest the best way to really help consumers is to go ahead and embed metadata in the first place, then there would be a clear impetus for extracting it. Even if a fuller metadata set is not being considered at this time, then at least the DOI should be considered for embedding in the PDF as a “hook” for further services. The handle plugin is a really good example of just such a downstream application.&lt;/p>
&lt;/blockquote>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul></description></item><item><title>PRISM 2.0</title><link>https://www.crossref.org/blog/prism-2.0/</link><pubDate>Thu, 02 Aug 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/prism-2.0/</guid><description>&lt;p>Only just caught up with this but the PRISM 2.0 draft is now available (since July 12) for public comment. See this posted by Dianne Kennedy:&lt;/p>
&lt;blockquote>
&lt;p>_“Just a note to let you know that PRISM 2.0 has just been posted at &lt;a href="https://web.archive.org/web/20070929195327/http://www.prismstandard.org/" target="_blank">www.prismstandard.org&lt;/a> . This is the first major revision to PRISM. We have incorporated new elements to support online content and have expanded and revised our controlled vocabularies. In addition we have added a profile to support PRISM in an XMP environment.&lt;/p>&lt;/p>
&lt;p>We invite you to review the new specification (in 6 documents organized by namespace) and provide your comments before September 15. Please just email comments and questions to me, &lt;a href="mailto:dkennedy@idealliance.org">dkennedy@idealliance.org&lt;/a>. “&lt;/i>&lt;/blockquote>&lt;/p>
&lt;/blockquote></description></item><item><title>Metadata in PDF: 1. Strategies</title><link>https://www.crossref.org/blog/metadata-in-pdf-1.-strategies/</link><pubDate>Wed, 01 Aug 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/metadata-in-pdf-1.-strategies/</guid><description>&lt;p>Emboldened by my own researches, by the recent handle plugin announcement from CNRI (on which, more in a follow-on post), and by Alexander Griekspoor’s comment to my earlier post, I thought I’d write a more extensive piece about embedding metadata in PDF with a view to the following:&lt;/p>
&lt;ul>
&lt;li>Discover what other publishers are currently doing&lt;/li>
&lt;li>Stimulate discussions between content providers and/or consumers&lt;/li>
&lt;li>Lay groundwork for a Crossref best practice guidelines&lt;/li>
&lt;/ul>
&lt;p>Why should Crossref be interested? Well, at minimum to embed the DOI along with the digital asset would seem to be inherently “a good thing”. (And, in fact, this is precisely the approach that CNRI have taken for their plugin demos. I’ll look later at what they actually did and consider whether that is a model that Crossref publishers might usefully follow.)&lt;/p>
&lt;p>Why include the DOI as an explicit piece of metadata rather than have it included by virtue of its appearance in a content section? The main reason is that it is then unambiguously accessible. Content sections in PDFs are typically filtered and sometimes encrypted), whereas metadata is usually plain text and moreover is marked up as to field type.&lt;/p>
&lt;p>Another question concerns whether to add in the identifier alone, or to embed a full metadata set. Why not just embed the identifier and visit Crossref for the metadata? This is feasible in some cases although it does involve an extra network trip, requires an application to service the identifier and is obviously not workable in offline contexts. Seems like a “no-brainer” to include a fuller description from the outset. Note that publishers frequently make some of this information available anyway in other metadata delivery channels, e.g. RSS feeds.&lt;/p>
&lt;p>There are two (complementary) approaches to embedding &lt;em>document-level&lt;/em> metadata in a PDF:&lt;/p>
&lt;ul>
&lt;li>&lt;strong>&lt;em>A - Document Information Dictionary&lt;/em>&lt;/strong>
This is an optional object (a dictionary) referenced from the PDF trailer dictionary. Example:&lt;/li>
&lt;/ul>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-1" data-lang="1">&lt;span class="line">&lt;span class="cl">&amp;lt;&amp;lt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">/Title ( PostScript Language Reference, Third Edition )
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">/Author ( Adobe Systems Incorporated )
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">/Creator ( Adobe FrameMaker 5.5.3 for Power Macintosh&amp;amp;reg; )
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">/Producer ( Acrobat Distiller 3.01 for Power Macintosh )
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">/CreationDate ( D:19970915110347-08&amp;#39;00&amp;#39; )
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">/ModDate ( D:19990209153925-08&amp;#39;00&amp;#39; )
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;gt;&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">endobj
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;ul>
&lt;li>&lt;strong>&lt;em>B - (Document) Metadata Stream&lt;/em>&lt;/strong>
This is an optional object (a stream) referenced from the document catalog, itself referenced from the PDF trailer dictionary. Example:&lt;/li>
&lt;/ul>
&lt;pre tabindex="0">&lt;code>2 0 obj
&amp;lt;&amp;lt;
/Type /Metadata
/Subtype /XML
/Length 1706
&amp;gt;&amp;gt;
stream
&amp;lt;?xpacket begin=&amp;#39;&amp;#39; id=&amp;#39;W5M0MpCehiHzreSzNTczkc9d&amp;#39;?&amp;gt;
&amp;lt;!-- RDF/XML goes here --&amp;gt;
&amp;lt;?xpacket end=&amp;#39;w&amp;#39;?
&amp;gt;
endstream
endobj
&lt;/code>&lt;/pre>&lt;p>Both approaches usually make the embedded metadata in the PDF available in the clear, whereas content is frequently filtered and sometimes encrypted. (Note that the information dictionary is always in the clear, while the metadata stream can be filtered and rendered unreadable although in practice this tends not to be filtered.)&lt;/p>
&lt;p>Below I examine both approaches and see how they can be used to encode the kind of metadata that scholarly publishers are accustomed to.&lt;/p>
&lt;p>&lt;strong>&lt;em>A - Document Information Dictionary&lt;/em>&lt;/strong>&lt;/p>
&lt;p>Note that keys in the document information dictionary divide equally between the logical document description (non-asterisked keys) and the physical asset description (asterisked keys):&lt;/p>
&lt;pre tabindex="0">&lt;code>Title
Author
Subject
Keywords
&amp;amp;nbsp;
* Creator
* Producer
* CreationDate
* ModDate
* Trapped
&lt;/code>&lt;/pre>&lt;p>This is the complete listing of keys in the PDF specification, although foreign keys are allowed (and ignored).&lt;/p>
&lt;p>What is missing here is any document identifier and/or any other descriptive metadata. From a Crossref point of view the identifier (the DOI) is a “hook” into the metadata record and so at minimum this could usefully be added. The question then is how? Either the identifier can be squeezed into one of the existing fields (“Title”, “Author”, “Subject”, “Keywords”) or else a new foreign key could be created.&lt;/p>
&lt;p>IMO if an existing keyword is used then I would opt for “Subject” or “Keywords”, and probably the former. If, on the other hand, a new foreign key were to be created I would choose something generic and (in keeping with the other terms) use something like “Identifier” (rather than, say, “DOI”).&lt;/p>
&lt;p>Of preference, I think I would go for the latter (“Identifier”) but if one wanted to make this more robust one could think of also adding in a known term (e.g. “Subject” or “Keywords”). So, to include metadata for the news article “Cosmology: Ripples of early starlight” printed in &lt;em>Nature&lt;/em> magazine &lt;em>Nature 445, 37 (2007): doi:10.1038/445037a&lt;/em>, we might include the following terms in the document information dictionary as:&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-1" data-lang="1">&lt;span class="line">&lt;span class="cl">&amp;lt;&amp;lt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">/Title ( Cosmology: Ripples of early starlight )
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">/Author ( Craig J. Hogan )
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">/Subject ( doi:10.1038/445037a )
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">/Keywords ( cosmology infrared protogalaxy starlight )
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;lt;b&amp;gt;/Identifier ( doi:10.1038/445037a )&amp;lt;/b&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">/Creator ( ... )
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">/Producer ( ... )
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">/CreationDate ( ... )
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">/ModDate ( ... )
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;gt;&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">endobj
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>where the bolded term represents a foreign key/value pair.&lt;/p>
&lt;p>Note: This (including the DOI in the “Subject” field) is a fix intended to get the DOI listed by Adobe apps which would not otherwise recognize the foreign key “Identifier”.&lt;/p>
&lt;p>Since it is not really feasible to include separate enumerated fields within the information dictionary (although it could be done), one might also consider including a descriptive citation field as a foreign key, e.g., something like:&lt;/p>
&lt;pre tabindex="0">&lt;code>/Source (Nature 445, 37 \(2007\))
&lt;/code>&lt;/pre>&lt;p>Aternatively that might better be presented as the “Subject” along with the DOI. Which would then limit the number of foreign keys to one (“Identifier”).&lt;/p>
&lt;p>&lt;strong>&lt;em>B - (Document) Metadata Stream&lt;/em>&lt;/strong>&lt;/p>
&lt;p>The metadata stream with its use of XMP packets (wrapping RDF/XML instances) is a much more flexible approach to embedding metadata and allows multiple schemas to be used. As noted in my &lt;a href="https://www.crossref.org/blog/xmp-first-hacks/">previous post&lt;/a> here on XMP, PDFs with XMP packets mostly use media-specific terms and schemas, although there is also a token showing of DC. From a descriptive metadata point of view we would more likely make use of DC and PRISM for our schemas.&lt;/p>
&lt;p>Reprising the example from the &lt;a href="https://www.crossref.org/blog/xmp-first-hacks/">previous post&lt;/a> (and again using citation example listed above) this would mean we may be inclined to include the following terms for a scholarly work (here in RDF/N3 for readability):&lt;/p>
&lt;pre tabindex="0">&lt;code class="language-dc:creator" data-lang="dc:creator">dc:title &amp;#34;Cosmology: Ripples of early starlight&amp;#34; ;
dc:identifier &amp;#34;doi:10.1038/445037a&amp;#34; ;
dc:source &amp;#34;Nature 445, 37 (2007)&amp;#34; ;
dc:date &amp;#34;2007-01-04&amp;#34; ;
dc:format &amp;#34;application/pdf&amp;#34; ;
dc:publisher &amp;#34;Nature Publishing Group&amp;#34; ;
dc:language &amp;#34;en&amp;#34; ;
dc:rights &amp;#34;© 2007 Nature Publishing Group&amp;#34; ;
&amp;amp;nbsp;
prism:publicationName &amp;#34;Nature&amp;#34; ;
prism:issn &amp;#34;0028-0836&amp;#34; ;
prism:eIssn &amp;#34;1476-4679&amp;#34; ;
prism:publicationDate &amp;#34;2007-01-04&amp;#34; ;
prism:copyright &amp;#34;© 2007 Nature Publishing Group&amp;#34; ;
prism:rightsAgent &amp;#34;permissions@nature.com&amp;#34; ;
prism:volume &amp;#34;445&amp;#34; ;
prism:number &amp;#34;7123&amp;#34; ;
prism:startingPage &amp;#34;37&amp;#34; ;
prism:endingPage &amp;#34;37&amp;#34; ;
prism:section &amp;#34;News and Views&amp;#34; ;
&lt;/code>&lt;/pre>&lt;p>This would look something like the following as an XMP packet within a PDF metadata stream (the RDF now being serialized as RDF/XML):&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" class="chroma">&lt;code class="language-2" data-lang="2">&lt;span class="line">&lt;span class="cl">&amp;lt;&amp;lt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">/Type /Metadata
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">/Subtype /XML
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">/Length 1706
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;gt;&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">stream
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;lt;?xpacket begin=&amp;#39;&amp;#39; id=&amp;#39;W5M0MpCehiHzreSzNTczkc9d&amp;#39;?&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;lt;rdf:RDF xmlns:rdf=&amp;#34;http://www.w3.org/1999/02/22-rdf-syntax-ns#&amp;#34;&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;lt;rdf:Description rdf:about=&amp;#34;&amp;#34;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">xmlns:dc=http://purl.org/dc/elements/1.1/&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;lt;dc:creator&amp;gt;Craig J. Hogan&amp;lt;/dc:creator&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;lt;dc:title&amp;gt;Cosmology: Ripples of early starlight&amp;lt;/dc:title&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;lt;dc:identifier&amp;gt;doi:10.1038/445037a&amp;lt;/dc:identifier&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;lt;dc:source&amp;gt;Nature 445, 37 (2007)&amp;lt;/dc:source&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;lt;dc:date&amp;gt;2007-01-04&amp;lt;/dc:date&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;lt;dc:format&amp;gt;application/pdf&amp;lt;/dc:format&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;lt;dc:publisher&amp;gt;Nature Publishing Group&amp;lt;/dc:publisher&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;lt;dc:language&amp;gt;en&amp;lt;dc:language&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;lt;dc:rights&amp;gt;© 2007 Nature Publishing Group&amp;lt;/dc:rights&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;lt;/rdf:Description&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;amp;nbsp;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;lt;rdf:Description rdf:about=&amp;#34;&amp;#34;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">xmlns:prism=[https://web.archive.org/web/20140228105237/http://prismstandard.org/namespaces/1.2/basic/](https://web.archive.org/web/20140228105237/http://prismstandard.org/namespaces/1.2/basic/)&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;lt;prism:publicationName&amp;gt;Nature&amp;lt;/prism:publicationName&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;lt;prism:issn&amp;gt;0028-0836&amp;lt;/prism:issn&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;lt;prism:eIssn&amp;gt;1476-4679&amp;lt;/prism:eIssn&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;lt;prism:publicationDate&amp;gt;2007-01-04&amp;lt;/prism:publicationDate&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;lt;prism:copyright&amp;gt;© 2007 Nature Publishing Group&amp;lt;/prism:copyright&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;lt;prism:rightsAgent&amp;gt;permissions@nature.com&amp;lt;/prism:rightsAgent&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;lt;prism:volume&amp;gt;445&amp;lt;/prism:volume&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;lt;prism:number&amp;gt;7123&amp;lt;/prism:number&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;lt;prism:startingPage&amp;gt;37&amp;lt;/prism:startingPage&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;lt;prism:endingPage&amp;gt;37&amp;lt;/prism:endingPage&amp;amp;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;lt;prism:section&amp;gt;News and Views&amp;lt;/prism:section&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;lt;/rdf:Description&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">&amp;lt;?xpacket end=&amp;#39;w&amp;#39;?&amp;gt;
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">endstream
&lt;/span>&lt;/span>&lt;span class="line">&lt;span class="cl">endobj
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;p>&lt;strong>&lt;em>References&lt;/em>&lt;/strong>&lt;/p>
&lt;p>Some useful references are:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>Adobe® Portable Document Format, Version 1.7, November 2006 (see &lt;a href="http://www.adobe.com/devnet/pdf/pdf_reference.html" target="_blank">http://www.adobe.com/devnet/pdf/pdf_reference.html&lt;/a>).&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Adobe® XMP Specification, September 2005 (see &lt;a href="http://partners.adobe.com/public/developer/en/xmp/sdk/XMPspecification.pdf" target="_blank">http://partners.adobe.com/public/developer/en/xmp/sdk/XMPspecification.pdf&lt;/a>&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Embedding XMP Metadata in Application Files, September 2001 (see &lt;a href="http://xml.coverpages.org/XMP-Embedding.pdf" target="_blank">http://xml.coverpages.org/XMP-Embedding.pdf&lt;/a>&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>Note a): See Section 10.2, “Metadata” in Ref. &lt;a href="https://www.crossref.org/blog/xmp-first-hacks/">1&lt;/a>.&lt;/p>
&lt;p>Note b): Ref. [3] is a fairly brief draft which covers both the Information Dictionary and Metadata Dictionary (XMP) approaches. There is an Adobe-hosted update to this document from June 2002 but that only discusses the XMP approach.&lt;/p></description></item><item><title>Metadata in PDF: 2. Use Cases</title><link>https://www.crossref.org/blog/metadata-in-pdf-2.-use-cases/</link><pubDate>Wed, 01 Aug 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/metadata-in-pdf-2.-use-cases/</guid><description>&lt;p>Well, this is likely to be a fairly brief post as I’m not aware of many use cases of metadata in PDFs from scholarly publishers. Certainly, I can say for &lt;em>Nature&lt;/em> that we haven’t done much in this direction yet although are now beginning to look into this.&lt;/p>
&lt;p>I’ll discuss a couple cases found in the wild but invite comment as to others’ practices. Let me start though with the CNRI handle plugin demo for Acrobat which I blogged &lt;a href="https://www.crossref.org/blog/handle-acrobat-reader-plugin/">here&lt;/a>.&lt;/p>
&lt;p>&lt;strong>&lt;em>Handle Plugin&lt;/em>&lt;/strong>&lt;/p>
&lt;p>First off, the handle plugin PDF samples do include an embedded (test) DOI in both the document information dictionary&lt;/p>
&lt;pre>5 0 obj
&amp;lt;&amp;lt;
/CreationDate (D:20070614140125-04'00')
/Author (Simon)
/Creator (PScript5.dll Version 5.2.2)
/Producer (Acrobat Distiller 8.1.0 \(Windows\))
/ModDate (D:20070614140240-04'00')
&lt;b>/HDL (10.5555/pdftest-crossref)&lt;/b>
/Title (Microsoft Word - crossref-rev.doc)
&amp;gt;&amp;gt;
endobj
&lt;/pre>
&lt;p>and in the (document) metadata stream&lt;/p>
&lt;pre>&amp;lt;rdf:Description rdf:about="" xmlns:pdfx="http://ns.adobe.com/pdfx/1.3/"&amp;gt;
&lt;b>&amp;lt;pdfx:HDL&amp;gt;10.5555/pdftest-crossref&amp;lt;/pdfx:HDL&amp;gt;&lt;/b>
&amp;lt;/rdf:Description&amp;gt;
&lt;/pre>
&lt;p>Bar any fuller disclosure of metadata terms at large (and one of the demo cases makes use of DOI to retrieve metadata form Crossref) this is excellent. I would, however, quibble with the use of “HDL” as a foreign key for the information dictionary. I realize this is just a test but the term “HDL” (or “DOI”, for that’s what it really is) is somewhat specific and a more general term such as “Identifier” would probably have more mileage, e.g.&lt;/p>
&lt;pre>5 0 obj
&amp;lt;&amp;lt;
...
&lt;b>/Identifier (doi:10.5555/pdftest-crossref)&lt;/b>
...
&amp;gt;&amp;gt;
endobj
&lt;/pre>
&lt;p>In the second example from the metadata dictionary I don’t think the term “HDL” from the PDF extension schema “pdfx” is very helpful. (Is that namespace actually defined anywhere?) From a descriptive metadata viewpoint a more usual schema such as DC would have wider coverage. So again the second example would be better rendered as&lt;/p>
&lt;pre>&amp;lt;rdf:Description rdf:about="" xmlns:dc="http://purl.org/dc/elements/1.1/"&amp;gt;
&lt;b>&amp;lt;dc:identifier&amp;gt;doi:10.5555/pdftest-crossref&amp;lt;/dc:identifier&amp;gt;&lt;/b>
&amp;lt;/rdf:Description&amp;gt;
&lt;/pre>
&lt;p>or, alternately,&lt;/p>
&lt;pre>&amp;lt;rdf:Description rdf:about="" xmlns:dc="http://purl.org/dc/elements/1.1/"&amp;gt;
&lt;b>&amp;lt;dc:identifier&amp;gt;info:hdl/10.5555/pdftest-crossref&amp;lt;/dc:identifier&amp;gt;&lt;/b>
&amp;lt;/rdf:Description&amp;gt;
&lt;/pre>
&lt;p>&lt;strong>&lt;em>Elsevier&lt;/em>&lt;/strong>&lt;/p>
&lt;p>Well, we have Alexander Griekspoor’s &lt;a href="https://www.crossref.org/blog/xmp-first-hacks/">comment earlier&lt;/a> that Elsevier are including the DOI in their PDFs. I don’t know how consistently this is being done but I’ve checked a couple sample articles and it would seem that they have embedded the DOI (here from &lt;em>Cancer Cell, doi:0.1016/j.ccr.2007.06.004&lt;/em>) in the title element which shows up in the information dictionary as&lt;/p>
&lt;pre>361 0 obj
&amp;lt;&amp;lt;
/Producer (Adobe LiveCycle PDFG 7.2)
/Creator (Elsevier)
/Author ()
/Keywords ()
&lt;b>/Title (doi:10.1016/j.ccr.2007.06.004)&lt;/b>
/ModDate (D:20070630031637+05'30')
/Subject ()
/CreationDate (D:00000101000000Z)
&amp;gt;&amp;gt;
endobj
&lt;/pre>
&lt;p>and in the (document) metadata dictionary as&lt;/p>
&lt;pre>365 0 obj
&amp;lt;&amp;lt;
/Type /Metadata
/Subtype /XML
/Length 1526
&amp;gt;&amp;gt;
stream
&amp;lt;?xpacket begin='' id='W5M0MpCehiHzreSzNTczkc9d' bytes='1526'?&amp;gt;
&amp;nbsp;
&amp;lt;rdf:RDF xmlns:rdf='http://www.w3.org/1999/02/22-rdf-syntax-ns#'
xmlns:iX='http://ns.adobe.com/iX/1.0/'&amp;gt;
&amp;nbsp;
&amp;lt;rdf:Description about=''
xmlns='http://ns.adobe.com/pdf/1.3/'
xmlns:pdf='http://ns.adobe.com/pdf/1.3/'&amp;gt;
&amp;lt;pdf:Producer&amp;gt;Adobe LiveCycle PDFG 7.2&amp;lt;/pdf:Producer&amp;gt;
&amp;lt;pdf:ModDate&amp;gt;2007-06-30T03:16:37+05:30&amp;lt;/pdf:ModDate&amp;gt;
&lt;b>&amp;lt;pdf:Title&amp;gt;doi:10.1016/j.ccr.2007.06.004&amp;lt;/pdf:Title&amp;gt;&lt;/b>
&amp;lt;pdf:Creator&amp;gt;Elsevier&amp;lt;/pdf:Creator&amp;gt;
&amp;lt;pdf:Author&amp;gt;&amp;lt;/pdf:Author&amp;gt;
&amp;lt;pdf:Keywords&amp;gt;&amp;lt;/pdf:Keywords&amp;gt;
&amp;lt;pdf:Subject&amp;gt;&amp;lt;/pdf:Subject&amp;gt;
&amp;lt;pdf:CreationDate&amp;gt;0-01-01T00:00:00Z&amp;lt;/pdf:CreationDate&amp;gt;
&amp;lt;/rdf:Description&amp;gt;
&amp;nbsp;
&amp;lt;rdf:Description about=''
xmlns='http://ns.adobe.com/xap/1.0/'
xmlns:xap='http://ns.adobe.com/xap/1.0/'&amp;gt;
&amp;lt;xap:CreatorTool>Elsevier&amp;lt;/xap:CreatorTool&amp;gt;
&amp;lt;xap:ModifyDate>2007-06-30T03:16:37+05:30&amp;lt;/xap:ModifyDate&amp;gt;
&amp;lt;xap:Title&amp;gt;
&amp;lt;rdf:Alt&amp;gt;
&lt;b>&amp;lt;rdf:li xml:lang='x-default'>doi:10.1016/j.ccr.2007.06.004&amp;lt;/rdf:li&amp;gt;&lt;/b>
&amp;lt;/rdf:Alt&amp;gt;
&amp;lt;/xap:Title&amp;gt;
&amp;lt;xap:Author&amp;gt;&amp;lt;/xap:Author&amp;gt;
&amp;lt;xap:Description&amp;gt;
&amp;lt;rdf:Alt&amp;gt;
&amp;lt;rdf:li xml:lang='x-default'/&amp;gt;
&amp;lt;/rdf:Alt&amp;gt;
&amp;lt;/xap:Description&amp;gt;
&amp;lt;xap:CreateDate&amp;gt;0-01-01T00:00:00Z&amp;lt;/xap:CreateDate&amp;gt;
&amp;lt;xap:MetadataDate>2007-06-30T03:16:37+05:30&amp;lt;/xap:MetadataDate&amp;gt;
&amp;lt;/rdf:Description&amp;gt;
&amp;nbsp;
&amp;lt;rdf:Description about=''
xmlns='http://purl.org/dc/elements/1.1/'
xmlns:dc='http://purl.org/dc/elements/1.1/'&amp;gt;
&lt;b>&amp;lt;dc:title>doi:10.1016/j.ccr.2007.06.004&amp;lt;/dc:title&amp;gt;&lt;/b>
&amp;lt;dc:creator/&amp;gt;
&amp;lt;dc:description/&amp;gt;
&amp;lt;/rdf:Description&amp;gt;
&amp;nbsp;
&amp;lt;/rdf:RDF>
&amp;lt;?xpacket end='r'?&amp;gt;
endstream
endobj
&lt;/pre>
&lt;p>Kudos anyway to Elsevier for emebedding this piece of information in their PDFs (if indeed it is a general practice). This has the merit of being picked up by Adobe apps and displayed in e.g. Reader. Also third party apps can pull this and use this to retrieve the metadata record from Crossref.&lt;/p>
&lt;p>The only downside is that technically this seems to be a kludge to satisfy Adobe apps and is not the correct field for filing this information. I would have thought that some other information dictionary field (e.g. “Subject”) would be a better kludge, and then reserve the “Title” and “Author” fields for their proper purposes. The RDF/XML title fields would appear to be inherited from the “Title” field in the information dictionary. It’s a bit of a shame really because the DOI is embedded - it’s just in the wrong place(s). (OK, so that’s still way better, maybe, than not providing this information at all.)&lt;/p>
&lt;p>Hopefully, with more examples to mull over and experiences to learn from we can arrive at a much better and more systematic way of including the DOI, and other key metadata fields, within a PDF so that this information can be gleaned easily and unambiguously by third party apps.&lt;/p></description></item><item><title>Handle Acrobat Reader Plugin</title><link>https://www.crossref.org/blog/handle-acrobat-reader-plugin/</link><pubDate>Tue, 31 Jul 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/handle-acrobat-reader-plugin/</guid><description>&lt;p>Just announced on the &lt;a href="http://www.handle.net/mailman/listinfo/handle-info" target="_blank">handle-info&lt;/a> list is a new plugin from CNRI for Acrobat Reader - see &lt;a href="http://www.handle.net/hs-tools/adobe/" target="_blank">here&lt;/a>. The announcement says:&lt;/p>
&lt;blockquote>
&lt;p>_“It is intended to demonstrate the utility of embedding a identifying&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>handle in a PDF document.&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p> &lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>…&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p> &lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>A set of demonstration documents, each with an embedded identifying&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>handle, is packaged with the plug-in to show potential uses. To make&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>productive use of this technology, a given industry or community of&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>users would have to agree on one or more specific applications and&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>populate the relevant handle records accordingly.”_&lt;/p>
&lt;/blockquote>
&lt;p>Two immediate comments:&lt;/p>
&lt;ul>
&lt;li>This is a Windows-only plugin (realized that right after hitting the download button and seeing the ‘.exe’ file) and also needs admin rights to install. (So I solved the first hurdle and am trying to clear the second hurdle. Lockdown is not an uncommon practice for enterprise or institutional computers.)&lt;br>
(&lt;strong>Update:&lt;/strong> Actually, I think I got this wrong. I need admin privileges to install Adobe Acrobat 8. Still scuppered, though. Can’t even see the sample PDF files.)
&lt;ul>
&lt;li>The plugin seems to be aimed at the user rather than at the user agent and thus is necessarily limited in scope, i.e. it needs a human driver. (Ideally content providers would embed metadata within media files using structured markup techniques which would be readily accessible to any downstream app which could leverage this data transparently to provide enhanced user services.) &lt;/ul>
Anyway, I’ll add something more when I can get it installed. I think this tool could be a useful addition to publishing toolkits but also that content providers could do much more for consumers by disclosing metadata for their digital assets in a neutral, structured form.&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul></description></item><item><title>URI Template Republished</title><link>https://www.crossref.org/blog/uri-template-republished/</link><pubDate>Sat, 28 Jul 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/uri-template-republished/</guid><description>&lt;p>Well, it all went very quiet for a while but glad to see that the &lt;a href="https://github.com/jcgregorio/uri-templates/blob/master/draft-gregorio-uritemplate-01.txt" target="_blank">URI Template Internet-Draft&lt;/a> has just been republished:&lt;/p>
&lt;blockquote>
&lt;p>_“A New Internet-Draft is available from the on-line Internet-Drafts&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>directories.&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>Title : URI Template&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>Author(s) : J. Gregorio, et al.&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>Filename : draft-gregorio-uritemplate-01.txt&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>Pages : 9&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>Date : 2007-7-23&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>URI Templates are strings that can be transformed into URIs after&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>embedded variables are substituted. This document defines the&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>syntax and processing of URI Templates.&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>A URL for this Internet-Draft is:&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>&lt;a href="https://github.com/jcgregorio/uri-templates/blob/master/draft-gregorio-uritemplate-01.txt" target="_blank">https://github.com/jcgregorio/uri-templates/blob/master/draft-gregorio-uritemplate-01.txt&lt;/a>”&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>_&lt;/p>
&lt;/blockquote>
&lt;p>URI templates should be a very useful publishing tool. Templates are already used by technologies such as OpenSearch - see &lt;a href="https://www.opensearch.org/" target="_blank">here&lt;/a>.&lt;/p></description></item><item><title>XMP: First Hacks</title><link>https://www.crossref.org/blog/xmp-first-hacks/</link><pubDate>Fri, 27 Jul 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/xmp-first-hacks/</guid><description>&lt;p>&lt;code>&amp;lt;span &amp;gt;&lt;/code>(&lt;code>&amp;lt;b&amp;gt;&lt;/code>Update - 2007.07.28:&lt;code>&amp;lt;/b&amp;gt;&lt;/code> I meant to reference in this entry Pierre Lindenbaum’s post back in May &lt;code>&amp;lt;a href=&amp;quot;http://plindenbaum.blogspot.com/2007/05/is-there-any-xmp-in-scientific-pdf-no.html&amp;quot;&amp;gt;&lt;/code>Is there any XMP in scientific pdf ? (No)&lt;code>&amp;lt;/a&amp;gt;&lt;/code>, which btw also references Roderic Page’s post on &lt;code>&amp;lt;a href=&amp;quot;http://iphylo.blogspot.com/2007/05/xmp.html&amp;quot;&amp;gt;&lt;/code>XMP&lt;code>&amp;lt;/a&amp;gt;&lt;/code> but forgot to add in the links in my haste to scoot off. Well, truth is we still can’t answer Pierre in the affirmative but at least we can take the first steps towards rectifying this.)&lt;/p>
&lt;p>&lt;code>&amp;lt;span &amp;gt;&lt;/code>I’ve been revisiting Adobe’s &lt;code>&amp;lt;a href=&amp;quot;http://www.adobe.com/products/xmp/&amp;quot;&amp;gt;&lt;/code>XMP&lt;code>&amp;lt;/a&amp;gt;&lt;/code> just recently. (I blogged &lt;code>&amp;lt;a href=&amp;quot;/blog/xmp-capabilities-extended//&amp;quot;&amp;gt;&lt;/code>here&lt;code>&amp;lt;/a&amp;gt;&lt;/code> about the new &lt;code>&amp;lt;a href=&amp;quot;http://www.adobe.com/devnet/xmp/&amp;quot;&amp;gt;&lt;/code>XMP Toolkit 4.1&lt;code>&amp;lt;/a&amp;gt;&lt;/code> back in March.)&lt;/p>
&lt;p>&lt;code>&amp;lt;span &amp;gt;&lt;/code>I wanted to share some of my early experiences. First off, after a couple of previous attempts which got pushed aside due to other projects, I managed to compile the libraries and the sample apps that ship with the C++ SDK under Xcode on the Mac. I also needed to compile &lt;code>&amp;lt;a href=&amp;quot;https://libexpat.github.io/&amp;quot;&amp;gt;&lt;/code>Expat&lt;code>&amp;lt;/a&amp;gt;&lt;/code> first which doesn’t ship with the distribution.&lt;/p>
&lt;p>&lt;code>&amp;lt;span &amp;gt;&lt;/code>OK, so far, so good. What this basically leaves one with is a couple of XMP dump utilities (&lt;code>&amp;lt;i&amp;gt;&lt;/code>DumpMainXMP&lt;code>&amp;lt;/i&amp;gt;&lt;/code> and &lt;code>&amp;lt;i&amp;gt;&lt;/code>DumpScannedXMP&lt;code>&amp;lt;/i&amp;gt;&lt;/code>) and two others (&lt;code>&amp;lt;i&amp;gt;&lt;/code>XMPCoreCoverage&lt;code>&amp;lt;/i&amp;gt;&lt;/code> and &lt;code>&amp;lt;i&amp;gt;&lt;/code>XMPFilesCoverage&lt;code>&amp;lt;/i&amp;gt;&lt;/code>) which is a good start anyways for exploring. And turns out that our PDFs already have some workflow metadata in them. This is encouraging because the SDK allows apps to read and update existing XMP packets from files, though not to write new packets into files (as far as I understand).&lt;/p>
&lt;p>&lt;code>&amp;lt;span &amp;gt;&lt;/code>I thought I would take this opportunity anyway to:&lt;/p>
&lt;ol>
&lt;li>&lt;code>&amp;lt;span &amp;gt;&lt;/code>See what XMP metadata terms we might consider adding&lt;/li>
&lt;/ol>
&lt;ul>
&lt;li>
&lt;p>&lt;code>&amp;lt;span &amp;gt;&lt;/code>Try and add these to existing XMP packets&lt;code>&amp;lt;span &amp;gt;&lt;/code>Ugly details are presented below, but by updating the XMP packet metadata in one of our PDFs (&lt;code>&amp;lt;i&amp;gt;&lt;/code>Nature 445, 37 (2007), C.J. Hogan&lt;code>&amp;lt;/i&amp;gt;&lt;/code>) we can teach Acrobat Reader to read - see the “before” (&lt;code>&amp;lt;a href=&amp;quot;https://web.archive.org/web/20130815224916/http://nurture.nature.com/&amp;quot;&amp;gt;&lt;/code>PDF here&lt;code>&amp;lt;/a&amp;gt;&lt;/code>) and “after” (&lt;code>&amp;lt;a href=&amp;quot;https://web.archive.org/web/20130815224916/http://nurture.nature.com/&amp;quot;&amp;gt;&lt;/code>PDF here&lt;code>&amp;lt;/a&amp;gt;&lt;/code>) screenshots in the figure.&lt;/p>
&lt;p>&lt;code>&amp;lt;span &amp;gt;&amp;lt;img src=&amp;quot;/wp/blog/images/acrobats.png&amp;quot; alt=&amp;quot;acrobats.png&amp;quot; width=&amp;quot;583&amp;quot; height=&amp;quot;466&amp;quot; /&amp;gt;&lt;/code>&lt;/p>
&lt;p>&lt;code>&amp;lt;span &amp;gt;&lt;/code>Of course, this is really about much more than getting Adobe apps to read/write metadata. It’s about using XMP as a standard platform for embedding metadata in digital assets for &lt;code>&amp;lt;i&amp;gt;&lt;/code>third-party apps&lt;code>&amp;lt;/i&amp;gt;&lt;/code> to read/write. If we can put ID3 tags into our podcasts then why not XMP packets into other media?&lt;code>&amp;lt;/p&amp;gt;&lt;/code>&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>&lt;code>&amp;lt;span &amp;gt;&lt;/code>First a brief digression on XMP packets, which look essentially like this:&lt;/p>
&lt;pre class="">&lt;?xpacket begin="..." id="..."?>
&amp;lt;x:xmpmeta xmlns:x="adobe:ns:meta/"&amp;gt;
&amp;lt;rdf:RDF xmlns:rdf="..." xmlns:...&amp;gt;
...
&amp;lt;/rdf:RDF&amp;gt;
&amp;lt;/x:xmpmeta&amp;gt;
... XML whitespace as padding ...
&amp;lt;?xpacket end="w"?&amp;gt;
&amp;lt;rdf:RDF&amp;gt;" element which is optionally wrapped by an "`&lt;tt>`&amp;lt;x:xmpmeta&amp;gt;`&lt;/tt>`" element. This XML fragment with trailing XML whitespace is topped and tailed by "`&lt;tt>`&amp;lt;?xpacket&amp;gt;`&lt;/tt>`" processing instructions with "`&lt;tt>`begin`&lt;/tt>`" and "`&lt;tt>`end`&lt;/tt>`" attributes, respectively.
The RDF supported is a simple profile of RDF with only certain constructs recognized: scalars, arrays, structures. It is not a means to embed arbitrary RDF/XML structures. But I'll pass on that for now. At first blush it's at least suitable to get a simple dictionary of key/value terms written in, and more besides.
The XMP metadata from the `&lt;a href="https://web.archive.org/web/20130815224916/http://nurture.nature.com/">`PDF file`&lt;/a>` listed above looks as follows in RDF/N3 (which is a more chipper serialization of RDF than is RDF/XML):`&lt;/pre>`
&lt;pre>&lt;code>`&amp;lt;pre&amp;gt;`&amp;amp;lt;uuid:...&amp;amp;gt;
&lt;/code>&lt;/pre>
&lt;p>dc:creator &amp;ldquo;x&amp;rdquo; ;
dc:format &amp;ldquo;application/pdf&amp;rdquo; ;
dc:title &amp;ldquo;19.7 N&amp;amp;V.indd NEW.indd&amp;rdquo;@x-default ;
 
pdf:GTS_PDFXConformance &amp;ldquo;PDF/X-1a:2001&amp;rdquo; ;
pdf:GTS_PDFXVersion &amp;ldquo;PDF/X-1:2001&amp;rdquo; ;
pdf:Producer &amp;ldquo;Acrobat Distiller 6.0.1 for Macintosh&amp;rdquo; ;
pdf:Trapped &amp;ldquo;False&amp;rdquo; ;
 
pdfx:GTS_PDFXConformance &amp;ldquo;PDF/X-1a:2001&amp;rdquo; ;
pdfx:GTS_PDFXVersion &amp;ldquo;PDF/X-1:2001&amp;rdquo; ;
 
xap:CreateDate &amp;ldquo;2007-07-16T09:25:20+01:00&amp;rdquo; ;
xap:CreatorTool &amp;ldquo;InDesign: pictwpstops filter 1.0&amp;rdquo; ;
xap:MetadataDate &amp;ldquo;2007-07-16T11:40:21+01:00&amp;rdquo; ;
xap:ModifyDate &amp;ldquo;2007-07-16T11:40:21+01:00&amp;rdquo; ;
 
xapMM:DocumentID &amp;ldquo;uuid:be3a9be5-4e3a-4b66-a50b-26f0a0bfc89d&amp;rdquo; ;
xapMM:InstanceID &amp;ldquo;uuid:73dcd021-d40a-4cb7-a99b-44f8e90624f4&amp;rdquo; .
&lt;code>&amp;lt;/pre&amp;gt;&lt;/code>&lt;/p>
&lt;pre>&lt;code>`&amp;lt;span &amp;gt;`(`&amp;lt;b&amp;gt;`Note:`&amp;lt;/b&amp;gt;` I’ve omitted namespaces here and dropped some of the structuring info that was present on the &amp;amp;#8220;`&amp;lt;tt&amp;gt;`dc:creator`&amp;lt;/tt&amp;gt;`&amp;amp;#8221; and &amp;amp;#8220;`&amp;lt;tt&amp;gt;`dc:title`&amp;lt;/tt&amp;gt;`&amp;amp;#8221; elements thus leaving all values as simple strings. Back to that in a bit. )
`&amp;lt;span &amp;gt;`What this says is simply that all these properies expressed in key/value pairs apply to the current document denoted by the resource identifier &amp;amp;#8220;`&amp;lt;tt&amp;gt;`[uuid:...](uuid:...)`&amp;lt;/tt&amp;gt;`&amp;amp;#8220;, and terms are taken from the schemas indicated by the prefixes. So, for example, the term &amp;amp;#8220;`&amp;lt;tt&amp;gt;`creator`&amp;lt;/tt&amp;gt;`&amp;amp;#8221; from the schema referenced by the placeholder &amp;amp;#8220;`&amp;lt;tt&amp;gt;`dc`&amp;lt;/tt&amp;gt;`&amp;amp;#8221; (there is a namespace URI for this but I haven’t shown it here) has the value &amp;amp;#8220;`&amp;lt;tt&amp;gt;`x`&amp;lt;/tt&amp;gt;`&amp;amp;#8221; for this document, and so on.
`&amp;lt;span &amp;gt;`So, salting away the media- and XMP-specific metadata, we are left with the following work metadata in our main XMP packet.
`&amp;lt;pre&amp;gt;&amp;lt;span &amp;gt;`&amp;amp;lt;uuid:...&amp;amp;gt;
&lt;/code>&lt;/pre>
&lt;p>dc:creator &amp;ldquo;x&amp;rdquo; ;
dc:format &amp;ldquo;application/pdf&amp;rdquo; ;
dc:title &amp;ldquo;19.7 N&amp;amp;V.indd NEW.indd&amp;rdquo;@x-default ;
&lt;code>&amp;lt;/pre&amp;gt;&lt;/code>&lt;/p>
&lt;pre>&lt;code>`&amp;lt;span &amp;gt;`Not wildly impressive, i must admit. Ideally we would like to pump this up with a fuller descriptive and rights metadata set such as we routinely syndicate with our web feeds. This would make use of both DC and PRISM vocabularies. In RDF/N3 we might expect to see something like:
`&amp;lt;pre&amp;gt;&amp;lt;span &amp;gt;`&amp;amp;lt;uuid:...&amp;amp;gt;
&lt;/code>&lt;/pre>
&lt;p>dc:creator &amp;ldquo;Craig J. Hogan&amp;rdquo; ;
dc:title &amp;ldquo;Cosmology: Ripples of early starlight&amp;rdquo; ;
dc:identifier &amp;ldquo;doi:10.1038/445037a&amp;rdquo; ;
dc:description &amp;ldquo;doi:10.1038/445037a&amp;rdquo; ;
dc:source &amp;ldquo;Nature 445, 37 (2007)&amp;rdquo; ;
dc:date &amp;ldquo;2007-01-04&amp;rdquo; ;
dc:format &amp;ldquo;application/pdf&amp;rdquo; ;
dc:publisher &amp;ldquo;Nature Publishing Group&amp;rdquo; ;
dc:language &amp;ldquo;en&amp;rdquo; ;
dc:rights &amp;ldquo;© 2007 Nature Publishing Group&amp;rdquo; ;
 
prism:publicationName &amp;ldquo;Nature&amp;rdquo; ;
prism:issn &amp;ldquo;0028-0836&amp;rdquo; ;
prism:eIssn &amp;ldquo;1476-4679&amp;rdquo; ;
prism:publicationDate &amp;ldquo;2007-01-04&amp;rdquo; ;
prism:copyright &amp;ldquo;© 2007 Nature Publishing Group&amp;rdquo; ;
prism:rightsAgent &amp;ldquo;&lt;a href="mailto:permissions@nature.com">permissions@nature.com&lt;/a>&amp;rdquo; ;
prism:volume &amp;ldquo;445&amp;rdquo; ;
prism:number &amp;ldquo;7123&amp;rdquo; ;
prism:startingPage &amp;ldquo;37&amp;rdquo; ;
prism:endingPage &amp;ldquo;37&amp;rdquo; ;
prism:section &amp;ldquo;News and Views&amp;rdquo; ;
&lt;code>&amp;lt;/pre&amp;gt;&lt;/code>&lt;/p>
&lt;pre>&lt;code>`&amp;lt;span &amp;gt;`So, taking this RDF and doing a quick and dirty substitution of it for the existing DC description in the PDF XMP packet (i.e. more or less &amp;amp;#8220;lobotomizing&amp;amp;#8221; the PDF) we then get an updated XMP packet which can be dumped with the `&amp;lt;i&amp;gt;`DumpMainXMP`&amp;lt;/i&amp;gt;` utility as (with some schemas removed):
`&amp;lt;pre&amp;gt;&amp;lt;span &amp;gt;`// ----------------------------------
&lt;/code>&lt;/pre>
&lt;p>// Dumping main XMP for 445037a.pdf :
 
File info : format = &amp;quot; &amp;ldquo;, handler flags = 00000260
Packet info : offset = 267225, length = 3651
 
Initial XMP from 445037a.pdf
Dumping XMPMeta object &amp;quot;&amp;rdquo; (0x0)
 
&amp;hellip;
 
&lt;a href="http://purl.org/dc/elements/1.1/" target="_blank">http://purl.org/dc/elements/1.1/&lt;/a> dc: (0x80000000 : schema)
dc:rights (0x1E00 : isLangAlt isAlt isOrdered isArray)
[1] = &amp;quot; 2007 Nature Publishing Group&amp;quot; (0x50 : hasLang hasQual)
? xml:lang = &amp;ldquo;x-default&amp;rdquo; (0x20 : isQual)
dc:language (0x200 : isArray)
[1] = &amp;ldquo;en&amp;rdquo;
dc:publisher (0x200 : isArray)
[1] = &amp;ldquo;Nature Publishing Group&amp;rdquo;
dc:format = &amp;ldquo;application/pdf&amp;rdquo;
dc:date (0x600 : isOrdered isArray)
[1] = &amp;ldquo;2007-01-04&amp;rdquo;
dc:source = &amp;ldquo;Nature 445, 37 (2007)&amp;rdquo;
dc:description (0x1E00 : isLangAlt isAlt isOrdered isArray)
[1] = &amp;ldquo;doi:10.1038/445037a&amp;rdquo; (0x50 : hasLang hasQual)
? xml:lang = &amp;ldquo;x-default&amp;rdquo; (0x20 : isQual)
dc:identifier = &amp;ldquo;doi:10.1038/445037a&amp;rdquo;
dc:title (0x1E00 : isLangAlt isAlt isOrdered isArray)
[1] = &amp;ldquo;Cosmology: Ripples of early starlight&amp;rdquo; (0x50 : hasLang hasQual)
? xml:lang = &amp;ldquo;x-default&amp;rdquo; (0x20 : isQual)
dc:creator (0x600 : isOrdered isArray)
[1] = &amp;ldquo;Craig J. Hogan&amp;rdquo;
 
&lt;a href="https://web.archive.org/web/20211021092941/http://prismstandard.org/namespaces/1.2/basic/" target="_blank">https://web.archive.org/web/20211021092941/http://prismstandard.org/namespaces/1.2/basic/&lt;/a> prism: (0x80000000 : schema)
prism:section = &amp;ldquo;News and Views&amp;rdquo;
prism:endingPage = &amp;ldquo;37&amp;rdquo;
prism:startingPage = &amp;ldquo;37&amp;rdquo;
prism:number = &amp;ldquo;7123&amp;rdquo;
prism:volume = &amp;ldquo;445&amp;rdquo;
prism:rightsAgent = &amp;ldquo;&lt;a href="mailto:permissions@nature.com">permissions@nature.com&lt;/a>&amp;rdquo;
prism:copyright = &amp;quot; 2007 Nature Publishing Group&amp;quot;
prism:publicationDate = &amp;ldquo;2007-01-04&amp;rdquo;
prism:eIssn = &amp;ldquo;1476-4679&amp;rdquo;
prism:issn = &amp;ldquo;0028-0836&amp;rdquo;
prism:publicationName = &amp;ldquo;Nature&amp;rdquo;
&lt;code>&amp;lt;/pre&amp;gt;&lt;/code>&lt;/p>
&lt;pre>&lt;code>`&amp;lt;span &amp;gt;`Full dumps of the &amp;amp;#8220;before&amp;amp;#8221; and &amp;amp;#8220;after&amp;amp;#8221; PDFs are available here:
*`&amp;lt;span &amp;gt;&amp;lt;i&amp;gt;`DumpMainXMP`&amp;lt;/i&amp;gt;` - `&amp;lt;a href=&amp;quot;https://web.archive.org/web/20080821103510/http://nurture.nature.com/tony/xmp/445037a.xmpp.0&amp;quot;&amp;gt;`before`&amp;lt;/a&amp;gt;` and `&amp;lt;a href=&amp;quot;https://web.archive.org/web/20080821103719/http://nurture.nature.com/tony/xmp/445037a.xmpp.1&amp;quot;&amp;gt;`after`&amp;lt;/a&amp;gt;`
`&amp;lt;pre&amp;gt;&amp;lt;/pre&amp;gt;`
&lt;/code>&lt;/pre>
&lt;ul>
&lt;li>
&lt;p>&lt;code>&amp;lt;span &amp;gt;&amp;lt;i&amp;gt;&lt;/code>DumpScannedXMP&lt;code>&amp;lt;/i&amp;gt;&lt;/code> - &lt;code>&amp;lt;a href=&amp;quot;https://web.archive.org/web/20080821103510/http://nurture.nature.com/tony/xmp/445037a.xmpp.0&amp;quot;&amp;gt;&lt;/code>before&lt;code>&amp;lt;/a&amp;gt;&lt;/code> and &lt;code>&amp;lt;a href=&amp;quot;https://web.archive.org/web/20080821103719/http://nurture.nature.com/tony/xmp/445037a.xmpp.1&amp;quot;&amp;gt;&lt;/code>after&lt;code>&amp;lt;/a&amp;gt;&amp;lt;span &amp;gt;&lt;/code>Note also that in the dump above some of the DC terms are interpreted by the XMP toolkit to have structured formats, i.e. are recognized as array members, and have language and ordering attributes. This seems to be an artefact of the toolkit as the RDF did not specify these structurings. Note also that the PRISM values were not similarly interpreted as the PRISM schema is not registered with the toolkit.&lt;/p>
&lt;p>&lt;code>&amp;lt;span &amp;gt;&lt;/code>Obviously, there’s much more to be learned yet. I’ll post an update to this later, but meantime it would be very interesting to get feedback from others on experiences they may have with XMP or any opinions they may want to share. I think it all looks very promising although tools are somewhat restricted.&lt;/p>
&lt;/li>
&lt;/ul></description></item><item><title>Publishing Linked Data</title><link>https://www.crossref.org/blog/publishing-linked-data/</link><pubDate>Thu, 19 Jul 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/publishing-linked-data/</guid><description>&lt;p>With these words:&lt;/p>
&lt;blockquote>
&lt;p>_“There was quite some interest in Linked Data at this year’s World Wide&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>Web Conference (WWW2007). Therefore, Richard Cyganiak, Tom Heath and I&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>decided to write a tutorial about how to publish Linked Data on the&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>Web, so that interested people can find all relevant information, best&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>practices and references in a single place.”_&lt;/p>
&lt;/blockquote>
&lt;p>Chris Bizer announces this draft &lt;a href="https://web.archive.org/web/20120315113002/http://www4.wiwiss.fu-berlin.de/bizer/pub/LinkedDataTutorial/" target="_blank">How to Publish Linked Data on the Web&lt;/a>. It’s a bright and breezy tutorial and useful (to me, anyway) for disclosing a couple of links:&lt;/p>
&lt;ul>
&lt;li>&lt;a href="http://www.w3.org/2001/tag/findings" target="_blank">Findings of the W3C TAG&lt;/a>
&lt;ul>
&lt;li>
&lt;p>&lt;a href="http://www.w3.org/DesignIssues/LinkedData.html" target="_blank">Linked Data - Design Issues&lt;/a> &lt;/ul>
The tutorial is unsurprisingly orthodox in its advocacy for all things HTTP and goes on to say:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“In the context of Linked Data, we restrict ourselves to using HTTP URIs only and avoid other URI schemes such as URNs and DOIs.”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>But this only relates back to Berners-Lee’s piece on Linked Data referenced above in which he says:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“The second rule, to use HTTP URIs, is also widely understood. The only deviation has been, since the web started, a constant tendency for people to invent new URI schemes (and sub-schemes within the urn: scheme) such as LSIDs and handles and XRIs and DOIs and so on, for various reasons. Typically, these involve not wanting to commit to the established Domain Name System (DNS) for delegation of authority but to construct something under separate control. Sometimes it has to do with not understanding that HTTP URIs are names (not addresses) and that HTTP name lookup is a complex, powerful and evolving set of standards. This issue discussed at length elsewhere, and time does not allow us to delve into it here.”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>Hmm. Does make one wonder where the concept of URI ever arose. Surely the nascent WWW application should have mandated the exclusive use of HTTP identifiers? Seems that this concept snuck up on us somehow and we now have to put it back into the box. Pandora, indeed!&lt;/p>
&lt;p>Back to the tutorial there are some unorthodox terms or at least I had not heard of them before. Contrasted with the defined term &lt;strong>information resources&lt;/strong> (from &lt;a href="http://www.w3.org/TR/webarch/" target="_blank">AWWW&lt;/a>) is the undefined term “non-information resources”. Further on, there’s a distinction made between two types of RDF triple: “literal triples” and “RDF links”. I hadn’t heard of either of these terms before although they are presented as if they were in common usage. The tutorial then goes on to deprecate the use of certain RDF features because it makes it “easier for clients”. So, I guess that the full expressivity of RDF is either not required or the world of “linked data” is not quite so large as it would like to be.&lt;/p>
&lt;p>And later on, there’s this puzzling injunction:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“You should only define terms that are not already defined within well-known vocabularies. In particular this means not defining completely new vocabularies from scratch, but instead extending existing vocabularies to represent your data as required.”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>Am I wrong, or is there something of a Catch 22 there? To extend an arbitrary vocabulary I would need to be the namespace authority - to be the “URI owner” in W3C speak. But I can’t be the authority for all namespaces/vocabularies because by the intent of the above they would likely be just the one (true?) vocabulary which I may or may not be the authority for. I thought the intent of the RDF model and XML namespaces was that terms could be applied from disparate vocabularies to the description at hand.&lt;/p>
&lt;p>Anyways, I am not trying to knock the draft. It’s something of a curate’s egg, that’s true, but I am genuinely looking forward to reading it through and would encourage others to have a look at it too.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul></description></item><item><title>PURL Redux</title><link>https://www.crossref.org/blog/purl-redux/</link><pubDate>Thu, 12 Jul 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/purl-redux/</guid><description>&lt;p>Seems that there’s life in the old dog yet. :~) See &lt;a href="http://outgoing.typepad.com/outgoing/2007/07/purl2.html" target="_blank">this post&lt;/a> about PURL from Thom Hickey, OCLC, This extract:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>OCLC has contracted with Zepheira to reimplement the PURL code which has become a bit out of date over the years. The new code will be in written in Java and released under the Apache 2.0 license.&lt;/em>&lt;/p>
&lt;/blockquote></description></item><item><title>BioNLP 2007</title><link>https://www.crossref.org/blog/bionlp-2007/</link><pubDate>Tue, 10 Jul 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/bionlp-2007/</guid><description>&lt;p>Just posted on Nascent a &lt;a href="https://web.archive.org/web/20070815000000*/http://blogs.nature.com/wp/nascent/2007/07/otmi_at_bionlp_2007.html" target="_blank">brief account&lt;/a> of a presentation I gave recently on &lt;a href="http://opentextmining.org/" target="_blank">OTMI&lt;/a> at &lt;a href="http://ufal.mff.cuni.cz/acl2007/workshops/program/index.php/ws05" target="_blank">BioNLP 2007&lt;/a>. The post lists some of the feedback I received. We are very interested to get further comments so do feel free to contribute comments either directly to the post, privately to &lt;a href="mailto:otmi@nature.com">otmi@nature.com&lt;/a>, or publicly to &lt;a href="mailto:otmi-discuss@crossref.org">otmi-discuss@crossref.org&lt;/a>. And then there’s always the OTMI wiki available for comment at &lt;a href="http://opentextmining.org/" target="_blank">http://opentextmining.org/&lt;/a>.&lt;/p>
&lt;p>It is important to note that OTMI is not a universal panacea but rather an attempt at bridging the gap between publisher and researcher. We are attempting to provide a framework to enable scholarly publishers to disclose full text for machine processing purposes without compromising their normal publishing obligations.&lt;/p></description></item><item><title>IBM Article on PRISM</title><link>https://www.crossref.org/blog/ibm-article-on-prism/</link><pubDate>Tue, 10 Jul 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/ibm-article-on-prism/</guid><description>&lt;p>Nice entry article on PRISM &lt;a href="http://www.ibm.com/developerworks/xml/library/x-think13.html" target="_blank">here&lt;/a> by Uche Ogbuji, Fourthought Inc. on IBM’s DeveloperWorks.&lt;/p></description></item><item><title>Oh, shiny!</title><link>https://www.crossref.org/blog/oh-shiny/</link><pubDate>Mon, 02 Jul 2007 00:00:00 +0000</pubDate><author>Crossref</author><guid>https://www.crossref.org/blog/oh-shiny/</guid><description>&lt;p>The other day Ed and I visited the &lt;a href="http://www.oecd.org/" target="_blank">OECD&lt;/a> to talk about all things e-publishig. At the end of our our meeting, Toby Green, the OECD’s head of publishing, handed all 30+ meeting attendees a copy of their well-known &lt;a href="https://www.oecd.org/en/publications/oecd-factbook-2015-2016_factbook-2015-en.html" target="_blank">OECD Factbook&lt;/a>- on a USB stick.&lt;/p>
&lt;p>Before you dismiss this as a gimick- note that organisations like the OECD get a lot of political and marketing mileage with “leave behinds”- print copies of their key reports, conference proceedings and reference works. While researchers might prefer electronic versions of the publications for their day-to-day work, print versions of the same publications seemed to continue to play a critical role as an “awareness tool.” I know that, for this very reason, several NGO/IGOs that I’ve spoken to have despaired of ever ramping down their print operations.&lt;/p>
&lt;p>I think that the OECD might have figured out a solution to this dilemma. It’s difficult to describe how viscerally satisfying it was to receive one of these Factbook USB-sticks. From the way in which the other meeting attendees swarmed around Toby as he was handing them out, I think that they might have had the same reaction.&lt;/p>
&lt;p>As we headed back to London on the Eurostar, I almost immediately popped the USB stick into my laptop and started browsing through the Factbook, much as I would have thumbed through a print version of the same (although -truth be told- I would have been tempted to conveniently “forget” the print version in order to not have to shlep it from Paris back to Oxford).&lt;/p>
&lt;p>In short, I think the system works. Kudos to the OECD for a simple, inexpensive and creative experiment in e-publishing.&lt;/p></description></item><item><title>OASIS Announces Search Web Services TC</title><link>https://www.crossref.org/blog/oasis-announces-search-web-services-tc/</link><pubDate>Fri, 15 Jun 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/oasis-announces-search-web-services-tc/</guid><description>&lt;p>&lt;a href="https://www.oasis-open.org/" target="_blank">OASIS&lt;/a> has just &lt;a href="http://lists.oasis-open.org/archives/tc-announce/200706/msg00008.html" target="_blank">announced&lt;/a> a technical committee for standardising search services. This from the Call for Participation:&lt;/p>
&lt;blockquote>
&lt;p>_&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>b. Purpose&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>To define Search and Retrieval Web Services, combining various current and&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>ongoing web service activities.&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>Within recent years there has been a growth in activity in the development of&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>web service definitions for search and retrieval applications. These include&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>SRU, a web service based in part on the NISO/ISO Search and Retrieval standards;&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>the Amazon OpenSearch, which defines a means of describing and automating search&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>web forms; as well as many proprietary definitions (e.g. the Google and MSN&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>Search APIs). There are also a number of activities for defining abstract search&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>APIs that can be mapped onto multiple implementations either within native code&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>or onto remote procedural calls and web services, such as ZOOM (Z39.50 Object&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>Oriented Model); SQI (Simple Query Interface), an IEEE standard developed for&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>searching and retrieval in the IMS (Instructional Management Systems) space; and&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>OSIDs (Open Service Interface Definitions from the Open Knowledge Initiative.&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>While abstract APIs would be out of scope, these would inform the work to&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>increase interoperability and compatibility.&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>_&lt;/p>
&lt;/blockquote></description></item><item><title>IDF Open Meeting: Innovative uses of the DOI system</title><link>https://www.crossref.org/blog/idf-open-meeting-innovative-uses-of-the-doi-system/</link><pubDate>Fri, 08 Jun 2007 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/idf-open-meeting-innovative-uses-of-the-doi-system/</guid><description>&lt;p>Please see the details of the IDF Annual Meeting and a related Handle System Workshop in Washington, DC on June 21 which may be of interest - &lt;a href="http://www.crossref.org/crweblog/2007/06/international_doi_foundation_a.html" target="_blank">http://www.crossref.org/crweblog/2007/06/international_doi_foundation_a.html&lt;/a>&lt;/p></description></item><item><title>Resource Maps</title><link>https://www.crossref.org/blog/resource-maps/</link><pubDate>Tue, 05 Jun 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/resource-maps/</guid><description>&lt;img alt="nyc1.jpg" src="https://www.crossref.org/wp/blog/images/nyc1.jpg" width="272" height="204" />
&lt;p>Last week we had a second face-to-face of the &lt;a href="http://www.openarchives.org/ore/" target="_blank">OAI-ORE&lt;/a> (Open Archives Initiative – Object Reuse and Exchange) Technical Committee in New York, the meeting being hosted courtesy of Google. (Hence the snap here taken from the terrace of Google’s canteen with its gorgeous view of midtown Manhattan. And the food’s not too shabby either. ;~)&lt;/p>
&lt;p>The main input to the meeting was this discussion document: &lt;a href="http://www.openarchives.org/ore/documents/CompoundObjects-200705.html" target="_blank">Compound Information Objects: The OAI-ORE Perspective&lt;/a>. This document we feel has now reached a level of maturity that we wanted to share with a wider audience. We invite feedback either directly at &lt;a href="mailto:ore@openarchives.org.">ore@openarchives.org&lt;/a> or indirectly via &lt;a href="mailto:t.hammond@nature.com">yours truly&lt;/a>.&lt;/p>
&lt;p>The document attempts to describe the problem domain - that of describing a scholarly publication as an aggregation of resources on the Web - and to put that squarely into the Web architecture context. What the initiative is seeking to provide is machine descriptions of those resources and their relationships, something that we are inclining to call “resource maps” and as underpinning we are making use of the notion of “named graphs” from ongoing semantic web research. Essentially these resource maps are machine-readable descriptions of participating resources (in a scholarly object - both core resources and related resources) and the relationships between those resources, the whole set of assertions about those resources being named (i.e. having a URI as identifier) and having provenance information attached, e.g. publisher, date of publication, version information (still under discussion). It is envisaged that these compound object descriptions may be available in a variety of serializations from a published, object-specific URL (i.e. a good old-fashioned Web address) but some honest-to-goodness XML serialization is a likely to be one of the candidates. No surprises here, then.&lt;/p>
&lt;p>Below is a schematic from the paper which shows the publication of a resource map (or named graph) corresponding to the compound object which logically represents a scholarly publication. For those objects of immediate interest to Crossref these would likely be identified with DOI’s although there is no restriction in OAI-ORE on the identifier to be used - other than it be a URI.&lt;/p>
&lt;img alt="named_graph.png" src="https://www.crossref.org/wp/blog/images/named_graph.png" width="460" height="160" />
&lt;p>&lt;strong>Update:&lt;/strong> For a couple posts from some other members of the ORE TC see &lt;a href="https://web.archive.org/web/20160404104320/http://dltj.org/article/thoughts-on-compound-documents/" target="_blank">here&lt;/a> (Peter Murray, OhioLINK) and &lt;a href="https://web.archive.org/web/20160410040938/http://efoundations.typepad.com/efoundations/2007/06/refining_ore.html" target="_blank">here&lt;/a> (Pete Johnston, Eduserv).&lt;/p></description></item><item><title>RSC’s Project Prospect v1.1</title><link>https://www.crossref.org/blog/rscs-project-prospect-v1.1/</link><pubDate>Thu, 31 May 2007 00:00:00 +0000</pubDate><author>rkidd</author><guid>https://www.crossref.org/blog/rscs-project-prospect-v1.1/</guid><description>&lt;p>We updated our &lt;a href="https://web.archive.org/web/20070401173200/http://www.rsc.org/Publishing/Journals/ProjectProspect/index.asp" target="_blank">Project Prospect&lt;/a> articles today to release v1.1, with a pile of look &amp;amp; feel improvements to the HTML views and links. The most interesting technical addition is the launch of our enhanced RSS feeds, where we have updated our &lt;a href="https://pubs.rsc.org/en/EAlerts/RssFeed" target="_blank">existing feeds&lt;/a> for enhanced articles. These now include ontology terms and primary compounds both visually (as text terms and 2D images) and within the RDF - using the OBO in OWL representation and the info:inchi specification mentioned here by Tony only a few weeks ago.&lt;/p>
&lt;p>The enhanced entries will soon become more common as we concentrate our enhancements on our Advance Articles, but the current example below from our &lt;a href="https://pubs.rsc.org/en/ealerts/rssfeed" target="_blank">Photochemical and Photobiological Sciences feed&lt;/a> is lovely. RDF code after the jump - just as beautiful to the parents…&lt;/p>
&lt;img alt="ProspectRSS.jpg" src="https://www.crossref.org/wp/blog/images/ProspectRSS.jpg" width="395" height="517" />
&lt;p>So the RDF code for the OBO terms and InChIs looks like this:&lt;/p>
&lt;pre tabindex="0">&lt;code>&amp;lt;tt&amp;gt;&amp;lt;br /&amp;gt; &amp;lt;rdf:li&amp;gt;&amp;lt;br /&amp;gt; &amp;lt;content:item rdf:about=&amp;#34;info:inchi/InChI=1/C20H28O/c1-16(8-6-9-17(2)13-15-21)11-12-19-18(3)10-7-14-20(19,4)5/h6,8-9,11-13,15H,7,10,14H2,1-5H3/b9-6-,12-11+,16-8+,17-13+&amp;#34;/&amp;gt;&amp;lt;br /&amp;gt; &amp;lt;/rdf:li&amp;gt;&amp;lt;br /&amp;gt; &amp;lt;rdf:li&amp;gt;&amp;lt;content:item rdf:about=&amp;#34;http://purl.org/obo/owl/CL#CL:0000210&amp;#34;/&amp;gt;&amp;lt;br /&amp;gt; &amp;lt;/rdf:li&amp;gt;&amp;lt;br /&amp;gt; &amp;lt;/tt&amp;gt;
&lt;/code>&lt;/pre>&lt;p>We now have over five hundred 2007 articles enhanced, so we’ve brought the majority back into controlled access. There are always &lt;a href="https://web.archive.org/web/20081004073354/http://www.rsc.org/Publishing/Journals/ProjectProspect/Examples.asp" target="_blank">examples&lt;/a> from each journal freely available.&lt;/p></description></item><item><title>OAI-ORE Presentation at OAI5</title><link>https://www.crossref.org/blog/oai-ore-presentation-at-oai5/</link><pubDate>Wed, 02 May 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/oai-ore-presentation-at-oai5/</guid><description>&lt;img alt="oai-ore-1.jpg" src="https://www.crossref.org/wp/blog/images/oai-ore-1.jpg" width="308" height="241" />
&lt;p>I posted &lt;a href="https://www.crossref.org/blog/digital-objects/">here&lt;/a> about an initial meeting of the &lt;a href="http://www.openarchives.org/ore/" target="_blank">OAI-ORE&lt;/a> Technical WG back in January. ORE is the “Object Reuse and Exchange” initiative which is aiming to provide a formalism for describing scholarly works as complete units (or packages) of information on the Web using resource maps which would be available from public access points. From a DOI perspective this work is intimately connected with multiple resolution. For further updates on this work, see &lt;a href="https://indico.cern.ch/event/5710/contributions/1212289/attachments/988175/1405153/ore-oai5-hvds.pdf" target="_blank">here&lt;/a> for a presentation by Herbert Van de Sompel on OAI-ORE at the OAI5 Workshop (5th Workshop on Innovations in Scholarly Communication) held a couple weeks back at CERN, Geneva, Switzerland.&lt;/p>
&lt;p>The presentation gives an insight regarding the problem domain in which ORE operates, and in the evolving thinking regarding potential solutions. The presentation was recorded on video and is available for both streaming and download (&lt;a href="https://web.archive.org/web/20070709065314/http://indico.cern.ch/" target="_blank">slides&lt;/a>, &lt;a href="https://web.archive.org/web/20070709065314/http://indico.cern.ch/" target="_blank">streaming video&lt;/a>, &lt;a href="https://web.archive.org/web/20070709065314/http://indico.cern.ch/" target="_blank">video download&lt;/a>).&lt;/p>
&lt;p>Note that Michael Nelson of Old Dominion University also presented on behalf of the ORE effort at the recent &lt;a href="https://web.archive.org/web/20110106125135/http://www.cni.org/tfms/2007a.spring/abstracts/PB-update-lagoze.html" target="_blank">CNI Task Force Meeting&lt;/a> and at the &lt;a href="https://web.archive.org/web/20070604163358/http://www.diglib.org/forums/spring2007/spring2007abstracts.htm" target="_blank">DLF Forum&lt;/a>.&lt;/p></description></item><item><title>A Modest Proposal</title><link>https://www.crossref.org/blog/a-modest-proposal/</link><pubDate>Wed, 11 Apr 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/a-modest-proposal/</guid><description>&lt;p>Was just reminded (thanks, Tim) of the possibility of using a special tag in bookmarking services to tag links to documents of interest to a given community. I think this is a fairly well-established practice. Note that e.g. the &lt;a href="http://www.openarchives.org/ore/" target="_blank">OAI-ORE&lt;/a> project is using &lt;a href="https://web.archive.org/web/20061205061750/http://www.connotea.org/" target="_blank">Connotea&lt;/a> to bookmark pages of interest and tagging them “&lt;strong>oaiore&lt;/strong>” which can then be easily retrieved using the link &lt;a href="http://web.archive.org/web/20160402182544/http://www.connotea.org/" target="_blank">http://web.archive.org/web/20160402182544/http://www.connotea.org/&lt;/a>.&lt;/p>
&lt;p>I would suggest that Crossref members might like to consider using the tag “&lt;strong>crosstech&lt;/strong>” in bookmarking pages about publishing technology, so that the following links might be used to retrieve documents of interest to this readership:&lt;/p>
&lt;ul>
&lt;li>del.icio.us - &amp;lt;https://web.archive.org/web/20071206033322/https://del.icio.us/
&lt;ul>
&lt;li>CiteULike - &lt;a href="http://www.citeulike.org/tag/crosstech" target="_blank">http://www.citeulike.org/tag/crosstech&lt;/a>
&lt;ul>
&lt;li>Connotea - &lt;a href="http://web.archive.org/web/20160402182544/http://www.connotea.org/" target="_blank">http://web.archive.org/web/20160402182544/http://www.connotea.org/&lt;/a>
&lt;ul>
&lt;li>etc. &lt;/ul>&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul></description></item><item><title>Citing Data Sets</title><link>https://www.crossref.org/blog/citing-data-sets/</link><pubDate>Fri, 30 Mar 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/citing-data-sets/</guid><description>&lt;p>This &lt;a href="http://www.dlib.org/dlib/march07/altman/03altman.html" target="_blank">D-Lib paper&lt;/a> by Altman and King looks interesting: &lt;em>“A Proposed Standard for the Scholarly Citation of Quantitative Data”&lt;/em>. (And thanks to &lt;a href="http://public.lanl.gov/herbertv/" target="_blank">Herbert Van de Sompel&lt;/a> for drawing attention to the paper.) Gist of it (Sect. 3) is&lt;/p>
&lt;blockquote>
&lt;p>_“We propose that citations to numerical data include, at a minimum, six required components. The first three components are traditional, directly paralleling print documents. … Thus, we add three components using modern technology, each of which is designed to persist even when the technology changes: a unique global identifier, a universal numeric fingerprint, and a bridge service. They are also designed to take advantage of the digital form of quantitative data.&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>An example of a complete citation, using this minimal version of the proposed standards, is as follows:&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>**Micah Altman; Karin MacDonald; Michael P. McDonald, 2005, “Computer Use in Redistricting”,&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>hdl:1902.1/AMXGCNKCLU UNF:3:J0PkMygLPfIyT1E/8xO/EA==&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>&lt;code>http://id.thedata.org/hdl%3A1902.1%2FAMXGCNKCLU&lt;/code>&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>“_&lt;/p>
&lt;/blockquote>
&lt;p>So the abbreviated citation (author, date, title, unique ID) is supplemented by a &lt;a href="https://web.archive.org/web/20061006030921/http://cran.r-project.org/src/contrib/Descriptions/UNF.html" target="_blank">UNF&lt;/a> which fingerprints the data. UNFs would appear to be a sort of super MD5 in providing a signature of the data content independent of the data serialization to a filestore.&lt;/p>
&lt;blockquote>
&lt;p>_“Thus, we add as the fifth component a Universal Numeric Fingerprint or UNF. The UNF is a short, fixed-length string of numbers and characters that summarize all the content in the data set, such that a change in any part of the data would produce a completely different UNF. A UNF works by first translating the data into a canonical form with fixed degrees of numerical precision and then applies a cryptographic hash function to produce the short string. The advantage of canonicalization is that UNFs (but not raw hash functions) are format-independent: they keep the same value even if the data set is moved between software programs, file storage systems, compression schemes, operating systems, or hardware platforms.&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>…&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>Finally, since most web browsers do not currently recognize global unique identifiers directly (i.e., without typing them into a web form), we add as the sixth and final component of the citation standard a bridge service, which is designed to make this task easier in the medium term.”_&lt;/p>
&lt;/blockquote>
&lt;p>Certainly looks promising. I’m not sure if there’s any other contestants in this arena.&lt;/p></description></item><item><title>Crossref Forward Linking Webinar</title><link>https://www.crossref.org/blog/crossref-forward-linking-webinar/</link><pubDate>Thu, 29 Mar 2007 00:00:00 +0000</pubDate><author>Anna Tolwinska</author><guid>https://www.crossref.org/blog/crossref-forward-linking-webinar/</guid><description>&lt;p>The next Crossref Forward Linking Webinar is coming on Monday April 30th , 2007 at 12:00pm.&lt;/p>
&lt;p>Registration is now available: [The next Crossref Forward Linking Webinar is coming on Monday April 30th , 2007 at 12:00pm.&lt;/p>
&lt;p>Registration is now available:]&lt;a href="https://www.crossref.org/webinars" target="_blank">1&lt;/a>&lt;/p>
&lt;p>Agenda is coming soon.&lt;/p></description></item><item><title>Markup for DOIs</title><link>https://www.crossref.org/blog/markup-for-dois/</link><pubDate>Thu, 29 Mar 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/markup-for-dois/</guid><description>&lt;p>Following up on his earlier &lt;a href="http://allmyeye.blogspot.com/2007/03/persistent-linking-web-crawlers-and.html" target="_blank">post&lt;/a> (which was also blogged to CrossTech &lt;a href="https://www.crossref.org/blog/indexing-urls/">here&lt;/a>), Leigh Dodds is now [Following up on his earlier &lt;a href="http://allmyeye.blogspot.com/2007/03/persistent-linking-web-crawlers-and.html" target="_blank">post&lt;/a> (which was also blogged to CrossTech &lt;a href="https://www.crossref.org/blog/indexing-urls/">here&lt;/a>), Leigh Dodds is now]&lt;a href="http://allmyeye.blogspot.com/2007/03/persistent-links-in-bookmarks.html" target="_blank">3&lt;/a> the possibility of using machine-readable auto-discovery type links for DOIs of the form&lt;/p>
&lt;p>&lt;tt>&lt;br /> &lt;link rel="bookmark" title="DOI" href="http://dx.doi.org/10.1000/1"/>&lt;br /> &lt;/tt>&lt;/p>
&lt;p>These &lt;code>LINK&lt;/code> tags are placed in the document &lt;code>HEAD&lt;/code> section and could be used by crawlers and agents to recognize the work represented by the current document. This sounds like a great idea and we’d like to hear feedback on it.&lt;/p>
&lt;p>Concurrently at Nature we have also been considering how best to mark up in a machine-readable way DOIs appearing &lt;em>within&lt;/em> a document page &lt;code>BODY&lt;/code>. Current thinking is to do something along the following lines:&lt;/p>
&lt;p>&lt;tt>&lt;br /> &lt;a href="http://dx.doi.org/10.1038/nprot.2007.43">&lt;br /> &lt;abbr title="Digital Object Identifier">doi&lt;/abbr>:&lt;br /> &lt;abbr class="uri" id="doi" title="info:doi/10.1038/nprot.2007.43">10.1038/nprot.2007.43&lt;/abbr>&lt;br /> &lt;/a>&lt;br /> &lt;/tt>&lt;/p>
&lt;p>which allows the DOI to be presented in the preferred Crossref citation format (&lt;code>doi:10.1038/nprot.2007.43&lt;/code>), to be hyperlinked to the handle proxy server (&lt;code>&amp;lt;a href=&amp;quot;http://dx.doi.org/10.1038/nprot.2007.43&amp;quot;&amp;gt;http://dx.doi.org/10.1038/nprot.2007.43&amp;lt;/a&amp;gt;&lt;/code>), and to refer to a validly registered URI form for the DOI (&lt;code>info:doi/10.1038/nprot.2007.43&lt;/code>). Again, we would be real interested to hear any opinions on this proposal for inline DOI markup as well as on Leigh’s proposal for document-level DOI markup.&lt;/p>
&lt;p>(Oh, and btw many congrats to Leigh on his recent &lt;a href="https://web.archive.org/web/20081014005906/http://eyetoeye.ingenta.com/publisher/issue18/news-dodds.htm" target="_blank">promotion&lt;/a> to CTO, Ingenta.)&lt;/p></description></item><item><title>Publishing 2.0</title><link>https://www.crossref.org/blog/publishing-2.0/</link><pubDate>Thu, 29 Mar 2007 00:00:00 +0000</pubDate><author>Crossref</author><guid>https://www.crossref.org/blog/publishing-2.0/</guid><description>&lt;p>XML:UK is holding a one-day conference entitled titled “&lt;a href="https://web.archive.org/web/20070630093957/http://www.xmluk.org/publishing20407.htm" target="_blank">Publishing 2.0&lt;/a>” at &lt;a href="http://www.bletchleypark.org.uk/" target="_blank">Bletchley Park&lt;/a> on Wednesday 25th April 2007. Bletchley Park was the location of the United Kingdom’s main codebreaking establishment during the Second World War and is now a museum (and has a train station!). The event will examine some of the more cutting-edge applications of XML technology to publishing. With keynotes by Sean McGrath and Kate Warlock and a series of must-see presentations, this will be the place to be on the last Wednesday in April.&lt;/p></description></item><item><title>Welcome to &amp;#8220;Otmi-discuss&amp;#8221;</title><link>https://www.crossref.org/blog/welcome-to-otmi-discuss/</link><pubDate>Fri, 23 Mar 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/welcome-to-otmi-discuss/</guid><description>&lt;p>Just a quick note to mention that we’ve now set up a new mailing list &lt;a href="https://web.archive.org/web/20070503161800/http://mailserv.crossref.org:8800/mailman/listinfo/otmi-discuss" target="_blank">otmi-discuss@crossref.org&lt;/a> for public discussion of &lt;a href="http:/opentextmining.org/" target="_blank">OTMI&lt;/a> - the Open Text Mining Interface proposed by Nature. See the list information page &lt;a href="https://web.archive.org/web/20070503161800/http://mailserv.crossref.org:8800/mailman/listinfo/otmi-discuss" target="_blank">here&lt;/a> for details on subscribing to the list and to access the mail archives.&lt;/p>
&lt;p>And many thanks to the Crossref folks for hosting this for us!&lt;/p></description></item><item><title>XMP Capabilities Extended</title><link>https://www.crossref.org/blog/xmp-capabilities-extended/</link><pubDate>Thu, 22 Mar 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/xmp-capabilities-extended/</guid><description>&lt;p>This &lt;a href="https://web.archive.org/web/20070908031425/http://blogs.adobe.com/creativesolutionspr/2007/03/adobe_extends_xmp_capabilities.html" target="_blank">post&lt;/a> on Adobe’s Creative Solutions PR blog may be worth a gander:&lt;/p>
&lt;blockquote>
&lt;p>_“This new update, the Adobe XMP 4.1, provides new libraries for developers to read, write and update XMP in popular image, document and video file formats including: JPEG, PSD, TIFF, AVI, WAV, MPEG, MP3, MOV, INDD, PS, EPS and PNG. In addition, the rewritten XMP 4.1 libraries have been optimized into two major components, the XMP Core and the XMP Files.&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>The XMP Core enables the parsing, manipulating and serializing of XMP data, and the XMP Files enables the reading, rewriting, and injecting serialized XMP into the multiple file formats. The XMP Files can be thought of as a “file I/O” component for reading and writing the metadata that is manipulated by the XMP Core component.&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>Supported development environments for Adobe’s XMP 4.1 are: XCode 2.3 for Macintosh universal binaries, Visual Studio 2005 (VC8) for Windows, and Eclipse 3.x on any available platform. The XMP Core is available as C++ and Java sources with project files for the Macintosh, Windows and Linux platform. A Java version of XMP Files is under consideration for a future update.”_&lt;/p>
&lt;/blockquote>
&lt;p>And now I just read that last sentence again: &lt;strong>&lt;em>“A Java version of XMP Files is under consideration for a future update.”&lt;/em>&lt;/strong> So, how hard do they really want to make uptake of XMP be? Am surprised they’re even still considering offering full Java support, and not offering also anything in the way of support for glue languages such as Perl, Python, or Ruby.&lt;/p>
&lt;p>Which leads to the question: Is anybody here using XMP and had any success to relate or lessons for the rest of us?&lt;/p></description></item><item><title>SIIA Executive FaceTime Webcast Series</title><link>https://www.crossref.org/blog/siia-executive-facetime-webcast-series/</link><pubDate>Wed, 21 Mar 2007 00:00:00 +0000</pubDate><author>Anna Tolwinska</author><guid>https://www.crossref.org/blog/siia-executive-facetime-webcast-series/</guid><description>&lt;p>We thought that this program might interest our CrossTech bloggers.&lt;/p>
&lt;p>Howard Ratner, Chief Technology Officer, Executive Vice-President at Nature Publishing Group is on the agenda.&lt;/p>
&lt;p>More information is available at: &lt;a href="https://web.archive.org/web/20070322234448/http://www.siia.net/" target="_blank">https://web.archive.org/web/20070322234448/http://www.siia.net/&lt;/a>.&lt;/p>
&lt;p>SIIA Executive FaceTime Webcast Series&lt;/p>
&lt;p>Howard Ratner, EVP/CTO, Nature Publishing Group&lt;/p>
&lt;p>Wednesday, March 28, 2007&lt;/p>
&lt;p>12:00PM – 1:30PM EST&lt;/p>
&lt;p>The SIIA is pleased to announce that Howard Ratner of Nature Publishing Group will be our guest for the upcoming Executive FaceTime. This live webcast series features one-on-one conversations between leading industry executives and host Hal Espo. Participation is encouraged, the web audience is invited to submit questions posed through the host. Past guests include Tad Smith, CEO of Reed Business Information and L. Gordon Crovitz, EVP of Dow Jones &amp;amp; Company. Registration is free to SIIA members and non-members alike; to participate, you must register by the end of the day on Tuesday, March 27th.&lt;/p>
&lt;p>Howard Ratner&lt;/p>
&lt;p>Howard Ratner is Chief Technology Officer, Executive Vice-President, for the Nature Publishing Group. Based in New York, Howard is in charge of NY operations and has global responsibilities for Production and Manufacturing, Web Development, Web Services, Content Services, and Information Technology across all NPG products. Howard’s prior positions include Director, Electronic Publishing &amp;amp; Production for Springer-Verlag New York, as well as the North American Manager for LINK, and a member of the production staff at John Wiley &amp;amp; Sons. He also serves on the Crossref board, PubMed Central, CORDS and LOCKSS advisory committees, and is a former chair for both the AAP/PSP DOI subcommittee and the DOI-X project.&lt;/p>
&lt;p>Hal Espo&lt;/p>
&lt;p>Hal Espo is President of Contextual Connections, LLC, a NYC-based consultancy which focuses exclusively in the digital services arena, including digital content, distribution, and applications. Hal has more than 25 years experience as an operating executive as well as a business and product development professional in the electronic information industry. He served as Chief Operating Officer of Index Stock Imagery, Inc., a web-based commercial stock photography and illustration vendor, and previously was the Chief Operating Officer at CORSEARCH, Inc., a trademark research firm serving Fortune 500 companies and law firms.&lt;/p></description></item><item><title>Agile Descriptions</title><link>https://www.crossref.org/blog/agile-descriptions/</link><pubDate>Tue, 20 Mar 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/agile-descriptions/</guid><description>&lt;p>Apologies to blog yet another of my posts to Nascent, this time on &lt;a href="https://web.archive.org/web/20070815000000*/http://blogs.nature.com/wp/nascent/2007/03/agile_descriptions_new.html" target="_blank">Agile Descriptions&lt;/a> - a talk I gave the week before last before the LC Future of Bibliographic Control WG. (Don’t worry - I shan’t be making it a habit of this.) But certain aspects of the talk (powerpoint is &lt;a href="https://web.archive.org/web/20070630200452/http://nurture.nature.com/tony/ppt/agile-descriptions.ppt" target="_blank">here&lt;/a>) may be interesting to this readership, in particular the slides on microformats and how these are tentatively being deployed on &lt;a href="https://web.archive.org/web/20070609090207/http://network.nature.com/" target="_blank">Nature Network&lt;/a>, and also a detailed anatomy of &lt;a href="http://opentextmining.org/" target="_blank">OTMI&lt;/a> files.&lt;/p></description></item><item><title>New-Look Web Feeds from Nature</title><link>https://www.crossref.org/blog/new-look-web-feeds-from-nature/</link><pubDate>Thu, 15 Mar 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/new-look-web-feeds-from-nature/</guid><description>&lt;p>I just posted &lt;a href="https://web.archive.org/web/20070815000000*/http://blogs.nature.com/wp/nascent/2007/03/nature_web_feeds_a_new_look.html" target="_blank">this entry&lt;/a> on &lt;a href="https://web.archive.org/web/20070815000000*/http://blogs.nature.com/wp/nascent/" target="_blank">Nascent&lt;/a>, Nature’s Web Publishing blog, about Nature’s new look for web feeds which essentially boils down to our using the RSS 1.0 ‘mod_content’ module to add in a rich content description for human consumption to complement our long-standing commitment to machine-readable descriptions. We are thus able to deliver full citation details in our RSS feeds as XHTML in CDATA sections for humans and as DC/PRISM properties for machines, the whole encoded in our feed format of choice - RSS 1.0. Note also that we declared our intention to publish parallel feeds in Atom which again will carry both human- and machine-readable citations. Further details on the RSS 1.0/Atom paired feeds will be posted here in the near future.&lt;/p>
&lt;p>Perhaps of special note we have added in the DOI in our descriptions in standard Crossref citation format and linked it to the DX resolver.&lt;/p></description></item><item><title>Indexing URLs</title><link>https://www.crossref.org/blog/indexing-urls/</link><pubDate>Thu, 08 Mar 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/indexing-urls/</guid><description>&lt;p>Leigh Dodds proposes &lt;a href="http://allmyeye.blogspot.com/2007/03/persistent-linking-web-crawlers-and.html" target="_blank">in this post&lt;/a> some solutions to persistent linking using web crawlers and social bookmarking.&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“When I use del.icio.us, CiteULike, or Connotea or other social bookmarking service, I end up bookmarking the URL of the site I’m currently using. Its this specific URL that goes into their database and associated with user-assigned tags, etc.&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>&lt;em>…&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;blockquote>
&lt;p>&lt;em>A more generally applicable approach to addressing this issue, one that is not specific to academic publishing, would be to include, in each article page, embedded metadata that indicates the preferred bookmark link. The DOI could again be pressed into service as the preferred bookmarking link.”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>He’s inviting feedback. I’d certainly like to hear what others may think of these suggestions.&lt;/p></description></item><item><title>eprintweb.org</title><link>https://www.crossref.org/blog/eprintweb.org/</link><pubDate>Fri, 02 Mar 2007 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/eprintweb.org/</guid><description>&lt;p>IOP has created an instance of the arXiv repository called eprintweb.org at &lt;a href="https://web.archive.org/web/20130803071935/http://eprintweb.org/S/" target="_blank">https://web.archive.org/web/20130803071935/http://eprintweb.org/S/&lt;/a>. What’s the difference from arXiv? From the eprinteweb.org site - “We have focused on your experience as a user, and have addressed issues of navigation, searching, personalization and presentation, in order to enhance that experience. We have also introduced reference linking across the entire content, and enhanced searching on all key fields, including institutional address.”&lt;/p>
&lt;p>The site looks very good and it’s interesting to see a publisher developing a service directly engaging with a repository.&lt;/p>
&lt;p>Some interesting points to note: There are DOI links to published articles - &lt;code>http://www.eprintweb.org/S/article/astro-ph/0603001&lt;/code> - which IOP gets from Crossref. References in the preprints are also linked - &lt;code>http://www.eprintweb.org/S/article/astro-ph/0603001/refs&lt;/code>&lt;/p>
&lt;p>Crossref will soon be making available an author/title only query for repositories to use to find DOIs for published papers when the preprint doesn’t have the full citation. Many authors don’t go back to their preprints to update the reference to the published version but the new Crossref query will enable the repositories to do this automatically.&lt;/p></description></item><item><title>Open Content</title><link>https://www.crossref.org/blog/open-content/</link><pubDate>Fri, 02 Mar 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/open-content/</guid><description>&lt;p>In light of my &lt;a href="https://www.crossref.org/blog/otmi-an-update/">earlier post&lt;/a> on OTMI, the mail copied below from Sebastian Hammer at &lt;a href="https://web.archive.org/web/20061205050055/http://indexdata.com//" target="_blank">Index Data&lt;/a> about open content may be of interest. They are looking to compile a listing of web sources of open content - see &lt;a href="https://www.indexdata.com/resources/open-content/" target="_blank">this page&lt;/a> for further details.&lt;/p>
&lt;p>(Via &lt;a href="https://web.archive.org/web/20070430002213/http://lists.webjunction.org/mailman/listinfo/xml4lib" target="_blank">XML4lib&lt;/a> and other lists.)&lt;/p>
&lt;p>_“Hi All,&lt;/p>
&lt;p>(apologies for any cross-posting)&lt;/p>
&lt;p>At Index Data, we have long felt that there were really interesting&lt;/p>
&lt;p>sources of open content out there that was not being utilized as well as&lt;/p>
&lt;p>it could be because it was hidden away in websites. We’re a software&lt;/p>
&lt;p>company specializing in information retrieval applications, so&lt;/p>
&lt;p>eventually we asked ourselves, ‘what could we all do with this stuff if&lt;/p>
&lt;p>it were exposed using our favorite open standards’.&lt;/p>
&lt;p>We thought it was worth finding out, so we have set up processes to&lt;/p>
&lt;p>regularly retrieve indexes of major open content resources, and make&lt;/p>
&lt;p>them available using SRU and Z39.50. We’ve started with the Open Content&lt;/p>
&lt;p>Alliance and Project Gutenberg (two quite different approaches to&lt;/p>
&lt;p>producing free eBooks), Wikipedia, the Open Directory Project, and&lt;/p>
&lt;p>OAIster. More is on the way.&lt;/p>
&lt;p>Connection information and more details are available at&lt;/p>
&lt;p>&lt;a href="https://web.archive.org/web/20070325152849/http://indexdata.com//opencontent/" target="_blank">https://web.archive.org/web/20070325152849/http://indexdata.com//opencontent/&lt;/a>.&lt;/p>
&lt;p>The kind of metadata you can get from these sources varies. The Open&lt;/p>
&lt;p>Content Alliance captures MARC records along with the scanned books,&lt;/p>
&lt;p>which makes for excellent metadata. Many of the others produce some&lt;/p>
&lt;p>variation of DublinCore. Our service, through either Z39.50 or SRU/W,&lt;/p>
&lt;p>exposes both MARC (or MARCXML) and DublinCore in XML for all sources.&lt;/p>
&lt;p>We’ve created a new mailing list to help inform people of changes to the&lt;/p>
&lt;p>services, new resources available, etc. Signup at&lt;/p>
&lt;p>&lt;a href="http://lists.indexdata.dk/cgi-bin/mailman/listinfo/oclist/" target="_blank">http://lists.indexdata.dk/cgi-bin/mailman/listinfo/oclist/&lt;/a> .&lt;/p>
&lt;p>We sincerely hope you will find these resources exciting and useful.&lt;/p>
&lt;p>Feel free to get in touch if you have questions or input.&lt;/p>
&lt;p>-Sebastian&lt;/p>
&lt;p>—&lt;/p>
&lt;p>Sebastian Hammer, Index Data&lt;/p>
&lt;p>&lt;a href="mailto:quinn@indexdata.com">quinn@indexdata.com&lt;/a> &lt;a href="https://www.indexdata.com" target="_blank">www.indexdata.com&lt;/a>&lt;/p>
&lt;p>Ph: (603) 209-6853 Fax: (866) 383-4485”_&lt;/p></description></item><item><title>OTMI - An Update</title><link>https://www.crossref.org/blog/otmi-an-update/</link><pubDate>Fri, 02 Mar 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/otmi-an-update/</guid><description>&lt;p>We’ve just posted an &lt;a href="https://web.archive.org/web/20070815000000*/http://blogs.nature.com/wp/nascent/2007/02/open_text_mining_interface_upd.html" target="_blank">update about OTMI&lt;/a> (the Open Text Mining Interface) on our Web Publishing blog &lt;a href="https://web.archive.org/web/20070815000000*/http://blogs.nature.com/wp/nascent/" target="_blank">Nascent&lt;/a>. This post details the following changes:&lt;/p>
&lt;ul>
&lt;li>Contact email - &lt;a href="mailto:otmi@nature.com">otmi@nature.com&lt;/a>
&lt;ul>
&lt;li>Wiki - &lt;a href="http://opentextmining.org/" target="_blank">http://opentextmining.org/&lt;/a>
&lt;ul>
&lt;li>
&lt;p>Repository - &lt;a href="https://web.archive.org/web/20090706181310/http://www.nature.com/otmi/journals.opml" target="_blank">https://web.archive.org/web/20090706181310/http://www.nature.com/otmi/journals.opml&lt;/a> &lt;/ul>
The OTMI content repository currently provides two years’ worth of full text across five of our titles:&lt;/p>
&lt;ul>
&lt;li>&lt;a href="http://www.nature.com/nature" target="_blank">&lt;em>Nature&lt;/em>&lt;/a>
&lt;ul>
&lt;li>&lt;a href="http://www.nature.com/nature" target="_blank">&lt;em>Nature Genetics&lt;/em>&lt;/a>
&lt;ul>
&lt;li>&lt;a href="http://www.nature.com/nature" target="_blank">&lt;em>Nature Reviews Drug Discovery&lt;/em>&lt;/a>
&lt;ul>
&lt;li>&lt;a href="http://www.nature.com/nature" target="_blank">&lt;em>Nature Structural &amp;amp; Molecular Biology&lt;/em>&lt;/a>
&lt;ul>
&lt;li>
&lt;p>&lt;a href="http://www.nature.com/nature" target="_blank">&lt;em>The Pharmacogenomics Journal&lt;/em>&lt;/a> &lt;/ul>
See the &lt;a href="http://opentextmining.org/" target="_blank">wiki&lt;/a> for draft technical specs and for a sample script to generate the OTMI files. And feel free to add to the wiki on existing pages or create new pages as required.&lt;/p>
&lt;p>We’re very much looking forward to any feedback you may have on what we consider to be a very exciting new initiative for scholarly publishers.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul></description></item><item><title>Sir TimBL’s Testimony</title><link>https://www.crossref.org/blog/sir-timbls-testimony/</link><pubDate>Fri, 02 Mar 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/sir-timbls-testimony/</guid><description>&lt;p>Just in case anybody may not have seen this, &lt;a href="http://dig.csail.mit.edu/2007/03/01-ushouse-future-of-the-web.html" target="_blank">here&lt;/a>‘s the testimony of Sir Tim Berners-Lee yesterday before a House of Representatives Subcommittee on Telecommunications and the Internet. Required reading.&lt;/p>
&lt;p>(Via this &lt;a href="https://web.archive.org/web/20070307171557/http://www.savetheinternet.com/blog/2007/03/01/world-wide-web-inventor-says-net-neutrality-a-top-priority-for-congress/" target="_blank">post&lt;/a> yesterday in the &lt;a href="http://www.savetheinternet.com/blog/" target="_blank">Save the Internet&lt;/a> blog.)&lt;/p></description></item><item><title>&amp;#8220;Spinning Around&amp;#8221;</title><link>https://www.crossref.org/blog/spinning-around/</link><pubDate>Fri, 23 Feb 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/spinning-around/</guid><description>&lt;p>There’s a great exposition of &lt;a href="http://www.ifla.org/VII/s13/frbr/frbr.htm" target="_blank">FRBR&lt;/a> (the Functional Requirements for Bibliographic Records model “&lt;em>&lt;strong>work -&amp;gt; expression -&amp;gt; manifestation -&amp;gt; item&lt;/strong>&lt;/em>“) in &lt;a href="https://web.archive.org/web/20071004111040/http://www.frbr.org/2007/02/22/de-revolutionibus" target="_blank">this post&lt;/a> from &lt;a href="http://www.frbr.org/" target="_blank">The FRBR Blog&lt;/a> on &lt;a href="http://en.wikipedia.org/wiki/De_revolutionibus_orbium_coelestium" target="_blank">De Revolutionibus&lt;/a> as described in &lt;em>The Book Nobody Read: Chasing the Revolutions of Nicolaus Copernicus&lt;/em> by Owen Gingerich. See post for the background and &lt;a href="https://web.archive.org/web/20070705095508/http://www.frbr.org/files/frbrevolutionibus.png" target="_blank">here (103 KB PNG)&lt;/a> for a map of the FRBR relationships.&lt;/p>
&lt;p>(Yes, and a twinkly star in the title too. ;~)&lt;/p></description></item><item><title>Kay Sera Sera</title><link>https://www.crossref.org/blog/kay-sera-sera/</link><pubDate>Tue, 20 Feb 2007 00:00:00 +0000</pubDate><author>Crossref</author><guid>https://www.crossref.org/blog/kay-sera-sera/</guid><description>&lt;p>Not specifically publishing-related, but here is a fun &lt;strike>rant &lt;/strike>&lt;a href="http://www.cioinsight.com/print_article2/0,1217,a=200162,00.asp" target="_blank">interview&lt;/a> with &lt;a href="http://en.wikipedia.org/wiki/Alan_Kay" target="_blank">Alan Kay&lt;/a> titled &lt;em>The PC Must Be Revamped—Now.&lt;/em>&lt;/p>
&lt;p>My favorite bit…&lt;/p>
&lt;p>“…in the last few years I’ve been asking computer scientists and programmers whether they’ve ever typed E-N-G-E-L-B-A-R-T into Google-and none of them have. I don’t think you could find a physicist who has not gone back and tried to find out what Newton actually did. It’s unimaginable. Yet the computing profession acts as if there isn’t anything to learn from the past, so most people haven’t gone back and referenced what Engelbart thought. ”&lt;/p></description></item><item><title>&amp;#8220;We’re sorry&amp;#8230;&amp;#8221;</title><link>https://www.crossref.org/blog/%238220were-sorry%238230%238221/</link><pubDate>Mon, 19 Feb 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/%238220were-sorry%238230%238221/</guid><description>&lt;p>&lt;strong>Update:&lt;/strong> All apologies to Google. Apparently this was a problem at our end which our IT folks are currently investigating. (And I thought it was just me. 🙂&lt;/p>
&lt;p>Just managed to get this page:&lt;/p>
&lt;p>_“Google Error&lt;/p>
&lt;p>We’re sorry…&lt;/p>
&lt;p>… but your query looks similar to automated requests from a computer virus or spyware application. To protect our users, we can’t process your request right now.&lt;/p>
&lt;p>We’ll restore your access as quickly as possible, so try again soon. In the meantime, if you suspect that your computer or network has been infected, you might want to run a virus checker or spyware remover to make sure that your systems are free of viruses and other spurious software.&lt;/p>
&lt;p>We apologize for the inconvenience, and hope we’ll see you again on Google.&lt;/p>
&lt;p>To continue searching, please type the characters you see below:”_&lt;/p>
&lt;p>And my search request?&lt;/p>
&lt;p>&lt;tt>ark&lt;/tt>&lt;/p>
&lt;p>(Actual query is &lt;a href="http://www.google.com/sorry/?continue=http://www.google.com/search%3Fclient%3Dfirefox-a%26rls%3Dorg.mozilla%253Aen-US%253Aofficial%26channel%3Ds%26hl%3Den%26q%3Dark%26btnG%3DGoogle%2BSearch" target="_blank">here&lt;/a> as argument to the &lt;tt>continue&lt;/tt> parameter.)&lt;/p>
&lt;p>Was hoping to find results related to the &lt;a href="http://www.ietf.org/internet-drafts/draft-kunze-ark-12.txt" target="_blank">The ARK Persistent Identifier Scheme&lt;/a>. Maybe I missed something but I’m not impressed.&lt;/p></description></item><item><title>At Last! URIs for InChI</title><link>https://www.crossref.org/blog/at-last-uris-for-inchi/</link><pubDate>Mon, 19 Feb 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/at-last-uris-for-inchi/</guid><description>&lt;p>The &lt;a href="http://info-uri.info/" target="_blank">info registry&lt;/a> has now added in the &lt;a href="http://iupac.org/inchi/" target="_blank">InChI&lt;/a> namespace (see registry entry &lt;a href="http://info-uri.info/registry/OAIHandler?verb=GetRecord&amp;amp;#038;metadataPrefix=reg&amp;amp;#038;identifier=info:inchi/" target="_blank">here&lt;/a>) which now means that chemical compounds identified by InChIs (&lt;a href="http://iupac.org/" target="_blank">IUPAC&lt;/a>‘s International Chemical Identifiers) are expressible in URI form and thus amenable to many Web-based description technologies that use URI as the means to identify objects, e.g. XLink, RDF, etc. As an example, the InChI identifier for &lt;a href="http://en.wikipedia.org/wiki/Naphthalene" target="_blank">naphthalene&lt;/a> is&lt;/p>
&lt;p>InChI=1/C10H8/c1-2-6-10-8-4-3-7-9(10)5-1/h1-8H&lt;/p>
&lt;p>and can now be legitimately expressed in URI form as&lt;/p>
&lt;p>info:inchi/InChI=1/C10H8/c1-2-6-10-8-4-3-7-9(10)5-1/h1-8H&lt;/p>
&lt;p>The info URI scheme exists to support legacy namespaces get a leg up onto the Web. Registered namespaces include PubMed identifiers, DOIs, handles, ADS bibcodes, etc. Increasingly we’ll be expecting to see identifiers (both new and old) represented in a common form - URI.&lt;/p></description></item><item><title>Stick this in your pipe&amp;#8230;</title><link>https://www.crossref.org/blog/stick-this-in-your-pipe/</link><pubDate>Mon, 19 Feb 2007 00:00:00 +0000</pubDate><author>Crossref</author><guid>https://www.crossref.org/blog/stick-this-in-your-pipe/</guid><description>&lt;p>&lt;a href="http://www2.blogger.com/profile/12568103499947976875" target="_blank">Rob Cornelius&lt;/a> has a &lt;a href="http://allmyeye.blogspot.com/index.html" target="_blank">practical little demo&lt;/a> of using Yahoo! pipes against some Ingenta feeds.&lt;/p>
&lt;p>Like Tony, I keep experiencing speed/stability problems while accessing pipes so I haven’t yet become a &lt;a href="http://www.researchbuzz.org/wp/2007/02/12/yahoo-ruins-my-life-with-yahoo-pipes/" target="_blank">crack-pipes-head&lt;/a>.&lt;/p></description></item><item><title>OpenURL Podcast</title><link>https://www.crossref.org/blog/openurl-podcast/</link><pubDate>Sat, 17 Feb 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/openurl-podcast/</guid><description>&lt;p>Jon Udell interviews &lt;a href="https://web.archive.org/web/20070114212828/http://curtis.med.yale.edu/dchud/" target="_blank">Dan Chudnov&lt;/a> about &lt;a href="https://web.archive.org/web/20070206165948/http://www.niso.org/standards/standard_detail.cfm?std_id=783" target="_blank">OpenURL&lt;/a>, see his &lt;a href="http://blog.jonudell.net/2007/02/16/a-conversation-with-dan-chudnov-about-openurl-context-sensitive-linking-and-digital-archiving/" target="_blank">blog entry&lt;/a>: “A conversation with Dan Chudnov about OpenURL, context-sensitive linking, and digital archiving”. The podcast of the interview is available &lt;a href="http://jonudell.net/podcast/ju_chudnov.mp3" target="_blank">here&lt;/a>.&lt;/p>
&lt;p>Interesting to see these kind of subjects beginning to be covered by a respected technology writer like Jon. As he says in his post:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“I have ventured into this confusing landscape because I think that the issues that libraries and academic publishers are wrestling with — persistent long-term storage, permanent URLs, reliable citation indexing and analysis — are ones that will matter to many businesses and individuals. As we project our corporate, professional, and personal identities onto the web, we’ll start to see that the long-term stability of those projections is valuable and worth paying for.”&lt;/em>&lt;/p>
&lt;/blockquote></description></item><item><title>OpenDocument 1.1 is OASIS Standard</title><link>https://www.crossref.org/blog/opendocument-1.1-is-oasis-standard/</link><pubDate>Thu, 15 Feb 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/opendocument-1.1-is-oasis-standard/</guid><description>&lt;p>From the OASIS &lt;a href="https://web.archive.org/web/20070219093602/https://www.oasis-open.org/news/oasis-news-2007-02-14.php" target="_blank">Press Release&lt;/a>:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“Boston, MA, USA; 13 February 2007 — OASIS, the international standards consortium, today announced that its members have approved version 1.1 of the Open Document Format for Office Applications (OpenDocument) as an OASIS Standard, a status that signifies the highest level of ratification.”&lt;/em>&lt;/p>
&lt;/blockquote></description></item><item><title>Crossref Author ID meeting</title><link>https://www.crossref.org/blog/crossref-author-id-meeting/</link><pubDate>Wed, 14 Feb 2007 00:00:00 +0000</pubDate><author>Amy Brand</author><guid>https://www.crossref.org/blog/crossref-author-id-meeting/</guid><description>&lt;p>February 5, 2007, Washington DC Crossref invited a number of people to attend an information gathering session on the topic of Author IDs. The purpose of the meeting was to determine:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>About whether there is an industry need for a central or federated contributor id registry;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>whether Crossref should have a role in creating such a registry;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>how to proceed in a way that builds upon existing systems and standards.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>&lt;strong>In attendance:&lt;/strong> Jeff Baer, CSA; Judith Barnsby, IOPP; Geoff Bilder, Crossref; Amy Brand, Crossref; David Brown, British Library; Richard Cave, PLoS (remote); Bill Carden, ScholarOne; Gregg Gordon, SSRN; Gerry Grenier, IEEE; Michael Healy, BISG (remote); Helen Henderson, Ringgold; Thomas Hickey, OCLC (remote); Terry Hulburt, IOPP; Tim Ingoldsby, AIP; Ruth Jones, Britsh Library; Marl Land, Parity; Dave Martinson, ACS; Georgios Papadapoulos, Atypon (with two colleagues); Jim Pringle, Thomson; Chris Rosin, Parity; Tim Ryan, Wiley; Philippa Scoones, Blackwell; Chris Shillum, Elsevier; Neil Smalheiser, UIC (remote); Barbara Tillett, LoC; Vetle Torvik, UIC (remote); Charles Trowbridge, ACS; Amanda Ward, Nature (remote); Stu Weibel, OCLC (remote); David Williamson, LoC;&lt;/p>
&lt;p>&lt;strong>Notes&lt;/strong> Amy Brand opened the meeting and welcomed attendees. She said the goal of the meeting was really nothing more than to launch a discussion on a topic of author identifiers and hear from participants re their views and experiences on unique identifiers for individuals — be they authors, contributors, or otherwise. We went around the table and everyone introduced themselves. Amy then introduced Geoff Bilder as moderator of the meeting. Geoffrey Bilder said that Crossref’s members had indicated that they would like Crossref to explore whether it could play a role in creating an author identification system. The members feel that an “author DOI” scheme would help them with production and editorial issues. They also recognize that such a scheme could fuel numerous downstream applications. Geoff apologized for sounding like Rumsfeld and said, we know that there is a lot that we don’t know, but we don’t know exactly what we don’t know. We have just started this project and we wanted to get some feedback from various groups concerned with scholarly publishing in order to understand what people would like to see in regards to author identification schemes and what initiatives/efforts we need to be aware of. He commented that the currently assembled group failed to include the open web community, and their input would be important too as this project develops. The meeting then turned to short project summaries from others.&lt;/p>
&lt;p>&lt;strong>Project Summaries&lt;/strong> Jim Pringle gave a short PPT presentation (attached) and reported that Thomson first started creating its own author ids in 2000, in relation to the launch of its Highly Cited service. The focus for Thomson in this area has been on author disambiguation. Jim said that the focus for Crossref in this area would be a system that could respond to the question “who are you and what have you written”; he also raised concern about matters of author privacy.&lt;/p>
&lt;p>Michael Healy then discussed the &lt;a href="https://web.archive.org/web/20070611181723/http://collectionscanada.ca/iso/tc46sc9/27729.htm" target="_blank">International Standard Party Identifier&lt;/a> (ISPI). ISO TC 46/SC 9 is developing ISPI as a new international identification system for the parties (persons and corporate bodies) involved in the creation and production of content entities. Work on the ISPI project began in August 2006 when the New Work Item proposal was approved by the member bodies of ISO TC 46/SC 9. The first meeting of the ISPI project group was held at CISAC’s offices in Paris on September 12, 2006. This project has strong representation the library sector, RRO’s, booksellers, music and film/TV industries represented as well. Mr. René Lloret Linares from CISAC (International Confederation of Societies of Authors and Composers) chairs the group; until now CISAC has been using a proprietary id scheme and would like to move to use of an open standard to identify all contributors and creators. Michael was asked whether membership in the project group was open, and he replied that anyone can attend meetings as observers but that voting is restricted to those nominated by their own national standards organisation. Chris Shillum then asked the group to think about developed use cases for the publishing industry, and how they differ from potential ISPI applications.&lt;/p>
&lt;p>Helen Henderson reported on the &lt;a href="https://web.archive.org/web/20071212133049/https://web.archive.org/web/20060904075439/http://www.journalsupplychain.com/" target="_blank">Journals Supply Chain project&lt;/a>, a pilot that aims to discover whether the creation of a standard, commonly used identifier for Institutions (customer ids) will be beneficial to parties involved in the journal supply chain. The pilot models interactions between each party — library, publisher, agent. 35 publishers are participating thus far. Helen also said there is a clear need for sub-institutional level ids. Helen also pointed out the value of associating author and institutional ids. On the topic of institutions, Tim Ingoldsby pointed out that both academic and corporate institutions are important. Chris Rosin said Parity is working on author merger and disambiguation as core use cases of author ids for its publisher clients. In particular, they have developed automated merging of instances into profiles, proceeding with conservative bias on what constitutes a match/merge. Parity is also looking at applying author cv’s onto profiles. This will require contributors to participate, and they will need to make it as easy as possible for contributors. Chris said that authentication, trust, and privacy are key considerations; even collecting public information in one place raises privacy issues. Judith Barnsby pointed out that the UK has stronger data protection rules than the US, re privacy. Discussion among the group at this point in the meeting resulted in identifying two different areas in author id assignment — (1) ongoing assignment, (2) retroactive assignment. Geoff said this distinction was useful for Crossref, who could more easily address ongoing assignment via publishers working directly with authors.&lt;/p>
&lt;p>Neil Smalheiser, a neuroscientist at UIC, reported on the &lt;a href="http://arrowsmith.psych.uic.edu/arrowsmith_uic/index.html" target="_blank">Arrowsmith Project&lt;/a>, a statistical model based on multiple features of the Medline database. The goal of the model is to predict the probability that any two papers are written by the same person. The project’s “Authority” tool weighs criteria such as researcher affiliation, co-author names, journal title, and medical subject headings to identify the papers most likely written by a target author. For details: arrowsmith.psych.uic.edu/arrowsmith_uic/index.html &lt;a href="http://arrowsmith.psych.uic.edu/arrowsmith_uic/index.html" target="_blank">http://arrowsmith.psych.uic.edu/arrowsmith_uic/index.html&lt;/a>&lt;/p>
&lt;p>David Williamson of LoC said he was working on name authority files, using ONIX metadata. Barbara Tillet of LoC spoke about authority files and related efforts in library world, which uses the control number, one type of unique id. She reported that IFLA (International Federation of Library Associations) has a group working on how to share authority numbers, which has actually been in discussion since the 1970s; there is to be an IFLA-IPA meeting in April 2007. The library community is eager to share what it knows and what it has developed this far. Barbara suggested that use of Dublin Core format here may be the best way to go. Different communities will no doubt need different ids. What is needed in the library community is an international, multi-lingual solution, based on unicode, connecting regional authority files. Publishers will want to take advantage of library author-ity files for retrospective identifications.&lt;/p>
&lt;p>Thomas Hickey of OCLC mentioned the &lt;a href="https://www.worldcat.org/identities/" target="_blank">WorldCat Identity service&lt;/a>, which summarizes information for 20 million authors searchable in WorldCat. Gerry Grenier reported that IEEE was about to implement its own author disambiguation and id system, and he offered that this metadata could be fed into a Crossref system. Different participants had different views on whether the goal here should be a “light and non-centralized” (or federated) approach versus a centralized registry with one place to link authors across all publishers, versus a hybrid — centralized source to handout unique id, but publisher data could be distributed. There could also be a network of registration agencies working in a federated system. Different participants also had different views on Crossref’s role. Several publishers at the meeting supported Crossref’s role, especially in the STM space, whereas there was concern raised among some parties about whether Crossref was an appropriate choice for a system that will need to be “available everywhere to everybody”, and others re-iterated the importance of giving the academic community a voice in the development of such a service Discussion then turned to use cases — the question being, what problems would having an author id help you solve in your organisation?&lt;/p>
&lt;p>&lt;strong>USE CASES ARTICULATED AT MEETING:&lt;/strong>&lt;/p>
&lt;ul>
&lt;li>
&lt;p>For RROs, known use case is to facilitate distribution of monies owed to authors;;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>for booksellers, disambiguation in search;;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>to understand the provenance of documents;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>search — to find works for particular person; self presentation — how can I effectively present myself and my work to the world?;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>cross-walks — associating various life sciences ids, such as PubChem;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>identity of society members;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>identity of research funding institutions;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>disambiguation and attribution;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>linking authors and institutions;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>for enhancing peer review system — need unique ids to share information with various departments;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>to better know the value of our authors — for activities such as peer review, tracking stats on authors, article downloads, and individualized or personalized services;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>with a central registry, author only has one place they have to update their information;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>authors will want the information to be portable when they move from inst to another — “where is Jeff Smith now?” is one such question;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>to associate connected authors with one another;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>to aggregate info on where (what institution) research is being done on a particular topic;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>privacy can be enhanced with author DOIs;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>sharing info from library to library;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>cluster all the works of a particular person for search purposes;&lt;/p>
&lt;/li>
&lt;li>
&lt;p>stats about authors — “how many times has this author tried and been rejected from Nature?” for instance.&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>**NEXT STEPS: Please watch the CrossTech blog for ongoing discussion **&lt;/p></description></item><item><title>Microsoft to Support OpenID</title><link>https://www.crossref.org/blog/microsoft-to-support-openid/</link><pubDate>Thu, 08 Feb 2007 00:00:00 +0000</pubDate><author>Crossref</author><guid>https://www.crossref.org/blog/microsoft-to-support-openid/</guid><description>&lt;p>Kim Cameron, Microsoft’s Identity Czar and member of the &lt;a href="https://web.archive.org/web/20070826193937/http://www.identitygang.org/" target="_blank">Identity Gang&lt;/a>, &lt;a href="http://www.identityblog.com/?p=668" target="_blank">comments on&lt;/a> Microsoft’s announcement that they will support &lt;a href="http://openid.net/" target="_blank">OpenID&lt;/a>. Another sign that &lt;a href="http://en.wikipedia.org/wiki/Federated_identity" target="_blank">federated identity&lt;/a> schemes are gaining traction and OpenID is likely to emerge as a standard the publishers are going to want to grapple with soon.&lt;/p>
&lt;p>This follows Doc Searl’s comments on the notion of “&lt;a href="http://www.linuxjournal.com/node/1000180" target="_blank">Creator Relationship Management&lt;/a>” where he speculates that the techniques being used in federated identity schemes and the Creative Commons can be combined to create a new “silo-free” value chain amongst creators, producers and distributors.&lt;/p></description></item><item><title>Remixing RSS</title><link>https://www.crossref.org/blog/remixing-rss/</link><pubDate>Thu, 08 Feb 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/remixing-rss/</guid><description>&lt;p>Niall Kennedy has a &lt;a href="http://www.niallkennedy.com/blog/archives/2007/02/yahoo-pipes.html" target="_blank">post&lt;/a> about the newly released &lt;a href="https://www.pipes.digital/" target="_blank">Yahoo! Pipes&lt;/a>. As he says:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“Yahoo! Pipes lets any Yahoo! registered user enter a set of data inputs and filter their results. You might splice a feed of your latest bookmarks on del.icio.us with the latest posts from your blog and your latest photographs posted to Flickr.”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>He also warns about possible implications for web publishers:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“Yahoo! Pipes makes it easy to remove advertising from feeds or otherwise reformat your content.”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>Note: As yet, I have not been able to access the site. Interested to learn if anybody else has and what their experiences have been.&lt;/p></description></item><item><title>RSS Validator in the Spotlight</title><link>https://www.crossref.org/blog/rss-validator-in-the-spotlight/</link><pubDate>Thu, 08 Feb 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/rss-validator-in-the-spotlight/</guid><description>&lt;p>Sam Ruby &lt;a href="http://www.intertwingly.net/blog/2007/02/07/Validating-the-Validators" target="_blank">responds&lt;/a> to Brian Kelly’s &lt;a href="http://ukwebfocus.wordpress.com/2007/02/07/validators-dont-always-work/" target="_blank">post&lt;/a> about the &lt;a href="http://feedvalidator.org/" target="_blank">RSS Validator&lt;/a> and its treatment of RSS 1.0, or rather, RSS 1.0 modules. As Ruby notes:&lt;/p>
&lt;blockquote>
&lt;p>&lt;em>“There is no question that RSS 1.0 is widely deployed. &lt;a href="http://web.resource.org/rss/1.0/" target="_blank">RSS 1.0&lt;/a> has a &lt;a href="http://www.intertwingly.net/slides/2003/rssQuickSummary.html" target="_blank">minimal&lt;/a> core. The validation for that core is pretty solid.”&lt;/em>&lt;/p>
&lt;/blockquote>
&lt;p>Not sure if I’d seen that &lt;a href="http://www.intertwingly.net/slides/2003/rssQuickSummary.html" target="_blank">RSS comparison table&lt;/a> before, but it is reassuring. (Oh, and see the really simple case off to the right. 😉&lt;/p>
&lt;p>Good point, anyway about contributing test cases. I guess we should really submit a PRISM test case. And yes, the Validator is somewhat buggy as some recent testing confirms. On which more later.&lt;/p></description></item><item><title>SearchULike</title><link>https://www.crossref.org/blog/searchulike/</link><pubDate>Mon, 05 Feb 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/searchulike/</guid><description>&lt;p>Nelson Minar has a short &lt;a href="http://www.somebits.com/weblog/tech/bad/googleSearchHistory.html" target="_blank">post&lt;/a> on Google’s &lt;a href="http://www.google.com/searchhistory" target="_blank">Search History&lt;/a> ‘feature’ and how it can be used to enhance your search experience. I guess that should be SearchULike.&lt;/p></description></item><item><title>What’s My Link?</title><link>https://www.crossref.org/blog/whats-my-link/</link><pubDate>Mon, 05 Feb 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/whats-my-link/</guid><description>&lt;p>Simon Willison has a great piece &lt;a href="https://web.archive.org/web/20070205131629/http://simonwillison.net/2007/Feb/4/urls/" target="_blank">here&lt;/a> about disambiguating URLs. Best practice on creating and publishing URLs is obviously something of interest to any publisher. See this excerpt from Simon’s post:&lt;/p>
&lt;p>_“Here’s a random example, plucked from today’s del.icio.us popular. convinceme.net is a new online debating site (tag clouds, gradient fills, rounded corners). It’s listed in del.icio.us a total of four times!&lt;/p>
&lt;ul>
&lt;li>
&lt;p>&lt;a href="https://web.archive.org/web/20070203050251/http://www.convinceme.net/" target="_blank">https://web.archive.org/web/20070203050251/http://www.convinceme.net/&lt;/a> has 36 saves&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;a href="https://web.archive.org/web/20070202182238/http://www.convinceme.net/index.php" target="_blank">https://web.archive.org/web/20070202182238/http://www.convinceme.net/index.php&lt;/a> has 148 saves&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;a href="https://web.archive.org/web/20070203050251/http://www.convinceme.net/" target="_blank">https://web.archive.org/web/20070203050251/http://www.convinceme.net/&lt;/a> has 211 saves&lt;/p>
&lt;/li>
&lt;li>
&lt;p>&lt;a href="https://web.archive.org/web/20070202182238/http://www.convinceme.net/index.php" target="_blank">https://web.archive.org/web/20070202182238/http://www.convinceme.net/index.php&lt;/a> has 38 saves&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>Combined that’s 433 saves; much more impressive, and more likely to end up at the top of a social sharing sites.”_&lt;/p></description></item><item><title>comments and trackbacks</title><link>https://www.crossref.org/blog/comments-and-trackbacks/</link><pubDate>Fri, 02 Feb 2007 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/comments-and-trackbacks/</guid><description>&lt;p>Due to spam the comments and trackbacks were turned off on the blog since last week. Comments can be moderated so they have now been turned back on. Glad to see postings picking up.&lt;/p></description></item><item><title>Hooray!</title><link>https://www.crossref.org/blog/hooray/</link><pubDate>Fri, 02 Feb 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/hooray/</guid><description>&lt;p>Somebody is both reading (and recommending) this blog - see Lorcan’s post &lt;a href="http://orweblog.oclc.org/archives/001257.html" target="_blank">here&lt;/a>. Just my opinion but would be really good to see more librarians following this in order to arrive at better consensus.&lt;/p></description></item><item><title>RSC launches semantic enrichment of journal articles</title><link>https://www.crossref.org/blog/rsc-launches-semantic-enrichment-of-journal-articles/</link><pubDate>Thu, 01 Feb 2007 00:00:00 +0000</pubDate><author>rkidd</author><guid>https://www.crossref.org/blog/rsc-launches-semantic-enrichment-of-journal-articles/</guid><description>&lt;p>The RSC has gone live today with the results of &lt;a href="https://web.archive.org/web/20070812060042/http://www.rsc.org/Publishing/Journals/ProjectProspect/index.asp" target="_blank">Project Prospect&lt;/a>, introducing semantic enrichment of journal articles across all our titles. I’m pretty sure we’re the first primary research publisher to do anything of this scope.&lt;/p>
&lt;p>We’re identifying chemical compounds and providing synonyms, InChIs (IUPAC’s Chemical Identifier), downloadable CML (Chemical Markup Language), SMILES strings and 2D images for these compounds. In terms of subject area we’re marking up terms from the IUPAC Gold Book, and also Open Biomedical Ontology terms from the Gene, Cell, and Sequence Ontologies. All this stuff is currently available from an enhanced HTML view, with the additional information and links to related articles accessed via highlights in the article and popups.&lt;/p>
&lt;p>The mark-up tools have been developed together with UK academics based at the Unilever Centre of Molecular Informatics and the Computing Laboratory at Cambridge University.&lt;/p>
&lt;p>At launch we have about 100 articles from our 2007 publications, with the enhanced views currently free-to-air. Feel free to take a look.&lt;/p></description></item><item><title>Digital Objects</title><link>https://www.crossref.org/blog/digital-objects/</link><pubDate>Tue, 30 Jan 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/digital-objects/</guid><description>&lt;p>A couple weeks back there was a meeting of the &lt;a href="http://www.openarchives.org/" target="_blank">Open Archive Initiative&lt;/a>‘s Object Reuse and Exchange (&lt;a href="http://www.openarchives.org/ore/" target="_blank">OAI-ORE&lt;/a>) Technical Committee hosted in the Butler Library at Columbia University, New York.&lt;/p>
&lt;img alt="DSC00027.JPG" src="" width="204" height="153" />
&lt;p>Lorcan Dempsey of OCLC blogs &lt;a href="http://orweblog.oclc.org/archives/001254.html" target="_blank">here&lt;/a> on the &lt;a href="http://www.openarchives.org/ore/documents/OAI-ORE-TC-Meeting-200701.pdf" target="_blank">report&lt;/a> (PDF format) that was generated from that meeting. As does Pete Johnston of Eduserv &lt;a href="http://efoundations.typepad.com/efoundations/2007/01/more_ore.html" target="_blank">here&lt;/a>.&lt;/p>
&lt;p>&lt;strong>Background:&lt;/strong>&lt;/p>
&lt;p>&lt;a href="http://www.openarchives.org/ore/" target="_blank">OAI-ORE&lt;/a> is being positioned as a companion activity to the more familiar &lt;a href="http://www.openarchives.org/pmh/" target="_blank">OAI-PMH&lt;/a> protocol for metadata harvesting. OAI-ORE relates to the expression and exchange of digital objects across repositories rather than just the exchange of metadata about those objects.&lt;/p>
&lt;p>The basic problem is that scholarly communication deals in units which are compound resulting from a complex of documents and/or datasets expressed in multiple formats, versions, relationships, etc. The underlying web architecture provides a fairly simple model of resources (identified with URIs) which are interconnected and can be interacted with by retrieving representations of those resources. In practice, this usually results in unique URIs (and thus resources) for each representation - think of one URI for an HTML document, another for a PDF document of the same work, and yet new URIs for those same document formats for a new version of the work. Clearly, all these representations (or documents) are related, and more importantly relate to a single underlying “work”. Web architecture as generally practiced does not provide ready mechanisms to aggregate (and compartmentalize) related documents and datasets.&lt;/p>
&lt;p>My fairly simple mental picture is that the web landscape is rather like the early universe in which energy (and matter) is distributed uniformly and there is little local “intelligence” which is gradually built up through time by matter formation and aggregations of this matter leading to the more familiar “clumpy” universe with its recognizable galaxies, stars and other objects. This “clumpiness” is precisely what we are missing in the scholarly web.&lt;/p></description></item><item><title>An Open PDF?</title><link>https://www.crossref.org/blog/an-open-pdf/</link><pubDate>Mon, 29 Jan 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/an-open-pdf/</guid><description>&lt;p>Adobe announces today the following:&lt;/p>
&lt;p>&lt;em>“SAN JOSE, Calif. — Jan. 29, 2007 — Adobe Systems Incorporated (Nasdaq:ADBE) today announced that it intends to release the full Portable Document Format (PDF) 1.7 specification to AIIM, the Enterprise Content Management Association, for the purpose of publication by the International organisation for Standardization (ISO).”&lt;/em>&lt;/p>
&lt;p>The full press release is &lt;a href="https://web.archive.org/web/20070202072839/http://www.adobe.com/aboutadobe/pressroom/pressreleases/200701/012907OpenPDFAIIM.html" target="_blank">here&lt;/a>.&lt;/p>
&lt;p>(Via &lt;a href="http://www.tkachenko.com/blog/archives/000657.html" target="_blank">Oleg Tkachenko’s Blog&lt;/a>.)&lt;/p></description></item><item><title>Jon Udell and DOIs</title><link>https://www.crossref.org/blog/jon-udell-and-dois/</link><pubDate>Mon, 29 Jan 2007 00:00:00 +0000</pubDate><author>Crossref</author><guid>https://www.crossref.org/blog/jon-udell-and-dois/</guid><description>&lt;p>Not to get too self-referential here, but it was very cool to see that Tony Hammond has managed to get &lt;a href="http://blog.jonudell.net/2007/01/29/the-persistent-blogosphere/" target="_blank">Not to get too self-referential here, but it was very cool to see that Tony Hammond has managed to get&lt;/a> This based on a &lt;a href="http://blog.jonudell.net/2007/01/26/a-conversation-with-tony-hammond-about-digital-object-identifiers/" target="_blank">podcast&lt;/a> interview with Tony posted on January 26th.&lt;/p></description></item><item><title>W3C Recs for XML - Eight of &amp;#8216;Em!</title><link>https://www.crossref.org/blog/w3c-recs-for-xml-eight-of-em/</link><pubDate>Thu, 25 Jan 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/w3c-recs-for-xml-eight-of-em/</guid><description>&lt;p>Although most folks will already know about this it still seems significant enough to blog the arrival of XQuery 1.0, XSLT 2.0, and XPath 2.0. See the &lt;a href="http://www.w3.org/2007/01/qt-pressrelease" target="_blank">W3C Press Release&lt;/a>.&lt;/p></description></item><item><title>Use of PRISM in RSS</title><link>https://www.crossref.org/blog/use-of-prism-in-rss/</link><pubDate>Tue, 23 Jan 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/use-of-prism-in-rss/</guid><description>&lt;p>Was rooting around for some information and stumbled across this page which may be of interest:&lt;/p>
&lt;p>&lt;a href="http://googlereader.blogspot.com/2006/08/namespaced-extensions-in-feeds.html" target="_blank">http://googlereader.blogspot.com/2006/08/namespaced-extensions-in-feeds.html&lt;/a>&lt;/p>
&lt;p>Namespaced Extensions in Feeds&lt;/p>
&lt;p>Thursday, August 03, 2006&lt;/p>
&lt;p>posted by Mihai Parparita&lt;/p>
&lt;p>&lt;em>“I wrote a small MapReduce program to go over our BigTable and get the top 50 namespaces based on the number of feeds that use them.”&lt;/em>&lt;table border=0 cellpadding="5">&lt;/p>
&lt;/table>
&lt;p>Seems quite an impressive percentage for PRISM.&lt;/p></description></item><item><title>What’s in a URI?</title><link>https://www.crossref.org/blog/whats-in-a-uri/</link><pubDate>Mon, 08 Jan 2007 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/whats-in-a-uri/</guid><description>&lt;p>First off, a Happy New Year to all!&lt;/p>
&lt;p>A &lt;a href="https://web.archive.org/web/20130903193049/https://utils.its.caltech.edu/pipermail/openurl/2007-January/000376.html" target="_blank">post of mine&lt;/a> to the &lt;a href="https://web.archive.org/web/20130903202546/https://utils.its.caltech.edu/mailman/listinfo/openurl" target="_blank">OpenURL list&lt;/a> may possibly be of interest. Following up the recent W3C TAG (Technical Architecture Group) Finding on “The Use of Metadata in URIs” I pointed out that the TAG do not seem to be aware of OpenURL: which is both a standard prescription for including metadata in URI strings and a US information standard to boot.&lt;/p></description></item><item><title>Google offer on journal archives&amp;#8230;</title><link>https://www.crossref.org/blog/google-offer-on-journal-archiv-1/</link><pubDate>Mon, 18 Dec 2006 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/google-offer-on-journal-archiv-1/</guid><description>&lt;p>Peter Suber &lt;a href="http://www.earlham.edu/~peters/fos/2006_12_17_fosblogarchive.html#116637929327063772" target="_blank">reports&lt;/a> on his Open Access News that Google is offering to digitize journal backfiles. The full text articles are available as images and for free hosted by Google. The deal is non-exclusive and publishers retain copyright (but many backfiles will be out of copyright) but Google will not supply the publisher with the electronic files - so non-exclusive means that the publisher or someone else could digitize the back-year records too (but how to recover the costs when it’s all free in Google?).&lt;/p>
&lt;p>Dorothea Salo (&lt;a href="https://www.crossref.org/blog/speaking-of-stm-innovations/">recent STM Innovations speaker&lt;/a>) over at Caveat Lector provides an excellent &lt;a href="https://web.archive.org/web/20080725070901/http://cavlec.yarinareth.net/archives/2006/12/17/control-your-bits/" target="_blank">review of the Google offer&lt;/a> with some good advice for publishers (“always control your bits”).&lt;/p></description></item><item><title>Exhibit A</title><link>https://www.crossref.org/blog/exhibit-a/</link><pubDate>Tue, 12 Dec 2006 00:00:00 +0000</pubDate><author>Crossref</author><guid>https://www.crossref.org/blog/exhibit-a/</guid><description>&lt;p>MIT’s Simile project has just released &lt;a href="http://simile.mit.edu/exhibit/" target="_blank">Exhibit&lt;/a>, a ” lightweight structured data publishing framework.” Read that as “an easy-to-use mashup creation tool.” I have heard that &lt;a href="http://www.ldodds.com/blog/" target="_blank">Leigh&lt;/a> has already started experimenting with it. I look forward to a writeup soon…&lt;/p></description></item><item><title>Speaking of STM Innovations</title><link>https://www.crossref.org/blog/speaking-of-stm-innovations/</link><pubDate>Tue, 12 Dec 2006 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/speaking-of-stm-innovations/</guid><description>&lt;p>The STM Innovations meeting on December 7th in London was excellent. Leigh Dodds &lt;a href="http://www.ldodds.com/blog/archives/000303.html" target="_blank">has a short summary&lt;/a> of the day on his blog. Interestingly, I can’t find anything about the conference on the STM website.&lt;/p></description></item><item><title>Zotero - next generation research tool?</title><link>https://www.crossref.org/blog/zotero-next-generation-research-tool/</link><pubDate>Tue, 12 Dec 2006 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/zotero-next-generation-research-tool/</guid><description>&lt;p>&lt;a href="http://www.zotero.org/" target="_blank">1&lt;/a> was mentioned at the STM Innovations talk in London and it’s worth taking a look. It’s billed as the next generation of bibliographic management software - End Note but a lot more included. DOIs should be incorporated into this tool - I couldn’t find any mention of Crossref or DOIs.&lt;/p></description></item><item><title>And Just Relax</title><link>https://www.crossref.org/blog/and-just-relax/</link><pubDate>Tue, 28 Nov 2006 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/and-just-relax/</guid><description>&lt;p>Nice piece of advocacy &lt;a href="http://www.tbray.org/ongoing/When/200x/2006/11/27/Choose-Relax" target="_blank">here&lt;/a> by Tim Bray for &lt;a href="http://relaxng.org/" target="_blank">RELAX&lt;/a>. High time to see someone standing up for RELAX - a much friendlier XML schema language.&lt;/p></description></item><item><title>Journal Supply Chain Efficiency Improvement Pilot</title><link>https://www.crossref.org/blog/journal-supply-chain-efficienc/</link><pubDate>Thu, 12 Oct 2006 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/journal-supply-chain-efficienc/</guid><description>&lt;p>This project - &lt;a href="https://web.archive.org/web/20061004011422/www.journalsupplychain.com/" target="_blank">https://web.archive.org/web/20061004011422/www.journalsupplychain.com/&lt;/a> - (which needs a new name or clever acronym) has released a &lt;a href="https://web.archive.org/web/20060904075439/http://www.journalsupplychain.com/press_files/JSCEI%20Pilot%20mid-year%20report%20external%2027Sep06.pdf" target="_blank">Mid Year Report&lt;/a>. The pilot is being extended into 2007 and there is clearly value for publishers in having an unique ID for institutions at the licensing unit level. Ringgold, one of the project partners, has a great database with a validated hierarchy of institutions from consortia down to departments - I had a demo at Frankfurt. The report has some info on benefits for publishers and on possible business models. I think a central, neutral registry of unique IDs would be a real benefit to the industry.&lt;/p>
&lt;p>From the report:&lt;/p>
&lt;p>“Publishers&lt;/p>
&lt;p>Certainly publishers are already using an institutional identifier internally with major&lt;/p>
&lt;p>marketing and customer communication benefits. The main areas where the proposed&lt;/p>
&lt;p>identifier could add value to the communication between the publisher and customer&lt;/p>
&lt;p>should be in areas such as:&lt;/p>
&lt;p>• accurate COUNTER usage reports&lt;/p>
&lt;p>• institutional renewals being unrecognized as such and therefore appearing as&lt;/p>
&lt;p>new subscriptions&lt;/p>
&lt;p>• easier ability to track institutional end-users of consolidated subscriptions&lt;/p>
&lt;p>(especially those where the agent does not deliver orders via ICEDIS&lt;/p>
&lt;p>structured FTP with Type 2 addresses incorporated in the complete record)”&lt;/p>
&lt;p>On business models:&lt;/p>
&lt;p>“A sensible business model would have those that receive the most economic benefit&lt;/p>
&lt;p>from a respective service providing a respective level of funding to support costs. It is&lt;/p>
&lt;p>clear that publishers are the primary beneficiaries of the institutional identifier, with&lt;/p>
&lt;p>clear benefits, thereby suggesting they should bear the proportionate cost. Ultimately&lt;/p>
&lt;p>the subscriber pays anyway; economies are reflected in reduced cost to the subscriber&lt;/p>
&lt;p>in a competitive market.&lt;/p>
&lt;p>Other participants would see service improvements, but not the same clear benefits. It&lt;/p>
&lt;p>would therefore be reasonable to ask the publishers to bear the major cost of the&lt;/p>
&lt;p>establishment of such an identifier, and to a certain extent they have already done so&lt;/p>
&lt;p>by subscribing selectively to Ringgold’s existing auditing and database services.&lt;/p>
&lt;p>The various and relevant business revenue streams might be reflected as follows:&lt;/p>
&lt;p>• Free service: limited search only, with number of searches per day restricted,&lt;/p>
&lt;p>possibility of searchers to edit or input information using a “response form”&lt;/p>
&lt;p>designed for such purposes&lt;/p>
&lt;p>• Basic subscription: unlimited search access to the database&lt;/p>
&lt;p>• Database license for hosting services: download of standard selected metadata&lt;/p>
&lt;p>• Database license for publishers: access for download of selected metadata, and&lt;/p>
&lt;p>automatic receipt of alerts for changes”&lt;/p></description></item><item><title>Ruby Makes A-List</title><link>https://www.crossref.org/blog/ruby-makes-alist/</link><pubDate>Thu, 12 Oct 2006 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/ruby-makes-alist/</guid><description>&lt;p>Um, well. Seems according to &lt;a href="http://www.oreillynet.com/ruby/blog/2006/10/ruby_declared_mainstream.html" target="_blank">O’Reilly Ruby&lt;/a> that Ruby is now a mainstream language.&lt;/p>
&lt;p>&lt;em>“The &lt;a href="http://www.ruby-lang.org/" target="_blank">Ruby programming language&lt;/a> just made the A-list on the &lt;a href="http://www.tiobe.com" target="_blank">TIOBE Programming Community Index&lt;/a>, and Ruby is now listed as a mainstream programming language. For the past three or four years Ruby has consistently placed in the high 20’s in this index, but is now placed as the 13th most popular programming language!”&lt;/em>&lt;/p>
&lt;p>(No language wars, but I am, I will confess, a big admirer - for some time.)&lt;/p></description></item><item><title>STIX and Stones</title><link>https://www.crossref.org/blog/stix-and-stones/</link><pubDate>Thu, 05 Oct 2006 00:00:00 +0000</pubDate><author>Crossref</author><guid>https://www.crossref.org/blog/stix-and-stones/</guid><description>&lt;p>The &lt;a href="http://www.stixfonts.org/" target="_blank">STIX Fonts&lt;/a> project funded by six major publishers to develop a comprehensive font set for STM publishing has completed its development phase and is about to move into beta testing (planned to commence in late October). Participation is open to all publishers - so now is the time to get involved to ensure your needs are met by this significant activity.&lt;/p></description></item><item><title>AdsML</title><link>https://www.crossref.org/blog/adsml/</link><pubDate>Tue, 03 Oct 2006 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/adsml/</guid><description>&lt;p>A new version of the AdsML Framework 2.0, Release 8 from the &lt;a href="https://web.archive.org/web/20061004090802/http://195.52.248.218/WebSite/adsml.nsf/HTML/Index.html" target="_blank">AdsML Consortium&lt;/a> is now available for download from &lt;a href="http://www.adsml.org/2006/announcements/adsml-framework-2-0-release-8-issued/" target="_blank">http://www.adsml.org/2006/announcements/adsml-framework-2-0-release-8-issued/&lt;/a>.&lt;/p>
&lt;p>Below is an extract from the “Vision” document which outlines the broad goals of AdsML.&lt;/p>
&lt;p>_“2 The Vision of AdsML&lt;/p>
&lt;p>According to its Charter document, the mission of the AdsML Consortium is 3-&lt;/p>
&lt;p>fold:&lt;/p>
&lt;p>• to create an internationally-adopted set of specifications and associated&lt;/p>
&lt;p>business processes for the electronic exchange of business information and&lt;/p>
&lt;p>content for advertising&lt;/p>
&lt;p>• to simplify and accelerate business interactions&lt;/p>
&lt;p>• to facilitate use across multiple media in both current and future&lt;/p>
&lt;p>environments.&lt;/p>
&lt;p>This dry, somewhat technical statement masks the simplicity and power of what&lt;/p>
&lt;p>the AdsML Consortium aims to do. Stated informally, AdsML’s vision is to tie&lt;/p>
&lt;p>together all of the parties involved in producing, booking, distributing&lt;/p>
&lt;p>and publishing an ad as if they all used the same software system – but&lt;/p>
&lt;p>without actually requiring everyone to switch to a different software system or&lt;/p>
&lt;p>vendor.”_&lt;/p></description></item><item><title>Blogs, Well Duh!</title><link>https://www.crossref.org/blog/blogs-well-duh/</link><pubDate>Tue, 03 Oct 2006 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/blogs-well-duh/</guid><description>&lt;p>Steve Rubel has a reponse &lt;a href="http://www.micropersuasion.com/2006/10/duh_of_course_c.html" target="_blank">here&lt;/a> to Lexis-Nexis’ &lt;a href="http://news.com.com/2100-1025_3-6121778.html?part=rss&amp;amp;#038;tag=6121778&amp;amp;#038;subj=news" target="_blank">survey&lt;/a> on consumers preferred outlets for breaking news and their rubbishing of blogs as a credible publishing forum. It’s something called, er, the &lt;a href="http://en.wikipedia.org/wiki/Long_tail" target="_blank">Long Tail&lt;/a> by Chris Anderson at &lt;em>Wired Magazine&lt;/em>.&lt;/p></description></item><item><title>Couple Web Feeds to Note</title><link>https://www.crossref.org/blog/couple-web-feeds-to-note/</link><pubDate>Tue, 03 Oct 2006 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/couple-web-feeds-to-note/</guid><description>&lt;p>Sorry to be somewhat backwards, but just in case any folks didn’t already know there’s a couple new feeds set up recently (or at least they’re newish to me 🙂&lt;/p>
&lt;ul>
&lt;li>&lt;a href="https://web.archive.org/web/20060923073323/http://www.stm-assoc.org/home/rss.xml" target="_blank">News from STM&lt;/a> (from the &lt;a href="http://www.stm-assoc.org/" target="_blank">STM Association&lt;/a>)
&lt;ul>
&lt;li>&lt;a href="http://efoundations.typepad.com/" target="_blank">eFoundations&lt;/a> (from Andy Powell and Pete Johnston at &lt;a href="https://web.archive.org/web/20061002052838/http://www.eduserv.org.uk/foundation/" target="_blank">Eduserv Foundation&lt;/a> in the UK) &lt;/ul>&lt;/li>
&lt;/ul>
&lt;/li>
&lt;/ul></description></item><item><title>Science Commons</title><link>https://www.crossref.org/blog/towards-a-science-commons/</link><pubDate>Tue, 03 Oct 2006 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/towards-a-science-commons/</guid><description>&lt;p>Peter Murray-Rust posts on the SPARC-OpenData mailing list about a Commons for Science Conference (Oct. 3/4 in DC). The meeting is invitation-only but the papers are online (see &lt;a href="https://web.archive.org/web/20061121055622/http://www.spatial.maine.edu/icfs" target="_blank">here&lt;/a>) and there should be public reports. The meeting underlines the importance of Open Data. There’s a brief abstract below.&lt;/p>
&lt;p>_“The sciences depend on access to and use of factual data. Powered by&lt;/p>
&lt;p>developments in electronic storage and computational capability,&lt;/p>
&lt;p>scientific inquiry today is becoming more data-intensive in almost&lt;/p>
&lt;p>every discipline. Whether the field is meteorology, genomics,&lt;/p>
&lt;p>medicine, ecology, or high-energy physics, modern research depends on&lt;/p>
&lt;p>the availability of multiple databases, drawn from multiple public&lt;/p>
&lt;p>and private sources; and the ability of those diverse databases to be&lt;/p>
&lt;p>searched, recombined, and processed.”_&lt;/p></description></item><item><title>CrossTech</title><link>https://www.crossref.org/blog/crosstech/</link><pubDate>Mon, 02 Oct 2006 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/crosstech/</guid><description>&lt;p>&lt;code>&amp;lt;span &amp;gt;&lt;/code>Just a couple comments about CrossTech:&lt;/p>
&lt;p>&lt;code>&amp;lt;span &amp;gt;&lt;/code>1. Shouldn’t it (or couldn’t it) be linked to from the Crossref home page? (This is a public read list after all and so should be made more widely available.) Maybe at some point could be announced on some lists of interest.&lt;/p>
&lt;p>&lt;code>&amp;lt;span &amp;gt;&lt;/code>2. Would be very nice to (at least) have a count of membership. I would also like to canvas opinions about making names of the membership public. What do others think about this?&lt;/p>
&lt;p>&lt;code>&amp;lt;span &amp;gt;&lt;/code>At the end of the day though this facility needs to be driven, otherwise it will end up being just another pier over the water (i.e. a ‘disappointed bridge’ And sorry for cribbing again from JAJ).&lt;/p></description></item><item><title>Wiley Does RSS, Too!</title><link>https://www.crossref.org/blog/wiley-does-rss-too/</link><pubDate>Mon, 02 Oct 2006 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/wiley-does-rss-too/</guid><description>&lt;p>&lt;a href="http://rafaelsidi.blogspot.com/2006/10/rss-feeds-in-wiley-journals.html" target="_blank">This post&lt;/a> blogged by Rafael Sidi at &lt;a href="http://www.ei.org/" target="_blank">EEI&lt;/a>. Wiley are now dishing out RSS feeds. And moreover from a cursory inspection (see e.g. here for the &lt;em>American Journal of Human Biology&lt;/em>) it seems like they are putting out RSS 1.0 (RDF) and DC/PRISM metadata. Don’t know if there’s anyone from Wiley who can comment on this. But this really is the best news. (Now, who else can we get to join the party. 😉&lt;/p>
&lt;p>[&lt;em>Editor&amp;rsquo;s update: Link to Wiley was broken and removed. January 2021&lt;/em>]&lt;/p></description></item><item><title>ACAP - (Automated Content Access Protocol)</title><link>https://www.crossref.org/blog/acap-automated-content-access/</link><pubDate>Fri, 29 Sep 2006 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/acap-automated-content-access/</guid><description>&lt;p>The World Association of Newspapers is developing ACAP - see the &lt;a href="https://web.archive.org/web/20070612185117/http://www.wan-press.org/article11943.html" target="_blank">press release&lt;/a> which will be machine readable rights information that search engines would read and act on in an automated way. Rightscom is working on the project and the IPA and EPC (European Publishers Council) are involved.&lt;/p>
&lt;p>Publishers presenting a united front to search engines is a good thing but I’m somewhat skeptical about how such a system would work without being overly complicated. However, the idea of getting more information to the search engines when they are crawling sites is a good idea but what will the publishers say to the search engines? If you get much above crawl/don’t crawl then you need a bilateral agreement that has to be negotiated.&lt;/p></description></item><item><title>PRISM Use Cases</title><link>https://www.crossref.org/blog/prism-use-cases/</link><pubDate>Mon, 25 Sep 2006 00:00:00 +0000</pubDate><author>Tony Hammond</author><guid>https://www.crossref.org/blog/prism-use-cases/</guid><description>&lt;p>At last week’s PRISM Face to Face meeting at Time Inc. (NY), Linda Burman raised the question of how (STM) publishers were using PRISM beyond RSS. I gave a brief presentation of how we at Nature were using PRISM: RSS (well you all know about that), Connotea (our social bookmarking tool), SRU (Search/Retrieve by URL), and OTMI (Open Text Mining Interface - which we’ll shortly be making available for wider comment). Be interested to learn if anyone else is using PRISM in other ways.&lt;/p></description></item><item><title>password control</title><link>https://www.crossref.org/blog/password-control-1/</link><pubDate>Mon, 11 Sep 2006 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/password-control-1/</guid><description>&lt;p>We’ve taken the top level access control off the site. This means that anyone can read the blog but posting will be limited to those with an account (Crossref members and invited participants). This will make it possible to include the CrossTech feed in your regular RSS reader/aggregator. We’ll soon be posting some general terms and conditions for this blog and also sending a message to all Crossref members about joining so we should see membership (and activity) pick up.&lt;/p></description></item><item><title>Embedding standardized metadata in HTML</title><link>https://www.crossref.org/blog/post/</link><pubDate>Tue, 05 Sep 2006 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/post/</guid><description>&lt;p>On the iSpecies blog Rod Page &lt;a href="http://ispecies.blogspot.com/2006/08/extracting-dois.html" target="_blank">describes how he extracts DOIs&lt;/a> from Google Scholar results - he does use the &lt;a href="https://www.crossref.org/education/retrieve-metadata/openurl/" target="_blank">Crossref OpenURL interface&lt;/a> and &lt;a href="https://web.archive.org/web/20061205061750/http://www.connotea.org/" target="_blank">Connotea&lt;/a> to get DOIs too. He also says “DOIs are pretty cool” which is good!&lt;/p>
&lt;p>On another blog post to SemAnt Page &lt;a href="http://semant.blogspot.com/2006/08/lsids-and-dois-for-ant-and-other.html" target="_blank">describes how he uses LSIDs and DOIs for Ant literature&lt;/a>.&lt;/p>
&lt;p>It seems that there is more and more of this type of use of the DOI so its great we have the OpenURL interface. Could the type of stuff that Page is doing be helped by publishers embedding metadata in their HTML pages? This could include licensing info and information for search engine crawlers.&lt;/p>
&lt;p>Ingenta and BMC embed metadata (are there others?) - here is a snippet from a BMC article -&lt;/p>
&lt;pre>&amp;lt;cc:Work rdf:about="http://www.biomedcentral.com/1471-2148/3/16"&amp;gt;
&amp;lt;cc:license rdf:resource="http://creativecommons.org/licenses/by/2.0/"/&amp;gt;
&amp;lt;/cc:Work&amp;gt;
&amp;lt;cc:License rdf:about="http://creativecommons.org/licenses/by/2.0/"&amp;gt;
&amp;lt;cc:permits rdf:resource="http://web.resource.org/cc/Reproduction"/&amp;gt;
&amp;lt;cc:permits rdf:resource="http://web.resource.org/cc/Distribution"/&amp;gt;
&amp;lt;cc:requires rdf:resource="http://web.resource.org/cc/Notice"/&amp;gt;
&amp;lt;cc:requires rdf:resource="http://web.resource.org/cc/Attribution"/&amp;gt;
&amp;lt;cc:permits rdf:resource="http://web.resource.org/cc/DerivativeWorks"/&amp;gt;
&amp;lt;/cc:License&amp;gt;
&amp;lt;item rdf:about="http://www.biomedcentral.com/1471-2148/3/16"&amp;gt;
&amp;lt;title&amp;gt;Inter-familial relationships of the shorebirds (Aves: Charadriiformes) based on nuclear DNA sequence data&amp;lt;/title&amp;gt;
&amp;lt;dc:title&amp;gt;Inter-familial relationships of the shorebirds (Aves: Charadriiformes) based on nuclear DNA sequence data&amp;lt;/dc:title&amp;gt;
&amp;lt;dc:creator&amp;gt;Ericson, Per GP&amp;lt;/dc:creator&amp;gt;
&amp;lt;dc:creator&amp;gt;Envall, Ida&amp;lt;/dc:creator&amp;gt;
&amp;lt;dc:creator&amp;gt;Irestedt, Martin&amp;lt;/dc:creator&amp;gt;
&amp;lt;dc:creator&amp;gt;Norman, Janette A&amp;lt;/dc:creator&amp;gt;
&amp;lt;dc:identifier&amp;gt;info:doi/10.1186/1471-2148-3-16&amp;lt;/dc:identifier&amp;gt;
&amp;lt;dc:identifier&amp;gt;info:pmid/12875664&amp;lt;/dc:identifier&amp;gt;
&amp;lt;dc:source&amp;gt;BMC Evolutionary Biology 2003, 3:16&amp;lt;/dc:source&amp;gt;
&amp;lt;dc:date&amp;gt;2003-07-23&amp;lt;/dc:date&amp;gt;
&amp;lt;prism:publicationName&amp;gt;BMC Evolutionary Biology&amp;lt;/prism:publicationName&amp;gt;
&amp;lt;prism:publicationDate&amp;gt;2003-07-23&amp;lt;/prism:publicationDate&amp;gt;
&amp;lt;prism:volume&amp;gt;3&amp;lt;/prism:volume&amp;gt;
&amp;lt;prism:number&amp;gt;1&amp;lt;/prism:number&amp;gt;
&amp;lt;prism:section&amp;gt;Research article&amp;lt;/prism:section&amp;gt;
&amp;lt;prism:startingPage&amp;gt;16&amp;lt;/prism:startingPage&amp;gt;
&amp;lt;/item&amp;gt;
&amp;lt;/rdf:RDF&amp;gt;
&lt;/pre></description></item><item><title>password control</title><link>https://www.crossref.org/blog/password-control/</link><pubDate>Tue, 29 Aug 2006 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/password-control/</guid><description>&lt;p>Hi,&lt;/p>
&lt;p>At the moment a username and password is needed to read the CrossTech blog in addition to needing an account to post entries. However, it may be better to take off the access control to read the blog - this would mean that services like Technorati and Google could index the blog, which they can’t do at the moment and posting to the blog would be public.&lt;/p>
&lt;p>As people come on to the list maybe the first thing to comment on is whether we should take off the access control to read the blog. What to people think?&lt;/p></description></item><item><title>SEMANTIC WEB: GOOGLE HAS THE ANSWERS, BUT NOT THE QUESTIONS</title><link>https://www.crossref.org/blog/semantic-web-google-has-the-an-1/</link><pubDate>Tue, 22 Aug 2006 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/semantic-web-google-has-the-an-1/</guid><description>&lt;p>Posted by special permission from EPS &lt;!--broken link www.epsltd.com.-->&lt;/p>
&lt;p>EPS INSIGHTS :: 01/08/2006&lt;/p>
&lt;p>SEMANTIC WEB: GOOGLE HAS THE ANSWERS, BUT NOT THE QUESTIONS&lt;/p>
&lt;ul>
&lt;li>The Google v. Semantic Web discussion at the AAAI (American Association for Artificial Intelligence) featured plenty of confrontation and even some rational argument, but it may chiefly be remembered as the day when Google responded to the challenge of semantic web thinking by saying that the semantic web movement did not matter - thereby demonstrating that it did.&lt;/li>
&lt;/ul>
&lt;p>by David Worlock, Chairman&lt;/p>
&lt;p>And we thought that the real battle this year was between net neutrality and the network owners. Or between those who think that click fraud crucially undermines Google, and those who think it doesn’t matter. We were wrong. July’s “Thrilla in Manila” was the discussion between Tim Berners-Lee and the Google Director of Search, Peter Norvig, at the Boston AAAI meeting. And it is an important moment because Berners-Lee’s assertion that the last semantic web building blocks are moving into place comes at exactly the time when Google seems anxious to diminish semantic web searching. It is a good guess that the latter results from a stimulus dictated by threat. A world where keyword searching was reduced to ground floor in a building of many storeys where it may even be an advantage to be a new market entrant with no history is a world where Google would have to progressively re-invent itself. And what is more difficult, in the recent history of these things, than a company created by a technology re-inventing itself in terms of a new technology?&lt;/p>
&lt;p>So Google’s Boston blows were first of all aimed at the reality test. Like STM publishers pointing to the unlikelihood of academic researchers adding metadata to articles for repository filing, Google pointed to user and webmaster incompetence as the chief reason why semantic interoperability was doomed to a long, slow and painful generative process. If users cannot configure a server or write HTML, how can they understand all this stuff? And then suppliers would slow it down by trying to make it proprietary. And then, machine to machine interoperability would encourage deception (obviously the click fraud business is hurting). The answer to the Semantic Web, from a Google stance, thus appears to be: very interesting, but not very soon.&lt;/p>
&lt;p>Dancing like a bee and stinging like a butterfly, Tim Berners-Lee clearly had the answers to this. The reason why the semantic web appears threatening to those who have entrenched tenancies in search is probably because it is going quicker than expected. His original ‘layer cake’ diagram, a feature on the conference circuit for five years, could now be completed at all levels. RDF as a data language is now well-established (think of RSS). Ontologies, mostly in narrow vertical domains, are moving into place, though there may be issues about relating them to each other. Query and rules languages now populate the other layers, with one of the former, SPARQL, emerging this year as a W3C candidate recommendation (6 April 2006). In a real sense this is the missing link which makes the Semantic Web a viable proposition, and at the same time joins it to the popular hubbub around Web 2.0. If part of the latter dream is data sourcing from a wide variety of service entities to create new web environments from composite content, then SPARQL sitting on top of RDF looks closest to realising that idea. In an important note in O’Reilly XML.com (SPARQL: Web 2.0 Meet the Semantic Web; 16 September 2005), Kendall Clark wrote “Imagine having one query language, and one client, that lets you arbitrarily slice the data of Flickr, del.icio.us, Google, and your three other favourite Web 2.0 sites, all FOAF files, all of the RSS 1.0 feeds (and, eventually, I suspect, all Atom 1.0 feeds) plus MusicBrainz etc”.&lt;/p>
&lt;p>Imagining that might well impel you into the ring with Tim Berners-Lee. If Google has to be re-invented, the process of recognition of change has to be slowed. Denying the speedy reality of the semantic web becomes essential while furious R&amp;amp;D takes place. And content and information service providers are not just spectators of this, but participants too.&lt;/p>
&lt;p>© Electronic Publishing Services&lt;/p>
&lt;!-- links broken, not in wayback machine
>From the EPS archive
&amp;#8212;&amp;#8212;&amp;#8212;&amp;#8212;&amp;#8212;&amp;#8212;&amp;#8212;&amp;#8212;&amp;#8212;&amp;#8212;&amp;#8212;&amp;#8212;-
Topix.net: semantic web building block? EPS Insights, 31 March 2005 ::
&lt;http://www.epsltd.com/accessArticles.asp?articleType=1&amp;#038;updateNoteID=1557>
Spotlight on . RDF and semantic web, imi, June 2006 ::
http://www.epsltd.com/accessArticles.asp?articleType=2&amp;#038;articleID=384&amp;#038;imiID=8
1
Semantic Web: another milestone reached, EPS Insights, 27 February 2004 ::
http://www.epsltd.com/accessArticles.asp?articleType=1&amp;#038;updateNoteID=1191 -->
&lt;p>Related links&lt;/p>
&lt;p>————————————-&lt;/p>
&lt;p>Google :: &lt;a href="http://www.google.com" target="_blank">http://www.google.com&lt;/a>&lt;/p>
&lt;p>AAAI :: &lt;a href="http://www.aaai.org" target="_blank">http://www.aaai.org&lt;/a>&lt;/p>
&lt;p>Kendall Clark - SPARQL: Web 2.0 Meet the Semantic Web ::&lt;/p>
&lt;p>[http://www.oreillynet.com/xml/blog/2005/09/sparql_web_20_meet_the_semanti.ht&lt;/p>
&lt;p>ml]&lt;a href="http://www.oreillynet.com/xml/blog/2005/09/sparql_web_20_meet_the_semanti.ht" target="_blank">1&lt;/a>&lt;/p>
&lt;p>W3C :: &lt;a href="http://www.w3.org" target="_blank">http://www.w3.org&lt;/a>&lt;/p>
&lt;p>Flickr :: &lt;a href="http://www.flickr.com" target="_blank">http://www.flickr.com&lt;/a>&lt;/p>
&lt;p>FOAF :: &lt;a href="http://www.foaf-project.org/" target="_blank">http://www.foaf-project.org/&lt;/a>&lt;/p>
&lt;p>MusicBrainz :: &lt;a href="http://musicbrainz.org/" target="_blank">http://musicbrainz.org/&lt;/a>&lt;/p>
&lt;p>————————————-&lt;/p></description></item><item><title>Welcome to CrossTech</title><link>https://www.crossref.org/blog/welcome-to-crosstech/</link><pubDate>Tue, 22 Aug 2006 00:00:00 +0000</pubDate><author>Ed Pentz</author><guid>https://www.crossref.org/blog/welcome-to-crosstech/</guid><description>&lt;p>Welcome to CrossTech, a new access-controlled blog to discuss developments in the online scholarly publishing world. Crossref’s mission is to foster dialogue and information sharing among publishers to enable innovation and collaboration. In order to do things collaboratively, publishers need to share information and communicate in an appropriate manner that takes into account anti-trust and competitive issues. The online publishing world changes quickly and many developments are driven by organisations outside of scholarly publishing so CrossTech provides publishers a “protected” space to discuss issues.&lt;/p>
&lt;p>Nature Publishing Group’s Xanadu blog is the model for CrossTech. Our hope is that CrossTech will build on the idea of Xanadu.&lt;/p>
&lt;p>CrossTech Objectives: To provide a neutral forum where participants can post and discuss technical issues, link to relevant items on the Internet, make others aware of important developments and share and learn from each others’ experiences. CrossTech will promote collaboration and innovation among publishers in an appropriate manner taking account of anti-trust and competitive issues.&lt;/p>
&lt;p>The main goals of CrossTech are:&lt;/p>
&lt;ul>
&lt;li>
&lt;p>To provide a common forum for discussing new publishing technologies&lt;/p>
&lt;/li>
&lt;li>
&lt;p>To develop a publisher technology community&lt;/p>
&lt;/li>
&lt;li>
&lt;p>To determine common directions for key publishing technologies&lt;/p>
&lt;/li>
&lt;li>
&lt;p>To foster best practices - and decide the best route to codify or standardize those practices&lt;/p>
&lt;/li>
&lt;li>
&lt;p>To share experiences&lt;/p>
&lt;/li>
&lt;li>
&lt;p>To act as an alerting mechanism for publishers to learn of relevant, new technology developments&lt;/p>
&lt;/li>
&lt;/ul>
&lt;p>Please let us know if you would like to participate. A username and password will be needed to read, post and comment. To obtain a username and password to post and comment, please email Anna Tolwinska &lt;a href="mailto:annat@crossref.org">annat@crossref.org&lt;/a>.&lt;/p>
&lt;p>We look forward to having you participate!&lt;/p></description></item></channel></rss>