2025 June 17
Evolving the preprint evaluation world with Sciety
This post is based on an interview with Sciety team at eLife.
As the range of public services (e.g. RSS) offered by publishers has matured this gives rise to the question: How can they expose their public data so that a user may discover them? Especially, with DOI there is now in place a persistence link infrastructure for accessing primary content. How can publishers leverage that infrastructure to advantage?
Anyway, I offer this figure as to how I see the current lie of the land as regards DOI services and data.
(Click to enlarge.)
For infotainment only (and because itâs a pretty printing). Glimpse into the dark world of DOI. Here, the handle contents for doi:10.1038/nature06930 exposed as a standard OpenHandle âHello Worldâ document. Browser image courtesy of Processing.js and Firefox 3 RC1.
So, why is it just so difficult to reference OpenURL?
Apart from the standard itself (hardly intended for human consumption - see abstract page here and PDF and donât even think to look at those links - they werenât meant to be cited!), seems that the best reference is to the Wikipedia page. There is the OpenURL Registry page at http://alcme.oclc.org/openurl/servlet/OAIHandler?verb=ListSets but this is just a workshop. Not much there beyond the OpenURL registered items. (And why does the page seem uncertain as to whether itâs a ârepositoryâ or a âregistryâ? Is there no difference between those terms?) The only other links are to a mix of HTML and PDF resources. (There really should be a health warning on links to PDFs - they are just not browser friendly documents.) And, I do have to wonder at this: the registry page has a link to the unofficial 0.1 version but not to the 1.0 standard. Er, why? And donât even try this link: http://openurl.info/. Not much info there.
So, the big guns have decided that XRI is out. In a message from the TAG yesterday, variously noted as being âcategoricalâ (Andy Powell, eFoundations) and a âproclamationâ (Edd Dumbill, XML.com), the co-chairs (Tim Berners-Lee and Stuart Williams) had this to say:
âWe are not satisfied that XRIs provide functionality not readily available from http: URIs. Accordingly the TAG recommends against taking the XRI specifications forward, or supporting the use of XRIs as identifiers in other specifications.â
Following on from yesterdayâs post about making metadata available on our Web pages, I wanted to ask here about âmetadata reuse policiesâ. Does anybody have a clue as to what might constitute a best practice in this area? Iâm specifically interested in license terms, rather than how those terms would be encoded or carried. Increasingly we are finding more channels to distribute metadata (RSS, HTML, OAI-PMH, etc.) but donât yet have any clear statement for our customers as to how they might reuse that data.
Well, we may not be the first but wanted anyway to report that Nature has now embedded metadata (HTML meta tags) into all its newly published pages including full text, abstracts and landing pages (all bar four titles which are currently being worked on). Metadata coverage extends back through the Nature archives (and depth of coverage varies depending on title). This conforms to the W3Câs Guideline 13.2 in the Web Content Accessibility Guidelines 1.0 which exhorts content publishers to âprovide metadata to add semantic information to pages and sitesâ.
Metadata is provided in both DC and PRISM formats as well as in Googleâs own bespoke metadata format. This generally follows the DCMI recommendation âExpressing Dublin Core metadata using HTML/XHTML meta and link elements, and the earlier RFC 2731 âEncoding Dublin Core Metadata in HTMLâ. (Note that schema name is normalized to lowercase.) Some notes:
dc.identifier
â term in URI form which is the Crossref recommendation for citing DOI.prism.doi
â for disclosing the native DOI form. This requires the PRISM namespace declaration to be bumped to v2.0. We might consider synchronizing this change with our RSS feeds which are currently pegged at v1.2, although note that the RSS module mod_prism currently applies only to PRISM v1.2.prism.url
â term to link back (through the DOI proxy server) to the content site. The namespace issue listed above still holds.citation_
â terms are not anchored in any published namespace which does make this term set problematic in application reuse. It would be useful to be able to reference a namespace (e.g. ârel="schema.gs" href="..."
â) for these terms and to cite them as e.g. âgs.citation_title
â.Further to my previous post âNIH Mandate and PMCIDsâ weâve been looking into linking to articles on publishersâ sites from PubMed Central (PMC). There are a couple of ways this happens currently (see details below) but these are complicated and will lead to broken links and more difficulty for PMC and publishers in managing the links. Crossref is going to be putting together a briefing note for its members on this soon.
The main issue we are raising with PMC, and that we will encourage publishers to raise too, is why doesnât PMC just automatically link DOIs? Most of the articles in PMC have DOIs so this would require very little effort from PMC and no effort from publishers and would give readers a perisistent link to the publisherâs version of an article.
Following up the earlier post on OpenHandle, there are now a number of language examples which have been contributed to the project. The diagram below shows the OpenHandle service in schematic with various languages support. Briefly, OpenHandle aims to provide a web services interface to the Handle System to simplify access to the data stored for a given Handle.
(Note that the diagram is an HTML imagemap and all elements are âclickableâ.)
The NIH Public Access Policy says âWhen citing their NIH-funded articles in NIH applications, proposals or progress reports, authors must include the PubMed Central reference number for each articleâ and the FAQ provides some examples of this:
Examples:
Varmus H, Klausner R, Zerhouni E, Acharya T, Daar A, Singer P. 2003. PUBLIC HEALTH: Grand Challenges in Global Health. Science 302(5644): 398-399. PMCID: 243493
Zerhouni, EA. (2003) A New Vision for the National Institutes of Health. Journal of Biomedicine and Biotechnology (3), 159-160. PMCID: 400215
Destacando nuestra comunidad en Colombia
2025 June 05