Does a CrossRef DOI identify a "work?"
Tony's recent thread on making DOIs play nicely in a linked data world has raised an issue I've meant to discuss here for some time- a lot of the thread is predicated on the idea that CrossRef DOIs are applied at the abstract "work" level. Indeed, that it what it currently says in our guidelines. Unfortunately, this is a case where theory, practice and documentation all diverge.
When the CrossRef linking system was developed it was focused primarily on facilitating persistent linking amongst journals and conference proceedings. The system was quickly adapted to handle books and more recently to handle working papers, technical reports, standards and “components”- a catchall term used to refer to everything from individual article images to database records.
In practice the content outside of the core journals and conference proceedings has accounted for relatively low volume. However, we expect that over the next few years this will change and that books and databases will increasingly drive the future growth in CrossRef’s citation linking services. Interestingly, these content types all share characteristics that make them substantially different from the journals and conference proceedings that we have hitherto focused on.
Both books and databases introduce new challenges to technology and policies of our citation linking service. The challenges revolved around two areas:
- Structure: Both books and databases can have complex structures and the publishers of this content are likely to require granular identification of these content substructures along with a mechanism for documenting the relationship between these substructures (e.g. this section is part of this chapter which is part of this monograph which is part of this series)
- Versioning: Unlike typical journals and conference proceedings, books and database records sometimes change over time.
When confronted with the issues of structure and versioning publishers are often tempted to take shortcuts and decide to simply assign DOIs at the highest level structure and to the “work” instead of a particular “manifestation” or version of that work. Indeed, section 5.5 of CrossRef's DOI Name Information and Guidelines recommends this. But this approach could have a negative impact on the integrity of the scholarly citation record that CrossRef is attempting to maintain.
Fundamentally, CrossRef DOIs are aimed at providing a persistent online citation infrastructure for scholarly and professional publishers. Consequently, decisions about where to apply CrossRef DOIs should be guided by common expectations about the way in which citations work. Citations are typically used to credit ideas or provide evidence. A reader follows a citation in order to obtain more detail or to verify that an author is accurately representing the item cited. A rule of thumb is that a reader has a reasonable expectation that when they follow a citation, they will be taken to what the author saw when creating the citation. Any divergent behavior could result in the reader concluding that the author was misrepresenting the item cited. A further implication of this is that any changes to content that are likely to effect the crediting or interpretation of the content should result in that changed content getting a new CrossRef DOI.
Typically, this means that CrossRef DOIs should be probably assigned at the expression level and different expressions should be assigned different CrossRef DOIs. This is because assigning a CrossRef DOI at the higher "work" level is generally not granular enough to guarantee that a reader following the citation will see what the author saw when creating the citation. For example, one translation of a work might be substantially different from another translation of the same work. Similarly a draft version of a work might be substantially different from the final published version of the work. In each case, resolving a citation to a different expression of the work than the expression that was originally cited might result in the reader interpreting the content differently than the citing author.
In general, different "equivalent manifestations" of the same work can safely be assigned the same CrossRef DOI. So, for instance, the HTML formatted version an article and the PDF formatted version of an article can almost always be assigned the same CrossRef DOI. Any differences between the two are unlikely to affect the crediting of, or reader's interpretation of, the work. But sometimes it is even possible that different manifestations of an expression will differ enough to merit different CrossRef DOIs. For instance, a semantically enhanced version of an article might require new crediting (e.g. the parties responsible for adding the semantic information) and the resulting semantic enhancement may conceivably alter the reader's interpretation of the article.
Unfortunately, there is no hard and fast rule about where and when to assign new CrossRef DOIs. Instead there is only a guideline, namely:
"Assign new CrossRef DOIs to content in a way that will ensure that a reader following the citation will see something as close to what the original author cited as is possible."
The implications of this to publishers are important, especially when they are assigning DOIs to protean content types. For instance, it may mean that:
- Book publishers should be expected to keep old editions of books available for link resolution purposes.
- Publishers of content that can change rapidly (e.g. by the second) should provide facilities for creating frozen, archived snapshots of content for citation purposes.
- All publishers of protean content should issue guidelines instructing researchers on when it is appropriate to cite a work, manifestation or version.
CrossRef needs to actively consider these issues as publishers start assigning CrossRef DOIs to more dynamic types of content. Minimally, we should be able to provide publishers with recommendations on how to make dynamic content citable. We may even want to consider enshrining certain types of behavior in our terms and conditions so as to ensure the future integrity of the scholarly citation record.
In short, we need to update our guidelines.

Comments
I think the origin of crossref doi representing as abstract "work" came out of an emphasis that equates the print version of an article with the electronic version. Your example of the semantically enhanced article comes close to breaking that equivalency (any digital version of print is to some extent semantically enhanced.) and I think you ought to re-examine the implications.
Posted by: Eric Hellman | February 11, 2010 12:40 PM
@eric I agree with you that "any digital version of print is to some extent semantically enhanced". I suppose my overall point is that we need to stop trying to define a hard-and-fast rule that ties the CrossRef DOI to a particular layer of FRBR and instead think about what we are trying to do in citing something. And the rule-of-thumb I'm advocating is nothing more than a variation on Bateson's "difference that makes a difference". Some semantic enhancements make a difference. Some don't. This will be an editorial judgement call, which I expect will make some geeks (including me) very twitchy because it can not really be codified.
Posted by: Geoffrey Bilder | February 12, 2010 5:44 AM
Geoff, I would see that as being the baseline case. DOIs (even CrossRef DOIs) have a more generic role to play in managing and organizing content online.
The view you present above (reference linking) and the concerns you bring up (structuring, versioning) might tend to position CrossRef as essentially nothing more than a global CMS for scholarly publishing. In that view the primary objects are manifestations and would in Web architecture terms be Information Resources. So, for example, two 'equivalent' manifestations (e.g. HTML and PDF) will be managed by the same DOI and can be accessed using regular HTTP content negotiation.
And I can see how versioning concerns (and dynamic content) could lead you even further down that way of thinking.
(Aside: Btw, versioning and dynamic content immediately brings to mind Memento as a candidate technology which essentially utilizes time-based content negotiation.)
On the other hand, object structuring especially seems to drive us IMO in the opposite direction. Here we are principally concerned with a structure that contains substructre which might be recognized as an entity in its own right, i.e. a work within a work.
A instructive working example might be an image within an article. The abstract article (the work) has an identifier - the DOI - and the image (also a work) has its own identifier - the DOI. Now let's say our image has two formats PNG and JPEG. No problem we have a single identifier and use content negotiation to mediate the formats. Now let's say we focus on just one format (JPEG, say) and consider two or three display sizes plus a thumbnail version. Are we going to assign seperate DOIs for each sizing? No, of course not. They will still be managed by a single DOI which will be a work (or possibly expression) identifier - but it will nonetheless be an abstract Non-Information Resource.
Similar considerations can be applied to other component types. Substructure will likely be dealt with using some level of asbtraction. And the same is likely true of versioning too.
It is evidently a more scalable solution to deal with this kind of an object oriented approach (with a single public identifier plus accessor methods, e.g content negotiation, etc) than using separate public identifiers with their own metadata records for individual components and types.
You say that:
Yes - indeed. But this also begs the question of how (or who) CrossRef will consult to arrive at a more informed set of guidelines. I presume that this will be taken to the Technical Working Group for its input?
Posted by: Tony Hammond | February 12, 2010 10:43 AM
@tony
I think the "baseline case" is the "de facto case." CrossRef DOIs were originally implemented to facilitate linking of citations (Hence the name of CrossRef's parent organization "Publishers International Linking Association")
I am not as steeped in the history of CrossRef guidelines as some, but it seems to me that use of the word "work" at the time was probably not in the FRBR sense and had more to do (as Eric said) with the notion of equating print with electronic.
I am not at all sure what you mean by "global CMS" but it certainly doesn't sound like anything I would advocate. For one thing, we don't manage *any* content. At best we manage citation pointers to content.
And, as I said above, I'm not sure we should be trying to tie the CrossRef DOI to a particular FRBR layer. Sometimes one cites things at the work level, sometimes at the expression level and sometimes at the manifestation level. I suspect that we are more often concerned about the expression level, but that could change if manifestations start to become too disimilar.
So, as you say, in the "baseline case" you might be able to use content negotiation to access either the PDF or HTML representation of an article. But first we need a mechanism to distinguish between when we are getting the thing pointed to by the CrossRef DOI (the PDF , HTML, etc.) as opposed to "something about the thing" (e.g. the landing page, metadata record, etc.). This might not seem like a particularly ambitious approach- but it seems to me to have the advantage of being pragmatic and roughly aligned with current practice.
And, of course, I think we need to figure out how to best map relationships between citable objects- but that is where I think we might want to draw a pragmatic line. What you seem to be advocating above is that we get into tracking compound documents in their entirety. I'm not sure what business we have keeping track of image thumbnails or other display-related detritus of the publication process. You say CrossRef DOIs have "more generic role to play in managing and organizing content online". That truly does sound like the asset management function of a CMS and that really seems to be stretching CrossRef's mandate.
As we try to figure out how to make the CrossRef DOI play better in the linked data world, we will, as always, consult with our members (including the TWG), knowledgeable people in the industry, etc. As is the case with all of our major projects, we will also form a special working group. And, of course, anything that we do will ultimately be approved by the PILA board.
Posted by: Geoffrey Bilder | February 12, 2010 12:21 PM
Just to toss in another thing to think about. Suppose we have an author id (i.e. ORCID) and we want to list the articles by the author. You'd think of it as an error if two sort-of-the-same versions of an article were listed separately. Following this line of thought, you're led to the notion of having stronger typing of crossref doi's.
Posted by: Eric Hellman | February 12, 2010 2:54 PM
Contemplating changing the guidelines has impacts on the fee structure as well, I think. From the CrossRef Publisher Fees page:
Posted by: Peter Murray | February 12, 2010 8:30 PM
Even beyond ejournal/ebook distinctions, there continues to be very little differentiation in the industry between the linking and standards requirements for of different classes of ebooks. Reference ebooks and databases have fallen to the lowest rung of the “books and databases" discussion ladder. And, from Paratext’s standpoint, it is easy to see that lack of engagement with discovery, cross search, and linking of reference works has resulted in declining use of reference in libraries.
Anyone who is intimately familiar with reference will recognize that is essential it is for publishers to assign DOIs that enable durable linking at the article/entry level. This is the only way to facilitate cross-searching and/or discovery in ways that support successful reference research.
For those less reference-minded, this is because the title tells very little about what lies within a reference work. Full-text search does not overcome the problem because it does not interpret context precisely enough to give a researcher what they need at the top of the results list—too much is missed.
For most reference publishers, assigning backfile DOIs at the article or entry level will involve hundreds of thousands of assignments. To the best of my knowledge, the DOI pricing structure is oriented to title-level DOIs. So, the pricing structure breaks down when we get to this volume.
For e-reference materials to function well, publishers should quickly commit to making DOI assignments. However, the business model will need to help them get there. Few publishers facing declining usage will be able to rationalize a significant infrastructure. Would it be simple enough to find a middle-ground with one pricing tier at the title level and fractional pricing for DOIs assigned to smaller chunks?
Posted by: Alix Vance | February 15, 2010 12:34 AM
@eric
Basically, I agree that getting undue credit just because you have sort-of-the-same content identified by separate DOIs could be a problem. I say "could", because this depends on the context, doesn't it? Listing the several major editions of the monograph you have authored might be a good thing whereas listing every instance in which your dataset has changed might not. In both cases, for citation purposes you would want to assign separate DOIs- but whether you list them separately in your publication list depends on the situation.
I also think you might be presuming that the ORCID system would depend exclusively on the fully automatic assignment of publications to authors via DOIs. The current vision of the ORCID system is to combine externally-asserted claims (which might be automated or not) and self-asserted claims and to clearly distinguish between the types of assertions. This will allow third parties (ORCID has no intention of getting into the metrics business. Neither does CrossRef.) to make judgements about what to count and how.
And so we get back to this issue of "judgement." The publisher is in a fairly good position to make a judgment as to whether the difference between A & B is likely to make a difference in the crediting or interpretation of the content (important for citation) However, publishers are not necessarily in a good position to judge how (or even whether) the difference between A & B is important in the context of promotion, grant funding, etc.
So finally, do we need "typed DOIs" to make CrossRef DOIs more tractable for linked data applications? I certainly think we need something, but I'm not sure yet whether it is "typing"- at least not in FRBRish way. But perhaps that is not what you meant?
Posted by: Geoffrey Bilder | February 15, 2010 5:49 AM
@alix
Great to see you here. We've actually recently published new guidelines for assigning DOIs to books including reference works:
http://www.crossref.org/06members/best_practices_for_books.html
And we've also just revised our fee structure in order to encourage more granular assignment of DOIs within books.
http://www.crossref.org/02publishers/20pub_fees.html#_2010_deposit
In this latter case, we were particularly interested in making sure what we did would work for reference work publishers (and we had quite a few of those on the working group).
So we are very much focused on increasing the use of DOIs in books, If these revised fees and guidelines don't address your concerns, then we'd be interested in talking to you directly.
Posted by: Geoffrey Bilder | February 15, 2010 6:00 AM
@peter
Good point, but aside from wording, I actually think the issues around fees would be largely the same. When content changes to such a degree that it could effect the crediting or interpretation of the work, there is also generally a way to provide distinguishing metadata that goes along with it.
Posted by: Geoffrey Bilder | February 15, 2010 6:28 AM
@geoff
I do think we need to separate out a) the issue of "type" of DOI (i.e. is it an Information Reource, or not), from that of b) when to assign a new DOI based on change in content. As you have suggested the second issue b) must generally be left to the publisher to determine, but the first issue a) is something that really needs to be defined at a higher level in order to provide a common semantics.
(Btw, I don't really like the word "type". Seems a bit weak when we are really considering the difference between "Abstract" and "Concrete". But if it serves then it'll do.)
Posted by: Tony Hammond | February 15, 2010 8:10 AM