« October 2008 | Main | December 2008 »

November 24, 2008

RSS Good Practice Guidelines

I just wanted to flag up here Lisa Rogers' recent review article on RSS in FUMSI (the online magazine for information professionals published by Free Pint Ltd)

RSS and Scholarly Journal Tables of Contents: the ticTOCs Project, and Good Practice Guidelines for Publishers

Especially of interest is the diagram in Fig. 2 which breaks out the metadata elements that might be encountered in a rich web feed. Worthwhile pointing out that this reflects current practice and that under the item elements one would soon hope to see publishers routinely adding in prism:doi (with the bare DOI as value) and prism:url (with DOI target URL as value) from the PRISM 2.0 vocabulary published earlier this year. Publishers should also be aware of the new PRISM Usage Rights vocabulary which is expected to be published some time in the new year.

November 19, 2008

Machine Readable: Are We There Yet?

The guidelines for CrossRef publishers ("DOI Name Information and Guidelines" - PDF, 210K) has this to say in "Sect. 6.3 The response page" regarding the response page for a DOI:

"A minimal response page must contain a full bibliographic citation displayed to the user. A response page without bibliographic information should never be presented to a user."
which would seem to be all fine and dandy. But if that user is a machine (or an agent acting for a user) they'll likely be out of luck as the metadata in the bibliographic citation is generally targeted at human users.

So here's a quick and dirty implementation of what a machine readable page could look like using RDFa. (The demo uses Jeni Tennison's wonderful rdfQuery plugin which I blogged about earlier.)

Clicking the DOI link below will bring up in a sub-window a bibliographic citation which might be found in a typical DOI repsonse page. If you now click the "Read Me" link you should see an alert message which presents the bibliographic metadata as a complete RDF document (in a simple N3 – or Notation3 – format). This document is assembled on the fly by rdfQuery using the RDFa markup embedded in the page.

doi:10.1038/nature05634 (Click for demo)

See the "View Source" link to list the actual XHTML markup and the RDFa properties which have been added. And note also that some of the properties are partially "hidden" to the human reader, e.g. a publication date is given in year form only whereas the machine record has the date in full, and some of the properties are fully "hidden": print and electronic ISSNs, issue number, ending page, etc.

(Continues below.)

So, what's new about this? There are already various means of adding metadata to pages using e.g. metadata tags (see here for an earlier post on this), or COinS objects, or even RDF/XML in comment sections. All of these have their various utilities but are still just early attempts at automation. What makes this new and compelling is that RDFa allows publishers to embed machine readable metadata that can be read as a complete machine description in RDF using pretty much off-the-shelf tools and that this markup is embedded unobtrusively into the content in the proper context.

Note that there are some similarities here between embedding an XMP packet (which includes metadata) into an arbitrary binary object, e.g. a PDF file, and embedding RDF into a section of a web page – or perhaps "draping" the RDF over the document markup would be a better term – so that the metadata travels along with the actual content.

By the way, the RDFa can be processed to yield valid RDF (as is shown in the demo) and which can also be seen by running the web page through the RDFa Distiller. (You just need to cut and paste the link of the demo page given above into the Distiller form box.) This will produce RDF in various serializations (N3, XML, Triples) from the RDFa.

So, is there really any longer any reason not to have machine readable metadata at the end of the DOI? Are we there yet?

November 17, 2008

rdfQuery

Whaddya know? I was just on the point of blogging about the real nice demo given by Jeni Tennison at last week's SWIG UK meeting at HP Labs in Bristol of rdfQuery (an RDF plugin for jQuery - the zip file is here). And there today on her blog I see that she has a full writeup on rdfQuery, so I'll defer to the expert. :~)

All I can really add to that is that rdfQuery is a pretty darn cool way to add and manipulate RDFa using jQuery. Does it get any better?

And now that RDFa is a W3C Rec since last month (see Primer and Syntax) it will be interesting to see how CrossRef members might begin to deploy it on their pages - especially on DOI landing pages.