« December 2006 | Main | February 2007 »

January 30, 2007

Digital Objects

A couple weeks back there was a meeting of the Open Archive Initiative's Object Reuse and Exchange (OAI-ORE) Technical Committee hosted in the Butler Library at Columbia University, New York.

DSC00027.JPG

Lorcan Dempsey of OCLC blogs here on the report (PDF format) that was generated from that meeting. As does Pete Johnston of Eduserv here.

Background:

OAI-ORE is being positioned as a companion activity to the more familiar OAI-PMH protocol for metadata harvesting. OAI-ORE relates to the expression and exchange of digital objects across repositories rather than just the exchange of metadata about those objects.

The basic problem is that scholarly communication deals in units which are compound resulting from a complex of documents and/or datasets expressed in multiple formats, versions, relationships, etc. The underlying web architecture provides a fairly simple model of resources (identified with URIs) which are interconnected and can be interacted with by retrieving representations of those resources. In practice, this usually results in unique URIs (and thus resources) for each representation - think of one URI for an HTML document, another for a PDF document of the same work, and yet new URIs for those same document formats for a new version of the work. Clearly, all these representations (or documents) are related, and more importantly relate to a single underlying "work". Web architecture as generally practiced does not provide ready mechanisms to aggregate (and compartmentalize) related documents and datasets.

My fairly simple mental picture is that the web landscape is rather like the early universe in which energy (and matter) is distributed uniformly and there is little local "intelligence" which is gradually built up through time by matter formation and aggregations of this matter leading to the more familiar "clumpy" universe with its recognizable galaxies, stars and other objects. This "clumpiness" is precisely what we are missing in the scholarly web.

January 29, 2007

Jon Udell and DOIs

Not to get too self-referential here, but it was very cool to see that Tony Hammond has managed to get Jon Udell interested in DOIs. This based on a podcast interview with Tony posted on January 26th.

An Open PDF?

Adobe announces today the following:

"SAN JOSE, Calif. — Jan. 29, 2007 — Adobe Systems Incorporated (Nasdaq:ADBE) today announced that it intends to release the full Portable Document Format (PDF) 1.7 specification to AIIM, the Enterprise Content Management Association, for the purpose of publication by the International Organization for Standardization (ISO)."

The full press release is here.

(Via Oleg Tkachenko's Blog.)

January 25, 2007

W3C Recs for XML - Eight of 'Em!

Although most folks will already know about this it still seems significant enough to blog the arrival of XQuery 1.0, XSLT 2.0, and XPath 2.0. See the W3C Press Release.

January 23, 2007

Use of PRISM in RSS

Was rooting around for some information and stumbled across this page which may be of interest:

http://googlereader.blogspot.com/2006/08/namespaced-extensions-in-feeds.html
Namespaced Extensions in Feeds
Thursday, August 03, 2006
posted by Mihai Parparita

“I wrote a small MapReduce program to go over our BigTable and get the top 50 namespaces based on the number of feeds that use them.”

% of FeedsNamespaceURI
29.36%Dublin Corehttp://purl.org/dc/elements/1.1/
0.21%PRISMhttp://prismstandard.org/namespaces/1.2/basic/

Seems quite an impressive percentage for PRISM.

January 08, 2007

What's in a URI?

First off, a Happy New Year to all!

A post of mine to the OpenURL list may possibly be of interest. Following up the recent W3C TAG (Technical Architecture Group) Finding on "The Use of Metadata in URIs" I pointed out that the TAG do not seem to be aware of OpenURL: which is both a standard prescription for including metadata in URI strings and a US information standard to boot.