<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0">
   <channel>
      <title>CrossTech</title>
      <link>http://www.crossref.org/CrossTech/</link>
      <description></description>
      <language>en</language>
      <copyright>Copyright 2009</copyright>
      <lastBuildDate>Wed, 10 Jun 2009 03:31:40 -0500</lastBuildDate>
      <generator>http://www.sixapart.com/movabletype/?v=3.2</generator>
      <docs>http://blogs.law.harvard.edu/tech/rss</docs> 

            <item>
         <title>XMP Primer</title>
         <description><![CDATA[<p><a href="http://www.idealliance.org/filefolder/XMPPrimer.pdf" border="0"><img alt="xmp-primer.jpg" src="http://www.crossref.org/CrossTech/images/xmp-primer.jpg" width="158" height="204" style="float:left;margin:0 15px 0 0;"/></a></p>

<p>There's a new <a href="http://www.idealliance.org/filefolder/XMPPrimer.pdf">XMP Primer</a> (PDF) by Ron Roskiewicz (ed. Dianne Kennedy) available from <a href="http://www.xmp-open.org">XMP-Open</a>. This is copyrighted 2008 but I only just saw this now. This is a 43 page document which provides a very gentle introduction to metadata and labelling of media and then introduces XMP into the content lifecycle and talks to the business case for using XMP. The primer covers the following areas:<ul><li>Introduction to Metadata<br />
<li>Introduction to XMP<br />
<li>XMP and the Content Lifecycle<br />
<li>XMP in Action; Use Cases<br />
<li>Additional XMP Resources<br />
</ul></p>

<p>One small gripe would be that this seems to have been prepared for US letter-sized pages and although is printable on A4 there is the slightest of clippings on the right-hand margin with no real loss of information but it does confer a sense of "incompleteness". Really there can be little excuse these days for this parochialism. Also, for a document talking up the benefits of using XMP, it's decidedly odd that it doesn't make use of XMP itself - or rather there is a default XMP packet in the PDF with no real useful properties such as title, author, or date. Could have been a nice little object lesson in using XMP.</p>]]></description>
         <link>http://www.crossref.org/CrossTech/2009/06/xmp_primer.html</link>
         <guid>http://www.crossref.org/CrossTech/2009/06/xmp_primer.html</guid>
         <category>XMP</category>
         <pubDate>Wed, 10 Jun 2009 03:31:40 -0500</pubDate>
      </item>
            <item>
         <title>Aligning OpenSearch and SRU</title>
         <description><![CDATA[<p>[<b>Update - 2009.06.07:</b> As pointed out by Todd Carpenter of NISO (see comments below) the phrase "<i>SRU by contrast is an initiative to update Z39.50 for the Web</i>" is inaccurate. I should have said "<i>By contrast SRU is an initiative recognized by ZING (Z39.50 International Next Generation) to bring Z39.50 functionality into the mainstream Web</i>".]</p>

<p>[<b>Update - 2009.06.08:</b> Bizarrely I find in mentioning query languages below that I omitted to mention SQL. I don't know what that means. Probably just that there's no Web-based API. And that again it's tied to a particular technology - RDBMS.]</p>

<p><a href="http://www.crossref.org/CrossTech/images/queryType.png"><img alt="queryType.png" src="http://www.crossref.org/CrossTech/images/queryType.png" width="379" height="261" border="0" /></a><br />
(Click image to enlarge.)</p>

<p>There are two well-known public search APIs for generic Web-based search: <a href="http://www.opensearch.org/">OpenSearch</a> and <a href="http://www.loc.gov/standards/sru/">SRU</a>. (Note that the key term here is "generic", so neither <a href="http://lucene.apache.org/solr/">Solr</a>/<a href="http://lucene.apache.org/">Lucene</a> nor <a href="http://www.w3.org/XML/Query/">XQuery</a> really qualify for that slot. Also, I am concentrating here on "classic" query languages rather than on semantic query languages such as <a href="http://www.w3.org/2009/sparql/wiki/Main_Page">SPARQL</a>.)</p>

<p>OpenSearch was created by Amazon's <a href="http://a9.com/">A9.com</a> and is a cheap and cheerful means to interface to a search service by declaring a template URL and returning a structured XML format. It therefore allows for structured result sets while placing no constraints on the query string. As outlined in my earlier post <a href="http://www.crossref.org/CrossTech/2009/05/search_web_service.html">Search Web Service</a>, there is support for search operation control parameters (pagination, encoding, etc.), but no inroads are made into the query string itself which is regarded as opaque.</p>

<p>SRU by contrast is an initiative to update <a href="http://www.loc.gov/z3950/agency/">Z39.50</a> for the Web and is firmly focussed on structured queries and responses. Specifically a query can be expressed in the high-level query language <a href="http://www.loc.gov/standards/sru/specs/cql.html">CQL</a> which is independent of any underlying implementation. Result records are returned using any declared W3C XML Schema format and are transported within a defined XML wrapper format for SRU. (Note that the <a href="http://www.oasis-open.org/committees/documents.php?wg_abbrev=search-ws">SRU 2.0 draft</a> provides support for arbitrary result formats based on media type.)</p>

<p>One can summarize the respective OpenSearch and SRU functionalities as in this table:</p>

<table border="1" width="50%">
<tr>
<th width="33%" align="left">Structure</th>
<th width="33%" align="center">OpenSearch</th>
<th width="33%" align="center">SRU</th>
</tr>
<tr>
<td>query</td>
<td align="center">no</td>
<td align="center">yes</td>
</tr>
<tr>
<td>results</td>
<td align="center">yes</td>
<td align="center">yes</td>
</tr>
<tr>
<td>control</td>
<td align="center">yes</td>
<td align="center">yes</td>
</tr>
<tr>
<td>diagnostics</td>
<td align="center">no</td>
<td align="center">yes</td>
</tr>
</table>

<p>What I wanted to discuss here was the OpenSearch and SRU interfaces to a <a href="http://www.crossref.org/CrossTech/2009/05/search_web_service.html">Search Web Service</a> such as outlined in my previous post. The <a href="http://www.crossref.org/CrossTech/images/queryType.png">diagram</a> at top of this post shows query forms for OpenSearch and SRU and associated result types. The Search Web Service is taken to be exposing an SRU interface. It might be simplest to walk through each of the cases.</p>

<p>(Continues below.) </p>]]></description>
         <link>http://www.crossref.org/CrossTech/2009/06/aligning_opensearch_and_sru.html</link>
         <guid>http://www.crossref.org/CrossTech/2009/06/aligning_opensearch_and_sru.html</guid>
         <category>Search</category>
         <pubDate>Fri, 05 Jun 2009 03:44:28 -0500</pubDate>
      </item>
            <item>
         <title>Search Web Service</title>
         <description><![CDATA[<p><a href="http://www.crossref.org/CrossTech/images/search-web-service.png"><img alt="search-web-service.png" src="http://www.crossref.org/CrossTech/images/search-web-service.png" width="405" height="303" border="0" /></a><br />
(Click image to enlarge graphic.)</p>

<p>While the <a href="http://www.oasis-open.org/committees/tc_home.php?wg_abbrev=search-ws">OASIS Search Web Services TC</a> is currently working towards reconciling SRU and OpenSearch, I thought it would be useful to share here a simple graphic outlining how a search web service for structured search might be architected.</p>

<p>Basically there are two views of this search web service (described in separate XML description files and discoverable through autodiscovery links added to HTML pages):<ul><li><a href="http://www.opensearch.org/">OpenSearch</a><li><a href="http://www.loc.gov/standards/sru/">SRU (Search and Retrieve by URL)</a></ul>One can see at a glance that there's more happening down in the SRU layer. The SRU layer implements a heavyweight, robust service which provides a detailed listing of search indexes and index relations in the description document ('SRU Explain'), is searchable using a standard query grammar - <a href="http://www.loc.gov/standards/sru/specs/cql.html">CQL</a> ('Contextual Query Language'),  responds with result sets inside a standard XML wrapper and expressed as an XML record set (e.g. PAM) that is validatable using W3C XML Schema, and makes available a full roster of diagnostics.</p>

<p>By contrast the OpenSearch layer provides a lightweight view onto the search web service in which a simple opaque query string is sent to the server and a simple XML result set returned (usually RSS or Atom). Again a description document is made available ('OpenSearch Description') but this is much more coarse grained than the SRU description - e.g. it does not specify query components such as indexes or relations.</p>

<p>In practice, both views can be provided for by the same search web service. While OpenSearch does not specify any structured query it can make use of a CQL packaged query. That is, a single parameter value for the OpenSearch 'query' parameter can be unpacked by a CQL parser to yield a complex search query. The search query does not need to be splattered all over the URL querystring which is already using its parameter set to provide control information for the search (e.g. pagination, encoding and the like). </p>

<p>And how would this relate to existing platform-hosted search services? Well, such services are usually  bound to the host platform and are not intended to support remote applications. A search web service, on the other hand, would be ideally suited to offering direct support for running structured searches on platform-hosted content using off-platform apps.</p>]]></description>
         <link>http://www.crossref.org/CrossTech/2009/05/search_web_service.html</link>
         <guid>http://www.crossref.org/CrossTech/2009/05/search_web_service.html</guid>
         <category>Search</category>
         <pubDate>Sat, 30 May 2009 09:51:23 -0500</pubDate>
      </item>
            <item>
         <title>Structured Search Using PRISM Elements</title>
         <description><![CDATA[<p>We just registered in the <a href="http://www.loc.gov/standards/sru/">SRU</a> (Search and Retrieve by URL) search registry the following components:<dl><dt><b><i>Context Sets</i></b><dd><ul><li><a href="http://www.loc.gov/standards/sru/resources/prism-context-set-v2-0.html">PRISM Context Set version 2.0</a><br />
<li><a href="http://www.loc.gov/standards/sru/resources/prism-context-set-v2-1.html">PRISM Context Set version 2.1</a></ul><dt><b><i>Schemas</i></b><dd><ul><li><a href="http://www.prismstandard.org/schemas/pam/2.0/pam.xsd">PRISM Aggregator Message Record Schema Version 2.0</a><br />
<li><a href="http://www.prismstandard.org/schemas/pam/2.1/pam.xsd">PRISM Aggregator Message Record Schema Version 2.1</a></ul></dl>This means that an <a href="http://www.loc.gov/standards/sru/">SRU</a> (Search and Retrieve by URL)  search engine that supported one of the PRISM context sets registered above could accept <a href="http://www.loc.gov/standards/sru/specs/cql.html">CQL</a> (Contextual Query Language) queries such as the following:<br />
<ol><li><tt>prism.doi = "10.1038/nature05398"</tt><br />
<li><tt>prism.publicationName = "Nature" and prism.volume = "444" and prism.number = "7119" and prism.startingPage = "E9"</tt><br />
<li><tt>dc.identifier = "doi:10.1038/nature05398"</tt><br />
<li><tt>dc.creator = "Jones-Smith" and prism.publicationName = "Nature" and prism.publicationDate > "2006-01-01"</tt><br />
<li><tt>dc.title any "fractal pollock" and prism.publicationName = "Nature" sortBy prism.publicationDate/sort.descending</tt><br />
<li><tt>"fractal anlysis" and prism.publicationDate within "2005-01-01 2008-12-31" sortBy dc.creator/sort.ascending</tt></ol>(Note that the quotes are only needed above for the DOI strings which contain a "/" character. Otherwise they are optional in the above examples.)<br />
<p/>Any query such as one of the above (here #1) could be sent to the server on a querystring like so:<br />
<blockquote><tt>?version=1.1&operation=searchRetrieve&query=prism.doi=%2210.1038/nature05398%22</tt></blockquote>and if the server were also equipped to respond with <a href="http://www.idealliance.org/industry_resources/intelligent_content_informed_workflow/about_the_prism_aggregator_message">PAM</a> (PRISM Aggregator Message)  format for result records, a response might look like this:<br />
<p/><img alt="fractal-analysis-pam.jpg" src="http://www.crossref.org/CrossTech/images/fractal-analysis-pam.jpg" width="686" height="450" /><br />
PAM was discussed <a href="http://www.crossref.org/CrossTech/2009/05/post_2.html">here</a> earlier.<br />
<p/><br />
Such a structured response would provide the metadata elements for applications to build various interfaces into the original article:<br />
<img alt="fractal-analysis.jpg" src="http://www.crossref.org/CrossTech/images/fractal-analysis.jpg" width="459" height="401" /><br />
We think that these PRISM components (context sets and schemas) will be useful for structured search of scholarly publications.</p>]]></description>
         <link>http://www.crossref.org/CrossTech/2009/05/structured_search_using_prism.html</link>
         <guid>http://www.crossref.org/CrossTech/2009/05/structured_search_using_prism.html</guid>
         <category>Search</category>
         <pubDate>Sat, 30 May 2009 07:23:48 -0500</pubDate>
      </item>
            <item>
         <title>OAI-ORE: Workshop Slides </title>
         <description><![CDATA[<div style="width:425px;text-align:left" id="__ss_1465963"><a style="font:14px Helvetica,Arial,Sans-serif;display:block;margin:12px 0 3px 0;text-decoration:underline;" href="http://www.slideshare.net/hvdsomp/an-overview-of-the-oai-object-reuse-and-exchange-interoperability-framework?type=presentation" title="An Overview of the OAI Object Reuse and Exchange Interoperability Framework">An Overview of the OAI Object Reuse and Exchange Interoperability Framework</a><object style="margin:0px" width="425" height="355"><param name="movie" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=oreinforum2009hvds-090520125748-phpapp02&stripped_title=an-overview-of-the-oai-object-reuse-and-exchange-interoperability-framework" /><param name="allowFullScreen" value="true"/><param name="allowScriptAccess" value="always"/><embed src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=oreinforum2009hvds-090520125748-phpapp02&stripped_title=an-overview-of-the-oai-object-reuse-and-exchange-interoperability-framework" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="355"></embed></object><div style="font-size:11px;font-family:tahoma,arial;height:26px;padding-top:2px;">View more <a style="text-decoration:underline;" href="http://www.slideshare.net/">Microsoft Word documents</a> from <a style="text-decoration:underline;" href="http://www.slideshare.net/hvdsomp">hvdsomp</a>.</div></div>

<p>This is a very slick presentation by Herbert Van de Sompel on <a href="http://www.openarchives.org/ore/">OAI-ORE</a> which he's due to give today for a <a href="http://www.inforum.cz/en/workshop/#12">workshop</a> at the <a href="http://www.inforum.cz/en/programme/">INFORUM 2009 15th Conference on Prrofessional Information Resources</a> in Prague. It's on the long side at 167 slides but even if you just flip though or sample it selectively you'll be bound to come away with something.</p>

<p>Describing aggregations of resources is a subject that really has to be of interest to CrossRef publishers.</p>]]></description>
         <link>http://www.crossref.org/CrossTech/2009/05/oaiore_workshop_slides.html</link>
         <guid>http://www.crossref.org/CrossTech/2009/05/oaiore_workshop_slides.html</guid>
         <category>Interoperability</category>
         <pubDate>Tue, 26 May 2009 04:24:36 -0500</pubDate>
      </item>
            <item>
         <title>PRISM Aggregator Message</title>
         <description><![CDATA[<p>The new OAI-PMH interface to Nature.com sports one particular novelty which may well be of interest here: it makes use of the <a href="http://www.idealliance.org/industry_resources/intelligent_content_informed_workflow/about_the_prism_aggregator_message">PRISM Aggregator Message</a>. (For an announcement of this service see the post on our web publishing blog <a href="http://blogs.nature.com/wp/nascent/">Nascent</a>.) </p>

<p>As a protocol for the harvesting of metadata records within a digital repository, OAI-PMH records may be expressed in a variety of different metadata formats. For reasons of interoperability a base metadata format ('Dublin Core') is mandated for all OAI-PMH implementations. The expectation is that this base format would be augmented by community-specific vocabularies.</p>

<p>Our natural inclination was to mirror the article descriptions which we already circulate in our RSS feeds and within our HTML pages (as META tags) and PDF files (as XMP packets).  In these cases we have used open data models (e.g. RDF) with simple properties cherry-picked from the DC and PRISM namespaces. But OAI-PMH has a special 'gotcha' in this regard: any metadata format must allow for W3C XML Schema validation. That is, the properties need to be constrained by an XSD data model. Enter PRISM Aggregator Message (PAM).</p>

<p>(Continues)</p>]]></description>
         <link>http://www.crossref.org/CrossTech/2009/05/post_2.html</link>
         <guid>http://www.crossref.org/CrossTech/2009/05/post_2.html</guid>
         <category>Interoperability</category>
         <pubDate>Fri, 08 May 2009 03:37:38 -0500</pubDate>
      </item>
            <item>
         <title>CrossRef&apos;s OpenURL query interface</title>
         <description><![CDATA[<p>Over the past two weeks we've focused on our OpenURL query interface with the goal being to improve its reliability. I'd like to mention some things  we've done.</p>

<p>1) We now require an OpenURL account to use this interface (see <a href="http://www.crossref.org/requestaccount/"> the registration page</a>) . This account is still free, there are no fixed usage limits, and the terms of use have been greatly simplified.  </p>

<p>2) Resources have been re-arranged dedicating more horse-power to the OpenURL function.</p>

<p>3) The OpenURL function is now in our advanced monitoring function which means some lucky staff member will be getting phone calls at 3AM (me included!).</p>

<p>I should note that  #1 has already reduced inappropriate usage. This also is not the end of planned changes. CrossRef has undertaken a major rewrite of parts of our system and this will include the OpenURL interface.</p>

<p>Chuck</p>]]></description>
         <link>http://www.crossref.org/CrossTech/2009/05/crossrefs_openurl_query_interf.html</link>
         <guid>http://www.crossref.org/CrossTech/2009/05/crossrefs_openurl_query_interf.html</guid>
         <category></category>
         <pubDate>Wed, 06 May 2009 13:15:26 -0500</pubDate>
      </item>
            <item>
         <title>OCLC defines requirements for a &quot;Cooperative Identities Hub&quot;</title>
         <description><![CDATA[<p>OCLC has <a href="http://www.oclc.org/programs/publications/reports/2009-05.pdf">published a report</a> (PDF) identifying some requirements for what they call a "Cooperative Identities Hub". A quick glance through it seems to show that the use cases focus on what we are calling the "Knowledge Discovery" use cases. As I mentioned in my <a href="http://network.nature.com/people/mfenner/blog/2009/02/17/interview-with-geoffrey-bilder">interview with Martin Fenner</a>, there is also a category of "authentication" use cases that I think needs to be addressed by a contributor identifier system. Still, this is a good report that highlights many of the complexities that an identifier system needs to address. </p>]]></description>
         <link>http://www.crossref.org/CrossTech/2009/05/oclc_defines_requirements_for.html</link>
         <guid>http://www.crossref.org/CrossTech/2009/05/oclc_defines_requirements_for.html</guid>
         <category>Author Identifiers</category>
         <pubDate>Fri, 01 May 2009 03:24:00 -0500</pubDate>
      </item>
            <item>
         <title>What do people want from an author identifier?</title>
         <description><![CDATA[<p><a href="http://network.nature.com/people/mfenner/profile">Martin Fenner</a> continues his interest in the subject of author identifiers. He recently <a href="http://network.nature.com/people/mfenner/blog/2009/04/13/a-few-questions-about-author-identifiers">posted an online poll</a> asking people some specific questions about how they would like to see an author identifier implemented.<sup>*</sup></p>

<p><a href="http://network.nature.com/people/mfenner/blog/2009/04/26/a-few-questions-about-author-identifiers-the-answers">The results of the poll</a> are in and, though the sample was very small, the results are interesting. The responses are both gratifying -there seems to be a general belief that CrossRef has a roll to play here- and perplexing -most think the identifier needs to identify other "contributors" to the scholarly communications process- yet there seems to be a preference for the moniker "digital author identifier". This latter preference is certainly a surprise to us as we had been focusing our efforts on identifying analog authors. The only "digital authors" I know of are <a href="http://pdos.csail.mit.edu/scigen/">this one at at MIT</a> and possibly <a href="http://news.bbc.co.uk/2/hi/science/nature/7979113.stm">this one at Aberystwyth University.</a> ;-)</p>

<p>Anyway, There are some <a href="http://friendfeed.com/e/bb174794-519c-02a7-8a00-9283013298d8/A-few-questions-about-author-identifiers-the/">additional reactions</a> to Martin's poll on FriendFeed.</p>

<p>Finally, I should have blogged about this earlier, but the March issue of <em>Science</em> included a summary of the initiatives and discussions surrounding the creation of an industry "author identifier" in  an article titled “Are You Ready to Become a Number” (<a href="http://dx.doi.org/10.1126/science.323.5922.1662">http://dx.doi.org/10.1126/science.323.5922.1662</a>).</p>

<p>In pointing people at this, I feel like I must make a clarification to the article. In short, I don't think any of our publisher members would "force" anybody  to use an author identifier whether it came from CrossRef or from anybody  else. Though it is likely that in the interview I used the terms "carrot" and "stick",  in truth publisher's would, instead of "a stick",  at most wield a <a href="http://en.wikipedia.org/wiki/Nerf">Nerf</a> bat. Having said that,  the essential point remains- even if  most major publishers *strongly* encouraged all of their authors to use the system, it  would take several years before the system had a critical mass of data. </p>

<p><br />
<sup>*</sup>Note that I deliberately didn't point CrossTech readers at this poll as it was being conducted because I thought doing so might introduce a CrossRef bias.</p>]]></description>
         <link>http://www.crossref.org/CrossTech/2009/04/what_do_people_want_from_an_au.html</link>
         <guid>http://www.crossref.org/CrossTech/2009/04/what_do_people_want_from_an_au.html</guid>
         <category>Author Identifiers</category>
         <pubDate>Mon, 27 Apr 2009 03:24:38 -0500</pubDate>
      </item>
            <item>
         <title>Introductory Signals</title>
         <description><![CDATA[<p>So while doing some background reading today I realized that legal citations already widely support a form of "<a href="http://www.crossref.org/CrossTech/2009/03/citation_typing_ontology.html">citation typing</a>" in the form of "<a href="http://en.wikipedia.org/wiki/Citation_signal">Introductory Signals</a>". The 10 introductory signals break down as follows...</p>

<p>In support of an argument:<br />
&nbsp;&nbsp;&nbsp;1) [no signal]. (NB that, apparently, this is increasingly deprecated.) <br />
&nbsp;&nbsp;&nbsp;2) accord;<br />
&nbsp;&nbsp;&nbsp;3) see;<br />
&nbsp;&nbsp;&nbsp;4) see also;<br />
&nbsp;&nbsp;&nbsp;5) cf.;</p>

<p>For Comparisons:<br />
&nbsp;&nbsp;&nbsp;6) compare ... with ...;</p>

<p>For contradiction:<br />
&nbsp;&nbsp;&nbsp;7) but see;<br />
&nbsp;&nbsp;&nbsp;8) but cf.;</p>

<p>For background:<br />
&nbsp;&nbsp;&nbsp;9) see generally;</p>

<p>And for examples:<br />
&nbsp;&nbsp;&nbsp;10) e.g.</p>

<p>Clever lawyers.</p>]]></description>
         <link>http://www.crossref.org/CrossTech/2009/03/introductory_signals.html</link>
         <guid>http://www.crossref.org/CrossTech/2009/03/introductory_signals.html</guid>
         <category></category>
         <pubDate>Mon, 23 Mar 2009 06:42:08 -0500</pubDate>
      </item>
            <item>
         <title>Citation Typing Ontology</title>
         <description><![CDATA[<p>I was happy to read David Shotton's recent <a href="http://www.ingentaconnect.com/content/alpsp/lp"><em>Learned Publishing</em></a> article, <a href="http://dx.doi.org/10.1087/2009202"><em>Semantic Publishing: The Coming Revolution in scientific journal publishing</em></a>, and see that he and his team have drafted a <a href="http://imageweb.zoo.ox.ac.uk/pub/2009/citobase/cito-20090311/cito-content/owldoc/">Citation Typing Ontology</a>.<sup>*</sup></p>

<p>Anybody who has seen me speak at conferences knows that I often like to proselytize about the concept of the "typed link", a notion that hypertext pioneer, <a href="http://www.workpractice.com/trigg/">Randy Trigg</a>, discussed <a href="http://www.workpractice.com/trigg/thesis-chap4.html">extensively<a> in his 1983 <a href="http://www.workpractice.com/trigg/thesis-default.html">Ph.D. thesis.</a>.  Basically, Trigg points out something that should be fairly obvious- a citation (i.e. "a link") is not <em>always</em> a "vote" in favor of the thing being cited.</p>

<p>In fact, there are all sorts of reasons that an author might want to cite something. They might be elaborating on the item cited, they might be critiquing the item cited, they might even be trying to refute the item cited (For an exhaustive and entertaining survey of the use and abuse of citations in the humanities, <a href="http://en.wikipedia.org/wiki/Anthony_Grafton">Anthony Grafton</a>'s, <a href="http://www.amazon.com/Footnote-Curious-History-Anthony-Grafton/dp/0571196012/ref=sr_1_2?ie=UTF8&s=books&qid=1237549279&sr=1-2">The Footnote: A Curious History</a>, is a rich source of examples)</p>

<p>Unfortunately, the naive assumption that a citation is tantamount to a vote of confidence has become inshrined in everything from the way in which we <a href="http://www.thomsonreuters.com/business_units/scientific/free/essays/impactfactor/">measure scholarly reputation</a>, to the way in which we <a href="http://www.hefce.ac.uk/Research/ref/">fund universities</a> and the way in which search engines rank their results. The distorting affect of this assumption is profound. If nothing else, it leads to a perverse situation in which people will often discuss books, articles, and blog postings  that they disagree with without actually citing the relevant content, just so that they can avoid inadvertently conferring "<a href="http://en.wikipedia.org/wiki/Whuffie">wuffie</a>" on the item being discussed. This can't be right.</p>

<p>Having said that, there has been a half-hearted attempt to introduce a gross level of link typology with the introduction of the <a href="http://en.wikipedia.org/wiki/Nofollow">"nofollow" link attribute</a>- an initiative started by Google  in order to try to address the increasing problem of <a href="http://en.wikipedia.org/wiki/Spamdexing">"Spamdexing"</a>. But this is a pretty ham-fisted form of link typing- particularly in the way it is implemented by the Wikipedia where CrossRef DOI links to formally published scholarly literature have a "nofollow" attribute attached to them but, inexplicably, items with a <a href="http://en.wikipedia.org/wiki/PMID">PMID</a> are not so hobbled (view the HTML source of <a href="http://en.wikipedia.org/wiki/Prion">this page</a>, for example). Essentially, this means that, the Wikipedia is a black-hole of reputation. That is, it absorbs reputation (through links too the Wikipedia), but it doesn't let reputation back out again. Hell, I feel dirty for even linking to it here ;-).</p>

<p>Anyway, scholarly publishers should certainly read Shotton's article because it is full of good, and practical ideas about what can can be done with today's technology in order to help us move beyond the "digital incunabula" that the industry is currently churning out. The <a href="http://imageweb.zoo.ox.ac.uk/pub/2008/plospaper/latest/">sample semantic article</a> that Shotton's team created is inspirational and I particularly encourage people to look at <a href="http://imageweb.zoo.ox.ac.uk/pub/2008/plospaper/latest/machine/citationinfo.n3">the source file for the ontology-enhanced bibliography</a> which reveals just how much more useful metadata can be associated with the humble citation.</p>

<p>And now I wonder whether <a href="http://www.citeulike.org/">CiteULike</a>, <a href="http://www.connotea.org/">Connotea</a>, <a href="http://www.2collab.com/nonLoggedInHomePage;jsessionid=CC0849D76677D585AE1DC3B3139B32A1">2Collab</a> or <a href="http://www.zotero.org/">Zotero</a> will consider adding support for the CItation Typing Ontology into their respective services?</p>

<p><br />
* Disclosure:</p>

<p>a) I am on the editorial board of <em>Learned Publishing</em><br />
b) CrossRef has consulted with David Shotton on the subject of semantically enhancing journal articles </p>]]></description>
         <link>http://www.crossref.org/CrossTech/2009/03/citation_typing_ontology.html</link>
         <guid>http://www.crossref.org/CrossTech/2009/03/citation_typing_ontology.html</guid>
         <category></category>
         <pubDate>Fri, 20 Mar 2009 06:30:36 -0500</pubDate>
      </item>
            <item>
         <title>Researcher Identification Primer</title>
         <description><![CDATA[<p><a href="http://friendfeed.com/e/c1fd00ec-15f9-d894-4ea9-4ffeaac5ae28/A-specialist-OpenID-service-to-provide-unique">Discussions around "contributor Ids"</a> (aka "Author ID, Researcher ID, etc.) seem to be becoming quite popular. In <a href="http://network.nature.com/people/mfenner/blog/2009/02/17/interview-with-geoffrey-bilder">the interview</a> that I pointed to in my <a href="http://www.crossref.org/CrossTech/2009/02/an_interview_about_author_ids.html">last post</a>, I mentioned that CrossRef has been talking with a group of researchers who were very interested in creating some sort of authenticated contributor ID as a mechanism for controlling who gets trusted access to sensitive genome-wide aggregate genotype data.</p>

<p>Well, I'm delighted to say that said group of researchers(at the <a href="http://www.gen2phen.org/">GEN2PHEN</a> project) have created a "<a href="http://www.gen2phen.org/researcher-identification/researcher-identification-primer">Researcher Identification Primer</a>" website in which they outline the many use-cases and issues around creating a mechanism for unambiguously identifying and/or authenticating researchers. This looks like a great resource and I expect it will serve as a useful focus for further discussion around the issue. </p>]]></description>
         <link>http://www.crossref.org/CrossTech/2009/03/researcher_identification_prim.html</link>
         <guid>http://www.crossref.org/CrossTech/2009/03/researcher_identification_prim.html</guid>
         <category></category>
         <pubDate>Wed, 11 Mar 2009 07:24:16 -0500</pubDate>
      </item>
            <item>
         <title>An interview about &quot;Author IDs&quot;</title>
         <description><![CDATA[<p>Over the past few months there seems to have been a <a href="http://friendfeed.com/e/c1fd00ec-15f9-d894-4ea9-4ffeaac5ae28/A-specialist-OpenID-service-to-provide-unique">sharp upturn in general interest</a> around implementing an "author identifier" system for the scholarly community. This, in turn, has meant that more people have been getting in touch with us about our nascent "Contributor ID" project. The other day, after seeing my comments in the above thread,  <a href="http://network.nature.com/people/mfenner/profile">Martin Fenner</a>  asked if he could interview me about the issue of author identifiers for <a href="http://network.nature.com/people/mfenner/blog">his blog on Nature Networks, Gobbledygook</a>. I agreed and he <a href="http://network.nature.com/people/mfenner/blog/2009/02/17/interview-with-geoffrey-bilder">posted the interview</a> the other day.</p>

<p>I warn you ahead of time, I did ramble on a bit and the interview is long. There is a lot of stuff at the beginning about the DOI and it might seem off-topic, but I do think that there is a lot that we can learn from our DOI experiences which would apply to any author identifier. Just be thankful I didn't start talking about the privacy issues that will inevitably arise from any author identifier system. If I had, the interview would have probably gone on for another six pages ;-).</p>

<p>Anyway, as most of our membership knows, we have a pilot project underway to explore what it would take to launch a "CrossRef Contributor ID" system. We still haven't concluded whether it makes sense for us to do it, but one thing is clear from the recent discussions we've had and that is that, if we don't do it, somebody else almost certainly will.</p>]]></description>
         <link>http://www.crossref.org/CrossTech/2009/02/an_interview_about_author_ids.html</link>
         <guid>http://www.crossref.org/CrossTech/2009/02/an_interview_about_author_ids.html</guid>
         <category>Identifiers</category>
         <pubDate>Thu, 19 Feb 2009 10:39:05 -0500</pubDate>
      </item>
            <item>
         <title>Real PRISM in the RSS Wilds</title>
         <description><![CDATA[<p>Alf Eaton just <a href="http://hublog.hubmed.org/archives/001818.html">posted</a> a real nice analysis of <a href="http://www.tictocs.ac.uk/">ticTOCs</a> RSS feeds. Good to see that almost half of the feeds (46%) are now in RDF and that fully a third (34%) are using PRISM metadata to disclose bibliographic fields. </p>

<p>The one downside from a CrossRef point of view is that these feeds are still using the old PRISM version (1.2) and not the new version (2.0) which was released a year ago and blogged  <a href="http://www.crossref.org/CrossTech/2008/02/prismdoi.html">here</a>. That version supports the elements <b>prism:doi</b> for the bare DOI, as well as <b>prism:url</b> for the DOI proxy server URL.</p>

<p>There are still some improvements to be made in serving up these feeds (as Alf's analysis shows for content type), but overall things are looking pretty good. :)<br />
</p>]]></description>
         <link>http://www.crossref.org/CrossTech/2009/02/real_prism_in_the_rss_wilds.html</link>
         <guid>http://www.crossref.org/CrossTech/2009/02/real_prism_in_the_rss_wilds.html</guid>
         <category>RSS</category>
         <pubDate>Thu, 19 Feb 2009 08:48:05 -0500</pubDate>
      </item>
            <item>
         <title>DOIs in an iPhone application</title>
         <description><![CDATA[<p>Very cool to see Alexander Griekspoor  releasing an iPhone version of his award-winning Papers application.  A while ago Alex intigrated DOI metadata lookup into the Mac version of papers and now I can  get a silly thrill from seeing CrossRef DOIs integrated in an iPhone app. Alex has just posted <a href="http://mekentosj.com/papers/iphone/">a preview video of the iPhone application</a> and it includes a cameo appearance by a DOI. Yay.</p>]]></description>
         <link>http://www.crossref.org/CrossTech/2009/02/dois_in_an_iphone_application.html</link>
         <guid>http://www.crossref.org/CrossTech/2009/02/dois_in_an_iphone_application.html</guid>
         <category>Linking</category>
         <pubDate>Thu, 12 Feb 2009 09:25:05 -0500</pubDate>
      </item>
      
   </channel>
</rss>
