DataCite Summer Meeting Report
DataCite held its first Summer Meeting in Hannover, Germany from June 7-8 and it was very successful. DataCite is a new international consortium of national libraries and data centers set up to "establish easier access to scientific research data on the Internet, increase acceptance of research data as legitimate, citable contributions to the scientific record, and to support data archiving that will permit results to be verified and re-purposed for future study."
In addition, DataCite is DOI Registration Agency and in the words of Adam Farquhar of the British Library and President of DataCite - "Datacite is to data centres and datasets as CrossRef is to publishers and research articles." It's important to note that CrossRef covers a lot more than just research articles (books, conference proceedings, technical reports, standards, datasets). DataCite and CrossRef are serving different communities so scholarly publishers who are members of CrossRef will still register datasets with CrossRef.
It's great to see DataCite getting going and using CrossRef as a model for its development. There are a wide range of issues confronting data centers and repositories that host raw scientific data and datasets. These organizations are developing best practices and trying to get datasets to be treated in a similar way to research articles so that researchers get credit for them, they are discoverable, they are citeable and that they interlink with peer-reviewed scholarly articles.
There many types of repositories and they vary greatly by subject discipline. From the scholarly journal publishing perspective it's interesting to see the repositories starting to act more like traditional publishers by considering things like editorial and peer review, versioning, assigning unique IDs, creating authoritative metadata and agreeing citation formats.
Some of the projects or organizations giving presentations at the meeting were DataONE which is working with Dryad, DataVerse from Harvard University and PANGAEA. PANGAEA and Elsevier have a project that shows interlinking between articles and datasets. A PANGAEA dataset citation, which includes the published article, is:
Delaney, ML (1989): Mg/Ca and Sr/Ca in calcites and estimated distribution coefficients in marine sediments (Table 2). doi:10.1594/PANGAEA.723370,
Supplement to: Delaney, Margaret Lois (1989): Temporal changes in interstitial water chemistry and calcite recrystallization in marine sediments. Earth and Planetary Science Letters, 95(1-2), 23-37, doi:10.1016/0012-821X(89)90165-9
If you look at the dataset record in PANGAEA there is a CrossRef DOI link to the article and on ScienceDirect in the lower right there is a "Supplementary Data" DataCite DOI link to PANGAEA.
CrossRef will be working with DataCite on a general, scaleable solution to interlinking CrossRef member content and DataCite participants' content.