The Future of Data: Plans for a UC Davis workshop

I have been working with several of the faculty of UC Davis to prepare a workshop for our department to (1) help make our researchers aware of the new requirements for NSF proposals and journal submissions with regard to data archiving, (2) To explore what new opportunities this creates and what tools exist that will help our researchers take maximum advantage of it, and (3) to engage our research community in the discussion of future standards for sharing, standardizing, rewarding and funding the archiving of data.

I’m very pleased that we have an excellent panel of supportive faculty:

  • Don Strong, Editor-in-Chief of Ecology
  • Marcel Holyoak, Editor-in-Chief, Ecology Letters
  • Jonathan Eisen, Editor-in-Chief, PLoS Biology
  • Alan Hastings, founding Editor-in-Chief, Theoretical Ecology
  • Robert Hijmans, Asst Professor of Environmental Science & Policy and expert in software infrastructure for reproducible research.
  • Trisha Cruse Director of UC Curation Center at the University of California Digital Library

We are currently working out the format of the workshop.  The current draft consists of three 1.5hr sessions, progressing through the three main goals above in an interactive format.

1) Why Archive?

(Weds March 2nd 2:30-4p, LSA 1022)
  • New requirements from funders and publishers.
  • Recommended practice: What’s ideal, what’s acceptable, what isn’t?
  • Types/levels of archiving.  Why repositories?
  • Funding data repositories.
  • Access embargoes.  Citing data/getting credit.
  • Panel Q&A

2) Leveraging data archives: logistics

(Thurs March 3nd 2:30-4p, PES 3001)
  • The major repositories.
  • Discovering data.
  • Citing data.  Metadata, DOIs, formats, persistent data
  • Sharing/archiving code?
  • Linked data, ontologies & Semantic web.
  • Tools for reproducible research.
  • Panel Q&A

3) Looking ahead: new scale, new science?

(Fri March 4nd 2:30-4p, PES 3001)
  • Challenges and Opportunities (i.e. Science issue,Science re: Ecology)
  • Integrated databases, standards for data
  • Dryad “publication model” vs DataONE model, etc
  • Reproducible research
  • UC3’s “data publication” concept?
  • Future of credit: Will you be scooped or will you be famous?
  • What roll we can/should have in shaping future standards, expectations & requirements
  • Panel Q&A

Resources

See the recent discussion of these issues in Science, Nature, TREE, Am Nat, and elsewhere.   I have started assembling a list of resources in my December notes: _Thoughts on the New Policies for Data Archiving at NSF and in Common Journals. _In addition, there’s more detail available on the Dryad Wiki and from Dryad here:

Some great information also available from UC3:

In the next month or so we are going to roll out a rule based tool that will ask the researcher a series of questions based on the funding organization that are applying to and will generate a Data Management Plan.

  • Merritt Repository services,  launched in late summer 2010.

  • EZID Identifier services:   obtain and manage long-term identifiers (DOI, ARK) for their digital content.  Can assign identifiers to anything: scientific datasets, technical reports, audio files, digital photographs, and non-digital objects as well.  This spring the objects with DOIs  will appear in Web of Science. (Annual charge unlimited number of DOIs: $75/user, $375/group, or $1,125 for whole university)

  • Consultation services: Individual consultation, also more workshops  on data citation, data sharing, and data rights this Fall.

We have been actively thinking about data and building services that support the management, preservation, and publication of data.  We are just about to embark on a project that will create an Excel add-in that will make it easy to archive, share, and publish data embedded in Excel.  We are also actively involved in the DataONE project.  We recently presented a paper at the Beyond the PDF workshop at UCSD that outlines some of our ideas surrounding data publication.

Your Input?

What would you like to see?  Structure, organization, topics?

[iframe https://friendfeed.com/science-2-0/e68945dc/i-m-planning-workshop-at-ucdavis-on-future-of?embed=1 480 400]