To search content in this manual only, enter your query above. To search for content in the entire CyVerse wiki, enter your query at the top right.
__________________

DATA COMMONS USER MANUAL
 

 

 

 

 

 

Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 7 Next »

The Data Commons publishes data to our own repository at  datacommons.cyverse.org as well as external repositories. All data published to Data Commons Curated Data receive a permanent identifier (PID) in the form of a DOI (Digital Object Identifier) or ARK (Archival Resource Key) and are expected to be stable and permanent. Data published to the Cyverse Community Contributed folder do not have PIDs, and may be changed or  removed at any time. The sections below provide more information on each type of data publication available through CyVerse. For more details on the range of data sharing options in CyVerse, see the CyVerse Data Policy.

Publishing Data Commons Curated Data

Data publication to Data Commons Curated Data a service offered for datasets that are intended to be stable and permanent. For Curated Data, the Data Commons provides landing pages, permanent DOIs or ARKs and the requirement to include an open data license. Permanent identifiers allow data to have a stable location on the web so that other users can always find them, along with the information that makes them understandable, citable, and reusable.  An open data license is important to allow others to reuse your data, but it does not exclude users from the obligation to correctly cite your data.

For more information about whether or not a Data Commons Curated Data is right for your dataset, see the Permanent Identifier FAQs page and the Data Commons Policy.

When you are ready to publish, see Requesting a Permanent Identifier in the Data Commons.

Publishing CyVerse Community Contributed Data

Community Contributed Data folders are available for evolving datasets that individuals or communities want to make available as quickly as possible for research and reuse. Community Contributed Data are intended for datasets that are growing or changing frequently or that may not need long-term preservation. Data can transition from Community Contributed Data to Data Commons Curated Data by requesting a DOI or ARK.

For more information on the policies that apply to Community Contributed Data, see the Data Commons Policy.

To request a public folder in Community Contributed Data, please use this form.

Publishing to external repositories

SRA pipeline: Data Commons enables CyVerse users to make submissions to the NCBI Sequence Read Archive directly. Submissions instructions include compressed sequenced files (FASTQ.gz, SFF.gz, and BAM.gz) and an XML metadata file, organized into a submission package.

WGS pipeline and TSA: Coming soon 

  • No labels