The Data Commons publishes data to our own repository at datacommons.cyverse.org as well as external repositories. All data published to Data Commons Curated Data receive a permanent identifier (PID) in the form of a DOI (Digital Object Identifier) or ARK (Archival Resource Key) and are expected to be stable and permanent. Data published to the Cyverse Community Contributed folder do not have PIDs, and may change or be removed at any time. The sections below provide more information on each kind of data publication available through CyVerse. For more details on the range of data sharing options in CyVerse, see the CyVerse Data Policy.
Publishing Data Commons Curated Data
Data publication to Data Commons Curated Data a service offered for datasets that are intended to be stable and permanent. For Curated Data, the Data Commons provides landing pages, permanent DOIs or ARKs and the requirement to include an open data license. Permanent identifiers allow data to have a stable location on the web so that other users can always find them, along with the information that makes them understandable, citable, and reusable. An open data license is important to allow others to reuse your data, but it does not exclude users from the obligation to correctly cite your data.
For more information about whether or not a Data Commons Curated Data is right for your dataset, see the Permanent Identifier FAQs page and the Data Commons Policy.
When you are ready to publish, see Requesting a Permanent Identifier in the Data Commons.
Publishing CyVerse Community Contributed Data
Community Contributed Data folders are available for evolving datasets that individuals or communities want to make available as quickly as possible for research and reuse. Community Contributed Data are intended for datasets that are growing or changing frequently or that may not need long-term preservation. Data can transition from Community Contributed Data to Data Commons Curated Data by requesting a DOI or ARK.
Publishing to external repositories
SRA pipeline: Data Commons enables CyVerse users to make submissions to the NCBI Sequence Read Archive directly. Submissions instructions include compressed sequenced files (FASTQ.gz, SFF.gz, and BAM.gz) and an XML metadata file, organized into a submission package.
WGS pipeline and TSA: Coming soon