To search content in this manual only, enter your query above. To search for content in the entire CyVerse wiki, enter your query at the top right.
__________________

DATA COMMONS USER MANUAL
 

 

 


 

 

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

This page is updated approximately once per quarter.

Projects under development 

Data Commons Website - datacommons.cyverse.org and dc.cyverse.org

...

  • Project creation (timeline uknown)
    • Step 1:
      • Ability to create a project
      • Ability to add metadata to a project
      • Ability to manage people, data, apps, tools, analyses, and allocations within a project
    • Step 2:
      • Ability to create datasets within a project
      • Ability to publish datasets from within a project, to DCR or external repositories

Submission to NCBI TSA (Transcriptome Assembly) 

Status: Planning

Whole Genome Shotgun (WGS) projects are genome assemblies of incomplete genomes or incomplete chromosomes of prokaryotes or eukaryotes that are generally being sequenced by a whole genome shotgun strategy. WGS projects may be annotated, but annotation is not required. We are adapting the CyVerse SRA submission pipeline (see below) to submit sequences to WGS.

  •   Similar to SRA submission features. Users must create a BioProject and BioSample and apply appropriate metadata before submission

Ontology-based Metadata Management (OMM)

Image Modified

First available: Q2 2016

A big hurdle to using adequate metadata is that scientists often have to supply different sets of metadata for different uses of the same file such as getting a DOI, submitting to NCBI, or publishing to Dryad. Often the different fields contain the same information, but with slightly different labels, meaning that researchers have to enter the same data multiple times, fuss with formatting, and often end up getting frustrated and giving up. Another hurdle for getting scientist to use metadata is the large amount of effort required for limited value in return.

...

 

  • On hold until after site visit.
  • See JIRA epic DS-114

Re-organization of Community Data

 

Policies

Completed projects

These features are operational and under maintenance. As new user needs arise, further development of these features may take place.

CyVerse Data Policies

  • A new  Data Commons User Agreement and Data Policy were released 04/2017.
  • Data policy was updated 08/2017 at the request of the Science team. Users can now request up to 10 TB without executive team review. 

 

Submission to external repositories

Submission to NCBI SRA repository

First available: Q2 2015

CyVerse users can now submit data to SRA through the DE. The process is documented on the NCBI Sequence Read Archive (SRA) Submission (Workflow Tutorial) wiki page.

 

Submission to NCBI WGS repository

 

Status: In First available: Q2 2017, in beta testing

 

Whole Genome Shotgun (WGS) projects are genome assemblies of incomplete genomes or incomplete chromosomes of prokaryotes or eukaryotes that are generally being sequenced by a whole genome shotgun strategy. WGS projects may be annotated, but annotation is not required. We are adapting the CyVerse SRA submission pipeline (see below) to submit sequences to WGS.

  •  

...

  •   Similar to SRA submission features. Users must create a BioProject and BioSample and apply appropriate metadata before submission

 

Submission to NCBI TSA (Transcriptome Assembly) 

 

Status: Planning


Future projects

Improved data discovery and reporting

...