To search content in this manual only, enter your query above. To search for content in the entire CyVerse wiki, enter your query at the top right.







Versions Compared


  • This line was added.
  • This line was removed.
  • Formatting was changed.

Requesting DOIs

CyVerse Curated Data in the Data Commons contains files that have been assigned a permanent identifier (DOI or ARK). These files are secure, stable, and unchangeable, thus making them the ideal platform for ease of data reuse and data citation. Datasets in the CyVerse Curated Data site can store very large datasets that are difficult to transfer, upload, and download across different computers and platforms. CyVerse Curated data is accessible to CyVerse's suite of large-scale computational analysis resources, allowing users to seamlessly analyze, manage, and publish new results.

If you are interested in obtaining a permanent identifier for one or more datasets, please read the Permanent Identifier FAQs and then answer the questions on the page, Is Data Commons Curated Data right for my data?

Your public dataset is the representation of your research. The Data Commons team wants to help you publish complete and well documented datasets by providing tools, guidelines, and permanent identifiers so that your research is showcased in a clear and thorough fashion, allowing others to learn about your project and reuse it. However, you are responsible for the contents and presentation of your dataset.

Requesting a DOI

Step 1: Before you begin, review the related pages

Step 2: Organize the dataset in the CyVerse Data Store

There are several steps to properly organizing your dataset. These include determining what data to include, how many identifiers to request, how to organize the data into folders, and creating the ReadMe file and data inventory.

Include Page

Step 3: Add metadata to your folder

  1. In the Data window, click the checkbox next to the folder.
  2. Select Metadata > Edit / View Metadata
    1. Alternatively, you can choose Edit / View Metadats from the three dot menu next to the file.
  3. Click on + Select Template and and choose the DOI Request / Datacite metadata template.
  4. Complete the required fields (marked with an asterisk) and as many of the optional fields as possible.
  5. Save the template. For more information, including how to apply metadata in bulks, see Using Metadata in the DE.
  6. You may add any additional metadata that is appropriate. We encourage the use of additional metadata to make your data better understood and more discoverable.
  7. We also encourage the use of metadata on subfolders and individual files in your datasets.

Step 4: Submit the request for the DOI

  1. In the Data window, click the checkbox next to the folder.
  2. Select Metadata > Request DOI.

  3. After verifying you have read the manual (this page), click I need a DOI. You will receive a verification email that your request has been received and a notification will be listed in the Notifications list in the DE.

  4. At this point, your folder will move to a new location under Community Data/commons_repo/staging.

Step 5: Wait for CyVerse validation checks

After submitting your request, a CyVerse Curated Data curator begins validating your dataset, metadata, and overall configuration of your dataset. Validations are based solely on the required DOI metadata and folder-naming conventions, as well as its potential utility to the CyVerse and larger scientific community—not the quality of your data.

  • If the curator determines that the dataset is adequately organized and the DataCite metadata are accurate, they will provide a DOI, and you will be notified of the DOI and the location of its corresponding landing page in the Community Data > commons_repo > curated folder in the DE.If the curator determines that minor changes are needed, they may make those changes themselves.
  • If the curator determines that substantive changes are needed, they will contact you with required changes.
  • If the curator determines that your dataset is not appropriate for the Curated Data site, you will be notified.
  • f the curator determines that the dataset is adequately organized and the DataCite metadata are accurate, they will provide a DOI, and you will be notified of the DOI and the location.

To check the status of your request, click Notifications at the top right of the DE screen. For more information on using notifications in the DE, see Viewing and Deleting Notifications.

Requesting an ARK

CyVerse will no longer offer ARKs, starting in April 2018.

Step 6: Published data

  • Once your DOI request is approved, the dataset will move to a location under Community Data > commons_repo > curated folder in the DE.
  • Datasets with DOIs are visible via, under "CyVerse Curated".
  • The DOI (in the form of a URL) will point directly to the landing page for your dataset.
  • You will no longer be able to edit your metadata once it is published.
    • For any changes or corrections, contact
    • Most changes will require issuing a new version of the dataset, with a new DOI, so it is important to make sure your data and metadata are all correct before publishing them.

Getting your dataset noticed

Metadata, the description about your data, is key to getting your dataset noticed in the world wide web. Search engines and bibliographic aggregators index the metadata that you create to obtain a DOI. Thus, it is important that you do the following:

  • Make sure the metadata is complete.
  • Include descriptive terms about the science and themes involved in your research. These can go in the "Subject" attribute, but you can also create additional metadata specific to your dataset.
  • Include methods used to generate the dataset in the "Description" attribute, and in more detail in a Readme file.
  • Include terms such as precise keywords in the "Subject" attribute.
  • Describe the dataset for a broader audience so that they understand your research.
  • If you or team members have an ORCID ID, make sure to include it when you edit the authors' namesin the metadata.

Publicizing your dataset

There are several ways to publicize your dataset:

  • Consider using social media to share the DOI of your dataset, and tag CyVerse.
  • If you have an interesting story about your data, contact us at, and we may be able to share it through CyVerse outreach.
  • If you have a tool or workflow you developed to analyze your data in CyVerse, consider presenting it as part of our Focus Forums webinars series.
titleHelpful Links

On This Page:

Table of Contents

In This Section:

Page Tree

Related Pages: