The applications listed here are available for use in the Discovery Environment and are documented in: Discovery Environment Manual.

Discovery Environment Applications List

The box below searches only this space.
To search the entire iPlant wiki, enter your query in the box at the upper right.

Maintenance: Tues, 28 Jan 2020

ACCESS TO OR USAGE OF THE FOLLOWING SERVICES WILL BE UNAVAILABLE OR DISRUPTED:

Discovery Environment         8:00am to 5:00pm MST
The Discovery Environment will be unavailable while patches and updates are applied.
        ** Currently running analyses will be terminated. Please plan accordingly.

Data Store                    8:00am to 5:00pm MST
The Data Store will be unavailable during the maintenance period.
 
Data Commons                  8:00am to 5:00pm MST
The Data Commons will be unavailable during the maintenance period.
 
Atmosphere and Cloud Services 8:00am to 5:00pm MST
Marana Cloud: Atmosphere instances in the Marana Cloud will be operational; however, you will not be able to use the Data Store within your instance, and you may not be able to access the Atmosphere web interface.
 
User Portal                   8:00am to 5:00pm MST
The User Portal, http://user.cyverse.org, will be unavailable while we perform maintenance and updates.
 
Agave/Science API             8:00am to 5:00pm MST
The Agave/Science API will be unavailable during this maintenance period.
 
DNA Subway                    8:00am to 5:00pm MST
DNA Subway will be unavailable during this maintenance period.
 
The following services will NOT be affected by the maintenance: CyVerse Wiki and JIRA

Keep up to date with our maintenance schedules on the CyVerse public calendar
http://www.cyverse.org/maintenance-calendar
Check your local timezone here https://bit.ly/36iVOkX 
 
Please contact support@cyverse.org for any questions, or concerns.

 

 

 

 

 

Skip to end of metadata
Go to start of metadata

 MUSCLE

MUSCLE performs DNA or amino acid multiple sequence alignment.

MUSCLE stands for MUltiple Sequence Comparison by Log- Expectation. MUSCLE may achieve both better average accuracy and better speed than ClustalW2 or T-Coffee, depending on the chosen options.

Icon

This documentation corresponds to the DE app named "MUSCLE-hpc-3.8.31", which runs on the Stampede supercomputer. See MUSCLE for documentation of the app "Muscle-3.8.31", which runs on the UA HTCondor cluster. These two apps can be used interchangeably.  Wait times may be shorter for Muscle-3.8.31.

Quick Start

  • To use MUSCLE, upload your data in FASTA format.
  • Resources: documentation

Test Data

All files are located in the Community Data directory of the iPlant Discovery Environment at the following path:

Community Data > iplantcollaborative > example_data > muscle-3.8.31

Input File(s)

  •  Use muscle_in.fa (FASTA file) as a test input file.
  • MUSCLE takes only FASTA files as input. 
  • MUSCLE allows lines of any length in a FASTA file. (Some programs limit lines to e.g., 80 characters).
  • MUSCLE does not support comments in FASTA files.
  • White space characters (blanks and tabs) are discarded if found in sequence data, but many other tools do not allow this and the practice is not recommended.

Parameters Used in App

When the app is run in the Discovery Environment, use the following parameters with the above input file(s) to get the output provided in the next section below

  • output format: Leave as default format (FASTA)
  • output name: Leave as default name (aligned.fa)

For information on adding command line parameters, please see the MUSCLE documentation: http://www.drive5.com/muscle

Output File(s)

  • MUSCLE generates multiple sequence alignments in a variety of output formats.
  • If you used the input and parameters suggested in this example, you should get the follow as output:
    • aligned.fa

Tool Source for App

References

MUSCLE v3.8.31 by Robert C. Edgar

http://www.drive5.com/muscle

Please cite:

Edgar, R.C. (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput
  Nucleic Acids Res. 32(5):1792-1797 [Link to PubMed].

Edgar, R.C. (2004) MUSCLE: a multiple sequence alignment method with reduced time and space complexity
  BMC Bioinformatics, (5) 113 [Link to PubMed].