Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

General Workflow:

  • Transfer data to Cyverse Discovery Environment
  • Gunzip your files
  • FastQC
  • Hisat2 (index and align)
  • HTSeq Count
  • DeSeq2

...

 

Info
titleExample Data

Example data to be used with this tutorial can be found here: Community Data -> iplantcollaborative -> example_data -> Mouse_RNAseq_DESeq2

 

How to Transfer your files to Cyverse Discovery Environment (DE)

There are multiple ways to transfer your files (either locally from your computer or from the UAGC data storage space). Below are the most common methods. If you are unfamiliar with the command line, I would suggest using Cyberduck. If you are familiar with the command line, I would suggest using iRODs. 

...

  1. Click on the "Add" button
  2. A directory will open and you can search for the fastq.gz files you want to unzip
  3. Click on "OK" and it will load the files you chose
  4. The "Names" box will load all the files you inputed into the box
  5. I keep the settings as the default: If you want to change the setting, read the manual of GUNZIP here:Uncompress files with gunzip 1.6-2
  6. Click "Launch Analysis" once you are ready for the computer to start the analysis. (Repeat steps 1-4 as necessary until you have all the files you want loaded)

...

  1. In your analyses folder, click on "HTSeq-count-0.6.1" 
  2. You will see "paired.sorted.XXXX.txt" files for your counts . This will be your count matrix

Example of Count Matrix for UT_top5M_1.sorted

Image Added

DESeq2

Now that we know the quantity of each transcript in each sample we need to compare those quantities between samples. For this we will use DESeq2.

...

  1. Move all your paired.sorted.XXX.txt files to one folder for the easiest analysis
  2. Select all your paired.sorted.XXX.txt files for your input (you can not do 27 pairwise at the same time, select smaller sets of samples for your pairwise comparisons)
  3. Need to create a target file: Refer to https://pods.iplantcollaborative.org/wiki/pages/viewpage.action?pageId=28115144b look below for image
  4. Select:
    1. Reference Biological condition: probably whatever the name is for your untreated/control samples
    2. Everything else: as default

Image Added

Output File Types:

  1. In your analyses folder, click on "Deseq2(multifactorial pairwise comparison)" 
  2. You will see "XXX.complete.txt" which has all the genes and your pairwise comparison
  3. You will see "XXX.up.txt" are genes that are upregulated in your pairwise comoparison
  4. You will see "XXX.down.txt" are genes that are downregulated in your pairwise comparison

...