5-5 Update

A number of my python scripts were changing the format of my fasta files making it impossible for biopython to read them in subsequent steps. I decided to use perl to do the sequence parsing as I found a good framework and was able to modify it for my purposes. I used the SPYDER python IDE and the quick install for biopython, DWIM Perl for Windows and this open-source NCBI tool. The original code I modified is commented out in lines 242-262, and my modification is in 262-274.

Here is the uploaded final powerpoint.

Here is a list of the files used:

Original laccases seqs w/o fragments:

laccase.fasta

high REDOX sequences:

High_E0_matched.fasta

Secreted high REDOX laccases:

T-matched.fasta

and here is a link to my github. The program is entitled fasta_seq_regex.