5-5 Update

A number of my python scripts were changing the format of my fasta files making it impossible for biopython to read them in subsequent steps.  I decided to use perl to do the sequence parsing as I found a good framework and was able to modify it for my purposes.   I used the SPYDER python IDE and the quick install for biopythonDWIM Perl for Windows and this open-source NCBI tool.  The original code I modified is commented out in lines 242-262, and my modification is in 262-274.   

Here is the uploaded final powerpoint.

 

Here is a list of the files used: 

Original laccases seqs w/o fragments:

laccase.fasta

high REDOX sequences:

High_E0_matched.fasta

Secreted high REDOX laccases:

T-matched.fasta

T-matched.fasta

and here is a link to my github. The program is entitled fasta_seq_regex.