Malverso Week 3

From LMU BioDB 2015
Revision as of 01:00, 22 September 2015 by Malverso (Talk | contribs) (answered reading frames for the (actual) -1 strand.)

Jump to: navigation, search

The Genetic Code, By Computer

I used putty.exe and logged in to my account on my.cs.lmu.edu in order to access prokaryote.txt.

Complement of a Strand

At first, I tried using the command cat prokaryote.txt | sed “s/atcg/tagc/g”, which was incorrect. I revisited my notes to see that it is actually:

cat prokaryote.txt | sed "y/atgc/tacg/" 

Reading Frames

+1

Using prokaryote.txt again, I used sed “y/t/u/” to replace all the t's with u's. I then re-read Introduction to the Command Line to find that sed –f <file with rules> is the technique I should use to use to harness the helpfullness of the genetic_code.sed file.

I tried cat prokaryote.txt | sed “y/t/u/” | sed –f genetic-code.sed, but when I checked that answer on the ExPASy Translate Tool it was wrong. Then I tried adding a space between each set of three letters with sed “s/…/& /g” which produced the correct answer but with some left over bases on the end.I added sed “s/[augc]//” to get rid of the left over bases, and then sed "s/ //g" to get rid of the spaces between the letters. This looked like:

cat prokaryote.txt | sed "s/.../& /g" | sed "y/t/u/" | sed -f genetic-code.sed | sed "s/[augc]//g" | sed "s/ //g"

which had the output:

STIFQ-VRWPKKTILNLKRCLIPCSAYNPAASSAGGIL

+2

For this strand, the only change I had to make in my code was to get rid of the very first character of prokaryote.txt. At first, I added the command sed "s/^[agtc]//g", but realized that it also worked to use sed "s/^.//g". I chose the second choice which looked like:

cat prokaryote.txt | sed "s/^.//g" | sed "s/.../& /g" | sed "y/t/u/" | sed -f genetic-code.sed | sed "s/[augc]//g" | sed "s/ //g"

which had the output:

LLYFNRYDGQRRQY-T-NVA-YHVPRITQPPVPLAAF- 

+3

All I did was add a "." to the sed command to delete the front two letters instead of just one:

cat prokaryote.txt | sed "s/^..//g" | sed "s/.../& /g" | sed "y/t/u/" | sed -f genetic-code.sed | sed "s/[augc]//g" | sed "s/ //g"

which proved to be successful:

YYISIGTMAKEDNIELETLPNTMFRV-PSRQFRWRHFN

-1

At first I thought all I had to do was add rev prokaryote.txt to the beginning of the previous three. I was mistaken. I realized that I needed to first change the bases to their complements, which I did with the code I wrote for the first homework question. My code:

cat prokaryote.txt | rev prokaryote.txt | sed “y/agtc/tacg/” | sed "s/.../& /g" | sed "y/t/u/" | sed -f genetic-code.sed | sed "s/[aguc]//g" | sed "s/ //g"

which produced:

VKMPPAELAAGLYAEHGIRQRFKFNIVFFGHRTY-NIV



Team Page

Heavy Metal HaterZ

Assignments

Individual Journal Entries

Shared Journal Entries