Difference between revisions of "Malverso Week 3"

From LMU BioDB 2015
Jump to: navigation, search
(answered reading frames for +3 strand (and fixed the signs of the first two questions))
(answered reading frames for the (actual) -1 strand.)
Line 13: Line 13:
 
I tried cat prokaryote.txt | sed “y/t/u/” | sed –f genetic-code.sed, but when I checked that answer on the ExPASy Translate Tool  it was wrong. Then I tried adding a space between each set of three letters with sed “s/…/& /g” which produced the correct answer but with some left over bases on the end.I added sed “s/[augc]//” to get rid of the left over bases, and then sed "s/ //g" to get rid of the spaces between the letters.  
 
I tried cat prokaryote.txt | sed “y/t/u/” | sed –f genetic-code.sed, but when I checked that answer on the ExPASy Translate Tool  it was wrong. Then I tried adding a space between each set of three letters with sed “s/…/& /g” which produced the correct answer but with some left over bases on the end.I added sed “s/[augc]//” to get rid of the left over bases, and then sed "s/ //g" to get rid of the spaces between the letters.  
 
This looked like:
 
This looked like:
  cat prokaryote.txt | sed "s/.../& /g" | sed "y/t/u/" | sed -f genetic-code.sed | sed "s/[atgc]//g" | sed "s/ //g"
+
  cat prokaryote.txt | sed "s/.../& /g" | sed "y/t/u/" | sed -f genetic-code.sed | sed "s/[augc]//g" | sed "s/ //g"
 
which had the output:
 
which had the output:
 
  STIFQ-VRWPKKTILNLKRCLIPCSAYNPAASSAGGIL
 
  STIFQ-VRWPKKTILNLKRCLIPCSAYNPAASSAGGIL
Line 28: Line 28:
 
which proved to be successful:
 
which proved to be successful:
 
  YYISIGTMAKEDNIELETLPNTMFRV-PSRQFRWRHFN
 
  YYISIGTMAKEDNIELETLPNTMFRV-PSRQFRWRHFN
 +
 +
====-1====
 +
At first I thought all I had to do was add rev prokaryote.txt to the beginning of the previous three. I was mistaken. I realized that I needed to first change the bases to their complements, which I did with the code I wrote for the first homework question. My code:
 +
cat prokaryote.txt | rev prokaryote.txt | sed “y/agtc/tacg/” | sed "s/.../& /g" | sed "y/t/u/" | sed -f genetic-code.sed | sed "s/[aguc]//g" | sed "s/ //g"
 +
which produced:
 +
VKMPPAELAAGLYAEHGIRQRFKFNIVFFGHRTY-NIV
 +
  
  
 
{{Template:Malverso}}
 
{{Template:Malverso}}

Revision as of 01:00, 22 September 2015

The Genetic Code, By Computer

I used putty.exe and logged in to my account on my.cs.lmu.edu in order to access prokaryote.txt.

Complement of a Strand

At first, I tried using the command cat prokaryote.txt | sed “s/atcg/tagc/g”, which was incorrect. I revisited my notes to see that it is actually:

cat prokaryote.txt | sed "y/atgc/tacg/" 

Reading Frames

+1

Using prokaryote.txt again, I used sed “y/t/u/” to replace all the t's with u's. I then re-read Introduction to the Command Line to find that sed –f <file with rules> is the technique I should use to use to harness the helpfullness of the genetic_code.sed file.

I tried cat prokaryote.txt | sed “y/t/u/” | sed –f genetic-code.sed, but when I checked that answer on the ExPASy Translate Tool it was wrong. Then I tried adding a space between each set of three letters with sed “s/…/& /g” which produced the correct answer but with some left over bases on the end.I added sed “s/[augc]//” to get rid of the left over bases, and then sed "s/ //g" to get rid of the spaces between the letters. This looked like:

cat prokaryote.txt | sed "s/.../& /g" | sed "y/t/u/" | sed -f genetic-code.sed | sed "s/[augc]//g" | sed "s/ //g"

which had the output:

STIFQ-VRWPKKTILNLKRCLIPCSAYNPAASSAGGIL

+2

For this strand, the only change I had to make in my code was to get rid of the very first character of prokaryote.txt. At first, I added the command sed "s/^[agtc]//g", but realized that it also worked to use sed "s/^.//g". I chose the second choice which looked like:

cat prokaryote.txt | sed "s/^.//g" | sed "s/.../& /g" | sed "y/t/u/" | sed -f genetic-code.sed | sed "s/[augc]//g" | sed "s/ //g"

which had the output:

LLYFNRYDGQRRQY-T-NVA-YHVPRITQPPVPLAAF- 

+3

All I did was add a "." to the sed command to delete the front two letters instead of just one:

cat prokaryote.txt | sed "s/^..//g" | sed "s/.../& /g" | sed "y/t/u/" | sed -f genetic-code.sed | sed "s/[augc]//g" | sed "s/ //g"

which proved to be successful:

YYISIGTMAKEDNIELETLPNTMFRV-PSRQFRWRHFN

-1

At first I thought all I had to do was add rev prokaryote.txt to the beginning of the previous three. I was mistaken. I realized that I needed to first change the bases to their complements, which I did with the code I wrote for the first homework question. My code:

cat prokaryote.txt | rev prokaryote.txt | sed “y/agtc/tacg/” | sed "s/.../& /g" | sed "y/t/u/" | sed -f genetic-code.sed | sed "s/[aguc]//g" | sed "s/ //g"

which produced:

VKMPPAELAAGLYAEHGIRQRFKFNIVFFGHRTY-NIV



Team Page

Heavy Metal HaterZ

Assignments

Individual Journal Entries

Shared Journal Entries