Difference between revisions of "Kmeilak Week 3"

From LMU BioDB 2013
Jump to: navigation, search
(Reading Frames)
Line 17: Line 17:
 
===Reading Frames===
 
===Reading Frames===
  
*+1 strand: cat prokaryote.txt | sed "s/t/u/g" | sed "s/.../& /g" | sed -f genetic-code.sed
+
*+1 strand:
 +
cat prokaryote.txt | sed "s/t/u/g" | sed "s/.../& /g" | sed -f genetic-code.sed
  
 +
*+2 strand:
 +
cat prokaryote.txt | sed "s/^.//g" | sed "s/t/u/g" | sed "s/.../& /g" | sed -f genetic-code.sed
  
 +
*+3 strand:
 +
cat prokaryote.txt | sed "s/^..//g" | sed "s/t/u/g" | sed "s/.../& /g" | sed -f genetic-code.sed
  
 +
*-1 strand:
 +
cat prokaryote.txt | sed "y/atcg/uagc/" | rev | sed "s/.../& /g" | sed -f genetic-code.sed
  
 +
*-2 strand:
 +
cat prokaryote.txt | sed "y/atcg/uagc/" | sed "s^.//g" | rev | sed "s/.../& /g" | sed -f genetic-code.sed
 +
 +
*-3 strand:
 +
cat prokaryote.txt | sed "y/atcg/uagc/" | sed "s^..//g" | rev | sed "s/.../& /g" | sed -f genetic-code.sed
  
 
==XMLPipeDB Match Practice==
 
==XMLPipeDB Match Practice==

Revision as of 04:11, 13 September 2013

Contents

Week 2 (corrected)

Kmeilak Week 2

Where's your stuff?

Link to html: [file:///C:/Users/Kevin/Desktop/Week%203%20homework.html link to Kevin's html]

The Genetic Code by Computer

Complement of a Strand

The way to return a strand's complement is by inputting the following

cat prokaryote.txt | sed "y/atcg/tagc/"

Reading Frames

  • +1 strand:
cat prokaryote.txt | sed "s/t/u/g" | sed "s/.../& /g" | sed -f genetic-code.sed
  • +2 strand:
cat prokaryote.txt | sed "s/^.//g" | sed "s/t/u/g" | sed "s/.../& /g" | sed -f genetic-code.sed
  • +3 strand:
cat prokaryote.txt | sed "s/^..//g" | sed "s/t/u/g" | sed "s/.../& /g" | sed -f genetic-code.sed
  • -1 strand:
cat prokaryote.txt | sed "y/atcg/uagc/" | rev | sed "s/.../& /g" | sed -f genetic-code.sed
  • -2 strand:
cat prokaryote.txt | sed "y/atcg/uagc/" | sed "s^.//g" | rev | sed "s/.../& /g" | sed -f genetic-code.sed
  • -3 strand:
cat prokaryote.txt | sed "y/atcg/uagc/" | sed "s^..//g" | rev | sed "s/.../& /g" | sed -f genetic-code.sed

XMLPipeDB Match Practice

1. The Match command that tallies the occurrences of the pattern GO:000916. in the 493.P_falciparum.xml file is

  • There are 2 unique matches
  • the first unique match (go:0009165) appears twice, and the second unique match (go:0009168) appears once
  • The pattern GO:000916. represents

2. The Match command that tallies the occurrences of the pattern \"James.*\" in the 493.P_falciparum.xml file is

  • There are 2 unique matches
  • The first unique match ("james k.d.") appears 8238 times and the second unique match ("james a.a.") appears 1 time
  • The pattern \"James.*\" represents

3. ATG Results

  • The answer Match gave me was that ATG appeared 830101 times
  • grep/wc said that ATG appeared 502410 times
  • Yes, these responses do make sense. Match looks only for the pattern ATG, but grep/wc searches for the number of lines of code where the sequence appears, which means any lines with multiple appearances of ATG are counted as a single result.
Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox