Difference between revisions of "Kmeilak Week 3"
From LMU BioDB 2013
(→Week 2) |
|||
Line 6: | Line 6: | ||
Link to html: [file:///C:/Users/Kevin/Desktop/Week%203%20homework.html link to Kevin's html] | Link to html: [file:///C:/Users/Kevin/Desktop/Week%203%20homework.html link to Kevin's html] | ||
+ | |||
+ | ==The Genetic Code by Computer== | ||
+ | |||
+ | ===Complement of a Strand=== | ||
+ | |||
+ | The way to return a strand's complement is by inputting the following | ||
+ | |||
+ | cat prokaryote.txt | sed "y/atcg/tagc/" | ||
+ | |||
+ | ===Reading Frames=== | ||
+ | |||
+ | *+1 strand: cat prokaryote.txt | sed "s/t/u/g" | sed "s/.../& /g" | sed -f genetic-code.sed | ||
+ | |||
+ | |||
+ | |||
+ | |||
==XMLPipeDB Match Practice== | ==XMLPipeDB Match Practice== |
Revision as of 04:06, 13 September 2013
Contents |
Week 2 (corrected)
Where's your stuff?
Link to html: [file:///C:/Users/Kevin/Desktop/Week%203%20homework.html link to Kevin's html]
The Genetic Code by Computer
Complement of a Strand
The way to return a strand's complement is by inputting the following
cat prokaryote.txt | sed "y/atcg/tagc/"
Reading Frames
- +1 strand: cat prokaryote.txt | sed "s/t/u/g" | sed "s/.../& /g" | sed -f genetic-code.sed
XMLPipeDB Match Practice
1. The Match command that tallies the occurrences of the pattern GO:000916. in the 493.P_falciparum.xml file is
- There are 2 unique matches
- the first unique match (go:0009165) appears twice, and the second unique match (go:0009168) appears once
- The pattern GO:000916. represents
2. The Match command that tallies the occurrences of the pattern \"James.*\" in the 493.P_falciparum.xml file is
- There are 2 unique matches
- The first unique match ("james k.d.") appears 8238 times and the second unique match ("james a.a.") appears 1 time
- The pattern \"James.*\" represents
3. ATG Results
- The answer Match gave me was that ATG appeared 830101 times
- grep/wc said that ATG appeared 502410 times
- Yes, these responses do make sense. Match looks only for the pattern ATG, but grep/wc searches for the number of lines of code where the sequence appears, which means any lines with multiple appearances of ATG are counted as a single result.