Difference between revisions of "Kevin Wyllie Week 3"

Revision as of 20:21, 20 September 2015

These pipes yield the following amino acid sequences (shown on right):
- -1 Nter- V K M P P A E L A A G L Y A E H G I R Q R F K F N I V F F G H R T Y -Cter (shown in red)
- -2 Nter- L K C R Q R N W R L G Y T R N M V L G N V S S S I L S S L A I V P I E I -Cter (shown in green)
- -3 No polypeptide - first codon is STOP. (shown in blue)

cat 493.P_falciparum.xml | java -jar xmlpipedb-match-1.1.1.jar "GO:000[567]"

There are three unique matches (the maximum possible for this command).
- GO:0005 occurred 1,371 times. GO:0007 occurred 113 times.
- GO:0006 occurred 1,100 times.
- GO:0007 occurred 113 times.

grep "GO:0007" 493.P_falciparum.xml

Looking at the text found on the same lines as this pattern, it appears to be the first few characters of a gene ID. Based on prior knowledge, it also may have something to do with gene ontology, as I have seen "GO" as an acronym for that term before.

cat "493.P_falciparum.xml" | java -jar xmlpipedb-match-1.1.1.jar "\"Yu.*\""

There are three unique matches.
- "yu b." occurred one time.
- "yu k." occurred 228 times.
- "yu m." occurred one time.
A grep command for this pattern brings up lines such as:

<person name="Yu K."/>

So these may be names of biologists, perhaps those who were responsible for the discovery of a given gene.

@@ Line 44: / Line 44: @@
 [[Image:Kwscreenshot3.jpg|right|thumb]]
 * These pipes yield the following amino acid sequences (shown on right):
-** '''+1''' Nter- S T I F Q -Cter (shown in green)
+** '''+1''' Nter- S T I F Q -Cter (shown in red)
-** '''+2''' Nter- L L Y F N R Y D G Q R R Q Y -Cter (shown in red)
+** '''+2''' Nter- L L Y F N R Y D G Q R R Q Y -Cter (shown in green)
 ** '''+3''' Nter- Y Y I S I G T M A K E D N I E L E T L P N T M F R V -Cter (shown in blue)
@@ Line 71: / Line 71: @@
 [[Image:Kwscreenshot4.jpg|right|thumb]]
 * These pipes yield the following amino acid sequences (shown on right):
-** '''-1''' Nter- V K M P P A E L A A G L Y A E H G I R Q R F K F N I V F F G H R T Y -Cter (shown in green)
+** '''-1''' Nter- V K M P P A E L A A G L Y A E H G I R Q R F K F N I V F F G H R T Y -Cter (shown in red)
-** '''-2''' Nter- L K C R Q R N W R L G Y T R N M V L G N V S S S I L S S L A I V P I E I -Cter (shown in red)
+** '''-2''' Nter- L K C R Q R N W R L G Y T R N M V L G N V S S S I L S S L A I V P I E I -Cter (shown in green)
 ** '''-3''' No polypeptide - first codon is STOP. (shown in blue)
 ===XMLPipeDB Match Practice===