Difference between revisions of "Kevin Wyllie Week 3"

Revision as of 18:22, 21 September 2015

These pipes yield the following amino acid sequences (shown on right):
- -1 Nter- V K M P P A E L A A G L Y A E H G I R Q R F K F N I V F F G H R T Y -Cter (shown in red)
- -2 Nter- L K C R Q R N W R L G Y T R N M V L G N V S S S I L S S L A I V P I E I -Cter (shown in green)
- -3 No polypeptide - first codon is STOP. (shown in blue)

right

cat 493.P_falciparum.xml | java -jar xmlpipedb-match-1.1.1.jar "GO:000[567]"

There are three unique matches (the maximum possible for this command).
- GO:0005 occurred 1,371 times.
- GO:0006 occurred 1,100 times.
- GO:0007 occurred 113 times.

right

grep "GO:0007" 493.P_falciparum.xml

Looking at the text found on the same lines as this pattern, it appears to be the first few characters of a gene ID. Based on prior knowledge, it also may have something to do with gene ontology, as I have seen "GO" as an acronym for that term before.

right

cat "493.P_falciparum.xml" | java -jar xmlpipedb-match-1.1.1.jar "\"Yu.*\""

There are three unique matches.
- "yu b." occurred one time.
- "yu k." occurred 228 times.
- "yu m." occurred one time.
A grep command for this pattern brings up lines such as:

<person name="Yu K."/>

So these may be names of biologists, perhaps those who were responsible for the discovery of a given gene.

right

To count occurrence of of "ATG."
- The match function finds 830,101 matches in hs_ref_GRCh37_chr19.fa (shown on right, in green).
- Connecting grep to wc finds 502,410 lines, 502,410 words and 35,671,048 characters (shown on right, in red).
- This discrepancy in matches is due to the differences in the functions. The Match function looks for the pattern outright, while grep-wc looks at the entirety of any line in which the pattern is found. The numbers that grep-wc returns apply to the lines that "ATG" is found in, not just the "ATG" pattern itself.

Revision as of 18:21, 21 September 2015 (view source) Kwyllie (Talk \| contribs) (Typo fix.) ← Older edit		Revision as of 18:22, 21 September 2015 (view source) Kwyllie (Talk \| contribs) (Attempted formatting fix.) Newer edit →
Line 41:		Line 41:

	cat prokaryote.txt \| '''sed "s/^..//g"''' \| sed "s/.../& /g" \| sed "y/t/u/" \| sed -f genetic-code.sed		cat prokaryote.txt \| '''sed "s/^..//g"''' \| sed "s/.../& /g" \| sed "y/t/u/" \| sed -f genetic-code.sed
		+
		+
		+
		+
		+

	[[Image:Kwscreenshot3.jpg\|right\|thumb]]		[[Image:Kwscreenshot3.jpg\|right\|thumb]]