Lena's Notes
From LMU BioDB 2013
consenus sequence: codons that like code for somethin particular.
brackets code for "or"
consensus sequnce for -10 site: [ct]at[at]at
-10 site could be... cataat tattat cattat tatat
hairpin sequence is aaaaggt you should see gcctttt, because t=u (mrna) and u binds with g
vibrio cholerae id: VC_#### VC_[0-9][0-9][0-9][0-9]
java -jar xmlpipedb-match-1.1.1.jar "VC_[0-9][0-9][0-9][0-9]" <uniprot-46.VC_cholerae_ATCC_39315.xml
select count (*) from genenametype where type = 'ordered locus' and value ~ 'VC_(A|)[0-9][0-9][0-9][0-9]';