Difference between revisions of "FAS2 Week 3"

From LMU BioDB 2019
Jump to navigation Jump to search
(DNA Sequence: DNA SEQUENCE FROM REFERENCE)
(Acknowledgements)
 
(15 intermediate revisions by 2 users not shown)
Line 1: Line 1:
 
==Summary==
 
==Summary==
 +
FAS2 is a gene that encodes for the alpha subunit of the fatty acid synthase. This proteins is particularly important as fatty acids are one of the most crucial molecules in a cell, involved in nearly every facet of a cell's function and metabolism. The gene is located on Chromosome XVI from positions 108652 to 114315 bp. It coordinates the binding of six beta subunits of the complex, which are encoded by the FAS1 gene. The alpha subunit contains 1894 amino acids.
  
 
==Gene Info==
 
==Gene Info==
Line 20: Line 21:
  
 
===DNA Sequence===
 
===DNA Sequence===
<code>
+
<pre>
 
ATGAAGCCGGAAGTTGAGCAAGAATTAGCTCATATTTTGCTAACTGAATTGTTAGCTTAT
 
ATGAAGCCGGAAGTTGAGCAAGAATTAGCTCATATTTTGCTAACTGAATTGTTAGCTTAT
 
CAATTTGCCTCTCCTGTGAGATGGATTGAAACTCAAGATGTTTTTTTGAAGGATTTTAAC
 
CAATTTGCCTCTCCTGTGAGATGGATTGAAACTCAAGATGTTTTTTTGAAGGATTTTAAC
Line 116: Line 117:
 
GAAGCTGGTGTTACCGATGTGAAGGTATCTATTTCTCACGATGACCTCCAAGCTGTCGCG
 
GAAGCTGGTGTTACCGATGTGAAGGTATCTATTTCTCACGATGACCTCCAAGCTGTCGCG
 
GTCGCCGTTTCTACTAAGAAATAG
 
GTCGCCGTTTCTACTAAGAAATAG
</code>
+
</pre>
  
>FAS2 YPL231W SGDID:S000006152, chrXVI:108652..114315
+
===Protein Sequence===
 
+
<pre>
ATGAAGCCGGAAGTTGAGCAAGAATTAGCTCATATTTTGCTAACTGAATTGTTAGCTTAT
+
MKPEVEQELAHILLTELLAYQFASPVRWIETQDVFLKDFNTERVVEIGPSPTLAGMAQRT
CAATTTGCCTCTCCTGTGAGATGGATTGAAACTCAAGATGTTTTTTTGAAGGATTTTAAC
+
LKNKYESYDAALSLHREILCYSKDAKEIYYTPDPSELAAKEEPAKEEAPAPTPAASAPAP
ACTGAAAGGGTTGTTGAAATCGGTCCTTCTCCAACTTTGGCTGGGATGGCTCAAAGAACC
+
AAAAPAPVAAAAPAAAAAEIADEPVKASLLLHVLVAHKLKKSLDSIPMSKTIKDLVGGKS
TTGAAGAATAAATACGAATCTTACGATGCTGCTCTGTCTTTACATAGAGAAATCTTATGC
+
TVQNEILGDLGKEFGTTPEKPEETPLEELAETFQDTFSGALGKQSSSLLSRLISSKMPGG
TATTCGAAGGATGCCAAAGAGATTTATTATACCCCAGATCCATCCGAACTAGCTGCAAAG
+
FTITVARKYLQTRWGLPSGRQDGVLLVALSNEPAARLGSEADAKAFLDSMAQKYASIVGV
GAAGAGCCCGCTAAGGAAGAAGCTCCTGCTCCAACTCCAGCTGCTAGTGCTCCTGCTCCT
+
DLSSAASASGAAGAGAAAGAAMIDAGALEEITKDHKVLARQQLQVLARYLKMDLDNGERK
GCAGCAGCAGCCCCAGCTCCCGTCGCGGCAGCAGCCCCAGCTGCAGCAGCTGCTGAGATT
+
FLKEKDTVAELQAQLDYLNAELGEFFVNGVATSFSRKKARTFDSSWNWAKQSLLSLYFEI
GCCGATGAACCTGTCAAGGCTTCCCTATTGTTGCACGTTTTGGTTGCTCACAAGTTGAAG
+
IHGVLKNVDREVVSEAINIMNRSNDALIKFMEYHISNTDETKGENYQLVKTLGEQLIENC
AAGTCGTTAGATTCCATTCCAATGTCCAAGACAATCAAAGACTTGGTCGGTGGTAAATCT
+
KQVLDVDPVYKDVAKPTGPKTAIDKNGNITYSEEPREKVRKLSQYVQEMALGGPITKESQ
ACAGTCCAAAATGAAATTTTGGGTGATTTAGGTAAAGAATTTGGTACTACTCCTGAAAAA
+
PTIEEDLTRVYKAISAQADKQDISSSTRVEFEKLYSDLMKFLESSKEIDPSQTTQLAGMD
CCAGAAGAAACTCCATTAGAAGAATTGGCAGAAACTTTCCAAGATACCTTCTCTGGAGCA
+
VEDALDKDSTKEVASLPNKSTISKTVSSTIPRETIPFLHLRKKTPAGDWKYDRQLSSLFL
TTGGGTAAGCAATCTTCCTCGTTATTATCAAGATTAATCTCATCTAAGATGCCTGGTGGG
+
DGLEKAAFNGVTFKDKYVLITGAGKGSIGAEVLQGLLQGGAKVVVTTSRFSKQVTDYYQS
TTTACTATTACTGTCGCTAGAAAATACTTACAAACTCGCTGGGGACTACCATCTGGTAGA
+
IYAKYGAKGSTLIVVPFNQGSKQDVEALIEFIYDTEKNGGLGWDLDAIIPFAAIPEQGIE
CAAGATGGTGTCCTTTTGGTAGCTTTATCTAACGAGCCTGCTGCTCGTCTAGGTTCTGAA
+
LEHIDSKSEFAHRIMLTNILRMMGCVKKQKSARGIETRPAQVILPMSPNHGTFGGDGMYS
GCTGATGCCAAGGCTTTCTTGGACTCCATGGCTCAAAAATACGCTTCCATTGTTGGTGTT
+
ESKLSLETLFNRWHSESWANQLTVCGAIIGWTRGTGLMSANNIIAEGIEKMGVRTFSQKE
GACTTATCATCAGCTGCTAGCGCTAGTGGTGCTGCCGGTGCAGGTGCTGCTGCCGGTGCA
+
MAFNLLGLLTPEVVELCQKSPVMADLNGGLQFVPELKEFTAKLRKELVETSEVRKAVSIE
GCTATGATCGATGCTGGCGCTCTGGAAGAAATAACCAAAGACCACAAGGTTTTGGCGCGT
+
TALEHKVVNGNSADAAYAQVEIQPRANIQLDFPELKPYKQVKQIAPAELEGLLDLERVIV
CAACAACTGCAAGTATTGGCTCGTTATCTAAAAATGGACTTGGATAACGGTGAAAGAAAG
+
VTGFAEVGPWGSARTRWEMEAFGEFSLEGCVEMAWIMGFISYHNGNLKGRPYTGWVDSKT
TTCTTGAAAGAAAAGGACACTGTTGCTGAACTTCAAGCTCAGTTGGATTACTTGAATGCC
+
KEPVDDKDVKAKYETSILEHSGIRLIEPELFNGYNPEKKEMIQEVIVEEDLEPFEASKET
GAATTAGGTGAATTCTTTGTTAACGGTGTTGCTACTTCTTTCTCTAGAAAAAAGGCCAGA
+
AEQFKHQHGDKVDIFEIPETGEYSVKLLKGATLYIPKALRFDRLVAGQIPTGWNAKTYGI
ACCTTCGATTCTTCCTGGAACTGGGCTAAACAATCTTTATTATCATTATACTTTGAGATA
+
SDDIISQVDPITLFVLVSVVEAFIASGITDPYEMYKYVHVSEVGNCSGSGMGGVSALRGM
ATTCATGGTGTCTTGAAAAACGTTGATAGAGAGGTTGTTAGTGAAGCTATCAATATCATG
+
FKDRFKDEPVQNDILQESFINTMSAWVNMLLISSSGPIKTPVGACATSVESVDIGVETIL
AACAGATCTAACGATGCTTTGATTAAATTCATGGAATACCATATCTCTAACACTGATGAA
+
SGKARICIVGGYDDFQEEGSFEFGNMKATSNTLEEFEHGRTPAEMSRPATTTRNGFMEAQ
ACAAAAGGTGAAAACTATCAATTGGTTAAAACTCTTGGTGAGCAGTTGATTGAAAACTGT
+
GAGIQIIMQADLALKMGVPIYGIVAMAATATDKIGRSVPAPGKGILTTAREHHSSVKYAS
AAACAAGTTTTGGATGTTGATCCAGTTTACAAAGATGTTGCTAAGCCTACCGGTCCAAAA
+
PNLNMKYRKRQLVTREAQIKDWVENELEALKLEAEEIPSEDQNEFLLERTREIHNEAESQ
ACTGCTATTGACAAGAACGGTAACATTACATACTCAGAAGAGCCAAGAGAAAAGGTTAGG
+
LRAAQQQWGNDFYKRDPRIAPLRGALATYGLTIDDLGVASFHGTSTKANDKNESATINEM
AAATTATCTCAATACGTACAAGAAATGGCCCTTGGTGGTCCAATCACCAAAGAATCTCAA
+
MKHLGRSEGNPVIGVFQKFLTGHPKGAAGAWMMNGALQILNSGIIPGNRNADNVDKILEQ
CCTACTATTGAAGAGGATTTGACTCGTGTTTACAAGGCAATCAGTGCTCAAGCTGATAAA
+
FEYVLYPSKTLKTDGVRAVSITSFGFGQKGGQAIVVHPDYLYGAITEDRYNEYVAKVSAR
CAAGATATTTCCAGCTCCACCAGGGTTGAATTTGAAAAACTATATAGTGATTTGATGAAG
+
EKSAYKFFHNGMIYNKLFVSKEHAPYTDELEEDVYLDPLARVSKDKKSGSLTFNSKNIQS
TTCTTGGAAAGCTCCAAAGAAATCGATCCTTCTCAAACAACCCAATTGGCCGGTATGGAT
+
KDSYINANTIETAKMIENMTKEKVSNGGVGVDVELITSINVENDTFIERNFTPQEIEYCS
GTTGAGGATGCTTTGGACAAAGATTCCACCAAAGAAGTTGCTTCTTTGCCAAACAAATCT
+
AQPSVQSSFAGTWSAKEAVFKSLGVKSLGGGAALKDIEIVRVNKNAPAVELHGNAKKAAE
ACCATTTCTAAGACGGTATCTTCAACTATTCCAAGAGAAACTATTCCGTTCTTACATTTG
+
EAGVTDVKVSISHDDLQAVAVAVSTKK
AGAAAGAAGACTCCTGCCGGAGATTGGAAATATGACCGCCAATTGTCTTCTCTTTTCTTA
+
</pre>
GATGGTTTAGAAAAGGCTGCCTTCAACGGTGTCACCTTCAAGGACAAATACGTCTTGATC
 
ACTGGTGCTGGTAAGGGTTCTATTGGTGCTGAAGTCTTGCAAGGTTTGTTACAAGGTGGT
 
GCTAAGGTTGTTGTTACCACCTCTCGTTTCTCTAAGCAAGTTACAGACTACTACCAATCC
 
ATTTACGCCAAATATGGTGCTAAGGGTTCTACTTTGATTGTTGTTCCATTCAACCAAGGT
 
TCTAAGCAAGACGTTGAAGCTTTGATTGAATTTATCTACGACACTGAAAAGAATGGTGGT
 
TTAGGTTGGGATCTAGATGCTATTATTCCATTCGCGGCCATTCCAGAACAAGGTATTGAA
 
TTAGAACATATTGATTCTAAGTCTGAATTTGCTCATAGAATCATGTTGACCAATATCTTA
 
AGAATGATGGGTTGTGTCAAGAAGCAAAAATCTGCAAGAGGTATTGAAACAAGACCAGCT
 
CAAGTCATTCTACCAATGTCTCCAAACCATGGTACTTTCGGTGGTGATGGTATGTATTCA
 
GAATCCAAGTTGTCTTTGGAAACTTTGTTCAACAGATGGCACTCTGAATCCTGGGCCAAT
 
CAATTAACCGTTTGCGGTGCTATTATTGGTTGGACTAGAGGTACTGGTTTAATGAGCGCT
 
AATAACATCATTGCTGAAGGCATTGAAAAGATGGGTGTTCGTACTTTCTCTCAAAAGGAA
 
ATGGCTTTCAACTTATTGGGTCTATTGACTCCAGAAGTCGTAGAATTGTGCCAAAAATCA
 
CCTGTTATGGCTGACTTGAATGGTGGTTTGCAATTTGTTCCTGAATTGAAGGAATTCACT
 
GCTAAATTGCGTAAAGAGTTGGTTGAAACTTCTGAAGTTAGAAAGGCAGTTTCCATCGAA
 
ACTGCTTTGGAGCATAAGGTTGTCAATGGCAATAGCGCTGATGCTGCATATGCTCAAGTC
 
GAAATTCAACCAAGAGCTAACATTCAACTGGACTTCCCAGAATTGAAACCATACAAACAG
 
GTTAAACAAATTGCTCCCGCTGAGCTTGAAGGTTTGTTGGATTTGGAAAGAGTTATTGTA
 
GTTACCGGTTTTGCTGAAGTCGGCCCATGGGGTTCGGCCAGAACAAGATGGGAAATGGAA
 
GCTTTTGGTGAATTTTCGTTGGAAGGTTGCGTTGAAATGGCCTGGATTATGGGCTTCATT
 
TCATACCATAACGGTAATTTGAAGGGTCGTCCATACACTGGTTGGGTTGATTCCAAAACA
 
AAAGAACCAGTTGATGACAAGGACGTTAAGGCCAAGTATGAAACATCAATCCTAGAACAC
 
AGTGGTATCAGATTGATCGAACCAGAGTTATTCAATGGTTACAACCCAGAAAAGAAGGAA
 
ATGATTCAAGAAGTCATTGTCGAAGAAGACTTGGAACCATTTGAGGCTTCGAAGGAAACT
 
GCCGAACAATTTAAACACCAACATGGTGACAAAGTGGATATCTTCGAAATCCCAGAAACA
 
GGAGAGTACTCTGTTAAGTTACTAAAGGGTGCCACTTTATACATTCCAAAGGCTTTGAGA
 
TTTGACCGTTTGGTTGCAGGTCAAATTCCAACTGGTTGGAATGCTAAGACTTATGGTATC
 
TCTGATGATATCATTTCTCAGGTTGACCCAATCACATTATTCGTTTTGGTCTCTGTTGTG
 
GAAGCATTTATTGCATCTGGTATCACCGACCCATACGAAATGTACAAATACGTACATGTT
 
TCTGAGGTTGGTAACTGTTCTGGTTCTGGTATGGGTGGTGTTTCTGCCTTACGTGGTATG
 
TTTAAGGACCGTTTCAAGGATGAGCCTGTCCAAAATGATATTTTACAAGAATCATTTATC
 
AACACCATGTCCGCTTGGGTTAATATGTTGTTGATTTCCTCATCTGGTCCAATCAAGACA
 
CCTGTTGGTGCCTGTGCCACATCCGTGGAATCTGTTGACATTGGTGTAGAAACCATCTTG
 
TCTGGTAAGGCTAGAATCTGTATTGTCGGTGGTTACGATGATTTCCAAGAAGAAGGCTCC
 
TTTGAGTTCGGTAACATGAAGGCCACTTCCAACACTTTGGAAGAATTTGAACATGGTCGT
 
ACCCCAGCGGAAATGTCCAGACCTGCCACCACTACCCGTAACGGTTTTATGGAAGCTCAA
 
GGTGCTGGTATTCAAATCATCATGCAAGCTGATTTAGCTTTGAAGATGGGTGTGCCAATT
 
TACGGTATTGTTGCCATGGCTGCTACCGCCACCGATAAGATTGGTAGATCTGTGCCAGCT
 
CCAGGTAAGGGTATTTTAACCACTGCTCGTGAACACCACTCCAGTGTTAAGTATGCTTCA
 
CCAAACTTGAACATGAAGTACAGAAAGCGCCAATTGGTTACTCGTGAAGCTCAGATTAAA
 
GATTGGGTAGAAAACGAATTGGAAGCTTTGAAGTTGGAGGCCGAAGAAATTCCAAGCGAA
 
GACCAAAACGAGTTCTTACTTGAACGTACCAGAGAAATCCACAACGAAGCTGAAAGTCAA
 
TTGAGAGCTGCACAACAACAATGGGGTAACGACTTCTACAAGAGGGACCCACGTATTGCT
 
CCATTGAGAGGAGCACTGGCTACTTACGGTTTAACTATTGATGACTTGGGTGTCGCTTCA
 
TTCCACGGTACATCCACAAAGGCTAATGACAAGAACGAATCTGCCACAATTAATGAAATG
 
ATGAAGCATTTGGGTAGATCTGAAGGTAATCCCGTCATTGGTGTTTTCCAAAAGTTCTTG
 
ACTGGTCATCCAAAGGGTGCTGCTGGTGCATGGATGATGAATGGTGCTTTGCAAATTCTA
 
AACAGTGGTATTATTCCAGGTAACCGTAACGCTGATAACGTGGATAAGATCTTGGAGCAA
 
TTTGAATACGTCTTGTACCCATCCAAGACTTTAAAGACCGACGGTGTCAGAGCCGTGTCC
 
ATCACTTCTTTCGGTTTTGGTCAAAAGGGTGGTCAAGCTATTGTGGTTCATCCAGACTAC
 
TTATACGGTGCTATCACTGAAGACAGATACAACGAGTATGTCGCCAAGGTTAGTGCCAGA
 
GAGAAAAGTGCCTACAAATTCTTCCATAATGGTATGATCTACAACAAGTTGTTCGTAAGT
 
AAAGAGCATGCTCCATACACTGATGAATTGGAAGAGGATGTTTACTTGGACCCATTAGCC
 
CGTGTATCTAAGGATAAGAAATCAGGCTCCTTGACTTTCAACTCTAAAAACATCCAAAGC
 
AAGGACAGTTACATCAATGCTAACACCATTGAAACTGCCAAGATGATTGAAAACATGACC
 
AAGGAGAAAGTCTCTAACGGTGGCGTCGGTGTAGATGTTGAATTAATCACTAGCATCAAC
 
GTTGAAAATGATACTTTTATCGAGCGCAATTTCACCCCGCAAGAAATAGAGTACTGCAGC
 
GCGCAGCCTAGTGTGCAAAGCTCTTTCGCTGGGACATGGTCCGCCAAAGAGGCTGTTTTC
 
AAGTCCTTAGGCGTCAAGTCCTTAGGCGGTGGTGCTGCATTGAAAGACATCGAAATCGTA
 
CGCGTTAACAAAAACGCTCCAGCCGTTGAACTGCACGGTAACGCCAAAAAGGCTGCCGAA
 
GAAGCTGGTGTTACCGATGTGAAGGTATCTATTTCTCACGATGACCTCCAAGCTGTCGCG
 
GTCGCCGTTTCTACTAAGAAATAG
 
  
===Protein Sequence===
+
[[Image:Protein_Sequence.PNG | thumb | Protein Sequence (Frame 1 is the Protein Sequence)]]
  
 
===Gene Function===
 
===Gene Function===
 +
This gene encodes the alpha subunit of the proteins fatty acid synthetase. The complex of the alpha subunit with the beta subunit (encoded by [https://www.yeastgenome.org/locus/S000001665 FAS1]) are responsible for the synthesis of 16- and 18-carbon long fatty acid chains from acetyl-CoA, malonyl-CoA, and NADPH (Cherry et al., 2007). The product of this reaction is palmitic or stearic acid (Lomakin et al., 2007). This is then transferred to other organelles and further modified before being delivered to its final destination, whether the lipid membrane or some other destination.
  
 
===Differences Between Databases===
 
===Differences Between Databases===
 +
SGD seems to be the only database that contains gene expression data. However, I do not seem to find any major differences in content between the databases; the only difference that I could find was that the NCBI Gene database specified more Gene Ontology terms for our gene than any of the other databases. The NCBI Gene database seems to contain far less detailed Information about the gene, however it does provide the necessary information. Furthermore, it seems that many of these databases actually derive information from each other, as I can search for the gene in all the databases using any of the unique identifier codes from any of the databases. While SGD seems to contain far more information than the other databases, it seems that UniProt actually contains far more detailed information; however, UniProt does not seem to be manually curated as SGD is because much of the information that it displays it cites from other databases.
  
 
===Why This Gene?===
 
===Why This Gene?===
Line 226: Line 167:
  
 
===Gene Image===
 
===Gene Image===
 +
 +
[[File:Firstglance_animation_1.gif]]
  
 
==Acknowledgements==
 
==Acknowledgements==
Line 231: Line 174:
  
 
Except for what is noted above, this individual journal entry was completed by me and not copied from another source.
 
Except for what is noted above, this individual journal entry was completed by me and not copied from another source.
 +
 +
[[User:Msamdars|Msamdars]] ([[User talk:Msamdars|talk]]) 21:52, 18 September 2019 (PDT)
  
 
==References==
 
==References==
Line 238: Line 183:
 
# Brown, G. R., Hem, V., Katz, K. S., Ovetsky, M., Wallin, C., Ermolaeva, O., ... & Murphy, T. D. (2014). Gene: a gene-centered information resource at NCBI. ''Nucleic Acids Research'', 43(D1), D36-D42.
 
# Brown, G. R., Hem, V., Katz, K. S., Ovetsky, M., Wallin, C., Ermolaeva, O., ... & Murphy, T. D. (2014). Gene: a gene-centered information resource at NCBI. ''Nucleic Acids Research'', 43(D1), D36-D42.
 
# Researchers identify the yeast genes behind rose and honeyed flavors in beer and wine (2017, November 7) retrieved 16 September 2019 from https://phys.org/news/2017-11-yeast-genes-rose- honeyed-flavors.html
 
# Researchers identify the yeast genes behind rose and honeyed flavors in beer and wine (2017, November 7) retrieved 16 September 2019 from https://phys.org/news/2017-11-yeast-genes-rose- honeyed-flavors.html
 +
# Lomakin, I. B., Xiong, Y., & Steitz, T. A. (2007). The crystal structure of yeast fatty acid synthase, a cellular machine with eight active sites working together. Cell, 129(2), 319-332.
 
# de Carvalho, B. T., Holt, S., Souffriau, B., Brandão, R. L., Foulquié-Moreno, M. R., & Thevelein, J. M. (2017). Identification of novel alleles conferring superior production of rose flavor phenylethyl acetate using polygenic analysis in yeast. ''MBio'', 8(6), e01173-17.
 
# de Carvalho, B. T., Holt, S., Souffriau, B., Brandão, R. L., Foulquié-Moreno, M. R., & Thevelein, J. M. (2017). Identification of novel alleles conferring superior production of rose flavor phenylethyl acetate using polygenic analysis in yeast. ''MBio'', 8(6), e01173-17.

Latest revision as of 21:52, 18 September 2019

Summary

FAS2 is a gene that encodes for the alpha subunit of the fatty acid synthase. This proteins is particularly important as fatty acids are one of the most crucial molecules in a cell, involved in nearly every facet of a cell's function and metabolism. The gene is located on Chromosome XVI from positions 108652 to 114315 bp. It coordinates the binding of six beta subunits of the complex, which are encoded by the FAS1 gene. The alpha subunit contains 1894 amino acids.

Gene Info

Name

Standard Name: FAS2

Systematic Name: YPL231W

Name Description: Fatty Acid Synthetase

Gene ID

SGD ID: S000006152

NCBI ID: 855845

Ensembl ID: YPL231W

Uniprot ID: P19097

DNA Sequence

ATGAAGCCGGAAGTTGAGCAAGAATTAGCTCATATTTTGCTAACTGAATTGTTAGCTTAT
CAATTTGCCTCTCCTGTGAGATGGATTGAAACTCAAGATGTTTTTTTGAAGGATTTTAAC
ACTGAAAGGGTTGTTGAAATCGGTCCTTCTCCAACTTTGGCTGGGATGGCTCAAAGAACC
TTGAAGAATAAATACGAATCTTACGATGCTGCTCTGTCTTTACATAGAGAAATCTTATGC
TATTCGAAGGATGCCAAAGAGATTTATTATACCCCAGATCCATCCGAACTAGCTGCAAAG
GAAGAGCCCGCTAAGGAAGAAGCTCCTGCTCCAACTCCAGCTGCTAGTGCTCCTGCTCCT
GCAGCAGCAGCCCCAGCTCCCGTCGCGGCAGCAGCCCCAGCTGCAGCAGCTGCTGAGATT
GCCGATGAACCTGTCAAGGCTTCCCTATTGTTGCACGTTTTGGTTGCTCACAAGTTGAAG
AAGTCGTTAGATTCCATTCCAATGTCCAAGACAATCAAAGACTTGGTCGGTGGTAAATCT
ACAGTCCAAAATGAAATTTTGGGTGATTTAGGTAAAGAATTTGGTACTACTCCTGAAAAA
CCAGAAGAAACTCCATTAGAAGAATTGGCAGAAACTTTCCAAGATACCTTCTCTGGAGCA
TTGGGTAAGCAATCTTCCTCGTTATTATCAAGATTAATCTCATCTAAGATGCCTGGTGGG
TTTACTATTACTGTCGCTAGAAAATACTTACAAACTCGCTGGGGACTACCATCTGGTAGA
CAAGATGGTGTCCTTTTGGTAGCTTTATCTAACGAGCCTGCTGCTCGTCTAGGTTCTGAA
GCTGATGCCAAGGCTTTCTTGGACTCCATGGCTCAAAAATACGCTTCCATTGTTGGTGTT
GACTTATCATCAGCTGCTAGCGCTAGTGGTGCTGCCGGTGCAGGTGCTGCTGCCGGTGCA
GCTATGATCGATGCTGGCGCTCTGGAAGAAATAACCAAAGACCACAAGGTTTTGGCGCGT
CAACAACTGCAAGTATTGGCTCGTTATCTAAAAATGGACTTGGATAACGGTGAAAGAAAG
TTCTTGAAAGAAAAGGACACTGTTGCTGAACTTCAAGCTCAGTTGGATTACTTGAATGCC
GAATTAGGTGAATTCTTTGTTAACGGTGTTGCTACTTCTTTCTCTAGAAAAAAGGCCAGA
ACCTTCGATTCTTCCTGGAACTGGGCTAAACAATCTTTATTATCATTATACTTTGAGATA
ATTCATGGTGTCTTGAAAAACGTTGATAGAGAGGTTGTTAGTGAAGCTATCAATATCATG
AACAGATCTAACGATGCTTTGATTAAATTCATGGAATACCATATCTCTAACACTGATGAA
ACAAAAGGTGAAAACTATCAATTGGTTAAAACTCTTGGTGAGCAGTTGATTGAAAACTGT
AAACAAGTTTTGGATGTTGATCCAGTTTACAAAGATGTTGCTAAGCCTACCGGTCCAAAA
ACTGCTATTGACAAGAACGGTAACATTACATACTCAGAAGAGCCAAGAGAAAAGGTTAGG
AAATTATCTCAATACGTACAAGAAATGGCCCTTGGTGGTCCAATCACCAAAGAATCTCAA
CCTACTATTGAAGAGGATTTGACTCGTGTTTACAAGGCAATCAGTGCTCAAGCTGATAAA
CAAGATATTTCCAGCTCCACCAGGGTTGAATTTGAAAAACTATATAGTGATTTGATGAAG
TTCTTGGAAAGCTCCAAAGAAATCGATCCTTCTCAAACAACCCAATTGGCCGGTATGGAT
GTTGAGGATGCTTTGGACAAAGATTCCACCAAAGAAGTTGCTTCTTTGCCAAACAAATCT
ACCATTTCTAAGACGGTATCTTCAACTATTCCAAGAGAAACTATTCCGTTCTTACATTTG
AGAAAGAAGACTCCTGCCGGAGATTGGAAATATGACCGCCAATTGTCTTCTCTTTTCTTA
GATGGTTTAGAAAAGGCTGCCTTCAACGGTGTCACCTTCAAGGACAAATACGTCTTGATC
ACTGGTGCTGGTAAGGGTTCTATTGGTGCTGAAGTCTTGCAAGGTTTGTTACAAGGTGGT
GCTAAGGTTGTTGTTACCACCTCTCGTTTCTCTAAGCAAGTTACAGACTACTACCAATCC
ATTTACGCCAAATATGGTGCTAAGGGTTCTACTTTGATTGTTGTTCCATTCAACCAAGGT
TCTAAGCAAGACGTTGAAGCTTTGATTGAATTTATCTACGACACTGAAAAGAATGGTGGT
TTAGGTTGGGATCTAGATGCTATTATTCCATTCGCGGCCATTCCAGAACAAGGTATTGAA
TTAGAACATATTGATTCTAAGTCTGAATTTGCTCATAGAATCATGTTGACCAATATCTTA
AGAATGATGGGTTGTGTCAAGAAGCAAAAATCTGCAAGAGGTATTGAAACAAGACCAGCT
CAAGTCATTCTACCAATGTCTCCAAACCATGGTACTTTCGGTGGTGATGGTATGTATTCA
GAATCCAAGTTGTCTTTGGAAACTTTGTTCAACAGATGGCACTCTGAATCCTGGGCCAAT
CAATTAACCGTTTGCGGTGCTATTATTGGTTGGACTAGAGGTACTGGTTTAATGAGCGCT
AATAACATCATTGCTGAAGGCATTGAAAAGATGGGTGTTCGTACTTTCTCTCAAAAGGAA
ATGGCTTTCAACTTATTGGGTCTATTGACTCCAGAAGTCGTAGAATTGTGCCAAAAATCA
CCTGTTATGGCTGACTTGAATGGTGGTTTGCAATTTGTTCCTGAATTGAAGGAATTCACT
GCTAAATTGCGTAAAGAGTTGGTTGAAACTTCTGAAGTTAGAAAGGCAGTTTCCATCGAA
ACTGCTTTGGAGCATAAGGTTGTCAATGGCAATAGCGCTGATGCTGCATATGCTCAAGTC
GAAATTCAACCAAGAGCTAACATTCAACTGGACTTCCCAGAATTGAAACCATACAAACAG
GTTAAACAAATTGCTCCCGCTGAGCTTGAAGGTTTGTTGGATTTGGAAAGAGTTATTGTA
GTTACCGGTTTTGCTGAAGTCGGCCCATGGGGTTCGGCCAGAACAAGATGGGAAATGGAA
GCTTTTGGTGAATTTTCGTTGGAAGGTTGCGTTGAAATGGCCTGGATTATGGGCTTCATT
TCATACCATAACGGTAATTTGAAGGGTCGTCCATACACTGGTTGGGTTGATTCCAAAACA
AAAGAACCAGTTGATGACAAGGACGTTAAGGCCAAGTATGAAACATCAATCCTAGAACAC
AGTGGTATCAGATTGATCGAACCAGAGTTATTCAATGGTTACAACCCAGAAAAGAAGGAA
ATGATTCAAGAAGTCATTGTCGAAGAAGACTTGGAACCATTTGAGGCTTCGAAGGAAACT
GCCGAACAATTTAAACACCAACATGGTGACAAAGTGGATATCTTCGAAATCCCAGAAACA
GGAGAGTACTCTGTTAAGTTACTAAAGGGTGCCACTTTATACATTCCAAAGGCTTTGAGA
TTTGACCGTTTGGTTGCAGGTCAAATTCCAACTGGTTGGAATGCTAAGACTTATGGTATC
TCTGATGATATCATTTCTCAGGTTGACCCAATCACATTATTCGTTTTGGTCTCTGTTGTG
GAAGCATTTATTGCATCTGGTATCACCGACCCATACGAAATGTACAAATACGTACATGTT
TCTGAGGTTGGTAACTGTTCTGGTTCTGGTATGGGTGGTGTTTCTGCCTTACGTGGTATG
TTTAAGGACCGTTTCAAGGATGAGCCTGTCCAAAATGATATTTTACAAGAATCATTTATC
AACACCATGTCCGCTTGGGTTAATATGTTGTTGATTTCCTCATCTGGTCCAATCAAGACA
CCTGTTGGTGCCTGTGCCACATCCGTGGAATCTGTTGACATTGGTGTAGAAACCATCTTG
TCTGGTAAGGCTAGAATCTGTATTGTCGGTGGTTACGATGATTTCCAAGAAGAAGGCTCC
TTTGAGTTCGGTAACATGAAGGCCACTTCCAACACTTTGGAAGAATTTGAACATGGTCGT
ACCCCAGCGGAAATGTCCAGACCTGCCACCACTACCCGTAACGGTTTTATGGAAGCTCAA
GGTGCTGGTATTCAAATCATCATGCAAGCTGATTTAGCTTTGAAGATGGGTGTGCCAATT
TACGGTATTGTTGCCATGGCTGCTACCGCCACCGATAAGATTGGTAGATCTGTGCCAGCT
CCAGGTAAGGGTATTTTAACCACTGCTCGTGAACACCACTCCAGTGTTAAGTATGCTTCA
CCAAACTTGAACATGAAGTACAGAAAGCGCCAATTGGTTACTCGTGAAGCTCAGATTAAA
GATTGGGTAGAAAACGAATTGGAAGCTTTGAAGTTGGAGGCCGAAGAAATTCCAAGCGAA
GACCAAAACGAGTTCTTACTTGAACGTACCAGAGAAATCCACAACGAAGCTGAAAGTCAA
TTGAGAGCTGCACAACAACAATGGGGTAACGACTTCTACAAGAGGGACCCACGTATTGCT
CCATTGAGAGGAGCACTGGCTACTTACGGTTTAACTATTGATGACTTGGGTGTCGCTTCA
TTCCACGGTACATCCACAAAGGCTAATGACAAGAACGAATCTGCCACAATTAATGAAATG
ATGAAGCATTTGGGTAGATCTGAAGGTAATCCCGTCATTGGTGTTTTCCAAAAGTTCTTG
ACTGGTCATCCAAAGGGTGCTGCTGGTGCATGGATGATGAATGGTGCTTTGCAAATTCTA
AACAGTGGTATTATTCCAGGTAACCGTAACGCTGATAACGTGGATAAGATCTTGGAGCAA
TTTGAATACGTCTTGTACCCATCCAAGACTTTAAAGACCGACGGTGTCAGAGCCGTGTCC
ATCACTTCTTTCGGTTTTGGTCAAAAGGGTGGTCAAGCTATTGTGGTTCATCCAGACTAC
TTATACGGTGCTATCACTGAAGACAGATACAACGAGTATGTCGCCAAGGTTAGTGCCAGA
GAGAAAAGTGCCTACAAATTCTTCCATAATGGTATGATCTACAACAAGTTGTTCGTAAGT
AAAGAGCATGCTCCATACACTGATGAATTGGAAGAGGATGTTTACTTGGACCCATTAGCC
CGTGTATCTAAGGATAAGAAATCAGGCTCCTTGACTTTCAACTCTAAAAACATCCAAAGC
AAGGACAGTTACATCAATGCTAACACCATTGAAACTGCCAAGATGATTGAAAACATGACC
AAGGAGAAAGTCTCTAACGGTGGCGTCGGTGTAGATGTTGAATTAATCACTAGCATCAAC
GTTGAAAATGATACTTTTATCGAGCGCAATTTCACCCCGCAAGAAATAGAGTACTGCAGC
GCGCAGCCTAGTGTGCAAAGCTCTTTCGCTGGGACATGGTCCGCCAAAGAGGCTGTTTTC
AAGTCCTTAGGCGTCAAGTCCTTAGGCGGTGGTGCTGCATTGAAAGACATCGAAATCGTA
CGCGTTAACAAAAACGCTCCAGCCGTTGAACTGCACGGTAACGCCAAAAAGGCTGCCGAA
GAAGCTGGTGTTACCGATGTGAAGGTATCTATTTCTCACGATGACCTCCAAGCTGTCGCG
GTCGCCGTTTCTACTAAGAAATAG

Protein Sequence

MKPEVEQELAHILLTELLAYQFASPVRWIETQDVFLKDFNTERVVEIGPSPTLAGMAQRT
LKNKYESYDAALSLHREILCYSKDAKEIYYTPDPSELAAKEEPAKEEAPAPTPAASAPAP
AAAAPAPVAAAAPAAAAAEIADEPVKASLLLHVLVAHKLKKSLDSIPMSKTIKDLVGGKS
TVQNEILGDLGKEFGTTPEKPEETPLEELAETFQDTFSGALGKQSSSLLSRLISSKMPGG
FTITVARKYLQTRWGLPSGRQDGVLLVALSNEPAARLGSEADAKAFLDSMAQKYASIVGV
DLSSAASASGAAGAGAAAGAAMIDAGALEEITKDHKVLARQQLQVLARYLKMDLDNGERK
FLKEKDTVAELQAQLDYLNAELGEFFVNGVATSFSRKKARTFDSSWNWAKQSLLSLYFEI
IHGVLKNVDREVVSEAINIMNRSNDALIKFMEYHISNTDETKGENYQLVKTLGEQLIENC
KQVLDVDPVYKDVAKPTGPKTAIDKNGNITYSEEPREKVRKLSQYVQEMALGGPITKESQ
PTIEEDLTRVYKAISAQADKQDISSSTRVEFEKLYSDLMKFLESSKEIDPSQTTQLAGMD
VEDALDKDSTKEVASLPNKSTISKTVSSTIPRETIPFLHLRKKTPAGDWKYDRQLSSLFL
DGLEKAAFNGVTFKDKYVLITGAGKGSIGAEVLQGLLQGGAKVVVTTSRFSKQVTDYYQS
IYAKYGAKGSTLIVVPFNQGSKQDVEALIEFIYDTEKNGGLGWDLDAIIPFAAIPEQGIE
LEHIDSKSEFAHRIMLTNILRMMGCVKKQKSARGIETRPAQVILPMSPNHGTFGGDGMYS
ESKLSLETLFNRWHSESWANQLTVCGAIIGWTRGTGLMSANNIIAEGIEKMGVRTFSQKE
MAFNLLGLLTPEVVELCQKSPVMADLNGGLQFVPELKEFTAKLRKELVETSEVRKAVSIE
TALEHKVVNGNSADAAYAQVEIQPRANIQLDFPELKPYKQVKQIAPAELEGLLDLERVIV
VTGFAEVGPWGSARTRWEMEAFGEFSLEGCVEMAWIMGFISYHNGNLKGRPYTGWVDSKT
KEPVDDKDVKAKYETSILEHSGIRLIEPELFNGYNPEKKEMIQEVIVEEDLEPFEASKET
AEQFKHQHGDKVDIFEIPETGEYSVKLLKGATLYIPKALRFDRLVAGQIPTGWNAKTYGI
SDDIISQVDPITLFVLVSVVEAFIASGITDPYEMYKYVHVSEVGNCSGSGMGGVSALRGM
FKDRFKDEPVQNDILQESFINTMSAWVNMLLISSSGPIKTPVGACATSVESVDIGVETIL
SGKARICIVGGYDDFQEEGSFEFGNMKATSNTLEEFEHGRTPAEMSRPATTTRNGFMEAQ
GAGIQIIMQADLALKMGVPIYGIVAMAATATDKIGRSVPAPGKGILTTAREHHSSVKYAS
PNLNMKYRKRQLVTREAQIKDWVENELEALKLEAEEIPSEDQNEFLLERTREIHNEAESQ
LRAAQQQWGNDFYKRDPRIAPLRGALATYGLTIDDLGVASFHGTSTKANDKNESATINEM
MKHLGRSEGNPVIGVFQKFLTGHPKGAAGAWMMNGALQILNSGIIPGNRNADNVDKILEQ
FEYVLYPSKTLKTDGVRAVSITSFGFGQKGGQAIVVHPDYLYGAITEDRYNEYVAKVSAR
EKSAYKFFHNGMIYNKLFVSKEHAPYTDELEEDVYLDPLARVSKDKKSGSLTFNSKNIQS
KDSYINANTIETAKMIENMTKEKVSNGGVGVDVELITSINVENDTFIERNFTPQEIEYCS
AQPSVQSSFAGTWSAKEAVFKSLGVKSLGGGAALKDIEIVRVNKNAPAVELHGNAKKAAE
EAGVTDVKVSISHDDLQAVAVAVSTKK
Protein Sequence (Frame 1 is the Protein Sequence)

Gene Function

This gene encodes the alpha subunit of the proteins fatty acid synthetase. The complex of the alpha subunit with the beta subunit (encoded by FAS1) are responsible for the synthesis of 16- and 18-carbon long fatty acid chains from acetyl-CoA, malonyl-CoA, and NADPH (Cherry et al., 2007). The product of this reaction is palmitic or stearic acid (Lomakin et al., 2007). This is then transferred to other organelles and further modified before being delivered to its final destination, whether the lipid membrane or some other destination.

Differences Between Databases

SGD seems to be the only database that contains gene expression data. However, I do not seem to find any major differences in content between the databases; the only difference that I could find was that the NCBI Gene database specified more Gene Ontology terms for our gene than any of the other databases. The NCBI Gene database seems to contain far less detailed Information about the gene, however it does provide the necessary information. Furthermore, it seems that many of these databases actually derive information from each other, as I can search for the gene in all the databases using any of the unique identifier codes from any of the databases. While SGD seems to contain far more information than the other databases, it seems that UniProt actually contains far more detailed information; however, UniProt does not seem to be manually curated as SGD is because much of the information that it displays it cites from other databases.

Why This Gene?

We chose this gene after reading an article that summarized the findings of an interesting study on yeast genes that impart flavor in beer and wine during the brewing process. This particular gene was one of two genes that were identified as producing higher levels of a compound called phenylethyl acetate, which imparts a flavor reminiscent of rose or honey (de Carvalho et al., 2017). FAS2 was one of those two genes; it is responsible for encoding the alpha subunit of fatty acid synthase, an enzyme that is unsurprisingly involved in fatty acid synthesis. We thought it would be interesting to explore this gene as it is also essential for the synthesis of essential fatty acids, such as palmitic acid, which is the most common fatty acid present in eukaryotes.

Gene Image

Firstglance animation 1.gif

Acknowledgements

This week, the two partners for this project are Mihir Samdarshi and Jonar Cowan. Both partners worked with each other outside of class to assist each other in finding and synthesizing the necessary information required to complete this assignment.

Except for what is noted above, this individual journal entry was completed by me and not copied from another source.

Msamdars (talk) 21:52, 18 September 2019 (PDT)

References

  1. Cherry, J. M., Hong, E. L., Amundsen, C., Balakrishnan, R., Binkley, G., Chan, E. T., ... & Fisk, D. G. (2011). Saccharomyces Genome Database: the genomics resource of budding yeast. Nucleic Acids Research, 40(D1), D700-D705.
  2. Cunningham, F., Amode, M. R., Barrell, D., Beal, K., Billis, K., Brent, S., ... & Gil, L. (2014). Ensembl 2015. Nucleic Acids Research, 43(D1), D662-D669.
  3. The UniProt Consortium. (2017). UniProt: the universal protein knowledgebase. Nucleic Acids Research, 45(D1), D158–D169, https://doi.org/10.1093/nar/gkw1099
  4. Brown, G. R., Hem, V., Katz, K. S., Ovetsky, M., Wallin, C., Ermolaeva, O., ... & Murphy, T. D. (2014). Gene: a gene-centered information resource at NCBI. Nucleic Acids Research, 43(D1), D36-D42.
  5. Researchers identify the yeast genes behind rose and honeyed flavors in beer and wine (2017, November 7) retrieved 16 September 2019 from https://phys.org/news/2017-11-yeast-genes-rose- honeyed-flavors.html
  6. Lomakin, I. B., Xiong, Y., & Steitz, T. A. (2007). The crystal structure of yeast fatty acid synthase, a cellular machine with eight active sites working together. Cell, 129(2), 319-332.
  7. de Carvalho, B. T., Holt, S., Souffriau, B., Brandão, R. L., Foulquié-Moreno, M. R., & Thevelein, J. M. (2017). Identification of novel alleles conferring superior production of rose flavor phenylethyl acetate using polygenic analysis in yeast. MBio, 8(6), e01173-17.