Difference between revisions of "Vkuehn Week 11"

From LMU BioDB 2013
Jump to: navigation, search
(Model Organism Database)
Line 102: Line 102:
 
===Model Organism Database===
 
===Model Organism Database===
 
http://tritrypdb.org/tritrypdb/
 
http://tritrypdb.org/tritrypdb/
# What types of data can be found in the database (sequence, structures, annotations, etc.); is it a primary or “meta” database; is it curated electronically, manually [in-house], or manually [community])?
+
# The data found mostly consists of sequences, with some protein prediction as well. It is a “meta” database; is it curated electronically, manually [in-house], or manually [community])?
 
# What individual or organization maintains the database?   
 
# What individual or organization maintains the database?   
 
# What is their funding source(s)?   
 
# What is their funding source(s)?   
Line 115: Line 115:
 
#* Run a sample query.  Do the results make sense?
 
#* Run a sample query.  Do the results make sense?
 
# What is the format (regular expression) of the main type of gene ID for this species?  (for example, for ''Vibrio cholerae'' it was VC#### or VC_####).
 
# What is the format (regular expression) of the main type of gene ID for this species?  (for example, for ''Vibrio cholerae'' it was VC#### or VC_####).
 
  
 
{{Template:Vkuehn}}
 
{{Template:Vkuehn}}
 
[[Category:Journal Entry]]
 
[[Category:Journal Entry]]
 
[[Category:Leishmania Major]]
 
[[Category:Leishmania Major]]

Revision as of 05:23, 12 November 2013

Contents

Journal Club Preparation: Leishmania Major

Genome Reference Paper: The Genome of the Kinetoplastid Parasite, Leishmania major (Reference Genome)

10 Biological Terms

  1. Polycistronic: A single mRNA encoding several different polypeptide chains http://medical-dictionary.thefreedictionary.com/Polycistronic
  2. Metacyclic forms: Produced in an intermediate host, and infective for the definitive host; said of the infective stages of trypanosomes http://medical-dictionary.thefreedictionary.com/Metacyclic
  3. Aneuploid: Having an abnormal number of chromosomes not an exact multiple of the haploid number, as contrasted with abnormal numbers of complete haploid sets of chromosomes, such as diploid or triploid, etc. http://medical-dictionary.thefreedictionary.com/Aneuploid
  4. "Repeat-repeat": (Repeated tandem repeats) Copies of DNA sequences which lie adjacent to each other in the same orientation or in the opposite direction to each other. http://www.nlm.nih.gov/cgi/mesh/2011/MB_cgi?mode=&term=Tandem+Repeat
  5. Subtelomeric sequence: Segments of DNA between telomeric caps and chromatin. http://encyclopedia.thefreedictionary.com/Subtelomeric
  6. Macrophage migration inhibition factor (MIF): A protein believed to be involved in immune response http://encyclopedia.thefreedictionary.com/MIF
  7. Tautomerase activity: An enzyme catalyzing the interconversion of tautomers (Tautomer: structural isomers that differ only in the position of a hydrogen atom or proton) http://medical-dictionary.thefreedictionary.com/tautomerase
  8. Tandem arrays: The existence of two or more identical DNA sequences in series, i.e., end -to-end. http://www.fao.org/docrep/003/X3910E/X3910E23.htm
  9. Sphingolipids: Any of a group of lipids, such as sphingomyelins or cerebrosides, that yield sphingosine or its derivatives upon hydrolysis. http://dictionary.reference.com/browse/Sphingolipid
  10. Serine peptidases: Serine proteolytic enzymes that catalyze the hydrolysis of peptide linkages; it comprises the exopeptidases and endopeptidases http://medical-dictionary.thefreedictionary.com/peptidases

Article Outline

Introduction

  • It is important to study the genome of Leishmania major because of the various human diseases that this parasite is capable of causing. If infected by a leishmania parasite a number of diseases can form. Annually there are 2 million cases in 88 tropical and subtropical countries.
  • How it infects:
    1. Parasite transmitted by sand flies as proliferative promastigote
    2. Differentiate into nondividing forms before inoculation into vertebrate host
    3. In host macrophages, phagocytose metacyclics --> differentiate into amastigotes (proliferate in phagolysosome)
    4. Leads to host macrophage lysis and infection of other macrophages
    5. Outcome of infection depends on species, host immune system and host genetics
  • Interesting to look at genome because of the unique mechanism of regulating transcription which is atypical for eukaryotes
    Leishmania major is considered an "Old World Leishmania" species, meaning it contains 36 chromosome pairs. There are approximately 30 Leishmania species who's gene order is highly conserved.
    Ways in which it differs:
    1. Organization of protein coding genes: long, strand-specific polycistronic clusters
    2. No transcription factors
  • This article determined the genome sequence of Leishmania major on a chromosome by chromosome basis. Present the structure and content based on molecular processes such as:
    • chromatin remodeling
    • transcription
    • RNA processing
    • Translation
    • posttranslational modification
    • protein turnover
    Also discuss essential host parasite interface developmental processes

Genome Structure and Content

  • 32,816,678 base pairs obtained by shotgun sequencing insert colonies and purified chromosomal DNA
  • Genome is partially aneuploid
  • L. major sequence analysis yielded 911 RNA genes, 39 pseudogenes, 8272 protein coding genes
  • L. major telomeres distinct from other Trityps and have heterogeneous structure
  • The end of Leishmania major chromosomes have tripartite "repeat-repeat" structure
  • "Leichmania restricted" genes: responsible for metablic differences from T. brucei and T. cruzi found randomly distributed in genome
  • Two genes of interest: LmjF33.1740 and LmjF33.1750
    • Because resulting proteins contain macrophage migration inhibition factor (MIF)
    • Homologues found in other Leishmania species
    • L. major MIFs thought to retain tautomerase activity, but dies not have oxidoreductase activity.
      Interesting because this ties it to eukaryotic similarities but also ties genes to bacteria
    • Suggests that L. major MIFs could use eukaryotic similarities to modulate host macrophage response and help them survive in the host

RNA Genes

  • RNAs participate in many cellular processes:
    RNA replication, splicing, RNA processing and modification, translation, translation regulation, protein translocation across membranes
  • Differences in organization of RNA genes in genomes of L. major and the other trypanosomes.
    All 3 tritryp genomes have different numbers of genes and location differs as well.

Chromatin Remodeling

  • Trypanosomatids have multiple copies of 4 core histone genes
    package chromosomal DNA into nucleosomes in eukaryotes and the access is also regulated by the RNA polytranscription complexes.
  • Most genes are clustered in discrete single tandem arrays. L. major is different in this sense because these gene types occur in 2 or more separate loci, which is not the case for the other tritryps.
  • Some variants in histone complexes in L. major may play roles in:
    gene slicing, gene expression, DNA repair, and centromere function
  • Trytrip parasites have typical chromatin remodeling activities of eukaryotes, but also have some significan differences.

Transcription

  • Little is known about the mechanisms of transcription initiation and few promoters have been analyzed in trypanosomatids
  • The chromosome is characterized by the unique arrangement of directional gene clusters:
    • Polycistronic transcription by RNA polymerase II initiates bidirectionality within divergent strand-switch regions
    • Terminates within convergent strand switch regions
  • Tritryps have conserved protein subunits. The difference between the species is that in L. major many of the homologues for RNA polymerase specific subunits are absent.
  • Few potential homologues of RNA polymerase II basal transcription factors were found in L. major that were present in other eukaryotes.
  • Findings show that primary determinants of tritryp gene expression is via posttranscriptional control mechanisms.

RNA Processing

  • Tritryp RNA processing is distinctive because the site of polyadenylation is determined by trans-splicing of downstream mRNA
  • Identified many putative tritryp splicing regulatory proteins and proteins implicated in alternative splicing. These suggest that regulation of splicing may have arisen early in eukaryotic evolution
  • There is an absence of an RNA polymerase II C-terminal domain which may have a distinct functional role in transcription
  • Degradation of mRNAs in regulating gene expression is similar to the process in mammals (the exosome plays a dominant role)
  • The number of RNA recognition motifs (RRMs) is similar in Tritryps and yeast proteins

Translation and co-/posttranslational modification

  • Major components of translational machinery found in L. major aslo found in other lower eukaryotes
  • There is a higher number of potential translation factors in Tritryps which suggests that there is a high degree of specialization
  • Most protein modification within tritryps involves usual eukaryotic processes. But there are some essential modificationsin L. major:
    glycosylphosphatidylinositol anchor addition, acylation, and prenylation
    all facilitate membrane attachment and/or protein-protein interactions
  • Enzymes that catalyze these modifications may be promising drug targets

Surface Molecules

  • Surface molecules of Leishmania is important because of its role in the infectious cycle in the host.
    Many of the anchored proteins contain similar posttranslational modifications but vary in other ways both within the Leishmania species and between Tritryps
  • Many of the functions of the identified genes have not been determined
  • Genes that result in nucleotide sugar transporters and their roles have been found to be unique in L. major
  • Sphingolipids= essential membrane components in eukaryotic cells, contribute to intracellular function
    • Primary sphingolipid in Tritryps is IPC -->could be a drug target because of its role in intracellular function

Proteolysis

  • Some peptidase protein-coding genes have been found to be virulence factors in Tritryps
    Potential vaccine and drug targets
  • No representatives of mammalian peptidase inhibitors were found
    But have IPCs that mammals lack, suggesting these play important role in host-parasite interaction
  • Tritryps also contain inhibitors of serine peptidases (ISPs) that are normally only found in bacteria
    ISPs also likely play an important role in host-parasite interactions

Concluding Remarks

  • Comparing genomic sequences of tritryps helps gain insight into possible locations for drug targeting
  • Its similarities, and divergences in genome organization and replication to both bacterial and eukaryotic cells also provides information regarding eukaryotic evolution
  • The availability of the entire L. major genome and the subsequent analysis of the protein-coding genes is important in further researching their role in virulence
  • This brings up possibilities for drug intervention and a better understanding of the mechanisms of the parasites' entrance into the host macrophage and its disease pathology

Model Organism Database

http://tritrypdb.org/tritrypdb/

  1. The data found mostly consists of sequences, with some protein prediction as well. It is a “meta” database; is it curated electronically, manually [in-house], or manually [community])?
  2. What individual or organization maintains the database?
  3. What is their funding source(s)?
  4. Is there a license agreement or any restrictions on access to the database?
  5. How often is the database updated?
  6. Are there links to other databases?
  7. Can the information be downloaded?
    • In what file formats?
  8. Evaluate the “user-friendliness” of the database.
    • Is the Web site well-organized?
    • Does it have a help section or tutorial?
    • Run a sample query. Do the results make sense?
  9. What is the format (regular expression) of the main type of gene ID for this species? (for example, for Vibrio cholerae it was VC#### or VC_####).
Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox