Difference between revisions of "Troque Week 11"
From LMU BioDB 2015
m (→Preparation for Journal Club on Your Species: Added another phrase to part 1) |
m (→Preparation for Journal Club on Your Species: Added another phrase to part 1) |
||
Line 18: | Line 18: | ||
#* '''amplicons''': a term for any small, replicating dna fragment. (Source: http://www.biology-online.org/dictionary/Amplicon) | #* '''amplicons''': a term for any small, replicating dna fragment. (Source: http://www.biology-online.org/dictionary/Amplicon) | ||
#* '''sequence finishing''': the manual enhancement of assembled shotgun sequence data to improve regions of low quality or close gaps in the sequence. Shotgun sequencing data can be whole genome derived or from clone‐based entities such as bacterial artificial chromosomes (BACs) or other plasmid based vectors such as fosmids. Sequence finishing can be achieved using a combination of laboratory and computational techniques to produce a highly accurate and complete deoxyribonucleic acid (DNA) sequence. (Source: http://www.els.net/WileyCDA/ElsArticle/refId-a0005389.html) | #* '''sequence finishing''': the manual enhancement of assembled shotgun sequence data to improve regions of low quality or close gaps in the sequence. Shotgun sequencing data can be whole genome derived or from clone‐based entities such as bacterial artificial chromosomes (BACs) or other plasmid based vectors such as fosmids. Sequence finishing can be achieved using a combination of laboratory and computational techniques to produce a highly accurate and complete deoxyribonucleic acid (DNA) sequence. (Source: http://www.els.net/WileyCDA/ElsArticle/refId-a0005389.html) | ||
+ | #* ''' primer walking''': the consecutive design of new primers to extend into unknown sequence. It takes ~4 days for each primer "step", assuming you submit the samples to the UM Sequencing Core for sequencing (~2 days) and you order primers from typical oligo vendors (~2 days). The longer you need to "walk", obviously the longer this process will take. For just a few kb, this is the best method. Longer sequences may require random-fragment approaches ("shotgun" sequencing)(Source: https://seqcore.brcf.med.umich.edu/doc/dnaseq/primerwalking.html) | ||
# Write an outline of the article. The length should be a minimum of the equivalent of 2 pages of standard 8 1/2 by 11 inch paper (you can use the "Print Preview" option in your browser to see the length). Your outline can be in any form you choose, but you should utilize the wiki syntax of headers and either numbered or bulleted lists to create it. The text of the outline does not have to be complete sentences, but it should answer the questions listed below and have enough information so that others can follow it. However, your outline should be in YOUR OWN WORDS, not copied straight from the article. | # Write an outline of the article. The length should be a minimum of the equivalent of 2 pages of standard 8 1/2 by 11 inch paper (you can use the "Print Preview" option in your browser to see the length). Your outline can be in any form you choose, but you should utilize the wiki syntax of headers and either numbered or bulleted lists to create it. The text of the outline does not have to be complete sentences, but it should answer the questions listed below and have enough information so that others can follow it. However, your outline should be in YOUR OWN WORDS, not copied straight from the article. | ||
#* What is the importance or significance of this work (i.e., your species)? | #* What is the importance or significance of this work (i.e., your species)? |
Revision as of 06:16, 13 November 2015
Contents
Preparation for Journal Club on Your Species
Your team will split into two halves for journal club presentations that will take place in class on Tuesday, November 17 and Tuesday, November 24. The Coder and Quality Assurance person will present the genome paper for your species and the GenMAPP Users will present the microarray paper for your species. You will decide within your team who will present on which day. Please edit the schedule on the Main Page to show who is presenting on which day.
In preparation for your journal club presentation, you will each individually complete the following assignment on your individual journal page.
- Make a list of at least 10 biological terms for which you did not know the definitions when you first read the article. Define each of the terms. You can use the glossary in any molecular biology, cell biology, or genetics text book as a source for definitions, or you can use one of many available online biological dictionaries. Cite your sources for the definitions by providing the proper citation (for a book) or the URL to the page with the definition for online sources. Each definition must have it's own URL citation.
- Gram-negative: Of, or relating to the group of bacteria that take the color of the counterstain (i.e. pink) under the microscope following gram staining. (Source: http://www.biology-online.org/dictionary/Gram-negative)
- orthologous: Any gene that can be found in two or more different species that can be traced back to the same common ancestor.. (Source: http://www.biology-online.org/dictionary/Orthologous_Gene)
- mucopurulent: containing both mucus and pus. (Source: http://www.biology-online.org/dictionary/Mucopurulent)
- etiologic: Causing a disease or a pathological condition. (Source: http://www.biology-online.org/dictionary/Etiological)
- enteric: Pertaining to the inside of, or by way of, or originating from, the intestines; intestinal. (Source: http://www.biology-online.org/dictionary/Enteric)
- enteroinvasive E. coli.: a type of pathogenic bacteria whose infection causes a syndrome that is identical to Shigellosis, with profuse diarrhea and high fever. EIEC are highly invasive, and they use adhesin proteins to bind to and enter intestinal cells. (Source: https://en.wikipedia.org/wiki/Enteroinvasive_Escherichia_coli)
- virulence: The degree or ability of a pathogenic organism to cause disease. (Source: http://www.biology-online.org/dictionary/Virulence)
- enterohemorrhagic: that which causes hemorrhage within the intestines. (Source: https://www.wordnik.com/words/enterohemorrhagic)
- contigs: group of clones representing overlapping regions of the genome. (Source: http://www.biology-online.org/dictionary/Contig)
- amplicons: a term for any small, replicating dna fragment. (Source: http://www.biology-online.org/dictionary/Amplicon)
- sequence finishing: the manual enhancement of assembled shotgun sequence data to improve regions of low quality or close gaps in the sequence. Shotgun sequencing data can be whole genome derived or from clone‐based entities such as bacterial artificial chromosomes (BACs) or other plasmid based vectors such as fosmids. Sequence finishing can be achieved using a combination of laboratory and computational techniques to produce a highly accurate and complete deoxyribonucleic acid (DNA) sequence. (Source: http://www.els.net/WileyCDA/ElsArticle/refId-a0005389.html)
- primer walking: the consecutive design of new primers to extend into unknown sequence. It takes ~4 days for each primer "step", assuming you submit the samples to the UM Sequencing Core for sequencing (~2 days) and you order primers from typical oligo vendors (~2 days). The longer you need to "walk", obviously the longer this process will take. For just a few kb, this is the best method. Longer sequences may require random-fragment approaches ("shotgun" sequencing)(Source: https://seqcore.brcf.med.umich.edu/doc/dnaseq/primerwalking.html)
- Write an outline of the article. The length should be a minimum of the equivalent of 2 pages of standard 8 1/2 by 11 inch paper (you can use the "Print Preview" option in your browser to see the length). Your outline can be in any form you choose, but you should utilize the wiki syntax of headers and either numbered or bulleted lists to create it. The text of the outline does not have to be complete sentences, but it should answer the questions listed below and have enough information so that others can follow it. However, your outline should be in YOUR OWN WORDS, not copied straight from the article.
- What is the importance or significance of this work (i.e., your species)?
- What were the methods used in the study?
- Shigella flexneri strain 301 (abbreviated Sf301), which we sequenced, was isolated from a patient with severe acute clinical manifestations of shigellosis in the Changping District, Beijing, in 1984, and has since been used as a reference strain for S.flexneri in China. The strain was routinely grown at 37°C overnight on tryptic soy agar containing 0.01% Congo red. Red colonies were inoculated into tryptic soy broth and grown to stationary phase at 37°C for isolating plasmid and chromosomal DNAs. The plasmid and the chromosomal libraries were separately constructed using pBluescript KS(–) (Strategene) as vectors. Approximately 48 000 clones were sequenced from both ends using the big‐dye kit (ABI) and ABI377 or ABI3700 automated sequencers, giving rise to 10 times coverage of the genome. Sequences were assembled initially using the phred/phrap program (12) when the sequence coverage was ∼4‐fold over the estimated size of the genome. The program was run with optimized parameters and the quality score was set to ≥20. Further assembly was carried out repeatedly using the same program when more sequences were obtained. When 100 500 sequences were assembled into 318 contigs, the Consed program was used for sequence finishing (13). Gaps among contigs were closed either by primer walking on selected clones, which were identified by analysis on the forward and the reversed links between contigs using the perl/Tk algorithm, or by sequencing the DNA amplicons generated by polymerase chain reaction (PCR). Glimmer 2.0, a program that searches for protein coding regions, was used to identify those ORFs possessing more than 30 consecutive codons (14). Overlapping and closely clustered ORFs were manually inspected. Predicted polypeptide sequences were used to search the non‐redundant protein database with BLASTP, and the clusters of orthologous groups of proteins (COGs) database was used to identify families to which predicted proteins are related (15). Mobile elements and repetitive sequences were identified using pair‐wise comparison. tRNA sequences were identified by the program tRNAscan‐SE (16). Repetitive regions were defined as those that have at least 200 bp with the significance of e–10 by BLASTN against the Sf301 genome itself and known IS databases. Sequence annotation and graphs of the circular and linear genomic maps were prepared using a newly developed Perl‐Script tool kit (available at ftp://ftp.chgb.org.cn/pub/). Whole genomic comparison with E.coli K12 MG1655 ( accession no. U00096) and O157 EDL933 (accession no. AE00517H) was performed using the GenomeComp program (J.Yang, J.Wang, Q.Jin, Y.Shen, Z.Yao and R.Chen, manuscript in preparation). The accession numbers for Sf301 chromosome and plasmid pCP301 are AE005674 and AF386526, respectively, in GenBank.
- Briefly state the result shown in each of the figures and tables.
- How do the results of this study compare to the results of previous studies (See Discussion).
- For the genome paper (Coder and QA only): in addition to the journal article, please find and review the Model Organism Database (MOD) for your species similarly to what you did to review your assigned database for the NAR assignment. In particular, make sure to answer the following:
- In order to find our database, we first had to search for our model organism from UniProt. I typed in the search bar at the top the phrase "shigella flexneri 2a 301" since this is the organism we are observing. Once the results showed up, I then copied one of the genes into the clipboard, googled "shigella flexneri genome database" and pasted the gene name into some of the database that were yielded in the Google search. Some of the viable databases that I found can be located here and here.
- What types of data can be found in the database (sequence, structures, annotations, etc.); is it a primary or “meta” database; is it curated electronically, manually [in-house], or manually [community])?
- What individual or organization maintains the database?
- What is their funding source(s)?
- Is there a license agreement or any restrictions on access to the database?
- How often is the database updated?
- Are there links to other databases?
- Can the information be downloaded?
- In what file formats?
- Evaluate the “user-friendliness” of the database.
- Is the Web site well-organized?
- Does it have a help section or tutorial?
- Run a sample query. Do the results make sense?
- What is the format (regular expression) of the main type of gene ID for this species (the "ordered locus name" ID)? (for example, for Vibrio cholerae it was VC#### or VC_####).
- What is the importance or significance of this work (i.e., your species)?
Assignment Links
Weekly Assignments
- Week 1
- Week 2
- Week 3
- Week 4
- Week 5
- Week 6
- Week 7
- Week 8
- Week 9
- Week 10
- Week 11
- Week 12
- No Week 13 Assignment
- Week 14
- Week 15
Individual Journal Entries
- Week 1 - This is technically the user page.
- Week 2
- Week 3
- Week 4
- Week 5
- Week 6
- Week 7
- Week 8
- Week 9
- Week 10
- Week 11
- Week 12
- No Week 13 Assignment
- Week 14
- Week 15