Difference between revisions of "Week 5"

From LMU BioDB 2013
Jump to: navigation, search
(NAR Exercise and Presentation: Clear out buddy assignments.)
(Add under construction marker.)
Line 1: Line 1:
 +
{{Under Construction}}
 +
 
'''This journal entry is due on Friday, September 27, at midnight PDT.''' ''(Thursday night/Friday morning)''
 
'''This journal entry is due on Friday, September 27, at midnight PDT.''' ''(Thursday night/Friday morning)''
  

Revision as of 18:04, 16 July 2013

Under Construction

The content in this page has not been finalized and is still subject to change. Use the current information at your own risk.

This journal entry is due on Friday, September 27, at midnight PDT. (Thursday night/Friday morning)

Individual Journal Assignment

  • Store this journal entry as "username Week 5" (i.e., this is the text to place between the square brackets when you link to this page).
  • Link from your user page to this Assignment page.
  • Link to your journal entry from your user page.
  • Link back from your journal entry to your user page.
  • Don't forget to add the "Journal Entry" category to the end of your wiki page.
    • Note: you can easily fulfill all of these links by adding them to your template and then using your template on your journal entry.

SWISS-PROT/UniProt Exercise

For this exercise, you will read and follow the links in Chapter 4: Using Protein and Specialized Sequence Databases of the book Bioinformatics for Dummies.

  • Since the publication of this book in 2003, the SWISS-PROT database has become the UniProt Knowledgebase. The underlying data are the same, but the scope and user interface for the database have been updated. Thus, some of the exact instructions of the chapter have to be changed to reflect the change to UniProt. These changes are noted below by page number.
  • Page 123:
    1. The URL for the SWISS-PROT/UniProt server is http://www.expasy.org/sprot/.
    2. The Quick Search field is now found at the upper right of the page.
    3. Choose "UniProtKB" from the drop-down menu (it is the default), and click the "GO" button.
      • Alternately, you can go directly to http://www.uniprot.org, the search field is in the top middle of the page.
    • The information described in subsequent pages can all be found, but will be in a different order on the page. There is a set of navigation links near the top of the page to help you jump to each section.
  • General information about the entry (bottom of page 123):
    • This information is found under the header "Entry information" and is near the bottom of the web page, instead of the top.
  • Name and origin of the protein (page 124) is near the top of the page.
  • The References (page 126) are near the middle of the page.
  • The Comments (page 126) is now known as "General annotation (comments)".
  • The Cross-Refernces (page 128) are even more extensive and are organized by sub-categories of databases.
    • In particular, click on a sample cross-reference link for each of the following databases, and for each, state what type of information is found there:
      • EMBL
      • InterPro
      • PDB
      • Pfam
      • RefSeq
      • GeneID
  • The Keywords (page 130) are now found listed under "Ontologies".
  • The Features (page 131) are now listed as "Sequence annotation (Features)".
  • In the section "Finding Out More about Your Protein" (page 135-139), some of the databases are defunct, highlighting how biological databases are a moving target (this book was first published in 2003).
  • A new feature of the UniProt interface is that you can view the data in several different formats. Click on the buttons on the top-right of the page to view the data as:
    • TXT: flat file text data, the original format of the SWISS-PROT data (even before it was put in a relational database)
    • XML: text data structured with tags (like you praacticed with for last week's assignment)
    • RDF/XML: a semantic web format
    • GFF: a specialized format for genomic information
    • FASTA: a basic text format for sequence information
  • Write a one-paragraph summary of what you have learned about the human EGFR protein from this exercise.
  • Reflect and answer the following questions:
    1. What was the purpose of this exercise?
    2. What did I learn from this exercise?
    3. What did I not understand (yet) about this exercise?

Additional UniProt Resources

NAR Exercise and Presentation

Each year, the journal Nucleic Acids Research (NAR) devotes the first issue in January to biological databases. The goal of this assignment is to dive into the deep end of the pool and experience the breadth and depth of biological databases available on the Web:

For this exercise, you will work with an assigned buddy. Choose a database from this issue and answer the following questions about that database. Each pair should choose a different database to profile. So, to claim your first choice, go to the Class Journal Week 5 page and stake your claim to a database. When you are choosing your database, look at the other students' entries to make sure you are not doing the same one. The buddy assignments are:

  • To be determined

On your wiki...

For your assignment, create a new wiki page to profile your database and link to it from the Class Journal Week 5 page. These pages will be a resource for the class as we move forward with this unit of the course.

Read the article about the database from the Nucleic Acids Research journal and then go online to the database itself. When you answer the questions below, provide a hyperlink to the page that you got the information from.

  1. What database did you access? (link to the home page of the database)
  2. What is the purpose of the database?
  3. What biological information does it contain?
  4. What species are covered in the database?
  5. What biological questions can it be used to answer?
  6. What type (or types) of database is it (sequence, structure model organism, or specialty [what?]; primary or “meta”; curated electronically, manually [in-house], manually [community])?
  7. What individual or organization maintains the database?
  8. What is their funding source(s)?
  9. Is there a license agreement or any restrictions on access to the database?
  10. How often is the database updated?
  11. Are there links to other databases?
  12. Can the information be downloaded?
    • In what file formats?
  13. Evaluate the “user-friendliness” of the database.
    • Is the Web site well-organized?
    • Does it have a help section or tutorial?
    • Run a sample query. Do the results make sense?

Some Definitions

  • Electronic curation occurs when someone writes a program to add information to a database record from another database.
  • Manual curation occurs when a human reviews the information being added to a record to validate it as true.
    • In-house is when the human works for the database organization.
    • Community is when the database allows members of the scientific community that don't work for the database organization to add information to the record.

For your PowerPoint presentation...

Each group will prepare and give a 10 minute PowerPoint presentation based on their chosen database.

  • Please follow the Presentation Guidelines for how to format your slides.
  • You will need to prepare ~10 slides (assume 1 slide per minute of presentation).
  • You need to present the information you gathered about your database that you listed in your wiki above, but organized as a presentation.
  • You may give a live demo of the database if you wish, but practice carefully so that you can do the presentation in 10 minutes.
    • Alternately, you may choose to show screen shots instead of the live demo.
  • Your PowerPoint slides must be uploaded to the wiki page you created for your database, by the normal wiki deadline, midnight Sunday/Monday.
    • You can update your slides before your presentation, but we will be grading the ones you upload by the deadline.
  • Your presentation (both the slides and the oral presentation) will be evalutated by the instructors using the guidelines shown here.
  • Your presentation will also be evaluated by your fellow classmates (anonymously) who will answer the following questions:
    1. What is the speaker's take-home message (one short sentence)?
    2. What are the best points about the presentation's content, organization, clarity of visuals, and presentation style? Please give at least 2 specific examples.
    3. What points need improvement? How would you improve them? Please give at least 2 specific examples.

Shared Journal Assignment

  • Store your journal entry in the shared Class Journal Week 5 page. If this page does not exist yet, go ahead and create it (congratulations on getting in first :) )
  • Link to your journal entry from your user page.
  • Link back from the journal entry to your user page.
    • NOTE: you can easily fulfill the links part of these instructions by adding them to your template and using the template on your user page.
  • Sign your portion of the journal with the standard wiki signature shortcut (~~~~).
  • Add the "Journal Entry" and "Shared" categories to the end of the wiki page (if someone has not already done so).

Reflect

After completing the both exercises, answer the following questions on the shared Class Journal Week 5 page:

  1. What was the most beneficial aspect of working with a buddy on this assignment (other than what you answered last week)?
  2. What was the most challenging aspect of working with a buddy on this assignment (other than what you answered last week)?
  3. What was most interesting to you in this week's exercise (SWISS-PROT/UniProt or NAR)? Why?
  4. What was least interesting? Why?
Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox