Week 5

From LMU BioDB 2015
Revision as of 21:21, 24 September 2015 by Kdahlquist (Talk | contribs) (fixed link to class journal)

Jump to: navigation, search

This week's assignment is divided into three parts:

  1. Review of an article for the 2016 Nucleic Acids Research annual database issue: due on Tuesday, October 6, at midnight PDT. (Monday night/Tuesday morning), 22 points.
  2. Group oral presentation about the database you reviewed: due on Thursday, October 8, at midnight PDT.' (Wednesday night/Thursday morning), 25 points.
  3. Your shared journal entry on the Class Journal Week 5 page: due on Thursday, October 8, at midnight PDT.' (Wednesday night/Thursday morning), 3 points.

This page is under construction.

Overview

The purpose of this assignment is:

Individual Journal Assignment

  • Store this journal entry as "username Week 5" (i.e., this is the text to place between the square brackets when you link to this page).
  • Link from your user page to this Assignment page.
  • Link to your journal entry from your user page.
  • Link back from your journal entry to your user page.
  • Don't forget to add the "Journal Entry" category to the end of your wiki page.
    • Note: you can easily fulfill all of these links by adding them to your template and then using your template on your journal entry.
  • For your assignment this week, you will keep an electronic laboratory notebook on your individual journal entry page for this week. An electronic laboratory notebook records all the manipulations you perform on the data and the answers to the questions throughout the protocol. Like a paper lab notebook found in a wet lab, it should contain enough information so that you or someone else could reproduce what you did using only the information from the notebook.

Review of a Database Article from NAR and Presentation

Each year, the journal Nucleic Acids Research (NAR) devotes the first issue in January to biological databases. The goal of this assignment is to dive into the deep end of the pool and experience the breadth and depth of biological databases available on the Web:

For this exercise, you will work with an assigned group of four students. The groups are:

Review of a Database Article from NAR

For your assignment, create a new wiki page to profile your database. There will be one page per group; both partners will contribute to the same page.

  • Link to your database page from the Class Journal Week 5 page. These pages will be a resource for the class as we move forward with this unit of the course.
  • Link to your database page from your user page.
  • Link from your database page to the Class Journal Week 5 page.
  • Link from your database page to your user pages.

Read the article about the database from the Nucleic Acids Research journal and then go online to the database itself. When you answer the questions below, provide a hyperlink to the page that you got the information from.

  1. What database did you access? (link to the home page of the database)
  2. What is the purpose of the database?
  3. What biological information does it contain?
  4. What species are covered in the database?
  5. What biological questions can it be used to answer?
  6. What type (or types) of database is it (sequence, structure model organism, or specialty [what?]; primary or “meta”; curated electronically, manually [in-house], manually [community])?
  7. What individual or organization maintains the database?
  8. What is their funding source(s)?
  9. Is there a license agreement or any restrictions on access to the database?
  10. How often is the database updated? When was the last update?
  11. Are there links to other databases?
  12. Can the information be downloaded?
    • In what file formats?
  13. Evaluate the “user-friendliness” of the database.
    • Is the Web site well-organized?
    • Does it have a help section or tutorial?
    • Run a sample query. Do the results make sense?

Some Definitions

  • Electronic curation occurs when someone writes a program to add information to a database record from another database.
  • Manual curation occurs when a human reviews the information being added to a record to validate it as true.
    • In-house is when the human works for the database organization.
    • Community is when the database allows members of the scientific community that don't work for the database organization to add information to the record.

PowerPoint Presentation

Each group will prepare and give a 10-15 minute PowerPoint presentation based on their assigned database.

  • All four groups will present in class on Thursday, October 8.
  • Please follow the Presentation Guidelines for how to format your slides.
  • You will need to prepare ~10-15 slides (assume 1 slide per minute of presentation).
  • You need to present the information you gathered about your database that you listed in your wiki above, but organized as a presentation.
  • You may give a live demo of the database if you wish, but practice carefully so that you can do the presentation in 15 minutes.
    • Alternately, you may choose to show screen shots instead of the live demo.
  • Your PowerPoint slides must be uploaded to the wiki page you created for your database, by midnight Monday/Tuesday, even if your group is scheduled to present on Thursday.
    • You can update your slides before your presentation, but we will be grading the ones you upload by the deadline.
  • Your presentation (both the slides and the oral presentation) will be evalutated by the instructors using the guidelines shown here.
  • Your presentation will also be evaluated by your fellow classmates (anonymously) who will answer the following questions:
    1. What is the speaker's take-home message (one short sentence)?
    2. What are the best points about the presentation's content, organization, clarity of visuals, and presentation style? Please give at least 2 specific examples.
    3. What points need improvement? How would you improve them? Please give at least 2 specific examples.

Shared Journal Assignment

  • Store your journal entry in the shared Class Journal Week 5 page. If this page does not exist yet, go ahead and create it (congratulations on getting in first :) )
  • Link to your journal entry from your user page.
  • Link back from the journal entry to your user page.
    • NOTE: you can easily fulfill the links part of these instructions by adding them to your template and using the template on your user page.
  • Sign your portion of the journal with the standard wiki signature shortcut (~~~~).
  • Add the "Journal Entry" and "Shared" categories to the end of the wiki page (if someone has not already done so).

Reflect

The following is a list of core competencies for scientific data literacy. After completing the all of the exercises in this assignment, answer the following questions on the shared Class Journal Week 5 page:

  1. Which of these core competencies (if any) were you familiar with before taking this class? How did you become familiar with them?
  2. Which of these core competencies (if any) did you gain a deeper understanding of by doing this exercise? What about the exercise taught you about them?
  3. Which of these core competencies (if any) do you want to know more about? Why?

Scientific Data Literacy Core Competencies

  1. Databases and Data Formats
    • Understand how to query relational databases, and be familiar with data types and formats for the discipline.
  2. Discovery and Acquisition of Data
    • Locate and utilize disciplinary data repositories, and identify appropriate data sources
  3. Data Management and Organization
    • Understand the lifecycle of data, and use data management plans to track subsets of processed data.
  4. Data Conversion and Interoperability
    • Migrate data from one format to another, and understand the benefits of standard data formats.
  5. Quality Assurance
    • Use metadata and screening procedures to recognize artifacts, incompletion, or corruption of data sets.
  6. Metadata
    • Interpret metadata from external sources, and annotate data so it can be used by external users.
  7. Data Curation and Re-use
    • Recognize the role of curation throughout the data lifecycle in its value in effective reuse of data.
  8. Cultures of Practice
    • Know the practices, values, and norms of discipline as they relate to managing, sharing, and curating data.
  9. Data Preservation
    • Understand the technology, resource, and organizational components of preserving data.
  10. Data Analysis
    • Understand the basic analysis tools of their discipline including workflow management tools.
  11. Data Visualization
    • Use visualization tools of discipline, and understand the advantages of the different types of visualization.
  12. Ethics, including citation of data
    • Understand intellectual property, privacy, and the ethos of the discipline around sharing and citing data.