Eyanosch Week 4

From LMU BioDB 2015
Revision as of 22:55, 22 September 2015 by Eyanosch (Talk | contribs) (Notes taken during Dr. Dhalquist's class lecture)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search
Human genes

about 50% repeated bp's

about 5% code for proteins

45% are regulatory (differentiation)

3 million bp's = human genome size

Roughly ~ 20,296 genes (protein coding genes)

humans have about 100,000 different proteins in our cells


Open reading frame and doing translation = area coded for a certain gene

Bioinformatics

Database analytical tools - available access to lots of data

  • information
    • representation, organization, manipulation, distribution, maintenance

transcends all science, interdisciplinary

check out Data life cycle (plan, collect, assure, describe, preserver, discover, integrate, analyze)

  • Key Concepts
    • ID's = identifiers
    • Record = entry in a database
    • searching a database is executing a query
    • Different databases use different file formats


Pertinent types for this class -Sequence -3d structure -Model organism databases -etc.

Major databases to note

  1. NCBI: National Center for Biotechnology Information
    • GenBank
    • Gene
    • Pubmed
  2. uniprot (formerly SWISS-PROT