Difference between revisions of "Quality Assurance"
From LMU BioDB 2015
(Initial transcription of this page.) |
(Provide overview and milestone names.) |
||
Line 1: | Line 1: | ||
{{Gene Database Project Links}} | {{Gene Database Project Links}} | ||
+ | |||
+ | The Quality Assurance team member is the resident expert on species ID systems and formats. He or she should be proficient with XMLPipeDB Match, SQL queries in PostgreSQL, Microsoft Excel, and Microsoft Access to navigate through the data and find missing IDs, discrepancies, sanity checks, etc. | ||
== Guild Members == | == Guild Members == | ||
Line 10: | Line 12: | ||
== Milestones == | == Milestones == | ||
− | === Milestone 1 === | + | === Milestone 1: Initial Database Export === |
# (with Coders) Get a full import-export cycle done. | # (with Coders) Get a full import-export cycle done. | ||
Line 22: | Line 24: | ||
#* The OrderedLocusNames for your species | #* The OrderedLocusNames for your species | ||
− | === Milestone 2 === | + | === Milestone 2: ID Pattern Definition and Verification === |
# Characterize regular expression patterns to detect the IDs ([[How Do I Count Thee? Let Me Count The Ways|for filtering then counting]]). | # Characterize regular expression patterns to detect the IDs ([[How Do I Count Thee? Let Me Count The Ways|for filtering then counting]]). | ||
Line 32: | Line 34: | ||
# Document/log all work done, problems encountered, and how they were resolved. | # Document/log all work done, problems encountered, and how they were resolved. | ||
− | === Milestone 3 === | + | === Milestone 3: Tally Engine Configuration === |
Along with your Coder, customize the Tally Engine setup for your species as specified in [[Coder#Customize_the_IDs_that_the_Tally_Engine_Counts|these coder steps]]. You will want to add, at the very least, the ''ordered locus'' IDs for your species. | Along with your Coder, customize the Tally Engine setup for your species as specified in [[Coder#Customize_the_IDs_that_the_Tally_Engine_Counts|these coder steps]]. You will want to add, at the very least, the ''ordered locus'' IDs for your species. | ||
− | === Milestone 4 === | + | === Milestone 4: Final Documentation === |
# Document the relational database schema for the gene database. | # Document the relational database schema for the gene database. |
Revision as of 00:39, 2 November 2015
Gene Database Project Links | |||||||
---|---|---|---|---|---|---|---|
Overview | Deliverables | Reference Format | Guilds | Project Manager | GenMAPP User | Quality Assurance | Coder |
Teams | Heavy Metal HaterZ | The Class Whoopers | GÉNialOMICS | Oregon Trail Survivors |
The Quality Assurance team member is the resident expert on species ID systems and formats. He or she should be proficient with XMLPipeDB Match, SQL queries in PostgreSQL, Microsoft Excel, and Microsoft Access to navigate through the data and find missing IDs, discrepancies, sanity checks, etc.
Contents
Guild Members
- Species 1:
- Species 2:
- Species 3:
- Species 4:
Milestones
Milestone 1: Initial Database Export
- (with Coders) Get a full import-export cycle done.
- (with Coders) Decide on a file/version management scheme/system.
- Learn the ID systems:
- Systems that are the same for each species (hint: guild members help each other out by posting the relevant information on this page)
- UniProt
- RefSeq
- GeneID (EntrezGene from NCBI)
- GO
- The OrderedLocusNames for your species
- Systems that are the same for each species (hint: guild members help each other out by posting the relevant information on this page)
Milestone 2: ID Pattern Definition and Verification
- Characterize regular expression patterns to detect the IDs (for filtering then counting).
- XMLPipeDB Match utility
- Direct SQL queries in PostgreSQL
- For example, the Vibrio IDs were of the form
VC####
orVC_####
; how would you express that in Match or as an SQL query? - Table inspection/filtering/sorting in Microsoft Access
- If needed, side-by-side sorted comparisons in Microsoft Excel (as described here)
- Document/log all work done, problems encountered, and how they were resolved.
Milestone 3: Tally Engine Configuration
Along with your Coder, customize the Tally Engine setup for your species as specified in these coder steps. You will want to add, at the very least, the ordered locus IDs for your species.
Milestone 4: Final Documentation
- Document the relational database schema for the gene database.
- Create the ReadMe with comparisons to MOD for your species.
Gene Database Project Links | |||||||
---|---|---|---|---|---|---|---|
Overview | Deliverables | Reference Format | Guilds | Project Manager | GenMAPP User | Quality Assurance | Coder |
Teams | Heavy Metal HaterZ | The Class Whoopers | GÉNialOMICS | Oregon Trail Survivors |