Difference between revisions of "Ajvree Week 13"
From LMU BioDB 2013
(→Week 13) |
(→11/21/13: additional process info) |
||
Line 72: | Line 72: | ||
*downloaded Taurus's version of gmbuilder to redo tally engine counting | *downloaded Taurus's version of gmbuilder to redo tally engine counting | ||
*used export 3 files instead of previous export 1 files | *used export 3 files instead of previous export 1 files | ||
+ | *connected to avreelan database in pgadminIII, inserted new gmbuilder files | ||
+ | *opened new version of gmbuilder/tally engine | ||
*XML file import took: 2.02 min | *XML file import took: 2.02 min | ||
*OBO-XML file import took: | *OBO-XML file import took: | ||
*GOA file import took: | *GOA file import took: | ||
*Results: | *Results: | ||
− | |||
==Links== | ==Links== | ||
{{Team ATK}} | {{Team ATK}} |
Revision as of 18:01, 21 November 2013
Contents |
Week 12 Information
Export Counting: -open Access, open file (remember to change all to all files) TIGR4 file:
ids used: SP_####
orderedlocusnames count total: 2126 entries
R6 file:
orderedlocusnames count total: 2115 entries
ids used: SPG_####
G54 file:
ids used: SPG_####
orderedlocusnames count total: 2115 entries
After finding identical results for R6 and G54 files, realized that R6 file was actually for the G54 strain. Checked a few times (reopened files multiple times to confirm)
First try on Tally Engine for TIGR4:
XML Count:
orderedlocus: 2127
refseq: 2106
Database Count:
ordered locus: 3831
refseq: 3403
Week 13
Tally Engine:
- created new database in pgadmin III
- in sql, opened gmbuilder.sql
- ran query, database tables were inserted
- went in to tally engine and imported files
- Xml import took 5.41 min
- GOA import took 0.07 min
- unzipped go-xml file
- OBO-XML import time: 19.92 min
- additional gene ontology information was processed, this took 14.96 min
- ran tally, came up with error
- refreshed gmbuilder and tried again successfully
XMLpipedb Match
- downloaded program from sourceforge
- opened cmd program
- cd Downloads file
- moved xmlmatch jar file to download folder
- used match to look for pattern SP_[0-9][0-9][0-9][0-9]
- Total unique matches: 2126
- almost identical to tally engine results of 2127, minus one result
OriginalRowCounts
- Looked at TIGR4 gdb file and benchmard VD file for table similarities/differences
- seemed to have same tables/same information
- took screenshots of both, included here:
- Note: a few of the rows are missing in the benchmark screenshot- could not fit all of them on screen.
SQL
- used following query to search for matches:
- select count(*) from genenametype where type = 'ordered locus' and value ~ 'SP_[0-9][0-9][0-9][0-9]';
- Result given was 2126
11/21/13
Tally Engine for Export 3
- downloaded Taurus's version of gmbuilder to redo tally engine counting
- used export 3 files instead of previous export 1 files
- connected to avreelan database in pgadminIII, inserted new gmbuilder files
- opened new version of gmbuilder/tally engine
- XML file import took: 2.02 min
- OBO-XML file import took:
- GOA file import took:
- Results:
Links
Alina's User Page | Kevin's User Page | Tauras's User Page |
Biological Databases Class Page | Gene Database Project | Gene Database Project Report Guidelines |
- Import Export Cycle 1: tATK Export One: TIGR4 Testing Report
- Import Export Cycle 2: tATK E2: TIGR4 Testing Report
- Import Export Cycle 3: tATK E3: TIGR4 Testing Report
- Import Export Cycle 4: tATK E4: TIGR4 Testing Report
Project Roles: | Project Manager | Coder | GenMAPP User | Quality Assurance |