Difference between revisions of "Data Analysis"
Jump to navigation
Jump to search
Kdahlquist (talk | contribs) (→Milestone 3: Complete Microarray Data Analysis: start filling out steps) |
Kdahlquist (talk | contribs) (→Milestone 3: Complete Microarray Data Analysis: note about column header) |
||
Line 32: | Line 32: | ||
# Organize the data in a worksheet in an Excel workbook so that: | # Organize the data in a worksheet in an Excel workbook so that: | ||
#* ID is in the first column | #* ID is in the first column | ||
− | #* Data columns are to the right, in increasing chronological order | + | #* Data columns are to the right, in increasing chronological order, using the column header pattern you created |
#* Replicates are grouped together | #* Replicates are grouped together | ||
# Perform an ANOVA analysis of the data. | # Perform an ANOVA analysis of the data. |
Revision as of 14:30, 18 November 2019
Final Project Links | |||||||
---|---|---|---|---|---|---|---|
Overview | Deliverables | Guilds | Project Manager | Quality Assurance | Data Analysis | Coder/Designer | |
Teams | FunGals | Sulfiknights | Skinny Genes |
The role of the Data Analyst will be to apply the data analysis pipeline that you learned by analyzing the Dahlquist Lab microarray dataset to complete the analysis of a different published yeast timecourse microarray dataset. The Data Analysts are the end-users of the project, ultimately determining whether the work of the coder/designer and quality assurance members is useful to them.
Contents
Guild Members
- Ivy, Marcus
- Emma, Kaitlyn
- Aby, David
Milestones
The milestones do not necessarily correspond to particular weeks; instead they are sets of tasks grouped together.
Milestone 1: Annotated Bibliography
- The Data Analysts will work with their teams to develop an annotated bibliography of papers relating to their team's assigned paper.
Milestone 2: Journal Club Presentation
- The Data Analysts will work with their teams to create and deliver a Journal Club presentation about to their team's assigned paper.
Milestone 3: Complete Microarray Data Analysis
- Download and examine the microarray dataset, comparing it to the samples and experiment described in your journal club article.
- Make a "sample-data relationship table" that lists all of the samples (microarray chips), noting the treatment, time point, and replicate number.
- Come up with consistent column headers that summarize this information
- For example, the Dahlquist Lab microarray data used strain_LogFC_timepoint-replicate number, as in wt_LogFC_t15-1.
- Come up with consistent column headers that summarize this information
- Organize the data in a worksheet in an Excel workbook so that:
- ID is in the first column
- Data columns are to the right, in increasing chronological order, using the column header pattern you created
- Replicates are grouped together
- Perform an ANOVA analysis of the data.
- Cluster the data with stem.
- Use YEASTRACT to generate a candidate gene regulatory network.
- Create an input workbook for GRNmap based on a Microsoft Access database that the Coder/Designer and QA's make.
- Run GRNmap and interpret data.
- As the end-user of the Access database, the Data Analysts will provide feedback to the QAs and Coder/Designer about the usability of database.
Final Project Links | |||||||
---|---|---|---|---|---|---|---|
Overview | Deliverables | Guilds | Project Manager | Quality Assurance | Data Analysis | Coder/Designer | |
Teams | FunGals | Sulfiknights | Skinny Genes |