Sulfiknights DA Week 12/13

From LMU BioDB 2019
Revision as of 18:02, 25 November 2019 by Imacarae (talk | contribs) (References: added reference)
Jump to navigation Jump to search
Sulfiknight Links
BIOL Databases Main Page Sulfiknights: Project Overview Page Final Project Deliverables Requirements Sulfiknights: Final Project Deliverables Members Project Manager & Quality Assurance: Naomi Tesfaiohannes Quality Assurance: Joey Nimmers-Minor Data Analysis: Ivy-Quynh Macaraeg & Marcus Avila Designer: DeLisa Madere
Assignment Pages Week 11 Week 12/13 Week 15

Template:Sulfiknights

Purpose

Methods & Results

Sample to data relationship table:

  • GSE6068_setA_family - swt vs nswt 1mM @ 1 hr (6 replicates); 15 min, 30 min, 18 hrs (3 replicates each)
  • GSE6129_set0_family - swt vs nswt 1mM @ 1hr; 6 replicates
  • GSE6129_set1_family - swt vs nswt .2mM @ 1 hr, 15 min, 30 min, 18 hr; 3 replicates each
  • GSE6129_set2_family - sYAP4 vs swt .2mM AND 1 mM each @ 1 hr, 3 replicates each
  • GSE6129_set3_family - sMET4 vs swt .2mM AND 1 mM each @ 1 hr, 3 replicates each

Organizing the Data

  1. Downloaded data from Thorsen et al.
  2. Changed "Name" to "Standard_ID" in column C.
  3. Changed column headers in each sheet:
    • swt = stressed wild type, nswt= nonstressed wild type
    • 1mM, 0.2mM = concentration of As(III) at which the cell was exposed
    • 1h, 15m, 30m, 18h = time point (h = hours, m = minutes)
    • rn = replicate number (n)
  4. Inserted MasterIndex in "GSE6068_setA_family"sheet.

Conducting the ANOVA

  • All data analysis was conducted on data on the "GSE6068_setA_family" sheet.
  • ANOVA procedure was based on the methods from Week 8.
  1. A new sheet called "swtVnwt_1mM_ANOVA" was created and all data from Columns A-R was copied and pasted into this sheet.
  2. Columns "swtVnwt_AvgLogFC_1mM_15m" - "swtVnwt_AvgLogFC_1mM_1080m" for each time point 15m, 30m, 60m, and 1080m were created and contain the equation the average of each replicate for each time point.
  3. Column "swtVnwt_ss_HO" was created containing the equation SUMSQ(D2:R2).
  4. Sanity Check: 1068/4785 p-values are less than 0.05

Data & Files

Thorsen Data

Conclusion

Acknowledgments

References