Difference between revisions of "Data Analysts Week 13"
(→Charlotte and Katie's Data Analyst Journal: adding acknowledgements and references) |
(→Acknowledgements: adding page link) |
||
Line 32: | Line 32: | ||
==Acknowledgements== | ==Acknowledgements== | ||
− | This procedure was adapted from the Data Analysis page Milestone protocols, | + | This procedure was adapted from the Data Analysis page Milestone protocols, linked here: [[Data Analysis]] |
==References== | ==References== | ||
LMU BioDB 2024. (2024). Week 13. Retrieved April 17, 2024 from https://xmlpipedb.cs.lmu.edu/biodb/spring2024/index.php/Week_13 | LMU BioDB 2024. (2024). Week 13. Retrieved April 17, 2024 from https://xmlpipedb.cs.lmu.edu/biodb/spring2024/index.php/Week_13 |
Revision as of 19:03, 17 April 2024
Contents
Charlotte and Katie's Data Analyst Journal
Milestone 1
Completed as of April 11th when we gave our Journal Club Presentation with Hailey Ivanson
Milestone 2
With Quality Assurance team member Hailey Ivanson, we downloaded and examined the microarray dataset, comparing it to the samples in our journal club article. We used the processed dataset from SGD. We made a sample-data relationship table that lists all of the samples (microarray chips), noting the treatment, time point, and replicate number.
We came up with consistent column headers that summarize this information. We named each column CHP_LogFc_time-trial without using special characters. We organized the data in a worksheet in an Excel workbook so that:
-ID (GSE7645) is in the first column
-Data columns are to the right, in increasing chronological order, using the column header pattern we created
-Treatments are grouped together
-Replicates are grouped together
-We deleted the "EWEIGHT" row and "GWEIGHT" column.
-We converted the data into Log2 fold changes (LogFC).
-We undid log transformations before we calculated the ratios.
-We created new column headers and then transformed of all the data with the equation "=2^<cell designation>"
-We computed the average value of the t0 timepoint for the control and CHP-treated data.
-We calculated the fold change by dividing each value for each timepoint by the average t0 value for the respective treatment (control or CHP-treated).
-We Log2 transformed the fold changes.
Acknowledgements
This procedure was adapted from the Data Analysis page Milestone protocols, linked here: Data Analysis
References
LMU BioDB 2024. (2024). Week 13. Retrieved April 17, 2024 from https://xmlpipedb.cs.lmu.edu/biodb/spring2024/index.php/Week_13