Difference between revisions of "Dwilliams Week 8 assignment"
(→Perform statistical analysis on the ratios) |
(→Perform statistical analysis on the ratios) |
||
Line 17: | Line 17: | ||
#Inserted a new worksheet and named it "statistics". | #Inserted a new worksheet and named it "statistics". | ||
#*Went back to the "scaling_centering" worksheet and copy the first column ("ID"). | #*Went back to the "scaling_centering" worksheet and copy the first column ("ID"). | ||
− | Paste the data into the first column of your new "statistics" worksheet. | + | #*Paste the data into the first column of your new "statistics" worksheet. |
#*Went back to the "scaling_centering" worksheet and copy Column C ("A1_scaled_centered). | #*Went back to the "scaling_centering" worksheet and copy Column C ("A1_scaled_centered). | ||
#*Went to new worksheet and clicked on the B1 cell. Select "Paste Special" from the Edit menu. A window will open: click on the radio button for "Values" and click OK. This will paste the numerical result into your new worksheet instead of the equation which must make calculations on the fly. | #*Went to new worksheet and clicked on the B1 cell. Select "Paste Special" from the Edit menu. A window will open: click on the radio button for "Values" and click OK. This will paste the numerical result into your new worksheet instead of the equation which must make calculations on the fly. |
Revision as of 17:20, 17 October 2013
Digital Lab Notebook
Contents |
' ' 'Part 1' ' '
Normalize the log ratios for the set of slides in the experiment
- Downloaded the Stanford Microarray Database.
- Created new excel sheet and performed scale_centered.
- Created a row for Average and row for Standard Deviation.
- Formula for calculating the average: =AVERAGE(B4:B5224)
- Formula for calculating the STd.Dev: =STDEV(B4:B5224)
- Applied both functions across all columns.
- After computing average and standard deviation of the log ratios for each chip, performed actual do scaling and centering based on these values.
- Created new columns for A1_scaled_centered, A2_scaled_centered, etc.
- Formula for scaling:=(B4-B$2)/B$3
- Applied the scaling and centering equation for each of the columns of data.
Perform statistical analysis on the ratios
- Inserted a new worksheet and named it "statistics".
- Went back to the "scaling_centering" worksheet and copy the first column ("ID").
- Paste the data into the first column of your new "statistics" worksheet.
- Went back to the "scaling_centering" worksheet and copy Column C ("A1_scaled_centered).
- Went to new worksheet and clicked on the B1 cell. Select "Paste Special" from the Edit menu. A window will open: click on the radio button for "Values" and click OK. This will paste the numerical result into your new worksheet instead of the equation which must make calculations on the fly.
- Went to a new column on the right of your worksheet. Type the header "Avg_LogFC_A", "Avg_LogFC_B", and "Avg_LogFC_C" into the top cell of the next three columns.
Compute the average log fold change for the replicates for each patient by typing the equation:=AVERAGE(B2:E2).
- Computed the average of the averages.
- Type the header "Avg_LogFC_all" into the first cell in the next empty column.
- Insert a new column next to the "Avg_LogFC_all" column computed in the previous step. Label the column "Tstat".
- Fromula for Tstat test: =AVERAGE(N2:P2)/(STDEV(N2:P2)/SQRT(number of replicates))
- Create Pvalue column.
- Formula for Pvalue: =TDIST(ABS(R2),degrees of freedom,2).
Using Data to Create GenMapp Files
Insert a new worksheet and name it "forGenMAPP". Go back to the "statistics" worksheet and Select All and Copy. Go to your new sheet and click on cell A1 and select Paste Special, click on the Values radio button, and click OK. We will now format this worksheet for import into GenMAPP. Select Columns B through Q (all the fold changes). Select the menu item Format > Cells. Under the number tab, select 2 decimal places. Click OK. Select Columns R and S. Select the menu item Format > Cells. Under the number tab, select 4 decimal places. Click OK. Select Columns N through S and Cut. Select Column B by left-clicking on the "B" at the top of the column. Then right-click on the Column B header and select "Insert Cut Cells". This will insert the data without writing over your existing columns. Delete Rows 2 and 3 where it says "Average" and "StDev" so that your data rows with gene IDs are immediately below the header row 1. Insert a column to the right of the "ID" column. Type the header "SystemCode" into the top cell of this column. Fill the entire column (each cell) with the letter "N". Select the menu item File > Save As, and choose "Text (Tab-delimited) (*.txt)" from the file type drop-down menu. Excel will make you click through a couple of warnings because it doesn't like you going all independent and choosing a different file type than the native .xls. This is OK. Your new #*.txt file is now ready for import into GenMAPP. But before we do that, we want to know a few things about our data as shown in the next section. Upload both the .xls and .txt files that you have just created to your journal page in the class wiki. Make sure that your file name is distinct from your other classmates so that nobody overwrites anyone else's file.
- 772 errors found.
- 27 results