Difference between revisions of "Vkuehn Week 8"
From LMU BioDB 2013
(Finished!) |
|||
Line 66: | Line 66: | ||
*#Number of probes that met the [Avg_LogFC_All]>0.25 AND [PValue]<0.05 criteria | *#Number of probes that met the [Avg_LogFC_All]>0.25 AND [PValue]<0.05 criteria | ||
*#Number of probes in the dataset | *#Number of probes in the dataset | ||
− | *: The other results were all different. This makes sense because my partner used an older version of the Vibrio Gene Database. The similar terms were the ones linked to Uniprot IDs and Go terms, but the other information was more updated in my dataset. There were larger results in my procedure because there were more links to the genes made between the times the data was uploaded. | + | *: The other results were all different. This makes sense because my partner used an older version of the Vibrio Gene Database. The similar terms were the ones linked to Uniprot IDs and Go terms, but the other information was more updated in my dataset. There were larger results in my procedure because there were more links to the genes made between the times the data was uploaded. |
+ | *Interpreting the GO terms. Focusing on relatedness | ||
+ | *:The main reoccurring term I found was flagellum assembly and Pilus assembly. These were found under cellular component organization, biogenisis at cellular level, and cellular process under multiple sub branches. Although there were other differences in gene expression as well, I thought that these were particular significant.By looking into the gene exprssion differences between the pathogenic vs lab I found that many of them had increased activity in the Pathogenic strain. This led me to conclude that the structure of the flagella and pili is probably altered in the pathogenic strain in a way that makes it harmful. It is possible the the change in pili structure makes it so that it is more likely to transfer mutagenic plasmid DNA to other cells causing them to become pathogenic. The flagella structure could also have changed in this strain and played a role in its virulence. Another significant change in gene expression in the two was that the pathogenic strain had decreased mRNA degredation, which could have have alerted the number of mutations, contributing to the pathogenicity. | ||
Latest revision as of 21:49, 19 October 2013
Contents |
[edit] Electronic Lab Notebook
[edit] 10.10.2013
Data accessed from [Microarray Analysis Vibrio Cholera page]
- Downloaded the Raw Data for Vibrio colera in the excel format
- Working with the data:
- Create new worksheet and copy over all the data
- Find STDEV and AVG for each one as new rows on top
- Scale and center by copying header next to it (Value-AVG)/STDEV
- Inserted new worksheet called Statistics
- Added 3 new columns: Avg_Log_FC_A, Avg_Log_FC_B, Avg_Log_FC_C
- Compute average log full change for each patients ex:(=AVG(B2:E2))
- Computed average of averages of 3 patients in the column:Acg_LogFC_all
- T-test: =AVERAGE(N2:P2)/(STDEV(N2:P2)/(SQRT(# of replicates))
- Determine if it is significantly different than zero
- Create new column titled P-Value and Calculated the P-value lower than 0.05 (Got 948 values that were significant).
- Created new worksheet titled GenMAPP
- copy over the data from Statistics worksheet (using values only)
- Select all fold changes and format cells under number tab to 2 decimal places
- Do the same for columns R and S
- Delete AVG and STDEV rows
- Insert "SystemCode" column next to the ID column and input "N" for all rows
- Save as tab-delimited text file
- Data accessed from uploaded txt file and excel file:
[edit] 10.15.2013
Expression dataset manager converted data resulted in 'Media: Merrell_Compiled_Raw_Data_Vibrio_VK_2013.10.15.gex
[edit] GenMAPP Expression Dataset Procedure
- Generated lines that resulted in error. During the conversion the data for 2010 resulted in 121 errors
- Generated an exception file Media:Merrell_Compiled_Raw_Data_Vibrio_VK_2013.10.15.EX.txt
- Used autofilter to determine errors. The errors were all gene not found. This was because it was not updated recently so it is ok to proceed. Media:Merrell_Compiled_Raw_Data_Vibrio_VK_2013.10.15.EX.xls
- My partner had a lot more errors than I did. This is because I was using the newer version while he was using the older dataset. Mine had been updated more recently which is why there were less errors because more of the data matched
- Customize the new Expression Dataset by creating new Color Sets which contain the instructions to GenMAPP for displaying data on MAPPs.
- Input the "Avg_LogFC_all" for the Vibrio dataset created in the excel file
- Create criteria builder for "increased":
- [Avg_LogFC_all] > 0.25 AND [Pvalue] < 0.05
- Uploaded color sets
[edit] MAPPFinder Procedure
- Launch MAPPFinder and make sure that the database for the correct species is loaded
- Click "Calculate new results" choose the dataset just created (choose Merrell_Compiled_Raw_Data_Vibrio_VK_2013.10.15.gex)
- Choose color set and criteria, select gene ontology and p value (see image to the right)
- Click "run mappfiner" Media:MerrellVK.mapp
- Gene Ontology window will open. All of the Gene Ontology terms that have at least 3 genes measured and a p value of less than 0.05 will be highlighted yellow. A term with a p value less than 0.05 is considered a "significant" result. Browse through the tree to see your results. Click on "ranked list"
- Top 10 Gene Ontology terms (image to the left)
- My terms in comparison with my partners are different, this is because he had different gene dataset that was less updated and there were differences in expression because of this.
- Find the Gene Ontology term(s) with which a particular gene is associated. Search for matched gene from Merrell et al. (2002)VC0647
- Below are the GO terms associated with the gene VC0647:
- mRNA catabolic process
- RNA processing
- cytoplasm
- mitochondrion
- RNA binding
- 3'-5' exoribonuclease activity
- ransferase activity
- nucleotidyltransferase activity
- Below are the GO terms associated with the gene VC0647:
- One GO term was chosen: Clicked on "mRNA catabolic process" this resulted in two genes listed on a GenMAPP. The one with the gene ID: PNP_VIBCH showed a decease in Pathogenic vs Lab according to my color chart. This means it significantly increased.
- Clicked on the gene ID, creating a backpage File:PNP VIBCH Backpage.txt
- From here the link to find a description of the gene was followed. According to AmiGO [[1]] This gene ID corresponds to 3'-5'-exoribonuclease activity. It plays a role in the Catalysis of the sequential cleavage of mononucleotides from a free 3' terminus of an RNA molecule.
- Uploaded the Criterion.GO in excel for the relevant Increased genes. Here is the unfiltered text file version: File:Increased VK-Criterion0-GO-2013-10-17-Criterion0-GO.txt
- In excel The information was filtered to display approximately 20 terms. To view the exact filters applied, view the results below.
- When these results were compared to those of my partners we found that the following categories were the same:
- Number of probes that met the [Avg_LogFC_All]>0.25 AND [PValue]<0.05 criteria
- Number of probes in the dataset
- The other results were all different. This makes sense because my partner used an older version of the Vibrio Gene Database. The similar terms were the ones linked to Uniprot IDs and Go terms, but the other information was more updated in my dataset. There were larger results in my procedure because there were more links to the genes made between the times the data was uploaded.
- Interpreting the GO terms. Focusing on relatedness
- The main reoccurring term I found was flagellum assembly and Pilus assembly. These were found under cellular component organization, biogenisis at cellular level, and cellular process under multiple sub branches. Although there were other differences in gene expression as well, I thought that these were particular significant.By looking into the gene exprssion differences between the pathogenic vs lab I found that many of them had increased activity in the Pathogenic strain. This led me to conclude that the structure of the flagella and pili is probably altered in the pathogenic strain in a way that makes it harmful. It is possible the the change in pili structure makes it so that it is more likely to transfer mutagenic plasmid DNA to other cells causing them to become pathogenic. The flagella structure could also have changed in this strain and played a role in its virulence. Another significant change in gene expression in the two was that the pathogenic strain had decreased mRNA degredation, which could have have alerted the number of mutations, contributing to the pathogenicity.
- Template:vkuehn
- Viktoria Kuehn
- Week 1 Assignment
- Class Journal 1
- Week 2 Assignment
- Class Journal 2
- Vkuehn Week 2
- Week 3 Assignment
- Class Journal 3
- Vkuehn Week 3
- Week 4 Assignment
- Class Journal 4
- Vkuehn Week 4
- Week 5 Assignment
- Vkuehn Week 5
- Ensembl Database
- Week 6 Assignment
- Class Journal 6
- Vkuehn Week 6
- Week 7 Assignment
- Class Journal 7
- Vkuehn Week 7
- Week 8 Assignment
- Class Journal 8
- Vkuehn Week 8
- Leishmania major
- Week 9 Assignment
- Class Journal 9
- Vkuehn Week 9
- Week 10 Assignment
- Class Journal 10
- Vkuehn Week 10
- Leishmania major
- Week 11 Assignment
- Vkuehn Week 11
- Leishmania major Genome Reference Article Presentation
- Vkuehn Week 12
- Week 12 Assignment
- Leishmania major Week 12 Status Report
- Vkuehn Week 13
- Week 13 Assignment
- Vkuehn Week 15
- Week 15 Assignment
- Vkuehn Individual Assessment and Reflection