Mbalducc Week 14
From LMU BioDB 2017
Contents
Files
Excel Sheet of Profile #22 TF from YEASTRACT
Using YEASTRACT to Infer which Transcription Factors Regulate a Cluster of Genes
- Opened the gene list in Excel for the one of the significant profiles from my stem analysis. I chose a cluster with a clear cold shock/recovery up/down or down/up pattern.
- Copied the list of gene IDs onto my clipboard.
- Launched a web browser and went to the YEASTRACT database.
- On the left panel of the window, clicked on the link to Rank by TF.
- Pasted my list of genes from my cluster into the box labeled ORFs/Genes.
- Checked the box for Check for all TFs.
- Accepted the defaults for the Regulations Filter (Documented, DNA binding plus expression evidence)
- Did not apply a filter for "Filter Documented Regulations by environmental condition".
- Ranked genes by TF using: The % of genes in the list and in YEASTRACT regulated by each TF.
- Clicked the Search button.
- Answered the following questions:
- In the results window that appears, the p values colored green are considered "significant", the ones colored yellow are considered "borderline significant" and the ones colored pink are considered "not significant". How many transcription factors are green or "significant"?
- 43 are green.
- Copied the table of results from the web page and pasteed it into a new Excel workbook to preserve the results.
- Uploaded the Excel file to OWW or Box and linked to it in my electronic lab notebook.
- Is your transcription factor on the list? If so, what is their "% in user set", "% in YEASTRACT", and "p value". (Note that this doesn't apply to the wt strain).
- My transcription factor, Cin5 was on the list. Here is the data associated with it:
- % in user set: 40.68%
- % in YEASTRACT: 3.30%
- p-value: 0.001495074628175
- My transcription factor, Cin5 was on the list. Here is the data associated with it:
- List of 20 significant transcription factors:
- Aft2p
- Mga2p
- Gis1p
- Hot1p
- Sut1p
- Zap1p
- Mig1p
- Hap2p
- Sut2p
- Bas1p
- Pho2p
- Opi1p
- Ert1p
- Oaf1p
- Sfp1p
- Mig3p
- Rlm1p
- Sum1p
- Ace2p
- Pho4p
- Went back to the YEASTRACT database and followed the link to Generate Regulation Matrix.
- Copied and pasted the list of transcription factors I identified (plus the transcription factor deleted in my strain: Cin5p) into both the "Transcription factors" field and the "Target ORF/Genes" field.
- I used the "Regulations Filter" options of "Documented", "Only DNA binding evidence"
- Clicked the "Generate" button.
- In the results window that appeared, I clicked on the link to the "Regulation matrix (Semicolon Separated Values (CSV) file)" that appeared and saved it to my Desktop. I renamed this file with a meaningful name so that I could distinguish it from the other files I will generate.
- In the results window that appears, the p values colored green are considered "significant", the ones colored yellow are considered "borderline significant" and the ones colored pink are considered "not significant". How many transcription factors are green or "significant"?
Acknowledgements
I worked with the rest of the Data Analyst guild: Antonio Porras, Dina Bashoura, and Emma Tyrnauer on this assignment. We spoke in class and created a group chat so we could ask each other questions and help each other as we worked on the project. I also used the instructions from the Week 10 assignment. I edited the instructions slightly so they would be specific to the profile I chose and the procedure I followed.
References
LMU BioDB 2017. (2017). Week 10 Retrieve November 28, 2017, from https://xmlpipedb.cs.lmu.edu/biodb/fall2017/index.php/Week_10