<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Msymond1</id>
	<title>LMU BioDB 2024 - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Msymond1"/>
	<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php/Special:Contributions/Msymond1"/>
	<updated>2026-06-14T20:23:56Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.32.1</generator>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts_Deliverables&amp;diff=3619</id>
		<title>Yeast Beasts Deliverables</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts_Deliverables&amp;diff=3619"/>
		<updated>2024-05-03T18:58:15Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: fixed report&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;# [[Yeast_Beasts_Deliverables|Organized Team deliverables wiki page with table of contents]]&lt;br /&gt;
# Group Report (&amp;#039;&amp;#039;.docx&amp;#039;&amp;#039; or &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039; file) [[media:Yeast_Beasts_Deliverable.pdf|Report]]&lt;br /&gt;
# Individual statements of work, assessments, reflections (wiki page, &amp;#039;&amp;#039;.docx&amp;#039;&amp;#039;, &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039;, or e-mailed to Dr. Dahlquist) [[media:Symonds Ind. Assessment-Reflection.docx|Dean&amp;#039;s reflection]]&lt;br /&gt;
# Group PowerPoint presentation (given on Thursday, May 2, &amp;#039;&amp;#039;.pptx&amp;#039;&amp;#039; or &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039; file) [[Media:Yeast_Beasts_Presentation.pdf|Presentation]]&lt;br /&gt;
# Sample-data relationship table in Excel (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [[Media:Sample_Data_Table.xlsx | Sample-data Relationship Table]]&lt;br /&gt;
# Excel spreadsheet with ANOVA results/stem formatting (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [[Media:ANOVA_STEM.xlsx | ANOVA and STEM Spreadsheet]]&lt;br /&gt;
# PowerPoint of ANOVA table, screenshots of stem results (&amp;#039;&amp;#039;.pptx&amp;#039;&amp;#039;), screenshot of black and white GRNsight input network and colored GRNmap/GRNsight output networks [[Media:ANOVAslides.pdf | PPT of ANOVA, STEM, and Networks]]&lt;br /&gt;
# Gene List and GO List files from each significant profile (&amp;#039;&amp;#039;.txt&amp;#039;&amp;#039; compressed together in a &amp;#039;&amp;#039;.zip&amp;#039;&amp;#039; file) [https://lmu.app.box.com/file/1512445828707 GO List Zipped Files] and [https://lmu.app.box.com/file/1512445314285 Gene List Zipped Files]&lt;br /&gt;
# YEASTRACT &amp;quot;rank by TF&amp;quot; results (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [https://lmu.app.box.com/file/1513580824058 YEASTRACT Results]&lt;br /&gt;
# GRNmap input workbook (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [[media:GRNmap_930PM.xlsx|GRNmap Input Workbook]]&lt;br /&gt;
# GRNmap output workbook (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) and output plots (&amp;#039;&amp;#039;.jpg&amp;#039;&amp;#039;) zipped together [https://lmu.box.com/s/rathfd4jhdqdjzuci4kdagmptq1jbxeb link to Box with GRNmap Output]&lt;br /&gt;
# MS Access database, including all tables (&amp;#039;&amp;#039;.accdb&amp;#039;&amp;#039;) [https://lmu.box.com/s/2mcetil8n7vxzwe100avt5yhmc7psuh5 Database]&lt;br /&gt;
# ReadMe for the database that describes the design of the database, references the sources of the data, and has a [https://www.quackit.com/microsoft_access/microsoft_access_2016/howto/how_to_create_a_database_diagram_in_access_2016.cfm database schema diagram] (&amp;#039;&amp;#039;.docx&amp;#039;&amp;#039;, &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039;) [https://lmu.box.com/s/0j6y9p1mrb3dji8wuvn8zhfwrb6lpo97 Database Diagram]&lt;br /&gt;
# Query design for populating a GRNmap input workbook from the database (screenshot of MS Access; or SQL code, &amp;#039;&amp;#039;.txt&amp;#039;&amp;#039;) [https://lmu.box.com/s/y6x97kqcjlxfezhnbpvck4h4oe4pm366 Query Designs]&lt;br /&gt;
# Electronic notebook corresponding to these the microarray results files ([[Week 13]], [[Week 14]], and [[Week 15]]) to support &amp;#039;&amp;#039;reproducible research&amp;#039;&amp;#039; so that all manipulations of the data and files are documented so that someone else could begin with your starting file, follow the protocol, and obtain your results.&lt;br /&gt;
&lt;br /&gt;
{{Yeast_Beasts}}&lt;br /&gt;
[[Template:Yeast_Beasts]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts_Deliverables&amp;diff=3617</id>
		<title>Yeast Beasts Deliverables</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts_Deliverables&amp;diff=3617"/>
		<updated>2024-05-03T18:57:54Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: added report&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;# [[Yeast_Beasts_Deliverables|Organized Team deliverables wiki page with table of contents]]&lt;br /&gt;
# Group Report (&amp;#039;&amp;#039;.docx&amp;#039;&amp;#039; or &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039; file) [[Yeast_Beasts_Deliverable.pdf|Report]]&lt;br /&gt;
# Individual statements of work, assessments, reflections (wiki page, &amp;#039;&amp;#039;.docx&amp;#039;&amp;#039;, &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039;, or e-mailed to Dr. Dahlquist) [[media:Symonds Ind. Assessment-Reflection.docx|Dean&amp;#039;s reflection]]&lt;br /&gt;
# Group PowerPoint presentation (given on Thursday, May 2, &amp;#039;&amp;#039;.pptx&amp;#039;&amp;#039; or &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039; file) [[Media:Yeast_Beasts_Presentation.pdf|Presentation]]&lt;br /&gt;
# Sample-data relationship table in Excel (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [[Media:Sample_Data_Table.xlsx | Sample-data Relationship Table]]&lt;br /&gt;
# Excel spreadsheet with ANOVA results/stem formatting (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [[Media:ANOVA_STEM.xlsx | ANOVA and STEM Spreadsheet]]&lt;br /&gt;
# PowerPoint of ANOVA table, screenshots of stem results (&amp;#039;&amp;#039;.pptx&amp;#039;&amp;#039;), screenshot of black and white GRNsight input network and colored GRNmap/GRNsight output networks [[Media:ANOVAslides.pdf | PPT of ANOVA, STEM, and Networks]]&lt;br /&gt;
# Gene List and GO List files from each significant profile (&amp;#039;&amp;#039;.txt&amp;#039;&amp;#039; compressed together in a &amp;#039;&amp;#039;.zip&amp;#039;&amp;#039; file) [https://lmu.app.box.com/file/1512445828707 GO List Zipped Files] and [https://lmu.app.box.com/file/1512445314285 Gene List Zipped Files]&lt;br /&gt;
# YEASTRACT &amp;quot;rank by TF&amp;quot; results (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [https://lmu.app.box.com/file/1513580824058 YEASTRACT Results]&lt;br /&gt;
# GRNmap input workbook (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [[media:GRNmap_930PM.xlsx|GRNmap Input Workbook]]&lt;br /&gt;
# GRNmap output workbook (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) and output plots (&amp;#039;&amp;#039;.jpg&amp;#039;&amp;#039;) zipped together [https://lmu.box.com/s/rathfd4jhdqdjzuci4kdagmptq1jbxeb link to Box with GRNmap Output]&lt;br /&gt;
# MS Access database, including all tables (&amp;#039;&amp;#039;.accdb&amp;#039;&amp;#039;) [https://lmu.box.com/s/2mcetil8n7vxzwe100avt5yhmc7psuh5 Database]&lt;br /&gt;
# ReadMe for the database that describes the design of the database, references the sources of the data, and has a [https://www.quackit.com/microsoft_access/microsoft_access_2016/howto/how_to_create_a_database_diagram_in_access_2016.cfm database schema diagram] (&amp;#039;&amp;#039;.docx&amp;#039;&amp;#039;, &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039;) [https://lmu.box.com/s/0j6y9p1mrb3dji8wuvn8zhfwrb6lpo97 Database Diagram]&lt;br /&gt;
# Query design for populating a GRNmap input workbook from the database (screenshot of MS Access; or SQL code, &amp;#039;&amp;#039;.txt&amp;#039;&amp;#039;) [https://lmu.box.com/s/y6x97kqcjlxfezhnbpvck4h4oe4pm366 Query Designs]&lt;br /&gt;
# Electronic notebook corresponding to these the microarray results files ([[Week 13]], [[Week 14]], and [[Week 15]]) to support &amp;#039;&amp;#039;reproducible research&amp;#039;&amp;#039; so that all manipulations of the data and files are documented so that someone else could begin with your starting file, follow the protocol, and obtain your results.&lt;br /&gt;
&lt;br /&gt;
{{Yeast_Beasts}}&lt;br /&gt;
[[Template:Yeast_Beasts]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=File:Yeast_Beasts_Deliverable.pdf&amp;diff=3616</id>
		<title>File:Yeast Beasts Deliverable.pdf</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=File:Yeast_Beasts_Deliverable.pdf&amp;diff=3616"/>
		<updated>2024-05-03T18:57:19Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts_Deliverables&amp;diff=3609</id>
		<title>Yeast Beasts Deliverables</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts_Deliverables&amp;diff=3609"/>
		<updated>2024-05-03T18:22:33Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: added my reflection&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;# [[Yeast_Beasts_Deliverables|Organized Team deliverables wiki page with table of contents]]&lt;br /&gt;
# Group Report (&amp;#039;&amp;#039;.docx&amp;#039;&amp;#039; or &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039; file)&lt;br /&gt;
# Individual statements of work, assessments, reflections (wiki page, &amp;#039;&amp;#039;.docx&amp;#039;&amp;#039;, &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039;, or e-mailed to Dr. Dahlquist) [[media:Symonds Ind. Assessment-Reflection.docx|Dean&amp;#039;s reflection]]&lt;br /&gt;
# Group PowerPoint presentation (given on Thursday, May 2, &amp;#039;&amp;#039;.pptx&amp;#039;&amp;#039; or &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039; file) [[Media:Yeast_Beasts_Presentation.pdf|Presentation]]&lt;br /&gt;
# Sample-data relationship table in Excel (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [[Media:Sample_Data_Table.xlsx | Sample-data Relationship Table]]&lt;br /&gt;
# Excel spreadsheet with ANOVA results/stem formatting (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [[Media:ANOVA_STEM.xlsx | ANOVA and STEM Spreadsheet]]&lt;br /&gt;
# PowerPoint of ANOVA table, screenshots of stem results (&amp;#039;&amp;#039;.pptx&amp;#039;&amp;#039;), screenshot of black and white GRNsight input network and colored GRNmap/GRNsight output networks [[Media:ANOVAslides.pdf | PPT of ANOVA, STEM, and Networks]]&lt;br /&gt;
# Gene List and GO List files from each significant profile (&amp;#039;&amp;#039;.txt&amp;#039;&amp;#039; compressed together in a &amp;#039;&amp;#039;.zip&amp;#039;&amp;#039; file) [https://lmu.app.box.com/file/1512445828707 GO List Zipped Files] and [https://lmu.app.box.com/file/1512445314285 Gene List Zipped Files]&lt;br /&gt;
# YEASTRACT &amp;quot;rank by TF&amp;quot; results (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [https://lmu.app.box.com/file/1513580824058 YEASTRACT Results]&lt;br /&gt;
# GRNmap input workbook (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [[media:GRNmap_930PM.xlsx|GRNmap Input Workbook]]&lt;br /&gt;
# GRNmap output workbook (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) and output plots (&amp;#039;&amp;#039;.jpg&amp;#039;&amp;#039;) zipped together [https://lmu.box.com/s/rathfd4jhdqdjzuci4kdagmptq1jbxeb link to Box with GRNmap Output]&lt;br /&gt;
# MS Access database, including all tables (&amp;#039;&amp;#039;.accdb&amp;#039;&amp;#039;) [https://lmu.box.com/s/2mcetil8n7vxzwe100avt5yhmc7psuh5 Database]&lt;br /&gt;
# ReadMe for the database that describes the design of the database, references the sources of the data, and has a [https://www.quackit.com/microsoft_access/microsoft_access_2016/howto/how_to_create_a_database_diagram_in_access_2016.cfm database schema diagram] (&amp;#039;&amp;#039;.docx&amp;#039;&amp;#039;, &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039;) [https://lmu.box.com/s/0j6y9p1mrb3dji8wuvn8zhfwrb6lpo97 Database Diagram]&lt;br /&gt;
# Query design for populating a GRNmap input workbook from the database (screenshot of MS Access; or SQL code, &amp;#039;&amp;#039;.txt&amp;#039;&amp;#039;) [https://lmu.box.com/s/y6x97kqcjlxfezhnbpvck4h4oe4pm366 Query Designs]&lt;br /&gt;
# Electronic notebook corresponding to these the microarray results files ([[Week 13]], [[Week 14]], and [[Week 15]]) to support &amp;#039;&amp;#039;reproducible research&amp;#039;&amp;#039; so that all manipulations of the data and files are documented so that someone else could begin with your starting file, follow the protocol, and obtain your results.&lt;br /&gt;
&lt;br /&gt;
{{Yeast_Beasts}}&lt;br /&gt;
[[Template:Yeast_Beasts]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=File:Symonds_Ind._Assessment-Reflection.docx&amp;diff=3608</id>
		<title>File:Symonds Ind. Assessment-Reflection.docx</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=File:Symonds_Ind._Assessment-Reflection.docx&amp;diff=3608"/>
		<updated>2024-05-03T18:21:29Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts&amp;diff=3569</id>
		<title>Yeast Beasts</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts&amp;diff=3569"/>
		<updated>2024-05-03T02:05:35Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: /* Dean&amp;#039;s Reflection */ added spaec&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* This page will be the main place from which the Yeast Beasts team project will be managed. Include all of the information/links that you think will be useful for your team to organize your work and communicate with each other and with the instructors. &amp;#039;&amp;#039;Hint:  the kinds of things that are on your own User pages and on the course Main page can be used as a guide.&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
[[Media:Yeast_Beasts_Presentation.pdf | Final version of presentation]]&lt;br /&gt;
&lt;br /&gt;
[[Yeast Beasts Deliverables]]&lt;br /&gt;
&lt;br /&gt;
==Week Reflections==&lt;br /&gt;
===Week 13===&lt;br /&gt;
====[[User:Hivanson|Hailey&amp;#039;s]] Reflection [Quality Assurance]====&lt;br /&gt;
*I worked closely with Charlotte and Katie toward completing Milestones 2 and 3. We completed milestone 2 and are close to completing milestone 3. &lt;br /&gt;
*I thought it worked well to split up, with Natalija going with the coder/designers and me going with data analysis, &amp;#039;&amp;#039;but&amp;#039;&amp;#039; I would love to see where they are at on their progress so that we can join up for the upcoming milestone 4. I want to do this on or before next Tuesday, April 30th.&lt;br /&gt;
*It did not work to try to do tasks simultaneously with the data analysts. To fix this, we had one person with an open Excel sheet on their computer, another reading and checking off the steps, and another checking that all of the data and equations were being entered properly. This solution worked well for us and we will continue to have just one computer with Excel open on it, but switching roles between the person inputting data and the one checking off steps could be better for the future.&lt;br /&gt;
&lt;br /&gt;
[[User:Hivanson|Hivanson]] ([[User talk:Hivanson|talk]]) 23:35, 17 April 2024 (PDT)&lt;br /&gt;
&lt;br /&gt;
====Andrew&amp;#039;s Reflection [Coder/Designer]====&lt;br /&gt;
To find my electronic notebook for this week please click on [[Asandle1 Week 13#Electronic Lab Notebook|Andrew Sandler&amp;#039;s Week 13 Lab Notebook]] &lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Executive Summary&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
#Classified Significant P-values as 1 (P &amp;lt; 0.01) or &amp;#039;0&amp;#039;&lt;br /&gt;
#Found issues with data including missing gene descriptions.&lt;br /&gt;
#Initially tried to use Yeastmine to find the missing gene information but it was inefficient.&lt;br /&gt;
#Found additional blanks in the dataset and need to speak with Dr. Dahlquist about how to solve this issue.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;What worked?&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
Everything &amp;quot;worked&amp;quot; but some surprises came up.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;What didn&amp;#039;t work?&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
Not that this didn&amp;#039;t work but it provided a challenge, the issues with the #REF boxes, the blank boxes, the random text in some boxes, and the NaN coming up in some spots. I don&amp;#039;t know how to deal with this and will need help from Dr. Dahlquist so I am not just guessing at a solution. I also need to figure out how to get a complete Gene ID list and then compare the whole ID list to the missing ID&amp;#039;s. I also need that list for the ID&amp;#039;s for the Access Database. &lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;What will I do next to fix what didn&amp;#039;t work?&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
I plan on speaking with Dean and Dr. Dahlquist in class tomorrow to fix these issues and then move onto working on the Access section of this assignment.&lt;br /&gt;
&lt;br /&gt;
[[Category:Journal Entry]]&lt;br /&gt;
[[Category:Team Project]]&lt;br /&gt;
&lt;br /&gt;
====Dean&amp;#039;s Reflection [Coder/Designer]====&lt;br /&gt;
# This week, me and my partner completed milestones 1 and 2, and we are currently working on milestone 3, there are some complications in milestone 3, for a large part of it requires Microsoft Access, and there are also some issues in importing tables to excel. [[MSymond1]]&lt;br /&gt;
# Each team member should reflect on the team&amp;#039;s progress:&lt;br /&gt;
## The things that worked well are cleaning up the data for the network table, which was done in class on Tuesday, the data table looks much more organized and the p values have all been successfully converted&lt;br /&gt;
## The other data tables are not pasting into excel as neatly as anticipated, I am also unaware of how to obtain the data from the yeastmine website. &lt;br /&gt;
## To fix these issues, I will ask Dr. Dahlquist for further advice in class on Thursday.&lt;br /&gt;
[[User:Msymond1|Msymond1]] ([[User talk:Msymond1|talk]]) 13:33, 18 April 2024 (PDT)&lt;br /&gt;
&lt;br /&gt;
====Katie&amp;#039;s Reflection====&lt;br /&gt;
#This week, Charlotte, Hailey, and I worked on completing Milestones 2 and 3. These milestones consisted of preparing the dataset from SGD for analysis, and then performing an ANOVA analysis like we had done in Week 9. A more detailed summary of the steps we followed is outlined on mine and Charlotte&amp;#039;s individual page, linked below. &lt;br /&gt;
#* [[Data Analysts Week 13]]&lt;br /&gt;
#The data analysts, me and Charlotte, worked together with Hailey on progressing through Milestones 2 and 3 on the Data Analysis page. We contacted each other throughout the week to check in on what each person was doing. We then met in person to work together on performing the ANOVA analysis. This worked well, because when we couldn&amp;#039;t meet we were still able to get some work done, and then once we got together we were able to ask any questions that we had. It was slightly difficult to progress through the steps in person because when attempting to work on the dataset at the same time, only one person could be actively making changes. I don&amp;#039;t believe it is possible for this issue to be fixed, as we cannot have multiple people working at exactly the same time, because steps need to be followed in a specific order. In the future, we will continue to make sure that we split up the steps so that each person is doing an equal amount of work, and to be communicative about any questions that we have or can answer.&lt;br /&gt;
&lt;br /&gt;
[[User:Kmill104|Kmill104]] ([[User talk:Kmill104|talk]]) 23:09, 17 April 2024 (PDT)&lt;br /&gt;
&lt;br /&gt;
====Charlotte&amp;#039;s Reflection:====&lt;br /&gt;
[[User:Kmill104| Katherine Miller]] and I, being the data analysts, worked with Quality Assurance [[User:Hivanson| Hailey Ivanson]] to complete Milestone 2 and Milestone 3 in person on April 17th, 2024. We messaged the Coder/Designers and got an update from them. I wrote out the steps taken on our [[Data Analysts Week 13]] page. It was helpful that we were able to meet in person to collaborate. However, it was hard to make changes to the data since we were working on one computer. We ended up splitting up the work well, but at first everyone trying to make edits at once was hard. Now we know a system that works for us as a group.&lt;br /&gt;
&lt;br /&gt;
[[User:Ckapla12|Ckapla12]] ([[User talk:Ckapla12|talk]]) 14:00, 18 April 2024 (PDT)&lt;br /&gt;
&lt;br /&gt;
===Week 14===&lt;br /&gt;
===Week 15===&lt;br /&gt;
====Dean&amp;#039;s Reflection====&lt;br /&gt;
# This week and last week, the entire group and I completed milestones 3-6, as well as the rest of the deliverables for the final project.&lt;br /&gt;
# Each team member should reflect on the team&amp;#039;s progress:&lt;br /&gt;
## The things that worked well are creating the database and getting it all well organized and working out any bugs or issues in the database. Running the queries in the database the way I did them also worked very well and was very quick once the issues in the database were resolved. The creation of the final project report also worked well since we also had presented on the project already. &lt;br /&gt;
## The things that did not work well were the collaboration on running the queries in the coders/designers since Andrew first tried doing it in a much more complicated way that required typing all of the syntax in the SQL mode and there was little to no communication between us on how these were done or what needs to be done in the future. &lt;br /&gt;
## To fix these issues, I made sure that for the rest of the project we all collaborated and communicated well for the final presentation and project.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
# Each person needs to write a short executive summary of that person&amp;#039;s progress on the project for the week, with links to the relevant individual journal pages (which will have more detailed information).&lt;br /&gt;
# Each team member should reflect on the team&amp;#039;s progress:&lt;br /&gt;
## What worked?&lt;br /&gt;
## What didn&amp;#039;t work?&lt;br /&gt;
## What will I do next to fix what didn&amp;#039;t work?&lt;br /&gt;
# Note that you will be directed to add specific information to your team&amp;#039;s pages in the individual portion of the assignment for this and future weeks.&lt;br /&gt;
&lt;br /&gt;
[[File:Yeast_Beasts_Presentation.pdf]]&lt;br /&gt;
&lt;br /&gt;
{{Yeast_Beasts}}&lt;br /&gt;
[[Template:Yeast_Beasts]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts&amp;diff=3568</id>
		<title>Yeast Beasts</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts&amp;diff=3568"/>
		<updated>2024-05-03T02:05:21Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: /* Week 15 */ added my reflection&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* This page will be the main place from which the Yeast Beasts team project will be managed. Include all of the information/links that you think will be useful for your team to organize your work and communicate with each other and with the instructors. &amp;#039;&amp;#039;Hint:  the kinds of things that are on your own User pages and on the course Main page can be used as a guide.&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
[[Media:Yeast_Beasts_Presentation.pdf | Final version of presentation]]&lt;br /&gt;
&lt;br /&gt;
[[Yeast Beasts Deliverables]]&lt;br /&gt;
&lt;br /&gt;
==Week Reflections==&lt;br /&gt;
===Week 13===&lt;br /&gt;
====[[User:Hivanson|Hailey&amp;#039;s]] Reflection [Quality Assurance]====&lt;br /&gt;
*I worked closely with Charlotte and Katie toward completing Milestones 2 and 3. We completed milestone 2 and are close to completing milestone 3. &lt;br /&gt;
*I thought it worked well to split up, with Natalija going with the coder/designers and me going with data analysis, &amp;#039;&amp;#039;but&amp;#039;&amp;#039; I would love to see where they are at on their progress so that we can join up for the upcoming milestone 4. I want to do this on or before next Tuesday, April 30th.&lt;br /&gt;
*It did not work to try to do tasks simultaneously with the data analysts. To fix this, we had one person with an open Excel sheet on their computer, another reading and checking off the steps, and another checking that all of the data and equations were being entered properly. This solution worked well for us and we will continue to have just one computer with Excel open on it, but switching roles between the person inputting data and the one checking off steps could be better for the future.&lt;br /&gt;
&lt;br /&gt;
[[User:Hivanson|Hivanson]] ([[User talk:Hivanson|talk]]) 23:35, 17 April 2024 (PDT)&lt;br /&gt;
&lt;br /&gt;
====Andrew&amp;#039;s Reflection [Coder/Designer]====&lt;br /&gt;
To find my electronic notebook for this week please click on [[Asandle1 Week 13#Electronic Lab Notebook|Andrew Sandler&amp;#039;s Week 13 Lab Notebook]] &lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Executive Summary&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
#Classified Significant P-values as 1 (P &amp;lt; 0.01) or &amp;#039;0&amp;#039;&lt;br /&gt;
#Found issues with data including missing gene descriptions.&lt;br /&gt;
#Initially tried to use Yeastmine to find the missing gene information but it was inefficient.&lt;br /&gt;
#Found additional blanks in the dataset and need to speak with Dr. Dahlquist about how to solve this issue.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;What worked?&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
Everything &amp;quot;worked&amp;quot; but some surprises came up.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;What didn&amp;#039;t work?&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
Not that this didn&amp;#039;t work but it provided a challenge, the issues with the #REF boxes, the blank boxes, the random text in some boxes, and the NaN coming up in some spots. I don&amp;#039;t know how to deal with this and will need help from Dr. Dahlquist so I am not just guessing at a solution. I also need to figure out how to get a complete Gene ID list and then compare the whole ID list to the missing ID&amp;#039;s. I also need that list for the ID&amp;#039;s for the Access Database. &lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;What will I do next to fix what didn&amp;#039;t work?&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
I plan on speaking with Dean and Dr. Dahlquist in class tomorrow to fix these issues and then move onto working on the Access section of this assignment.&lt;br /&gt;
&lt;br /&gt;
[[Category:Journal Entry]]&lt;br /&gt;
[[Category:Team Project]]&lt;br /&gt;
&lt;br /&gt;
====Dean&amp;#039;s Reflection [Coder/Designer]====&lt;br /&gt;
# This week, me and my partner completed milestones 1 and 2, and we are currently working on milestone 3, there are some complications in milestone 3, for a large part of it requires Microsoft Access, and there are also some issues in importing tables to excel. [[MSymond1]]&lt;br /&gt;
# Each team member should reflect on the team&amp;#039;s progress:&lt;br /&gt;
## The things that worked well are cleaning up the data for the network table, which was done in class on Tuesday, the data table looks much more organized and the p values have all been successfully converted&lt;br /&gt;
## The other data tables are not pasting into excel as neatly as anticipated, I am also unaware of how to obtain the data from the yeastmine website. &lt;br /&gt;
## To fix these issues, I will ask Dr. Dahlquist for further advice in class on Thursday.&lt;br /&gt;
[[User:Msymond1|Msymond1]] ([[User talk:Msymond1|talk]]) 13:33, 18 April 2024 (PDT)&lt;br /&gt;
&lt;br /&gt;
====Katie&amp;#039;s Reflection====&lt;br /&gt;
#This week, Charlotte, Hailey, and I worked on completing Milestones 2 and 3. These milestones consisted of preparing the dataset from SGD for analysis, and then performing an ANOVA analysis like we had done in Week 9. A more detailed summary of the steps we followed is outlined on mine and Charlotte&amp;#039;s individual page, linked below. &lt;br /&gt;
#* [[Data Analysts Week 13]]&lt;br /&gt;
#The data analysts, me and Charlotte, worked together with Hailey on progressing through Milestones 2 and 3 on the Data Analysis page. We contacted each other throughout the week to check in on what each person was doing. We then met in person to work together on performing the ANOVA analysis. This worked well, because when we couldn&amp;#039;t meet we were still able to get some work done, and then once we got together we were able to ask any questions that we had. It was slightly difficult to progress through the steps in person because when attempting to work on the dataset at the same time, only one person could be actively making changes. I don&amp;#039;t believe it is possible for this issue to be fixed, as we cannot have multiple people working at exactly the same time, because steps need to be followed in a specific order. In the future, we will continue to make sure that we split up the steps so that each person is doing an equal amount of work, and to be communicative about any questions that we have or can answer.&lt;br /&gt;
&lt;br /&gt;
[[User:Kmill104|Kmill104]] ([[User talk:Kmill104|talk]]) 23:09, 17 April 2024 (PDT)&lt;br /&gt;
&lt;br /&gt;
====Charlotte&amp;#039;s Reflection:====&lt;br /&gt;
[[User:Kmill104| Katherine Miller]] and I, being the data analysts, worked with Quality Assurance [[User:Hivanson| Hailey Ivanson]] to complete Milestone 2 and Milestone 3 in person on April 17th, 2024. We messaged the Coder/Designers and got an update from them. I wrote out the steps taken on our [[Data Analysts Week 13]] page. It was helpful that we were able to meet in person to collaborate. However, it was hard to make changes to the data since we were working on one computer. We ended up splitting up the work well, but at first everyone trying to make edits at once was hard. Now we know a system that works for us as a group.&lt;br /&gt;
&lt;br /&gt;
[[User:Ckapla12|Ckapla12]] ([[User talk:Ckapla12|talk]]) 14:00, 18 April 2024 (PDT)&lt;br /&gt;
&lt;br /&gt;
===Week 14===&lt;br /&gt;
===Week 15===&lt;br /&gt;
====Dean&amp;#039;s Reflection====&lt;br /&gt;
# This week and last week, the entire group and I completed milestones 3-6, as well as the rest of the deliverables for the final project.&lt;br /&gt;
# Each team member should reflect on the team&amp;#039;s progress:&lt;br /&gt;
## The things that worked well are creating the database and getting it all well organized and working out any bugs or issues in the database. Running the queries in the database the way I did them also worked very well and was very quick once the issues in the database were resolved. The creation of the final project report also worked well since we also had presented on the project already. &lt;br /&gt;
## The things that did not work well were the collaboration on running the queries in the coders/designers since Andrew first tried doing it in a much more complicated way that required typing all of the syntax in the SQL mode and there was little to no communication between us on how these were done or what needs to be done in the future. &lt;br /&gt;
## To fix these issues, I made sure that for the rest of the project we all collaborated and communicated well for the final presentation and project.&lt;br /&gt;
&lt;br /&gt;
# Each person needs to write a short executive summary of that person&amp;#039;s progress on the project for the week, with links to the relevant individual journal pages (which will have more detailed information).&lt;br /&gt;
# Each team member should reflect on the team&amp;#039;s progress:&lt;br /&gt;
## What worked?&lt;br /&gt;
## What didn&amp;#039;t work?&lt;br /&gt;
## What will I do next to fix what didn&amp;#039;t work?&lt;br /&gt;
# Note that you will be directed to add specific information to your team&amp;#039;s pages in the individual portion of the assignment for this and future weeks.&lt;br /&gt;
&lt;br /&gt;
[[File:Yeast_Beasts_Presentation.pdf]]&lt;br /&gt;
&lt;br /&gt;
{{Yeast_Beasts}}&lt;br /&gt;
[[Template:Yeast_Beasts]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_15&amp;diff=3567</id>
		<title>MSymond1 Week 15</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_15&amp;diff=3567"/>
		<updated>2024-05-03T02:03:06Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: /* Acknowledgements */ finished acknowledgements&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Progress Report==&lt;br /&gt;
===Milestone 3 continued===&lt;br /&gt;
*I was able to obtain the data from the yeastmine website by scrolling down and selecting the table with “all verified uncharacterized dubious ORFs”, the data table included all of the columns that Dr. Dahlquist suggested I include in the table&lt;br /&gt;
*The production and degradation tables could be saved as txt files which could then be opened in excel directly&lt;br /&gt;
*The metadata table was created with assistance from my professor in which she helped us gather the necessary materials for it&lt;br /&gt;
===Milestone 4===&lt;br /&gt;
*The database creation went mostly well with a few issues along the way such as&lt;br /&gt;
*Importing the tables from excel led to some problems of all values being either 0 or 1 on production rates or degradation rates tables, so we started to export the tables from excel as a txt file and then open them in access at txt files&lt;br /&gt;
*In the relationships window, we connected the systematic name column for each of the tables to ensure they would all be connected since that column was the primary key in each of the data tables&lt;br /&gt;
===Milesone 5===&lt;br /&gt;
*The quality assurance team was able to verify that the database was correct and had all necessary fields&lt;br /&gt;
===Milestone 6===&lt;br /&gt;
*We were informed by the data analysts that they selected profile 41 to analyze, this profile included 23 genes initially&lt;br /&gt;
*At first we were stuck and were not sure how to obtain the necessary data for the GRNmap, but Dr. Dahlquist pointed out that we can use the database that we created to run queries and find the necessary fields for each of the 23 genes&lt;br /&gt;
*I was informed that Andrew had completed the first three queries for the first three tables on the GRNmap excel sheet&lt;br /&gt;
*I followed the sample GRNmap excel sheet as a template for the queries. and I was running into problems running the query with the network table&lt;br /&gt;
*We imported profile 41 genes into access as another table to run queries with&lt;br /&gt;
*I noticed the problem was that profile 41 only included standard names of the genes rather than the systematic names, which was not the primary key for most of the database, and the network table did not include the standard names so I could not link those two tables together using it&lt;br /&gt;
*My solution was to link the gene table to the profile table using their standard names. However, I noticed that the gene table did not include the standard names for the genes, so I imported another gene table from yeastmine that does include the standard names. Then I was able to link the profile 41 table to the gene table using the standard names and then I was able to run a query using the query design and I was able to answer question 4.&lt;br /&gt;
*I also noticed that the first three questions were incorrect since they did not include all of the genes, I&amp;#039;m not sure how that happened with Andrew&amp;#039;s queries, but I went ahead and ran them again and fixed that issue&lt;br /&gt;
*Dr. Dahlquist also noticed that Andrew&amp;#039;s queries included the data from the control expression group rather than the CHP treated expression group, so I fixed that too&lt;br /&gt;
*I followed the rest of the directions for the rest of the questions for milestone 6 and ran into no problems&lt;br /&gt;
*I was able to give the GRNmap excel file to Dr. Dahlquist for her to run it&lt;br /&gt;
&lt;br /&gt;
==Presentation==&lt;br /&gt;
===Progress===&lt;br /&gt;
*We were able to complete the presentation to the best of our abilities following the directions listed in the project deliverables page&lt;br /&gt;
*The biggest issues we ran into was structuring the presentation in a way that made sense since we all had a great understanding of our own part of the project but we did not know everyone else&amp;#039;s as well, so we had to learn a little bit about everyone&amp;#039;s part of the project in order to create a cohesive presentation that made sense&lt;br /&gt;
*We also did our best to take into account our feedback from the last presentation and tried to make our titles more descriptive and also make our bullet points more useful&lt;br /&gt;
slides [https://docs.google.com/presentation/d/1DnYfkl9j5hy6EqTc1XT0tT7A_husJCJNNgsxhncsozY/edit?usp=sharing|our presentation]&lt;br /&gt;
&lt;br /&gt;
==Team Journal Assignment==&lt;br /&gt;
# This week and last week, the entire group and I completed milestones 3-6, as well as the rest of the deliverables for the final project.&lt;br /&gt;
# Each team member should reflect on the team&amp;#039;s progress:&lt;br /&gt;
## The things that worked well are creating the database and getting it all well organized and working out any bugs or issues in the database. Running the queries in the database the way I did them also worked very well and was very quick once the issues in the database were resolved. The creation of the final project report also worked well since we also had presented on the project already. &lt;br /&gt;
## The things that did not work well were the collaboration on running the queries in the coders/designers since Andrew first tried doing it in a much more complicated way that required typing all of the syntax in the SQL mode and there was little to no communication between us on how these were done or what needs to be done in the future. &lt;br /&gt;
## To fix these issues, I made sure that for the rest of the project we all collaborated and communicated well for the final presentation and project.&lt;br /&gt;
&lt;br /&gt;
==Acknowledgements==&lt;br /&gt;
I utilized all of my group members and my professor in the past two weeks to assist in completing this project. Except for what is noted above, this individual journal entry was completed by me and not copied from another source.&lt;br /&gt;
[[User:Msymond1|Msymond1]] ([[User talk:Msymond1|talk]]) 19:03, 2 May 2024 (PDT)&lt;br /&gt;
&lt;br /&gt;
{{Template:MSymond1}}&lt;br /&gt;
[[Category:Team Project]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_15&amp;diff=3566</id>
		<title>MSymond1 Week 15</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_15&amp;diff=3566"/>
		<updated>2024-05-03T02:02:23Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: added acknowledgements&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Progress Report==&lt;br /&gt;
===Milestone 3 continued===&lt;br /&gt;
*I was able to obtain the data from the yeastmine website by scrolling down and selecting the table with “all verified uncharacterized dubious ORFs”, the data table included all of the columns that Dr. Dahlquist suggested I include in the table&lt;br /&gt;
*The production and degradation tables could be saved as txt files which could then be opened in excel directly&lt;br /&gt;
*The metadata table was created with assistance from my professor in which she helped us gather the necessary materials for it&lt;br /&gt;
===Milestone 4===&lt;br /&gt;
*The database creation went mostly well with a few issues along the way such as&lt;br /&gt;
*Importing the tables from excel led to some problems of all values being either 0 or 1 on production rates or degradation rates tables, so we started to export the tables from excel as a txt file and then open them in access at txt files&lt;br /&gt;
*In the relationships window, we connected the systematic name column for each of the tables to ensure they would all be connected since that column was the primary key in each of the data tables&lt;br /&gt;
===Milesone 5===&lt;br /&gt;
*The quality assurance team was able to verify that the database was correct and had all necessary fields&lt;br /&gt;
===Milestone 6===&lt;br /&gt;
*We were informed by the data analysts that they selected profile 41 to analyze, this profile included 23 genes initially&lt;br /&gt;
*At first we were stuck and were not sure how to obtain the necessary data for the GRNmap, but Dr. Dahlquist pointed out that we can use the database that we created to run queries and find the necessary fields for each of the 23 genes&lt;br /&gt;
*I was informed that Andrew had completed the first three queries for the first three tables on the GRNmap excel sheet&lt;br /&gt;
*I followed the sample GRNmap excel sheet as a template for the queries. and I was running into problems running the query with the network table&lt;br /&gt;
*We imported profile 41 genes into access as another table to run queries with&lt;br /&gt;
*I noticed the problem was that profile 41 only included standard names of the genes rather than the systematic names, which was not the primary key for most of the database, and the network table did not include the standard names so I could not link those two tables together using it&lt;br /&gt;
*My solution was to link the gene table to the profile table using their standard names. However, I noticed that the gene table did not include the standard names for the genes, so I imported another gene table from yeastmine that does include the standard names. Then I was able to link the profile 41 table to the gene table using the standard names and then I was able to run a query using the query design and I was able to answer question 4.&lt;br /&gt;
*I also noticed that the first three questions were incorrect since they did not include all of the genes, I&amp;#039;m not sure how that happened with Andrew&amp;#039;s queries, but I went ahead and ran them again and fixed that issue&lt;br /&gt;
*Dr. Dahlquist also noticed that Andrew&amp;#039;s queries included the data from the control expression group rather than the CHP treated expression group, so I fixed that too&lt;br /&gt;
*I followed the rest of the directions for the rest of the questions for milestone 6 and ran into no problems&lt;br /&gt;
*I was able to give the GRNmap excel file to Dr. Dahlquist for her to run it&lt;br /&gt;
&lt;br /&gt;
==Presentation==&lt;br /&gt;
===Progress===&lt;br /&gt;
*We were able to complete the presentation to the best of our abilities following the directions listed in the project deliverables page&lt;br /&gt;
*The biggest issues we ran into was structuring the presentation in a way that made sense since we all had a great understanding of our own part of the project but we did not know everyone else&amp;#039;s as well, so we had to learn a little bit about everyone&amp;#039;s part of the project in order to create a cohesive presentation that made sense&lt;br /&gt;
*We also did our best to take into account our feedback from the last presentation and tried to make our titles more descriptive and also make our bullet points more useful&lt;br /&gt;
slides [https://docs.google.com/presentation/d/1DnYfkl9j5hy6EqTc1XT0tT7A_husJCJNNgsxhncsozY/edit?usp=sharing|our presentation]&lt;br /&gt;
&lt;br /&gt;
==Team Journal Assignment==&lt;br /&gt;
# This week and last week, the entire group and I completed milestones 3-6, as well as the rest of the deliverables for the final project.&lt;br /&gt;
# Each team member should reflect on the team&amp;#039;s progress:&lt;br /&gt;
## The things that worked well are creating the database and getting it all well organized and working out any bugs or issues in the database. Running the queries in the database the way I did them also worked very well and was very quick once the issues in the database were resolved. The creation of the final project report also worked well since we also had presented on the project already. &lt;br /&gt;
## The things that did not work well were the collaboration on running the queries in the coders/designers since Andrew first tried doing it in a much more complicated way that required typing all of the syntax in the SQL mode and there was little to no communication between us on how these were done or what needs to be done in the future. &lt;br /&gt;
## To fix these issues, I made sure that for the rest of the project we all collaborated and communicated well for the final presentation and project.&lt;br /&gt;
&lt;br /&gt;
==Acknowledgements==&lt;br /&gt;
I utilized all of my group members and my professor in the past two weeks to assist in completing this project.&lt;br /&gt;
&lt;br /&gt;
{{Template:MSymond1}}&lt;br /&gt;
[[Category:Team Project]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_15&amp;diff=3565</id>
		<title>MSymond1 Week 15</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_15&amp;diff=3565"/>
		<updated>2024-05-03T02:01:23Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: /* Team Journal Assignment */ did team journal assignment&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Progress Report==&lt;br /&gt;
===Milestone 3 continued===&lt;br /&gt;
*I was able to obtain the data from the yeastmine website by scrolling down and selecting the table with “all verified uncharacterized dubious ORFs”, the data table included all of the columns that Dr. Dahlquist suggested I include in the table&lt;br /&gt;
*The production and degradation tables could be saved as txt files which could then be opened in excel directly&lt;br /&gt;
*The metadata table was created with assistance from my professor in which she helped us gather the necessary materials for it&lt;br /&gt;
===Milestone 4===&lt;br /&gt;
*The database creation went mostly well with a few issues along the way such as&lt;br /&gt;
*Importing the tables from excel led to some problems of all values being either 0 or 1 on production rates or degradation rates tables, so we started to export the tables from excel as a txt file and then open them in access at txt files&lt;br /&gt;
*In the relationships window, we connected the systematic name column for each of the tables to ensure they would all be connected since that column was the primary key in each of the data tables&lt;br /&gt;
===Milesone 5===&lt;br /&gt;
*The quality assurance team was able to verify that the database was correct and had all necessary fields&lt;br /&gt;
===Milestone 6===&lt;br /&gt;
*We were informed by the data analysts that they selected profile 41 to analyze, this profile included 23 genes initially&lt;br /&gt;
*At first we were stuck and were not sure how to obtain the necessary data for the GRNmap, but Dr. Dahlquist pointed out that we can use the database that we created to run queries and find the necessary fields for each of the 23 genes&lt;br /&gt;
*I was informed that Andrew had completed the first three queries for the first three tables on the GRNmap excel sheet&lt;br /&gt;
*I followed the sample GRNmap excel sheet as a template for the queries. and I was running into problems running the query with the network table&lt;br /&gt;
*We imported profile 41 genes into access as another table to run queries with&lt;br /&gt;
*I noticed the problem was that profile 41 only included standard names of the genes rather than the systematic names, which was not the primary key for most of the database, and the network table did not include the standard names so I could not link those two tables together using it&lt;br /&gt;
*My solution was to link the gene table to the profile table using their standard names. However, I noticed that the gene table did not include the standard names for the genes, so I imported another gene table from yeastmine that does include the standard names. Then I was able to link the profile 41 table to the gene table using the standard names and then I was able to run a query using the query design and I was able to answer question 4.&lt;br /&gt;
*I also noticed that the first three questions were incorrect since they did not include all of the genes, I&amp;#039;m not sure how that happened with Andrew&amp;#039;s queries, but I went ahead and ran them again and fixed that issue&lt;br /&gt;
*Dr. Dahlquist also noticed that Andrew&amp;#039;s queries included the data from the control expression group rather than the CHP treated expression group, so I fixed that too&lt;br /&gt;
*I followed the rest of the directions for the rest of the questions for milestone 6 and ran into no problems&lt;br /&gt;
*I was able to give the GRNmap excel file to Dr. Dahlquist for her to run it&lt;br /&gt;
&lt;br /&gt;
==Presentation==&lt;br /&gt;
===Progress===&lt;br /&gt;
*We were able to complete the presentation to the best of our abilities following the directions listed in the project deliverables page&lt;br /&gt;
*The biggest issues we ran into was structuring the presentation in a way that made sense since we all had a great understanding of our own part of the project but we did not know everyone else&amp;#039;s as well, so we had to learn a little bit about everyone&amp;#039;s part of the project in order to create a cohesive presentation that made sense&lt;br /&gt;
*We also did our best to take into account our feedback from the last presentation and tried to make our titles more descriptive and also make our bullet points more useful&lt;br /&gt;
slides [https://docs.google.com/presentation/d/1DnYfkl9j5hy6EqTc1XT0tT7A_husJCJNNgsxhncsozY/edit?usp=sharing|our presentation]&lt;br /&gt;
&lt;br /&gt;
==Team Journal Assignment==&lt;br /&gt;
# This week and last week, the entire group and I completed milestones 3-6, as well as the rest of the deliverables for the final project.&lt;br /&gt;
# Each team member should reflect on the team&amp;#039;s progress:&lt;br /&gt;
## The things that worked well are creating the database and getting it all well organized and working out any bugs or issues in the database. Running the queries in the database the way I did them also worked very well and was very quick once the issues in the database were resolved. The creation of the final project report also worked well since we also had presented on the project already. &lt;br /&gt;
## The things that did not work well were the collaboration on running the queries in the coders/designers since Andrew first tried doing it in a much more complicated way that required typing all of the syntax in the SQL mode and there was little to no communication between us on how these were done or what needs to be done in the future. &lt;br /&gt;
## To fix these issues, I made sure that for the rest of the project we all collaborated and communicated well for the final presentation and project.&lt;br /&gt;
{{Template:MSymond1}}&lt;br /&gt;
[[Category:Team Project]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts_Deliverables&amp;diff=3553</id>
		<title>Yeast Beasts Deliverables</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts_Deliverables&amp;diff=3553"/>
		<updated>2024-05-03T01:21:18Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: added presentation&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;# Organized Team deliverables wiki page with table of contents&lt;br /&gt;
# Group Report (&amp;#039;&amp;#039;.docx&amp;#039;&amp;#039; or &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039; file)&lt;br /&gt;
# Individual statements of work, assessments, reflections (wiki page, &amp;#039;&amp;#039;.docx&amp;#039;&amp;#039;, &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039;, or e-mailed to Dr. Dahlquist)&lt;br /&gt;
# Group PowerPoint presentation (given on Thursday, May 2, &amp;#039;&amp;#039;.pptx&amp;#039;&amp;#039; or &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039; file) [[Media:Yeast_Beasts_Presentation.pdf|Presentation]]&lt;br /&gt;
# Sample-data relationship table in Excel (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;)&lt;br /&gt;
# Excel spreadsheet with ANOVA results/stem formatting (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [[media:Expression_Table.xlsx|ANOVA table]]&lt;br /&gt;
# PowerPoint of ANOVA table, screenshots of stem results (&amp;#039;&amp;#039;.pptx&amp;#039;&amp;#039;), screenshot of black and white GRNsight input network and colored GRNmap/GRNsight output networks&lt;br /&gt;
# Gene List and GO List files from each significant profile (&amp;#039;&amp;#039;.txt&amp;#039;&amp;#039; compressed together in a &amp;#039;&amp;#039;.zip&amp;#039;&amp;#039; file)&lt;br /&gt;
# YEASTRACT &amp;quot;rank by TF&amp;quot; results (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;)&lt;br /&gt;
# GRNmap input workbook (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [[media:GRNmap_930PM.xlsx|GRNmap input workbook]]&lt;br /&gt;
# GRNmap output workbook (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) and output plots (&amp;#039;&amp;#039;.jpg&amp;#039;&amp;#039;) zipped together [https://lmu.box.com/s/rathfd4jhdqdjzuci4kdagmptq1jbxeb link to box with GRNmap output]&lt;br /&gt;
# MS Access database, including all tables (&amp;#039;&amp;#039;.accdb&amp;#039;&amp;#039;) [https://lmu.box.com/s/2mcetil8n7vxzwe100avt5yhmc7psuh5 Database]&lt;br /&gt;
# ReadMe for the database that describes the design of the database, references the sources of the data, and has a [https://www.quackit.com/microsoft_access/microsoft_access_2016/howto/how_to_create_a_database_diagram_in_access_2016.cfm database schema diagram] (&amp;#039;&amp;#039;.docx&amp;#039;&amp;#039;, &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039;) [https://lmu.box.com/s/0j6y9p1mrb3dji8wuvn8zhfwrb6lpo97 database diagram]&lt;br /&gt;
# Query design for populating a GRNmap input workbook from the database (screenshot of MS Access; or SQL code, &amp;#039;&amp;#039;.txt&amp;#039;&amp;#039;) [https://lmu.box.com/s/y6x97kqcjlxfezhnbpvck4h4oe4pm366 Query Designs]&lt;br /&gt;
# Electronic notebook corresponding to these the microarray results files ([[Week 13]], [[Week 14]], and [[Week 15]]) to support &amp;#039;&amp;#039;reproducible research&amp;#039;&amp;#039; so that all manipulations of the data and files are documented so that someone else could begin with your starting file, follow the protocol, and obtain your results.&lt;br /&gt;
&lt;br /&gt;
{{Yeast_Beasts}}&lt;br /&gt;
[[Template:Yeast_Beasts]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts_Deliverables&amp;diff=3552</id>
		<title>Yeast Beasts Deliverables</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts_Deliverables&amp;diff=3552"/>
		<updated>2024-05-03T01:19:45Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: added query designs&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;# Organized Team deliverables wiki page with table of contents&lt;br /&gt;
# Group Report (&amp;#039;&amp;#039;.docx&amp;#039;&amp;#039; or &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039; file)&lt;br /&gt;
# Individual statements of work, assessments, reflections (wiki page, &amp;#039;&amp;#039;.docx&amp;#039;&amp;#039;, &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039;, or e-mailed to Dr. Dahlquist)&lt;br /&gt;
# Group PowerPoint presentation (given on Thursday, May 2, &amp;#039;&amp;#039;.pptx&amp;#039;&amp;#039; or &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039; file)&lt;br /&gt;
# Sample-data relationship table in Excel (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;)&lt;br /&gt;
# Excel spreadsheet with ANOVA results/stem formatting (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [[media:Expression_Table.xlsx|ANOVA table]]&lt;br /&gt;
# PowerPoint of ANOVA table, screenshots of stem results (&amp;#039;&amp;#039;.pptx&amp;#039;&amp;#039;), screenshot of black and white GRNsight input network and colored GRNmap/GRNsight output networks&lt;br /&gt;
# Gene List and GO List files from each significant profile (&amp;#039;&amp;#039;.txt&amp;#039;&amp;#039; compressed together in a &amp;#039;&amp;#039;.zip&amp;#039;&amp;#039; file)&lt;br /&gt;
# YEASTRACT &amp;quot;rank by TF&amp;quot; results (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;)&lt;br /&gt;
# GRNmap input workbook (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [[media:GRNmap_930PM.xlsx|GRNmap input workbook]]&lt;br /&gt;
# GRNmap output workbook (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) and output plots (&amp;#039;&amp;#039;.jpg&amp;#039;&amp;#039;) zipped together [https://lmu.box.com/s/rathfd4jhdqdjzuci4kdagmptq1jbxeb link to box with GRNmap output]&lt;br /&gt;
# MS Access database, including all tables (&amp;#039;&amp;#039;.accdb&amp;#039;&amp;#039;) [https://lmu.box.com/s/2mcetil8n7vxzwe100avt5yhmc7psuh5 Database]&lt;br /&gt;
# ReadMe for the database that describes the design of the database, references the sources of the data, and has a [https://www.quackit.com/microsoft_access/microsoft_access_2016/howto/how_to_create_a_database_diagram_in_access_2016.cfm database schema diagram] (&amp;#039;&amp;#039;.docx&amp;#039;&amp;#039;, &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039;) [https://lmu.box.com/s/0j6y9p1mrb3dji8wuvn8zhfwrb6lpo97 database diagram]&lt;br /&gt;
# Query design for populating a GRNmap input workbook from the database (screenshot of MS Access; or SQL code, &amp;#039;&amp;#039;.txt&amp;#039;&amp;#039;) [https://lmu.box.com/s/y6x97kqcjlxfezhnbpvck4h4oe4pm366 Query Designs]&lt;br /&gt;
# Electronic notebook corresponding to these the microarray results files ([[Week 13]], [[Week 14]], and [[Week 15]]) to support &amp;#039;&amp;#039;reproducible research&amp;#039;&amp;#039; so that all manipulations of the data and files are documented so that someone else could begin with your starting file, follow the protocol, and obtain your results.&lt;br /&gt;
&lt;br /&gt;
{{Yeast_Beasts}}&lt;br /&gt;
[[Template:Yeast_Beasts]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts_Deliverables&amp;diff=3548</id>
		<title>Yeast Beasts Deliverables</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts_Deliverables&amp;diff=3548"/>
		<updated>2024-05-03T01:13:45Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: fixed labels&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;# Organized Team deliverables wiki page with table of contents&lt;br /&gt;
# Group Report (&amp;#039;&amp;#039;.docx&amp;#039;&amp;#039; or &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039; file)&lt;br /&gt;
# Individual statements of work, assessments, reflections (wiki page, &amp;#039;&amp;#039;.docx&amp;#039;&amp;#039;, &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039;, or e-mailed to Dr. Dahlquist)&lt;br /&gt;
# Group PowerPoint presentation (given on Thursday, May 2, &amp;#039;&amp;#039;.pptx&amp;#039;&amp;#039; or &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039; file)&lt;br /&gt;
# Sample-data relationship table in Excel (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;)&lt;br /&gt;
# Excel spreadsheet with ANOVA results/stem formatting (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [[media:Expression_Table.xlsx|ANOVA table]]&lt;br /&gt;
# PowerPoint of ANOVA table, screenshots of stem results (&amp;#039;&amp;#039;.pptx&amp;#039;&amp;#039;), screenshot of black and white GRNsight input network and colored GRNmap/GRNsight output networks&lt;br /&gt;
# Gene List and GO List files from each significant profile (&amp;#039;&amp;#039;.txt&amp;#039;&amp;#039; compressed together in a &amp;#039;&amp;#039;.zip&amp;#039;&amp;#039; file)&lt;br /&gt;
# YEASTRACT &amp;quot;rank by TF&amp;quot; results (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;)&lt;br /&gt;
# GRNmap input workbook (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [[media:GRNmap_930PM.xlsx|GRNmap input workbook]]&lt;br /&gt;
# GRNmap output workbook (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) and output plots (&amp;#039;&amp;#039;.jpg&amp;#039;&amp;#039;) zipped together [https://lmu.box.com/s/rathfd4jhdqdjzuci4kdagmptq1jbxeb link to box with GRNmap output]&lt;br /&gt;
# MS Access database, including all tables (&amp;#039;&amp;#039;.accdb&amp;#039;&amp;#039;) [https://lmu.box.com/s/2mcetil8n7vxzwe100avt5yhmc7psuh5 Database]&lt;br /&gt;
# ReadMe for the database that describes the design of the database, references the sources of the data, and has a [https://www.quackit.com/microsoft_access/microsoft_access_2016/howto/how_to_create_a_database_diagram_in_access_2016.cfm database schema diagram] (&amp;#039;&amp;#039;.docx&amp;#039;&amp;#039;, &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039;) [https://lmu.box.com/s/0j6y9p1mrb3dji8wuvn8zhfwrb6lpo97 database diagram]&lt;br /&gt;
# Query design for populating a GRNmap input workbook from the database (screenshot of MS Access; or SQL code, &amp;#039;&amp;#039;.txt&amp;#039;&amp;#039;) &lt;br /&gt;
# Electronic notebook corresponding to these the microarray results files ([[Week 13]], [[Week 14]], and [[Week 15]]) to support &amp;#039;&amp;#039;reproducible research&amp;#039;&amp;#039; so that all manipulations of the data and files are documented so that someone else could begin with your starting file, follow the protocol, and obtain your results.&lt;br /&gt;
&lt;br /&gt;
{{Yeast_Beasts}}&lt;br /&gt;
[[Template:Yeast_Beasts]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts_Deliverables&amp;diff=3546</id>
		<title>Yeast Beasts Deliverables</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts_Deliverables&amp;diff=3546"/>
		<updated>2024-05-03T01:13:11Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: added database diagram&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;# Organized Team deliverables wiki page with table of contents&lt;br /&gt;
# Group Report (&amp;#039;&amp;#039;.docx&amp;#039;&amp;#039; or &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039; file)&lt;br /&gt;
# Individual statements of work, assessments, reflections (wiki page, &amp;#039;&amp;#039;.docx&amp;#039;&amp;#039;, &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039;, or e-mailed to Dr. Dahlquist)&lt;br /&gt;
# Group PowerPoint presentation (given on Thursday, May 2, &amp;#039;&amp;#039;.pptx&amp;#039;&amp;#039; or &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039; file)&lt;br /&gt;
# Sample-data relationship table in Excel (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;)&lt;br /&gt;
# Excel spreadsheet with ANOVA results/stem formatting (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [[media:Expression_Table.xlsx|ANOVA table]]&lt;br /&gt;
# PowerPoint of ANOVA table, screenshots of stem results (&amp;#039;&amp;#039;.pptx&amp;#039;&amp;#039;), screenshot of black and white GRNsight input network and colored GRNmap/GRNsight output networks&lt;br /&gt;
# Gene List and GO List files from each significant profile (&amp;#039;&amp;#039;.txt&amp;#039;&amp;#039; compressed together in a &amp;#039;&amp;#039;.zip&amp;#039;&amp;#039; file)&lt;br /&gt;
# YEASTRACT &amp;quot;rank by TF&amp;quot; results (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;)&lt;br /&gt;
# GRNmap input workbook (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [[media:GRNmap_930PM.xlsx|GRNmap input workbook]]&lt;br /&gt;
# GRNmap output workbook (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) and output plots (&amp;#039;&amp;#039;.jpg&amp;#039;&amp;#039;) zipped together [https://lmu.box.com/s/rathfd4jhdqdjzuci4kdagmptq1jbxeb link to box with GRNmap output]&lt;br /&gt;
# MS Access database, including all tables (&amp;#039;&amp;#039;.accdb&amp;#039;&amp;#039;) [https://lmu.box.com/s/2mcetil8n7vxzwe100avt5yhmc7psuh5|link Database]&lt;br /&gt;
# ReadMe for the database that describes the design of the database, references the sources of the data, and has a [https://www.quackit.com/microsoft_access/microsoft_access_2016/howto/how_to_create_a_database_diagram_in_access_2016.cfm database schema diagram] (&amp;#039;&amp;#039;.docx&amp;#039;&amp;#039;, &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039;) [https://lmu.box.com/s/0j6y9p1mrb3dji8wuvn8zhfwrb6lpo97 database diagram]&lt;br /&gt;
# Query design for populating a GRNmap input workbook from the database (screenshot of MS Access; or SQL code, &amp;#039;&amp;#039;.txt&amp;#039;&amp;#039;) &lt;br /&gt;
# Electronic notebook corresponding to these the microarray results files ([[Week 13]], [[Week 14]], and [[Week 15]]) to support &amp;#039;&amp;#039;reproducible research&amp;#039;&amp;#039; so that all manipulations of the data and files are documented so that someone else could begin with your starting file, follow the protocol, and obtain your results.&lt;br /&gt;
&lt;br /&gt;
{{Yeast_Beasts}}&lt;br /&gt;
[[Template:Yeast_Beasts]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts_Deliverables&amp;diff=3543</id>
		<title>Yeast Beasts Deliverables</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts_Deliverables&amp;diff=3543"/>
		<updated>2024-05-03T01:10:33Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: added label&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;# Organized Team deliverables wiki page with table of contents&lt;br /&gt;
# Group Report (&amp;#039;&amp;#039;.docx&amp;#039;&amp;#039; or &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039; file)&lt;br /&gt;
# Individual statements of work, assessments, reflections (wiki page, &amp;#039;&amp;#039;.docx&amp;#039;&amp;#039;, &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039;, or e-mailed to Dr. Dahlquist)&lt;br /&gt;
# Group PowerPoint presentation (given on Thursday, May 2, &amp;#039;&amp;#039;.pptx&amp;#039;&amp;#039; or &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039; file)&lt;br /&gt;
# Sample-data relationship table in Excel (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;)&lt;br /&gt;
# Excel spreadsheet with ANOVA results/stem formatting (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [[media:Expression_Table.xlsx|ANOVA table]]&lt;br /&gt;
# PowerPoint of ANOVA table, screenshots of stem results (&amp;#039;&amp;#039;.pptx&amp;#039;&amp;#039;), screenshot of black and white GRNsight input network and colored GRNmap/GRNsight output networks&lt;br /&gt;
# Gene List and GO List files from each significant profile (&amp;#039;&amp;#039;.txt&amp;#039;&amp;#039; compressed together in a &amp;#039;&amp;#039;.zip&amp;#039;&amp;#039; file)&lt;br /&gt;
# YEASTRACT &amp;quot;rank by TF&amp;quot; results (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;)&lt;br /&gt;
# GRNmap input workbook (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [[media:GRNmap_930PM.xlsx|GRNmap input workbook]]&lt;br /&gt;
# GRNmap output workbook (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) and output plots (&amp;#039;&amp;#039;.jpg&amp;#039;&amp;#039;) zipped together [https://lmu.box.com/s/rathfd4jhdqdjzuci4kdagmptq1jbxeb|link link to box with GRNmap output]&lt;br /&gt;
# MS Access database, including all tables (&amp;#039;&amp;#039;.accdb&amp;#039;&amp;#039;) [https://lmu.box.com/s/2mcetil8n7vxzwe100avt5yhmc7psuh5|link Database]&lt;br /&gt;
# ReadMe for the database that describes the design of the database, references the sources of the data, and has a [https://www.quackit.com/microsoft_access/microsoft_access_2016/howto/how_to_create_a_database_diagram_in_access_2016.cfm database schema diagram] (&amp;#039;&amp;#039;.docx&amp;#039;&amp;#039;, &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039;)&lt;br /&gt;
# Query design for populating a GRNmap input workbook from the database (screenshot of MS Access; or SQL code, &amp;#039;&amp;#039;.txt&amp;#039;&amp;#039;) &lt;br /&gt;
# Electronic notebook corresponding to these the microarray results files ([[Week 13]], [[Week 14]], and [[Week 15]]) to support &amp;#039;&amp;#039;reproducible research&amp;#039;&amp;#039; so that all manipulations of the data and files are documented so that someone else could begin with your starting file, follow the protocol, and obtain your results.&lt;br /&gt;
&lt;br /&gt;
{{Yeast_Beasts}}&lt;br /&gt;
[[Template:Yeast_Beasts]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts_Deliverables&amp;diff=3541</id>
		<title>Yeast Beasts Deliverables</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts_Deliverables&amp;diff=3541"/>
		<updated>2024-05-03T01:09:59Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: added label&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;# Organized Team deliverables wiki page with table of contents&lt;br /&gt;
# Group Report (&amp;#039;&amp;#039;.docx&amp;#039;&amp;#039; or &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039; file)&lt;br /&gt;
# Individual statements of work, assessments, reflections (wiki page, &amp;#039;&amp;#039;.docx&amp;#039;&amp;#039;, &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039;, or e-mailed to Dr. Dahlquist)&lt;br /&gt;
# Group PowerPoint presentation (given on Thursday, May 2, &amp;#039;&amp;#039;.pptx&amp;#039;&amp;#039; or &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039; file)&lt;br /&gt;
# Sample-data relationship table in Excel (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;)&lt;br /&gt;
# Excel spreadsheet with ANOVA results/stem formatting (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [[media:Expression_Table.xlsx|ANOVA table]]&lt;br /&gt;
# PowerPoint of ANOVA table, screenshots of stem results (&amp;#039;&amp;#039;.pptx&amp;#039;&amp;#039;), screenshot of black and white GRNsight input network and colored GRNmap/GRNsight output networks&lt;br /&gt;
# Gene List and GO List files from each significant profile (&amp;#039;&amp;#039;.txt&amp;#039;&amp;#039; compressed together in a &amp;#039;&amp;#039;.zip&amp;#039;&amp;#039; file)&lt;br /&gt;
# YEASTRACT &amp;quot;rank by TF&amp;quot; results (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;)&lt;br /&gt;
# GRNmap input workbook (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [[media:GRNmap_930PM.xlsx|GRNmap input workbook]]&lt;br /&gt;
# GRNmap output workbook (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) and output plots (&amp;#039;&amp;#039;.jpg&amp;#039;&amp;#039;) zipped together [https://lmu.box.com/s/rathfd4jhdqdjzuci4kdagmptq1jbxeb|link link to box with GRNmap output]&lt;br /&gt;
# MS Access database, including all tables (&amp;#039;&amp;#039;.accdb&amp;#039;&amp;#039;) [https://lmu.box.com/s/2mcetil8n7vxzwe100avt5yhmc7psuh5|Database]&lt;br /&gt;
# ReadMe for the database that describes the design of the database, references the sources of the data, and has a [https://www.quackit.com/microsoft_access/microsoft_access_2016/howto/how_to_create_a_database_diagram_in_access_2016.cfm database schema diagram] (&amp;#039;&amp;#039;.docx&amp;#039;&amp;#039;, &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039;)&lt;br /&gt;
# Query design for populating a GRNmap input workbook from the database (screenshot of MS Access; or SQL code, &amp;#039;&amp;#039;.txt&amp;#039;&amp;#039;) &lt;br /&gt;
# Electronic notebook corresponding to these the microarray results files ([[Week 13]], [[Week 14]], and [[Week 15]]) to support &amp;#039;&amp;#039;reproducible research&amp;#039;&amp;#039; so that all manipulations of the data and files are documented so that someone else could begin with your starting file, follow the protocol, and obtain your results.&lt;br /&gt;
&lt;br /&gt;
{{Yeast_Beasts}}&lt;br /&gt;
[[Template:Yeast_Beasts]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts_Deliverables&amp;diff=3540</id>
		<title>Yeast Beasts Deliverables</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts_Deliverables&amp;diff=3540"/>
		<updated>2024-05-03T01:09:35Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: added grnmap output&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;# Organized Team deliverables wiki page with table of contents&lt;br /&gt;
# Group Report (&amp;#039;&amp;#039;.docx&amp;#039;&amp;#039; or &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039; file)&lt;br /&gt;
# Individual statements of work, assessments, reflections (wiki page, &amp;#039;&amp;#039;.docx&amp;#039;&amp;#039;, &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039;, or e-mailed to Dr. Dahlquist)&lt;br /&gt;
# Group PowerPoint presentation (given on Thursday, May 2, &amp;#039;&amp;#039;.pptx&amp;#039;&amp;#039; or &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039; file)&lt;br /&gt;
# Sample-data relationship table in Excel (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;)&lt;br /&gt;
# Excel spreadsheet with ANOVA results/stem formatting (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [[media:Expression_Table.xlsx|ANOVA table]]&lt;br /&gt;
# PowerPoint of ANOVA table, screenshots of stem results (&amp;#039;&amp;#039;.pptx&amp;#039;&amp;#039;), screenshot of black and white GRNsight input network and colored GRNmap/GRNsight output networks&lt;br /&gt;
# Gene List and GO List files from each significant profile (&amp;#039;&amp;#039;.txt&amp;#039;&amp;#039; compressed together in a &amp;#039;&amp;#039;.zip&amp;#039;&amp;#039; file)&lt;br /&gt;
# YEASTRACT &amp;quot;rank by TF&amp;quot; results (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;)&lt;br /&gt;
# GRNmap input workbook (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [[media:GRNmap_930PM.xlsx|GRNmap input workbook]]&lt;br /&gt;
# GRNmap output workbook (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) and output plots (&amp;#039;&amp;#039;.jpg&amp;#039;&amp;#039;) zipped together [https://lmu.box.com/s/rathfd4jhdqdjzuci4kdagmptq1jbxeb|link to box with GRNmap output]&lt;br /&gt;
# MS Access database, including all tables (&amp;#039;&amp;#039;.accdb&amp;#039;&amp;#039;) [https://lmu.box.com/s/2mcetil8n7vxzwe100avt5yhmc7psuh5|Database]&lt;br /&gt;
# ReadMe for the database that describes the design of the database, references the sources of the data, and has a [https://www.quackit.com/microsoft_access/microsoft_access_2016/howto/how_to_create_a_database_diagram_in_access_2016.cfm database schema diagram] (&amp;#039;&amp;#039;.docx&amp;#039;&amp;#039;, &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039;)&lt;br /&gt;
# Query design for populating a GRNmap input workbook from the database (screenshot of MS Access; or SQL code, &amp;#039;&amp;#039;.txt&amp;#039;&amp;#039;) &lt;br /&gt;
# Electronic notebook corresponding to these the microarray results files ([[Week 13]], [[Week 14]], and [[Week 15]]) to support &amp;#039;&amp;#039;reproducible research&amp;#039;&amp;#039; so that all manipulations of the data and files are documented so that someone else could begin with your starting file, follow the protocol, and obtain your results.&lt;br /&gt;
&lt;br /&gt;
{{Yeast_Beasts}}&lt;br /&gt;
[[Template:Yeast_Beasts]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts_Deliverables&amp;diff=3539</id>
		<title>Yeast Beasts Deliverables</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts_Deliverables&amp;diff=3539"/>
		<updated>2024-05-03T01:07:55Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: added input workbook&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;# Organized Team deliverables wiki page with table of contents&lt;br /&gt;
# Group Report (&amp;#039;&amp;#039;.docx&amp;#039;&amp;#039; or &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039; file)&lt;br /&gt;
# Individual statements of work, assessments, reflections (wiki page, &amp;#039;&amp;#039;.docx&amp;#039;&amp;#039;, &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039;, or e-mailed to Dr. Dahlquist)&lt;br /&gt;
# Group PowerPoint presentation (given on Thursday, May 2, &amp;#039;&amp;#039;.pptx&amp;#039;&amp;#039; or &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039; file)&lt;br /&gt;
# Sample-data relationship table in Excel (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;)&lt;br /&gt;
# Excel spreadsheet with ANOVA results/stem formatting (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [[media:Expression_Table.xlsx|ANOVA table]]&lt;br /&gt;
# PowerPoint of ANOVA table, screenshots of stem results (&amp;#039;&amp;#039;.pptx&amp;#039;&amp;#039;), screenshot of black and white GRNsight input network and colored GRNmap/GRNsight output networks&lt;br /&gt;
# Gene List and GO List files from each significant profile (&amp;#039;&amp;#039;.txt&amp;#039;&amp;#039; compressed together in a &amp;#039;&amp;#039;.zip&amp;#039;&amp;#039; file)&lt;br /&gt;
# YEASTRACT &amp;quot;rank by TF&amp;quot; results (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;)&lt;br /&gt;
# GRNmap input workbook (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [[media:GRNmap_930PM.xlsx|GRNmap input workbook]]&lt;br /&gt;
# GRNmap output workbook (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) and output plots (&amp;#039;&amp;#039;.jpg&amp;#039;&amp;#039;) zipped together&lt;br /&gt;
# MS Access database, including all tables (&amp;#039;&amp;#039;.accdb&amp;#039;&amp;#039;) [https://lmu.box.com/s/2mcetil8n7vxzwe100avt5yhmc7psuh5|Database]&lt;br /&gt;
# ReadMe for the database that describes the design of the database, references the sources of the data, and has a [https://www.quackit.com/microsoft_access/microsoft_access_2016/howto/how_to_create_a_database_diagram_in_access_2016.cfm database schema diagram] (&amp;#039;&amp;#039;.docx&amp;#039;&amp;#039;, &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039;)&lt;br /&gt;
# Query design for populating a GRNmap input workbook from the database (screenshot of MS Access; or SQL code, &amp;#039;&amp;#039;.txt&amp;#039;&amp;#039;) &lt;br /&gt;
# Electronic notebook corresponding to these the microarray results files ([[Week 13]], [[Week 14]], and [[Week 15]]) to support &amp;#039;&amp;#039;reproducible research&amp;#039;&amp;#039; so that all manipulations of the data and files are documented so that someone else could begin with your starting file, follow the protocol, and obtain your results.&lt;br /&gt;
&lt;br /&gt;
{{Yeast_Beasts}}&lt;br /&gt;
[[Template:Yeast_Beasts]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=File:GRNmap_930PM.xlsx&amp;diff=3537</id>
		<title>File:GRNmap 930PM.xlsx</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=File:GRNmap_930PM.xlsx&amp;diff=3537"/>
		<updated>2024-05-03T01:06:53Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts_Deliverables&amp;diff=3534</id>
		<title>Yeast Beasts Deliverables</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts_Deliverables&amp;diff=3534"/>
		<updated>2024-05-03T01:04:35Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: added ANOVA&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;# Organized Team deliverables wiki page with table of contents&lt;br /&gt;
# Group Report (&amp;#039;&amp;#039;.docx&amp;#039;&amp;#039; or &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039; file)&lt;br /&gt;
# Individual statements of work, assessments, reflections (wiki page, &amp;#039;&amp;#039;.docx&amp;#039;&amp;#039;, &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039;, or e-mailed to Dr. Dahlquist)&lt;br /&gt;
# Group PowerPoint presentation (given on Thursday, May 2, &amp;#039;&amp;#039;.pptx&amp;#039;&amp;#039; or &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039; file)&lt;br /&gt;
# Sample-data relationship table in Excel (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;)&lt;br /&gt;
# Excel spreadsheet with ANOVA results/stem formatting (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [[media:Expression_Table.xlsx|ANOVA table]]&lt;br /&gt;
# PowerPoint of ANOVA table, screenshots of stem results (&amp;#039;&amp;#039;.pptx&amp;#039;&amp;#039;), screenshot of black and white GRNsight input network and colored GRNmap/GRNsight output networks&lt;br /&gt;
# Gene List and GO List files from each significant profile (&amp;#039;&amp;#039;.txt&amp;#039;&amp;#039; compressed together in a &amp;#039;&amp;#039;.zip&amp;#039;&amp;#039; file)&lt;br /&gt;
# YEASTRACT &amp;quot;rank by TF&amp;quot; results (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;)&lt;br /&gt;
# GRNmap input workbook (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;)&lt;br /&gt;
# GRNmap output workbook (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) and output plots (&amp;#039;&amp;#039;.jpg&amp;#039;&amp;#039;) zipped together&lt;br /&gt;
# MS Access database, including all tables (&amp;#039;&amp;#039;.accdb&amp;#039;&amp;#039;) [https://lmu.box.com/s/2mcetil8n7vxzwe100avt5yhmc7psuh5|Database]&lt;br /&gt;
# ReadMe for the database that describes the design of the database, references the sources of the data, and has a [https://www.quackit.com/microsoft_access/microsoft_access_2016/howto/how_to_create_a_database_diagram_in_access_2016.cfm database schema diagram] (&amp;#039;&amp;#039;.docx&amp;#039;&amp;#039;, &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039;)&lt;br /&gt;
# Query design for populating a GRNmap input workbook from the database (screenshot of MS Access; or SQL code, &amp;#039;&amp;#039;.txt&amp;#039;&amp;#039;) &lt;br /&gt;
# Electronic notebook corresponding to these the microarray results files ([[Week 13]], [[Week 14]], and [[Week 15]]) to support &amp;#039;&amp;#039;reproducible research&amp;#039;&amp;#039; so that all manipulations of the data and files are documented so that someone else could begin with your starting file, follow the protocol, and obtain your results.&lt;br /&gt;
&lt;br /&gt;
{{Yeast_Beasts}}&lt;br /&gt;
[[Template:Yeast_Beasts]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts_Deliverables&amp;diff=3532</id>
		<title>Yeast Beasts Deliverables</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts_Deliverables&amp;diff=3532"/>
		<updated>2024-05-03T01:03:31Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: added ANOVA table&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;# Organized Team deliverables wiki page with table of contents&lt;br /&gt;
# Group Report (&amp;#039;&amp;#039;.docx&amp;#039;&amp;#039; or &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039; file)&lt;br /&gt;
# Individual statements of work, assessments, reflections (wiki page, &amp;#039;&amp;#039;.docx&amp;#039;&amp;#039;, &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039;, or e-mailed to Dr. Dahlquist)&lt;br /&gt;
# Group PowerPoint presentation (given on Thursday, May 2, &amp;#039;&amp;#039;.pptx&amp;#039;&amp;#039; or &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039; file)&lt;br /&gt;
# Sample-data relationship table in Excel (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;)&lt;br /&gt;
# Excel spreadsheet with ANOVA results/stem formatting (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) [[media: Expression_Table.xslx|ANOVA table]]&lt;br /&gt;
# PowerPoint of ANOVA table, screenshots of stem results (&amp;#039;&amp;#039;.pptx&amp;#039;&amp;#039;), screenshot of black and white GRNsight input network and colored GRNmap/GRNsight output networks&lt;br /&gt;
# Gene List and GO List files from each significant profile (&amp;#039;&amp;#039;.txt&amp;#039;&amp;#039; compressed together in a &amp;#039;&amp;#039;.zip&amp;#039;&amp;#039; file)&lt;br /&gt;
# YEASTRACT &amp;quot;rank by TF&amp;quot; results (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;)&lt;br /&gt;
# GRNmap input workbook (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;)&lt;br /&gt;
# GRNmap output workbook (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) and output plots (&amp;#039;&amp;#039;.jpg&amp;#039;&amp;#039;) zipped together&lt;br /&gt;
# MS Access database, including all tables (&amp;#039;&amp;#039;.accdb&amp;#039;&amp;#039;) [https://lmu.box.com/s/2mcetil8n7vxzwe100avt5yhmc7psuh5|Database]&lt;br /&gt;
# ReadMe for the database that describes the design of the database, references the sources of the data, and has a [https://www.quackit.com/microsoft_access/microsoft_access_2016/howto/how_to_create_a_database_diagram_in_access_2016.cfm database schema diagram] (&amp;#039;&amp;#039;.docx&amp;#039;&amp;#039;, &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039;)&lt;br /&gt;
# Query design for populating a GRNmap input workbook from the database (screenshot of MS Access; or SQL code, &amp;#039;&amp;#039;.txt&amp;#039;&amp;#039;) &lt;br /&gt;
# Electronic notebook corresponding to these the microarray results files ([[Week 13]], [[Week 14]], and [[Week 15]]) to support &amp;#039;&amp;#039;reproducible research&amp;#039;&amp;#039; so that all manipulations of the data and files are documented so that someone else could begin with your starting file, follow the protocol, and obtain your results.&lt;br /&gt;
&lt;br /&gt;
{{Yeast_Beasts}}&lt;br /&gt;
[[Template:Yeast_Beasts]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=File:Expression_Table.xlsx&amp;diff=3531</id>
		<title>File:Expression Table.xlsx</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=File:Expression_Table.xlsx&amp;diff=3531"/>
		<updated>2024-05-03T01:01:18Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts_Deliverables&amp;diff=3527</id>
		<title>Yeast Beasts Deliverables</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts_Deliverables&amp;diff=3527"/>
		<updated>2024-05-03T00:56:06Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: added database&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;# Organized Team deliverables wiki page with table of contents&lt;br /&gt;
# Group Report (&amp;#039;&amp;#039;.docx&amp;#039;&amp;#039; or &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039; file)&lt;br /&gt;
# Individual statements of work, assessments, reflections (wiki page, &amp;#039;&amp;#039;.docx&amp;#039;&amp;#039;, &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039;, or e-mailed to Dr. Dahlquist)&lt;br /&gt;
# Group PowerPoint presentation (given on Thursday, May 2, &amp;#039;&amp;#039;.pptx&amp;#039;&amp;#039; or &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039; file)&lt;br /&gt;
# Sample-data relationship table in Excel (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;)&lt;br /&gt;
# Excel spreadsheet with ANOVA results/stem formatting (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;)&lt;br /&gt;
# PowerPoint of ANOVA table, screenshots of stem results (&amp;#039;&amp;#039;.pptx&amp;#039;&amp;#039;), screenshot of black and white GRNsight input network and colored GRNmap/GRNsight output networks&lt;br /&gt;
# Gene List and GO List files from each significant profile (&amp;#039;&amp;#039;.txt&amp;#039;&amp;#039; compressed together in a &amp;#039;&amp;#039;.zip&amp;#039;&amp;#039; file)&lt;br /&gt;
# YEASTRACT &amp;quot;rank by TF&amp;quot; results (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;)&lt;br /&gt;
# GRNmap input workbook (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;)&lt;br /&gt;
# GRNmap output workbook (&amp;#039;&amp;#039;.xlsx&amp;#039;&amp;#039;) and output plots (&amp;#039;&amp;#039;.jpg&amp;#039;&amp;#039;) zipped together&lt;br /&gt;
# MS Access database, including all tables (&amp;#039;&amp;#039;.accdb&amp;#039;&amp;#039;) [https://lmu.box.com/s/2mcetil8n7vxzwe100avt5yhmc7psuh5|Database]&lt;br /&gt;
# ReadMe for the database that describes the design of the database, references the sources of the data, and has a [https://www.quackit.com/microsoft_access/microsoft_access_2016/howto/how_to_create_a_database_diagram_in_access_2016.cfm database schema diagram] (&amp;#039;&amp;#039;.docx&amp;#039;&amp;#039;, &amp;#039;&amp;#039;.pdf&amp;#039;&amp;#039;)&lt;br /&gt;
# Query design for populating a GRNmap input workbook from the database (screenshot of MS Access; or SQL code, &amp;#039;&amp;#039;.txt&amp;#039;&amp;#039;) &lt;br /&gt;
# Electronic notebook corresponding to these the microarray results files ([[Week 13]], [[Week 14]], and [[Week 15]]) to support &amp;#039;&amp;#039;reproducible research&amp;#039;&amp;#039; so that all manipulations of the data and files are documented so that someone else could begin with your starting file, follow the protocol, and obtain your results.&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_15&amp;diff=3514</id>
		<title>MSymond1 Week 15</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_15&amp;diff=3514"/>
		<updated>2024-05-02T17:47:41Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: /* Presentation */ added progress for presentation&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Progress Report==&lt;br /&gt;
===Milestone 3 continued===&lt;br /&gt;
*I was able to obtain the data from the yeastmine website by scrolling down and selecting the table with “all verified uncharacterized dubious ORFs”, the data table included all of the columns that Dr. Dahlquist suggested I include in the table&lt;br /&gt;
*The production and degradation tables could be saved as txt files which could then be opened in excel directly&lt;br /&gt;
*The metadata table was created with assistance from my professor in which she helped us gather the necessary materials for it&lt;br /&gt;
===Milestone 4===&lt;br /&gt;
*The database creation went mostly well with a few issues along the way such as&lt;br /&gt;
*Importing the tables from excel led to some problems of all values being either 0 or 1 on production rates or degradation rates tables, so we started to export the tables from excel as a txt file and then open them in access at txt files&lt;br /&gt;
*In the relationships window, we connected the systematic name column for each of the tables to ensure they would all be connected since that column was the primary key in each of the data tables&lt;br /&gt;
===Milesone 5===&lt;br /&gt;
*The quality assurance team was able to verify that the database was correct and had all necessary fields&lt;br /&gt;
===Milestone 6===&lt;br /&gt;
*We were informed by the data analysts that they selected profile 41 to analyze, this profile included 23 genes initially&lt;br /&gt;
*At first we were stuck and were not sure how to obtain the necessary data for the GRNmap, but Dr. Dahlquist pointed out that we can use the database that we created to run queries and find the necessary fields for each of the 23 genes&lt;br /&gt;
*I was informed that Andrew had completed the first three queries for the first three tables on the GRNmap excel sheet&lt;br /&gt;
*I followed the sample GRNmap excel sheet as a template for the queries. and I was running into problems running the query with the network table&lt;br /&gt;
*We imported profile 41 genes into access as another table to run queries with&lt;br /&gt;
*I noticed the problem was that profile 41 only included standard names of the genes rather than the systematic names, which was not the primary key for most of the database, and the network table did not include the standard names so I could not link those two tables together using it&lt;br /&gt;
*My solution was to link the gene table to the profile table using their standard names. However, I noticed that the gene table did not include the standard names for the genes, so I imported another gene table from yeastmine that does include the standard names. Then I was able to link the profile 41 table to the gene table using the standard names and then I was able to run a query using the query design and I was able to answer question 4.&lt;br /&gt;
*I also noticed that the first three questions were incorrect since they did not include all of the genes, I&amp;#039;m not sure how that happened with Andrew&amp;#039;s queries, but I went ahead and ran them again and fixed that issue&lt;br /&gt;
*Dr. Dahlquist also noticed that Andrew&amp;#039;s queries included the data from the control expression group rather than the CHP treated expression group, so I fixed that too&lt;br /&gt;
*I followed the rest of the directions for the rest of the questions for milestone 6 and ran into no problems&lt;br /&gt;
*I was able to give the GRNmap excel file to Dr. Dahlquist for her to run it&lt;br /&gt;
&lt;br /&gt;
==Presentation==&lt;br /&gt;
===Progress===&lt;br /&gt;
*We were able to complete the presentation to the best of our abilities following the directions listed in the project deliverables page&lt;br /&gt;
*The biggest issues we ran into was structuring the presentation in a way that made sense since we all had a great understanding of our own part of the project but we did not know everyone else&amp;#039;s as well, so we had to learn a little bit about everyone&amp;#039;s part of the project in order to create a cohesive presentation that made sense&lt;br /&gt;
*We also did our best to take into account our feedback from the last presentation and tried to make our titles more descriptive and also make our bullet points more useful&lt;br /&gt;
slides [https://docs.google.com/presentation/d/1DnYfkl9j5hy6EqTc1XT0tT7A_husJCJNNgsxhncsozY/edit?usp=sharing|our presentation]&lt;br /&gt;
&lt;br /&gt;
==Team Journal Assignment==&lt;br /&gt;
# This week, me and my partner completed milestones 1 and 2, and we are currently working on milestone 3, there are some complications in milestone 3, for a large part of it requires Microsoft Access, and there are also some issues in importing tables to excel. [[MSymond1]]&lt;br /&gt;
# Each team member should reflect on the team&amp;#039;s progress:&lt;br /&gt;
## The things that worked well are cleaning up the data for the network table, which was done in class on Tuesday, the data table looks much more organized and the p values have all been successfully converted&lt;br /&gt;
## The other data tables are not pasting into excel as neatly as anticipated, I am also unaware of how to obtain the data from the yeastmine website. &lt;br /&gt;
## To fix these issues, I will ask Dr. Dahlquist for further advice in class on Thursday.&lt;br /&gt;
{{Template:MSymond1}}&lt;br /&gt;
[[Category:Team Project]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_15&amp;diff=3513</id>
		<title>MSymond1 Week 15</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_15&amp;diff=3513"/>
		<updated>2024-05-02T17:39:11Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: /* Milestone 6 */ finished milestone 6&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Progress Report==&lt;br /&gt;
===Milestone 3 continued===&lt;br /&gt;
*I was able to obtain the data from the yeastmine website by scrolling down and selecting the table with “all verified uncharacterized dubious ORFs”, the data table included all of the columns that Dr. Dahlquist suggested I include in the table&lt;br /&gt;
*The production and degradation tables could be saved as txt files which could then be opened in excel directly&lt;br /&gt;
*The metadata table was created with assistance from my professor in which she helped us gather the necessary materials for it&lt;br /&gt;
===Milestone 4===&lt;br /&gt;
*The database creation went mostly well with a few issues along the way such as&lt;br /&gt;
*Importing the tables from excel led to some problems of all values being either 0 or 1 on production rates or degradation rates tables, so we started to export the tables from excel as a txt file and then open them in access at txt files&lt;br /&gt;
*In the relationships window, we connected the systematic name column for each of the tables to ensure they would all be connected since that column was the primary key in each of the data tables&lt;br /&gt;
===Milesone 5===&lt;br /&gt;
*The quality assurance team was able to verify that the database was correct and had all necessary fields&lt;br /&gt;
===Milestone 6===&lt;br /&gt;
*We were informed by the data analysts that they selected profile 41 to analyze, this profile included 23 genes initially&lt;br /&gt;
*At first we were stuck and were not sure how to obtain the necessary data for the GRNmap, but Dr. Dahlquist pointed out that we can use the database that we created to run queries and find the necessary fields for each of the 23 genes&lt;br /&gt;
*I was informed that Andrew had completed the first three queries for the first three tables on the GRNmap excel sheet&lt;br /&gt;
*I followed the sample GRNmap excel sheet as a template for the queries. and I was running into problems running the query with the network table&lt;br /&gt;
*We imported profile 41 genes into access as another table to run queries with&lt;br /&gt;
*I noticed the problem was that profile 41 only included standard names of the genes rather than the systematic names, which was not the primary key for most of the database, and the network table did not include the standard names so I could not link those two tables together using it&lt;br /&gt;
*My solution was to link the gene table to the profile table using their standard names. However, I noticed that the gene table did not include the standard names for the genes, so I imported another gene table from yeastmine that does include the standard names. Then I was able to link the profile 41 table to the gene table using the standard names and then I was able to run a query using the query design and I was able to answer question 4.&lt;br /&gt;
*I also noticed that the first three questions were incorrect since they did not include all of the genes, I&amp;#039;m not sure how that happened with Andrew&amp;#039;s queries, but I went ahead and ran them again and fixed that issue&lt;br /&gt;
*Dr. Dahlquist also noticed that Andrew&amp;#039;s queries included the data from the control expression group rather than the CHP treated expression group, so I fixed that too&lt;br /&gt;
*I followed the rest of the directions for the rest of the questions for milestone 6 and ran into no problems&lt;br /&gt;
*I was able to give the GRNmap excel file to Dr. Dahlquist for her to run it&lt;br /&gt;
&lt;br /&gt;
==Presentation==&lt;br /&gt;
slides [https://docs.google.com/presentation/d/1DnYfkl9j5hy6EqTc1XT0tT7A_husJCJNNgsxhncsozY/edit?usp=sharing]&lt;br /&gt;
==Team Journal Assignment==&lt;br /&gt;
# This week, me and my partner completed milestones 1 and 2, and we are currently working on milestone 3, there are some complications in milestone 3, for a large part of it requires Microsoft Access, and there are also some issues in importing tables to excel. [[MSymond1]]&lt;br /&gt;
# Each team member should reflect on the team&amp;#039;s progress:&lt;br /&gt;
## The things that worked well are cleaning up the data for the network table, which was done in class on Tuesday, the data table looks much more organized and the p values have all been successfully converted&lt;br /&gt;
## The other data tables are not pasting into excel as neatly as anticipated, I am also unaware of how to obtain the data from the yeastmine website. &lt;br /&gt;
## To fix these issues, I will ask Dr. Dahlquist for further advice in class on Thursday.&lt;br /&gt;
{{Template:MSymond1}}&lt;br /&gt;
[[Category:Team Project]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_15&amp;diff=3512</id>
		<title>MSymond1 Week 15</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_15&amp;diff=3512"/>
		<updated>2024-05-02T17:31:30Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: added slides&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Progress Report==&lt;br /&gt;
===Milestone 3 continued===&lt;br /&gt;
*I was able to obtain the data from the yeastmine website by scrolling down and selecting the table with “all verified uncharacterized dubious ORFs”, the data table included all of the columns that Dr. Dahlquist suggested I include in the table&lt;br /&gt;
*The production and degradation tables could be saved as txt files which could then be opened in excel directly&lt;br /&gt;
*The metadata table was created with assistance from my professor in which she helped us gather the necessary materials for it&lt;br /&gt;
===Milestone 4===&lt;br /&gt;
*The database creation went mostly well with a few issues along the way such as&lt;br /&gt;
*Importing the tables from excel led to some problems of all values being either 0 or 1 on production rates or degradation rates tables, so we started to export the tables from excel as a txt file and then open them in access at txt files&lt;br /&gt;
*In the relationships window, we connected the systematic name column for each of the tables to ensure they would all be connected since that column was the primary key in each of the data tables&lt;br /&gt;
===Milesone 5===&lt;br /&gt;
*The quality assurance team was able to verify that the database was correct and had all necessary fields&lt;br /&gt;
===Milestone 6===&lt;br /&gt;
*We were informed by the data analysts that they selected profile 41 to analyze, this profile included 23 genes initially&lt;br /&gt;
*At first we were stuck and were not sure how to obtain the necessary data for the GRNmap, but Dr. Dahlquist pointed out that we can use the database that we created to run queries and find the necessary fields for each of the 23 genes&lt;br /&gt;
*I was informed that Andrew had completed the first three queries for the first three tables on the GRNmap excel sheet&lt;br /&gt;
*I followed the sample GRNmap excel sheet as a template for the queries.&lt;br /&gt;
==Presentation==&lt;br /&gt;
slides [https://docs.google.com/presentation/d/1DnYfkl9j5hy6EqTc1XT0tT7A_husJCJNNgsxhncsozY/edit?usp=sharing]&lt;br /&gt;
==Team Journal Assignment==&lt;br /&gt;
# This week, me and my partner completed milestones 1 and 2, and we are currently working on milestone 3, there are some complications in milestone 3, for a large part of it requires Microsoft Access, and there are also some issues in importing tables to excel. [[MSymond1]]&lt;br /&gt;
# Each team member should reflect on the team&amp;#039;s progress:&lt;br /&gt;
## The things that worked well are cleaning up the data for the network table, which was done in class on Tuesday, the data table looks much more organized and the p values have all been successfully converted&lt;br /&gt;
## The other data tables are not pasting into excel as neatly as anticipated, I am also unaware of how to obtain the data from the yeastmine website. &lt;br /&gt;
## To fix these issues, I will ask Dr. Dahlquist for further advice in class on Thursday.&lt;br /&gt;
{{Template:MSymond1}}&lt;br /&gt;
[[Category:Team Project]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Asandle1_Week_15&amp;diff=3511</id>
		<title>Asandle1 Week 15</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Asandle1_Week_15&amp;diff=3511"/>
		<updated>2024-05-02T17:30:46Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: /* April 25 Notes */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Template:Asandle1}}&lt;br /&gt;
&lt;br /&gt;
== Electronic Lab Notebook ==&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;This Journal is combination between week 14 and 15.&lt;br /&gt;
To get to the Team Journal Page please click here: [[Yeast Beasts]]&lt;br /&gt;
To get to the Previous Journal click here: [[Asandle1 Week 13]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
===In Class Thursday April 18th Notes===&lt;br /&gt;
&lt;br /&gt;
*Dean immediately told me he needed help getting the text for the degradation imported&lt;br /&gt;
*I told him I knew how to fix it and he just needed to import the text to excel instead of copy pasting. Dr. Dahlquist helped him with that.&lt;br /&gt;
* we solved the issue of the #REF and Blanks, we didn&amp;#039;t end up needing those two columns so we just deleted them&lt;br /&gt;
* Dean shared into the box up to the additional steps he took.&lt;br /&gt;
* Everything is into the file and we are having the other group look over what we did.&lt;br /&gt;
* We are now thinking about what to put down for metadata, I asked what we need for metadata and Dr. Dahlquist is explaining to us what metadata is and what to put in for it.&lt;br /&gt;
* Added a sheet and named it Metadata&lt;br /&gt;
* Added a Primary Key, Description, Data Source, Date Accessed, Data Updated Date, and Publication (Pubmed ID) or DOI Column&lt;br /&gt;
* Entered the info under each of these sections, and shared a screenshot with Dean to make sure we don&amp;#039;t do the same work twice&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
===April 21 Notes===&lt;br /&gt;
* Trying to get in touch with to see where he left off so I know where to progress for next steps&lt;br /&gt;
&lt;br /&gt;
===April 22 Notes===&lt;br /&gt;
* Dean looks to have created 4 of the tables but I need to speak with him in person to make corrections&lt;br /&gt;
* Created the metadata table&lt;br /&gt;
* Added detailed descriptions for table items&lt;br /&gt;
* Access File is on Box in each iteration, uploaded most recent version to box.&lt;br /&gt;
&lt;br /&gt;
===April 23 Notes===&lt;br /&gt;
#Issue with Degradation rates and production rates on access going to 0&amp;#039;s and 1&amp;#039;s.&lt;br /&gt;
# I told Dean to ignore this issue and I will Deal with it while he deals with the expression table.&lt;br /&gt;
# Dr. Dahlquist tried reimporting and it works, so I am going to import both Degradation_Rates and Production_Rates again.&lt;br /&gt;
# I tried importing and had no issues&lt;br /&gt;
# Fixed the issues and everything worked fine&lt;br /&gt;
# Dean is working on the Expression_Table right now while I was fixing these. He encountered some issues for the Data team to solve.&lt;br /&gt;
# Dean said there is an error in the Network table. He said he can&amp;#039;t make something a primary key because of a null value. I told him thats an excel importing issue, and we need to fix it in excel. I told Dean I will deal with it because he is not understanding the issue.&lt;br /&gt;
# I connected all of the relationships for the primary keys. Meta data remains unconnected since it is a separate piece.&lt;br /&gt;
&lt;br /&gt;
===April 25 Notes===&lt;br /&gt;
We are waiting for QA to finish Milestone 5 so we can do Milestone 6, In the meantime we are going to work on the presentation&lt;br /&gt;
Got the Gene list for profile 41 from the Data Analyst Team which are: Rpn4, Gcn4, Pdr1, Xbp1, Met28, Mga2, Spt23, Bas1, Yap1, Sok2, Msn2, Crz1, Rlm1, Fhl1, Pdr3, Cbf1, Rph1, Met31, Stp1, Msn4, Tec1, Rgm1, Stp2.&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/presentation/d/1DnYfkl9j5hy6EqTc1XT0tT7A_husJCJNNgsxhncsozY/edit?usp=sharing|presentation]&lt;br /&gt;
&lt;br /&gt;
==References &amp;amp; Acknowledgements==&lt;br /&gt;
===Acknowlegements===&lt;br /&gt;
* Dean and I communicated about the project via text, at one point I didn&amp;#039;t hear from him so I called him and he promptly got back to me.&lt;br /&gt;
* Dr. Dahlquist helped us solve some issues with the data in the production_rates and degradation_rates tables. She also provided assistance with the expression table.&lt;br /&gt;
&lt;br /&gt;
===References===&lt;br /&gt;
#LMU BioDB 2024. (2024). Coder/Designer. Retrieved April 17, 2024, from https://xmlpipedb.cs.lmu.edu/biodb/spring2024/index.php/Coder/Designer&lt;br /&gt;
#Community Central Fandom. (2024). Forum:How do I link to a heading on another page?. Retrieved April 17, 2024, from https://community.fandom.com/wiki/Forum:How_do_I_link_to_a_heading_on_another_page%3F &lt;br /&gt;
&lt;br /&gt;
Used week 13 as an outline&lt;br /&gt;
Except for what is noted above, this individual journal entry was completed by me and not copied from another source. [[User:Asandle1|Asandle1]] ([[User talk:Asandle1|talk]])&lt;br /&gt;
&lt;br /&gt;
[[Category: Team Project]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_15&amp;diff=3510</id>
		<title>MSymond1 Week 15</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_15&amp;diff=3510"/>
		<updated>2024-05-02T17:29:39Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: started week 15&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Progress Report==&lt;br /&gt;
===Milestone 3 continued===&lt;br /&gt;
*I was able to obtain the data from the yeastmine website by scrolling down and selecting the table with “all verified uncharacterized dubious ORFs”, the data table included all of the columns that Dr. Dahlquist suggested I include in the table&lt;br /&gt;
*The production and degradation tables could be saved as txt files which could then be opened in excel directly&lt;br /&gt;
*The metadata table was created with assistance from my professor in which she helped us gather the necessary materials for it&lt;br /&gt;
===Milestone 4===&lt;br /&gt;
*The database creation went mostly well with a few issues along the way such as&lt;br /&gt;
*Importing the tables from excel led to some problems of all values being either 0 or 1 on production rates or degradation rates tables, so we started to export the tables from excel as a txt file and then open them in access at txt files&lt;br /&gt;
*In the relationships window, we connected the systematic name column for each of the tables to ensure they would all be connected since that column was the primary key in each of the data tables&lt;br /&gt;
===Milesone 5===&lt;br /&gt;
*The quality assurance team was able to verify that the database was correct and had all necessary fields&lt;br /&gt;
===Milestone 6===&lt;br /&gt;
*We were informed by the data analysts that they selected profile 41 to analyze, this profile included 23 genes initially&lt;br /&gt;
*At first we were stuck and were not sure how to obtain the necessary data for the GRNmap, but Dr. Dahlquist pointed out that we can use the database that we created to run queries and find the necessary fields for each of the 23 genes&lt;br /&gt;
*I was informed that Andrew had completed the first three queries for the first three tables on the GRNmap excel sheet&lt;br /&gt;
*I followed the sample GRNmap excel sheet as a template for the queries.&lt;br /&gt;
==Team Journal Assignment==&lt;br /&gt;
# This week, me and my partner completed milestones 1 and 2, and we are currently working on milestone 3, there are some complications in milestone 3, for a large part of it requires Microsoft Access, and there are also some issues in importing tables to excel. [[MSymond1]]&lt;br /&gt;
# Each team member should reflect on the team&amp;#039;s progress:&lt;br /&gt;
## The things that worked well are cleaning up the data for the network table, which was done in class on Tuesday, the data table looks much more organized and the p values have all been successfully converted&lt;br /&gt;
## The other data tables are not pasting into excel as neatly as anticipated, I am also unaware of how to obtain the data from the yeastmine website. &lt;br /&gt;
## To fix these issues, I will ask Dr. Dahlquist for further advice in class on Thursday.&lt;br /&gt;
{{Template:MSymond1}}&lt;br /&gt;
[[Category:Team Project]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Template:MSymond1&amp;diff=3509</id>
		<title>Template:MSymond1</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Template:MSymond1&amp;diff=3509"/>
		<updated>2024-05-02T17:12:51Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: added week 15&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;====User Page====&lt;br /&gt;
*[[User:Msymond1]]&lt;br /&gt;
====Assignment Pages====&lt;br /&gt;
*[[Week 1]]&lt;br /&gt;
*[[Week 2]]&lt;br /&gt;
*[[Week 3]]&lt;br /&gt;
*[[Week 4]]&lt;br /&gt;
*[[Week 5]]&lt;br /&gt;
*[[Week 6]]&lt;br /&gt;
*[[Week 8]]&lt;br /&gt;
*[[Week 9]]&lt;br /&gt;
*[[Week 10]]&lt;br /&gt;
*[[Week 12]]&lt;br /&gt;
*[[Week 13]]&lt;br /&gt;
*[[Week 15]]&lt;br /&gt;
&lt;br /&gt;
====Individual Journal Pages====&lt;br /&gt;
*[[MSymond1 Week 1]]&lt;br /&gt;
*[[MSymond1 Week 2]]&lt;br /&gt;
*[[MSymond1 KMill104 Week 3]]&lt;br /&gt;
*[[NeMO_Week4]]&lt;br /&gt;
*[[MSymond1 Week 5]]&lt;br /&gt;
*[[MSymond1 Week 6]]&lt;br /&gt;
*[[MSymond1 Week 8]]&lt;br /&gt;
*[[MSymond1 Week 9]]&lt;br /&gt;
*[[MSymond1 Week 10]]&lt;br /&gt;
*[[MSymond1 Week 12]]&lt;br /&gt;
*[[MSymond1 Week 13]]&lt;br /&gt;
*[[MSymond1 Week 15]]&lt;br /&gt;
&lt;br /&gt;
====Class Journal Pages====&lt;br /&gt;
*[[Class Journal Week 1]]&lt;br /&gt;
*[[Class Journal Week 2]]&lt;br /&gt;
*[[Class Journal Week 3]]&lt;br /&gt;
*[[Class Journal Week 4]]&lt;br /&gt;
*[[Class Journal Week 5]]&lt;br /&gt;
*[[Class Journal Week 6]]&lt;br /&gt;
*[[Class Journal Week 8]]&lt;br /&gt;
*[[Class Journal Week 9]]&lt;br /&gt;
*[[Class Journal Week 10]]&lt;br /&gt;
[[Category:Journal Entry]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts&amp;diff=3399</id>
		<title>Yeast Beasts</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Yeast_Beasts&amp;diff=3399"/>
		<updated>2024-04-18T20:33:11Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: /* Week Reflections */ added my reflection&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* This page will be the main place from which the Yeast Beasts team project will be managed. Include all of the information/links that you think will be useful for your team to organize your work and communicate with each other and with the instructors. &amp;#039;&amp;#039;Hint:  the kinds of things that are on your own User pages and on the course Main page can be used as a guide.&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Week Reflections==&lt;br /&gt;
===Week 13===&lt;br /&gt;
====[[User:Hivanson|Hailey&amp;#039;s]] Reflection [Quality Assurance]====&lt;br /&gt;
*I worked closely with Charlotte and Katie toward completing Milestones 2 and 3. We completed milestone 2 and are close to completing milestone 3. &lt;br /&gt;
*I thought it worked well to split up, with Natalija going with the coder/designers and me going with data analysis, &amp;#039;&amp;#039;but&amp;#039;&amp;#039; I would love to see where they are at on their progress so that we can join up for the upcoming milestone 4. I want to do this on or before next Tuesday, April 30th.&lt;br /&gt;
*It did not work to try to do tasks simultaneously with the data analysts. To fix this, we had one person with an open Excel sheet on their computer, another reading and checking off the steps, and another checking that all of the data and equations were being entered properly. This solution worked well for us and we will continue to have just one computer with Excel open on it, but switching roles between the person inputting data and the one checking off steps could be better for the future.&lt;br /&gt;
&lt;br /&gt;
[[User:Hivanson|Hivanson]] ([[User talk:Hivanson|talk]]) 23:35, 17 April 2024 (PDT)&lt;br /&gt;
&lt;br /&gt;
====Andrew&amp;#039;s Reflection [Coder/Designer]====&lt;br /&gt;
To find my electronic notebook for this week please click on [[Asandle1 Week 13#Electronic Lab Notebook|Andrew Sandler&amp;#039;s Week 13 Lab Notebook]] &lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Executive Summary&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
#Classified Significant P-values as 1 (P &amp;lt; 0.01) or &amp;#039;0&amp;#039;&lt;br /&gt;
#Found issues with data including missing gene descriptions.&lt;br /&gt;
#Initially tried to use Yeastmine to find the missing gene information but it was inefficient.&lt;br /&gt;
#Found additional blanks in the dataset and need to speak with Dr. Dahlquist about how to solve this issue.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;What worked?&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
Everything &amp;quot;worked&amp;quot; but some surprises came up.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;What didn&amp;#039;t work?&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
Not that this didn&amp;#039;t work but it provided a challenge, the issues with the #REF boxes, the blank boxes, the random text in some boxes, and the NaN coming up in some spots. I don&amp;#039;t know how to deal with this and will need help from Dr. Dahlquist so I am not just guessing at a solution. I also need to figure out how to get a complete Gene ID list and then compare the whole ID list to the missing ID&amp;#039;s. I also need that list for the ID&amp;#039;s for the Access Database. &lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;What will I do next to fix what didn&amp;#039;t work?&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
I plan on speaking with Dean and Dr. Dahlquist in class tomorrow to fix these issues and then move onto working on the Access section of this assignment.&lt;br /&gt;
&lt;br /&gt;
[[Category:Assignment]]&lt;br /&gt;
[[Category:Team Project]]&lt;br /&gt;
&lt;br /&gt;
====Dean&amp;#039;s Reflection [Coder/Designer]====&lt;br /&gt;
# This week, me and my partner completed milestones 1 and 2, and we are currently working on milestone 3, there are some complications in milestone 3, for a large part of it requires Microsoft Access, and there are also some issues in importing tables to excel. [[MSymond1]]&lt;br /&gt;
# Each team member should reflect on the team&amp;#039;s progress:&lt;br /&gt;
## The things that worked well are cleaning up the data for the network table, which was done in class on Tuesday, the data table looks much more organized and the p values have all been successfully converted&lt;br /&gt;
## The other data tables are not pasting into excel as neatly as anticipated, I am also unaware of how to obtain the data from the yeastmine website. &lt;br /&gt;
## To fix these issues, I will ask Dr. Dahlquist for further advice in class on Thursday.&lt;br /&gt;
[[User:Msymond1|Msymond1]] ([[User talk:Msymond1|talk]]) 13:33, 18 April 2024 (PDT)&lt;br /&gt;
&lt;br /&gt;
====Katie&amp;#039;s Reflection====&lt;br /&gt;
#This week, Charlotte, Hailey, and I worked on completing Milestones 2 and 3. These milestones consisted of preparing the dataset from SGD for analysis, and then performing an ANOVA analysis like we had done in Week 9. A more detailed summary of the steps we followed is outlined on mine and Charlotte&amp;#039;s individual page, linked below. &lt;br /&gt;
#* [[Data Analysts Week 13]]&lt;br /&gt;
#The data analysts, me and Charlotte, worked together with Hailey on progressing through Milestones 2 and 3 on the Data Analysis page. We contacted each other throughout the week to check in on what each person was doing. We then met in person to work together on performing the ANOVA analysis. This worked well, because when we couldn&amp;#039;t meet we were still able to get some work done, and then once we got together we were able to ask any questions that we had. It was slightly difficult to progress through the steps in person because when attempting to work on the dataset at the same time, only one person could be actively making changes. I don&amp;#039;t believe it is possible for this issue to be fixed, as we cannot have multiple people working at exactly the same time, because steps need to be followed in a specific order. In the future, we will continue to make sure that we split up the steps so that each person is doing an equal amount of work, and to be communicative about any questions that we have or can answer.&lt;br /&gt;
&lt;br /&gt;
[[User:Kmill104|Kmill104]] ([[User talk:Kmill104|talk]]) 23:09, 17 April 2024 (PDT)&lt;br /&gt;
&lt;br /&gt;
====Charlotte&amp;#039;s Reflection:====&lt;br /&gt;
[[User:Kmill104| Katherine Miller]] and I, being the data analysts, worked with Quality Assurance [[User:Hivanson| Hailey Ivanson]] to complete Milestone 2 and Milestone 3 in person on April 17th, 2024. We messaged the Coder/Designers and got an update from them. I wrote out the steps taken on our [[Data Analysts Week 13]] page. It was helpful that we were able to meet in person to collaborate. However, it was hard to make changes to the data since we were working on one computer. We ended up splitting up the work well, but at first everyone trying to make edits at once was hard. Now we know a system that works for us as a group.&lt;br /&gt;
&lt;br /&gt;
===Week 14===&lt;br /&gt;
===Week 15===&lt;br /&gt;
# Each person needs to write a short executive summary of that person&amp;#039;s progress on the project for the week, with links to the relevant individual journal pages (which will have more detailed information).&lt;br /&gt;
# Each team member should reflect on the team&amp;#039;s progress:&lt;br /&gt;
## What worked?&lt;br /&gt;
## What didn&amp;#039;t work?&lt;br /&gt;
## What will I do next to fix what didn&amp;#039;t work?&lt;br /&gt;
# Note that you will be directed to add specific information to your team&amp;#039;s pages in the individual portion of the assignment for this and future weeks.&lt;br /&gt;
&lt;br /&gt;
{{Yeast_Beasts}}&lt;br /&gt;
[[Template:Yeast_Beasts]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_13&amp;diff=3342</id>
		<title>MSymond1 Week 13</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_13&amp;diff=3342"/>
		<updated>2024-04-18T03:27:46Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: did the team journal assignment&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Progress Report==&lt;br /&gt;
===Milestone 1===&lt;br /&gt;
*Milestone 1 had been completed with our presentation to the class&lt;br /&gt;
===Milestone 2===&lt;br /&gt;
*Milestone 2 was completed because we both already had the necessary softwares for the assignment&lt;br /&gt;
===Milesone 3===&lt;br /&gt;
*Milestone 3 was the most time consuming and most difficult of all the milestones&lt;br /&gt;
*The gene ID table from yeastmine could not yet be acquired because I do not know how to access a full length table from the website of the entire yeast genome or download it from the website&lt;br /&gt;
*The Gene expression table is being made by the data analysts, therefore we do not have access to it yet&lt;br /&gt;
*After importing the data for the network table, it needed some cleaning up&lt;br /&gt;
*YPD is rich medium, can be deleted&lt;br /&gt;
*The numbers at top row could be deleted, because nor me or Dr. Dahlquist had an idea what it is&lt;br /&gt;
*We attempted to convert the p values to either 0 or 1, 1 if p&amp;lt;.001 and 0 if p&amp;gt;.001&lt;br /&gt;
*It didn’t work on my computer trying to do the syntax to convert the p values, and it didn’t on Dahlquist’s computer either, but it did on Andrew’s so we just used his&lt;br /&gt;
*The rest of the data tables do not require much adjusting if at all. Therefore they should be more simple to import into the database&lt;br /&gt;
*I attempted to import the data from the degradation rates and the production rates tables into excel, however it did not separate any of the rows or columns, I will likely have to ask my professor how to troubleshoot this issue.&lt;br /&gt;
==Team Journal Assignment==&lt;br /&gt;
# This week, me and my partner completed milestones 1 and 2, and we are currently working on milestone 3, there are some complications in milestone 3, for a large part of it requires Microsoft Access, and there are also some issues in importing tables to excel. [[MSymond1]]&lt;br /&gt;
# Each team member should reflect on the team&amp;#039;s progress:&lt;br /&gt;
## The things that worked well are cleaning up the data for the network table, which was done in class on Tuesday, the data table looks much more organized and the p values have all been successfully converted&lt;br /&gt;
## The other data tables are not pasting into excel as neatly as anticipated, I am also unaware of how to obtain the data from the yeastmine website. &lt;br /&gt;
## To fix these issues, I will ask Dr. Dahlquist for further advice in class on Thursday.&lt;br /&gt;
{{Template:MSymond1}}&lt;br /&gt;
[[Category:Team Project]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_13&amp;diff=3340</id>
		<title>MSymond1 Week 13</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_13&amp;diff=3340"/>
		<updated>2024-04-18T03:18:44Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: added category and acknowledgements&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Progress Report==&lt;br /&gt;
===Milestone 1===&lt;br /&gt;
*Milestone 1 had been completed with our presentation to the class&lt;br /&gt;
===Milestone 2===&lt;br /&gt;
*Milestone 2 was completed because we both already had the necessary softwares for the assignment&lt;br /&gt;
===Milesone 3===&lt;br /&gt;
*Milestone 3 was the most time consuming and most difficult of all the milestones&lt;br /&gt;
*The gene ID table from yeastmine could not yet be acquired because I do not know how to access a full length table from the website of the entire yeast genome or download it from the website&lt;br /&gt;
*The Gene expression table is being made by the data analysts, therefore we do not have access to it yet&lt;br /&gt;
*After importing the data for the network table, it needed some cleaning up&lt;br /&gt;
*YPD is rich medium, can be deleted&lt;br /&gt;
*The numbers at top row could be deleted, because nor me or Dr. Dahlquist had an idea what it is&lt;br /&gt;
*We attempted to convert the p values to either 0 or 1, 1 if p&amp;lt;.001 and 0 if p&amp;gt;.001&lt;br /&gt;
*It didn’t work on my computer trying to do the syntax to convert the p values, and it didn’t on Dahlquist’s computer either, but it did on Andrew’s so we just used his&lt;br /&gt;
*The rest of the data tables do not require much adjusting if at all. Therefore they should be more simple to import into the database&lt;br /&gt;
*I attempted to import the data from the degradation rates and the production rates tables into excel, however it did not separate any of the rows or columns, I will likely have to ask my professor how to troubleshoot this issue.&lt;br /&gt;
&lt;br /&gt;
{{Template:MSymond1}}&lt;br /&gt;
[[Category:Team Project]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_13&amp;diff=3339</id>
		<title>MSymond1 Week 13</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_13&amp;diff=3339"/>
		<updated>2024-04-18T03:17:20Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: added my progress report&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Progress Report==&lt;br /&gt;
===Milestone 1===&lt;br /&gt;
*Milestone 1 had been completed with our presentation to the class&lt;br /&gt;
===Milestone 2===&lt;br /&gt;
*Milestone 2 was completed because we both already had the necessary softwares for the assignment&lt;br /&gt;
===Milesone 3===&lt;br /&gt;
*Milestone 3 was the most time consuming and most difficult of all the milestones&lt;br /&gt;
*The gene ID table from yeastmine could not yet be acquired because I do not know how to access a full length table from the website of the entire yeast genome or download it from the website&lt;br /&gt;
*The Gene expression table is being made by the data analysts, therefore we do not have access to it yet&lt;br /&gt;
*After importing the data for the network table, it needed some cleaning up&lt;br /&gt;
*YPD is rich medium, can be deleted&lt;br /&gt;
*The numbers at top row could be deleted, because nor me or Dr. Dahlquist had an idea what it is&lt;br /&gt;
*We attempted to convert the p values to either 0 or 1, 1 if p&amp;lt;.001 and 0 if p&amp;gt;.001&lt;br /&gt;
*It didn’t work on my computer trying to do the syntax to convert the p values, and it didn’t on Dahlquist’s computer either, but it did on Andrew’s so we just used his&lt;br /&gt;
*The rest of the data tables do not require much adjusting if at all. Therefore they should be more simple to import into the database&lt;br /&gt;
*I attempted to import the data from the degradation rates and the production rates tables into excel, however it did not separate any of the rows or columns, I will likely have to ask my professor how to troubleshoot this issue.&lt;br /&gt;
&lt;br /&gt;
{{Template:MSymond1}}&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Coder/Designer&amp;diff=3286</id>
		<title>Coder/Designer</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Coder/Designer&amp;diff=3286"/>
		<updated>2024-04-17T20:56:38Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: /* Milestone 3: Design a database to store data needed to create a GRNmap input workbook */ changed back the name&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Final Project Links}}&lt;br /&gt;
&lt;br /&gt;
The Coder/Designers are responsible for creating the Microsoft Access database that will be used by the Data Analysts to prepare an input workbook for GRNmap for the microarray dataset they are analyzing.  The Coder/Designers are also the resident experts on the technology being used—assorted software, file management, version control, and troubleshooting. He or she coordinates with Dr. Dahlquist and fellow Coders/Designers in developing the Access database and storing it on Box.&lt;br /&gt;
&lt;br /&gt;
== Guild Members ==&lt;br /&gt;
&lt;br /&gt;
* Dean&lt;br /&gt;
* Andrew&lt;br /&gt;
&lt;br /&gt;
== Milestones ==&lt;br /&gt;
&lt;br /&gt;
The milestones do not necessarily correspond to particular weeks; instead they are sets of tasks grouped together. However, Milestone 3 is a hard prerequisite for proceeding to Milestone 4, so ideally the Coder/Coder guild should finish these milestones (they require some coordination; see below) as soon as possible.&lt;br /&gt;
&lt;br /&gt;
* Coder/Designers can have a shared &amp;#039;&amp;#039;individual&amp;#039;&amp;#039; journal entry.  Both students will be given the same grade and are expected to contribute equally to the electronic lab notebook.&lt;br /&gt;
* Detailed notes should be taken throughout consistent with reproducible research and contributing to the final deliverables.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Milestone 1: Journal Club Presentation ===&lt;br /&gt;
&lt;br /&gt;
* The Coder/Designers will work with one of the QA&amp;#039;s to create and deliver a Journal Club presentation about to their assigned paper.&lt;br /&gt;
&lt;br /&gt;
=== Milestone 2: Working Environment Setup ===&lt;br /&gt;
&lt;br /&gt;
Coder/Designer work will require the following software/accounts. The Seaver 120 lab computers are already set up for this; this list is provided for Coders/Designers who need to work on a different computer or outside of the lab.&lt;br /&gt;
* Microsoft Access&lt;br /&gt;
* Box account (provided by LMU)&lt;br /&gt;
** Databases created by the teams will be kept in a [https://lmu.box.com/s/lorqp5d5hkzqb7q161ldb66mhqfham3f &amp;quot;BIOL367_Spring2024&amp;quot; Box folder].&lt;br /&gt;
** Coder/Designer guild members have rights as editor to this folder; all others in the class can only view/download.&lt;br /&gt;
** This folder will serve as as the version control mechanism for the Coder/Designer guild.&lt;br /&gt;
&lt;br /&gt;
=== Milestone 3: Design a database to store data needed to create a GRNmap input workbook ===&lt;br /&gt;
&lt;br /&gt;
* Designer/Coders will work with the QA&amp;#039;s to create a MS Access Database that will contain data needed to create a GRNmap input workbook.  It will need to have the following tables:&lt;br /&gt;
** A &amp;lt;code&amp;gt;gene&amp;lt;/code&amp;gt; table that contains all of the gene IDs for the entire yeast genome, obtained from [https://yeastmine.yeastgenome.org/yeastmine/begin.do YeastMine].&lt;br /&gt;
** An &amp;lt;code&amp;gt;expression&amp;lt;/code&amp;gt; table to store the yeast time-course microarray data for the dataset being analyzed by the Data Analysts.  You will consult with the Data Analysts and QA&amp;#039;s to figure out the sample-data relationships and how that should be encoded as fields in the database.&lt;br /&gt;
** A &amp;lt;code&amp;gt;degradation_rates&amp;lt;/code&amp;gt; table that contains degradation rates from Neymotin et al. (2014).  This table is provided at [[Media:Degradation_rates.txt| this link]].&lt;br /&gt;
** A &amp;lt;code&amp;gt;production_rates&amp;lt;/code&amp;gt; table that contains initial guesses for the production rates for each gene.  This table is provided at [[Media:Production_rates.txt | this link]].&lt;br /&gt;
** A &amp;lt;code&amp;gt;network&amp;lt;/code&amp;gt; table that contains the gene regulatory network data from the Harbison et al. (2004) paper.  [[Media:Pvalbygene_forpaper_abbr.xls | Here is the link to the data.]]&lt;br /&gt;
** A &amp;lt;code&amp;gt;metadata&amp;lt;/code&amp;gt; table that encodes information about the database itself, i.e., other tables in the database.&lt;br /&gt;
*** A major part of the design work will be to figure out what information needs to be in the metadata table so that queries can be easily and uniquely performed on the data.&lt;br /&gt;
&lt;br /&gt;
=== Milestone 4: Build the database ===&lt;br /&gt;
&lt;br /&gt;
* Once the design work has been completed, you need to actually import the data into the database.&lt;br /&gt;
&lt;br /&gt;
=== Milestone 5: Validation, Quality Assurance, and Documentation of the Database ===&lt;br /&gt;
&lt;br /&gt;
* The QA will perform quality assurance to make sure that the database is correct and accurate.&lt;br /&gt;
** In particular, the QA&amp;#039;s need to make sure that all of the rows of data were imported into the database for each table.&lt;br /&gt;
** The QA&amp;#039;s will make sure that both the ID (SGD systematic name) and Standard Names are included in the expression table and are correct.&lt;br /&gt;
* QA&amp;#039;s will communicate to the Coder/Designers any changes needed to the database.&lt;br /&gt;
* With the QA&amp;#039;s finalize the [https://www.quackit.com/microsoft_access/microsoft_access_2016/howto/how_to_create_a_database_diagram_in_access_2016.cfm database schema diagram]&lt;br /&gt;
&lt;br /&gt;
=== Milestone 6: Document the schema and design queries to create the GRNmap input workbook ===&lt;br /&gt;
&lt;br /&gt;
* Assist the Data Analysts with the queries needed to creat a GRNmap input workbook&lt;br /&gt;
&lt;br /&gt;
{{Final Project Links}}&lt;br /&gt;
&lt;br /&gt;
[[Category:Team Project]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Coder/Designer&amp;diff=3285</id>
		<title>Coder/Designer</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Coder/Designer&amp;diff=3285"/>
		<updated>2024-04-17T20:56:08Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: /* Milestone 3: Design a database to store data needed to create a GRNmap input workbook */ edited file name for degradation rates&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Final Project Links}}&lt;br /&gt;
&lt;br /&gt;
The Coder/Designers are responsible for creating the Microsoft Access database that will be used by the Data Analysts to prepare an input workbook for GRNmap for the microarray dataset they are analyzing.  The Coder/Designers are also the resident experts on the technology being used—assorted software, file management, version control, and troubleshooting. He or she coordinates with Dr. Dahlquist and fellow Coders/Designers in developing the Access database and storing it on Box.&lt;br /&gt;
&lt;br /&gt;
== Guild Members ==&lt;br /&gt;
&lt;br /&gt;
* Dean&lt;br /&gt;
* Andrew&lt;br /&gt;
&lt;br /&gt;
== Milestones ==&lt;br /&gt;
&lt;br /&gt;
The milestones do not necessarily correspond to particular weeks; instead they are sets of tasks grouped together. However, Milestone 3 is a hard prerequisite for proceeding to Milestone 4, so ideally the Coder/Coder guild should finish these milestones (they require some coordination; see below) as soon as possible.&lt;br /&gt;
&lt;br /&gt;
* Coder/Designers can have a shared &amp;#039;&amp;#039;individual&amp;#039;&amp;#039; journal entry.  Both students will be given the same grade and are expected to contribute equally to the electronic lab notebook.&lt;br /&gt;
* Detailed notes should be taken throughout consistent with reproducible research and contributing to the final deliverables.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Milestone 1: Journal Club Presentation ===&lt;br /&gt;
&lt;br /&gt;
* The Coder/Designers will work with one of the QA&amp;#039;s to create and deliver a Journal Club presentation about to their assigned paper.&lt;br /&gt;
&lt;br /&gt;
=== Milestone 2: Working Environment Setup ===&lt;br /&gt;
&lt;br /&gt;
Coder/Designer work will require the following software/accounts. The Seaver 120 lab computers are already set up for this; this list is provided for Coders/Designers who need to work on a different computer or outside of the lab.&lt;br /&gt;
* Microsoft Access&lt;br /&gt;
* Box account (provided by LMU)&lt;br /&gt;
** Databases created by the teams will be kept in a [https://lmu.box.com/s/lorqp5d5hkzqb7q161ldb66mhqfham3f &amp;quot;BIOL367_Spring2024&amp;quot; Box folder].&lt;br /&gt;
** Coder/Designer guild members have rights as editor to this folder; all others in the class can only view/download.&lt;br /&gt;
** This folder will serve as as the version control mechanism for the Coder/Designer guild.&lt;br /&gt;
&lt;br /&gt;
=== Milestone 3: Design a database to store data needed to create a GRNmap input workbook ===&lt;br /&gt;
&lt;br /&gt;
* Designer/Coders will work with the QA&amp;#039;s to create a MS Access Database that will contain data needed to create a GRNmap input workbook.  It will need to have the following tables:&lt;br /&gt;
** A &amp;lt;code&amp;gt;gene&amp;lt;/code&amp;gt; table that contains all of the gene IDs for the entire yeast genome, obtained from [https://yeastmine.yeastgenome.org/yeastmine/begin.do YeastMine].&lt;br /&gt;
** An &amp;lt;code&amp;gt;expression&amp;lt;/code&amp;gt; table to store the yeast time-course microarray data for the dataset being analyzed by the Data Analysts.  You will consult with the Data Analysts and QA&amp;#039;s to figure out the sample-data relationships and how that should be encoded as fields in the database.&lt;br /&gt;
** A &amp;lt;code&amp;gt;degradation_rates&amp;lt;/code&amp;gt; table that contains degradation rates from Neymotin et al. (2014).  This table is provided at [[Media:Degradation_rates.txt]].&lt;br /&gt;
** A &amp;lt;code&amp;gt;production_rates&amp;lt;/code&amp;gt; table that contains initial guesses for the production rates for each gene.  This table is provided at [[Media:Production_rates.txt | this link]].&lt;br /&gt;
** A &amp;lt;code&amp;gt;network&amp;lt;/code&amp;gt; table that contains the gene regulatory network data from the Harbison et al. (2004) paper.  [[Media:Pvalbygene_forpaper_abbr.xls | Here is the link to the data.]]&lt;br /&gt;
** A &amp;lt;code&amp;gt;metadata&amp;lt;/code&amp;gt; table that encodes information about the database itself, i.e., other tables in the database.&lt;br /&gt;
*** A major part of the design work will be to figure out what information needs to be in the metadata table so that queries can be easily and uniquely performed on the data.&lt;br /&gt;
&lt;br /&gt;
=== Milestone 4: Build the database ===&lt;br /&gt;
&lt;br /&gt;
* Once the design work has been completed, you need to actually import the data into the database.&lt;br /&gt;
&lt;br /&gt;
=== Milestone 5: Validation, Quality Assurance, and Documentation of the Database ===&lt;br /&gt;
&lt;br /&gt;
* The QA will perform quality assurance to make sure that the database is correct and accurate.&lt;br /&gt;
** In particular, the QA&amp;#039;s need to make sure that all of the rows of data were imported into the database for each table.&lt;br /&gt;
** The QA&amp;#039;s will make sure that both the ID (SGD systematic name) and Standard Names are included in the expression table and are correct.&lt;br /&gt;
* QA&amp;#039;s will communicate to the Coder/Designers any changes needed to the database.&lt;br /&gt;
* With the QA&amp;#039;s finalize the [https://www.quackit.com/microsoft_access/microsoft_access_2016/howto/how_to_create_a_database_diagram_in_access_2016.cfm database schema diagram]&lt;br /&gt;
&lt;br /&gt;
=== Milestone 6: Document the schema and design queries to create the GRNmap input workbook ===&lt;br /&gt;
&lt;br /&gt;
* Assist the Data Analysts with the queries needed to creat a GRNmap input workbook&lt;br /&gt;
&lt;br /&gt;
{{Final Project Links}}&lt;br /&gt;
&lt;br /&gt;
[[Category:Team Project]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Template:MSymond1&amp;diff=3261</id>
		<title>Template:MSymond1</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Template:MSymond1&amp;diff=3261"/>
		<updated>2024-04-16T21:21:45Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: /* Assignment Pages */ week 13&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;====User Page====&lt;br /&gt;
*[[User:Msymond1]]&lt;br /&gt;
====Assignment Pages====&lt;br /&gt;
*[[Week 1]]&lt;br /&gt;
*[[Week 2]]&lt;br /&gt;
*[[Week 3]]&lt;br /&gt;
*[[Week 4]]&lt;br /&gt;
*[[Week 5]]&lt;br /&gt;
*[[Week 6]]&lt;br /&gt;
*[[Week 8]]&lt;br /&gt;
*[[Week 9]]&lt;br /&gt;
*[[Week 10]]&lt;br /&gt;
*[[Week 12]]&lt;br /&gt;
*[[Week 13]]&lt;br /&gt;
&lt;br /&gt;
====Individual Journal Pages====&lt;br /&gt;
*[[MSymond1 Week 1]]&lt;br /&gt;
*[[MSymond1 Week 2]]&lt;br /&gt;
*[[MSymond1 KMill104 Week 3]]&lt;br /&gt;
*[[NeMO_Week4]]&lt;br /&gt;
*[[MSymond1 Week 5]]&lt;br /&gt;
*[[MSymond1 Week 6]]&lt;br /&gt;
*[[MSymond1 Week 8]]&lt;br /&gt;
*[[MSymond1 Week 9]]&lt;br /&gt;
*[[MSymond1 Week 10]]&lt;br /&gt;
*[[MSymond1 Week 12]]&lt;br /&gt;
*[[MSymond1 Week 13]]&lt;br /&gt;
&lt;br /&gt;
====Class Journal Pages====&lt;br /&gt;
*[[Class Journal Week 1]]&lt;br /&gt;
*[[Class Journal Week 2]]&lt;br /&gt;
*[[Class Journal Week 3]]&lt;br /&gt;
*[[Class Journal Week 4]]&lt;br /&gt;
*[[Class Journal Week 5]]&lt;br /&gt;
*[[Class Journal Week 6]]&lt;br /&gt;
*[[Class Journal Week 8]]&lt;br /&gt;
*[[Class Journal Week 9]]&lt;br /&gt;
*[[Class Journal Week 10]]&lt;br /&gt;
[[Category:Journal Entry]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Template:MSymond1&amp;diff=3260</id>
		<title>Template:MSymond1</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=Template:MSymond1&amp;diff=3260"/>
		<updated>2024-04-16T21:21:27Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: /* Individual Journal Pages */ added week 13&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;====User Page====&lt;br /&gt;
*[[User:Msymond1]]&lt;br /&gt;
====Assignment Pages====&lt;br /&gt;
*[[Week 1]]&lt;br /&gt;
*[[Week 2]]&lt;br /&gt;
*[[Week 3]]&lt;br /&gt;
*[[Week 4]]&lt;br /&gt;
*[[Week 5]]&lt;br /&gt;
*[[Week 6]]&lt;br /&gt;
*[[Week 8]]&lt;br /&gt;
*[[Week 9]]&lt;br /&gt;
*[[Week 10]]&lt;br /&gt;
*[[Week 12]]&lt;br /&gt;
&lt;br /&gt;
====Individual Journal Pages====&lt;br /&gt;
*[[MSymond1 Week 1]]&lt;br /&gt;
*[[MSymond1 Week 2]]&lt;br /&gt;
*[[MSymond1 KMill104 Week 3]]&lt;br /&gt;
*[[NeMO_Week4]]&lt;br /&gt;
*[[MSymond1 Week 5]]&lt;br /&gt;
*[[MSymond1 Week 6]]&lt;br /&gt;
*[[MSymond1 Week 8]]&lt;br /&gt;
*[[MSymond1 Week 9]]&lt;br /&gt;
*[[MSymond1 Week 10]]&lt;br /&gt;
*[[MSymond1 Week 12]]&lt;br /&gt;
*[[MSymond1 Week 13]]&lt;br /&gt;
&lt;br /&gt;
====Class Journal Pages====&lt;br /&gt;
*[[Class Journal Week 1]]&lt;br /&gt;
*[[Class Journal Week 2]]&lt;br /&gt;
*[[Class Journal Week 3]]&lt;br /&gt;
*[[Class Journal Week 4]]&lt;br /&gt;
*[[Class Journal Week 5]]&lt;br /&gt;
*[[Class Journal Week 6]]&lt;br /&gt;
*[[Class Journal Week 8]]&lt;br /&gt;
*[[Class Journal Week 9]]&lt;br /&gt;
*[[Class Journal Week 10]]&lt;br /&gt;
[[Category:Journal Entry]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_12&amp;diff=3181</id>
		<title>MSymond1 Week 12</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_12&amp;diff=3181"/>
		<updated>2024-04-11T07:07:04Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: added slides&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Individual Journal Page==&lt;br /&gt;
#A list of biological terms from the paper I did not know the definitions for when I first read the article&lt;br /&gt;
#*transcription regulator activity: A molecular function that controls the rate, timing and/or magnitude of gene transcription. The function of transcriptional regulators is to modulate gene expression at the transcription step so that they are expressed in the right cell at the right time and in the right amount throughout the life of the cell and the organism. Genes are transcriptional units, and include bacterial operons (Gene Ontology, 2024).&lt;br /&gt;
#*transcription cis-regulatory region binding: Binding to a specific sequence of DNA that is part of a regulatory region that controls transcription of that section of the DNA. The transcribed region might be described as a gene, cistron, or operon (Gene Ontology, 2024).&lt;br /&gt;
#*respiratory electron transport chain: A process in which a series of electron carriers operate together to transfer electrons from donors such as NADH and FADH2 to any of several different terminal electron acceptors to generate a transmembrane electrochemical gradient (Gene Ontology, 2024).&lt;br /&gt;
#*Lysis: The disintegration or rupture of the cell membrane, resulting in the release of cell contents or the subsequent death of the cell (Biology Online, 2024).&lt;br /&gt;
#*immunoprecipitate: the precipitate formed in an antigen‐antibody reaction (Oxford Reference, 2006).&lt;br /&gt;
#*DNA ligation: The re-formation of a broken phosphodiester bond in the DNA backbone, carried out by DNA ligase (Gene Ontology, 2024).&lt;br /&gt;
#*Phylogeny: the scientific study of phylogeny. It studies evolutionary relationships among various groups of organisms based on evolutionary history, similarities, and differences. It makes use of molecular sequencing data (such as homologous sequences, protein sequences, nucleotide sequences, etc.) and morphological data matrices to understand and analyze the protein and gene evolutions of genetically-related groups of organisms (Biology Online, 2024).&lt;br /&gt;
#*PCR: A laboratory method used to make many copies of a specific piece of DNA from a sample that contains very tiny amounts of that DNA. PCR allows these pieces of DNA to be amplified so they can be detected. PCR may be used to look for certain changes in a gene or chromosome, which may help find and diagnose a genetic condition or a disease, such as cancer. It may also be used to look at pieces of the DNA of certain bacteria, viruses, or other microorganisms to help diagnose an infection. Also called polymerase chain reaction (National Cancer Institute, 2024).&lt;br /&gt;
#*Epitope: That part of an antigenic molecule to which the t-cell receptor responds, a site on a large molecule against which an antibody will be produced and to which it will bind (Biology Online, 2024).&lt;br /&gt;
#*Microarray: A laboratory tool used to analyze large numbers of genes or proteins at one time (National Cancer Institute, 2024).&lt;br /&gt;
#The main Findings of the paper are that the architecture of the promoter, meaning the arrangements of the DNA binding site, change depending on environmental conditions and can be predicted with confidence what the binding arrangement will be depending on the promoter and the environmental conditions.&lt;br /&gt;
#The significance of these findings is the fact that they combine genome-wide location data with phylogenetic conservation data. Using both these types of data allowed to cluster all significant results from the genome wide data location based upon their conservation data.&lt;br /&gt;
#The limitations of prior studies is the fact that it cannot be determined what the location is for the recognition sites of transcriptional regulators with phylogenetic sequence data alone, or with any other prior knowledge from any previous study. The fact that the sequences have been conserved through evolution indicates that they can be regulated, but does not reveal information about the binding process, or the conditions, or the architecture of such binding.&lt;br /&gt;
#They treated the Yeast cells by using PCR and they printed about 6000 DNA fragments to represent nearly all regions in the yeast genome.&lt;br /&gt;
#They used the W303 yeast strain from Saccharomyces cerevisiae, and it was haploid. &lt;br /&gt;
#They grew them in microarrays with PCR products. The article does not specify temperature or time, in the supplementary methods section, it does list that the times varies for each of the conditions. it is as low as 20 minutes for certain conditions (namely the moderately hypertonic condition). And the time is as high as 14 hours (namely the filamentation inducing condition). It does not specify the temperature for most of the conditions, except for the elevated temperature condition in which it specifies that it begins at 30 degrees celsius and is shifted to 37 degrees celsius. &lt;br /&gt;
#The controls group they used was an unenriched microarray to compare with the immunoprecipitated samples.&lt;br /&gt;
#They ran each program 50 times on a randomly selected set of sequences. &lt;br /&gt;
#The study conducted their genome wide location analysis by cross linking the proteins to the DNA, which then created precipitate which separated the DNA from the protein. These precipitates were then went through PCR procedures to hybridize them to a microarray of spotted PCR products, each representing a different location of the yeast genome. Such locations were used to compare the probabilities of binding interactions.&lt;br /&gt;
#They used an Axon 200B scanner to scan the microarrays, they compared the immunoprecipitated sample with the unenriched sample. They found the median of each channel to calculate a normalization factor. They then calculated the log ratio of the intensity of the test channel to the control channel. The log ratios were normalized by subracting the average log ratio of every spot across all arrays. Finally, they calculated an error model by calculating the significance of enrichment on each chip, and combining the data for all replicates to calculate an average ratio and significance of enrichment for every region in the genome. &lt;br /&gt;
#their supplementary tables are available to the public for download on nature.com and these tables have the results that they were able to calculate from their data, but their raw data and calculations are not to be found available to the public.&lt;br /&gt;
#The list of figures from the article&lt;br /&gt;
#*Figure 1 has 2 parts, part a essentially states that the conclusions from this study, regarding the identification of transcription factor binding site specificities, could only be concluded when using the three kinds of data they have. They had their genome-wide location data, their phylogenetic sequence conservation data, and other previous work. Part b shows the sequence specificities of some of the regulators. There are 2 columns, one of the columns displays sequences that had already been discovered and were rediscovered with this study, and the other column shows sequences that were newly discovered by this study. Each of the letters in the sequence have a size proportional to the product of their frequency and their information content&lt;br /&gt;
#*Figure 2 has three parts, part a displays the different chromosomes, as well as certain genes located on said chromosomes with , it also shows the locations of certain DNA sequences that are bound by transcriptional regulators. They obtained this information by mapping on the yeast genome sequences the motifs that they found to be bound by regulators at high confidence that were also conserved. The functions of the specific transcriptional regulators had already been previously established. Part b of the figure combines binding data with sequence conservation data. This part of the graph is in 3 parts, the first shows all sequence matches to DNA binding specificities. The second part shows all of the sequence matches to conserved sequences, and the third part shows all sequences that match with conserved sequences that are bound by regulators. Part c of the figure is a graph that shows the frequency of binding sites in relation to the distance from translational start site on the DNA sequence. The x axis is the distance from translational start site, and the y axis is the number of binding sites.&lt;br /&gt;
#*Figure 3 shows the different promoter architectures. The first one being single regulator, the second being repetitive motifs, the third being multiple regulators, and co-occuring regulators. They display this information by putting a different color box for each regulator on different lines representing different binding site sequences. They obtained this information through their microarray experiments in this study.&lt;br /&gt;
#*Figure 4 displays the environment-specific use of transcriptional regulatory code. It shows four different patterns of binding behavior in four different rows. The different patterns being Condition invariant, condition enabled, condition expanded, and condition altered. The regulators are represented by colored circles and shown above and below the genes/promoters, and there are lines connecting them to the genes that display their binding nature depending on their environments. To the right of these charts, there are 2 lines for each binding pattern representing different environments. The regulators are displayed near the genes, and they are shown in circles or colored boxes to show whether they are binding or not depending on the environment.&lt;br /&gt;
#*Supplementary figure 1 is a graph in which the x axis is the regulator under testing (1 through 203), and the y axis is the number of promoter regions bound to said regulator. There are 2 lines on the graph, one is in blue which is unadjusted per the number of conditions the regulator was profiled under, and another line is in pink in which it averages the distributions for the same set of p values among regulators and promoter regions. This information was also obtained by their microarray data.&lt;br /&gt;
#*Supplementary figure 2 represents the data calculations conducted in this study. First all of their motifs were identified by using a variety of methods as listed in the figure, which were then filtered to determine which were significant, and then clustered based upon representative motifs, they then used conservation data to identify which motifs had the highest confidence rate. The final step is the statistical test from specificity databases to assign a specific motif to each regulator.&lt;br /&gt;
#*Supplementary figure 3 is a photo that displays the binding of Cin5 to two different sequences. It shows 15 different lanes to demonstrate the different binding results of the protein with different sequences. The first lane shows it with no competitor, the lanes 2-8 show it binding with a competitor sequence found by one of the discovered motifs in this study, and lanes 9-15 show it binding with a previously established binding site for the regulator. And the concentration of the regulator was 27 times higher in the motif discovered in this study as opposed to the previously established sequence, meaning the results of the study were able to predict the binding capacity for this protein better than previously published literature.&lt;br /&gt;
#*Supplementary figure 4 is a bar graph in which the x axis is the regulators, and the y axis is the number of promoter regions bound for each of those regulators. This figure compares the number of promoter regions bound for each regulator depending on the environment they&amp;#039;re growing in. The two different environments they compared in this figure are the rich medium environment and the amino acid starvation environment. &lt;br /&gt;
#*Supplementary figure 5 is a bar graph in which the x axis is the percent of maximum matching sites, which essentially translates to the quality of the matching sites found for each of them, which was determined based on to the best matching sequence to the Gcn4 binding specificity. The y axis is the frequency of matches found of that quality. The bars were clustered by which conditions they were grown in. &lt;br /&gt;
#This study does incorporate methods from other previous studies. Other previous studies have used conservation sequence data, but that has not allowed them to predict the binding sites and environments for each of the regulators with confidence that this study does.&lt;br /&gt;
#The authors could take the future direction of testing such transcription factors in higher eukaryotes, for if they are able to predict such binding mechanisms for yeast cells, they can likely do something similar for a higher level organism. Perhaps they will not be able to test as many regulators or have as high of a confidence rate, for I would assume it would be far more difficult to carry out such tests on another higher organism, for there are likely far more regulators and they may have a much larger genome to select from.&lt;br /&gt;
#I believe the authors in this study were able to support their conclusions well with the data acquired, but I do not believe the data was well presented or explained in the article. Much of the materials necessary to understand their methods or to know important details (temperature, time, conditions) of their experiment were not even on the article itself and had to be found in the supplementary section. And even in the supplementary section it was still a very dense topic that is very difficult to understand for anyone who is not a well established expert on the topic. Not to mention the fact there is no defined discussion or conclusion section of the paper. The final section of the article is the methods section which is rather unconventional.&lt;br /&gt;
==Presentation==&lt;br /&gt;
[https://docs.google.com/presentation/d/1I73nHmqkM8bI4R0cw8sK5-OsMfS-2wcXwj-fDVgbZqQ/edit?usp=sharing|Google Slides]&lt;br /&gt;
&lt;br /&gt;
==Meta Data==&lt;br /&gt;
#Harbison, C. T., Gordon, D. B., Lee, T. I., Rinaldi, N. J., Macisaac, K. D., Danford, T. W., Hannett, N. M., Tagne, J. B., Reynolds, D. B., Yoo, J., Jennings, E. G., Zeitlinger, J., Pokholok, D. K., Kellis, M., Rolfe, P. A., Takusagawa, K. T., Lander, E. S., Gifford, D. K., Fraenkel, E., &amp;amp; Young, R. A. (2004). Transcriptional regulatory code of a eukaryotic genome. Nature, 431(7004), 99–104. https://doi.org/10.1038/nature02800&lt;br /&gt;
#[https://pubmed.ncbi.nlm.nih.gov/15343339/|The link to the abstract from PubMed]&lt;br /&gt;
#[https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3006441/|The link to the full text of the article in PubMedCentral]&lt;br /&gt;
#[https://www.nature.com/articles/nature02800|The link to the full text of the article (HTML format) from the publisher web site.]&lt;br /&gt;
#[https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3006441/|The link to the full PDF version of the article from the publisher web site.]&lt;br /&gt;
#The copyright is owned by the Author because it is an Author Manuscript&lt;br /&gt;
# Once I open the full text, I do see &amp;quot;Public Access&amp;quot;.&lt;br /&gt;
#* The article is open access.&lt;br /&gt;
#*Accessing it from NIH was free, however to access it from &amp;quot;nature&amp;quot; it says that it is a subscription content and must be accessed via the institution&lt;br /&gt;
#The journal Nature is available for print, since it is available to subscribe to print.&lt;br /&gt;
#The publisher of the Journal is Nature Portfolio, which is part of Springer Nature, they are for profit. They are not a member of OAPA.&lt;br /&gt;
#Since 1989&lt;br /&gt;
#Yes, the articles in this journal are peer-reviewed.&lt;br /&gt;
# [https://www.nature.com/nature/editors|scientific advisory board/editorial board of the journal.]&lt;br /&gt;
# 64.8 was the 2 year impact factor and 60.9 was the 5 year impact factor (2022).&lt;br /&gt;
#The article was submitted on March 11 2004&lt;br /&gt;
#The article was accepted on July 1st 2004&lt;br /&gt;
#No/unknown, all it says is that it was published in final edited form 2004, Sept 2.&lt;br /&gt;
#The article was published on September 2 2004&lt;br /&gt;
#7 months&lt;br /&gt;
# Whitehead Institute of Biomedical Research, Massachusetts Institute of Technology, MIT Computer Science and Artificial Intelligence Laboratory&lt;br /&gt;
#One of the authors, Christopher T Harbison, had published a paper in 2002 on Transcriptional Regulatory Networks in Saccharomyces cerevisiae. He also published a paper on Genome-wide map of nucleosome acetylation and methylation in yeast in 2005. Another author,  D Benjamin Gordon, also published a paper relating to transcription factors in 2004, as well as An improved map of conserved regulatory sites for Saccharomyces cerevisiae in 2006. &lt;br /&gt;
#Yes, “Some authors have filed a patent application covering aspects of this work and are pursuing commercialization.”&lt;br /&gt;
#There isn&amp;#039;t data associated with the dataset.&lt;br /&gt;
#This article has cites 30 articles, and has been cited by 1671 articles.&lt;br /&gt;
&lt;br /&gt;
==Acknowledgements==&lt;br /&gt;
I have been in contact with my group members for this week about the presentation and questions this week. We worked together in class and texted about the presentation. I also visited my professor, Dr. Dahlquist during her office hours to ask for help in interpreting the article for this week. Except for what is noted above, this individual journal entry was completed by me and not copied from another source.&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
*Harbison CT, Gordon DB, Lee TI, Rinaldi NJ, Macisaac KD, Danford TW, Hannett NM, Tagne JB, Reynolds DB, Yoo J, Jennings EG, Zeitlinger J, Pokholok DK, Kellis M, Rolfe PA, Takusagawa KT, Lander ES, Gifford DK, Fraenkel E, Young RA. Transcriptional regulatory code of a eukaryotic genome. Nature. 2004 Sep 2;431(7004):99-104. doi: 10.1038/nature02800. PMID: 15343339; PMCID: PMC3006441.&lt;br /&gt;
*Gene Ontology. Transcription regulator activity. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0140110&lt;br /&gt;
*Gene Ontology. Transcription cis-regulatory region binding. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0000976&lt;br /&gt;
*Gene Ontology. Respiratory electron transport chain. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0022904&lt;br /&gt;
*Gene Ontology. DNA ligation. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0006266&lt;br /&gt;
*Biology Online. Lysis. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/lysis&lt;br /&gt;
*Biology Online. Phylogenetics. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/phylogenetics&lt;br /&gt;
*Biology Online. Epitope. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/epitope&lt;br /&gt;
*Cammack, Richard, et al. Oxford Reference. (2006). Immunoprecipitate. Retrieved April 10, 2024, from https://www.oxfordreference.com/display/10.1093/acref/9780198529170.001.0001/acref-9780198529170-e-9850?rskey=ZIIfXn&amp;amp;result=1&lt;br /&gt;
*National Cancer Institute. PCR. Retrieved April 10, 2024, from https://www.cancer.gov/publications/dictionaries/cancer-terms/def/pcr&lt;br /&gt;
*National Cancer Institute. Microarray. Retrieved April 10, 2024, from https://www.cancer.gov/search/results?swKeyword=microarray&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Template:MSymond1}}&lt;br /&gt;
[[Category:Team Project]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_12&amp;diff=3180</id>
		<title>MSymond1 Week 12</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_12&amp;diff=3180"/>
		<updated>2024-04-11T07:05:39Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: /* Meta Data */ finished&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Individual Journal Page==&lt;br /&gt;
#A list of biological terms from the paper I did not know the definitions for when I first read the article&lt;br /&gt;
#*transcription regulator activity: A molecular function that controls the rate, timing and/or magnitude of gene transcription. The function of transcriptional regulators is to modulate gene expression at the transcription step so that they are expressed in the right cell at the right time and in the right amount throughout the life of the cell and the organism. Genes are transcriptional units, and include bacterial operons (Gene Ontology, 2024).&lt;br /&gt;
#*transcription cis-regulatory region binding: Binding to a specific sequence of DNA that is part of a regulatory region that controls transcription of that section of the DNA. The transcribed region might be described as a gene, cistron, or operon (Gene Ontology, 2024).&lt;br /&gt;
#*respiratory electron transport chain: A process in which a series of electron carriers operate together to transfer electrons from donors such as NADH and FADH2 to any of several different terminal electron acceptors to generate a transmembrane electrochemical gradient (Gene Ontology, 2024).&lt;br /&gt;
#*Lysis: The disintegration or rupture of the cell membrane, resulting in the release of cell contents or the subsequent death of the cell (Biology Online, 2024).&lt;br /&gt;
#*immunoprecipitate: the precipitate formed in an antigen‐antibody reaction (Oxford Reference, 2006).&lt;br /&gt;
#*DNA ligation: The re-formation of a broken phosphodiester bond in the DNA backbone, carried out by DNA ligase (Gene Ontology, 2024).&lt;br /&gt;
#*Phylogeny: the scientific study of phylogeny. It studies evolutionary relationships among various groups of organisms based on evolutionary history, similarities, and differences. It makes use of molecular sequencing data (such as homologous sequences, protein sequences, nucleotide sequences, etc.) and morphological data matrices to understand and analyze the protein and gene evolutions of genetically-related groups of organisms (Biology Online, 2024).&lt;br /&gt;
#*PCR: A laboratory method used to make many copies of a specific piece of DNA from a sample that contains very tiny amounts of that DNA. PCR allows these pieces of DNA to be amplified so they can be detected. PCR may be used to look for certain changes in a gene or chromosome, which may help find and diagnose a genetic condition or a disease, such as cancer. It may also be used to look at pieces of the DNA of certain bacteria, viruses, or other microorganisms to help diagnose an infection. Also called polymerase chain reaction (National Cancer Institute, 2024).&lt;br /&gt;
#*Epitope: That part of an antigenic molecule to which the t-cell receptor responds, a site on a large molecule against which an antibody will be produced and to which it will bind (Biology Online, 2024).&lt;br /&gt;
#*Microarray: A laboratory tool used to analyze large numbers of genes or proteins at one time (National Cancer Institute, 2024).&lt;br /&gt;
#The main Findings of the paper are that the architecture of the promoter, meaning the arrangements of the DNA binding site, change depending on environmental conditions and can be predicted with confidence what the binding arrangement will be depending on the promoter and the environmental conditions.&lt;br /&gt;
#The significance of these findings is the fact that they combine genome-wide location data with phylogenetic conservation data. Using both these types of data allowed to cluster all significant results from the genome wide data location based upon their conservation data.&lt;br /&gt;
#The limitations of prior studies is the fact that it cannot be determined what the location is for the recognition sites of transcriptional regulators with phylogenetic sequence data alone, or with any other prior knowledge from any previous study. The fact that the sequences have been conserved through evolution indicates that they can be regulated, but does not reveal information about the binding process, or the conditions, or the architecture of such binding.&lt;br /&gt;
#They treated the Yeast cells by using PCR and they printed about 6000 DNA fragments to represent nearly all regions in the yeast genome.&lt;br /&gt;
#They used the W303 yeast strain from Saccharomyces cerevisiae, and it was haploid. &lt;br /&gt;
#They grew them in microarrays with PCR products. The article does not specify temperature or time, in the supplementary methods section, it does list that the times varies for each of the conditions. it is as low as 20 minutes for certain conditions (namely the moderately hypertonic condition). And the time is as high as 14 hours (namely the filamentation inducing condition). It does not specify the temperature for most of the conditions, except for the elevated temperature condition in which it specifies that it begins at 30 degrees celsius and is shifted to 37 degrees celsius. &lt;br /&gt;
#The controls group they used was an unenriched microarray to compare with the immunoprecipitated samples.&lt;br /&gt;
#They ran each program 50 times on a randomly selected set of sequences. &lt;br /&gt;
#The study conducted their genome wide location analysis by cross linking the proteins to the DNA, which then created precipitate which separated the DNA from the protein. These precipitates were then went through PCR procedures to hybridize them to a microarray of spotted PCR products, each representing a different location of the yeast genome. Such locations were used to compare the probabilities of binding interactions.&lt;br /&gt;
#They used an Axon 200B scanner to scan the microarrays, they compared the immunoprecipitated sample with the unenriched sample. They found the median of each channel to calculate a normalization factor. They then calculated the log ratio of the intensity of the test channel to the control channel. The log ratios were normalized by subracting the average log ratio of every spot across all arrays. Finally, they calculated an error model by calculating the significance of enrichment on each chip, and combining the data for all replicates to calculate an average ratio and significance of enrichment for every region in the genome. &lt;br /&gt;
#their supplementary tables are available to the public for download on nature.com and these tables have the results that they were able to calculate from their data, but their raw data and calculations are not to be found available to the public.&lt;br /&gt;
#The list of figures from the article&lt;br /&gt;
#*Figure 1 has 2 parts, part a essentially states that the conclusions from this study, regarding the identification of transcription factor binding site specificities, could only be concluded when using the three kinds of data they have. They had their genome-wide location data, their phylogenetic sequence conservation data, and other previous work. Part b shows the sequence specificities of some of the regulators. There are 2 columns, one of the columns displays sequences that had already been discovered and were rediscovered with this study, and the other column shows sequences that were newly discovered by this study. Each of the letters in the sequence have a size proportional to the product of their frequency and their information content&lt;br /&gt;
#*Figure 2 has three parts, part a displays the different chromosomes, as well as certain genes located on said chromosomes with , it also shows the locations of certain DNA sequences that are bound by transcriptional regulators. They obtained this information by mapping on the yeast genome sequences the motifs that they found to be bound by regulators at high confidence that were also conserved. The functions of the specific transcriptional regulators had already been previously established. Part b of the figure combines binding data with sequence conservation data. This part of the graph is in 3 parts, the first shows all sequence matches to DNA binding specificities. The second part shows all of the sequence matches to conserved sequences, and the third part shows all sequences that match with conserved sequences that are bound by regulators. Part c of the figure is a graph that shows the frequency of binding sites in relation to the distance from translational start site on the DNA sequence. The x axis is the distance from translational start site, and the y axis is the number of binding sites.&lt;br /&gt;
#*Figure 3 shows the different promoter architectures. The first one being single regulator, the second being repetitive motifs, the third being multiple regulators, and co-occuring regulators. They display this information by putting a different color box for each regulator on different lines representing different binding site sequences. They obtained this information through their microarray experiments in this study.&lt;br /&gt;
#*Figure 4 displays the environment-specific use of transcriptional regulatory code. It shows four different patterns of binding behavior in four different rows. The different patterns being Condition invariant, condition enabled, condition expanded, and condition altered. The regulators are represented by colored circles and shown above and below the genes/promoters, and there are lines connecting them to the genes that display their binding nature depending on their environments. To the right of these charts, there are 2 lines for each binding pattern representing different environments. The regulators are displayed near the genes, and they are shown in circles or colored boxes to show whether they are binding or not depending on the environment.&lt;br /&gt;
#*Supplementary figure 1 is a graph in which the x axis is the regulator under testing (1 through 203), and the y axis is the number of promoter regions bound to said regulator. There are 2 lines on the graph, one is in blue which is unadjusted per the number of conditions the regulator was profiled under, and another line is in pink in which it averages the distributions for the same set of p values among regulators and promoter regions. This information was also obtained by their microarray data.&lt;br /&gt;
#*Supplementary figure 2 represents the data calculations conducted in this study. First all of their motifs were identified by using a variety of methods as listed in the figure, which were then filtered to determine which were significant, and then clustered based upon representative motifs, they then used conservation data to identify which motifs had the highest confidence rate. The final step is the statistical test from specificity databases to assign a specific motif to each regulator.&lt;br /&gt;
#*Supplementary figure 3 is a photo that displays the binding of Cin5 to two different sequences. It shows 15 different lanes to demonstrate the different binding results of the protein with different sequences. The first lane shows it with no competitor, the lanes 2-8 show it binding with a competitor sequence found by one of the discovered motifs in this study, and lanes 9-15 show it binding with a previously established binding site for the regulator. And the concentration of the regulator was 27 times higher in the motif discovered in this study as opposed to the previously established sequence, meaning the results of the study were able to predict the binding capacity for this protein better than previously published literature.&lt;br /&gt;
#*Supplementary figure 4 is a bar graph in which the x axis is the regulators, and the y axis is the number of promoter regions bound for each of those regulators. This figure compares the number of promoter regions bound for each regulator depending on the environment they&amp;#039;re growing in. The two different environments they compared in this figure are the rich medium environment and the amino acid starvation environment. &lt;br /&gt;
#*Supplementary figure 5 is a bar graph in which the x axis is the percent of maximum matching sites, which essentially translates to the quality of the matching sites found for each of them, which was determined based on to the best matching sequence to the Gcn4 binding specificity. The y axis is the frequency of matches found of that quality. The bars were clustered by which conditions they were grown in. &lt;br /&gt;
#This study does incorporate methods from other previous studies. Other previous studies have used conservation sequence data, but that has not allowed them to predict the binding sites and environments for each of the regulators with confidence that this study does.&lt;br /&gt;
#The authors could take the future direction of testing such transcription factors in higher eukaryotes, for if they are able to predict such binding mechanisms for yeast cells, they can likely do something similar for a higher level organism. Perhaps they will not be able to test as many regulators or have as high of a confidence rate, for I would assume it would be far more difficult to carry out such tests on another higher organism, for there are likely far more regulators and they may have a much larger genome to select from.&lt;br /&gt;
#I believe the authors in this study were able to support their conclusions well with the data acquired, but I do not believe the data was well presented or explained in the article. Much of the materials necessary to understand their methods or to know important details (temperature, time, conditions) of their experiment were not even on the article itself and had to be found in the supplementary section. And even in the supplementary section it was still a very dense topic that is very difficult to understand for anyone who is not a well established expert on the topic. Not to mention the fact there is no defined discussion or conclusion section of the paper. The final section of the article is the methods section which is rather unconventional.&lt;br /&gt;
&lt;br /&gt;
==Meta Data==&lt;br /&gt;
#Harbison, C. T., Gordon, D. B., Lee, T. I., Rinaldi, N. J., Macisaac, K. D., Danford, T. W., Hannett, N. M., Tagne, J. B., Reynolds, D. B., Yoo, J., Jennings, E. G., Zeitlinger, J., Pokholok, D. K., Kellis, M., Rolfe, P. A., Takusagawa, K. T., Lander, E. S., Gifford, D. K., Fraenkel, E., &amp;amp; Young, R. A. (2004). Transcriptional regulatory code of a eukaryotic genome. Nature, 431(7004), 99–104. https://doi.org/10.1038/nature02800&lt;br /&gt;
#[https://pubmed.ncbi.nlm.nih.gov/15343339/|The link to the abstract from PubMed]&lt;br /&gt;
#[https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3006441/|The link to the full text of the article in PubMedCentral]&lt;br /&gt;
#[https://www.nature.com/articles/nature02800|The link to the full text of the article (HTML format) from the publisher web site.]&lt;br /&gt;
#[https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3006441/|The link to the full PDF version of the article from the publisher web site.]&lt;br /&gt;
#The copyright is owned by the Author because it is an Author Manuscript&lt;br /&gt;
# Once I open the full text, I do see &amp;quot;Public Access&amp;quot;.&lt;br /&gt;
#* The article is open access.&lt;br /&gt;
#*Accessing it from NIH was free, however to access it from &amp;quot;nature&amp;quot; it says that it is a subscription content and must be accessed via the institution&lt;br /&gt;
#The journal Nature is available for print, since it is available to subscribe to print.&lt;br /&gt;
#The publisher of the Journal is Nature Portfolio, which is part of Springer Nature, they are for profit. They are not a member of OAPA.&lt;br /&gt;
#Since 1989&lt;br /&gt;
#Yes, the articles in this journal are peer-reviewed.&lt;br /&gt;
# [https://www.nature.com/nature/editors|scientific advisory board/editorial board of the journal.]&lt;br /&gt;
# 64.8 was the 2 year impact factor and 60.9 was the 5 year impact factor (2022).&lt;br /&gt;
#The article was submitted on March 11 2004&lt;br /&gt;
#The article was accepted on July 1st 2004&lt;br /&gt;
#No/unknown, all it says is that it was published in final edited form 2004, Sept 2.&lt;br /&gt;
#The article was published on September 2 2004&lt;br /&gt;
#7 months&lt;br /&gt;
# Whitehead Institute of Biomedical Research, Massachusetts Institute of Technology, MIT Computer Science and Artificial Intelligence Laboratory&lt;br /&gt;
#One of the authors, Christopher T Harbison, had published a paper in 2002 on Transcriptional Regulatory Networks in Saccharomyces cerevisiae. He also published a paper on Genome-wide map of nucleosome acetylation and methylation in yeast in 2005. Another author,  D Benjamin Gordon, also published a paper relating to transcription factors in 2004, as well as An improved map of conserved regulatory sites for Saccharomyces cerevisiae in 2006. &lt;br /&gt;
#Yes, “Some authors have filed a patent application covering aspects of this work and are pursuing commercialization.”&lt;br /&gt;
#There isn&amp;#039;t data associated with the dataset.&lt;br /&gt;
#This article has cites 30 articles, and has been cited by 1671 articles.&lt;br /&gt;
&lt;br /&gt;
==Acknowledgements==&lt;br /&gt;
I have been in contact with my group members for this week about the presentation and questions this week. We worked together in class and texted about the presentation. I also visited my professor, Dr. Dahlquist during her office hours to ask for help in interpreting the article for this week. Except for what is noted above, this individual journal entry was completed by me and not copied from another source.&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
*Harbison CT, Gordon DB, Lee TI, Rinaldi NJ, Macisaac KD, Danford TW, Hannett NM, Tagne JB, Reynolds DB, Yoo J, Jennings EG, Zeitlinger J, Pokholok DK, Kellis M, Rolfe PA, Takusagawa KT, Lander ES, Gifford DK, Fraenkel E, Young RA. Transcriptional regulatory code of a eukaryotic genome. Nature. 2004 Sep 2;431(7004):99-104. doi: 10.1038/nature02800. PMID: 15343339; PMCID: PMC3006441.&lt;br /&gt;
*Gene Ontology. Transcription regulator activity. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0140110&lt;br /&gt;
*Gene Ontology. Transcription cis-regulatory region binding. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0000976&lt;br /&gt;
*Gene Ontology. Respiratory electron transport chain. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0022904&lt;br /&gt;
*Gene Ontology. DNA ligation. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0006266&lt;br /&gt;
*Biology Online. Lysis. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/lysis&lt;br /&gt;
*Biology Online. Phylogenetics. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/phylogenetics&lt;br /&gt;
*Biology Online. Epitope. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/epitope&lt;br /&gt;
*Cammack, Richard, et al. Oxford Reference. (2006). Immunoprecipitate. Retrieved April 10, 2024, from https://www.oxfordreference.com/display/10.1093/acref/9780198529170.001.0001/acref-9780198529170-e-9850?rskey=ZIIfXn&amp;amp;result=1&lt;br /&gt;
*National Cancer Institute. PCR. Retrieved April 10, 2024, from https://www.cancer.gov/publications/dictionaries/cancer-terms/def/pcr&lt;br /&gt;
*National Cancer Institute. Microarray. Retrieved April 10, 2024, from https://www.cancer.gov/search/results?swKeyword=microarray&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Template:MSymond1}}&lt;br /&gt;
[[Category:Team Project]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_12&amp;diff=3179</id>
		<title>MSymond1 Week 12</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_12&amp;diff=3179"/>
		<updated>2024-04-11T07:04:00Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: /* Meta Data */ edited&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Individual Journal Page==&lt;br /&gt;
#A list of biological terms from the paper I did not know the definitions for when I first read the article&lt;br /&gt;
#*transcription regulator activity: A molecular function that controls the rate, timing and/or magnitude of gene transcription. The function of transcriptional regulators is to modulate gene expression at the transcription step so that they are expressed in the right cell at the right time and in the right amount throughout the life of the cell and the organism. Genes are transcriptional units, and include bacterial operons (Gene Ontology, 2024).&lt;br /&gt;
#*transcription cis-regulatory region binding: Binding to a specific sequence of DNA that is part of a regulatory region that controls transcription of that section of the DNA. The transcribed region might be described as a gene, cistron, or operon (Gene Ontology, 2024).&lt;br /&gt;
#*respiratory electron transport chain: A process in which a series of electron carriers operate together to transfer electrons from donors such as NADH and FADH2 to any of several different terminal electron acceptors to generate a transmembrane electrochemical gradient (Gene Ontology, 2024).&lt;br /&gt;
#*Lysis: The disintegration or rupture of the cell membrane, resulting in the release of cell contents or the subsequent death of the cell (Biology Online, 2024).&lt;br /&gt;
#*immunoprecipitate: the precipitate formed in an antigen‐antibody reaction (Oxford Reference, 2006).&lt;br /&gt;
#*DNA ligation: The re-formation of a broken phosphodiester bond in the DNA backbone, carried out by DNA ligase (Gene Ontology, 2024).&lt;br /&gt;
#*Phylogeny: the scientific study of phylogeny. It studies evolutionary relationships among various groups of organisms based on evolutionary history, similarities, and differences. It makes use of molecular sequencing data (such as homologous sequences, protein sequences, nucleotide sequences, etc.) and morphological data matrices to understand and analyze the protein and gene evolutions of genetically-related groups of organisms (Biology Online, 2024).&lt;br /&gt;
#*PCR: A laboratory method used to make many copies of a specific piece of DNA from a sample that contains very tiny amounts of that DNA. PCR allows these pieces of DNA to be amplified so they can be detected. PCR may be used to look for certain changes in a gene or chromosome, which may help find and diagnose a genetic condition or a disease, such as cancer. It may also be used to look at pieces of the DNA of certain bacteria, viruses, or other microorganisms to help diagnose an infection. Also called polymerase chain reaction (National Cancer Institute, 2024).&lt;br /&gt;
#*Epitope: That part of an antigenic molecule to which the t-cell receptor responds, a site on a large molecule against which an antibody will be produced and to which it will bind (Biology Online, 2024).&lt;br /&gt;
#*Microarray: A laboratory tool used to analyze large numbers of genes or proteins at one time (National Cancer Institute, 2024).&lt;br /&gt;
#The main Findings of the paper are that the architecture of the promoter, meaning the arrangements of the DNA binding site, change depending on environmental conditions and can be predicted with confidence what the binding arrangement will be depending on the promoter and the environmental conditions.&lt;br /&gt;
#The significance of these findings is the fact that they combine genome-wide location data with phylogenetic conservation data. Using both these types of data allowed to cluster all significant results from the genome wide data location based upon their conservation data.&lt;br /&gt;
#The limitations of prior studies is the fact that it cannot be determined what the location is for the recognition sites of transcriptional regulators with phylogenetic sequence data alone, or with any other prior knowledge from any previous study. The fact that the sequences have been conserved through evolution indicates that they can be regulated, but does not reveal information about the binding process, or the conditions, or the architecture of such binding.&lt;br /&gt;
#They treated the Yeast cells by using PCR and they printed about 6000 DNA fragments to represent nearly all regions in the yeast genome.&lt;br /&gt;
#They used the W303 yeast strain from Saccharomyces cerevisiae, and it was haploid. &lt;br /&gt;
#They grew them in microarrays with PCR products. The article does not specify temperature or time, in the supplementary methods section, it does list that the times varies for each of the conditions. it is as low as 20 minutes for certain conditions (namely the moderately hypertonic condition). And the time is as high as 14 hours (namely the filamentation inducing condition). It does not specify the temperature for most of the conditions, except for the elevated temperature condition in which it specifies that it begins at 30 degrees celsius and is shifted to 37 degrees celsius. &lt;br /&gt;
#The controls group they used was an unenriched microarray to compare with the immunoprecipitated samples.&lt;br /&gt;
#They ran each program 50 times on a randomly selected set of sequences. &lt;br /&gt;
#The study conducted their genome wide location analysis by cross linking the proteins to the DNA, which then created precipitate which separated the DNA from the protein. These precipitates were then went through PCR procedures to hybridize them to a microarray of spotted PCR products, each representing a different location of the yeast genome. Such locations were used to compare the probabilities of binding interactions.&lt;br /&gt;
#They used an Axon 200B scanner to scan the microarrays, they compared the immunoprecipitated sample with the unenriched sample. They found the median of each channel to calculate a normalization factor. They then calculated the log ratio of the intensity of the test channel to the control channel. The log ratios were normalized by subracting the average log ratio of every spot across all arrays. Finally, they calculated an error model by calculating the significance of enrichment on each chip, and combining the data for all replicates to calculate an average ratio and significance of enrichment for every region in the genome. &lt;br /&gt;
#their supplementary tables are available to the public for download on nature.com and these tables have the results that they were able to calculate from their data, but their raw data and calculations are not to be found available to the public.&lt;br /&gt;
#The list of figures from the article&lt;br /&gt;
#*Figure 1 has 2 parts, part a essentially states that the conclusions from this study, regarding the identification of transcription factor binding site specificities, could only be concluded when using the three kinds of data they have. They had their genome-wide location data, their phylogenetic sequence conservation data, and other previous work. Part b shows the sequence specificities of some of the regulators. There are 2 columns, one of the columns displays sequences that had already been discovered and were rediscovered with this study, and the other column shows sequences that were newly discovered by this study. Each of the letters in the sequence have a size proportional to the product of their frequency and their information content&lt;br /&gt;
#*Figure 2 has three parts, part a displays the different chromosomes, as well as certain genes located on said chromosomes with , it also shows the locations of certain DNA sequences that are bound by transcriptional regulators. They obtained this information by mapping on the yeast genome sequences the motifs that they found to be bound by regulators at high confidence that were also conserved. The functions of the specific transcriptional regulators had already been previously established. Part b of the figure combines binding data with sequence conservation data. This part of the graph is in 3 parts, the first shows all sequence matches to DNA binding specificities. The second part shows all of the sequence matches to conserved sequences, and the third part shows all sequences that match with conserved sequences that are bound by regulators. Part c of the figure is a graph that shows the frequency of binding sites in relation to the distance from translational start site on the DNA sequence. The x axis is the distance from translational start site, and the y axis is the number of binding sites.&lt;br /&gt;
#*Figure 3 shows the different promoter architectures. The first one being single regulator, the second being repetitive motifs, the third being multiple regulators, and co-occuring regulators. They display this information by putting a different color box for each regulator on different lines representing different binding site sequences. They obtained this information through their microarray experiments in this study.&lt;br /&gt;
#*Figure 4 displays the environment-specific use of transcriptional regulatory code. It shows four different patterns of binding behavior in four different rows. The different patterns being Condition invariant, condition enabled, condition expanded, and condition altered. The regulators are represented by colored circles and shown above and below the genes/promoters, and there are lines connecting them to the genes that display their binding nature depending on their environments. To the right of these charts, there are 2 lines for each binding pattern representing different environments. The regulators are displayed near the genes, and they are shown in circles or colored boxes to show whether they are binding or not depending on the environment.&lt;br /&gt;
#*Supplementary figure 1 is a graph in which the x axis is the regulator under testing (1 through 203), and the y axis is the number of promoter regions bound to said regulator. There are 2 lines on the graph, one is in blue which is unadjusted per the number of conditions the regulator was profiled under, and another line is in pink in which it averages the distributions for the same set of p values among regulators and promoter regions. This information was also obtained by their microarray data.&lt;br /&gt;
#*Supplementary figure 2 represents the data calculations conducted in this study. First all of their motifs were identified by using a variety of methods as listed in the figure, which were then filtered to determine which were significant, and then clustered based upon representative motifs, they then used conservation data to identify which motifs had the highest confidence rate. The final step is the statistical test from specificity databases to assign a specific motif to each regulator.&lt;br /&gt;
#*Supplementary figure 3 is a photo that displays the binding of Cin5 to two different sequences. It shows 15 different lanes to demonstrate the different binding results of the protein with different sequences. The first lane shows it with no competitor, the lanes 2-8 show it binding with a competitor sequence found by one of the discovered motifs in this study, and lanes 9-15 show it binding with a previously established binding site for the regulator. And the concentration of the regulator was 27 times higher in the motif discovered in this study as opposed to the previously established sequence, meaning the results of the study were able to predict the binding capacity for this protein better than previously published literature.&lt;br /&gt;
#*Supplementary figure 4 is a bar graph in which the x axis is the regulators, and the y axis is the number of promoter regions bound for each of those regulators. This figure compares the number of promoter regions bound for each regulator depending on the environment they&amp;#039;re growing in. The two different environments they compared in this figure are the rich medium environment and the amino acid starvation environment. &lt;br /&gt;
#*Supplementary figure 5 is a bar graph in which the x axis is the percent of maximum matching sites, which essentially translates to the quality of the matching sites found for each of them, which was determined based on to the best matching sequence to the Gcn4 binding specificity. The y axis is the frequency of matches found of that quality. The bars were clustered by which conditions they were grown in. &lt;br /&gt;
#This study does incorporate methods from other previous studies. Other previous studies have used conservation sequence data, but that has not allowed them to predict the binding sites and environments for each of the regulators with confidence that this study does.&lt;br /&gt;
#The authors could take the future direction of testing such transcription factors in higher eukaryotes, for if they are able to predict such binding mechanisms for yeast cells, they can likely do something similar for a higher level organism. Perhaps they will not be able to test as many regulators or have as high of a confidence rate, for I would assume it would be far more difficult to carry out such tests on another higher organism, for there are likely far more regulators and they may have a much larger genome to select from.&lt;br /&gt;
#I believe the authors in this study were able to support their conclusions well with the data acquired, but I do not believe the data was well presented or explained in the article. Much of the materials necessary to understand their methods or to know important details (temperature, time, conditions) of their experiment were not even on the article itself and had to be found in the supplementary section. And even in the supplementary section it was still a very dense topic that is very difficult to understand for anyone who is not a well established expert on the topic. Not to mention the fact there is no defined discussion or conclusion section of the paper. The final section of the article is the methods section which is rather unconventional.&lt;br /&gt;
&lt;br /&gt;
==Meta Data==&lt;br /&gt;
#Harbison, C. T., Gordon, D. B., Lee, T. I., Rinaldi, N. J., Macisaac, K. D., Danford, T. W., Hannett, N. M., Tagne, J. B., Reynolds, D. B., Yoo, J., Jennings, E. G., Zeitlinger, J., Pokholok, D. K., Kellis, M., Rolfe, P. A., Takusagawa, K. T., Lander, E. S., Gifford, D. K., Fraenkel, E., &amp;amp; Young, R. A. (2004). Transcriptional regulatory code of a eukaryotic genome. Nature, 431(7004), 99–104. https://doi.org/10.1038/nature02800&lt;br /&gt;
#[https://pubmed.ncbi.nlm.nih.gov/15343339/|The link to the abstract from PubMed]&lt;br /&gt;
#[https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3006441/]&lt;br /&gt;
#[https://www.nature.com/articles/nature02800]&lt;br /&gt;
#[https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3006441/]&lt;br /&gt;
#The copyright is owned by the Author because it is an Author Manuscript&lt;br /&gt;
# Once I open the full text, I do see &amp;quot;Public Access&amp;quot;.&lt;br /&gt;
#* The article is open access.&lt;br /&gt;
#*Accessing it from NIH was free, however to access it from &amp;quot;nature&amp;quot; it says that it is a subscription content and must be accessed via the institution&lt;br /&gt;
#The journal Nature is available for print, since it is available to subscribe to print.&lt;br /&gt;
#The publisher of the Journal is Nature Portfolio, which is part of Springer Nature, they are for profit. They are not a member of OAPA.&lt;br /&gt;
#Since 1989&lt;br /&gt;
#Yes, the articles in this journal are peer-reviewed.&lt;br /&gt;
# [https://www.nature.com/nature/editors]&lt;br /&gt;
# 64.8 was the 2 year impact factor and 60.9 was the 5 year impact factor (2022).&lt;br /&gt;
#The article was submitted on March 11 2004&lt;br /&gt;
#The article was accepted on July 1st 2004&lt;br /&gt;
#No/unknown, all it says is that it was published in final edited form 2004, Sept 2.&lt;br /&gt;
#The article was published on September 2 2004&lt;br /&gt;
#7 months&lt;br /&gt;
# Whitehead Institute of Biomedical Research, Massachusetts Institute of Technology, MIT Computer Science and Artificial Intelligence Laboratory&lt;br /&gt;
#One of the authors, Christopher T Harbison, had published a paper in 2002 on Transcriptional Regulatory Networks in Saccharomyces cerevisiae. He also published a paper on Genome-wide map of nucleosome acetylation and methylation in yeast in 2005. Another author,  D Benjamin Gordon, also published a paper relating to transcription factors in 2004, as well as An improved map of conserved regulatory sites for Saccharomyces cerevisiae in 2006. &lt;br /&gt;
#Yes, “Some authors have filed a patent application covering aspects of this work and are pursuing commercialization.”&lt;br /&gt;
#There isn&amp;#039;t data associated with the dataset.&lt;br /&gt;
#This article has cites 30 articles, and has been cited by 1671 articles.&lt;br /&gt;
&lt;br /&gt;
==Acknowledgements==&lt;br /&gt;
I have been in contact with my group members for this week about the presentation and questions this week. We worked together in class and texted about the presentation. I also visited my professor, Dr. Dahlquist during her office hours to ask for help in interpreting the article for this week. Except for what is noted above, this individual journal entry was completed by me and not copied from another source.&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
*Harbison CT, Gordon DB, Lee TI, Rinaldi NJ, Macisaac KD, Danford TW, Hannett NM, Tagne JB, Reynolds DB, Yoo J, Jennings EG, Zeitlinger J, Pokholok DK, Kellis M, Rolfe PA, Takusagawa KT, Lander ES, Gifford DK, Fraenkel E, Young RA. Transcriptional regulatory code of a eukaryotic genome. Nature. 2004 Sep 2;431(7004):99-104. doi: 10.1038/nature02800. PMID: 15343339; PMCID: PMC3006441.&lt;br /&gt;
*Gene Ontology. Transcription regulator activity. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0140110&lt;br /&gt;
*Gene Ontology. Transcription cis-regulatory region binding. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0000976&lt;br /&gt;
*Gene Ontology. Respiratory electron transport chain. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0022904&lt;br /&gt;
*Gene Ontology. DNA ligation. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0006266&lt;br /&gt;
*Biology Online. Lysis. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/lysis&lt;br /&gt;
*Biology Online. Phylogenetics. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/phylogenetics&lt;br /&gt;
*Biology Online. Epitope. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/epitope&lt;br /&gt;
*Cammack, Richard, et al. Oxford Reference. (2006). Immunoprecipitate. Retrieved April 10, 2024, from https://www.oxfordreference.com/display/10.1093/acref/9780198529170.001.0001/acref-9780198529170-e-9850?rskey=ZIIfXn&amp;amp;result=1&lt;br /&gt;
*National Cancer Institute. PCR. Retrieved April 10, 2024, from https://www.cancer.gov/publications/dictionaries/cancer-terms/def/pcr&lt;br /&gt;
*National Cancer Institute. Microarray. Retrieved April 10, 2024, from https://www.cancer.gov/search/results?swKeyword=microarray&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Template:MSymond1}}&lt;br /&gt;
[[Category:Team Project]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_12&amp;diff=3178</id>
		<title>MSymond1 Week 12</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_12&amp;diff=3178"/>
		<updated>2024-04-11T07:03:23Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: added bibliography&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Individual Journal Page==&lt;br /&gt;
#A list of biological terms from the paper I did not know the definitions for when I first read the article&lt;br /&gt;
#*transcription regulator activity: A molecular function that controls the rate, timing and/or magnitude of gene transcription. The function of transcriptional regulators is to modulate gene expression at the transcription step so that they are expressed in the right cell at the right time and in the right amount throughout the life of the cell and the organism. Genes are transcriptional units, and include bacterial operons (Gene Ontology, 2024).&lt;br /&gt;
#*transcription cis-regulatory region binding: Binding to a specific sequence of DNA that is part of a regulatory region that controls transcription of that section of the DNA. The transcribed region might be described as a gene, cistron, or operon (Gene Ontology, 2024).&lt;br /&gt;
#*respiratory electron transport chain: A process in which a series of electron carriers operate together to transfer electrons from donors such as NADH and FADH2 to any of several different terminal electron acceptors to generate a transmembrane electrochemical gradient (Gene Ontology, 2024).&lt;br /&gt;
#*Lysis: The disintegration or rupture of the cell membrane, resulting in the release of cell contents or the subsequent death of the cell (Biology Online, 2024).&lt;br /&gt;
#*immunoprecipitate: the precipitate formed in an antigen‐antibody reaction (Oxford Reference, 2006).&lt;br /&gt;
#*DNA ligation: The re-formation of a broken phosphodiester bond in the DNA backbone, carried out by DNA ligase (Gene Ontology, 2024).&lt;br /&gt;
#*Phylogeny: the scientific study of phylogeny. It studies evolutionary relationships among various groups of organisms based on evolutionary history, similarities, and differences. It makes use of molecular sequencing data (such as homologous sequences, protein sequences, nucleotide sequences, etc.) and morphological data matrices to understand and analyze the protein and gene evolutions of genetically-related groups of organisms (Biology Online, 2024).&lt;br /&gt;
#*PCR: A laboratory method used to make many copies of a specific piece of DNA from a sample that contains very tiny amounts of that DNA. PCR allows these pieces of DNA to be amplified so they can be detected. PCR may be used to look for certain changes in a gene or chromosome, which may help find and diagnose a genetic condition or a disease, such as cancer. It may also be used to look at pieces of the DNA of certain bacteria, viruses, or other microorganisms to help diagnose an infection. Also called polymerase chain reaction (National Cancer Institute, 2024).&lt;br /&gt;
#*Epitope: That part of an antigenic molecule to which the t-cell receptor responds, a site on a large molecule against which an antibody will be produced and to which it will bind (Biology Online, 2024).&lt;br /&gt;
#*Microarray: A laboratory tool used to analyze large numbers of genes or proteins at one time (National Cancer Institute, 2024).&lt;br /&gt;
#The main Findings of the paper are that the architecture of the promoter, meaning the arrangements of the DNA binding site, change depending on environmental conditions and can be predicted with confidence what the binding arrangement will be depending on the promoter and the environmental conditions.&lt;br /&gt;
#The significance of these findings is the fact that they combine genome-wide location data with phylogenetic conservation data. Using both these types of data allowed to cluster all significant results from the genome wide data location based upon their conservation data.&lt;br /&gt;
#The limitations of prior studies is the fact that it cannot be determined what the location is for the recognition sites of transcriptional regulators with phylogenetic sequence data alone, or with any other prior knowledge from any previous study. The fact that the sequences have been conserved through evolution indicates that they can be regulated, but does not reveal information about the binding process, or the conditions, or the architecture of such binding.&lt;br /&gt;
#They treated the Yeast cells by using PCR and they printed about 6000 DNA fragments to represent nearly all regions in the yeast genome.&lt;br /&gt;
#They used the W303 yeast strain from Saccharomyces cerevisiae, and it was haploid. &lt;br /&gt;
#They grew them in microarrays with PCR products. The article does not specify temperature or time, in the supplementary methods section, it does list that the times varies for each of the conditions. it is as low as 20 minutes for certain conditions (namely the moderately hypertonic condition). And the time is as high as 14 hours (namely the filamentation inducing condition). It does not specify the temperature for most of the conditions, except for the elevated temperature condition in which it specifies that it begins at 30 degrees celsius and is shifted to 37 degrees celsius. &lt;br /&gt;
#The controls group they used was an unenriched microarray to compare with the immunoprecipitated samples.&lt;br /&gt;
#They ran each program 50 times on a randomly selected set of sequences. &lt;br /&gt;
#The study conducted their genome wide location analysis by cross linking the proteins to the DNA, which then created precipitate which separated the DNA from the protein. These precipitates were then went through PCR procedures to hybridize them to a microarray of spotted PCR products, each representing a different location of the yeast genome. Such locations were used to compare the probabilities of binding interactions.&lt;br /&gt;
#They used an Axon 200B scanner to scan the microarrays, they compared the immunoprecipitated sample with the unenriched sample. They found the median of each channel to calculate a normalization factor. They then calculated the log ratio of the intensity of the test channel to the control channel. The log ratios were normalized by subracting the average log ratio of every spot across all arrays. Finally, they calculated an error model by calculating the significance of enrichment on each chip, and combining the data for all replicates to calculate an average ratio and significance of enrichment for every region in the genome. &lt;br /&gt;
#their supplementary tables are available to the public for download on nature.com and these tables have the results that they were able to calculate from their data, but their raw data and calculations are not to be found available to the public.&lt;br /&gt;
#The list of figures from the article&lt;br /&gt;
#*Figure 1 has 2 parts, part a essentially states that the conclusions from this study, regarding the identification of transcription factor binding site specificities, could only be concluded when using the three kinds of data they have. They had their genome-wide location data, their phylogenetic sequence conservation data, and other previous work. Part b shows the sequence specificities of some of the regulators. There are 2 columns, one of the columns displays sequences that had already been discovered and were rediscovered with this study, and the other column shows sequences that were newly discovered by this study. Each of the letters in the sequence have a size proportional to the product of their frequency and their information content&lt;br /&gt;
#*Figure 2 has three parts, part a displays the different chromosomes, as well as certain genes located on said chromosomes with , it also shows the locations of certain DNA sequences that are bound by transcriptional regulators. They obtained this information by mapping on the yeast genome sequences the motifs that they found to be bound by regulators at high confidence that were also conserved. The functions of the specific transcriptional regulators had already been previously established. Part b of the figure combines binding data with sequence conservation data. This part of the graph is in 3 parts, the first shows all sequence matches to DNA binding specificities. The second part shows all of the sequence matches to conserved sequences, and the third part shows all sequences that match with conserved sequences that are bound by regulators. Part c of the figure is a graph that shows the frequency of binding sites in relation to the distance from translational start site on the DNA sequence. The x axis is the distance from translational start site, and the y axis is the number of binding sites.&lt;br /&gt;
#*Figure 3 shows the different promoter architectures. The first one being single regulator, the second being repetitive motifs, the third being multiple regulators, and co-occuring regulators. They display this information by putting a different color box for each regulator on different lines representing different binding site sequences. They obtained this information through their microarray experiments in this study.&lt;br /&gt;
#*Figure 4 displays the environment-specific use of transcriptional regulatory code. It shows four different patterns of binding behavior in four different rows. The different patterns being Condition invariant, condition enabled, condition expanded, and condition altered. The regulators are represented by colored circles and shown above and below the genes/promoters, and there are lines connecting them to the genes that display their binding nature depending on their environments. To the right of these charts, there are 2 lines for each binding pattern representing different environments. The regulators are displayed near the genes, and they are shown in circles or colored boxes to show whether they are binding or not depending on the environment.&lt;br /&gt;
#*Supplementary figure 1 is a graph in which the x axis is the regulator under testing (1 through 203), and the y axis is the number of promoter regions bound to said regulator. There are 2 lines on the graph, one is in blue which is unadjusted per the number of conditions the regulator was profiled under, and another line is in pink in which it averages the distributions for the same set of p values among regulators and promoter regions. This information was also obtained by their microarray data.&lt;br /&gt;
#*Supplementary figure 2 represents the data calculations conducted in this study. First all of their motifs were identified by using a variety of methods as listed in the figure, which were then filtered to determine which were significant, and then clustered based upon representative motifs, they then used conservation data to identify which motifs had the highest confidence rate. The final step is the statistical test from specificity databases to assign a specific motif to each regulator.&lt;br /&gt;
#*Supplementary figure 3 is a photo that displays the binding of Cin5 to two different sequences. It shows 15 different lanes to demonstrate the different binding results of the protein with different sequences. The first lane shows it with no competitor, the lanes 2-8 show it binding with a competitor sequence found by one of the discovered motifs in this study, and lanes 9-15 show it binding with a previously established binding site for the regulator. And the concentration of the regulator was 27 times higher in the motif discovered in this study as opposed to the previously established sequence, meaning the results of the study were able to predict the binding capacity for this protein better than previously published literature.&lt;br /&gt;
#*Supplementary figure 4 is a bar graph in which the x axis is the regulators, and the y axis is the number of promoter regions bound for each of those regulators. This figure compares the number of promoter regions bound for each regulator depending on the environment they&amp;#039;re growing in. The two different environments they compared in this figure are the rich medium environment and the amino acid starvation environment. &lt;br /&gt;
#*Supplementary figure 5 is a bar graph in which the x axis is the percent of maximum matching sites, which essentially translates to the quality of the matching sites found for each of them, which was determined based on to the best matching sequence to the Gcn4 binding specificity. The y axis is the frequency of matches found of that quality. The bars were clustered by which conditions they were grown in. &lt;br /&gt;
#This study does incorporate methods from other previous studies. Other previous studies have used conservation sequence data, but that has not allowed them to predict the binding sites and environments for each of the regulators with confidence that this study does.&lt;br /&gt;
#The authors could take the future direction of testing such transcription factors in higher eukaryotes, for if they are able to predict such binding mechanisms for yeast cells, they can likely do something similar for a higher level organism. Perhaps they will not be able to test as many regulators or have as high of a confidence rate, for I would assume it would be far more difficult to carry out such tests on another higher organism, for there are likely far more regulators and they may have a much larger genome to select from.&lt;br /&gt;
#I believe the authors in this study were able to support their conclusions well with the data acquired, but I do not believe the data was well presented or explained in the article. Much of the materials necessary to understand their methods or to know important details (temperature, time, conditions) of their experiment were not even on the article itself and had to be found in the supplementary section. And even in the supplementary section it was still a very dense topic that is very difficult to understand for anyone who is not a well established expert on the topic. Not to mention the fact there is no defined discussion or conclusion section of the paper. The final section of the article is the methods section which is rather unconventional.&lt;br /&gt;
&lt;br /&gt;
==Meta Data==&lt;br /&gt;
#Harbison, C. T., Gordon, D. B., Lee, T. I., Rinaldi, N. J., Macisaac, K. D., Danford, T. W., Hannett, N. M., Tagne, J. B., Reynolds, D. B., Yoo, J., Jennings, E. G., Zeitlinger, J., Pokholok, D. K., Kellis, M., Rolfe, P. A., Takusagawa, K. T., Lander, E. S., Gifford, D. K., Fraenkel, E., &amp;amp; Young, R. A. (2004). Transcriptional regulatory code of a eukaryotic genome. Nature, 431(7004), 99–104. https://doi.org/10.1038/nature02800&lt;br /&gt;
#[https://pubmed.ncbi.nlm.nih.gov/15343339/]&lt;br /&gt;
#[https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3006441/]&lt;br /&gt;
#[https://www.nature.com/articles/nature02800]&lt;br /&gt;
#[https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3006441/]&lt;br /&gt;
#The copyright is owned by the Author because it is an Author Manuscript&lt;br /&gt;
# Once I open the full text, I do see &amp;quot;Public Access&amp;quot;.&lt;br /&gt;
#* The article is open access.&lt;br /&gt;
#*Accessing it from NIH was free, however to access it from &amp;quot;nature&amp;quot; it says that it is a subscription content and must be accessed via the institution&lt;br /&gt;
#The journal Nature is available for print, since it is available to subscribe to print.&lt;br /&gt;
#The publisher of the Journal is Nature Portfolio, which is part of Springer Nature, they are for profit. They are not a member of OAPA.&lt;br /&gt;
#Since 1989&lt;br /&gt;
#Yes, the articles in this journal are peer-reviewed.&lt;br /&gt;
# [https://www.nature.com/nature/editors]&lt;br /&gt;
# 64.8 was the 2 year impact factor and 60.9 was the 5 year impact factor (2022).&lt;br /&gt;
#The article was submitted on March 11 2004&lt;br /&gt;
#The article was accepted on July 1st 2004&lt;br /&gt;
#No/unknown, all it says is that it was published in final edited form 2004, Sept 2.&lt;br /&gt;
#The article was published on September 2 2004&lt;br /&gt;
#7 months&lt;br /&gt;
# Whitehead Institute of Biomedical Research, Massachusetts Institute of Technology, MIT Computer Science and Artificial Intelligence Laboratory&lt;br /&gt;
#One of the authors, Christopher T Harbison, had published a paper in 2002 on Transcriptional Regulatory Networks in Saccharomyces cerevisiae. He also published a paper on Genome-wide map of nucleosome acetylation and methylation in yeast in 2005. Another author,  D Benjamin Gordon, also published a paper relating to transcription factors in 2004, as well as An improved map of conserved regulatory sites for Saccharomyces cerevisiae in 2006. &lt;br /&gt;
#Yes, “Some authors have filed a patent application covering aspects of this work and are pursuing commercialization.”&lt;br /&gt;
#There isn&amp;#039;t data associated with the dataset.&lt;br /&gt;
#This article has cites 30 articles, and has been cited by 1671 articles.&lt;br /&gt;
&lt;br /&gt;
==Acknowledgements==&lt;br /&gt;
I have been in contact with my group members for this week about the presentation and questions this week. We worked together in class and texted about the presentation. I also visited my professor, Dr. Dahlquist during her office hours to ask for help in interpreting the article for this week. Except for what is noted above, this individual journal entry was completed by me and not copied from another source.&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
*Harbison CT, Gordon DB, Lee TI, Rinaldi NJ, Macisaac KD, Danford TW, Hannett NM, Tagne JB, Reynolds DB, Yoo J, Jennings EG, Zeitlinger J, Pokholok DK, Kellis M, Rolfe PA, Takusagawa KT, Lander ES, Gifford DK, Fraenkel E, Young RA. Transcriptional regulatory code of a eukaryotic genome. Nature. 2004 Sep 2;431(7004):99-104. doi: 10.1038/nature02800. PMID: 15343339; PMCID: PMC3006441.&lt;br /&gt;
*Gene Ontology. Transcription regulator activity. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0140110&lt;br /&gt;
*Gene Ontology. Transcription cis-regulatory region binding. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0000976&lt;br /&gt;
*Gene Ontology. Respiratory electron transport chain. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0022904&lt;br /&gt;
*Gene Ontology. DNA ligation. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0006266&lt;br /&gt;
*Biology Online. Lysis. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/lysis&lt;br /&gt;
*Biology Online. Phylogenetics. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/phylogenetics&lt;br /&gt;
*Biology Online. Epitope. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/epitope&lt;br /&gt;
*Cammack, Richard, et al. Oxford Reference. (2006). Immunoprecipitate. Retrieved April 10, 2024, from https://www.oxfordreference.com/display/10.1093/acref/9780198529170.001.0001/acref-9780198529170-e-9850?rskey=ZIIfXn&amp;amp;result=1&lt;br /&gt;
*National Cancer Institute. PCR. Retrieved April 10, 2024, from https://www.cancer.gov/publications/dictionaries/cancer-terms/def/pcr&lt;br /&gt;
*National Cancer Institute. Microarray. Retrieved April 10, 2024, from https://www.cancer.gov/search/results?swKeyword=microarray&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Template:MSymond1}}&lt;br /&gt;
[[Category:Team Project]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_12&amp;diff=3072</id>
		<title>MSymond1 Week 12</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_12&amp;diff=3072"/>
		<updated>2024-04-11T04:17:56Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: /* Individual Journal Page */ took away links&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Individual Journal Page==&lt;br /&gt;
#A list of biological terms from the paper I did not know the definitions for when I first read the article&lt;br /&gt;
#*transcription regulator activity: A molecular function that controls the rate, timing and/or magnitude of gene transcription. The function of transcriptional regulators is to modulate gene expression at the transcription step so that they are expressed in the right cell at the right time and in the right amount throughout the life of the cell and the organism. Genes are transcriptional units, and include bacterial operons (Gene Ontology, 2024).&lt;br /&gt;
#*transcription cis-regulatory region binding: Binding to a specific sequence of DNA that is part of a regulatory region that controls transcription of that section of the DNA. The transcribed region might be described as a gene, cistron, or operon (Gene Ontology, 2024).&lt;br /&gt;
#*respiratory electron transport chain: A process in which a series of electron carriers operate together to transfer electrons from donors such as NADH and FADH2 to any of several different terminal electron acceptors to generate a transmembrane electrochemical gradient (Gene Ontology, 2024).&lt;br /&gt;
#*Lysis: The disintegration or rupture of the cell membrane, resulting in the release of cell contents or the subsequent death of the cell (Biology Online, 2024).&lt;br /&gt;
#*immunoprecipitate: the precipitate formed in an antigen‐antibody reaction (Oxford Reference, 2006).&lt;br /&gt;
#*DNA ligation: The re-formation of a broken phosphodiester bond in the DNA backbone, carried out by DNA ligase (Gene Ontology, 2024).&lt;br /&gt;
#*Phylogeny: the scientific study of phylogeny. It studies evolutionary relationships among various groups of organisms based on evolutionary history, similarities, and differences. It makes use of molecular sequencing data (such as homologous sequences, protein sequences, nucleotide sequences, etc.) and morphological data matrices to understand and analyze the protein and gene evolutions of genetically-related groups of organisms (Biology Online, 2024).&lt;br /&gt;
#*PCR: A laboratory method used to make many copies of a specific piece of DNA from a sample that contains very tiny amounts of that DNA. PCR allows these pieces of DNA to be amplified so they can be detected. PCR may be used to look for certain changes in a gene or chromosome, which may help find and diagnose a genetic condition or a disease, such as cancer. It may also be used to look at pieces of the DNA of certain bacteria, viruses, or other microorganisms to help diagnose an infection. Also called polymerase chain reaction (National Cancer Institute, 2024).&lt;br /&gt;
#*Epitope: That part of an antigenic molecule to which the t-cell receptor responds, a site on a large molecule against which an antibody will be produced and to which it will bind (Biology Online, 2024).&lt;br /&gt;
#*Microarray: A laboratory tool used to analyze large numbers of genes or proteins at one time (National Cancer Institute, 2024).&lt;br /&gt;
#The main Findings of the paper are that the architecture of the promoter, meaning the arrangements of the DNA binding site, change depending on environmental conditions and can be predicted with confidence what the binding arrangement will be depending on the promoter and the environmental conditions.&lt;br /&gt;
#The significance of these findings is the fact that they combine genome-wide location data with phylogenetic conservation data. Using both these types of data allowed to cluster all significant results from the genome wide data location based upon their conservation data.&lt;br /&gt;
#The limitations of prior studies is the fact that it cannot be determined what the location is for the recognition sites of transcriptional regulators with phylogenetic sequence data alone, or with any other prior knowledge from any previous study. The fact that the sequences have been conserved through evolution indicates that they can be regulated, but does not reveal information about the binding process, or the conditions, or the architecture of such binding.&lt;br /&gt;
#They treated the Yeast cells by using PCR and they printed about 6000 DNA fragments to represent nearly all regions in the yeast genome.&lt;br /&gt;
#They used the W303 yeast strain from Saccharomyces cerevisiae, and it was haploid. &lt;br /&gt;
#They grew them in microarrays with PCR products. The article does not specify temperature or time, in the supplementary methods section, it does list that the times varies for each of the conditions. it is as low as 20 minutes for certain conditions (namely the moderately hypertonic condition). And the time is as high as 14 hours (namely the filamentation inducing condition). It does not specify the temperature for most of the conditions, except for the elevated temperature condition in which it specifies that it begins at 30 degrees celsius and is shifted to 37 degrees celsius. &lt;br /&gt;
#The controls group they used was an unenriched microarray to compare with the immunoprecipitated samples.&lt;br /&gt;
#They ran each program 50 times on a randomly selected set of sequences. &lt;br /&gt;
#The study conducted their genome wide location analysis by cross linking the proteins to the DNA, which then created precipitate which separated the DNA from the protein. These precipitates were then went through PCR procedures to hybridize them to a microarray of spotted PCR products, each representing a different location of the yeast genome. Such locations were used to compare the probabilities of binding interactions.&lt;br /&gt;
#They used an Axon 200B scanner to scan the microarrays, they compared the immunoprecipitated sample with the unenriched sample. They found the median of each channel to calculate a normalization factor. They then calculated the log ratio of the intensity of the test channel to the control channel. The log ratios were normalized by subracting the average log ratio of every spot across all arrays. Finally, they calculated an error model by calculating the significance of enrichment on each chip, and combining the data for all replicates to calculate an average ratio and significance of enrichment for every region in the genome. &lt;br /&gt;
#their supplementary tables are available to the public for download on nature.com and these tables have the results that they were able to calculate from their data, but their raw data and calculations are not to be found available to the public.&lt;br /&gt;
#The list of figures from the article&lt;br /&gt;
#*Figure 1 has 2 parts, part a essentially states that the conclusions from this study, regarding the identification of transcription factor binding site specificities, could only be concluded when using the three kinds of data they have. They had their genome-wide location data, their phylogenetic sequence conservation data, and other previous work. Part b shows the sequence specificities of some of the regulators. There are 2 columns, one of the columns displays sequences that had already been discovered and were rediscovered with this study, and the other column shows sequences that were newly discovered by this study. Each of the letters in the sequence have a size proportional to the product of their frequency and their information content&lt;br /&gt;
#*Figure 2 has three parts, part a displays the different chromosomes, as well as certain genes located on said chromosomes with , it also shows the locations of certain DNA sequences that are bound by transcriptional regulators. They obtained this information by mapping on the yeast genome sequences the motifs that they found to be bound by regulators at high confidence that were also conserved. The functions of the specific transcriptional regulators had already been previously established. Part b of the figure combines binding data with sequence conservation data. This part of the graph is in 3 parts, the first shows all sequence matches to DNA binding specificities. The second part shows all of the sequence matches to conserved sequences, and the third part shows all sequences that match with conserved sequences that are bound by regulators. Part c of the figure is a graph that shows the frequency of binding sites in relation to the distance from translational start site on the DNA sequence. The x axis is the distance from translational start site, and the y axis is the number of binding sites.&lt;br /&gt;
#*Figure 3 shows the different promoter architectures. The first one being single regulator, the second being repetitive motifs, the third being multiple regulators, and co-occuring regulators. They display this information by putting a different color box for each regulator on different lines representing different binding site sequences. They obtained this information through their microarray experiments in this study.&lt;br /&gt;
#*Figure 4 displays the environment-specific use of transcriptional regulatory code. It shows four different patterns of binding behavior in four different rows. The different patterns being Condition invariant, condition enabled, condition expanded, and condition altered. The regulators are represented by colored circles and shown above and below the genes/promoters, and there are lines connecting them to the genes that display their binding nature depending on their environments. To the right of these charts, there are 2 lines for each binding pattern representing different environments. The regulators are displayed near the genes, and they are shown in circles or colored boxes to show whether they are binding or not depending on the environment.&lt;br /&gt;
#*Supplementary figure 1 is a graph in which the x axis is the regulator under testing (1 through 203), and the y axis is the number of promoter regions bound to said regulator. There are 2 lines on the graph, one is in blue which is unadjusted per the number of conditions the regulator was profiled under, and another line is in pink in which it averages the distributions for the same set of p values among regulators and promoter regions. This information was also obtained by their microarray data.&lt;br /&gt;
#*Supplementary figure 2 represents the data calculations conducted in this study. First all of their motifs were identified by using a variety of methods as listed in the figure, which were then filtered to determine which were significant, and then clustered based upon representative motifs, they then used conservation data to identify which motifs had the highest confidence rate. The final step is the statistical test from specificity databases to assign a specific motif to each regulator.&lt;br /&gt;
#*Supplementary figure 3 is a photo that displays the binding of Cin5 to two different sequences. It shows 15 different lanes to demonstrate the different binding results of the protein with different sequences. The first lane shows it with no competitor, the lanes 2-8 show it binding with a competitor sequence found by one of the discovered motifs in this study, and lanes 9-15 show it binding with a previously established binding site for the regulator. And the concentration of the regulator was 27 times higher in the motif discovered in this study as opposed to the previously established sequence, meaning the results of the study were able to predict the binding capacity for this protein better than previously published literature.&lt;br /&gt;
#*Supplementary figure 4 is a bar graph in which the x axis is the regulators, and the y axis is the number of promoter regions bound for each of those regulators. This figure compares the number of promoter regions bound for each regulator depending on the environment they&amp;#039;re growing in. The two different environments they compared in this figure are the rich medium environment and the amino acid starvation environment. &lt;br /&gt;
#*Supplementary figure 5 is a bar graph in which the x axis is the percent of maximum matching sites, which essentially translates to the quality of the matching sites found for each of them, which was determined based on to the best matching sequence to the Gcn4 binding specificity. The y axis is the frequency of matches found of that quality. The bars were clustered by which conditions they were grown in. &lt;br /&gt;
#This study does incorporate methods from other previous studies. Other previous studies have used conservation sequence data, but that has not allowed them to predict the binding sites and environments for each of the regulators with confidence that this study does.&lt;br /&gt;
#The authors could take the future direction of testing such transcription factors in higher eukaryotes, for if they are able to predict such binding mechanisms for yeast cells, they can likely do something similar for a higher level organism. Perhaps they will not be able to test as many regulators or have as high of a confidence rate, for I would assume it would be far more difficult to carry out such tests on another higher organism, for there are likely far more regulators and they may have a much larger genome to select from.&lt;br /&gt;
#I believe the authors in this study were able to support their conclusions well with the data acquired, but I do not believe the data was well presented or explained in the article. Much of the materials necessary to understand their methods or to know important details (temperature, time, conditions) of their experiment were not even on the article itself and had to be found in the supplementary section. And even in the supplementary section it was still a very dense topic that is very difficult to understand for anyone who is not a well established expert on the topic. Not to mention the fact there is no defined discussion or conclusion section of the paper. The final section of the article is the methods section which is rather unconventional.&lt;br /&gt;
&lt;br /&gt;
==Acknowledgements==&lt;br /&gt;
I have been in contact with my group members for this week about the presentation and questions this week. We worked together in class and texted about the presentation. I also visited my professor, Dr. Dahlquist during her office hours to ask for help in interpreting the article for this week. Except for what is noted above, this individual journal entry was completed by me and not copied from another source.&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
*Harbison CT, Gordon DB, Lee TI, Rinaldi NJ, Macisaac KD, Danford TW, Hannett NM, Tagne JB, Reynolds DB, Yoo J, Jennings EG, Zeitlinger J, Pokholok DK, Kellis M, Rolfe PA, Takusagawa KT, Lander ES, Gifford DK, Fraenkel E, Young RA. Transcriptional regulatory code of a eukaryotic genome. Nature. 2004 Sep 2;431(7004):99-104. doi: 10.1038/nature02800. PMID: 15343339; PMCID: PMC3006441.&lt;br /&gt;
*Gene Ontology. Transcription regulator activity. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0140110&lt;br /&gt;
*Gene Ontology. Transcription cis-regulatory region binding. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0000976&lt;br /&gt;
*Gene Ontology. Respiratory electron transport chain. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0022904&lt;br /&gt;
*Gene Ontology. DNA ligation. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0006266&lt;br /&gt;
*Biology Online. Lysis. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/lysis&lt;br /&gt;
*Biology Online. Phylogenetics. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/phylogenetics&lt;br /&gt;
*Biology Online. Epitope. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/epitope&lt;br /&gt;
*Cammack, Richard, et al. Oxford Reference. (2006). Immunoprecipitate. Retrieved April 10, 2024, from https://www.oxfordreference.com/display/10.1093/acref/9780198529170.001.0001/acref-9780198529170-e-9850?rskey=ZIIfXn&amp;amp;result=1&lt;br /&gt;
*National Cancer Institute. PCR. Retrieved April 10, 2024, from https://www.cancer.gov/publications/dictionaries/cancer-terms/def/pcr&lt;br /&gt;
*National Cancer Institute. Microarray. Retrieved April 10, 2024, from https://www.cancer.gov/search/results?swKeyword=microarray&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Template:MSymond1}}&lt;br /&gt;
[[Category:Team Project]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_12&amp;diff=3070</id>
		<title>MSymond1 Week 12</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_12&amp;diff=3070"/>
		<updated>2024-04-11T04:16:52Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: /* References */ added reference&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Individual Journal Page==&lt;br /&gt;
#A list of biological terms from the paper I did not know the definitions for when I first read the article&lt;br /&gt;
#*transcription regulator activity: A molecular function that controls the rate, timing and/or magnitude of gene transcription. The function of transcriptional regulators is to modulate gene expression at the transcription step so that they are expressed in the right cell at the right time and in the right amount throughout the life of the cell and the organism. Genes are transcriptional units, and include bacterial operons (Gene Ontology, 2024) https://amigo.geneontology.org/amigo/term/GO:0140110.&lt;br /&gt;
#*transcription cis-regulatory region binding: Binding to a specific sequence of DNA that is part of a regulatory region that controls transcription of that section of the DNA. The transcribed region might be described as a gene, cistron, or operon (Gene Ontology, 2024). https://amigo.geneontology.org/amigo/term/GO:0000976&lt;br /&gt;
#*respiratory electron transport chain: A process in which a series of electron carriers operate together to transfer electrons from donors such as NADH and FADH2 to any of several different terminal electron acceptors to generate a transmembrane electrochemical gradient (Gene Ontology, 2024). https://amigo.geneontology.org/amigo/term/GO:0022904&lt;br /&gt;
#*Lysis: The disintegration or rupture of the cell membrane, resulting in the release of cell contents or the subsequent death of the cell (Biology Online, 2024). https://www.biologyonline.com/dictionary/lysis&lt;br /&gt;
#*immunoprecipitate: the precipitate formed in an antigen‐antibody reaction (Oxford Reference, 2006). https://www.oxfordreference.com/display/10.1093/acref/9780198529170.001.0001/acref-9780198529170-e-9850?rskey=ZIIfXn&amp;amp;result=1&lt;br /&gt;
#*DNA ligation: The re-formation of a broken phosphodiester bond in the DNA backbone, carried out by DNA ligase (Gene Ontology, 2024). https://amigo.geneontology.org/amigo/term/GO:0006266&lt;br /&gt;
#*Phylogeny: the scientific study of phylogeny. It studies evolutionary relationships among various groups of organisms based on evolutionary history, similarities, and differences. It makes use of molecular sequencing data (such as homologous sequences, protein sequences, nucleotide sequences, etc.) and morphological data matrices to understand and analyze the protein and gene evolutions of genetically-related groups of organisms (Biology Online, 2024). https://www.biologyonline.com/dictionary/phylogenetics&lt;br /&gt;
#*PCR: A laboratory method used to make many copies of a specific piece of DNA from a sample that contains very tiny amounts of that DNA. PCR allows these pieces of DNA to be amplified so they can be detected. PCR may be used to look for certain changes in a gene or chromosome, which may help find and diagnose a genetic condition or a disease, such as cancer. It may also be used to look at pieces of the DNA of certain bacteria, viruses, or other microorganisms to help diagnose an infection. Also called polymerase chain reaction (National Cancer Institute, 2024). https://www.cancer.gov/publications/dictionaries/cancer-terms/def/pcr&lt;br /&gt;
#*Epitope: That part of an antigenic molecule to which the t-cell receptor responds, a site on a large molecule against which an antibody will be produced and to which it will bind (Biology Online, 2024).https://www.biologyonline.com/dictionary/epitope&lt;br /&gt;
#*Microarray: A laboratory tool used to analyze large numbers of genes or proteins at one time (National Cancer Institute, 2024).&lt;br /&gt;
#The main Findings of the paper are that the architecture of the promoter, meaning the arrangements of the DNA binding site, change depending on environmental conditions and can be predicted with confidence what the binding arrangement will be depending on the promoter and the environmental conditions.&lt;br /&gt;
#The significance of these findings is the fact that they combine genome-wide location data with phylogenetic conservation data. Using both these types of data allowed to cluster all significant results from the genome wide data location based upon their conservation data.&lt;br /&gt;
#The limitations of prior studies is the fact that it cannot be determined what the location is for the recognition sites of transcriptional regulators with phylogenetic sequence data alone, or with any other prior knowledge from any previous study. The fact that the sequences have been conserved through evolution indicates that they can be regulated, but does not reveal information about the binding process, or the conditions, or the architecture of such binding.&lt;br /&gt;
#They treated the Yeast cells by using PCR and they printed about 6000 DNA fragments to represent nearly all regions in the yeast genome.&lt;br /&gt;
#They used the W303 yeast strain from Saccharomyces cerevisiae, and it was haploid. &lt;br /&gt;
#They grew them in microarrays with PCR products. The article does not specify temperature or time, in the supplementary methods section, it does list that the times varies for each of the conditions. it is as low as 20 minutes for certain conditions (namely the moderately hypertonic condition). And the time is as high as 14 hours (namely the filamentation inducing condition). It does not specify the temperature for most of the conditions, except for the elevated temperature condition in which it specifies that it begins at 30 degrees celsius and is shifted to 37 degrees celsius. &lt;br /&gt;
#The controls group they used was an unenriched microarray to compare with the immunoprecipitated samples.&lt;br /&gt;
#They ran each program 50 times on a randomly selected set of sequences. &lt;br /&gt;
#The study conducted their genome wide location analysis by cross linking the proteins to the DNA, which then created precipitate which separated the DNA from the protein. These precipitates were then went through PCR procedures to hybridize them to a microarray of spotted PCR products, each representing a different location of the yeast genome. Such locations were used to compare the probabilities of binding interactions.&lt;br /&gt;
#They used an Axon 200B scanner to scan the microarrays, they compared the immunoprecipitated sample with the unenriched sample. They found the median of each channel to calculate a normalization factor. They then calculated the log ratio of the intensity of the test channel to the control channel. The log ratios were normalized by subracting the average log ratio of every spot across all arrays. Finally, they calculated an error model by calculating the significance of enrichment on each chip, and combining the data for all replicates to calculate an average ratio and significance of enrichment for every region in the genome. &lt;br /&gt;
#their supplementary tables are available to the public for download on nature.com and these tables have the results that they were able to calculate from their data, but their raw data and calculations are not to be found available to the public.&lt;br /&gt;
#The list of figures from the article&lt;br /&gt;
#*Figure 1 has 2 parts, part a essentially states that the conclusions from this study, regarding the identification of transcription factor binding site specificities, could only be concluded when using the three kinds of data they have. They had their genome-wide location data, their phylogenetic sequence conservation data, and other previous work. Part b shows the sequence specificities of some of the regulators. There are 2 columns, one of the columns displays sequences that had already been discovered and were rediscovered with this study, and the other column shows sequences that were newly discovered by this study. Each of the letters in the sequence have a size proportional to the product of their frequency and their information content&lt;br /&gt;
#*Figure 2 has three parts, part a displays the different chromosomes, as well as certain genes located on said chromosomes with , it also shows the locations of certain DNA sequences that are bound by transcriptional regulators. They obtained this information by mapping on the yeast genome sequences the motifs that they found to be bound by regulators at high confidence that were also conserved. The functions of the specific transcriptional regulators had already been previously established. Part b of the figure combines binding data with sequence conservation data. This part of the graph is in 3 parts, the first shows all sequence matches to DNA binding specificities. The second part shows all of the sequence matches to conserved sequences, and the third part shows all sequences that match with conserved sequences that are bound by regulators. Part c of the figure is a graph that shows the frequency of binding sites in relation to the distance from translational start site on the DNA sequence. The x axis is the distance from translational start site, and the y axis is the number of binding sites.&lt;br /&gt;
#*Figure 3 shows the different promoter architectures. The first one being single regulator, the second being repetitive motifs, the third being multiple regulators, and co-occuring regulators. They display this information by putting a different color box for each regulator on different lines representing different binding site sequences. They obtained this information through their microarray experiments in this study.&lt;br /&gt;
#*Figure 4 displays the environment-specific use of transcriptional regulatory code. It shows four different patterns of binding behavior in four different rows. The different patterns being Condition invariant, condition enabled, condition expanded, and condition altered. The regulators are represented by colored circles and shown above and below the genes/promoters, and there are lines connecting them to the genes that display their binding nature depending on their environments. To the right of these charts, there are 2 lines for each binding pattern representing different environments. The regulators are displayed near the genes, and they are shown in circles or colored boxes to show whether they are binding or not depending on the environment.&lt;br /&gt;
#*Supplementary figure 1 is a graph in which the x axis is the regulator under testing (1 through 203), and the y axis is the number of promoter regions bound to said regulator. There are 2 lines on the graph, one is in blue which is unadjusted per the number of conditions the regulator was profiled under, and another line is in pink in which it averages the distributions for the same set of p values among regulators and promoter regions. This information was also obtained by their microarray data.&lt;br /&gt;
#*Supplementary figure 2 represents the data calculations conducted in this study. First all of their motifs were identified by using a variety of methods as listed in the figure, which were then filtered to determine which were significant, and then clustered based upon representative motifs, they then used conservation data to identify which motifs had the highest confidence rate. The final step is the statistical test from specificity databases to assign a specific motif to each regulator.&lt;br /&gt;
#*Supplementary figure 3 is a photo that displays the binding of Cin5 to two different sequences. It shows 15 different lanes to demonstrate the different binding results of the protein with different sequences. The first lane shows it with no competitor, the lanes 2-8 show it binding with a competitor sequence found by one of the discovered motifs in this study, and lanes 9-15 show it binding with a previously established binding site for the regulator. And the concentration of the regulator was 27 times higher in the motif discovered in this study as opposed to the previously established sequence, meaning the results of the study were able to predict the binding capacity for this protein better than previously published literature.&lt;br /&gt;
#*Supplementary figure 4 is a bar graph in which the x axis is the regulators, and the y axis is the number of promoter regions bound for each of those regulators. This figure compares the number of promoter regions bound for each regulator depending on the environment they&amp;#039;re growing in. The two different environments they compared in this figure are the rich medium environment and the amino acid starvation environment. &lt;br /&gt;
#*Supplementary figure 5 is a bar graph in which the x axis is the percent of maximum matching sites, which essentially translates to the quality of the matching sites found for each of them, which was determined based on to the best matching sequence to the Gcn4 binding specificity. The y axis is the frequency of matches found of that quality. The bars were clustered by which conditions they were grown in. &lt;br /&gt;
#This study does incorporate methods from other previous studies. Other previous studies have used conservation sequence data, but that has not allowed them to predict the binding sites and environments for each of the regulators with confidence that this study does.&lt;br /&gt;
#The authors could take the future direction of testing such transcription factors in higher eukaryotes, for if they are able to predict such binding mechanisms for yeast cells, they can likely do something similar for a higher level organism. Perhaps they will not be able to test as many regulators or have as high of a confidence rate, for I would assume it would be far more difficult to carry out such tests on another higher organism, for there are likely far more regulators and they may have a much larger genome to select from.&lt;br /&gt;
#I believe the authors in this study were able to support their conclusions well with the data acquired, but I do not believe the data was well presented or explained in the article. Much of the materials necessary to understand their methods or to know important details (temperature, time, conditions) of their experiment were not even on the article itself and had to be found in the supplementary section. And even in the supplementary section it was still a very dense topic that is very difficult to understand for anyone who is not a well established expert on the topic. Not to mention the fact there is no defined discussion or conclusion section of the paper. The final section of the article is the methods section which is rather unconventional.&lt;br /&gt;
&lt;br /&gt;
==Acknowledgements==&lt;br /&gt;
I have been in contact with my group members for this week about the presentation and questions this week. We worked together in class and texted about the presentation. I also visited my professor, Dr. Dahlquist during her office hours to ask for help in interpreting the article for this week. Except for what is noted above, this individual journal entry was completed by me and not copied from another source.&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
*Harbison CT, Gordon DB, Lee TI, Rinaldi NJ, Macisaac KD, Danford TW, Hannett NM, Tagne JB, Reynolds DB, Yoo J, Jennings EG, Zeitlinger J, Pokholok DK, Kellis M, Rolfe PA, Takusagawa KT, Lander ES, Gifford DK, Fraenkel E, Young RA. Transcriptional regulatory code of a eukaryotic genome. Nature. 2004 Sep 2;431(7004):99-104. doi: 10.1038/nature02800. PMID: 15343339; PMCID: PMC3006441.&lt;br /&gt;
*Gene Ontology. Transcription regulator activity. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0140110&lt;br /&gt;
*Gene Ontology. Transcription cis-regulatory region binding. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0000976&lt;br /&gt;
*Gene Ontology. Respiratory electron transport chain. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0022904&lt;br /&gt;
*Gene Ontology. DNA ligation. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0006266&lt;br /&gt;
*Biology Online. Lysis. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/lysis&lt;br /&gt;
*Biology Online. Phylogenetics. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/phylogenetics&lt;br /&gt;
*Biology Online. Epitope. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/epitope&lt;br /&gt;
*Cammack, Richard, et al. Oxford Reference. (2006). Immunoprecipitate. Retrieved April 10, 2024, from https://www.oxfordreference.com/display/10.1093/acref/9780198529170.001.0001/acref-9780198529170-e-9850?rskey=ZIIfXn&amp;amp;result=1&lt;br /&gt;
*National Cancer Institute. PCR. Retrieved April 10, 2024, from https://www.cancer.gov/publications/dictionaries/cancer-terms/def/pcr&lt;br /&gt;
*National Cancer Institute. Microarray. Retrieved April 10, 2024, from https://www.cancer.gov/search/results?swKeyword=microarray&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Template:MSymond1}}&lt;br /&gt;
[[Category:Team Project]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_12&amp;diff=3069</id>
		<title>MSymond1 Week 12</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_12&amp;diff=3069"/>
		<updated>2024-04-11T04:15:57Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: /* Individual Journal Page */ added in text citation&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Individual Journal Page==&lt;br /&gt;
#A list of biological terms from the paper I did not know the definitions for when I first read the article&lt;br /&gt;
#*transcription regulator activity: A molecular function that controls the rate, timing and/or magnitude of gene transcription. The function of transcriptional regulators is to modulate gene expression at the transcription step so that they are expressed in the right cell at the right time and in the right amount throughout the life of the cell and the organism. Genes are transcriptional units, and include bacterial operons (Gene Ontology, 2024) https://amigo.geneontology.org/amigo/term/GO:0140110.&lt;br /&gt;
#*transcription cis-regulatory region binding: Binding to a specific sequence of DNA that is part of a regulatory region that controls transcription of that section of the DNA. The transcribed region might be described as a gene, cistron, or operon (Gene Ontology, 2024). https://amigo.geneontology.org/amigo/term/GO:0000976&lt;br /&gt;
#*respiratory electron transport chain: A process in which a series of electron carriers operate together to transfer electrons from donors such as NADH and FADH2 to any of several different terminal electron acceptors to generate a transmembrane electrochemical gradient (Gene Ontology, 2024). https://amigo.geneontology.org/amigo/term/GO:0022904&lt;br /&gt;
#*Lysis: The disintegration or rupture of the cell membrane, resulting in the release of cell contents or the subsequent death of the cell (Biology Online, 2024). https://www.biologyonline.com/dictionary/lysis&lt;br /&gt;
#*immunoprecipitate: the precipitate formed in an antigen‐antibody reaction (Oxford Reference, 2006). https://www.oxfordreference.com/display/10.1093/acref/9780198529170.001.0001/acref-9780198529170-e-9850?rskey=ZIIfXn&amp;amp;result=1&lt;br /&gt;
#*DNA ligation: The re-formation of a broken phosphodiester bond in the DNA backbone, carried out by DNA ligase (Gene Ontology, 2024). https://amigo.geneontology.org/amigo/term/GO:0006266&lt;br /&gt;
#*Phylogeny: the scientific study of phylogeny. It studies evolutionary relationships among various groups of organisms based on evolutionary history, similarities, and differences. It makes use of molecular sequencing data (such as homologous sequences, protein sequences, nucleotide sequences, etc.) and morphological data matrices to understand and analyze the protein and gene evolutions of genetically-related groups of organisms (Biology Online, 2024). https://www.biologyonline.com/dictionary/phylogenetics&lt;br /&gt;
#*PCR: A laboratory method used to make many copies of a specific piece of DNA from a sample that contains very tiny amounts of that DNA. PCR allows these pieces of DNA to be amplified so they can be detected. PCR may be used to look for certain changes in a gene or chromosome, which may help find and diagnose a genetic condition or a disease, such as cancer. It may also be used to look at pieces of the DNA of certain bacteria, viruses, or other microorganisms to help diagnose an infection. Also called polymerase chain reaction (National Cancer Institute, 2024). https://www.cancer.gov/publications/dictionaries/cancer-terms/def/pcr&lt;br /&gt;
#*Epitope: That part of an antigenic molecule to which the t-cell receptor responds, a site on a large molecule against which an antibody will be produced and to which it will bind (Biology Online, 2024).https://www.biologyonline.com/dictionary/epitope&lt;br /&gt;
#*Microarray: A laboratory tool used to analyze large numbers of genes or proteins at one time (National Cancer Institute, 2024).&lt;br /&gt;
#The main Findings of the paper are that the architecture of the promoter, meaning the arrangements of the DNA binding site, change depending on environmental conditions and can be predicted with confidence what the binding arrangement will be depending on the promoter and the environmental conditions.&lt;br /&gt;
#The significance of these findings is the fact that they combine genome-wide location data with phylogenetic conservation data. Using both these types of data allowed to cluster all significant results from the genome wide data location based upon their conservation data.&lt;br /&gt;
#The limitations of prior studies is the fact that it cannot be determined what the location is for the recognition sites of transcriptional regulators with phylogenetic sequence data alone, or with any other prior knowledge from any previous study. The fact that the sequences have been conserved through evolution indicates that they can be regulated, but does not reveal information about the binding process, or the conditions, or the architecture of such binding.&lt;br /&gt;
#They treated the Yeast cells by using PCR and they printed about 6000 DNA fragments to represent nearly all regions in the yeast genome.&lt;br /&gt;
#They used the W303 yeast strain from Saccharomyces cerevisiae, and it was haploid. &lt;br /&gt;
#They grew them in microarrays with PCR products. The article does not specify temperature or time, in the supplementary methods section, it does list that the times varies for each of the conditions. it is as low as 20 minutes for certain conditions (namely the moderately hypertonic condition). And the time is as high as 14 hours (namely the filamentation inducing condition). It does not specify the temperature for most of the conditions, except for the elevated temperature condition in which it specifies that it begins at 30 degrees celsius and is shifted to 37 degrees celsius. &lt;br /&gt;
#The controls group they used was an unenriched microarray to compare with the immunoprecipitated samples.&lt;br /&gt;
#They ran each program 50 times on a randomly selected set of sequences. &lt;br /&gt;
#The study conducted their genome wide location analysis by cross linking the proteins to the DNA, which then created precipitate which separated the DNA from the protein. These precipitates were then went through PCR procedures to hybridize them to a microarray of spotted PCR products, each representing a different location of the yeast genome. Such locations were used to compare the probabilities of binding interactions.&lt;br /&gt;
#They used an Axon 200B scanner to scan the microarrays, they compared the immunoprecipitated sample with the unenriched sample. They found the median of each channel to calculate a normalization factor. They then calculated the log ratio of the intensity of the test channel to the control channel. The log ratios were normalized by subracting the average log ratio of every spot across all arrays. Finally, they calculated an error model by calculating the significance of enrichment on each chip, and combining the data for all replicates to calculate an average ratio and significance of enrichment for every region in the genome. &lt;br /&gt;
#their supplementary tables are available to the public for download on nature.com and these tables have the results that they were able to calculate from their data, but their raw data and calculations are not to be found available to the public.&lt;br /&gt;
#The list of figures from the article&lt;br /&gt;
#*Figure 1 has 2 parts, part a essentially states that the conclusions from this study, regarding the identification of transcription factor binding site specificities, could only be concluded when using the three kinds of data they have. They had their genome-wide location data, their phylogenetic sequence conservation data, and other previous work. Part b shows the sequence specificities of some of the regulators. There are 2 columns, one of the columns displays sequences that had already been discovered and were rediscovered with this study, and the other column shows sequences that were newly discovered by this study. Each of the letters in the sequence have a size proportional to the product of their frequency and their information content&lt;br /&gt;
#*Figure 2 has three parts, part a displays the different chromosomes, as well as certain genes located on said chromosomes with , it also shows the locations of certain DNA sequences that are bound by transcriptional regulators. They obtained this information by mapping on the yeast genome sequences the motifs that they found to be bound by regulators at high confidence that were also conserved. The functions of the specific transcriptional regulators had already been previously established. Part b of the figure combines binding data with sequence conservation data. This part of the graph is in 3 parts, the first shows all sequence matches to DNA binding specificities. The second part shows all of the sequence matches to conserved sequences, and the third part shows all sequences that match with conserved sequences that are bound by regulators. Part c of the figure is a graph that shows the frequency of binding sites in relation to the distance from translational start site on the DNA sequence. The x axis is the distance from translational start site, and the y axis is the number of binding sites.&lt;br /&gt;
#*Figure 3 shows the different promoter architectures. The first one being single regulator, the second being repetitive motifs, the third being multiple regulators, and co-occuring regulators. They display this information by putting a different color box for each regulator on different lines representing different binding site sequences. They obtained this information through their microarray experiments in this study.&lt;br /&gt;
#*Figure 4 displays the environment-specific use of transcriptional regulatory code. It shows four different patterns of binding behavior in four different rows. The different patterns being Condition invariant, condition enabled, condition expanded, and condition altered. The regulators are represented by colored circles and shown above and below the genes/promoters, and there are lines connecting them to the genes that display their binding nature depending on their environments. To the right of these charts, there are 2 lines for each binding pattern representing different environments. The regulators are displayed near the genes, and they are shown in circles or colored boxes to show whether they are binding or not depending on the environment.&lt;br /&gt;
#*Supplementary figure 1 is a graph in which the x axis is the regulator under testing (1 through 203), and the y axis is the number of promoter regions bound to said regulator. There are 2 lines on the graph, one is in blue which is unadjusted per the number of conditions the regulator was profiled under, and another line is in pink in which it averages the distributions for the same set of p values among regulators and promoter regions. This information was also obtained by their microarray data.&lt;br /&gt;
#*Supplementary figure 2 represents the data calculations conducted in this study. First all of their motifs were identified by using a variety of methods as listed in the figure, which were then filtered to determine which were significant, and then clustered based upon representative motifs, they then used conservation data to identify which motifs had the highest confidence rate. The final step is the statistical test from specificity databases to assign a specific motif to each regulator.&lt;br /&gt;
#*Supplementary figure 3 is a photo that displays the binding of Cin5 to two different sequences. It shows 15 different lanes to demonstrate the different binding results of the protein with different sequences. The first lane shows it with no competitor, the lanes 2-8 show it binding with a competitor sequence found by one of the discovered motifs in this study, and lanes 9-15 show it binding with a previously established binding site for the regulator. And the concentration of the regulator was 27 times higher in the motif discovered in this study as opposed to the previously established sequence, meaning the results of the study were able to predict the binding capacity for this protein better than previously published literature.&lt;br /&gt;
#*Supplementary figure 4 is a bar graph in which the x axis is the regulators, and the y axis is the number of promoter regions bound for each of those regulators. This figure compares the number of promoter regions bound for each regulator depending on the environment they&amp;#039;re growing in. The two different environments they compared in this figure are the rich medium environment and the amino acid starvation environment. &lt;br /&gt;
#*Supplementary figure 5 is a bar graph in which the x axis is the percent of maximum matching sites, which essentially translates to the quality of the matching sites found for each of them, which was determined based on to the best matching sequence to the Gcn4 binding specificity. The y axis is the frequency of matches found of that quality. The bars were clustered by which conditions they were grown in. &lt;br /&gt;
#This study does incorporate methods from other previous studies. Other previous studies have used conservation sequence data, but that has not allowed them to predict the binding sites and environments for each of the regulators with confidence that this study does.&lt;br /&gt;
#The authors could take the future direction of testing such transcription factors in higher eukaryotes, for if they are able to predict such binding mechanisms for yeast cells, they can likely do something similar for a higher level organism. Perhaps they will not be able to test as many regulators or have as high of a confidence rate, for I would assume it would be far more difficult to carry out such tests on another higher organism, for there are likely far more regulators and they may have a much larger genome to select from.&lt;br /&gt;
#I believe the authors in this study were able to support their conclusions well with the data acquired, but I do not believe the data was well presented or explained in the article. Much of the materials necessary to understand their methods or to know important details (temperature, time, conditions) of their experiment were not even on the article itself and had to be found in the supplementary section. And even in the supplementary section it was still a very dense topic that is very difficult to understand for anyone who is not a well established expert on the topic. Not to mention the fact there is no defined discussion or conclusion section of the paper. The final section of the article is the methods section which is rather unconventional.&lt;br /&gt;
&lt;br /&gt;
==Acknowledgements==&lt;br /&gt;
I have been in contact with my group members for this week about the presentation and questions this week. We worked together in class and texted about the presentation. I also visited my professor, Dr. Dahlquist during her office hours to ask for help in interpreting the article for this week. Except for what is noted above, this individual journal entry was completed by me and not copied from another source.&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
*Harbison CT, Gordon DB, Lee TI, Rinaldi NJ, Macisaac KD, Danford TW, Hannett NM, Tagne JB, Reynolds DB, Yoo J, Jennings EG, Zeitlinger J, Pokholok DK, Kellis M, Rolfe PA, Takusagawa KT, Lander ES, Gifford DK, Fraenkel E, Young RA. Transcriptional regulatory code of a eukaryotic genome. Nature. 2004 Sep 2;431(7004):99-104. doi: 10.1038/nature02800. PMID: 15343339; PMCID: PMC3006441.&lt;br /&gt;
*Gene Ontology. Transcription regulator activity. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0140110&lt;br /&gt;
*Gene Ontology. Transcription cis-regulatory region binding. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0000976&lt;br /&gt;
*Gene Ontology. Respiratory electron transport chain. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0022904&lt;br /&gt;
*Gene Ontology. DNA ligation. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0006266&lt;br /&gt;
*Biology Online. Lysis. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/lysis&lt;br /&gt;
*Biology Online. Phylogenetics. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/phylogenetics&lt;br /&gt;
*Biology Online. Epitope. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/epitope&lt;br /&gt;
*Cammack, Richard, et al. Oxford Reference. (2006). Immunoprecipitate. Retrieved April 10, 2024, from https://www.oxfordreference.com/display/10.1093/acref/9780198529170.001.0001/acref-9780198529170-e-9850?rskey=ZIIfXn&amp;amp;result=1&lt;br /&gt;
*National Cancer Institute. PCR. Retrieved April 10, 2024, from https://www.cancer.gov/publications/dictionaries/cancer-terms/def/pcr&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Template:MSymond1}}&lt;br /&gt;
[[Category:Team Project]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_12&amp;diff=3053</id>
		<title>MSymond1 Week 12</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_12&amp;diff=3053"/>
		<updated>2024-04-11T01:44:20Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: /* Individual Journal Page */ fixed link&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Individual Journal Page==&lt;br /&gt;
#A list of biological terms from the paper I did not know the definitions for when I first read the article&lt;br /&gt;
#*transcription regulator activity: A molecular function that controls the rate, timing and/or magnitude of gene transcription. The function of transcriptional regulators is to modulate gene expression at the transcription step so that they are expressed in the right cell at the right time and in the right amount throughout the life of the cell and the organism. Genes are transcriptional units, and include bacterial operons (Gene Ontology, 2024) https://amigo.geneontology.org/amigo/term/GO:0140110.&lt;br /&gt;
#*transcription cis-regulatory region binding: Binding to a specific sequence of DNA that is part of a regulatory region that controls transcription of that section of the DNA. The transcribed region might be described as a gene, cistron, or operon (Gene Ontology, 2024). https://amigo.geneontology.org/amigo/term/GO:0000976&lt;br /&gt;
#*respiratory electron transport chain: A process in which a series of electron carriers operate together to transfer electrons from donors such as NADH and FADH2 to any of several different terminal electron acceptors to generate a transmembrane electrochemical gradient (Gene Ontology, 2024). https://amigo.geneontology.org/amigo/term/GO:0022904&lt;br /&gt;
#*Lysis: The disintegration or rupture of the cell membrane, resulting in the release of cell contents or the subsequent death of the cell (Biology Online, 2024). https://www.biologyonline.com/dictionary/lysis&lt;br /&gt;
#*immunoprecipitate: the precipitate formed in an antigen‐antibody reaction (Oxford Reference, 2006). https://www.oxfordreference.com/display/10.1093/acref/9780198529170.001.0001/acref-9780198529170-e-9850?rskey=ZIIfXn&amp;amp;result=1&lt;br /&gt;
#*DNA ligation: The re-formation of a broken phosphodiester bond in the DNA backbone, carried out by DNA ligase (Gene Ontology, 2024). https://amigo.geneontology.org/amigo/term/GO:0006266&lt;br /&gt;
#*Phylogeny: the scientific study of phylogeny. It studies evolutionary relationships among various groups of organisms based on evolutionary history, similarities, and differences. It makes use of molecular sequencing data (such as homologous sequences, protein sequences, nucleotide sequences, etc.) and morphological data matrices to understand and analyze the protein and gene evolutions of genetically-related groups of organisms (Biology Online, 2024). https://www.biologyonline.com/dictionary/phylogenetics&lt;br /&gt;
#*PCR: A laboratory method used to make many copies of a specific piece of DNA from a sample that contains very tiny amounts of that DNA. PCR allows these pieces of DNA to be amplified so they can be detected. PCR may be used to look for certain changes in a gene or chromosome, which may help find and diagnose a genetic condition or a disease, such as cancer. It may also be used to look at pieces of the DNA of certain bacteria, viruses, or other microorganisms to help diagnose an infection. Also called polymerase chain reaction (National Cancer Institute, 2024). https://www.cancer.gov/publications/dictionaries/cancer-terms/def/pcr&lt;br /&gt;
#*Epitope: That part of an antigenic molecule to which the t-cell receptor responds, a site on a large molecule against which an antibody will be produced and to which it will bind (Biology Online, 2024).https://www.biologyonline.com/dictionary/epitope&lt;br /&gt;
#The main Findings of the paper are that the architecture of the promoter, meaning the arrangements of the DNA binding site, change depending on environmental conditions and can be predicted with confidence what the binding arrangement will be depending on the promoter and the environmental conditions.&lt;br /&gt;
#The significance of these findings is the fact that they combine genome-wide location data with phylogenetic conservation data. Using both these types of data allowed to cluster all significant results from the genome wide data location based upon their conservation data.&lt;br /&gt;
#The limitations of prior studies is the fact that it cannot be determined what the location is for the recognition sites of transcriptional regulators with phylogenetic sequence data alone, or with any other prior knowledge from any previous study. The fact that the sequences have been conserved through evolution indicates that they can be regulated, but does not reveal information about the binding process, or the conditions, or the architecture of such binding.&lt;br /&gt;
#They treated the Yeast cells by using PCR and they printed about 6000 DNA fragments to represent nearly all regions in the yeast genome.&lt;br /&gt;
#They used the W303 yeast strain from Saccharomyces cerevisiae, and it was haploid. &lt;br /&gt;
#They grew them in microarrays with PCR products. The article does not specify temperature or time, in the supplementary methods section, it does list that the times varies for each of the conditions. it is as low as 20 minutes for certain conditions (namely the moderately hypertonic condition). And the time is as high as 14 hours (namely the filamentation inducing condition). It does not specify the temperature for most of the conditions, except for the elevated temperature condition in which it specifies that it begins at 30 degrees celsius and is shifted to 37 degrees celsius. &lt;br /&gt;
#The controls group they used was an unenriched microarray to compare with the immunoprecipitated samples.&lt;br /&gt;
#They ran each program 50 times on a randomly selected set of sequences. &lt;br /&gt;
#The study conducted their genome wide location analysis by cross linking the proteins to the DNA, which then created precipitate which separated the DNA from the protein. These precipitates were then went through PCR procedures to hybridize them to a microarray of spotted PCR products, each representing a different location of the yeast genome. Such locations were used to compare the probabilities of binding interactions.&lt;br /&gt;
#They used an Axon 200B scanner to scan the microarrays, they compared the immunoprecipitated sample with the unenriched sample. They found the median of each channel to calculate a normalization factor. They then calculated the log ratio of the intensity of the test channel to the control channel. The log ratios were normalized by subracting the average log ratio of every spot across all arrays. Finally, they calculated an error model by calculating the significance of enrichment on each chip, and combining the data for all replicates to calculate an average ratio and significance of enrichment for every region in the genome. &lt;br /&gt;
#their supplementary tables are available to the public for download on nature.com and these tables have the results that they were able to calculate from their data, but their raw data and calculations are not to be found available to the public.&lt;br /&gt;
#The list of figures from the article&lt;br /&gt;
#*Figure 1 has 2 parts, part a essentially states that the conclusions from this study, regarding the identification of transcription factor binding site specificities, could only be concluded when using the three kinds of data they have. They had their genome-wide location data, their phylogenetic sequence conservation data, and other previous work. Part b shows the sequence specificities of some of the regulators. There are 2 columns, one of the columns displays sequences that had already been discovered and were rediscovered with this study, and the other column shows sequences that were newly discovered by this study. Each of the letters in the sequence have a size proportional to the product of their frequency and their information content&lt;br /&gt;
#*Figure 2 has three parts, part a displays the different chromosomes, as well as certain genes located on said chromosomes with , it also shows the locations of certain DNA sequences that are bound by transcriptional regulators. They obtained this information by mapping on the yeast genome sequences the motifs that they found to be bound by regulators at high confidence that were also conserved. The functions of the specific transcriptional regulators had already been previously established. Part b of the figure combines binding data with sequence conservation data. This part of the graph is in 3 parts, the first shows all sequence matches to DNA binding specificities. The second part shows all of the sequence matches to conserved sequences, and the third part shows all sequences that match with conserved sequences that are bound by regulators. Part c of the figure is a graph that shows the frequency of binding sites in relation to the distance from translational start site on the DNA sequence. The x axis is the distance from translational start site, and the y axis is the number of binding sites.&lt;br /&gt;
#*Figure 3 shows the different promoter architectures. The first one being single regulator, the second being repetitive motifs, the third being multiple regulators, and co-occuring regulators. They display this information by putting a different color box for each regulator on different lines representing different binding site sequences. They obtained this information through their microarray experiments in this study.&lt;br /&gt;
#*Figure 4 displays the environment-specific use of transcriptional regulatory code. It shows four different patterns of binding behavior in four different rows. The different patterns being Condition invariant, condition enabled, condition expanded, and condition altered. The regulators are represented by colored circles and shown above and below the genes/promoters, and there are lines connecting them to the genes that display their binding nature depending on their environments. To the right of these charts, there are 2 lines for each binding pattern representing different environments. The regulators are displayed near the genes, and they are shown in circles or colored boxes to show whether they are binding or not depending on the environment.&lt;br /&gt;
#*Supplementary figure 1 is a graph in which the x axis is the regulator under testing (1 through 203), and the y axis is the number of promoter regions bound to said regulator. There are 2 lines on the graph, one is in blue which is unadjusted per the number of conditions the regulator was profiled under, and another line is in pink in which it averages the distributions for the same set of p values among regulators and promoter regions. This information was also obtained by their microarray data.&lt;br /&gt;
#*Supplementary figure 2 represents the data calculations conducted in this study. First all of their motifs were identified by using a variety of methods as listed in the figure, which were then filtered to determine which were significant, and then clustered based upon representative motifs, they then used conservation data to identify which motifs had the highest confidence rate. The final step is the statistical test from specificity databases to assign a specific motif to each regulator.&lt;br /&gt;
#*Supplementary figure 3 is a photo that displays the binding of Cin5 to two different sequences. It shows 15 different lanes to demonstrate the different binding results of the protein with different sequences. The first lane shows it with no competitor, the lanes 2-8 show it binding with a competitor sequence found by one of the discovered motifs in this study, and lanes 9-15 show it binding with a previously established binding site for the regulator. And the concentration of the regulator was 27 times higher in the motif discovered in this study as opposed to the previously established sequence, meaning the results of the study were able to predict the binding capacity for this protein better than previously published literature.&lt;br /&gt;
#*Supplementary figure 4 is a bar graph in which the x axis is the regulators, and the y axis is the number of promoter regions bound for each of those regulators. This figure compares the number of promoter regions bound for each regulator depending on the environment they&amp;#039;re growing in. The two different environments they compared in this figure are the rich medium environment and the amino acid starvation environment. &lt;br /&gt;
#*Supplementary figure 5 is a bar graph in which the x axis is the percent of maximum matching sites, which essentially translates to the quality of the matching sites found for each of them, which was determined based on to the best matching sequence to the Gcn4 binding specificity. The y axis is the frequency of matches found of that quality. The bars were clustered by which conditions they were grown in. &lt;br /&gt;
#This study does incorporate methods from other previous studies. Other previous studies have used conservation sequence data, but that has not allowed them to predict the binding sites and environments for each of the regulators with confidence that this study does.&lt;br /&gt;
#The authors could take the future direction of testing such transcription factors in higher eukaryotes, for if they are able to predict such binding mechanisms for yeast cells, they can likely do something similar for a higher level organism. Perhaps they will not be able to test as many regulators or have as high of a confidence rate, for I would assume it would be far more difficult to carry out such tests on another higher organism, for there are likely far more regulators and they may have a much larger genome to select from.&lt;br /&gt;
#I believe the authors in this study were able to support their conclusions well with the data acquired, but I do not believe the data was well presented or explained in the article. Much of the materials necessary to understand their methods or to know important details (temperature, time, conditions) of their experiment were not even on the article itself and had to be found in the supplementary section. And even in the supplementary section it was still a very dense topic that is very difficult to understand for anyone who is not a well established expert on the topic. Not to mention the fact there is no defined discussion or conclusion section of the paper. The final section of the article is the methods section which is rather unconventional.&lt;br /&gt;
&lt;br /&gt;
==Acknowledgements==&lt;br /&gt;
I have been in contact with my group members for this week about the presentation and questions this week. We worked together in class and texted about the presentation. I also visited my professor, Dr. Dahlquist during her office hours to ask for help in interpreting the article for this week. Except for what is noted above, this individual journal entry was completed by me and not copied from another source.&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
*Harbison CT, Gordon DB, Lee TI, Rinaldi NJ, Macisaac KD, Danford TW, Hannett NM, Tagne JB, Reynolds DB, Yoo J, Jennings EG, Zeitlinger J, Pokholok DK, Kellis M, Rolfe PA, Takusagawa KT, Lander ES, Gifford DK, Fraenkel E, Young RA. Transcriptional regulatory code of a eukaryotic genome. Nature. 2004 Sep 2;431(7004):99-104. doi: 10.1038/nature02800. PMID: 15343339; PMCID: PMC3006441.&lt;br /&gt;
*Gene Ontology. Transcription regulator activity. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0140110&lt;br /&gt;
*Gene Ontology. Transcription cis-regulatory region binding. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0000976&lt;br /&gt;
*Gene Ontology. Respiratory electron transport chain. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0022904&lt;br /&gt;
*Gene Ontology. DNA ligation. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0006266&lt;br /&gt;
*Biology Online. Lysis. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/lysis&lt;br /&gt;
*Biology Online. Phylogenetics. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/phylogenetics&lt;br /&gt;
*Biology Online. Epitope. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/epitope&lt;br /&gt;
*Cammack, Richard, et al. Oxford Reference. (2006). Immunoprecipitate. Retrieved April 10, 2024, from https://www.oxfordreference.com/display/10.1093/acref/9780198529170.001.0001/acref-9780198529170-e-9850?rskey=ZIIfXn&amp;amp;result=1&lt;br /&gt;
*National Cancer Institute. PCR. Retrieved April 10, 2024, from https://www.cancer.gov/publications/dictionaries/cancer-terms/def/pcr&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Template:MSymond1}}&lt;br /&gt;
[[Category:Team Project]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_12&amp;diff=3052</id>
		<title>MSymond1 Week 12</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_12&amp;diff=3052"/>
		<updated>2024-04-11T01:43:35Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: /* References */ finished references&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Individual Journal Page==&lt;br /&gt;
#A list of biological terms from the paper I did not know the definitions for when I first read the article&lt;br /&gt;
#*transcription regulator activity: A molecular function that controls the rate, timing and/or magnitude of gene transcription. The function of transcriptional regulators is to modulate gene expression at the transcription step so that they are expressed in the right cell at the right time and in the right amount throughout the life of the cell and the organism. Genes are transcriptional units, and include bacterial operons (Gene Ontology, 2024) https://amigo.geneontology.org/amigo/term/GO:0140110.&lt;br /&gt;
#*transcription cis-regulatory region binding: Binding to a specific sequence of DNA that is part of a regulatory region that controls transcription of that section of the DNA. The transcribed region might be described as a gene, cistron, or operon (Gene Ontology, 2024). https://amigo.geneontology.org/amigo/term/GO:0000976&lt;br /&gt;
#*respiratory electron transport chain: A process in which a series of electron carriers operate together to transfer electrons from donors such as NADH and FADH2 to any of several different terminal electron acceptors to generate a transmembrane electrochemical gradient (Gene Ontology, 2024). https://amigo.geneontology.org/amigo/term/GO:0022904&lt;br /&gt;
#*Lysis: The disintegration or rupture of the cell membrane, resulting in the release of cell contents or the subsequent death of the cell (Biology Online, 2024). https://www.biologyonline.com/dictionary/lysis&lt;br /&gt;
#*immunoprecipitate: the precipitate formed in an antigen‐antibody reaction (Oxford Reference, 2006). https://www.oxfordreference.com/display/10.1093/acref/9780198529170.001.0001/acref-9780198529170-e-9850?rskey=ZIIfXn&amp;amp;result=1&lt;br /&gt;
#*DNA ligation: The re-formation of a broken phosphodiester bond in the DNA backbone, carried out by DNA ligase (Gene Ontology, 2024). https://amigo.geneontology.org/amigo/term/GO:0006266&lt;br /&gt;
#*Phylogeny: the scientific study of phylogeny. It studies evolutionary relationships among various groups of organisms based on evolutionary history, similarities, and differences. It makes use of molecular sequencing data (such as homologous sequences, protein sequences, nucleotide sequences, etc.) and morphological data matrices to understand and analyze the protein and gene evolutions of genetically-related groups of organisms (Biology Online, 2024). https://www.biologyonline.com/dictionary/phylogenetics&lt;br /&gt;
#*PCR: A laboratory method used to make many copies of a specific piece of DNA from a sample that contains very tiny amounts of that DNA. PCR allows these pieces of DNA to be amplified so they can be detected. PCR may be used to look for certain changes in a gene or chromosome, which may help find and diagnose a genetic condition or a disease, such as cancer. It may also be used to look at pieces of the DNA of certain bacteria, viruses, or other microorganisms to help diagnose an infection. Also called polymerase chain reaction (National Cancer Institute, 2024). https://www.cancer.gov/publications/dictionaries/cancer-terms/def/pcr&lt;br /&gt;
#*Epitope: That part of an antigenic molecule to which the t-cell receptor responds, a site on a large molecule against which an antibody will be produced and to which it will bind (Biology Online, 2024).https://www.biologyonline.com/dictionary/epitope&lt;br /&gt;
#The main Findings of the paper are that the architecture of the promoter, meaning the arrangements of the DNA binding site, change depending on environmental conditions and can be predicted with confidence what the binding arrangement will be depending on the promoter and the environmental conditions (Biology Online, 2024). https://www.biologyonline.com/dictionary/epitope&lt;br /&gt;
#The significance of these findings is the fact that they combine genome-wide location data with phylogenetic conservation data. Using both these types of data allowed to cluster all significant results from the genome wide data location based upon their conservation data.&lt;br /&gt;
#The limitations of prior studies is the fact that it cannot be determined what the location is for the recognition sites of transcriptional regulators with phylogenetic sequence data alone, or with any other prior knowledge from any previous study. The fact that the sequences have been conserved through evolution indicates that they can be regulated, but does not reveal information about the binding process, or the conditions, or the architecture of such binding.&lt;br /&gt;
#They treated the Yeast cells by using PCR and they printed about 6000 DNA fragments to represent nearly all regions in the yeast genome.&lt;br /&gt;
#They used the W303 yeast strain from Saccharomyces cerevisiae, and it was haploid. &lt;br /&gt;
#They grew them in microarrays with PCR products. The article does not specify temperature or time, in the supplementary methods section, it does list that the times varies for each of the conditions. it is as low as 20 minutes for certain conditions (namely the moderately hypertonic condition). And the time is as high as 14 hours (namely the filamentation inducing condition). It does not specify the temperature for most of the conditions, except for the elevated temperature condition in which it specifies that it begins at 30 degrees celsius and is shifted to 37 degrees celsius. &lt;br /&gt;
#The controls group they used was an unenriched microarray to compare with the immunoprecipitated samples.&lt;br /&gt;
#They ran each program 50 times on a randomly selected set of sequences. &lt;br /&gt;
#The study conducted their genome wide location analysis by cross linking the proteins to the DNA, which then created precipitate which separated the DNA from the protein. These precipitates were then went through PCR procedures to hybridize them to a microarray of spotted PCR products, each representing a different location of the yeast genome. Such locations were used to compare the probabilities of binding interactions.&lt;br /&gt;
#They used an Axon 200B scanner to scan the microarrays, they compared the immunoprecipitated sample with the unenriched sample. They found the median of each channel to calculate a normalization factor. They then calculated the log ratio of the intensity of the test channel to the control channel. The log ratios were normalized by subracting the average log ratio of every spot across all arrays. Finally, they calculated an error model by calculating the significance of enrichment on each chip, and combining the data for all replicates to calculate an average ratio and significance of enrichment for every region in the genome. &lt;br /&gt;
#their supplementary tables are available to the public for download on nature.com and these tables have the results that they were able to calculate from their data, but their raw data and calculations are not to be found available to the public.&lt;br /&gt;
#The list of figures from the article&lt;br /&gt;
#*Figure 1 has 2 parts, part a essentially states that the conclusions from this study, regarding the identification of transcription factor binding site specificities, could only be concluded when using the three kinds of data they have. They had their genome-wide location data, their phylogenetic sequence conservation data, and other previous work. Part b shows the sequence specificities of some of the regulators. There are 2 columns, one of the columns displays sequences that had already been discovered and were rediscovered with this study, and the other column shows sequences that were newly discovered by this study. Each of the letters in the sequence have a size proportional to the product of their frequency and their information content&lt;br /&gt;
#*Figure 2 has three parts, part a displays the different chromosomes, as well as certain genes located on said chromosomes with , it also shows the locations of certain DNA sequences that are bound by transcriptional regulators. They obtained this information by mapping on the yeast genome sequences the motifs that they found to be bound by regulators at high confidence that were also conserved. The functions of the specific transcriptional regulators had already been previously established. Part b of the figure combines binding data with sequence conservation data. This part of the graph is in 3 parts, the first shows all sequence matches to DNA binding specificities. The second part shows all of the sequence matches to conserved sequences, and the third part shows all sequences that match with conserved sequences that are bound by regulators. Part c of the figure is a graph that shows the frequency of binding sites in relation to the distance from translational start site on the DNA sequence. The x axis is the distance from translational start site, and the y axis is the number of binding sites.&lt;br /&gt;
#*Figure 3 shows the different promoter architectures. The first one being single regulator, the second being repetitive motifs, the third being multiple regulators, and co-occuring regulators. They display this information by putting a different color box for each regulator on different lines representing different binding site sequences. They obtained this information through their microarray experiments in this study.&lt;br /&gt;
#*Figure 4 displays the environment-specific use of transcriptional regulatory code. It shows four different patterns of binding behavior in four different rows. The different patterns being Condition invariant, condition enabled, condition expanded, and condition altered. The regulators are represented by colored circles and shown above and below the genes/promoters, and there are lines connecting them to the genes that display their binding nature depending on their environments. To the right of these charts, there are 2 lines for each binding pattern representing different environments. The regulators are displayed near the genes, and they are shown in circles or colored boxes to show whether they are binding or not depending on the environment.&lt;br /&gt;
#*Supplementary figure 1 is a graph in which the x axis is the regulator under testing (1 through 203), and the y axis is the number of promoter regions bound to said regulator. There are 2 lines on the graph, one is in blue which is unadjusted per the number of conditions the regulator was profiled under, and another line is in pink in which it averages the distributions for the same set of p values among regulators and promoter regions. This information was also obtained by their microarray data.&lt;br /&gt;
#*Supplementary figure 2 represents the data calculations conducted in this study. First all of their motifs were identified by using a variety of methods as listed in the figure, which were then filtered to determine which were significant, and then clustered based upon representative motifs, they then used conservation data to identify which motifs had the highest confidence rate. The final step is the statistical test from specificity databases to assign a specific motif to each regulator.&lt;br /&gt;
#*Supplementary figure 3 is a photo that displays the binding of Cin5 to two different sequences. It shows 15 different lanes to demonstrate the different binding results of the protein with different sequences. The first lane shows it with no competitor, the lanes 2-8 show it binding with a competitor sequence found by one of the discovered motifs in this study, and lanes 9-15 show it binding with a previously established binding site for the regulator. And the concentration of the regulator was 27 times higher in the motif discovered in this study as opposed to the previously established sequence, meaning the results of the study were able to predict the binding capacity for this protein better than previously published literature.&lt;br /&gt;
#*Supplementary figure 4 is a bar graph in which the x axis is the regulators, and the y axis is the number of promoter regions bound for each of those regulators. This figure compares the number of promoter regions bound for each regulator depending on the environment they&amp;#039;re growing in. The two different environments they compared in this figure are the rich medium environment and the amino acid starvation environment. &lt;br /&gt;
#*Supplementary figure 5 is a bar graph in which the x axis is the percent of maximum matching sites, which essentially translates to the quality of the matching sites found for each of them, which was determined based on to the best matching sequence to the Gcn4 binding specificity. The y axis is the frequency of matches found of that quality. The bars were clustered by which conditions they were grown in. &lt;br /&gt;
#This study does incorporate methods from other previous studies. Other previous studies have used conservation sequence data, but that has not allowed them to predict the binding sites and environments for each of the regulators with confidence that this study does.&lt;br /&gt;
#The authors could take the future direction of testing such transcription factors in higher eukaryotes, for if they are able to predict such binding mechanisms for yeast cells, they can likely do something similar for a higher level organism. Perhaps they will not be able to test as many regulators or have as high of a confidence rate, for I would assume it would be far more difficult to carry out such tests on another higher organism, for there are likely far more regulators and they may have a much larger genome to select from.&lt;br /&gt;
#I believe the authors in this study were able to support their conclusions well with the data acquired, but I do not believe the data was well presented or explained in the article. Much of the materials necessary to understand their methods or to know important details (temperature, time, conditions) of their experiment were not even on the article itself and had to be found in the supplementary section. And even in the supplementary section it was still a very dense topic that is very difficult to understand for anyone who is not a well established expert on the topic. Not to mention the fact there is no defined discussion or conclusion section of the paper. The final section of the article is the methods section which is rather unconventional. &lt;br /&gt;
&lt;br /&gt;
==Acknowledgements==&lt;br /&gt;
I have been in contact with my group members for this week about the presentation and questions this week. We worked together in class and texted about the presentation. I also visited my professor, Dr. Dahlquist during her office hours to ask for help in interpreting the article for this week. Except for what is noted above, this individual journal entry was completed by me and not copied from another source.&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
*Harbison CT, Gordon DB, Lee TI, Rinaldi NJ, Macisaac KD, Danford TW, Hannett NM, Tagne JB, Reynolds DB, Yoo J, Jennings EG, Zeitlinger J, Pokholok DK, Kellis M, Rolfe PA, Takusagawa KT, Lander ES, Gifford DK, Fraenkel E, Young RA. Transcriptional regulatory code of a eukaryotic genome. Nature. 2004 Sep 2;431(7004):99-104. doi: 10.1038/nature02800. PMID: 15343339; PMCID: PMC3006441.&lt;br /&gt;
*Gene Ontology. Transcription regulator activity. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0140110&lt;br /&gt;
*Gene Ontology. Transcription cis-regulatory region binding. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0000976&lt;br /&gt;
*Gene Ontology. Respiratory electron transport chain. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0022904&lt;br /&gt;
*Gene Ontology. DNA ligation. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0006266&lt;br /&gt;
*Biology Online. Lysis. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/lysis&lt;br /&gt;
*Biology Online. Phylogenetics. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/phylogenetics&lt;br /&gt;
*Biology Online. Epitope. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/epitope&lt;br /&gt;
*Cammack, Richard, et al. Oxford Reference. (2006). Immunoprecipitate. Retrieved April 10, 2024, from https://www.oxfordreference.com/display/10.1093/acref/9780198529170.001.0001/acref-9780198529170-e-9850?rskey=ZIIfXn&amp;amp;result=1&lt;br /&gt;
*National Cancer Institute. PCR. Retrieved April 10, 2024, from https://www.cancer.gov/publications/dictionaries/cancer-terms/def/pcr&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Template:MSymond1}}&lt;br /&gt;
[[Category:Team Project]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_12&amp;diff=3051</id>
		<title>MSymond1 Week 12</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_12&amp;diff=3051"/>
		<updated>2024-04-11T01:34:20Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: /* References */ continued references&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Individual Journal Page==&lt;br /&gt;
#A list of biological terms from the paper I did not know the definitions for when I first read the article&lt;br /&gt;
#*transcription regulator activity: A molecular function that controls the rate, timing and/or magnitude of gene transcription. The function of transcriptional regulators is to modulate gene expression at the transcription step so that they are expressed in the right cell at the right time and in the right amount throughout the life of the cell and the organism. Genes are transcriptional units, and include bacterial operons (Gene Ontology, 2024) https://amigo.geneontology.org/amigo/term/GO:0140110.&lt;br /&gt;
#*transcription cis-regulatory region binding: Binding to a specific sequence of DNA that is part of a regulatory region that controls transcription of that section of the DNA. The transcribed region might be described as a gene, cistron, or operon (Gene Ontology, 2024). https://amigo.geneontology.org/amigo/term/GO:0000976&lt;br /&gt;
#*respiratory electron transport chain: A process in which a series of electron carriers operate together to transfer electrons from donors such as NADH and FADH2 to any of several different terminal electron acceptors to generate a transmembrane electrochemical gradient (Gene Ontology, 2024). https://amigo.geneontology.org/amigo/term/GO:0022904&lt;br /&gt;
#*Lysis: The disintegration or rupture of the cell membrane, resulting in the release of cell contents or the subsequent death of the cell (Biology Online, 2024). https://www.biologyonline.com/dictionary/lysis&lt;br /&gt;
#*immunoprecipitate: the precipitate formed in an antigen‐antibody reaction (Oxford Reference, 2006). https://www.oxfordreference.com/display/10.1093/acref/9780198529170.001.0001/acref-9780198529170-e-9850?rskey=ZIIfXn&amp;amp;result=1&lt;br /&gt;
#*DNA ligation: The re-formation of a broken phosphodiester bond in the DNA backbone, carried out by DNA ligase (Gene Ontology, 2024). https://amigo.geneontology.org/amigo/term/GO:0006266&lt;br /&gt;
#*Phylogeny: the scientific study of phylogeny. It studies evolutionary relationships among various groups of organisms based on evolutionary history, similarities, and differences. It makes use of molecular sequencing data (such as homologous sequences, protein sequences, nucleotide sequences, etc.) and morphological data matrices to understand and analyze the protein and gene evolutions of genetically-related groups of organisms (Biology Online, 2024). https://www.biologyonline.com/dictionary/phylogenetics&lt;br /&gt;
#*PCR: A laboratory method used to make many copies of a specific piece of DNA from a sample that contains very tiny amounts of that DNA. PCR allows these pieces of DNA to be amplified so they can be detected. PCR may be used to look for certain changes in a gene or chromosome, which may help find and diagnose a genetic condition or a disease, such as cancer. It may also be used to look at pieces of the DNA of certain bacteria, viruses, or other microorganisms to help diagnose an infection. Also called polymerase chain reaction (National Cancer Institute, 2024). https://www.cancer.gov/publications/dictionaries/cancer-terms/def/pcr&lt;br /&gt;
#*Epitope: That part of an antigenic molecule to which the t-cell receptor responds, a site on a large molecule against which an antibody will be produced and to which it will bind (Biology Online, 2024).https://www.biologyonline.com/dictionary/epitope&lt;br /&gt;
#The main Findings of the paper are that the architecture of the promoter, meaning the arrangements of the DNA binding site, change depending on environmental conditions and can be predicted with confidence what the binding arrangement will be depending on the promoter and the environmental conditions (Biology Online, 2024). https://www.biologyonline.com/dictionary/epitope&lt;br /&gt;
#The significance of these findings is the fact that they combine genome-wide location data with phylogenetic conservation data. Using both these types of data allowed to cluster all significant results from the genome wide data location based upon their conservation data.&lt;br /&gt;
#The limitations of prior studies is the fact that it cannot be determined what the location is for the recognition sites of transcriptional regulators with phylogenetic sequence data alone, or with any other prior knowledge from any previous study. The fact that the sequences have been conserved through evolution indicates that they can be regulated, but does not reveal information about the binding process, or the conditions, or the architecture of such binding.&lt;br /&gt;
#They treated the Yeast cells by using PCR and they printed about 6000 DNA fragments to represent nearly all regions in the yeast genome.&lt;br /&gt;
#They used the W303 yeast strain from Saccharomyces cerevisiae, and it was haploid. &lt;br /&gt;
#They grew them in microarrays with PCR products. The article does not specify temperature or time, in the supplementary methods section, it does list that the times varies for each of the conditions. it is as low as 20 minutes for certain conditions (namely the moderately hypertonic condition). And the time is as high as 14 hours (namely the filamentation inducing condition). It does not specify the temperature for most of the conditions, except for the elevated temperature condition in which it specifies that it begins at 30 degrees celsius and is shifted to 37 degrees celsius. &lt;br /&gt;
#The controls group they used was an unenriched microarray to compare with the immunoprecipitated samples.&lt;br /&gt;
#They ran each program 50 times on a randomly selected set of sequences. &lt;br /&gt;
#The study conducted their genome wide location analysis by cross linking the proteins to the DNA, which then created precipitate which separated the DNA from the protein. These precipitates were then went through PCR procedures to hybridize them to a microarray of spotted PCR products, each representing a different location of the yeast genome. Such locations were used to compare the probabilities of binding interactions.&lt;br /&gt;
#They used an Axon 200B scanner to scan the microarrays, they compared the immunoprecipitated sample with the unenriched sample. They found the median of each channel to calculate a normalization factor. They then calculated the log ratio of the intensity of the test channel to the control channel. The log ratios were normalized by subracting the average log ratio of every spot across all arrays. Finally, they calculated an error model by calculating the significance of enrichment on each chip, and combining the data for all replicates to calculate an average ratio and significance of enrichment for every region in the genome. &lt;br /&gt;
#their supplementary tables are available to the public for download on nature.com and these tables have the results that they were able to calculate from their data, but their raw data and calculations are not to be found available to the public.&lt;br /&gt;
#The list of figures from the article&lt;br /&gt;
#*Figure 1 has 2 parts, part a essentially states that the conclusions from this study, regarding the identification of transcription factor binding site specificities, could only be concluded when using the three kinds of data they have. They had their genome-wide location data, their phylogenetic sequence conservation data, and other previous work. Part b shows the sequence specificities of some of the regulators. There are 2 columns, one of the columns displays sequences that had already been discovered and were rediscovered with this study, and the other column shows sequences that were newly discovered by this study. Each of the letters in the sequence have a size proportional to the product of their frequency and their information content&lt;br /&gt;
#*Figure 2 has three parts, part a displays the different chromosomes, as well as certain genes located on said chromosomes with , it also shows the locations of certain DNA sequences that are bound by transcriptional regulators. They obtained this information by mapping on the yeast genome sequences the motifs that they found to be bound by regulators at high confidence that were also conserved. The functions of the specific transcriptional regulators had already been previously established. Part b of the figure combines binding data with sequence conservation data. This part of the graph is in 3 parts, the first shows all sequence matches to DNA binding specificities. The second part shows all of the sequence matches to conserved sequences, and the third part shows all sequences that match with conserved sequences that are bound by regulators. Part c of the figure is a graph that shows the frequency of binding sites in relation to the distance from translational start site on the DNA sequence. The x axis is the distance from translational start site, and the y axis is the number of binding sites.&lt;br /&gt;
#*Figure 3 shows the different promoter architectures. The first one being single regulator, the second being repetitive motifs, the third being multiple regulators, and co-occuring regulators. They display this information by putting a different color box for each regulator on different lines representing different binding site sequences. They obtained this information through their microarray experiments in this study.&lt;br /&gt;
#*Figure 4 displays the environment-specific use of transcriptional regulatory code. It shows four different patterns of binding behavior in four different rows. The different patterns being Condition invariant, condition enabled, condition expanded, and condition altered. The regulators are represented by colored circles and shown above and below the genes/promoters, and there are lines connecting them to the genes that display their binding nature depending on their environments. To the right of these charts, there are 2 lines for each binding pattern representing different environments. The regulators are displayed near the genes, and they are shown in circles or colored boxes to show whether they are binding or not depending on the environment.&lt;br /&gt;
#*Supplementary figure 1 is a graph in which the x axis is the regulator under testing (1 through 203), and the y axis is the number of promoter regions bound to said regulator. There are 2 lines on the graph, one is in blue which is unadjusted per the number of conditions the regulator was profiled under, and another line is in pink in which it averages the distributions for the same set of p values among regulators and promoter regions. This information was also obtained by their microarray data.&lt;br /&gt;
#*Supplementary figure 2 represents the data calculations conducted in this study. First all of their motifs were identified by using a variety of methods as listed in the figure, which were then filtered to determine which were significant, and then clustered based upon representative motifs, they then used conservation data to identify which motifs had the highest confidence rate. The final step is the statistical test from specificity databases to assign a specific motif to each regulator.&lt;br /&gt;
#*Supplementary figure 3 is a photo that displays the binding of Cin5 to two different sequences. It shows 15 different lanes to demonstrate the different binding results of the protein with different sequences. The first lane shows it with no competitor, the lanes 2-8 show it binding with a competitor sequence found by one of the discovered motifs in this study, and lanes 9-15 show it binding with a previously established binding site for the regulator. And the concentration of the regulator was 27 times higher in the motif discovered in this study as opposed to the previously established sequence, meaning the results of the study were able to predict the binding capacity for this protein better than previously published literature.&lt;br /&gt;
#*Supplementary figure 4 is a bar graph in which the x axis is the regulators, and the y axis is the number of promoter regions bound for each of those regulators. This figure compares the number of promoter regions bound for each regulator depending on the environment they&amp;#039;re growing in. The two different environments they compared in this figure are the rich medium environment and the amino acid starvation environment. &lt;br /&gt;
#*Supplementary figure 5 is a bar graph in which the x axis is the percent of maximum matching sites, which essentially translates to the quality of the matching sites found for each of them, which was determined based on to the best matching sequence to the Gcn4 binding specificity. The y axis is the frequency of matches found of that quality. The bars were clustered by which conditions they were grown in. &lt;br /&gt;
#This study does incorporate methods from other previous studies. Other previous studies have used conservation sequence data, but that has not allowed them to predict the binding sites and environments for each of the regulators with confidence that this study does.&lt;br /&gt;
#The authors could take the future direction of testing such transcription factors in higher eukaryotes, for if they are able to predict such binding mechanisms for yeast cells, they can likely do something similar for a higher level organism. Perhaps they will not be able to test as many regulators or have as high of a confidence rate, for I would assume it would be far more difficult to carry out such tests on another higher organism, for there are likely far more regulators and they may have a much larger genome to select from.&lt;br /&gt;
#I believe the authors in this study were able to support their conclusions well with the data acquired, but I do not believe the data was well presented or explained in the article. Much of the materials necessary to understand their methods or to know important details (temperature, time, conditions) of their experiment were not even on the article itself and had to be found in the supplementary section. And even in the supplementary section it was still a very dense topic that is very difficult to understand for anyone who is not a well established expert on the topic. Not to mention the fact there is no defined discussion or conclusion section of the paper. The final section of the article is the methods section which is rather unconventional. &lt;br /&gt;
&lt;br /&gt;
==Acknowledgements==&lt;br /&gt;
I have been in contact with my group members for this week about the presentation and questions this week. We worked together in class and texted about the presentation. I also visited my professor, Dr. Dahlquist during her office hours to ask for help in interpreting the article for this week. Except for what is noted above, this individual journal entry was completed by me and not copied from another source.&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
*Harbison CT, Gordon DB, Lee TI, Rinaldi NJ, Macisaac KD, Danford TW, Hannett NM, Tagne JB, Reynolds DB, Yoo J, Jennings EG, Zeitlinger J, Pokholok DK, Kellis M, Rolfe PA, Takusagawa KT, Lander ES, Gifford DK, Fraenkel E, Young RA. Transcriptional regulatory code of a eukaryotic genome. Nature. 2004 Sep 2;431(7004):99-104. doi: 10.1038/nature02800. PMID: 15343339; PMCID: PMC3006441.&lt;br /&gt;
*Gene Ontology. Transcription regulator activity. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0140110&lt;br /&gt;
*Gene Ontology. Transcription cis-regulatory region binding. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0000976&lt;br /&gt;
*Gene Ontology. Respiratory electron transport chain. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0022904&lt;br /&gt;
*Gene Ontology. DNA ligation. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0006266&lt;br /&gt;
*Biology Online. Lysis. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/lysis&lt;br /&gt;
*Biology Online. Phylogenetics. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/phylogenetics&lt;br /&gt;
*Biology Online. Epitope. Retrieved April 10, 2024, from https://www.biologyonline.com/dictionary/epitope&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Template:MSymond1}}&lt;br /&gt;
[[Category:Team Project]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_12&amp;diff=3050</id>
		<title>MSymond1 Week 12</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_12&amp;diff=3050"/>
		<updated>2024-04-11T01:27:07Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: /* References */ continued references&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Individual Journal Page==&lt;br /&gt;
#A list of biological terms from the paper I did not know the definitions for when I first read the article&lt;br /&gt;
#*transcription regulator activity: A molecular function that controls the rate, timing and/or magnitude of gene transcription. The function of transcriptional regulators is to modulate gene expression at the transcription step so that they are expressed in the right cell at the right time and in the right amount throughout the life of the cell and the organism. Genes are transcriptional units, and include bacterial operons (Gene Ontology, 2024) https://amigo.geneontology.org/amigo/term/GO:0140110.&lt;br /&gt;
#*transcription cis-regulatory region binding: Binding to a specific sequence of DNA that is part of a regulatory region that controls transcription of that section of the DNA. The transcribed region might be described as a gene, cistron, or operon (Gene Ontology, 2024). https://amigo.geneontology.org/amigo/term/GO:0000976&lt;br /&gt;
#*respiratory electron transport chain: A process in which a series of electron carriers operate together to transfer electrons from donors such as NADH and FADH2 to any of several different terminal electron acceptors to generate a transmembrane electrochemical gradient (Gene Ontology, 2024). https://amigo.geneontology.org/amigo/term/GO:0022904&lt;br /&gt;
#*Lysis: The disintegration or rupture of the cell membrane, resulting in the release of cell contents or the subsequent death of the cell (Biology Online, 2024). https://www.biologyonline.com/dictionary/lysis&lt;br /&gt;
#*immunoprecipitate: the precipitate formed in an antigen‐antibody reaction (Oxford Reference, 2006). https://www.oxfordreference.com/display/10.1093/acref/9780198529170.001.0001/acref-9780198529170-e-9850?rskey=ZIIfXn&amp;amp;result=1&lt;br /&gt;
#*DNA ligation: The re-formation of a broken phosphodiester bond in the DNA backbone, carried out by DNA ligase (Gene Ontology, 2024). https://amigo.geneontology.org/amigo/term/GO:0006266&lt;br /&gt;
#*Phylogeny: the scientific study of phylogeny. It studies evolutionary relationships among various groups of organisms based on evolutionary history, similarities, and differences. It makes use of molecular sequencing data (such as homologous sequences, protein sequences, nucleotide sequences, etc.) and morphological data matrices to understand and analyze the protein and gene evolutions of genetically-related groups of organisms (Biology Online, 2024). https://www.biologyonline.com/dictionary/phylogenetics&lt;br /&gt;
#*PCR: A laboratory method used to make many copies of a specific piece of DNA from a sample that contains very tiny amounts of that DNA. PCR allows these pieces of DNA to be amplified so they can be detected. PCR may be used to look for certain changes in a gene or chromosome, which may help find and diagnose a genetic condition or a disease, such as cancer. It may also be used to look at pieces of the DNA of certain bacteria, viruses, or other microorganisms to help diagnose an infection. Also called polymerase chain reaction (National Cancer Institute, 2024). https://www.cancer.gov/publications/dictionaries/cancer-terms/def/pcr&lt;br /&gt;
#*Epitope: That part of an antigenic molecule to which the t-cell receptor responds, a site on a large molecule against which an antibody will be produced and to which it will bind (Biology Online, 2024).https://www.biologyonline.com/dictionary/epitope&lt;br /&gt;
#The main Findings of the paper are that the architecture of the promoter, meaning the arrangements of the DNA binding site, change depending on environmental conditions and can be predicted with confidence what the binding arrangement will be depending on the promoter and the environmental conditions (Biology Online, 2024). https://www.biologyonline.com/dictionary/epitope&lt;br /&gt;
#The significance of these findings is the fact that they combine genome-wide location data with phylogenetic conservation data. Using both these types of data allowed to cluster all significant results from the genome wide data location based upon their conservation data.&lt;br /&gt;
#The limitations of prior studies is the fact that it cannot be determined what the location is for the recognition sites of transcriptional regulators with phylogenetic sequence data alone, or with any other prior knowledge from any previous study. The fact that the sequences have been conserved through evolution indicates that they can be regulated, but does not reveal information about the binding process, or the conditions, or the architecture of such binding.&lt;br /&gt;
#They treated the Yeast cells by using PCR and they printed about 6000 DNA fragments to represent nearly all regions in the yeast genome.&lt;br /&gt;
#They used the W303 yeast strain from Saccharomyces cerevisiae, and it was haploid. &lt;br /&gt;
#They grew them in microarrays with PCR products. The article does not specify temperature or time, in the supplementary methods section, it does list that the times varies for each of the conditions. it is as low as 20 minutes for certain conditions (namely the moderately hypertonic condition). And the time is as high as 14 hours (namely the filamentation inducing condition). It does not specify the temperature for most of the conditions, except for the elevated temperature condition in which it specifies that it begins at 30 degrees celsius and is shifted to 37 degrees celsius. &lt;br /&gt;
#The controls group they used was an unenriched microarray to compare with the immunoprecipitated samples.&lt;br /&gt;
#They ran each program 50 times on a randomly selected set of sequences. &lt;br /&gt;
#The study conducted their genome wide location analysis by cross linking the proteins to the DNA, which then created precipitate which separated the DNA from the protein. These precipitates were then went through PCR procedures to hybridize them to a microarray of spotted PCR products, each representing a different location of the yeast genome. Such locations were used to compare the probabilities of binding interactions.&lt;br /&gt;
#They used an Axon 200B scanner to scan the microarrays, they compared the immunoprecipitated sample with the unenriched sample. They found the median of each channel to calculate a normalization factor. They then calculated the log ratio of the intensity of the test channel to the control channel. The log ratios were normalized by subracting the average log ratio of every spot across all arrays. Finally, they calculated an error model by calculating the significance of enrichment on each chip, and combining the data for all replicates to calculate an average ratio and significance of enrichment for every region in the genome. &lt;br /&gt;
#their supplementary tables are available to the public for download on nature.com and these tables have the results that they were able to calculate from their data, but their raw data and calculations are not to be found available to the public.&lt;br /&gt;
#The list of figures from the article&lt;br /&gt;
#*Figure 1 has 2 parts, part a essentially states that the conclusions from this study, regarding the identification of transcription factor binding site specificities, could only be concluded when using the three kinds of data they have. They had their genome-wide location data, their phylogenetic sequence conservation data, and other previous work. Part b shows the sequence specificities of some of the regulators. There are 2 columns, one of the columns displays sequences that had already been discovered and were rediscovered with this study, and the other column shows sequences that were newly discovered by this study. Each of the letters in the sequence have a size proportional to the product of their frequency and their information content&lt;br /&gt;
#*Figure 2 has three parts, part a displays the different chromosomes, as well as certain genes located on said chromosomes with , it also shows the locations of certain DNA sequences that are bound by transcriptional regulators. They obtained this information by mapping on the yeast genome sequences the motifs that they found to be bound by regulators at high confidence that were also conserved. The functions of the specific transcriptional regulators had already been previously established. Part b of the figure combines binding data with sequence conservation data. This part of the graph is in 3 parts, the first shows all sequence matches to DNA binding specificities. The second part shows all of the sequence matches to conserved sequences, and the third part shows all sequences that match with conserved sequences that are bound by regulators. Part c of the figure is a graph that shows the frequency of binding sites in relation to the distance from translational start site on the DNA sequence. The x axis is the distance from translational start site, and the y axis is the number of binding sites.&lt;br /&gt;
#*Figure 3 shows the different promoter architectures. The first one being single regulator, the second being repetitive motifs, the third being multiple regulators, and co-occuring regulators. They display this information by putting a different color box for each regulator on different lines representing different binding site sequences. They obtained this information through their microarray experiments in this study.&lt;br /&gt;
#*Figure 4 displays the environment-specific use of transcriptional regulatory code. It shows four different patterns of binding behavior in four different rows. The different patterns being Condition invariant, condition enabled, condition expanded, and condition altered. The regulators are represented by colored circles and shown above and below the genes/promoters, and there are lines connecting them to the genes that display their binding nature depending on their environments. To the right of these charts, there are 2 lines for each binding pattern representing different environments. The regulators are displayed near the genes, and they are shown in circles or colored boxes to show whether they are binding or not depending on the environment.&lt;br /&gt;
#*Supplementary figure 1 is a graph in which the x axis is the regulator under testing (1 through 203), and the y axis is the number of promoter regions bound to said regulator. There are 2 lines on the graph, one is in blue which is unadjusted per the number of conditions the regulator was profiled under, and another line is in pink in which it averages the distributions for the same set of p values among regulators and promoter regions. This information was also obtained by their microarray data.&lt;br /&gt;
#*Supplementary figure 2 represents the data calculations conducted in this study. First all of their motifs were identified by using a variety of methods as listed in the figure, which were then filtered to determine which were significant, and then clustered based upon representative motifs, they then used conservation data to identify which motifs had the highest confidence rate. The final step is the statistical test from specificity databases to assign a specific motif to each regulator.&lt;br /&gt;
#*Supplementary figure 3 is a photo that displays the binding of Cin5 to two different sequences. It shows 15 different lanes to demonstrate the different binding results of the protein with different sequences. The first lane shows it with no competitor, the lanes 2-8 show it binding with a competitor sequence found by one of the discovered motifs in this study, and lanes 9-15 show it binding with a previously established binding site for the regulator. And the concentration of the regulator was 27 times higher in the motif discovered in this study as opposed to the previously established sequence, meaning the results of the study were able to predict the binding capacity for this protein better than previously published literature.&lt;br /&gt;
#*Supplementary figure 4 is a bar graph in which the x axis is the regulators, and the y axis is the number of promoter regions bound for each of those regulators. This figure compares the number of promoter regions bound for each regulator depending on the environment they&amp;#039;re growing in. The two different environments they compared in this figure are the rich medium environment and the amino acid starvation environment. &lt;br /&gt;
#*Supplementary figure 5 is a bar graph in which the x axis is the percent of maximum matching sites, which essentially translates to the quality of the matching sites found for each of them, which was determined based on to the best matching sequence to the Gcn4 binding specificity. The y axis is the frequency of matches found of that quality. The bars were clustered by which conditions they were grown in. &lt;br /&gt;
#This study does incorporate methods from other previous studies. Other previous studies have used conservation sequence data, but that has not allowed them to predict the binding sites and environments for each of the regulators with confidence that this study does.&lt;br /&gt;
#The authors could take the future direction of testing such transcription factors in higher eukaryotes, for if they are able to predict such binding mechanisms for yeast cells, they can likely do something similar for a higher level organism. Perhaps they will not be able to test as many regulators or have as high of a confidence rate, for I would assume it would be far more difficult to carry out such tests on another higher organism, for there are likely far more regulators and they may have a much larger genome to select from.&lt;br /&gt;
#I believe the authors in this study were able to support their conclusions well with the data acquired, but I do not believe the data was well presented or explained in the article. Much of the materials necessary to understand their methods or to know important details (temperature, time, conditions) of their experiment were not even on the article itself and had to be found in the supplementary section. And even in the supplementary section it was still a very dense topic that is very difficult to understand for anyone who is not a well established expert on the topic. Not to mention the fact there is no defined discussion or conclusion section of the paper. The final section of the article is the methods section which is rather unconventional. &lt;br /&gt;
&lt;br /&gt;
==Acknowledgements==&lt;br /&gt;
I have been in contact with my group members for this week about the presentation and questions this week. We worked together in class and texted about the presentation. I also visited my professor, Dr. Dahlquist during her office hours to ask for help in interpreting the article for this week. Except for what is noted above, this individual journal entry was completed by me and not copied from another source.&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
*Harbison CT, Gordon DB, Lee TI, Rinaldi NJ, Macisaac KD, Danford TW, Hannett NM, Tagne JB, Reynolds DB, Yoo J, Jennings EG, Zeitlinger J, Pokholok DK, Kellis M, Rolfe PA, Takusagawa KT, Lander ES, Gifford DK, Fraenkel E, Young RA. Transcriptional regulatory code of a eukaryotic genome. Nature. 2004 Sep 2;431(7004):99-104. doi: 10.1038/nature02800. PMID: 15343339; PMCID: PMC3006441.&lt;br /&gt;
*Gene Ontology. Transcription regulator activity. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0140110&lt;br /&gt;
*Gene Ontology. Transcription cis-regulatory region binding. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0000976&lt;br /&gt;
*Gene Ontology. Respiratory electron transport chain. Retrieved April 10, 2024, from https://amigo.geneontology.org/amigo/term/GO:0022904&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Template:MSymond1}}&lt;br /&gt;
[[Category:Team Project]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_12&amp;diff=3049</id>
		<title>MSymond1 Week 12</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_12&amp;diff=3049"/>
		<updated>2024-04-11T01:18:37Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: finished acknowledgements&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Individual Journal Page==&lt;br /&gt;
#A list of biological terms from the paper I did not know the definitions for when I first read the article&lt;br /&gt;
#*transcription regulator activity: A molecular function that controls the rate, timing and/or magnitude of gene transcription. The function of transcriptional regulators is to modulate gene expression at the transcription step so that they are expressed in the right cell at the right time and in the right amount throughout the life of the cell and the organism. Genes are transcriptional units, and include bacterial operons (Gene Ontology, 2024) https://amigo.geneontology.org/amigo/term/GO:0140110.&lt;br /&gt;
#*transcription cis-regulatory region binding: Binding to a specific sequence of DNA that is part of a regulatory region that controls transcription of that section of the DNA. The transcribed region might be described as a gene, cistron, or operon (Gene Ontology, 2024). https://amigo.geneontology.org/amigo/term/GO:0000976&lt;br /&gt;
#*respiratory electron transport chain: A process in which a series of electron carriers operate together to transfer electrons from donors such as NADH and FADH2 to any of several different terminal electron acceptors to generate a transmembrane electrochemical gradient (Gene Ontology, 2024). https://amigo.geneontology.org/amigo/term/GO:0022904&lt;br /&gt;
#*Lysis: The disintegration or rupture of the cell membrane, resulting in the release of cell contents or the subsequent death of the cell (Biology Online, 2024). https://www.biologyonline.com/dictionary/lysis&lt;br /&gt;
#*immunoprecipitate: the precipitate formed in an antigen‐antibody reaction (Oxford Reference, 2006). https://www.oxfordreference.com/display/10.1093/acref/9780198529170.001.0001/acref-9780198529170-e-9850?rskey=ZIIfXn&amp;amp;result=1&lt;br /&gt;
#*DNA ligation: The re-formation of a broken phosphodiester bond in the DNA backbone, carried out by DNA ligase (Gene Ontology, 2024). https://amigo.geneontology.org/amigo/term/GO:0006266&lt;br /&gt;
#*Phylogeny: the scientific study of phylogeny. It studies evolutionary relationships among various groups of organisms based on evolutionary history, similarities, and differences. It makes use of molecular sequencing data (such as homologous sequences, protein sequences, nucleotide sequences, etc.) and morphological data matrices to understand and analyze the protein and gene evolutions of genetically-related groups of organisms (Biology Online, 2024). https://www.biologyonline.com/dictionary/phylogenetics&lt;br /&gt;
#*PCR: A laboratory method used to make many copies of a specific piece of DNA from a sample that contains very tiny amounts of that DNA. PCR allows these pieces of DNA to be amplified so they can be detected. PCR may be used to look for certain changes in a gene or chromosome, which may help find and diagnose a genetic condition or a disease, such as cancer. It may also be used to look at pieces of the DNA of certain bacteria, viruses, or other microorganisms to help diagnose an infection. Also called polymerase chain reaction (National Cancer Institute, 2024). https://www.cancer.gov/publications/dictionaries/cancer-terms/def/pcr&lt;br /&gt;
#*Epitope: That part of an antigenic molecule to which the t-cell receptor responds, a site on a large molecule against which an antibody will be produced and to which it will bind (Biology Online, 2024).https://www.biologyonline.com/dictionary/epitope&lt;br /&gt;
#The main Findings of the paper are that the architecture of the promoter, meaning the arrangements of the DNA binding site, change depending on environmental conditions and can be predicted with confidence what the binding arrangement will be depending on the promoter and the environmental conditions (Biology Online, 2024). https://www.biologyonline.com/dictionary/epitope&lt;br /&gt;
#The significance of these findings is the fact that they combine genome-wide location data with phylogenetic conservation data. Using both these types of data allowed to cluster all significant results from the genome wide data location based upon their conservation data.&lt;br /&gt;
#The limitations of prior studies is the fact that it cannot be determined what the location is for the recognition sites of transcriptional regulators with phylogenetic sequence data alone, or with any other prior knowledge from any previous study. The fact that the sequences have been conserved through evolution indicates that they can be regulated, but does not reveal information about the binding process, or the conditions, or the architecture of such binding.&lt;br /&gt;
#They treated the Yeast cells by using PCR and they printed about 6000 DNA fragments to represent nearly all regions in the yeast genome.&lt;br /&gt;
#They used the W303 yeast strain from Saccharomyces cerevisiae, and it was haploid. &lt;br /&gt;
#They grew them in microarrays with PCR products. The article does not specify temperature or time, in the supplementary methods section, it does list that the times varies for each of the conditions. it is as low as 20 minutes for certain conditions (namely the moderately hypertonic condition). And the time is as high as 14 hours (namely the filamentation inducing condition). It does not specify the temperature for most of the conditions, except for the elevated temperature condition in which it specifies that it begins at 30 degrees celsius and is shifted to 37 degrees celsius. &lt;br /&gt;
#The controls group they used was an unenriched microarray to compare with the immunoprecipitated samples.&lt;br /&gt;
#They ran each program 50 times on a randomly selected set of sequences. &lt;br /&gt;
#The study conducted their genome wide location analysis by cross linking the proteins to the DNA, which then created precipitate which separated the DNA from the protein. These precipitates were then went through PCR procedures to hybridize them to a microarray of spotted PCR products, each representing a different location of the yeast genome. Such locations were used to compare the probabilities of binding interactions.&lt;br /&gt;
#They used an Axon 200B scanner to scan the microarrays, they compared the immunoprecipitated sample with the unenriched sample. They found the median of each channel to calculate a normalization factor. They then calculated the log ratio of the intensity of the test channel to the control channel. The log ratios were normalized by subracting the average log ratio of every spot across all arrays. Finally, they calculated an error model by calculating the significance of enrichment on each chip, and combining the data for all replicates to calculate an average ratio and significance of enrichment for every region in the genome. &lt;br /&gt;
#their supplementary tables are available to the public for download on nature.com and these tables have the results that they were able to calculate from their data, but their raw data and calculations are not to be found available to the public.&lt;br /&gt;
#The list of figures from the article&lt;br /&gt;
#*Figure 1 has 2 parts, part a essentially states that the conclusions from this study, regarding the identification of transcription factor binding site specificities, could only be concluded when using the three kinds of data they have. They had their genome-wide location data, their phylogenetic sequence conservation data, and other previous work. Part b shows the sequence specificities of some of the regulators. There are 2 columns, one of the columns displays sequences that had already been discovered and were rediscovered with this study, and the other column shows sequences that were newly discovered by this study. Each of the letters in the sequence have a size proportional to the product of their frequency and their information content&lt;br /&gt;
#*Figure 2 has three parts, part a displays the different chromosomes, as well as certain genes located on said chromosomes with , it also shows the locations of certain DNA sequences that are bound by transcriptional regulators. They obtained this information by mapping on the yeast genome sequences the motifs that they found to be bound by regulators at high confidence that were also conserved. The functions of the specific transcriptional regulators had already been previously established. Part b of the figure combines binding data with sequence conservation data. This part of the graph is in 3 parts, the first shows all sequence matches to DNA binding specificities. The second part shows all of the sequence matches to conserved sequences, and the third part shows all sequences that match with conserved sequences that are bound by regulators. Part c of the figure is a graph that shows the frequency of binding sites in relation to the distance from translational start site on the DNA sequence. The x axis is the distance from translational start site, and the y axis is the number of binding sites.&lt;br /&gt;
#*Figure 3 shows the different promoter architectures. The first one being single regulator, the second being repetitive motifs, the third being multiple regulators, and co-occuring regulators. They display this information by putting a different color box for each regulator on different lines representing different binding site sequences. They obtained this information through their microarray experiments in this study.&lt;br /&gt;
#*Figure 4 displays the environment-specific use of transcriptional regulatory code. It shows four different patterns of binding behavior in four different rows. The different patterns being Condition invariant, condition enabled, condition expanded, and condition altered. The regulators are represented by colored circles and shown above and below the genes/promoters, and there are lines connecting them to the genes that display their binding nature depending on their environments. To the right of these charts, there are 2 lines for each binding pattern representing different environments. The regulators are displayed near the genes, and they are shown in circles or colored boxes to show whether they are binding or not depending on the environment.&lt;br /&gt;
#*Supplementary figure 1 is a graph in which the x axis is the regulator under testing (1 through 203), and the y axis is the number of promoter regions bound to said regulator. There are 2 lines on the graph, one is in blue which is unadjusted per the number of conditions the regulator was profiled under, and another line is in pink in which it averages the distributions for the same set of p values among regulators and promoter regions. This information was also obtained by their microarray data.&lt;br /&gt;
#*Supplementary figure 2 represents the data calculations conducted in this study. First all of their motifs were identified by using a variety of methods as listed in the figure, which were then filtered to determine which were significant, and then clustered based upon representative motifs, they then used conservation data to identify which motifs had the highest confidence rate. The final step is the statistical test from specificity databases to assign a specific motif to each regulator.&lt;br /&gt;
#*Supplementary figure 3 is a photo that displays the binding of Cin5 to two different sequences. It shows 15 different lanes to demonstrate the different binding results of the protein with different sequences. The first lane shows it with no competitor, the lanes 2-8 show it binding with a competitor sequence found by one of the discovered motifs in this study, and lanes 9-15 show it binding with a previously established binding site for the regulator. And the concentration of the regulator was 27 times higher in the motif discovered in this study as opposed to the previously established sequence, meaning the results of the study were able to predict the binding capacity for this protein better than previously published literature.&lt;br /&gt;
#*Supplementary figure 4 is a bar graph in which the x axis is the regulators, and the y axis is the number of promoter regions bound for each of those regulators. This figure compares the number of promoter regions bound for each regulator depending on the environment they&amp;#039;re growing in. The two different environments they compared in this figure are the rich medium environment and the amino acid starvation environment. &lt;br /&gt;
#*Supplementary figure 5 is a bar graph in which the x axis is the percent of maximum matching sites, which essentially translates to the quality of the matching sites found for each of them, which was determined based on to the best matching sequence to the Gcn4 binding specificity. The y axis is the frequency of matches found of that quality. The bars were clustered by which conditions they were grown in. &lt;br /&gt;
#This study does incorporate methods from other previous studies. Other previous studies have used conservation sequence data, but that has not allowed them to predict the binding sites and environments for each of the regulators with confidence that this study does.&lt;br /&gt;
#The authors could take the future direction of testing such transcription factors in higher eukaryotes, for if they are able to predict such binding mechanisms for yeast cells, they can likely do something similar for a higher level organism. Perhaps they will not be able to test as many regulators or have as high of a confidence rate, for I would assume it would be far more difficult to carry out such tests on another higher organism, for there are likely far more regulators and they may have a much larger genome to select from.&lt;br /&gt;
#I believe the authors in this study were able to support their conclusions well with the data acquired, but I do not believe the data was well presented or explained in the article. Much of the materials necessary to understand their methods or to know important details (temperature, time, conditions) of their experiment were not even on the article itself and had to be found in the supplementary section. And even in the supplementary section it was still a very dense topic that is very difficult to understand for anyone who is not a well established expert on the topic. Not to mention the fact there is no defined discussion or conclusion section of the paper. The final section of the article is the methods section which is rather unconventional. &lt;br /&gt;
&lt;br /&gt;
==Acknowledgements==&lt;br /&gt;
I have been in contact with my group members for this week about the presentation and questions this week. We worked together in class and texted about the presentation. I also visited my professor, Dr. Dahlquist during her office hours to ask for help in interpreting the article for this week. Except for what is noted above, this individual journal entry was completed by me and not copied from another source.&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
*Harbison CT, Gordon DB, Lee TI, Rinaldi NJ, Macisaac KD, Danford TW, Hannett NM, Tagne JB, Reynolds DB, Yoo J, Jennings EG, Zeitlinger J, Pokholok DK, Kellis M, Rolfe PA, Takusagawa KT, Lander ES, Gifford DK, Fraenkel E, Young RA. Transcriptional regulatory code of a eukaryotic genome. Nature. 2004 Sep 2;431(7004):99-104. doi: 10.1038/nature02800. PMID: 15343339; PMCID: PMC3006441.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Template:MSymond1}}&lt;br /&gt;
[[Category:Team Project]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
	<entry>
		<id>https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_12&amp;diff=3048</id>
		<title>MSymond1 Week 12</title>
		<link rel="alternate" type="text/html" href="https://xmlpipedb2024.lmucs.io/biodb/spring2024/index.php?title=MSymond1_Week_12&amp;diff=3048"/>
		<updated>2024-04-11T01:14:16Z</updated>

		<summary type="html">&lt;p&gt;Msymond1: started acknowledgements and referencesdgements&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Individual Journal Page==&lt;br /&gt;
#A list of biological terms from the paper I did not know the definitions for when I first read the article&lt;br /&gt;
#*transcription regulator activity: A molecular function that controls the rate, timing and/or magnitude of gene transcription. The function of transcriptional regulators is to modulate gene expression at the transcription step so that they are expressed in the right cell at the right time and in the right amount throughout the life of the cell and the organism. Genes are transcriptional units, and include bacterial operons (Gene Ontology, 2024) https://amigo.geneontology.org/amigo/term/GO:0140110.&lt;br /&gt;
#*transcription cis-regulatory region binding: Binding to a specific sequence of DNA that is part of a regulatory region that controls transcription of that section of the DNA. The transcribed region might be described as a gene, cistron, or operon (Gene Ontology, 2024). https://amigo.geneontology.org/amigo/term/GO:0000976&lt;br /&gt;
#*respiratory electron transport chain: A process in which a series of electron carriers operate together to transfer electrons from donors such as NADH and FADH2 to any of several different terminal electron acceptors to generate a transmembrane electrochemical gradient (Gene Ontology, 2024). https://amigo.geneontology.org/amigo/term/GO:0022904&lt;br /&gt;
#*Lysis: The disintegration or rupture of the cell membrane, resulting in the release of cell contents or the subsequent death of the cell (Biology Online, 2024). https://www.biologyonline.com/dictionary/lysis&lt;br /&gt;
#*immunoprecipitate: the precipitate formed in an antigen‐antibody reaction (Oxford Reference, 2006). https://www.oxfordreference.com/display/10.1093/acref/9780198529170.001.0001/acref-9780198529170-e-9850?rskey=ZIIfXn&amp;amp;result=1&lt;br /&gt;
#*DNA ligation: The re-formation of a broken phosphodiester bond in the DNA backbone, carried out by DNA ligase (Gene Ontology, 2024). https://amigo.geneontology.org/amigo/term/GO:0006266&lt;br /&gt;
#*Phylogeny: the scientific study of phylogeny. It studies evolutionary relationships among various groups of organisms based on evolutionary history, similarities, and differences. It makes use of molecular sequencing data (such as homologous sequences, protein sequences, nucleotide sequences, etc.) and morphological data matrices to understand and analyze the protein and gene evolutions of genetically-related groups of organisms (Biology Online, 2024). https://www.biologyonline.com/dictionary/phylogenetics&lt;br /&gt;
#*PCR: A laboratory method used to make many copies of a specific piece of DNA from a sample that contains very tiny amounts of that DNA. PCR allows these pieces of DNA to be amplified so they can be detected. PCR may be used to look for certain changes in a gene or chromosome, which may help find and diagnose a genetic condition or a disease, such as cancer. It may also be used to look at pieces of the DNA of certain bacteria, viruses, or other microorganisms to help diagnose an infection. Also called polymerase chain reaction (National Cancer Institute, 2024). https://www.cancer.gov/publications/dictionaries/cancer-terms/def/pcr&lt;br /&gt;
#*Epitope: That part of an antigenic molecule to which the t-cell receptor responds, a site on a large molecule against which an antibody will be produced and to which it will bind (Biology Online, 2024).https://www.biologyonline.com/dictionary/epitope&lt;br /&gt;
#The main Findings of the paper are that the architecture of the promoter, meaning the arrangements of the DNA binding site, change depending on environmental conditions and can be predicted with confidence what the binding arrangement will be depending on the promoter and the environmental conditions (Biology Online, 2024). https://www.biologyonline.com/dictionary/epitope&lt;br /&gt;
#The significance of these findings is the fact that they combine genome-wide location data with phylogenetic conservation data. Using both these types of data allowed to cluster all significant results from the genome wide data location based upon their conservation data.&lt;br /&gt;
#The limitations of prior studies is the fact that it cannot be determined what the location is for the recognition sites of transcriptional regulators with phylogenetic sequence data alone, or with any other prior knowledge from any previous study. The fact that the sequences have been conserved through evolution indicates that they can be regulated, but does not reveal information about the binding process, or the conditions, or the architecture of such binding.&lt;br /&gt;
#They treated the Yeast cells by using PCR and they printed about 6000 DNA fragments to represent nearly all regions in the yeast genome.&lt;br /&gt;
#They used the W303 yeast strain from Saccharomyces cerevisiae, and it was haploid. &lt;br /&gt;
#They grew them in microarrays with PCR products. The article does not specify temperature or time, in the supplementary methods section, it does list that the times varies for each of the conditions. it is as low as 20 minutes for certain conditions (namely the moderately hypertonic condition). And the time is as high as 14 hours (namely the filamentation inducing condition). It does not specify the temperature for most of the conditions, except for the elevated temperature condition in which it specifies that it begins at 30 degrees celsius and is shifted to 37 degrees celsius. &lt;br /&gt;
#The controls group they used was an unenriched microarray to compare with the immunoprecipitated samples.&lt;br /&gt;
#They ran each program 50 times on a randomly selected set of sequences. &lt;br /&gt;
#The study conducted their genome wide location analysis by cross linking the proteins to the DNA, which then created precipitate which separated the DNA from the protein. These precipitates were then went through PCR procedures to hybridize them to a microarray of spotted PCR products, each representing a different location of the yeast genome. Such locations were used to compare the probabilities of binding interactions.&lt;br /&gt;
#They used an Axon 200B scanner to scan the microarrays, they compared the immunoprecipitated sample with the unenriched sample. They found the median of each channel to calculate a normalization factor. They then calculated the log ratio of the intensity of the test channel to the control channel. The log ratios were normalized by subracting the average log ratio of every spot across all arrays. Finally, they calculated an error model by calculating the significance of enrichment on each chip, and combining the data for all replicates to calculate an average ratio and significance of enrichment for every region in the genome. &lt;br /&gt;
#their supplementary tables are available to the public for download on nature.com and these tables have the results that they were able to calculate from their data, but their raw data and calculations are not to be found available to the public.&lt;br /&gt;
#The list of figures from the article&lt;br /&gt;
#*Figure 1 has 2 parts, part a essentially states that the conclusions from this study, regarding the identification of transcription factor binding site specificities, could only be concluded when using the three kinds of data they have. They had their genome-wide location data, their phylogenetic sequence conservation data, and other previous work. Part b shows the sequence specificities of some of the regulators. There are 2 columns, one of the columns displays sequences that had already been discovered and were rediscovered with this study, and the other column shows sequences that were newly discovered by this study. Each of the letters in the sequence have a size proportional to the product of their frequency and their information content&lt;br /&gt;
#*Figure 2 has three parts, part a displays the different chromosomes, as well as certain genes located on said chromosomes with , it also shows the locations of certain DNA sequences that are bound by transcriptional regulators. They obtained this information by mapping on the yeast genome sequences the motifs that they found to be bound by regulators at high confidence that were also conserved. The functions of the specific transcriptional regulators had already been previously established. Part b of the figure combines binding data with sequence conservation data. This part of the graph is in 3 parts, the first shows all sequence matches to DNA binding specificities. The second part shows all of the sequence matches to conserved sequences, and the third part shows all sequences that match with conserved sequences that are bound by regulators. Part c of the figure is a graph that shows the frequency of binding sites in relation to the distance from translational start site on the DNA sequence. The x axis is the distance from translational start site, and the y axis is the number of binding sites.&lt;br /&gt;
#*Figure 3 shows the different promoter architectures. The first one being single regulator, the second being repetitive motifs, the third being multiple regulators, and co-occuring regulators. They display this information by putting a different color box for each regulator on different lines representing different binding site sequences. They obtained this information through their microarray experiments in this study.&lt;br /&gt;
#*Figure 4 displays the environment-specific use of transcriptional regulatory code. It shows four different patterns of binding behavior in four different rows. The different patterns being Condition invariant, condition enabled, condition expanded, and condition altered. The regulators are represented by colored circles and shown above and below the genes/promoters, and there are lines connecting them to the genes that display their binding nature depending on their environments. To the right of these charts, there are 2 lines for each binding pattern representing different environments. The regulators are displayed near the genes, and they are shown in circles or colored boxes to show whether they are binding or not depending on the environment.&lt;br /&gt;
#*Supplementary figure 1 is a graph in which the x axis is the regulator under testing (1 through 203), and the y axis is the number of promoter regions bound to said regulator. There are 2 lines on the graph, one is in blue which is unadjusted per the number of conditions the regulator was profiled under, and another line is in pink in which it averages the distributions for the same set of p values among regulators and promoter regions. This information was also obtained by their microarray data.&lt;br /&gt;
#*Supplementary figure 2 represents the data calculations conducted in this study. First all of their motifs were identified by using a variety of methods as listed in the figure, which were then filtered to determine which were significant, and then clustered based upon representative motifs, they then used conservation data to identify which motifs had the highest confidence rate. The final step is the statistical test from specificity databases to assign a specific motif to each regulator.&lt;br /&gt;
#*Supplementary figure 3 is a photo that displays the binding of Cin5 to two different sequences. It shows 15 different lanes to demonstrate the different binding results of the protein with different sequences. The first lane shows it with no competitor, the lanes 2-8 show it binding with a competitor sequence found by one of the discovered motifs in this study, and lanes 9-15 show it binding with a previously established binding site for the regulator. And the concentration of the regulator was 27 times higher in the motif discovered in this study as opposed to the previously established sequence, meaning the results of the study were able to predict the binding capacity for this protein better than previously published literature.&lt;br /&gt;
#*Supplementary figure 4 is a bar graph in which the x axis is the regulators, and the y axis is the number of promoter regions bound for each of those regulators. This figure compares the number of promoter regions bound for each regulator depending on the environment they&amp;#039;re growing in. The two different environments they compared in this figure are the rich medium environment and the amino acid starvation environment. &lt;br /&gt;
#*Supplementary figure 5 is a bar graph in which the x axis is the percent of maximum matching sites, which essentially translates to the quality of the matching sites found for each of them, which was determined based on to the best matching sequence to the Gcn4 binding specificity. The y axis is the frequency of matches found of that quality. The bars were clustered by which conditions they were grown in. &lt;br /&gt;
#This study does incorporate methods from other previous studies. Other previous studies have used conservation sequence data, but that has not allowed them to predict the binding sites and environments for each of the regulators with confidence that this study does.&lt;br /&gt;
#The authors could take the future direction of testing such transcription factors in higher eukaryotes, for if they are able to predict such binding mechanisms for yeast cells, they can likely do something similar for a higher level organism. Perhaps they will not be able to test as many regulators or have as high of a confidence rate, for I would assume it would be far more difficult to carry out such tests on another higher organism, for there are likely far more regulators and they may have a much larger genome to select from.&lt;br /&gt;
#I believe the authors in this study were able to support their conclusions well with the data acquired, but I do not believe the data was well presented or explained in the article. Much of the materials necessary to understand their methods or to know important details (temperature, time, conditions) of their experiment were not even on the article itself and had to be found in the supplementary section. And even in the supplementary section it was still a very dense topic that is very difficult to understand for anyone who is not a well established expert on the topic. Not to mention the fact there is no defined discussion or conclusion section of the paper. The final section of the article is the methods section which is rather unconventional. &lt;br /&gt;
&lt;br /&gt;
==Acknowledgements==&lt;br /&gt;
I have been in contact with my group members for this week about the presentation and questions this week. We worked together in class and texted about the presentation. I also visited my professor, Dr. Dahlquist during her office hours to ask for help in interpreting the article for this week. &lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
*Harbison CT, Gordon DB, Lee TI, Rinaldi NJ, Macisaac KD, Danford TW, Hannett NM, Tagne JB, Reynolds DB, Yoo J, Jennings EG, Zeitlinger J, Pokholok DK, Kellis M, Rolfe PA, Takusagawa KT, Lander ES, Gifford DK, Fraenkel E, Young RA. Transcriptional regulatory code of a eukaryotic genome. Nature. 2004 Sep 2;431(7004):99-104. doi: 10.1038/nature02800. PMID: 15343339; PMCID: PMC3006441.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{Template:MSymond1}}&lt;br /&gt;
[[Category:Team Project]]&lt;/div&gt;</summary>
		<author><name>Msymond1</name></author>
		
	</entry>
</feed>