ific TFs across multi ple cell lines. The thickness with the solid line connecting a noncanonical motif to a cell line indicates the proportion of data sets in that cell line that revealed the motif as a noncanonical GANT61 motif. We highlight a number of motifs that had been often discovered as noncanonical motifs inside a certain cell line. PU. 1 was most often discovered in GM12878 cells. Its corresponding TF SPI1, a member with the ETS family, activates GANT61 gene expres sion for the duration of myeloid and B lymphoid cell development. The SPI1 gene is expressed in both GM12878 and K562 cells, but not in the other three cell lines. On the other hand, another member with the ETS family, SPIB, is only expressed in GM12878 cells, as well as the SPIB gene shows substantial TF binding web-sites specifically in GM12878 cells.
SPIB and SPI1 have the very same canonical motif and are both necessary for B cell devel opment. GATA1 cell line show enriched TF binding web-sites in the corresponding cell line. This really is, indeed, the case to get a huge fraction of genes, and Figure SC144 4A shows five examples, 1 per cell line. FCER2 is actually a key gene for B cell function. It truly is extremely and specifically expressed in GM12878. Its promoter region and gene body are bound by nine TFs in GM12878, such as SPI1. The G protein coupled receptor GPRC5A plays a function in epi thelial cell differentiation. It truly is extremely and specifically expressed in HeLa cells, and accordingly, its promoter region and gene body are bound by seven TFs in HeLa cells. The Abd B homeobox family member HOXB9 is actually a sequence particular transcription factor.
It truly is extremely and specifically expressed in K562 cells, and accordingly, its promoter regions and gene body Protein precursor are bound by seven TFs such as GATA1 TAL1 in K562 cells. SERPINA1 encodes a serine protease inhibitor, and defects in this gene can cause liver diseases. It truly is four orders of magnitude additional extremely expressed in HepG2 than in the other four cell lines. FOXA, HNF4, RXRA, TCF7L2, and eight other TFs bind near this gene in HepG2 but not in other cell lines. AC104304 encodes to get a putative teratocarcinoma derived growth factor that plays an important function in embryonic development. It truly is extremely expressed in H1 hESC and bound by eight TFs, such as NANOG. We then asked whether or not the noncanonical motifs we discov ered also reflect cell variety specificity.
Figure 4B plots the noncanonical motifs detected in the ChIP seq data sets of sequence particular TFs for every with the five cell lines with the most ENCODE ChIP seq data sets. Cell line particular, noncanonical was one of the most often discovered noncanonical motif SC144 in K562 cells. It truly is bound GANT61 by the GATA family of TFs, which are necessary for erythroid development by regulating the fetal to adult switch of hemoglobin production. The GATA1 gene is extremely expressed in K562 cells but not in the other four cell lines and shows substantial binding web-sites only in the K562 cell line. FOXA and HNF4 are the most often identified noncanonical motifs in HepG2 cells. Their correspond ing TFs are activators of numerous liver particular genes and are necessary for hepatocyte function. Both the FOXA1 and HNF4 genes are more than 10 fold additional extremely expressed and show additional substantial TF binding web-sites in the HepG2 cell line than in the other four cell lines.
The SOX2 OCT4 combined motif was one of the most often identified noncanonical motif in H1 hESC cells. OCT4 will be the canonical motif of POU5F1, a POU homeodomain containing TF essential SC144 for embryonic stem cell pluripotency. Their corresponding TFs type a protein protein complex and are essential for embryonic stem cell pluripotency. GANT61 Both POU5F1 and SOX2 are exclusively expressed in H1 hESC cells and extensively regulated by a large quantity of TFs, such as by themselves. Tethered binding of non sequence particular TFs In Figure 4B, we also integrated all non sequence particular TFs for which there are ChIP seq data in these cell lines. Dashed lines connect non sequence particular TFs to the motifs discovered in their ChIP seq peaks.
Two non sequence particular TFs show cell line particular enrichment in motifs the enhancer binding protein EP300 as well as the histone deacetylase HDAC2. You can find seven data sets for EP300 in seven different cell lines and three data sets for HDAC2 in three different cell lines. Distinct motifs had been found in different cell lines SPI1 for SC144 EP300 in GM12878 cells, GATA1 for both EP300 and HDAC2 in K562 cells, FOXA and HNF4 for HDAC2, and FOXA and TCF7L2 for EP300 in HepG2 cells, SOX2 OCT4 and UA9 for HDAC2, and TEAD1 for EP300 in H1 hESC cells, and CEBPB, AP 1, and CREB for EP300 in HeLa cells. As described in the earlier section, numerous of these motifs had been most often and specifically observed as secondary motifs for sequence particular TFs in the respective cell lines. Due to the fact non sequence particular TFs don't bind DNA directly, they tether onto sequence particular TFs to bind target DNA. EP300 is recognized to interact with AP 1 and CEBPB and HDAC2 with TAL1 GATA. Our results highlight that the
Tuesday, December 24, 2013
A Debate Over Ruthless GANT61SC144 -Methods
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment