|
|
|
|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Vol. 13, Issue 5, 1608-1614, May 2002
Department of Genetics, Harvard Medical School, Boston, Massachusetts 02115
Submitted September 28, 2001; Revised January 10, 2002; Accepted February 8, 2002| |
ABSTRACT |
|---|
|
|
|---|
Ohno [Ohno, S. (1970) in Evolution by Gene Duplication, Springer, New York] proposed that gene duplication with subsequent divergence of paralogs could be a major force in the evolution of new gene functions. In practice the functional differences between closely related homologues produced by duplications can be subtle and difficult to separate experimentally. Here we show that DNA microarrays can distinguish the functions of two closely related homologues from the yeast Saccharomyces cerevisiae, Yap1p and Yap2p. Although Yap1p and Yap2p are both bZIP transcription factors involved in multiple stress responses and are 88% identical in their DNA binding domains, our work shows that these proteins activate nonoverlapping sets of genes. Yap1p controls a set of genes involved in detoxifying the effects of reactive oxygen species, whereas Yap2p controls a set of genes over represented for the function of stabilizing proteins. In addition we show that the binding sites in the promoters of the Yap1p-dependent genes differ from the sites in the promoters of Yap2p-dependent genes and we validate experimentally that these differences are important for regulation by Yap1p. We conclude that while Yap1p and Yap2p may have some overlapping functions they are clearly not redundant and, more generally, that DNA microarray analysis will be an important tool for distinguishing the functions of the large numbers of highly conserved genes found in all eukaryotic genomes.
| |
INTRODUCTION |
|---|
|
|
|---|
DNA microarrays can reveal functional
similarities between genes with little or no sequence homology. This is
because the whole-genome mRNA expression patterns that result from the
mutation of genes with similar functions are often very similar and can be thought of as "molecular phenotypes" (Hughes et al.,
2000b
). As a case study to determine whether these molecular phenotypes are sensitive enough to discriminate between the functions of closely
related transcription factors, we chose to study Yap1p and Yap2p.
Although previous experiments with DNA microarrays demonstrated that a
number of genes involved in stress response show Yap1p-dependent
expression (Gasch et al., 2000
), little is known about the
differences between genes regulated by Yap1p versus Yap2p. Yap1p and
Yap2p are 88% identical in their DNA binding regions and have both
been shown to bind the same consensus site (TTAGTAA; Fernandes et
al., 1997
). Furthermore, overexpression of either protein induces
resistance to multiple cellular stresses (Schnell et al.,
1992
; Bossier et al., 1993
; Wu et al., 1993
; Hirata et al., 1994b
; Stephen et al., 1995
).
Whether Yap1p and Yap2p exert these similar phenotypic effects by
controlling the same or different sets of genes has remained unclear.
One study did identify three genes whose expression are dependent on
Yap1p but not Yap2p (Stephen et al., 1995
). However, no
targets for Yap2p have yet been identified. If and how Yap1p and Yap2p
show specificity toward different regulons are also unresolved
questions because both proteins bind to and activate transcription from the same consensus sequence (Hirata et al., 1994b
; Fernandes
et al., 1997
). To begin to answer these questions we used
whole-genome microarrays to measure the expression of all the genes in
the genome in wild-type, yap1
, yap2
, and
yap1
yap2
cells grown in minimal medium.
Because Yap1p and Yap2p are implicated in the response to cellular
stresses we also measured expression in cells treated with the
oxidizing agent hydrogen peroxide
(H2O2) and the metal
cadmium (Cd2+). In this report we focus on the
response to H2O2, but the
full dataset is available at http://arep.med.harvard.edu/ExpressDB.
| |
MATERIALS AND METHODS |
|---|
|
|
|---|
Yeast Manipulations
Strain BY4740 (MATa, leu2
0, lys2
0, ura3
0)
was used as the control strain in this study and yap1
,
yap2
, and yap1
yap2
derivatives were constructed as described (Brachmann et al.,
1998
). For RNA extractions all strains were grown to mid log phase in minimal media and induced for 1 h with either 0.6 mM
H2O2, 1 µM CdCl2, or mock treated, and mRNA was extracted,
labeled and hybridized to oligonucleotide arrays as described (Wodicka
et al., 1997
). All experiments were repeated at least twice
(sometimes three times) and the average expression level of the
independent experiments was used for the analysis.
For plating assays, all strains were grown to OD600 of 0.4, dilutions were made and 5 µL of each dilution was spotted onto the appropriate medium
-Galactosidase assays were performed as described (Dudley et
al., 1999
).
Plasmid Constructions
To create the wild-type YKL086W reporter gene primers
BC248 (5'-CGGAATTCTATGTAAAATAGAGACGAATGAAAA-3') and BC249
(5'-GCCCTTATTGTGGCCACCATTGCGTC-3') were used to amplify the
YKL086W promoter region and this fragment was cloned into
the EcoRI and BamHI sites of pSEYC102 (Gift of Fred Winston). The resulting plasmid was named pBC266. All mutant constructs were derived from pBC266 using sequential PCR mutagenesis (Ausubel et al., 1994
). For mutation of the core base pairs
in the extended site we used primers BC285
(5'-CGATTGCTTTTTCCCTGATccGcAAGCTACATCATTTATAC-3') and BC286
(5'-GTATAAATGATGTAGCTTgCggATCAGGGAAAAAGCAATCG-3') and for mutation of
the flanking residues in the extended sited we used primers BC283
(5'-CGATTGCTTTTTCCCTGgTTAGTAAcaTACATCATTTATAC-3') and BC284
(5'-GTATAAATGATGTAtgTTACTAAcCAGGGAAAAAGCAATCG-3'). For mutation of the
core base pairs within the core site we used primers BC292
(5'-CCCAGAAGTCGCCATTATTTcTAGctATTACAGTAGCCCTGTT-GGG-3') and BC293 (5'-CCCAACAGGGCTACTGTAATagCTAgA-AATAATGGCGACTTCTGGG-3').
Data Analysis
Genes with low expression and low variance were filtered from
the dataset as described (Cohen et al., 2000
). The dataset
was then divided into clusters of coexpressed genes using the computer program QTClust (Heyer et al., 1999
) using a correlation
threshold of 0.7. A detailed description of all the clusters produced
from this analysis can be found at
http://genetics.med.harvard.edu/~cohen/yaps/Yaps.html. A Yap
binding site weight matrix (Stormo et al., 1982
) was
constructed using sites from four promoters known to be regulated by
Yap1p (Kuge and Jones, 1994
; Wu and Moye-Rowley, 1994
; Wemmie et
al., 1994
; Grant et al., 1996
). This weight matrix was
used as an input to the computer program ScanACE (Hughes et
al., 2000a
) to determine the distribution of Yap sites among all
of the expression clusters. Only sites that scored at least as well as
the average site in the matrix were counted as Yap sites. The
significance of clusters in which a high proportion of the promoters
within the cluster contained at least one Yap site was assessed using
the hypergeometric probability distribution, without correction for
multiple hypotheses, as follows:
|
The significance of groups of genes that were enriched for particular
MIPS functional annotations was tested as described (Tavazoie et
al., 1999
).
| |
RESULTS |
|---|
|
|
|---|
Characterization of Molecular Phenotypes
In this study we define the molecular phenotype of a gene to be
the constellation of changes in gene expression that take place upon
deletion of the gene. Because Yap1p and Yap2p have both been implicated
in stress response, we determined their molecular phenotypes in
H2O2. We arbitrarily chose
to include only genes whose expression changed by more than threefold
in our molecular phenotypes. For example, the Yap1p molecular phenotype
is composed of genes that do not vary in wild-type cells grown in
H2O2, but whose expression
changes at least threefold in yap1
cells grown in
H2O2. Although Yap1p and
Yap2p have partially overlapping molecular phenotypes (Figure
1, A and B), it is clear that there are a
significant number of genes whose expression changes only in
yap1
mutants and other changes that occur only in
yap2
mutants. Aside from the changes shown in Figure 1,
there were also 82 genes whose expression changed in yap1
yap2
cells but not in either of the single mutants. This
result suggests that there may be some functional redundancy between
Yap1p and Yap2p. However the fact that these two proteins have
different molecular phenotypes implies that they also have separable
functions.
|
Identification of Yap1p- and Yap2p-dependent Genes
Because our preceding results suggested that Yap1p and Yap2p have separable functions and because these proteins are themselves transcription factors, we hypothesized that there would be separate groups of genes whose transcription were directly regulated by either Yap1p or Yap2p.
We observed 250 genes whose expression changed by more than threefold in wild-type cells upon addition of H2O2 to the growth medium. Fifty-three percent of these changes depended on the presence of Yap1p, Yap2p, or Yap1p and Yap2p, underscoring the importance of these proteins in the response to oxidative stress. However from these data alone it was not possible to discern how many of these changes were primary targets of Yap1p or Yap2p regulation. We reasoned that genes showing Yap-dependent regulation that also contained Yap binding sites in their promoters would be more likely to be direct targets of Yap1p or Yap2p.
To find sets of genes whose expression depended on either Yap1p or
Yap2p, we first partitioned our dataset into 29 groups of coexpressed
genes using the clustering algorithm QTClust (Heyer et al.,
1999
). For a full description of all clusters see
http://genetics.med.harvard.edu/~cohen/yaps/Yaps.html. Next we
determined the distribution of Yap binding sites among the different
expression clusters. To do this we constructed a multiple alignment of
Yap1p binding sites from promoters known to be regulated by Yap1p. We
then ran the computer program ScanACE (Hughes et al.,
2000a
), which uses a multiple alignment to search for additional
matching sequences, to identify all of the occurrences of Yap1p binding
sites in the genome. Finally, we used two different statistical tests
to search for clusters in which Yap sites were statistically over
represented in the promoters of the genes within those clusters. First
we looked for clusters in which a high proportion of the promoters
contained at least one Yap binding site (Table 1). We also looked for clusters in
which promoters tended to have multiple Yap sites (Table
2). Using these criteria six clusters were deemed to contain more Yap sites in the promoters of their genes
then expected by chance.
|
|
Among this set clusters 31 (25 genes) and 33 (24 genes) are of
particular interest because they clearly separate genes controlled by
Yap1p and Yap2p (Figure 2, A and B). The
genes in cluster 31 show a Yap1p-dependent increase in expression in
H2O2. The small increase in
normalized expression in the wild-type cells in
H2O2 actually corresponds
to an average change of fourfold
H2O2. What is striking,
however, about the expression of the genes in this cluster is that the
Yap1p-dependent increase in expression in H2O2 is greatly magnified
in the absence of Yap2p. This expression is Yap1p dependent
because it is absent in the yap1
yap2
mutant. Cluster 33 shows an almost opposite expression pattern from
cluster 31. The genes in this cluster do not show a significant
increase in expression in
H2O2 in wild-type cells
(1.8-fold on average), a result that might be expected because of the
absence of H2O2 hypersensitivity in yap2
mutants (Hirata et
al., 1994a
). However, they do show a very large increase in
expression in the absence of Yap1p. This expression is Yap2p dependent
because it is absent in the yap1
yap2
mutant. Therefore, there is clearly a set of genes whose expression is
dependent on Yap1p but not Yap2p and a set of genes whose expression is
dependent on Yap2p but not Yap1p.
|
Using the MIPS database (Mewes et al., 1999
), we asked
whether the Yap1p- and Yap2p-dependent genes were enriched for
particular functions. Cluster 31 (Yap1p dependent) was enriched for
genes in the category "detoxification" (p = 2.4 × 10
4). This cluster contained genes such as
glutathione s-transferase and superoxide dismutase, which
are clearly involved in the response to reactive oxygen species. By
contrast cluster 33 (Yap2p dependent) was enriched for genes in the
category "protein folding and stability" (p = 2.3 × 10
2) and contained genes such as chaperones
and ubiquitin-conjugating enzymes. However, 54% of the genes in
cluster 33 were of unknown function, suggesting that the
Yap2p-dependent regulon may also have other functions besides affecting
protein turnover. In response to oxidative stress a cell must deal
directly with reactive oxygen species as well as stabilize its
correctly folded proteins and degrade its misfolded proteins. Directly
comparing cluster 31 to cluster 33 shows that the functions of the
genes enriched in each cluster are significantly different
(z = 2.8, p < 0.01) and suggests that
detoxification is controlled by the Yap1p regulon, whereas protein
turnover is affected by the Yap2p regulon. These results may also help
explain the relationship between the Yap1p- and Yap2p-dependent genes.
For example, in the absence of Yap2p, the Yap1p response to
H2O2 is magnified (Figure
2A). This may be because in the absence of the Yap2p-dependent protein
turnover response there is a greater need for the detoxification
response. Thus, although the detoxification genes are dependent only on Yap1p they can sense the absence of Yap2p, perhaps because Yap1p is
activated by the presence of unfolded proteins. Alternatively, Yap2p
may directly repress the promoters of Yap1p-dependent genes.
Clusters 31 and 33 were both identified because the genes within those
clusters were over represented for Yap binding sites. We also used the
program AlignACE (Hughes et al., 2000a
) to search the
promoters of genes within Clusters 31 and 33 for other DNA sequences
that might be over represented within these sets. Aside from the Yap
sites themselves no other motifs showed significant over representation
within these promoters, suggesting that it is the Yap sites that are
responsible for the expression patterns of the clusters. How can the
same binding site control the expression of genes with such different
expression patterns? To address this question we reconstructed binding
site matrices (Stormo et al., 1982
) for the Yap sites from
clusters 31 and 33 (Figure 2, C and D). Although sites from both
clusters 31 and 33 tended to conform to the known Yap site (TTAGTAA),
many sites from cluster 31 contained additional conserved bases
flanking this "core" sequence. Although the core sequence is
distributed widely across the genome and is present in almost all of
our expression clusters, the "extended" yap site is found almost
exclusively in the promoters of genes within the Yap1p-dependent
cluster 31. Three genes (TRX2, YCF1, and
GLR1), which were previously identified as being Yap1p
dependent, contain sites in their promoters with strong matches to the
extended site we identified. One other Yap1p-dependent gene
(GSH1) does not contain an extended site in its promoter.
However, the regulation of this gene has already been shown to be
different from that of other Yap1p-dependent genes (Stephen et
al., 1995
). Taken together these results raised the possibility
that the flanking bases found in the extended site are important for
Yap1p-dependent expression.
We chose to test this hypothesis on YKL086W, a novel member
of the Yap1p regulon from cluster 31. Although the function of YKL086W is currently unknown, when we deleted it from the
genome the resulting cells were hypersensitive to
H2O2 (Figure
3A), demonstrating that this gene is
involved in the response to reactive oxygen species. This also suggests
that other genes in cluster 31 with unknown functions that contain Yap
binding sites are involved in the response to oxidative stress. We
fused the promoter region of YKL086W to the LacZ gene from
Escherichia coli. This reporter gene mimicked the
Yap1p-dependent expression pattern observed on the microarray for
YKL086W. The promoter region of this gene contains one
extended Yap site, GTTAGTAACA (flanking bases
shown in bold), and one core Yap site, TTAGTAA. Although the analysis
was complicated by the presence of the core site, it was clear from
various mutant derivatives that mutating the flanking bases in the
extended site caused a reproducible reduction in activity from the
reporter (Figure 3B). In constructs where the core site had been
mutated so as to be inactive, mutating the flanking bases in the
extended site was equivalent to mutating bases within the core sequence
of the extended site. These results suggest that the flanking bases in
the extended site are required for the function of that site and for
specification of Yap1p-dependent transcription.
|
Another interesting cluster is number 79 (12 genes) in which the genes
show a strong decrease in transcription in
H2O2 that is dependent on
both Yap1p and Yap2p (Figure 4). The
promoters of these genes are over represented for Yap binding sites
(Table 1), suggesting that this repression of transcription is direct. The genes in this cluster are enriched in the MIPS functional category
"DNA synthesis and replication." Although this enrichment is not
statistically significant (p = 0.14), it makes sense that a
cell would slow replication during oxidative stress while damage done
to DNA is repaired. These results provide the first evidence that Yap1p
and Yap2p may repress as well as activate transcription.
|
| |
DISCUSSION |
|---|
|
|
|---|
Our results demonstrate that although Yap1p and Yap2p are closely related homologues, with similar phenotypes, these proteins are clearly nonredundant. Yap1p and Yap2p activate distinct regulons involved in different aspects of the oxidative stress response, with Yap1p-dependent genes being involved directly in detoxifying the effects of reactive oxygen species, whereas Yap2p-dependent genes help stabilize and fold proteins in an oxidative environment. The molecular functions of Yap1p and Yap2p have diverged such that these homologues now activate different regulons, yet both proteins are involved in the response to cellular stresses. Selective pressures may have driven this divergence by increasing the scope and flexibility of the response to cellular stresses. Although we have focused here on oxidative stress, the divergence of Yap1p and Yap2p may allow different physiological responses to different cellular stresses. There may be conditions in which Yap1p and Yap2p are differentially expressed or activated. However, at the level of mRNA expression there is no significant difference between the YAP1 and YAP2 transcripts in 217 whole-genome expression data sets tested. Our working hypothesis is therefore that Yap1p and Yap2p both respond to similar cellular stresses, and they are both maintained in the genome because they activate different regulons.
We have also provided evidence that the specification of Yap1p- versus
Yap2p-dependent transcription occurs through an extended Yap site found
only in the promoters of Yap1p-dependent genes. These extended sites
are not found in Yap2p-dependent genes, and reporter gene assays
confirmed that the additional base pairs in the extended sites are
important for regulation by Yap1p. This mechanism for obtaining
specificity is consistent with what is known about other families of
transcription factors. For example, base pairs flanking the core
binding site have also been shown to be important in specifying
transcription between members in families of basic helix-loop-helix
leucine zipper transcription factors (O'Hagan et al.,
2000
). How Yap2p-dependent transcription is specified is still unclear.
Although the core Yap site is statistically over represented in the
promoters of Yap2p-dependent genes, that site is also present in the
promoters of many genes that do not show Yap2p-dependent transcription.
The Yap sites in Yap2p-dependent promoters may work in combination with
other transcription factor binding sites that fell below the threshold
of detection of the search algorithms we used.
As demonstrated by the expression pattern of cluster 79, Yap1p and Yap2p may function as repressors as well as activators of transcription. One possible model to describe the Yap network is that during oxidative stress Yap1p and Yap2p homodimers activate distinct regulons, whereas Yap1p/Yap2p heterodimers collaborate to repress a separate regulon.
Our approach to separating the functions of the Yap1p and Yap2p transcription factors has several advantages. Focusing on expression clusters that are over represented for Yap binding sites helps distinguish direct versus indirect effects on transcription caused by transcription factors. This approach also does not require any a priori prediction of what a Yap-dependent cluster should look like and therefore allows the identification of clusters with unexpected expression patterns, such as cluster 79. Finally our approach is automatable and requires minimal curation, and is therefore easily scalable to larger genomes and larger protein families. This work should serve as a model to disentangle the functions of the huge number of paralogs being identified by the various genome sequencing projects.
| |
ACKNOWLEDGMENTS |
|---|
We thank Semyon Kruglyak for the computer code for the QTClust algorithm, Jason Hughes for the ScanACE and AlignACE programs, Fred Winston for strains and plasmids, and members of the Church lab for discussions and critical readings of the manuscript. B.A.C. was supported by a postdoctoral fellowship from the American Cancer Society (PF-98-159-01-MBC). This work was supported by the US Department of Energy (DE-FG02-87-ER60565), the Office of Naval Research and DARPA (N00014-97-1-0865), the Lipper Foundation, and Hoechst Marion Roussel.
| |
FOOTNOTES |
|---|
* Corresponding author and present address: Department of Genetics, Washington University School of Medicine, St. Louis, MO 63110; e-mail address: cohen{at}genetics.wustl.edu.
Article published online ahead of print. Mol. Biol. Cell 10.1091/mbc.01-10-0472. Article and publication date are at www.molbiolcell.org/cgi/10.1091/mbc.01-10-0472.
| |
REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
S. Mroczek and J. Kufel Apoptotic signals induce specific degradation of ribosomal RNA in yeast Nucleic Acids Res., May 1, 2008; 36(9): 2874 - 2888. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Tan, H. Feizi, C. Luo, S. H. Fan, T. Ravasi, and T. G. Ideker A systems approach to delineate functions of paralogous transcription factors: Role of the Yap family in the DNA damage response PNAS, February 26, 2008; 105(8): 2934 - 2939. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. D. Deppmann, R. S. Alvania, and E. J. Taparowsky Cross-Species Annotation of Basic Leucine Zipper Factor Interactions: Insight into the Evolution of Closed Interaction Networks Mol. Biol. Evol., August 1, 2006; 23(8): 1480 - 1492. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Lev, R. Hadar, P. Amedeo, S. E. Baker, O. C. Yoder, and B. A. Horwitz Activation of an AP1-Like Transcription Factor of the Maize Pathogen Cochliobolus heterostrophus in Response to Oxidative Stress and Plant Signals Eukaryot. Cell, February 1, 2005; 4(2): 443 - 454. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Nicholls, M. Straffon, B. Enjalbert, A. Nantel, S. Macaskill, M. Whiteway, and A. J. P. Brown Msn2- and Msn4-Like Transcription Factors Play No Obvious Roles in the Stress Responses of the Fungal Pathogen Candida albicans Eukaryot. Cell, October 1, 2004; 3(5): 1111 - 1123. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Wysocki, P.-K. Fortier, E. Maciaszczyk, M. Thorsen, A. Leduc, A. Odhagen, G. Owsianik, S. Ulaszewski, D. Ramotar, and M. J. Tamas Transcriptional Activation of Metalloid Tolerance Genes in Saccharomyces cerevisiae Requires the AP-1-like Proteins Yap1p and Yap8p Mol. Biol. Cell, May 1, 2004; 15(5): 2049 - 2060. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Rutherford, S. Jaron, and D. R. Winge Aft1p and Aft2p Mediate Iron-responsive Gene Expression in Yeast through Related Promoter Elements J. Biol. Chem., July 18, 2003; 278(30): 27636 - 27643. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. D. Gibbons and F. P. Roth Judging the Quality of Gene Expression-Based Clustering Methods Using Gene Annotation Genome Res., October 1, 2002; 12(10): 1574 - 1581. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||