![]() |
|
|
Vol. 10, Issue 6, 1859-1872, June 1999




Departments of *Biochemistry and ¶Anatomy
and Embryology, University of Amsterdam, Academic Medical Center, 1105 AZ Amsterdam, The Netherlands; §European Molecular Biology
Laboratory, Biochemical Instrumentation Programme, D-69117 Heidelberg,
Germany;
Munich Information Centre for Protein
Sequences, Max-Planck-Institut für Biochemie, D-82152
Martinsried, Germany; and #Unité de
Génétique Moleculaire des Levures, Institut Pasteur,
F-75724 Paris Cedex 15, France
| |
ABSTRACT |
|---|
|
|
|---|
We describe a genome-wide characterization of mRNA
transcript levels in yeast grown on the fatty acid oleate, determined
using Serial Analysis of Gene Expression (SAGE). Comparison of this SAGE library with that reported for glucose grown cells revealed the dramatic adaptive response of yeast to a change in carbon source. A
major fraction (>20%) of the 15,000 mRNA molecules in a yeast cell
comprised differentially expressed transcripts, which were derived from
only 2% of the total number of ~6300 yeast genes. Most of the mRNAs
that were differentially expressed code for enzymes or for other
proteins participating in metabolism (e.g., metabolite transporters).
In oleate-grown cells, this was exemplified by the huge increase of
mRNAs encoding the peroxisomal
-oxidation enzymes required for
degradation of fatty acids. The data provide evidence for the existence
of redox shuttles across organellar membranes that involve peroxisomal,
cytoplasmic, and mitochondrial enzymes. We also analyzed the mRNA
profile of a mutant strain with deletions of the PIP2
and OAF1 genes, encoding transcription factors required
for induction of genes encoding peroxisomal proteins. Induction of
genes under the immediate control of these factors was abolished; other
genes were up-regulated, indicating an adaptive response to the changed
metabolism imposed by the genetic impairment. We describe a statistical
method for analysis of data obtained by SAGE.
| |
INTRODUCTION |
|---|
|
|
|---|
The development of innovative techniques to study gene expression
combined with the knowledge of the Saccharomyces cerevisiae genome sequence makes it possible to establish an inventory of all
yeast transcripts (transcriptome) and to relate these transcripts to
the genes from which they originate. Serial Analysis of Gene Expression
(SAGE) and hybridization on DNA microarrays are recently described
tools for such an analysis (Velculescu et al., 1995
, 1997
;
DeRisi et al., 1997
; Wodicka et al., 1997
; Zhang
et al., 1997
; Cho et al., 1998
; Holstege et
al., 1998
). The SAGE technique samples short sequences of 10-14
nucleotides (tags) of individual mRNAs. Determination of the sequence
of these tags allows identification of the corresponding genes. The
frequency of a tag, representing the steady-state level of the mRNA
from which it was derived, gives, with certain limitations, information
about the level of gene expression and the amount of protein made.
Comparison of transcriptomes yields interesting information about the
dynamics of total genome expression attributable to a change in
environmental conditions or state of differentiation. In addition, it
provides necessary clues to determine the function of those genes whose contribution to cellular life is still unknown.
We were particularly interested in the changes in the mRNA population
required for the increase in number and volume of peroxisomes when
yeast is faced with fatty acids as a sole carbon source. In S. cerevisiae, the enzymes for degradation of fatty acids are uniquely confined to peroxisomes together with some of the enzymes constituting the glyoxylate cycle (Kunau et al., 1988
).
Particularly, the genes encoding the
-oxidation enzymes are highly
induced, but to what extent the growth on fatty acids also leads to
more general alterations in cellular metabolism and organelles other than peroxisomes is not known. It is also unknown to what extent proliferation of the peroxisomal compartment induced by growth on fatty
acids requires adjustment in other proteins than those functioning in
metabolism, for instance, proteins involved in protein trafficking or
proteins involved in peroxisome biogenesis.
To address these questions, we used SAGE to determine the transcriptome
of yeast grown on oleate and compared this with the transcriptome of
yeast grown on glucose (Velculescu et al., 1997
). In the
process we developed a statistical method for comparison of SAGE
results and the planning of future experiments. We also constructed a
SAGE library of mutant yeast cells in which the oleate-induced
enlargement of the peroxisomal compartment was abrogated by deletion of
the genes encoding transcription factors Pip2p and Oaf1p (Luo et
al., 1996
; Rottensteiner et al., 1996
). This analysis
revealed genes that are under the control of Pip2p and Oaf1p but in
addition showed genes whose activity increased to survive the genetic
impairment imposed to the pip2/oaf1 mutant cells. Here we
will relate some of our findings to aspects of peroxisome biogenesis
and function.
| |
MATERIALS AND METHODS |
|---|
|
|
|---|
Yeast Strains, Culture Conditions, and RNA Isolation
Yeast strains used in this study are BJ1991
(MAT
, leu2, trp1, ura3-52, pep4-3, prb1-1122;
Jones, 1977
) and BJ1991 pip2/oaf1 (MAT
,
leu2, trp1, ura3-52, pep4-3, prb1-1122, PIP2::KANMX4,
OAF1::LEU2; Rottensteiner et al., 1997
).
Differences in genotype between yeast strains used for glucose
(Velculescu et al., 1997
) and oleate SAGE libraries were not
thought to influence global gene expression patterns, considering the
rich carbon sources used. Strains were precultured 24 h on minimal
medium containing 0.3% glucose to obtain a derepressed culture. After
a shift to medium containing 0.12% oleate, 0.2% Tween 40, 0.3% yeast
extract, 0.5% bacto-peptone, and 0.5% potassium phosphate buffer, pH
6.0, the cells were cultured for 18 h at 28°C. Cell growth was
stopped by the addition of an equal volume of ethanol (
80°C), and
RNA was extracted immediately. mRNA was isolated using the Poly-A-tract
kit from Promega (Madison, WI) according to the manufacturer's protocol.
SAGE Procedure
The SAGE libraries were obtained essentially following the SAGE
protocol described (Velculescu et al., 1995
). Briefly, mRNA was converted to cDNA using a Life Technologies (Gaithersburg, MD) cDNA
synthesis kit and biotin-oligo-(dT)18 (New England Biolabs, Beverly,
MA). cDNA was digested with NlaIII, and 3' cDNAs were isolated using streptavidin magnetic beads (Dynal M280; Dynal, Oslo,
Norway). 3' cDNAs were split into two pools, and SAGE linkers 1 and 2 (synthesized by Eurogentec, Seraing, Belgium) were ligated to pools 1 and 2, respectively. SAGE tags were released with
BsmF1 and blunted with T4 polymerase, and the tags from
pools 1 and 2 were ligated to each other. A 1:800 dilution of the
ligation product was amplified with 28 cycles of PCR and digested with NlaIII. Ditags were isolated from a 12% polyacrylamide
gel and self-ligated. Concatemers were isolated from an 8%
polyacrylamide gel and cloned into pZERO (Invitrogen, San Diego, CA)
digested with SphI. Inserts were amplified by colony PCR
using M13 forward and reverse primers. PCR fragments were sequenced via
cycle sequencing with the AmpliTaqFS core kit (Applied Biosystems,
Foster City, CA) and 2 pmol of Cy5-T7 primer. An MJ Research (Waltham,
MA) PT-200 cycler was used to perform 25 cycles (97°C, 15 s; 55°C, 30 s; and 68°C, 30 s). Reactions were loaded on
60-cm-long 4.5% PAGE-PLUS polyacrylamide gels (Amresco, Solon, Ohio)
on the European Molecular Biology Laboratory (EMBL) sequencing system
with array detectors (Erfle et al., 1997
). The 3' and 5'
sequences were obtained simultaneously by sequencing on the ARAKIS
two-dye DNA-sequencing system developed at EMBL (Wiemann et
al., 1995
). The system allows simultaneous on-line sequencing on
both strands (Doublex sequencing), with the two sequencing products
obtained in a single sequencing reaction, each labeled with a different
fluorescent dye. Up to 2000 bases are thus obtained simultaneously in
one sequencing reaction on both strands, which presents an efficient
system for identifying large number of sequence tags obtained in one
run. Raw sequencing data were evaluated using the GeneSkipper software package (EMBL).
SAGE Data Analysis
Initial data analysis was performed using the SAGE software
package version 1.0 (Velculescu et al., 1995
; Zhang et
al., 1997
). The tag list from wild-type cells and
pip2/oaf1 cells contained 10,943 and 3847 tags,
respectively, of which 577 and 234, respectively, were derived from
linker sequences. These tags were excluded from the analysis. The
resulting tag lists contained 10,366 total tags from wild-type cells
and 3613 tags from pip2/oaf1 cells. We compiled a database
of all potential tags of the complete yeast genome (>69,000 10-bp
sequences) and linked each tag to the gene annotations in the
MIPS database (as of December 9, 1998). Next, we merged this
data set with the tags found with SAGE. For estimating the number of
different genes of which tags were found, we counted tags located
within the ORFs or within the 500-bp 3' adjacent of the ORFs. When
different tags originating from the same gene were found, these tags
were pooled to calculate expression levels. We found that tags
originated from ~1700 genes, of which 400 were not identified in the
tag list from glucose-grown cells. For most of these 400 genes only one
tag was found. Tag numbers were converted to number of mRNA transcripts
per cell assuming a total of 15,000 mRNA molecules per cell. mRNA
ratios between oleate- and glucose-grown cells were calculated from the
tag lists. To avoid division by zero we used a tag value of 1 for tags
that were not detected in the tag list from glucose-grown cells (Zhang
et al., 1997
). Hereby also the genes that are highest
expressed are scored as the highest induced. Classification in groups
was done according to the yeast protein functional catalogue (Goffeau,
1997
; Mewes et al., 1997
; also available via the World Wide
Web at http://websvr.mips.biochem.mpg.de/proj/yeast). A second database
was prepared that permitted searching for genes in specific functional
categories. SAGE data sets of oleate grown wild-type and
pip2/oaf1 mutant cells are available on request (E-mail:
A.Kal{at}icrf.icnet.uk) and will be made available via the web sites of
Munich Information Centre for Protein Sequences
(http://websvr.mips.biochem.mpg.de/proj/yeast) and MBC
(http://www.molbiolcell.org).
For the nomenclature of genes coding for cytoplasmic ribosomal protein,
the guidelines proposed by Mager et al. (1997)
were followed.
Statistical Analysis
Two statistical approaches of the number of specific tags found
in SAGE can be envisioned. First, the number of tags can be seen as
counts and therefore as Poisson distributed values. A test for
differences in tag numbers found in two experimental conditions,
seemingly based on Poisson distribution statistics, has recently been
described (Madden et al., 1997
). However, their approach
suffers from the disadvantage that it can only be used reliably on tag
libraries of similar size.
The second approach is to look at the number of copies of a specific
mRNA per cell as a fraction or proportion of the total number of mRNA
molecules in that cell. The same proportion (p) of specific tags should
be present in the SAGE library of all sequenced tags. Thus:
|
.
The accuracy of the estimation of the proportion depends on the total
number of tags (N) sequenced, its SE being SEp = SDp/
N. The 95% confidence interval of the observed
number of specific tags can then be calculated as:
|
(1) |
p2 of proportions
p1 and p2, resulting from samples with sizes
N1 and N2, respectively, is given by:
|
(2) |
p2 and the above SE
can be used to calculate the test statistic:
|
(3) |
An additional advantage is that this Z test provides a way
to calculate the number of tags needed to be sequenced to detect a
difference as significant. In statistical testing, the relation between
the difference p1
p2 that can be detected
with a two-sided probability of a type I error (incorrect rejection of
the null hypothesis) of less then
, as well as a probability of a
type II error (failure to detect a true difference p1
p2) of less then
can generally be expressed as:
|
(4) |
|
(5) |
p2 with a two-sided significance less then
and a power
greater than 1-
:
|
(6) |
|
,
, and a required difference p1
p2 can only be calculated by an iterative procedure based
on Eq. 5. The results of such a calculation for N1 = 60,633,
= 0.05, and
= 0.1 for a range of differences is given
in Figure 6B. Similarly, an estimate of the minimal detectable difference p1
p2 for a given N1,
N2,
, and
can only be reached by iteration. A
Windows program, SAGEstat, performing these calculations, is available
on request (E-mail: j.m.ruijter{at}amc.uva.nl subject: SAGEstat).
| |
RESULTS |
|---|
|
|
|---|
Comparison of SAGE Tag Lists Determined for Glucose-grown and Oleate-grown Yeast
We determined 10,366 SAGE tags (from 10,366 transcripts)
originating from ~1700 different genes obtained from yeast cells grown on medium containing the fatty acid oleate as sole carbon source.
Following the specificity criteria set forth by Velculescu et
al. (1995
, 1997
) and assuming a total number of 15,000 mRNA molecules per cell, the number of transcripts from a certain gene was
calculated from the number of tags observed in the tag list (Tables
1 and 2).
In this way, tag lists of cells cultured under different conditions can
be compared with each other to monitor changes in steady-state mRNA
levels. Although tag lists cannot account for regulation secondary to
transcription and mRNA turnover, such as enzyme feed back control,
covalent modification, and protein turnover, they can be used as good
indicators for changes in gene expression. Although the mRNA
steady-state levels expressed as mRNA copies per cell suggest a certain
precision, we like to emphasize that these numbers will vary according
to the assumptions made for their calculation (as amply discussed by
Velculescu et al., 1997
). Here we have used these numbers
particularly for comparative purposes to sort our own database in
various ways and conformed ourselves to published rules to be able to
make simple reference to already existing data. For an in-depth
discussion of the applied statistics, see MATERIALS AND METHODS. The
calculated mRNA levels from SAGE data obtained from glucose-grown and
oleate-grown cells show good correlation with previously published data
from Northern blotting experiments (Einerhand et al., 1991
;
Rottensteiner et al., 1996
; Karpichev and Small, 1998
).
|
|
When genes were graphically displayed in order of decreasing tag
frequency, a surprisingly small number of mRNAs were found to be
present at high steady-state levels. This was already observed for
glucose-grown cells (Velculescu et al., 1997
) and is shown in Figure 1A. The same was true for
oleate-grown cells (Figure 1C). Only 0.1% of the genes was represented
at ~100 copies per cell (c/c) or more; 0.5% at 50 c/c or more; 5%
at 2 c/c or more, whereas most mRNAs (>90%) were present at <2 c/c
or were not expressed at all.
|
The nature of the tags representing the abundant mRNAs was totally different between the two conditions of growth. This was illustrated by displaying the mRNA levels observed in oleate-grown cells arranged in the same order as mRNA levels of glucose-grown cells (Figure 1B). Here we observed a complete change in the mRNA landscape. The abundant mRNAs of oleate-grown cells now appeared as peaks at positions where the same mRNAs were present at low levels in the graph of the glucose-grown cells. These were the mRNAs that were sorted as the high abundance group in the graph of Figure 1C. The ratios between copies per cell on oleate and glucose represent the fold induction of expression. When c/c on oleate is divided by c/c on glucose, a high ratio indicates a high induction on oleate. Calculation of these ratios showed that many high abundant mRNAs on oleate were also highly induced (Tables 1 and 2). The error in the values of these ratios is subject to variation dependent on the tag numbers used for their calculation. Again, like the c/c, we have used these ratios not as absolute values but only as general indicators to help us in the interpretation of the information present in the database.
The peaks in Figure 1B alerted us to the genes that were highly
expressed when cells were grown on oleate. A number of such genes were
already known from previous work using Northern blotting and reporter
gene studies (Einerhand et al., 1991
; Rottensteiner et
al., 1996
; Karpichev and Small, 1998
). Examples are genes encoding enzymes of the
-oxidation pathway such as acyl-coenzyme A (CoA) oxidase (FOX1/POX1) and thiolase (FOX3/POT1) and
other genes encoding peroxisomal matrix proteins such as catalase A
(CTA1) and malate dehydrogenase isoform 3 (MDH3).
Indeed, the number of mRNAs observed was also correspondingly high:
acyl-CoA oxidase, 55 c/c; thiolase, 120 c/c; catalase, 84 c/c; and
malate dehydrogenase 3, 72 c/c. These numbers were substantially lower
in glucose-grown cells (0, 0, 0, and 4 c/c, respectively). Several
genes with unknown functions that were highly expressed proved to
encode novel peroxisomal proteins: YNL009W encodes Idp3p (23 c/c), a
peroxisomal NADP-dependent isocitrate dehydrogenase (van Roermund
et al., 1998
), and YJR019C encodes Tes1p (29 c/c), a
peroxisomal acyl-CoA thioesterase (Kal, 1997
).
In the Yeast Genome Directory 1997 (Goffeau, 1997
), genes with known
function or with homology to genes with known function have been
grouped in functional categories. This allowed graphical display of
only those genes that fall into such a category and permitted a more
in-depth analysis of a particular biological process. A few examples
will be discussed below; the complete data set is available on the Internet.
Biogenesis and Function of Peroxisomes
For growth on oleate peroxisomes are required, because in yeast
these organelles exclusively house the
-oxidation enzymes necessary
for fatty acid degradation. As a consequence, in oleate-grown cells the
number and volume of peroxisomes are greatly increased compared with
cells grown on other carbon sources, particularly glucose. This is
illustrated in Figure 2A. The highest
transcript frequencies corresponded with mRNAs encoding the
-oxidation enzymes and other peroxisomal enzymes directly or
indirectly involved in fatty acid metabolism. Control of gene
expression in these cases was very tight, because for most of these
genes not a single tag was found in the glucose tag list despite the
overwhelming number of 60,000 tags determined (Velculescu et
al., 1997
). Induction extended also over a large range considering
the high number of mRNA copies per cell. This situation was totally
different for PEX genes coding for peroxins, proteins
involved in the biogenesis of peroxisomes and involved in import of
proteins into peroxisomes (Figure 2A). For most PEX genes it
has been reported that they are induced when cells are grown on oleate.
However, the abundance of mRNAs encoding peroxins with known functions
was still <6 c/c, e.g., components of the protein import pathway such
as the peroxisomal-targeting signal receptor Pex5p and the
Pex5p-docking proteins Pex13p and Pex14p (Elgersma et al.,
1996
; Erdmann and Blobel, 1996
; Gould et al., 1996
;
Albertini et al., 1997
). This seemed to be a general feature; also, mRNAs coding for proteins of the nuclear pore complex, the mitochondrial Tim and Tom proteins, and the components of the
protein import machinery of the endoplasmic reticulum were low abundant
or absent in both the glucose and oleate tag lists. Because much
about peroxisome metabolism and biogenesis is still unknown, we like to
caution that the "peroxisome" functional category is plainly
incomplete. Clues for additional candidates of this category can be
found in the data set. For instance, YJR019c (TES1) is an
unknown reading frame that by definition will not show up in the
peroxisome functional category. It is, however, a good example
to illustrate the potential use of the data set. TES1 is
expressed at 29 c/c in oleate-grown cells and hardly expressed in
glucose-grown cells (<0.3 c/c; Table 2). The encoded reading frame
shows homology to acyl-CoA thioesterases of Escherichia coli, Haemophilus influenzae, and human. Further
biochemical analysis showed that the encoded protein had acyl-CoA
thioesterase activity and is located in the peroxisomal matrix. Like
the majority of genes encoding peroxisomal matrix proteins, the
TES1 gene is regulated by the Pip2p and Oaf1p transcription
factors. Upon deletion of the TES1 gene we have not yet
observed an overt phenotype (Kal, 1997
). This example shows how
candidates for the peroxisome functional category can be traced. In
addition, it is an example of a gene that we missed for lack of a
phenotype in our genetic screens applied thus far.
|
Mitochondria
For growth on oleate and other nonfermentable carbon
sources, a functional mitochondrial compartment is essential. We
therefore wondered whether genes encoding mitochondrial proteins would
be up-regulated or down-regulated as a group depending on the carbon source. Figure 3 illustrates that on
average the frequency of mRNAs coding for mitochondrial proteins was
higher in oleate-grown cells compared with glucose-grown cells. It
indicated that cells more heavily rely on mitochondrial metabolism when
they grow on oleate (see DISCUSSION). However, in a number of cases,
tag frequencies are substantially increased above the average
enrichment, suggesting that mitochondria require fine tuning to adjust
to growth on oleate. Some adjustments, such as elevated expression of
the mitochondrial glycerol-3-phosphate dehydrogenase (Gut2p), can be
explained in terms of changing metabolism (see below); other changes
ask for further research, such as the mitochondrial outer membrane
protein OM45 (35 c/c in oleate grown cells vs. <0.3 c/c in glucose
grown cells).
|
Communication among Peroxisomes, Cytosol, and Mitochondria
The end products of
-oxidation, NADH and acetyl-CoA,
generated inside peroxisomes, must be exported across the impermeable membrane to cytoplasm and mitochondria for ATP generation and biosynthetic processes (Elgersma and Tabak, 1996
). Reduction
equivalents are proposed to shuttle from the peroxisomal matrix to the
cytoplasm in the form of malate in a process requiring peroxisomal
malate dehydrogenase (Figure 4). Indeed,
not only the gene encoding peroxisomal malate dehydrogenase
(MDH3, 72 c/c) but also the gene encoding cytosolic malate
dehydrogenase (MDH2, 120 c/c) was highly expressed considering the number of mRNAs per cell, and both genes were specifically induced considering the corresponding numbers in glucose-grown cells (4 and 0 c/c, respectively).
|
After export from peroxisomes, the now cytoplasmic NADH can in part be
oxidised by mitochondrial oxidases facing the cytoplasm (de Vries and
Marres, 1987
; Larsson et al., 1998
) or shuttled into the
respiratory chain by the mitochondrial glycerol-3-phosphate dehydrogenase isoenzyme Gut2p (Figure 4). This was indicated by the
appreciable induction of the GUT2 gene (Ronnow and
Kielland-Brandt, 1993
) (30 c/c).
Although the net flow of NADH is directed outward to the cytosol, a net
inward flow of NADPH goes toward peroxisomes. The NADP-dependent
isocitrate dehydrogenase Idp3p is involved in the production of NADPH
in the peroxisomal matrix, to enable the action of the peroxisomal 2,4 dienoyl-CoA reductase Sps19p, which is required for degradation of
unsaturated fatty acids with double bonds at even positions (Gurvitz
et al., 1997
; Henke et al., 1998
; van Roermund
et al., 1998
). The existence of an isocitrate/2-oxoglutarate redox shuttle across the peroxisomal membrane (Figure 4) is indicated by the tandem induction of both the IDP3 gene (20 c/c on
oleate, <0.3 c/c on glucose) and the IDP2 gene (29 c/c on
oleate, <0.3 c/c on glucose) encoding peroxisomal isocitrate
dehydrogenase Idp3p and cytosolic isocitrate dehydrogenase Idp2p, respectively.
There are two mechanisms for acetyl-CoA to leave the peroxisomes. The
first is in the form of acetyl-carnitine; the second is in the form of
succinate, generated in the glyoxylate cycle (Tabak et al.,
1995
; van Roermund et al., 1995
). Part of the acetyl-CoA is
transferred to the mitochondria for ATP production, whereas another
part is assimilated to C4 and C6 compounds (Elgersma and Tabak, 1996
).
For this latter process the glyoxylate cycle and gluconeogenesis
pathway are essential. Two enzymes are unique components of the
glyoxylate cycle: malate synthase I and isocitrate lyase I. The
remaining enzymes of the cycle are used for other purposes as well.
This was underscored by the number of mRNAs observed for the
(peroxisomal) enzymes that are essential for operation of this cycle
(Figure 5A). Indeed, all the glyoxylate cycle enzymes were induced in oleate compared with glucose. We have
left out the result for peroxisomal malate dehydrogenase 3, because it
was shown that this enzyme does not participate in the glyoxylate
cycle. Instead, Mdh3p is involved in the shuttling of reduction
equivalents from peroxisomes to the cytosol (van Roermund et
al., 1995
). Further assimilation to C6 sugars is mediated by the
gluconeogenesis pathway. The genes encoding the enzymes isocitrate
lyase (ICL1), fructose 1,6-bisphosphatase (FBP1),
and phosphoenolpyruvate carboxykinase (PCK1) are essential
for gluconeogenesis. Their mRNAs are present at 25-42 c/c in
oleate-grown cells (Figure 5A). Remarkably, in the tag list of
glucose-grown cells, no hits for these mRNAs were scored. Thus, the
gluconeogenesis enzymes are indeed very specific for oleate-grown
cells. In addition, most of the glycolytic household enzymes are of
course required in this process.
|
Heat Shock Proteins and Stress Response
The comparison also indicated that mRNAs coding for certain members of the heat shock and stress family of proteins occur at significantly higher steady-state levels in oleate-grown compared with glucose-grown cells (Figure 5A): HSP12, TIP1, SSA1, PIR3, DDR2, YRO2, SIR4, and HSP26 mRNAs. This was not simply due to a general stress response, because the levels of other stress indicators, e.g., catalase T mRNAs (CTT1), remained <0.3 c/c in the two growth conditions, and expression of genes encoding the 30- and 60-kDa heat shock proteins (HSP30 and HSP60) remained practically unchanged.
The Transcriptome of pip2/oaf1 Mutant Cells
Transcription of the genes coding for the major enzymes of
peroxisomal metabolism is controlled by the transcription factors Pip2p
and Oaf1p. The constitutive Oaf1p and oleate-inducible Pip2p bind as a
heterodimer to an upstream activation sequence called "oleate
response element" (ORE) and activate transcription of genes
containing an ORE in their promoter (Karpichev et al., 1997
; Rottensteiner et al., 1997
). To investigate the consequences
of the loss of these transcription factors and to identify additional, unknown genes subjected to their control, we cultured a
pip2/oaf1 double mutant strain on oleate-containing medium.
A SAGE library of 3613 tags was prepared from the mRNA derived from
this culture. When this library was compared with the one obtained from
wild-type cells grown on oleate, two different effects could be
discerned: 1) a number of highly expressed genes under the control of
Pip2p/Oaf1p were down-regulated to much lower levels, comparable with
the levels encountered in yeast cells growing on nonfermentable carbon sources such as glycerol or ethanol, for which no major contribution of
peroxisomes is required (derepression level) (Figures 1D and 2B and
Tables 1 and 2); and 2) in response to the inefficient
-oxidation,
cells made more efficient use of alternative carbon sources present in
the growth medium (enforced with yeast extract). For instance, genes
required for proline import and catabolism were induced in wild-type
cells grown on oleate (PUT1 and PUT4) and even
further induced in the mutant (PUT1). The induction of dicarboxylic acid transporters and permeases JEN1 and
DIP5 also suggested that the cells switched to alternative
carbon sources.
In addition to these metabolic adaptations, several genes that are known to be induced by stress were also induced in the pip2/oaf1 strain (e.g., HSP12, TIP1, DDR2, HSP26, and HSP30) (Figure 5B), whereas certain genes encoding cytosolic ribosomal proteins were down-regulated (Table 1). Possibly, the loss of Pip2p and Oaf1p transcription factors and the resulting inability to deal with oleate as a carbon source was a stressful condition, which the cells attempted to compensate.
The absence of high-level expression of certain oleate-induced genes in
the pip2/oaf1 strain compared with the wild-type strain made
it possible to identify genes that were thus far unknown to be under
the control of Pip2p and Oaf1p. Examples are the TES1 gene
encoding a peroxisomal acyl-CoA thioesterase (Kal, 1997
) and the
IDP3 gene encoding a peroxisomal NADP-dependent isocitrate dehydrogenase (van Roermund et al., 1998
). For other genes
encoding proteins with unknown functions that were expressed highly in wild-type cells on oleate but not in pip2/oaf1 mutant cells,
e.g., YOR084W, further investigation of their suspected role in
peroxisome function is required.
| |
DISCUSSION |
|---|
|
|
|---|
Here we report a transcriptome analysis using SAGE of S. cerevisiae growing on the fatty acid oleate as the sole carbon
source. The availability of a transcriptome of glucose-grown yeast
cells (Velculescu et al., 1997
) made it possible to
investigate the changes in mRNA levels that result from the presence of
one or the other carbon source in the growth medium.
Growth of yeast on oleate requires the presence of
-oxidation
enzymes to metabolize this fatty acid. In S. cerevisiae the
-oxidation enzymes are exclusively located in peroxisomes, and the
peroxisomal compartment is increased in number and volume in
oleate-grown cells (Veenhuis et al., 1987
). We hoped that
SAGE analysis would alert us to new genes involved in peroxisome
biogenesis and function. In particular, we were interested in those
genes that went unnoticed in the genetic screens applied thus far, for instance, genes that upon mutation would confer lethality to the cell
or genes coding for proteins the loss of which did not result in a
clear phenotype. SAGE is perfectly suited for application to S. cerevisiae because the total genome has been sequenced, and
~6300 reading frames longer than 100 amino acids were identified (Goffeau, 1997
). This allows assignment of almost all tags to genes
with great precision. Ironically, the most abundant tag provided an
exception. Here, the high DNA sequence conservation between the
ADH1 and ADH2 genes encoding the isoenzymes
alcohol dehydrogenase 1 and 2 caused ambiguity. From the separation of the Adh isoenzymes by PAGE under native conditions followed by determination of enzyme activity (Williamson et al., 1980
),
we concluded that in oleate-grown cells the tags were derived from the
ADH2 gene; the tags from glucose-grown cells originated from the ADH1 gene (our unpublished data). Another imperfection
of SAGE is that it cannot detect genes that lack a recognition site for
the anchoring enzyme (NlaIII). An example is the TPI1 gene, which functions in glycolysis and is highly expressed in glucose-grown cells (Alber and Kawasaki, 1982
).
When we compared both tag lists and limited ourselves to statistically
significant changes (see MATERIALS AND METHODS and Figure
6 for a discussion of our statistical
analysis), we observed that the number of mRNAs that was clearly
different under both conditions was rather small: ~100. This
comprised only 2% of the total number of genes in the genome; however,
it contributed a substantial portion of the total number of 15,000 mRNAs per cell: >20%. Most of these mRNAs that significantly increase
in oleate-grown cells encoded enzymes required for adaptation of
metabolism to the new carbon source. This confirms the dictum of
Christian de Duve (1984
) in his book A Guided Tour of the Living
Cell: "The internal affairs of a living cell are mainly
concerned with biogenesis and energy production." The genes encoding
these enzymes were under very strict control. Many of the abundant
mRNAs in oleate-grown cells were almost or totally absent in the
glucose-grown cells.
|
We were rather surprised to find that mRNAs coding for proteins
functioning in the maintenance of cellular structures, for instance
proteins that support trafficking of proteins in the cell, were present
at very low numbers of copies per cell in both growth conditions. This
holds even for cases in which a specific alteration takes place when
cells are faced with another carbon source. In glucose-grown cells only
a few small peroxisomes are present; in oleate-grown cells, however,
their number and volume are strongly increased. But also in the latter
case the peroxins (proteins involved in the biogenesis of peroxisomes)
were represented with exceptionally low numbers of copies per cell.
Considering this strict rule, it is remarkable that Pex11p forms an
exception. Induction and transcriptional control of the
PEX11 gene resembles that of genes coding for enzymes
involved in fatty acid metabolism. Indeed, preliminary experiments
suggest that Pex11p function is related to transport of metabolites
across the peroxisomal membrane (van Roermund, Hettema, and Tabak,
unpublished observations) rather than to proliferation of peroxisomes
(Erdmann and Blobel, 1995
; Passreiter et al., 1998
). Another
case illustrating the low copy number of certain mRNAs concerns the
transcription factors involved in peroxisome proliferation. We
previously reported that Oaf1p and Pip2p form a heterodimeric complex
that binds to OREs, present in many promoters of genes coding for
peroxisomal enzymes (Rottensteiner et al., 1997
). The
OAF1 gene is constitutively expressed, but the
PIP2 gene is induced by autoregulation in oleate-containing growth medium. Both mRNAs were present at only very low levels, and
the number of tags collected in this study was too low to detect a
statistically significant induction of PIP2 mRNA. However, although we were not able to monitor alterations in the level of
transcription factors or, for that matter, in components of signal
transduction routes, the changes of expression levels of the target
genes evoked by the action of these components were clearly
demonstrated. Thus, 10,000 tags were sufficient to visualize the
changes in mRNAs coding for components of metabolism.
The analysis of the SAGE library from pip2/oaf1 mutant cells provided a valuable tool to identify new genes involved in peroxisomal functioning. In addition, the attempts of the mutant cells to adapt to the absence of efficient metabolism of fatty acids were illustrated by elevated levels of mRNAs for alternative pathways, such as the uptake and metabolism of proline (PUT1 and PUT4) and the uptake of dicarboxylic acids (JEN1 and DIP5). The stressful condition was reflected in the induction of genes encoding heat shock and stress proteins.
The ability to visualize the behavior of each gene with respect to its
contribution to cellular life is of particular interest to study the
interactions and ways of communication between the major compartments
of the cell. The
-oxidation of fatty acids takes place in
peroxisomes, but further metabolism of its end products, NADH and
acetyl-CoA, requires the participation of other compartments of the
cell, particularly cytosol and mitochondria. The impermeability of the
peroxisomal membrane to small molecules requires dedicated transporters
in the membrane, comparable with the situation in mitochondria.
Candidate genes coding for such proteins can be traced by screening the
tag list for genes that are induced on oleate and code for proteins
with multiple membrane spans or otherwise hydrophobic character (e.g.,
Pex11p; see above). We are currently determining the cellular
localization of several of such proteins and studying the ones confined
to peroxisomes in more detail.
Acetyl-CoA is in part converted to succinate via the glyoxylate cycle
or to glucose via the gluconeogenesis pathway. Our SAGE analysis
confirmed these predictions convincingly. We have argued that
peroxisomal malate dehydrogenase (Mdh3p) is used to regenerate NAD+ for continuation of
-oxidation, rather than being
an intrinsic part of the glyoxylate cycle (van Roermund et
al., 1995
). The simultaneous induction of both MDH3 and
MDH2 (the latter encoding cytosolic malate dehydrogenase)
reinforces the proposal of a malate/oxaloacetate shuttle to transfer
reduction equivalents across the peroxisomal membrane. The strong
induction on oleate of the GUT2 gene, encoding glycerol-3-phosphate dehydrogenase located at the mitochondrial inner
membrane, suggests how mitochondria can tap part of the cytosolically
delivered NADH for production of ATP.
To degrade polyunsaturated fatty acids, double bonds at even positions
must be relocated to uneven positions in a reductive process requiring
NADPH (Gurvitz et al., 1997
). The induction on oleate of the
genes encoding peroxisomal and cytosolic isocitrate dehydrogenase
(IDP3 and IDP2, respectively) suggests that an
isocitrate/2-oxoglutarate shuttle exists to expedite the transfer of
reduction equivalents from the cytosol into the peroxisomes to maintain
the level of NADPH (Henke et al., 1998
; van Roermund
et al., 1998
).
Large amounts of information can already be abstracted from a comparison of steady-state conditions: glucose-grown, oleate-grown, and transcription factor-deficient cells. For more detailed insight in the dynamic transition of one state into the other, the SAGE technique is less suitable because of its low throughput capacity. In that respect, the application of the DNA microarray technique is more promising. However, SAGE data are very robust and quantitative. Dynamic ranges >200-fold in gene expression can be determined. Furthermore, by obeying some basic rules for data management, tag lists obtained by different research groups can be integrated into larger databases, and the increasing tag numbers can further improve the statistical confidence in the analysis.
Evaluating our comparison of transcriptomes of yeast cells grown on different carbon sources, we expect that detailed knowledge of metabolism and its change to altering conditions can be gained from such studies. In addition, these studies can provide clues to discover functions of novel genes identified in sequencing projects. Considering the strong conservation of biological principles during evolution, these genome-wide model studies in yeast may help to understand certain pathological conditions in human.
| |
ACKNOWLEDGMENTS |
|---|
We are indebted to Victor Velculescu and Kenneth Kinzler for
generously providing SAGE software, protocols, and data from glucose-grown cells and for advice during the course of this project. We thank Werner Mewes for support in data analysis, Ted Young for
advice on discriminating between alcohol dehydrogenases I and II, and
Piet Borst, Ewald Hettema, Fred Meijer, Ton Muijsers, Ronald Plasterk,
Carlo van Roermund, and Ron Wanders for stimulating discussions. This
project was financially supported by the Netherlands Foundation for
Chemical Research (Stichting Scheikundig Onderzoek Nederland)-Netherlands Foundation for Scientific Research
(Nederlandse Organisatie voor Wetenschappel
k Onderzoek)
and by the European Functional Analysis Network.
| |
FOOTNOTES |
|---|
Online version of this article contains a complete data
set. Online version available at www.molbiolcell.org.
Present address: Gene Expression Control
Laboratory, Imperial Cancer Research Fund, 44 Lincoln's Inn Fields,
London WC2A 3PX, United Kingdom.
Present address: Introgene BV, 2333 AL
Leiden, The Netherlands.
a Corresponding author. E-mail address: H.F.Tabak{at}amc.uva.nl.
| |
ABBREVIATIONS |
|---|
Abbreviations used: c/c, copies per cell; CoA, coenzyme A; EMBL, European Molecular Biology Laboratory; ORE, oleate response element; SAGE, Serial Analysis of Gene Expression.
| |
REFERENCES |
|---|
|
|
|---|
a database for protein sequences, homology data and yeast genome information.
Nucleic Acids Res.
25, 28-30
involvement of Arf and coatomer.
J. Cell Biol.
141, 373-383This article has been cited by other articles:
![]() |
L. Hood, L. Rowen, D. J. Galas, and J. D. Aitchison Systems biology at the Institute for Systems Biology Briefings in Functional Genomics, July 1, 2008; 7(4): 239 - 248. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. V. Karpichev, J. M. Durand-Heredia, Y. Luo, and G. M. Small Binding Characteristics and Regulatory Mechanisms of the Transcription Factors Controlling Oleate-responsive Genes in Saccharomyces cerevisiae J. Biol. Chem., April 18, 2008; 283(16): 10264 - 10275. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. D. Robinson and G. K. Smyth Small-sample estimation of negative binomial dispersion, with applications to SAGE data Biostat., April 1, 2008; 9(2): 321 - 332. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Norambuena, R. Malig, and F. Melo SAGExplore: a web server for unambiguous tag mapping in serial analysis of gene expression oriented to gene discovery and annotation Nucleic Acids Res., July 13, 2007; 35(suppl_2): W163 - W168. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. L. Tai, I. Snoek, M. A. H. Luttik, M. J. H. Almering, M. C. Walsh, J. T. Pronk, and J.-M. Daran Correlation between transcript profiles and fitness of deletion mutants in anaerobic chemostat cultures of Saccharomyces cerevisiae Microbiology, March 1, 2007; 153(3): 877 - 886. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Kerssen, E. Hambruch, W. Klaas, H. W. Platta, B. de Kruijff, R. Erdmann, W.-H. Kunau, and W. Schliebs Membrane Association of the Cycling Peroxisome Import Receptor Pex5p J. Biol. Chem., September 15, 2006; 281(37): 27003 - 27015. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. J. Kauffman, B. A. Ogunnaike, and J. S. Edwards Designing experiments that aid in the identification of regulatory networks Briefings in Functional Genomics, February 1, 2006; 4(4): 331 - 342. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Pylouster, C. Senamaud-Beaufort, and T. E. Saison-Behmoaras WEBSAGE: a web tool for visual analysis of differentially expressed human SAGE tags Nucleic Acids Res., July 1, 2005; 33(suppl_2): W693 - W695. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Sengoelge, W. Luo, D. Fine, A. M. Perschl, W. Fierlbeck, A. Haririan, J. Sorensson, T.-U. Rehman, P. Hauser, J. S. Trevick, et al. A SAGE-based comparison between glomerular and aortic endothelial cells Am J Physiol Renal Physiol, June 1, 2005; 288(6): F1290 - F1300. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Lin, J. T. White, W. Lu, T. Xie, A. G. Utleg, X. Yan, E. C. Yi, P. Shannon, I. Khrebtukova, P. H. Lange, et al. Evidence for the Presence of Disease-Perturbed Networks in Prostate Cancer Cells by Genomic and Proteomic Analyses: A Systems Approach to Disease Cancer Res., April 15, 2005; 65(8): 3081 - 3091. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. A. Stolovitzky, A. Kundaje, G. A. Held, K. H. Duggar, C. D. Haudenschild, D. Zhou, T. J. Vasicek, K. D. Smith, A. Aderem, and J. C. Roach Statistical analysis of MPSS measurements: Application to the study of LPS-activated macrophage gene expression PNAS, February 1, 2005; 102(5): 1402 - 1407. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. D. Antonenkov, R. T. Sormunen, and J. K. Hiltunen The rat liver peroxisomal membrane forms a permeability barrier for cofactors but not for small metabolites in vitro J. Cell Sci., November 1, 2004; 117(23): 5633 - 5642. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J. Saldanha, M. J. Brauer, and D. Botstein Nutritional Homeostasis in Batch and Steady-State Culture of Yeast Mol. Biol. Cell, September 1, 2004; 15(9): 4089 - 4104. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Whitsett, C. J. Bachurski, K. C. Barnes, P. A. Bunn Jr., L. M. Case, D. N. Cook, D. Crooks, M. W. Duncan, L. Dwyer-Nield, R. C. Elston, et al. Functional Genomics of Lung Disease Am. J. Respir. Cell Mol. Biol., August 1, 2004; 31(2/S1): S1 - S81. [Full Text] [PDF] |
||||
![]() |
X. Wang, M. A. McMahon, S. N. Shelton, M. Nampaisansuk, J. L. Ballard, and J. M. Goodman Multiple Targeting Modules on Peroxisomal Proteins Are Not Redundant: Discrete Functions of Targeting Signals within Pmp47 and Pex8p Mol. Biol. Cell, April 1, 2004; 15(4): 1702 - 1710. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Wu, N. Zhang, A. Hayes, K. Panoutsopoulou, and S. G. Oliver Global analysis of nutrient control of gene expression in Saccharomyces cerevisiae during growth and starvation PNAS, March 2, 2004; 101(9): 3148 - 3153. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Ohlmeier, A. J. Kastaniotis, J. K. Hiltunen, and U. Bergmann The Yeast Mitochondrial Proteome, a Study of Fermentative and Respiratory Growth J. Biol. Chem., February 6, 2004; 279(6): 3956 - 3979. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Chambers, A. Issaka, and S. P. Palecek Saccharomyces cerevisiae JEN1 Promoter Activity Is Inversely Related to Concentration of Repressing Sugar Appl. Envir. Microbiol., January 1, 2004; 70(1): 8 - 17. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Divina and J. Forejt The Mouse SAGE Site: database of public mouse SAGE libraries Nucleic Acids Res., January 1, 2004; 32(90001): D482 - 483. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Fizames, S. Munos, C. Cazettes, P. Nacry, J. Boucherez, F. Gaymard, D. Piquemal, V. Delorme, T. Commes, P. Doumas, et al. The Arabidopsis Root Transcriptome by Serial Analysis of Gene Expression. Gene Identification Using the Genome Sequence Plant Physiology, January 1, 2004; 134(1): 67 - 80. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. T. Young, K. M. Dombek, C. Tachibana, and T. Ideker Multiple Pathways Are Co-regulated by the Protein Kinase Snf1 and the Transcription Factors Adr1 and Cat8 J. Biol. Chem., July 3, 2003; 278(28): 26146 - 26158. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Ruijter, A. H. C. Van Kampen, and F. Baas Statistical evaluation of SAGE libraries: consequences for experimental design Physiol Genomics, October 29, 2002; 11(2): 37 - 44. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. G. Koerkamp, M. Rep, H. J. Bussemaker, G. P.M.A. Hardy, A. Mul, K. Piekarska, C. A.-K. Szigyarto, J. M. T. de Mattos, and H. F. Tabak Dissection of Transient Oxidative Stress Response in Saccharomyces cerevisiae by Using DNA Microarrays Mol. Biol. Cell, August 1, 2002; 13(8): 2783 - 2794. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. J. Smith, M. Marelli, R. H. Christmas, F. J. Vizeacoumar, D. J. Dilworth, T. Ideker, T. Galitski, K. Dimitrov, R. A. Rachubinski, and J. D. Aitchison Transcriptome profiling to identify genes involved in peroxisome assembly and function J. Cell Biol., July 22, 2002; 158(2): 259 - 271. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Ruprich-Robert, V. Berteaux-Lecellier, D. Zickler, A. Panvier-Adoutte, and M. Picard Identification of Six Loci in Which Mutations Partially Restore Peroxisome Biogenesis and/or Alleviate the Metabolic Defect of pex2 Mutants in Podospora Genetics, July 1, 2002; 161(3): 1089 - 1099. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. H. Margulies, S. L. R. Kardia, and J. W. Innis Identification and prevention of a GC content bias in SAGE libraries Nucleic Acids Res., June 15, 2001; 29(12): e60 - e60. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Robert-Nicoud, M. Flahaut, J.-M. Elalouf, M. Nicod, M. Salinas, M. Bens, A. Doucet, P. Wincker, F. Artiguenave, J.-D. Horisberger, et al. Transcriptome of a mouse kidney cortical collecting duct cell line: Effects of aldosterone and vasopressin PNAS, February 15, 2001; (2001) 51603198. [Abstract] [Full Text] |
||||
![]() |
C. B. Epstein, J. A. Waddle, W. Hale IV, V. Davé, J. Thornton, T. L. Macatee, H. R. Garner, and R. A. Butow Genome-wide Responses to Mitochondrial Dysfunction Mol. Biol. Cell, February 1, 2001; 12(2): 297 - 308. [Abstract] [Full Text] |
||||
![]() |
R. Chrast, H. S. Scott, M. P. Papasavvas, C. Rossier, E. S. Antonarakis, C. Barras, M. T. Davisson, C. Schmidt, X. Estivill, M. Dierssen, et al. The Mouse Brain Transcriptome by SAGE: Differences in Gene Expression between P30 Brains of the Partial Trisomy 16 Mouse Model of Down Syndrome (Ts65Dn) and Normals Genome Res., December 1, 2000; 10(12): 2006 - 2021. [Abstract] [Full Text] |
||||
![]() |
C. W.T. van Roermund, H. F. Tabak, M. van den Berg, R. J.A. Wanders, and E. H. Hettema Pex11p Plays a Primary Role in Medium-Chain Fatty Acid Oxidation, a Process That Affects Peroxisome Number and Size in Saccharomyces cerevisiae J. Cell Biol., August 7, 2000; 150(3): 489 - 498. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Haurie, M. Perrot, T. Mini, P. Jeno, F. Sagliocco, and H. Boucherie The Transcriptional Activator Cat8p Provides a Major Contribution to the Reprogramming of Carbon Metabolism during the Diauxic Shift in Saccharomyces cerevisiae J. Biol. Chem., January 5, 2001; 276(1): 76 - 85. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Robert-Nicoud, M. Flahaut, J.-M. Elalouf, M. Nicod, M. Salinas, M. Bens, A. Doucet, P. Wincker, F. Artiguenave, J.-D. Horisberger, et al. Transcriptome of a mouse kidney cortical collecting duct cell line: Effects of aldosterone and vasopressin PNAS, February 27, 2001; 98(5): 2712 - 2716. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||