|
|
|
|
Vol. 10, Issue 8, 2703-2734, August 1999
Laboratory of Molecular Pharmacology, Division of Basic Sciences, National Cancer Institute, Bethesda, Maryland 20892
Submitted February 16, 1999; Accepted May 27, 1999| |
ABSTRACT |
|---|
|
|
|---|
Eventually to understand the integrated function of the cell cycle regulatory network, we must organize the known interactions in the form of a diagram, map, and/or database. A diagram convention was designed capable of unambiguous representation of networks containing multiprotein complexes, protein modifications, and enzymes that are substrates of other enzymes. To facilitate linkage to a database, each molecular species is symbolically represented only once in each diagram. Molecular species can be located on the map by means of indexed grid coordinates. Each interaction is referenced to an annotation list where pertinent information and references can be found. Parts of the network are grouped into functional subsystems. The map shows how multiprotein complexes could assemble and function at gene promoter sites and at sites of DNA damage. It also portrays the richness of connections between the p53-Mdm2 subsystem and other parts of the network.
| |
INTRODUCTION |
|---|
|
|
|---|
The complexity of the molecular interactions implicated in cell regulatory networks challenges human comprehension. Current diagrams of molecular interactions are often ambiguous or incomplete. The preparation of more comprehensive regulatory network diagrams is both difficult and urgent. The difficulties are not due merely to the large number of reactions, because this is a feature also of the familiar metabolic pathway diagrams. They are due rather to complexities that rarely occur in classical pathway diagrams, such as multisubunit complexes, protein modifications, enzymes that are modified by other enzymes, and protein domains whose function is regulated by other domains of the same molecule. The present work describes and applies a diagram method designed to cope with these kinds of complexity.
Why do we need molecular interaction maps? First, it is often difficult to keep in mind all of the known interactions that may be pertinent to a particular experimental or theoretical question, and a molecular interaction map can be used in much the same way as a road map or electronic circuit diagram. Second, molecular interaction maps can suggest new interpretations or questions for experiment. Third, the act of preparing a molecular interaction map imposes a discipline of logic and critique to the formulation of functional models. Finally, the diagram convention provides a shorthand for recording complicated findings or hypotheses.
Another kind of difficulty in preparing useful maps is the incompleteness and uncertainty of knowledge, as well as the limited scope of applicability of some interactions. An important aspect of molecular interaction maps, as described here, is that they are linked to an annotation list that summarizes current information relevant to particular interactions and provides references. A molecular interaction map can therefore function also as a review. The maps can be updated interactively via the Internet and thus can provide a current summary of an area.
The current work describes the mapping conventions and uses them to build a molecular interaction map of the circuitry that governs the mammalian cell cycle and DNA repair machinery. Updated and corrected versions of the map will be accessible at the internet address discovery.nci.nih.gov.
| |
MAP CONVENTIONS |
|---|
|
|
|---|
Desirable Characteristics of Molecular Interaction Maps
A major consideration in the design of the diagram conventions was to facilitate tracing all the known interactions of any given molecular species. Accordingly, each molecular species should ideally appear only once in a diagram, and all interactions involving that species should emanate from a single symbolic construct. A second major consideration was a concise method to represent multimolecular complexes. Multimer proteins are common components of regulatory systems and sometimes function in large-scale multimolecular assemblies. Therefore, an extensible representation of such complexes was a fundamental requirement. A third major consideration was the representation of protein modifications, such as phosphorylations. One must be able to represent various modifications of a protein by unique graphical constructs. Meeting these goals simultaneously is a significant challenge.
Additionally, one must be able to show the actions or effects of each molecular species or interaction, including enzyme action and stimulation and inhibition of activity or binding. Often there are many interaction or modification sites having diverse effects on function. The potential number of modification-multimerization combinations is staggering, and we have barely begun to explore this vast domain experimentally. Representation of all of these possible combinations in a single diagram is obviously impractical. Nonetheless, it is important to be able to represent any combinations that may be significant.
Symbols and Rules
Because each molecular species in a diagram ideally should appear
only once, interactions must be indicated by several types of
lines connecting the species. The different types of interaction lines
are distinguished by different kinds of arrowheads or other line
endings, as summarized in Figure 1.
|
Multimolecular Complexes
Noncovalent binding between molecular species is indicated by
lines terminated at both ends by barbed arrowheads. Thus noncovalent binding in most cases is represented symmetrically. In some cases, however, it is useful to distinguish a protein that has a receptor site
from a protein that donates a peptide binding to the site, for example
the binding of an N-terminal peptide of p53 to a pocket in Mdm2 (Kussie
et al., 1996
). The receptor end of an interaction line can
be represented by a double-barbed arrowhead (Figure 1). This notation
can serve to indicate targets of opportunity for pharmacological
intervention. Noncovalent binding is generally assumed to be
reversible; when binding is unusually tight, the interaction line can
be drawn heavier.
Having defined an intermolecular binding symbol, we next need a representation of the complex itself. This is accomplished by placing a small filled circle (or "node") on the connecting line. An action of the complex can then be represented by an appropriate type of line emanating from the node. See, for example, the enzymatic action line emanating from the node representing CycD:Cdk4 in Figure 5 (reaction 5), enzymatic action being indicated by the open circle at the end of the line, in accord with the symbol definition table (Figure 1).
Multiple actions of a complex can be depicted conveniently by using multiple nodes on the same line; each node then refers to exactly the same complex. See, for example, the two occurrences of node a in Figure 3.
Thus only the monomolecular species are indicated by name, and the identity of the complexes is determined by tracing the connecting lines back to the monomolecular units.
To represent alternative or competitive binding of different proteins
at the same site, the lines from the competing proteins are merged
before connecting to the site. Figure 2
illustrates the conciseness and flexibility of this representation.
Each possible dimer (a, c, e, or g), or dimer pair for a given monomer
(b, d, or f), is defined by specific placement of a node. Effects
specific to any combination of interactions therefore can be
represented unambiguously. For example, the actions of Cdk1 that occur
regardless of whether the partner is Cyclin A or B would be indicated
by an action line emanating from node f.
|
An important feature of the line-and-node representation is that it is
extensible. Binding interactions involving a node are represented in
the same way as binding interactions involving a monomer. To see how
this works, consider the example in Figure 3, which represents the interactions of
E2F1, DP1, pRb, and an E2 promoter element. Each of the naturally
occurring molecular combinations of these monomers is indicated by a
node, the promoter element being treated like a monomer species. The
two filled circles labeled a, being on the same line segment, represent
exactly the same molecular species, namely E2F1:DP1. The other node
representations are as follows: b, E2F1:DP1:pRb; c, E2F1:DP1 bound to
promoter element E2; and d, E2F1:DP1:pRb bound to E2. Also indicated is transcriptional stimulation or inhibition occurring when the promoter element is occupied by E2F1:DP1 (node c) or E2F1:DP1:pRb (node d),
respectively. (See Figure 1 for definitions of stimulation and
inhibition symbols. Also, note that connecting lines may change in
direction, for example, with right-angle bends, and that crossing lines
do not affect each other.) Using this type of scheme, most multimolecular contingencies can readily be depicted.
|
A complex containing multiple copies of the same monomer can be
represented concisely by use of a ditto symbol, consisting of an
isolated filled circle at the end of a single connecting line. See, for
example, the representation of a homodimer in the lower part of Figure
1. The notation can be extended to higher homopolymers, as illustrated
in Figure 4 for tetramerization of p53:
three nodes are placed side by side to denote the three additional copies that, together with the identified monomer, make up the tetramer. This example shows the requirement of p53 tetramerization for
binding to promoter elements and for phosphorylation of p53 at Ser15.
|
A given protein often can form stable complexes simultaneously with two or more different protein molecules. These are of special interest because the interactions can extend to form large-scale, functionally integrated multiprotein assemblies. When there is an extensive chain of interactions, however, it is often unknown how local interactions in different parts of the chain affect each other. The map conventions allow the binary interactions to be depicted without specifying all of the possible influences between different parts of a chain of interactions while at the same time specifying those influences that are known.
Covalent Modifications
Most regulatory proteins are subject to a multiplicity of modifications, especially phosphorylations, that alter function. This presents a severe challenge to any diagram method. The difficulty is further increased when combinations of phosphorylations must be considered.
A phosphorylation (or other covalent modification) is represented by a line with a single barbed arrowhead that points to the modified protein (Figure 1). The multiplicity of phosphorylations and acetylations of p53 are included in the comprehensive diagram, Figure 6B. The modifications are arrayed along the length of the elongated, pill-shaped p53 outline from the N terminus on the left to the C terminus on the right. The amino acid positions of the modification sites are indicated. A node on a single-barb-arrowed line represents the protein modified at that site. The effects of a given modification on intra- or intermolecular actions are indicated by interaction lines emerging from the node.
Although the effects of multiple modifications on one another are
largely unknown, some important interactions among modifications within
the same protein molecule have been defined. To represent combinations
of modifications, we need additional symbols. We use a nonarrowed
connecting line to represent joint modifications; a node on such a
connecting line represents the protein having the combined
modifications. An example of this situation is provided by the
phosphorylation states of pRb (Figure 5).
pRb is multiply phosphorylated by CycD:Cdk4 and by CycE:Cdk2, but
several of the sites differ. It seems that phosphorylation by CycD:Cdk4
is required before CycE:Cdk2 can phosphorylate its specific sites, and
that both kinases are required to fully impair the inhibitory binding of pRb to E2F:DP complexes.
|
Multidomain Structures
Regulatory proteins often are composed of structural domains having different functions. This multidomain organization can integrate interactions with other molecules or communicate functions within a given protein molecule. To depict functional localization within a protein molecule, a horizontal, pill-shaped outline is used rather than an oval. Localized interactions or modifications proceeding from the N terminus on the left to the C terminus on the right are marked along the upper and/or lower borders of the pill shape (e.g., p53 in Figure 6B). When the locale of an interaction is unknown, it is marked at either end of the pill shape.
| |
MOLECULAR INTERACTION MAP |
|---|
|
|
|---|
Figure 6 presents a comprehensive
molecular interaction map of regulators of cell cycle and DNA repair
processes. The current map, however, is limited to events in the
mammalian cell nucleus. Because of limits on what can legibly be
formatted onto a journal page, the map is divided into two parts.
Figure 6A maps the interactions involving E2F, pRb, Cyclin, and Cdk
family members, their activators and inhibitors, as well as some
important interactions with other components. Figure 6B focuses on the
p53-Mdm2 subsystem and on subsystems related to DNA repair. Molecular
components are grouped in putative subsystems according to mutual
interactions or functional coherence. It remains to be determined
whether subsystems can be identified on the basis of objective
criteria.
|
The monomolecular species included in Figure 6 are listed
alphabetically in an index (Table 1),
which gives grid coordinates to help locate each species.
|
Some of the interaction patterns shown in the map deserve special comment. References are cited in the annotation list and can be found using the identifiers marked on the interaction lines in the map. These identifiers are italicized letter-number combinations, which are used also in the following text to specify particular interactions on the map.
We are concerned here primarily with molecular interactions rather than biological effects. The latter may eventually be understood as emerging from the former.
The E2F-pRb Box
The possible occupancy states of an E2F recognition element in a promoter comprise a multiplicity of complex patterns. The method of representing these patterns was shown by the simplified example in Figure 3. The simplest arrangement would be an E2F:DP heterodimer bound to an E2 element (E5). (The italicized letter-number combinations identify particular interactions in Figure 6 and refer to entries in the annotation list where references are cited.) Because promoters generally have two or more E2F recognition elements, however, the actual situation even in this case may be far from simple. Most E2F family proteins can activate transcription (E6) by way of a transactivation domain. E2F-6, however, has instead a transcription repressor domain (E9). E2F:DP heterodimers (other than those of E2F-6) can tether pRb family proteins onto promoters, thereby converting a potential gene activator into a repressor (E7). E2F in fact more often functions to repress rather than activate.
E2F proteins (as heterodimers with a DP protein [E1]) have individual binding preferences for pRb family members. The E2F-pRb box shows that pRb prefers E2F-1-, -2-, or -3-containing heterodimers (E2) (with lesser interaction with E2F-4), p107 binds only E2F-4 heterodimers, and p130 can bind heterodimers of E2F-4 or 5 (E3,4). All of these complexes can bind and repress E2F recognition elements (E6). E2F-6, however, does not bind any pRb family protein. E2F-1, -2, and -3 are marked together as a unit on the map, because of their mutual preference for pRb. Although there are significant functional distinctions between them, the molecular bases for these differences is not known. No major functional differences among different DP family members have been defined.
A further level of complexity arises from the ability of some E2F complexes to bind other transcription factors, such as Sp1, and to synergize the transcriptional activation.
Another mode of regulation arises from the ability of histone deacetylase (HDAC1) to bind to pRb:E2F:DP-type complexes (E13). HDAC1 could deactivate transcription that has been enhanced by histone acetylation (H1) and thus could contribute to gene down-regulation by pRb family proteins.
pRb sometimes functions as a gene activator rather than repressor. For example, it can activate the Jun family and CCAAT/enhancer-binding protein (C/EBP) transcription factors (E16,17). The detailed mechanism by which this occurs is not known.
Taken together, the interactions noted on the map add up to perhaps 20 different possible states of an individual E2F recognition element.
The interaction capabilities of the constituent proteins are subject to modulation by phosphorylation, protein binding, and regulated degradation. In addition, regulated nuclear-cytoplasmic transport is emerging as an important process. Translocations can be represented as indicted in Figure 1 but, because of space limitations, have been omitted from the current version of the map. pRb is subject to different sets of multiple phosphorylation by CycD:Cdk4/6 (C31) and cycE:Cdk2 (C32). The manner of depiction of these phosphorylations and their effects was explained in Figure 5. Only the fully phosphorylated pRb is impaired with respect to E2F binding (C33). The E2F binding of p107 and p130 is also inhibited by phosphorylation (E11). Phosphorylation of E2F and/or DP by CycA:Cdk2 (which forms stable complexes with E2F-1) (E20) inhibits the E2F-DP interaction and could serve to turn off E2F function when cycA accumulates late in S phase.
pRb family proteins may also be inhibited by binding to other proteins, such as Raf-1 (E22). This may be one of the logical connections, suggested by current work, which may communicate signals from the cell surface to the cell cycle control circuitry.
The Cyclin-Cdk Box
The activity of Cdks is intricately regulated. To begin with, Cdk activity requires binding to a Cyclin. The map shows Cdk4 or Cdk6 (which have the same molecular interactions) binding to CycD (C3), Cdk2 binding to Cyclins E or A (C4), and Cdk1 (also known as Cdc2) binding to Cyclins A or B (C5). (The subtypes of CycB and CycD are not differentiated here.)
A second class of controls on Cdks are stimulatory and inhibitory phosphorylations, which are controlled by several kinases and phosphatases noted on the map. All Cdks are activated by phosphorylation of Thr160 (or 161), carried out by CycH:Cdk7 (C14), which functions also as a constituent of the transcription factor IIH (TFIIH) complex. Cdk1, and to some extent other Cdks, can be inhibited by phosphorylations corresponding to Thr14 and/or Tyr15 (C17,19,20). These sites are phosphorylated by Wee1 or Myt1 (C16). In the case of Cdk1, these inhibitory phosphorylations are removed by dual-action phosphatase Cdc25C, which is in turn activated by phosphorylations (C18) introduced by mammalian polo-like kinase 1 (Plk1) (C37) and/or Cdk1 (C36). Cdc25C can be phosphorylated at Ser216 by Chk1 (C38) or C-TAK1 (C39). Ser216 phosphorylation generates a binding site for 14-3-3, and this binding inhibits the phosphatase (C40).
A positive feedback loop that can be traced on the map consists of just two components: Cdc25C and CycB:Cdk1. Cdc25C is activated by hyperphosphorylation (C18); activated Cdc25C dephosphorylates Thr14 and Tyr15 of Cdk1, thereby removing the inhibitory effect of these phosphates on the kinase (C17) and increasing the activating phosphorylation of Cdc25C (C36). This positive feedback could help produce switch-like behavior and may operate in the G2 to M cell cycle phase transition.
A third class of controls acts through the binding of specific Cyclin:Cdk inhibitors, including p16ink4a, p21cip1, p27kip1, and p57kip2. p16ink4a inhibits by binding Cdk4/6 in competition with cycD (C8). p21cip1, p27kip1, and p57kip2 can bind Cyclin complexes of Cdk4/6 and Cdk2 (C7,23). There may be an additional complication, however, because p21 can stabilize and enhance the activity of cycD:Cdk4 when a single p21 molecule is bound but can inhibit the same activity when a second p21 molecule binds to the complex (C22). p27 can be phosphorylated by the kinase it inhibits, CycE:Cdk2 (C21). This seemingly paradoxical relationship might be due to intermolecular action of an active CycE:Cdk2 on an inactive CycE:Cdk2:p27 complex.
The Cyclin:Cdk system can interact with elements of the DNA replication and repair systems through binding of p21 (R6) or Cyclin D (R11) to proliferating cell nuclear antigen (PCNA). This action may also involve Gadd45, which can bind simultaneously to p21 (C34) and PCNA (R10). p21, PCNA, and Gadd45 are all transciptionally activated by p53 (P43,44).
The p53-Mdm2 Box
The map shows the remarkable richness of p53 interconnections and the diversity of functionally determinative p53 modifications. Eleven phosphorylation or acetylation sites (or groups of sites) for which functionality has been surmised are shown. If all of these could occur independently, there would be ~2000 possible modification states of p53 monomers. Some interdependent modifications have been noted: phosphorylation of Thr18 requires previous phosphorylation of Ser15 (P3) (Appella, personal communication); acetylation of Lys320 requires tetramer structure of p53 and is inhibited by phosphorylation of Ser378 (P23) (Sakaguchi and Appella, personal communication). Other dependencies certainly exist, some perhaps having major functional impact, whereas many could have subtle quantitative effects, which may or may not convey a selective evolutionary advantage. Nevertheless, p53-expressing cells may contain hundreds of different modification states of p53 monomers.
Some p53 modifications and interactions are especially notable. Ser15 appears to be the site of phosphorylation responses to DNA damage signals communicated by way of the kinases ataxia telangiectasia mutated gene/protein (ATM) (P2) and DNA-dependent protein kinase (DNA-PK) (P6). Phosphorylation of Ser18 or Ser20 prevents stable binding to Mdm2 (P5), thus abrogating the Mdm2-mediated inhibition (P29) and degradation (P31) of p53. Because these sites are located within the region required for Mdm2 binding, it is plausible that their phosphorylation could inhibit this interaction. p53 forms a stable complex with p300 (P25), as a result of which p300 acetylates p53 Lys382 (P21). This acetylation, as well as the acetylation of Lys320 by PCAF (P23), enhances the sequence-specific binding of p53 to promoters, probably indirectly by inhibiting nonspecific DNA binding (P22). A similar mechanism of enhanced promoter binding (P16) may occur as a result of binding of the p53 C-terminal region to 14-3-3 (P13), which requires 14-3-3 to be dimerized (P14).
p53 can bind to a number of proteins that are involved in DNA repair functions, cell cycle control, or general control functions. In approximate order of binding location from N to C termini of p53, these include the following: Mdm2 (which has a pocket that binds a p53 N-terminal peptide) (P28); p300 C-terminal region (P25); DP1 (P26); poly(ADP-ribose) polymerase (PARP) (P46); c-Abl (A4); replication protein A (RPA) (S6); high-mobility group protein (HMG) (P52); TFIIH constituent helicases xeroderma pigmentosum complementation group B (XPB) and XPD and DNA repair protein CSB (P27); p19ARF (P40); p300 N-terminal region (P33); BRCA1 (P47); and 14-3-3 (P13). Some of these interactions (p300, BRCA1, and 14-3-3) stimulate and some (Mdm2, PARP, and RPA) inhibit the transcriptional activity of p53. (Stimulations may be indirect: 14-3-3 may block nonspecific binding of p53 to DNA [P16]; p300 may do the same consequential to acetylation of K382 [P21,22].)
p53 can form homotetramers and must be in tetramer form for sequence-specific DNA binding and transcriptional activation. The map shows the dependencies relating to tetramers; tetramerization is stimulated by phosphorylation of Ser392, and this enhancement can be inhibited by phosphorylation of Ser315 (P17). The ability to form tetramers is further modulated by other modifications and protein interactions. Influence on tetramerization may be how p53 transcriptional activity is stimulated by binding BRCA1 and inhibited by binding Mdm2, PARP, or RPA. This could be the major mechanism of regulation of p53 transcriptional activity. The activation of p53 seems to be exquisitely controlled by a large number of determinative inputs. The transition to active tetramers could be very sharp because of a possible fourth-power dependence on the concentration of tetramerization-competent monomers.
Mdm2 is an intimate part of the p53 control system. Mdm2 contains a pocket that binds a p53 N-terminal peptide (P28). Mdm2 binding blocks the transcriptional activation domain of p53 (P29) and is instrumental in p53 degradation (P31). p53, in turn, transcriptionally up-regulates Mdm2, probably forming a negative feedback loop. Mdm2 can itself activate some genes, such as Cyclin A (P37). In addition to binding p53, Mdm2 reportedly binds E2F1:DP1 (P35), pRb (P35), TATA-binding protein (TBP) (P36), TBP-associated factor II250 (TAFII250) (P37), p19ARF(P34), and p300 (P33). Some of these interactions may compete for the same Mdm2 site, as may be the case for p19ARF and p300. Binding to p53 is abrogated by phosphorylation of Mdm2 on Ser17, perhaps through the kinase activity of DNA-PK (P49).
p19ARF, an alternate reading frame (ARF) product from the ink4a locus that also codes for p16, has recently emerged as an additional player in the p53-Mdm2 system. p19ARF binds to and inhibits the actions of Mdm2 (P34,41). It also can bind to p53 (P40). Moreover, p19ARF is transcriptionally up-regulated by E2F1:DP1 (P42). This link between p53 and E2F1 may be crucial to the control of S-phase and apoptosis.
DNA Repair
The map includes three phases of nucleotide excision repair (NER). The first phase, lesion recognition and local opening of the DNA helix, is carried out by a molecular assembly, which includes the XPC:HR23B heterodimer (N1,2), XPA (N3), and TFIIH (N13). This phase opens the DNA helix in the vicinity of the lesion and allows access to other DNA repair proteins. If the DNA is opened by another process, such as transcription, XPC is dispensable, and repair can begin with the second phase.
The second phase, excision of a short DNA strand segment containing the
lesion, is carried out by an assembly of the XPG and XPF:excision
repair cross-complementing 1 (ERCC1) endonucleases (N6,8,9),
together with XPA, RPA, TFIIH, and PCNA (which can bind XPG)
(R9). This assembly appears to be held together in part
through RPA, which binds to single-stranded DNA (ssDNA) regions in the vicinity of lesion (N10), and at the same time may be able
to bind XPG and XPF:ERCC1 (N7), as well as XPA
(N4). In going from the recognition to the excision phase,
the molecular assembly rearranges as XPC:HR23B is replaced by XPG
(N13) (Wakasugi and Sancar, 1998
).
In the third phase, gap filling, x-ray repair cross-complementing
gene/protein 1 (XRCC1) appears to function as a platform for the
assembly of DNA polymerase
(DPase
) (N20), DNA ligase III (N19), and PARP (N18). This assembly is held
together, in part, via breast cancer protein 1 C-terminal module (BRCT)
modules in the constituent proteins (Masson et al., 1998
).
The binding of PARP by XRCC1 may function to block the further action
of PARP during this phase at a repair site. The assembly of XRCC1 with DPase
and DNA ligase III may also function in the single-nucleotide replacement pathway of base excision repair (Cappelli et
al., 1997
).
Through its binding to DNA single-stranded regions, RPA may also recruit Rad52 and Rad51 to sites of DNA damage (S14,15). Rad51 also binds to ssDNA and, together with RPA, may function in recombinational repair (N11).
Rad51 may also provide links to a network of mutually interacting components via its binding to c-Abl (N12). c-Abl may bind ATM (A1), DNA-PK (B7), pRb (E18), and p53 (A4), although it is not fully determined which of these interactions can occur simultaneously and which are mutually exclusive. Rad51, ATM, and DNA-PK bind to a Src homology 3 (SH3) domain in the c-Abl N-terminal region, whereas pRb and p53 may bind to the c-Abl C-terminal region. c-Abl may regulate Rad51 function by phosphorylating Rad51 on Tyr54, thereby abrogating the direct binding of Rad51 to ssDNA (N12).
Another set of interactions is implicated in the processing of DNA
double-strand breaks. DNA double-strand ends are recognized and bound
by the Ku70:Ku80 heterodimer (Ku) (B1,2), which can recruit
DNA-PK to the site (B3), thereby activating the kinase (B4). DNA-PK, however, can also bind to RPA
(S16). RPA is a heterotrimer (S1,2) that binds
ssDNA regions (S3). DNA-PK can thus be recruited to ssDNA
regions formed transiently at replication forks. DNA-PK may then be
available for interaction with a double-stranded DNA (dsDNA) end, which
could appear in the vicinity as a consequence of replication fork
encounters with open topoisomerase I DNA complexes trapped by drugs
such as camptothecin (Shao et al., 1999
). It is noteworthy
that DNA-PK does not always require Ku for activation, because it can
be activated by tethering to DNA via other molecules, such as chromatin
constituents of the HMG family (B10).
A further capability of RPA could arise from its ability to bind p53 (S6) and from the abrogation of this binding by phosphorylation of the RPA2 subunit by DNA-PK (S11), ATM (S12), or CycA:Cdk2 (S10).
We thus begin to see some of the intricate mechanism of the DNA repair machinery. This example shows how a molecular interaction map can represent DNA-targeted processes and the transitions between multimolecular assemblies.
| |
DISCUSSION |
|---|
|
|
|---|
Biological functions must eventually be understood as arising from
molecular interactions. It is therefore not surprising that the
molecular interaction information so far accumulated forms a highly
complex network whose functional behavior may be difficult to
comprehend. To provide a foundation for eventual understanding of
function, this information must be organized in a manner that allows
integrated behavior to be discerned. Molecular interaction maps such as
those described here could contribute to this goal. The primary
objective here was to suggest how complex molecular interaction
networks can be usefully displayed. The map was constructed from
evidence relating to molecular interactions with the view that the
interaction patterns would suggest biological functions. The complexity
of the map, however, demands that great care be taken in the
formulation of specific functional hypotheses, which may have to be
investigated with the aid of computer simulations (Kohn, 1998
).
The molecular constituents were tentatively grouped into subsystems, demarcated on the map by dashed boxes. Some of the numerous connections between the boxes can be arranged in tracts, perhaps analogous to nerve tracts in the brain or communication buses in computers. A task for the future is to find an objective way to make these groupings, or indeed to determine to what extent nature has designed subsystems within the control network.
The map includes 26 individual p53 modification or binding interactions for which evidence of functionality has been presented. Considering the number of modifications and binding combinations that are possible a priori, the number of different possible states of p53 could be so large as to raise the question of whether, at any given time in a cell, any two p53 molecules would likely be in the same state. It seems plausible, however, that certain combinations of states would be strongly favored under particular circumstances.
p53 appears to be a central focus for the concurrence of signals from many pathways, presumably serving as an integrated information processor. It might, for example, function to test the validity of signaling patterns and, depending on the outcome of the test, to initiate cell cycle arrest or apoptosis. The test however must first be activated by allowing p53 (which normally is rapidly degraded and thus nearly absent) to accumulate in response to inputs involving Mdm2 and/or p19ARF. It appears to be a logic unit that monitors the state of the cell based on the pattern of a large number of inputs.
The molecular interaction map helps one discern possible multiprotein
assemblies, alternative arrangements of which may operate under
different circumstances in a cell or under particular states of cell
differentiation. A recent example and possible paradigm is the large
multiprotein binder and acetyl transferase p300, which may function as
a platform for the assembly of high-order complexes (Grossman et
al., 1998
). Multiprotein interactions may be competitive,
cooperative, or independent of each other, possibly depending on the
modification states of the component molecules. Moreover,
susceptibility to modification may depend on the multimolecular configuration. A given protein may have more than one binding site for
a second protein: p300, for example, has two separated sites for
binding of p53 (Grossman et al., 1998
). Multimolecular complexes thus may have alternative, functionally different
configurations, and covalent modifications could induce configurational
switches. The details of these interdependencies may be critically
important, but only fragmentary information is as yet available.
Promoter regulation can be very complicated, because transcription factors can have positive or negative effects, depending on their environments and mutual interactions. Genes controlled in part by the same transcription factors sometimes exhibit different regulation patterns. Moreover, the interactions among the different transcription factors are only beginning to be elucidated. It was therefore not feasible, in the present diagrams, to show the full regulation pattern of each individual gene. The regulation of the E2F-dependent S-phase genes and of the p53-dependent genes highlight the difficulties. The major known regulatory actions on these genes are shown (Figure 6, A and B), and some of the details regarding effects on individual genes are mentioned in the annotation list.
A diagram convention such as that described here will be needed for the
representation of functional models. These "heuristic" diagrams do
not fully define the contingencies among the interactions. For
simulation of models, however, "explicit" or fully defined diagrams
are needed. Suitable explicit diagrams can be prepared using a subset
of the present symbols (Kohn, 1998
) or using the conventions of
electronic circuitry (McAdams and Shapiro, 1995
).
This exercise has suggested how a large body of molecular interaction data can be organized in a map with associated annotations. Many of the known interactions relating to cell cycle control and DNA repair events in the nucleus were included in the map. Some important areas remain to be added: in particular, the control of nuclear-cytoplasmic shuttling of key regulatory molecules and the signaling pathways from growth factor receptors on the cell surface.
The molecular interaction map will require frequent updating as new information accrues. For this purpose, we plan to put updated versions the map on the Internet (http://discover.nci.nih.gov)
| |
MOLECULAR INTERACTION MAP ANNOTATIONS |
|---|
|
|
|---|
ATM and c-Abl
A1.
ATM is present in a complex that includes c-Abl and Rad51 (Yuan
et al., 1998
).
A2.
ATM phosphorylates c-Abl on Ser465 and thereby activates the
kinase (Baskaran et al., 1997b
).
A3.
Phosphorylation by ATM activates c-Abl (Baskaran et
al., 1997b
). c-Abl tyrosine kinase activity is stimulated
in response to ionizing radiation (IR),
1-
-D-arabinofuransylcytosine, camptothecin, or
etoposide (Yuan et al., 1996
, 1998
).
A4.
c-Abl binds to p53 in response to
1-
-D-arabinofuransylcytosine or methylmethanesulfonate
(Yuan et al., 1996
); this interaction does not require c-Abl
kinase activity. Moreover, c-Abl kinase activity does not require p53.
Binding of c-Abl to p53 inhibits the Mdm2-mediated degradation of p53
(Sionov et al., 1999
).
A5.
c-Abl phosphorylates tyrosines in the C-terminal domain (CTD) of
RNA polymerase II (RPase II; Km = 0.5 µM)
(Baskaran et al., 1993
, 1997a
); the c-Abl SH2 domain
is a specificity determinant for this reaction (Duyster et
al., 1995
).
A6.
The c-Abl C-terminal region can bind the Crk SH3 domain; this
interaction may link c-Abl function to the state of the cell surface
with respect to integrins and focal adhesions (Gotoh and Broxmeyer, 1997
).
A7.
c-Abl binds and tyrosine phosphorylates paxillin in an
adhesion-dependent manner (Lewis and Schwartz, 1998
).
A8.
c-Abl tyrosine kinase activity is blocked by pRb, which binds to
the c-Abl kinase domain (Welch and Wang, 1995
).
A9.
Phosphorylation of pRb disrupts the c-Abl:pRb complex and releases
active c-Abl (Welch and Wang, 1995
).
DNA Strand Break Processing
B1. Ku is a tight heterodimer consisting of Ku70 and Ku80/86.
B2.
Ku loads onto dsDNA ends and can diffuse along the DNA in an
energy-independent manner (deVries et al., 1989
). Ku can
localize internally on dsDNA as well as at dsDNA ends (Yaneva et
al., 1997
). Ku binds to DNA single-strand breaks (ssbs) (Blier
et al., 1993
) and has helicase activity, but ssb-bound Ku
does not activate DNA-PK (Smider et al., 1998
). Ku can also
bind to hairpin-ended DNA without activating DNA-PK (Smider et
al., 1998
).
B3.
Ku binds to the C-terminal region of DNA-PK (amino acids
3002-3850) near the protein kinase domain (Jin et al.,
1997
). DNA-PK can bind weakly and transiently to dsDNA ends without Ku
(Lieber et al., 1997
; Yaneva et al., 1997
; West
et al., 1998
). In the presence of Ku, however, the binding
is stronger and more stable. DNA-PK and Ku localize adjacent to each
other at dsDNA ends. DNA-PK does not bind detectably to Ku in the
absence of DNA. There is approximately five times more Ku than DNA-PK
in mammalian cells.
B4.
The kinase activity of DNA-PK is stimulated by binding to dsDNA
ends; however, the stimulation is greater in the complex with Ku
(Yaneva et al., 1997
; West et al., 1998
).
B5.
DNA-PK phosphorylates itself, thereby blocking its interaction
with Ku:DNA complex and inhibiting its kinase activity; it also
phosphorylates Ku70 > Ku80, but without effect (Chan and Lees-Miller, 1996
).
B6.
Autophosphorylated DNA-PK dissociates from Ku:DNA (Chan and
Lees-Miller, 1996
).
B7.
The SH3 domain of c-Abl binds to the C-terminal region of DNA-PK
(amino acids 3414-3850) and may compete with Ku for binding to the
same region (Jin et al., 1997
; Kharbanda et al.,
1997
). c-Abl does not bind directly to Ku; however, IR induces the
association of Ku with c-Abl:DNA-PK complex (Kharbanda et
al., 1997
).
B8.
c-Abl phosphorylates DNA-PK in the C-terminal region (amino acids
3414-3850) (Jin et al., 1997
). c-Abl-dependent
phosphorylation of DNA-PK is stimulated by IR (Kharbanda et
al., 1997
). c-Abl does not phosphorylate Ku (Kharbanda et
al., 1997
).
B9.
Phosphorylation of DNA-PK by c-Abl dissociates DNA-PK from Ku:DNA
(Jin et al., 1997
; Kharbanda et al., 1997
).
B10.
HMG1 or 2 competes with Ku for binding to DNA-PK and stimulates
DNA-dependent kinase activity in vitro in the absence of Ku (Yumoto
et al., 1998
).
Cyclin-Cdk Box
C1.
The Cyclin D1 promoter is activated by E2F4, but it is repressed
by E2F1 via pRb (Watanabe et al., 1998
). In pRb-deficient cells, E2F1 stimulates this promoter. An Sp1 site close to the E2F
element also participates in the regulation. (Overexpression of pRb can
increase the expression of Cyclin D1 by an unknown mechanism [Watanabe
et al., 1998
].)
C2.
The Cyclin E and A genes (but not the Cyclin D gene) are strongly
activated by E2F1 (DeGregori et al., 1995
; Shan et
al., 1996
). Further details about cyclin E promoter regulation
have recently been reported (Le Cam et al., 1999
), as
follows. In addition to a constitutively occupied E2F1-Sp1 site
immediately upstream of the cyclin E transcription start region, there
is downstream a cell cycle-regulated site (termed CERM) that may
function as a cyclin E-repressor module. The CERM contains a variant
E2F-recognition element, and binds a complex (termed CERC)
consisting of E2F4, DP1, and either p130 or p107, as well as an
unidentified necessary component.
C3. Cdk4 and Cdk6 bind exclusively to D-type cyclins.
C4. Cdk2 binds to cyclins E and A.
C5. Cdk1 (Cdc2) binds cyclins A and B.
C6.
Cdk1 and 2 bind the small protein Cks1 (Jackman and Pines, 1997
).
Cks1 binds at the C-terminal region of Cdk, which is distinct from the
region that binds cyclins. Cks1 may be involved in the dephosphorylation of Cdk Tyr15.
C7.
Low concentrations of p21Cip1, p27Kip1, or p57Kip2 promote the
binding of Cyclin D to Cdk4 (LaBaer et al., 1997
), although high concentrations are inhibitory.
C8.
p16ink4a competes with Cyclin D1 for binding to Cdk4 (Hang
et al., 1998
).
C9.
Cyclin D1 has a short half-life (<30 min), regardless of whether
free or Cdk4-bound. Rapid degradation of Cyclin D1 requires phosphorylation at threonine-286 (kinase unknown, but not Cdk2 or
Cdk4); degradation is by way of the ubiquitin-proteasome pathway (Diehl et al., 1997
).
C10.
D-type cyclins can bind the myb-like protein DMP1 (Hirai
and Sherr, 1996
). The binding does not require Cdk4/6 (Inoue and Sherr,
1998
).
C11.
DMP1 binds consensus sequences CCCG(G/T)ATGT and activates
transcription (Hirai and Sherr, 1996
).
C11a.
DNA binding is inhibited when DMP1 is bound to cyclin D (Inoue and
Sherr, 1998
). Cyclin D binds at the DNA-binding domain of DMP1
immediately adjacent to the myb repeats (Inoue and Sherr, 1998
).
C11b.
DMP1 activates transcription from the p19ARF promoter and induces
cell cycle arrest and p21Cip1 accumulation in a p19ARF- and
p53-dependent manner (Inoue et al., 1999
). Another
DMP1-regulated gene is CD13/aminopeptidase N, which is
activated cooperatively by DMP1 and c-Myb; its activation by DMP1 is
inhibited by cyclin D independent of Cdk4/6.
C12.
When phosphorylated by Cyclin D-dependent kinases, DMP1 activates
transcription (Hirai and Sherr, 1996
).
C13.
Cyclin E has a half-life of ~30 min. It is degraded by way of
the ubiquitin-proteasome pathway subsequent to phosphorylation (possibly autophosphorylation) at threonine-380 (Clurman et
al., 1996
; Won and Reed, 1996
). However, it can be stabilized by
binding to Cdk2.
C14.
CycH:Cdk7 (also known as Cdk-activating kinase [CAK])
phosphorylates a site on the T-loop of Cdks (Thr161 in human Cdk1,
Thr160 in Cdk2, Thr172 in Cdk4, and Thr170 in Cdk7 itself) and thereby causes the loop to be displaced to allow access to the catalytic site
(Morgan, 1995
). CAK readily phosphorylates Cdk2 monomer, which,
however, remains inactive (Fisher and Morgan, 1994
). This phosphorylation is required for the formation of stable CycA:Cdk1 dimer
but not for the formation of other Cyclin:Cdk dimers (Ducommun et
al., 1991
; Desai et al., 1995
). CAK is localized to the
nucleus (Tassan et al., 1994
).
C15.
Cyclin B binds only to Cdk1. Cyclin B1 is retained in the
cytoplasm (by means of a cytoplasmic retention sequence) until the time
of mitotic prophase when it is abruptly transported into the nucleus
(Pines and Hunter, 1994
). Cyclin B1 is localized to microtubules and
centrosome, whereas Cyclin B2 localizes to the Golgi complex (Jackman
et al., 1995
).
C16.
Cdk1 is phosphorylated at Tyr15 by Wee1 and at Thr14 by Myt1 (Liu
et al., 1997
, and references therein). Phosphorylations of
Cdks are facilitated by Cyclin binding and stabilize the Cyclin:Cdk complex (Jackman and Pines, 1997
). Human Myt1 phosphorylates and inactivates Cdk1 associated with cyclins A and B but does not phosphorylate Cdk2 or Cdk4 complexes (Booher et al., 1997
)
(unlike wee1, which can phosphorylate both Cdk1 and Cdk2). Myt1 is
membrane bound to endoplasmic reticulum and Golgi complex.
Phosphorylation of Thr14 and Tyr15 occurs when the cycB:Cdk1 complex
assembles in the cytoplasm (Pines and Hunter, 1994
). Wee1, however, is
in the nucleus; hence another kinase may be operating in the cytoplasm (Matsuura and Wang, 1996
). Thr14 phosphorylation precedes Tyr15 phosphorylation (Liu et al., 1997
).
C17.
Phosphorylation of Thr14 or Tyr15 in Cdk1 reduces kinase activity
10-fold; phosphorylation of both sites reduces activity 100-fold (Liu
et al., 1997
).
C18.
Dephosphorylation of Cdk1 Thr14 and Tyr15 sites is carried out by
Cdc25C, which must itself be activated by phosphorylation in its
N-terminal domain (Jackman and Pines, 1997
). Cdc25C activity is high in
mitosis during mitosis and low during interphase.
C19.
Cdk2 is regulated by phosphorylation at Tyr15, but there is much
less phosphorylation at Thr14 (Booher et al., 1997
, and
references therein).
C20.
Cdk4 may be inhibited by tyrosine phosphorylation (Terada et
al., 1995
) but cannot be phosphorylated at the position
corresponding to Thr14, because there is an Ala here rather than Thr.
C21.
p27Kip1 can be phosphorylated by Cyclin E- or Cyclin A-dependent
kinases and thereby may be targeted for degradation (Sheaff et
al., 1997
).
C22.
p21Cip1-induced inhibition of Cyclin:Cdk complexes requires the
binding of more than one p21Cip1 molecule (Zhang et al.,
1994
).
C23.
Cyclin A:Cdk2, in normal human fibroblasts, exists in complex with
p21Cip1 bound to PCNA (Zhang et al., 1993
).
C24.
Raf1 can bind and activate Cdc25A (Galaktionov et al.,
1995
; Weinberg, 1995
), perhaps by phosphorylation.
C24a.
Raf1 is activated by Ras in a complex manner involving
phosphorylations, as well as positive and negative effects of 14-3-3 interactions (Roy et al., 1998
; Thorson et al.,
1998
; Tzivion et al., 1998
).
C25.
Cyclin A:Cdk2 is bound to p45Skp2 in complex with p19Skp1 in many
transformed cells (Zhang et al., 1995
; Bai et
al., 1996
).
C26.
p19Skp1 binds to an F-box motif in p45Skp2 (Bai et al.,
1996
).
C27.
p45Skp2 inhibits Cdk2 kinase activity and blocks phosphorylation
of Cdk2 by Wee1 or CAK (Yam et al., 1999
). Similarly to
p21Cip1, two molecules of p45Skp2 seem to be required to inhibit Cdk2. Binding to p45Skp2 is mutually exclusive with p21cip1.
C28.
p45Skp2 (Ser76) can be phosphorylated by Cyclin A:Cdk2 (Yam
et al., 1999
).
C29.
p19Skp1, through its F-box motif, may link Cyclin A to the
ubiquitin-proteasome protein degradation device (Bai et
al., 1996
).
C30.
Gadd45 binds to Cdk1 and inhibits Cdk1 activity, probably by
displacing Cyclin B1 (Zhan et al., 1999
). Gadd45 may in this way contribute to the G2 delay response to some types of stress.
C31. Cyclin D:Cdk4 phosphorylates pRb at a subset of sites, P(D), but this does not suffice to abrogate the inhibition of E2F.
C32.
Cyclin E:Cdk2 phosphorylates pRb at additional sites,
P(E), after the Cyclin D:Cdk4-specific sites have been
phosphorylated. (Cyclin E, in addition to acting on pRb, has actions
that can induce S phase independent of pRb [Lukas et al.,
1997
; Lundberg and Weinberg, 1998
].)
C33.
Hyperphosphorylated pRb, resulting from combined phosphorylation
by Cyclin D:Cdk4 and Cyclin E:Cdk2, abrogates the binding of pRb to E2F
heterodimers (Lundberg and Weinberg, 1998
).
C34.
p21Cip1 binds Gadd45 (Kearsey et al., 1995
).
C35.
Cdc25A may be transcriptionally activated by c-Myc; the Myc:Max
heterodimer binds to elements in the Cdc25A gene and activates its
transcription (Galaktionov et al., 1996
).
C36.
Cdc25C is activated by hyperphosphorylation of the N-terminal
domain (Gabrielli et al., 1997
), which can be phosphorylated by Cyclin B:Cdk1 (Hoffman et al., 1993
).
C37.
The Cdc25C N-terminal domain can also be phosphorylated by Plk1
(Hamanaka et al., 1994
). During mitosis, Plk1, by way of its polo boxes, localizes progressively to the centromeres, spindle poles,
centrosomes, and spindle midzone or midbody (Glover et al.,
1998
; Lee et al., 1998c
).
C38.
Chk1 binds and phosphorylates Cdc25A-C in vitro (Sanchez et
al., 1997
). Cdc25C becomes phosphorylated at Ser216.
C39.
Cdc25C Ser216 also binds and is phosphorylated by C-TAK1
(Peng et al., 1998
). Cdc25C Ser216 is phosphorylated
throughout interphase but not in mitosis (Peng et al.,
1997
).
C40.
Ser216-phosphorylated Cdc25C is recognized and bound by 14-3-3 protein family members (Peng et al., 1997
, 1998
). Ser216
phosphorylation and 14-3-3 binding probably sequester Cdc25C and thus
prevent it from interacting with Cdk1 in vivo (Peng et al.,
1997
). 14-3-3
is localized to the cytoplasm and may be the means by
which Cdc25C is sequestered outside of the nucleus (Hermeking et
al., 1997
).
C41.
Cyclin B is degraded late in mitosis through the
ubiquitin-protein ligase (E3) activity of the anaphase-promoting
complex, which is probably activated by phosphorylation by Plk1 (Glover et al., 1998
).
C42.
CycB:Cdk1 phosphorylates and inactivates the promoter selectivity
factor SL1 (Heix et al., 1998
). This explains in part the silencing of rRNA synthesis during mitosis.
C43.
p16 associates with TFIIH and RNA polymerase II (RPase II) CTD and
inhibits the phosphorylation of the CTD by TFIIH (Serizawa, 1998
).
E2F-pRb Box
E1.
DP1 or 2 forms stable heterodimers with E2F1-6. (Interactions of
E2F complexes have recently been reviewed [Dyson, 1998
; Helin, 1998
].)
G14. Unphosphorylated (or hypophosphorylated) pRb can bind to DP complexes of E2F1-3 and to a lesser degree E2F4. Binding of pRb family members is mediated by a short, highly conserved domain in the C-terminal region of E2F proteins. The E2F-binding site is in the C-terminal region of pRb.
E3. E2F4:DP can bind to p107 or p130 or, to a lesser extent, pRb.
E4. E2F5:DP binds only to p130.
E5. E2F1-5 complexes with DP1 or 2 can bind to E2 promoter elements, although there may be differences in preferences for variations of the E2 consensus sequence.
E6. DP complexes of E2F1-5 can stimulate promoters containing E2 elements via a potent transactivation domain in the C-terminal region of the E2F component.
E7.
E2F-DP dimers, complexed with pRb, p107, or p130, can bind and
inhibit E2 promoter elements (Dyson, 1998
; Mayol and Grana, 1998
). In
quiescent cells, the predominant complexes contain E2F4 and p130.
E8.
E2F6:DP complexes bind to a variation of the E2 consensus
sequence, possibly competing with other E2F complexes (Cartwright et al., 1998
).
E9.
In contrast to other E2F species, E2F6:DP directly represses
transcription. E2F6 lacks a transactivation domain; it has instead a
repression domain in its C-terminal region (Gaubatz et al., 1998
). E2F6:DP represses a subset of E2F-responsive genes (Cartwright et al., 1998
). E2F6:DP does not bind to pRb, p107, or p130
(Trimarchi et al., 1998
).
E10.
E2F4 is protected against proteasomal degradation when associated
with p130 (Hateboer et al., 1996
).
E11.
Upon stimulation of quiescent cells by growth factors, p130
becomes hyperphosphorylated and incapable of binding E2F (the responsible kinase is unidentified) (Mayol and Grana, 1998
).
Hyperphosphorylated p130 is unstable. Upon growth factor stimulation of
quiescent cells, p130 declines late in G1 and is replaced by p107,
which is absent in quiescent cells (Mayol and Grana, 1998
; Nevins,
1998
).
E12.
p130 may associate with HBP1, a transcription factor
involved in cell cycle exit during differentiation (Tevosian et
al., 1997
).
E13.
HDAC1 binds to pocket protein family members pRb, p107, and p130
and is thereby recruited to E2F complexes on promoters (Ferreira et al., 1998
). The binding is via an IXCXE motif in HDAC1,
which can bind to the C-terminal region of p130 (Stiegler et
al., 1998
) and presumably to the LXCXE site on the B-box of pRb
(Lee et al., 1998b
).
E14.
The interaction with HDAC1 enhances transcriptional repression by
pocket proteins (Brehm et al., 1998
; Ferreira et
al., 1998
; Luo et al., 1998
).
E15.
E2F-regulated genes include many that are involved in cell cycle
progression and control. Individual genes are differently regulated.
Dyhydrofolate reductase (DHFR) is activated via the E2F transactivation domain, whereas B-myb, Cyclin
E, E2F-1, E2F-2, and Cdc2 are
regulated via the repression domain of pRb family proteins (Dyson,
1998
).
E16.
pRb binds and activates C/EBP (Chen et al., 1996a
,b
).
The binding of C/EBP to its DNA recognition elements is enhanced.
E17.
Hypophosphorylated pRb binds c-Jun, JunD, and JunB (Nead et
al., 1998
). This enhances the binding of the Jun family members to
c-Fos and stimulates transcriptional activation by the Fos:Jun complexes. A region (amino acids 612-657) in the B-pocket of pRb and a
region in the C-pocket can independently bind c-Jun. The binding site
in c-Jun is in the leucine zipper region.
E18.
pRb binds c-Abl via the pRb C-pocket (residues 768-785 and
825-840). pRb can bind c-Abl and E2F simultaneously.(Welch and Wang,
1995
; Whitaker et al., 1998
). The c-Abl-binding C-pocket and
the E2F-binding CTD of pRb are distinct from each other.
E19.
pRb binds Mdm2 via the pRb C-terminal 44 residues (Xiao et
al., 1995
; Tan and Wang, 1998
). These C-terminal residues are not required for the growth-suppressive effect of pRb.
E20.
CycA:Cdk2 binds to E2F1-3 at a site near the N-terminal region,
as a consequence of which both the E2F and DP component are phosphorylated. Phosphorylation of either impairs the binding between
the E2F and DP monomers (Xu et al., 1994
; Krek et
al., 1995
; Dynlacht et al., 1997
).
E21.
The p107 promoter contains E2F recognition elements and can be
repressed by pRb or p107 (Zhu et al., 1995
).
E22.
Raf1 can bind pRb and p130, which are not thereby dissociated from
E2F complexes, although promoter inhibition is reversed (Wang et
al., 1998
). There was no detectable binding to p107. Binding to
pRb is mediated by the N-terminal 28 amino acids of Raf1. The kinase
activity of Raf1 was required to reverse the pRb-mediated promoter
repression (Wang et al., 1998
), but the phosphorylation
sites on pRb remain to be described and therefore are not indicated in
the diagram.
E23.
Sp1 cognate elements are found in the promoter regions of several
S-phase genes that also contain E2F elements, including DHFR, c-myc,
thymidine kinase, cyclin E, and E2F1 (Datta et al., 1995
).
Binding of Sp1 and E2F to the chromatin-organized thymidine kinase
promoter was cooperative (Karlseder et al., 1996
).
E24.
Sp1 and E2F1 bind to each other (Karlseder et
al., 1996
; Lin et al., 1996a
; Watanabe et
al., 1998
). Sp1 binds to the N-terminal region of E2F1; this
region is also present in E2F2 and E2F3 but not in E2F4 and E2F5;
accordingly, Sp1 can bind E2F1-3 but not E2F4 or 5 (Karlseder et
al., 1996
). The Sp1-binding region of E2F1 may overlap the cyclin
A-binding region. It is, however, separated from the transactivation
and pRb-binding regions, which are near the E2F1 C terminus. E2F1
binding requires the C-terminal region of Sp1 where Zn fingers are
located (Karlseder et al., 1996
; Lin et al.,
1996a
). (Sp1 may function as a higher-order complex; see
Karlseder et al., 1996
.) The E2F1-binding region of Sp1 is
phosphorylated by an Sp1-associated kinase when quiescent cells are
induced to proliferate (Black et al., 1999
).
E25.
Sp1 and E2F binding sites are both essential for activation
of the murine thymidine kinase promoter (Karlseder et al.,
1996
). The promoter was activated when the Sp1 and E2F sites were
separated by 6 or 10 bp but not when they were separated by 20 bp. The
DHFR promoter was strongly activated by Sp1 alone but hardly at all by
E2F1 alone (cotransfection in insect cells) (Lin et al.,
1996a
). E2F1, however, enhanced the activating ability of Sp1,
even in the absence of a functional E2F binding site on the promoter.
E26.
Sp1 binds to p107 (within the first 385 amino acids of p107),
which is separate from the p107 pocket region that binds E2F4 (Datta
et al., 1995
).
E27.
p107 inhibits Sp1-dependent transcription. Binding of p107 to Sp1
seems to inhibit the binding of Sp1 to DNA (Datta et al., 1995
).
Chromatin and Acetylase Box
H1. HDAC1 removes acetyl groups from histones, thereby making nucleosomes compact and inhibitory to transcription (i.e., HDAC1 removes acetyl groups that inhibit the inhibitory effect of compact nucleosomes on transcription; thus there is an odd number [3] of negative effects, which resolves to a net negative effect).
H2.
Gadd45 binds to core histones in chormatin or nucleosomes whose
structure has been loosened by acetylation or UV light (UV) radiation
(Carrier et al., 1999
).
H3.
p300 and CBP have intrinsic histone acetyl transferase
activity (Ogryzko et al., 1996
).
H4.
p300 binds PCAF (Grossman et al., 1998
, and references therein).
H5.
p300 binds to the transactivation domain of E2F1 (Lee et
al., 1998a
). E2F1 and p53 may be reciprocally regulated by
their mutual dependence on coactivation by limiting amounts of p300 (Lee et al., 1998a
).
H6.
The p300 C-terminal region can bind Cyclin E:Cdk2 (Perkins
et al., 1997
).
H7.
p300, via its Cys/His-rich region C/H3, associates with RPase II
via the intermediacy of RNA helicase A, which can bind both RPase II
and the C/H3 domain (Nakajima et al., 1997
).
Myc Box
M1.
c-Myc and pRb compete for binding to AP2 (Batsche et
al., 1998
).
M2.
AP2 and Max compete for binding to c-Myc (Batsche et
al., 1998
). AP2 and Myc associate in vivo via their CTDs (Gaubatz
et al., 1995
).
M3.
The E-cadherin promoter is regulated via AP2 recognition elements
(Hennig et al., 1995
, 1996
; Batsche et al.,
1998
).
M4.
c-Myc and pRb enhance transcription from the E-cadherin promoter
in an AP2-dependent manner in epithelial cells (mechanism unknown)
(Batsche et al., 1998
). Activation by pRb and c-Myc is not
additive, suggesting that they act on the same site, thereby perhaps
blocking the binding of an unidentified inhibitor. No c-Myc recognition
element is required for activation of the E-cadherin promoter by c-Myc.
Max blocks transcriptional activation from the E-cadherin promoter by
c-Myc, presumably because it blocks the binding between c-Myc and AP2.
DNA Repair
N1.
XPC forms a tight complex with HR23B, a homologue of yeast Rad23.
HR23B is present in large excess over XPC (Sugasawa et al., 1997
).
N2.
The XPC:HR23B complex may be the primary recognizer of a variety
of DNA lesions and the initiator of the NER of nontranscribed DNA
regions (Sugasawa et al., 1998
). XPC is required to open the DNA to allow access of other repair factors, such as XPA and RPA, to
the vicinity of the lesion (Evans et al., 1997a
,b
).
XPC is not required for transcription-coupled repair, perhaps because the lesion-containing DNA region is opened by the encounter with the
transcription machinery (Mu and Sancar, 1997
; Sugasawa et al., 1998
).
N3.
XPC is necessary to promote the stable binding of XPA to
UV-damaged DNA (Li et al., 1998
). XPA binds to DNA and
preferentially at sites of bulky damage produced, for example, by UV,
cisplatin, or N-AAF. However, the association constant of XPA
for UV-irradiated DNA is only severalfold above that for unirradiated
DNA, suggesting that other factors (such as XPC) may be required for
effective lesion recognition (Jones and Wood, 1993
; Sugasawa et
al., 1998
).
N4.
RPA binds directly to XPA via the C-terminal region of RPA2 (He
et al., 1995
; Stigger et al., 1998
).
N5.
XPA binds ERCC1 (residues 93-120) (Li et al., 1994
)
(Kd = 2.5 × 10
7 M;
Saijo et al., 1996
).
N6.
The XPF C-terminal region (residues 814-905) binds to the
C-terminal region of ERCC1 (residues 224-297) (de Laat et
al., 1998b
).
N7.
RPA binds XPG and ERCC1:XPF, the NER endonucleases (He et
al., 1995
; Matsunaga et al., 1996
; for review, see
Wold, 1997
).
N8,9.
The ERCC1:XPF heterodimer incises the damaged DNA strand
15-24 nucleotides to the 5' side of the lesion (Mu et al.,
1995
). XPG and ERCC1:XPF cut on the 3' and 5' sides of the lesion,
respectively (de Laat et al., 1998a
, and references
therein). RPA binding enhances the activity of XPG and ERCC1:XPF
(Matsunaga et al., 1996
). RPA is required for the nucleotide
excision process (Moggs et al., 1996
; Mu et al.,
1996
).
N10.
RPA binds single-strand regions at locally unwound intermediates
in NER (Evans et al., 1997
). The RPA:XPA complex
binds cooperatively to DNA damage sites (He et al., 1995
).
N11. Rad51-coated ssDNA, together with Rad52 and