Significance
Proteins conduct numerous complex biological functions by use of tailored structural dynamics. The molecular details of how these emerged from ancestral peptides remains mysterious. How does nature utilize the same repertoire of folds to diversify function? To shed light on this, we analyzed bilobed proteins with a common structural core, which is spread throughout the tree of life and is involved in diverse biological functions such as transcription, enzymatic catalysis, membrane transport, and signaling. We show here that the structural dynamics of the structural core differentiate predominantly via terminal additions during a long-period evolution. This diversifies substrate specificity and, ultimately, biological function.
Novel biophysical tools allow the structural dynamics of proteins and the regulation of such dynamics by binding partners to be explored in unprecedented detail. Although this has provided critical insights into protein function, the means by which structural dynamics direct protein evolution remain poorly understood. Here, we investigated how proteins with a bilobed structure, composed of two related domains from the periplasmic-binding proteinlike II domain family, have undergone divergent evolution, leading to adaptation of their structural dynamics. We performed a structural analysis on 600 bilobed proteins with a common primordial structural core, which we complemented with biophysical studies to explore the structural dynamics of selected examples by single-molecule Frster resonance energy transfer and HydrogenDeuterium exchange mass spectrometry. We show that evolutionary modifications of the structural core, largely at its termini, enable distinct structural dynamics, allowing the diversification of these proteins into transcription factors, enzymes, and extracytoplasmic transport-related proteins. Structural embellishments of the core created interdomain interactions that stabilized structural states, reshaping the active site geometry, and ultimately altered substrate specificity. Our findings reveal an as-yet-unrecognized mechanism for the emergence of functional promiscuity during long periods of evolution and are applicable to a large number of domain architectures.
Proteins drive and maintain all fundamental cellular processes (1) by interactions with small molecules and/or other biopolymers. Important mechanistic information on proteins are accessible via structural analysis of their functional cycle (2). While classical approaches rely on the interpretation of static structure snapshots, the visualization of structural dynamics (i.e., to follow the interconversion of distinct structural states at high spatial and temporal resolution) (37) has been recognized as an essential complement. The folding funnel model (8), rooted in the free-energy landscape theory (911), has by now become a widely accepted way to describe the ensemble of such states (1214).
Distinct structural states can originate from local flexibility (i.e., bond vibrations, and side-chain rotations [Fig. 1A, Tier-2 dynamics]), changes in secondary structure (Fig. 1A, Tier-1 dynamics) or large-scale domain motions (Fig. 1A, Tier-0 dynamics). The free-energy landscape of a protein defines the lifetime of its structural states, ranging from nanoseconds (local flexibility) to seconds (large-scale motions). Transitions between the states are referred to as structural changes and are induced by interactions with ligands, posttranslational modifications (e.g., phosphorylation), or chemical events such as nucleotide hydrolysis. The coupling of the latter to structural changes enables proteins to perform a diverse range of functions.
Energetic funnel, structure, and evolution of the cherry-core. (A) One-dimensional cross-section of a hypothetical protein energy landscape adapted from Kern and coworkers (12) according to the Tier description and definitions introduced by Ansari and coworkers (15). A structural state is defined as the lowest point of a well on the energy surface. The populations of the Tier-0 states, closed and open, are defined as Boltzmann distributions, and their relative probabilities (pC, pO) are determined in this paper by smFRET, which follows large-scale domain motions. Tier-1 states describe local and fast structural fluctuations (e.g., changes in secondary structure elements like loop motions or loss of secondary structure). Tier-1 dynamics were probed in this study by HDX-MS. Rapid and localized Tier-2 dynamics (e.g., side-chain rotations) were not considered here but can be monitored via molecular dynamics (MD) simulations. Changes in the chemical environment (i.e., absence or presence of a ligand) modify the energy landscape via a bias for one of the two states. Typically, thermodynamic parameters such as the free energy difference of Tier-0 wells (GOC) can be determined by ITC. (B) Structure of a representative bilobed protein, the substrate binding domain 2 (SBD2) of the ATP-binding cassette (ABC) amino acid transporter GlnPQ from Lactococcus lactis (PDB: 4KR5). (C) Summary of the structure-based phylogenetic tree with schematic representations of the different structural classes (class A through G) highlighting their termini. Complete sequence and structure-based phylogenetic trees are provided in SI Appendix, Fig. S1 and Datasets S1 and S2. We used the following notation: Secondary structure elements that are common between the different classes were assigned a number identifier (e.g., helix H4 in classes F and G), whereas the unique elements have a letter followed by a number identifier (e.g., HD1, first unique helix of class D proteins). An asterisk (*) marks a structural subclass (shown in detail in Dataset S3). Schematics are based on the crystal structures of the selected proteins. (D) Topology of bilobed proteins depicting the consensus of secondary structure elements: strands (s) or helices (H) belonging to the Domains (D1, D2), hinge-forming -strands (H1, H2), and C-tails of all seven classes (A through G) are shown. Revised alignments of bilobed proteins are shown in Dataset S3. The secondary structure elements forming the consensus cherry-core structure are depicted on the Top row.
Tier-0 dynamics were observed and characterized in various settings (e.g., in motor proteins, in which they are used in propelling movement along filaments) (16), in the transport of molecules or biopolymers across biological membranes (1721), or in the activity of proteins that perform mechanical work (22). Tier-1 dynamics drive the actions of various signaling proteins for transmission of signals (2325). Structural and biochemical data indicate that enzymes also show varying degrees of structural dynamics (26), although it is not well understood what precise role this plays for catalytic activity. The current belief is that extensive structural dynamics in enzymes are not necessarily required for catalysis (27) but rather enact in regulation. For instance, many protein kinases exploit Tier-0 dynamics to generate active or inactive structural states (28). Tier-2 dynamics have been shown to be important for the evolution of enzymatic function (29). In addition to domain motions occurring within a structure, protein oligomerization and the possible quaternary dynamics can also be relevant for function (30). A well-characterized example is the allostery of hemoglobin that occurs on the transition between two distinct quaternary states (relaxed and tense) (31, 32). All this highlights that multi-Tier dynamics of proteins occur on various time- and length-scales (Fig. 1A) and are often the basis for function. It is, however, not yet well understood how structural dynamics are optimized during evolution to tailor protein function.
Analysis of protein sequences and structures has provided important insights into the evolution of protein function (33). A powerful approach is to assign the domain components of proteins to families and superfamilies on the basis of sequence alone (Pfam database for the protein family) (34) or in combination with structural information (class architecture topology homologous superfamily, CATH and structural classification of proteins, SCOP databases) (35, 36). The CATH and SCOP databases combined have identified 3,000 domain superfamilies comprising >50 million domains that account for 70% of the domains in completed genomes (37). More recently, the database ECOD (evolutionary classification of protein domains), groups domains by considering their evolutionary relationships (38). In ECOD, 760,000 domains have been assigned to 3,700 homologous groups (39). The most highly populated domain superfamilies/homologous groups are universal to all kingdoms of life (40, 41). The prevalence of proteins with multidomain architectures and the recurrent appearance of the same domain in nonhomologous proteins suggests that functional domains are reused when creating new proteins and functions (42, 43). Analyses of selected domain groups (44, 45), and more-recent large-scale investigations (4648) have shown that domains in such groups generally share a common structural core (40 to 50% of the domain) that is highly conserved, even for relatives separated by billions of years. To achieve functional promiscuity of such a structural core, it has been proposed that larger structural embellishments (secondary structure elements or even entire domains) need to be added to the core for altered biochemical function (49). Such fusions occur frequently at the N or C termini (40), which has led to the belief that structural elements act as Lego bricks (47). Those are recombined in various ways for new functions to emerge during evolution (42, 49).
Much less is known on the role played by structural dynamics in the evolution of protein function (5053). A recent model suggests that the native state of an evolved protein is the most abundant state of all possible structural states, which was selected for a specific function (54, 55). The avant-garde evolvability theory proposes the existence of a highly promiscuous primordial protein structure. It is assumed that for the emergence of the native state, the ability to evolve (evolvability) was traded for ligand-functional specificity (56, 57). These observations were experimentally verified (58) and agree with the proposal that changes in structural dynamics serve as a mechanism for the evolution of specialist enzymes from promiscuous generalists (59), though discordant examples are also observed in evolution (60). Furthermore, recent studies indicate that this evolvability theory can explain the short-period evolution (e.g., variation of enzyme local flexibility [variation of Tier-1/2 dynamics] to acquire new functions) particularly well (5456, 61). Remarkably, this short-period evolution can be faithfully reproduced invitro using directed evolutionary approaches based on consecutive rounds of single point mutations (54, 61). Ancestral protein reconstruction has also been useful in elucidating the role of structural dynamics in the emergence of specialized amino acid binding proteins from a promiscuous ancestor (62). However, it still remains unclear how, during longer periods of evolution, a primordial core structure evolves to modulate or diverge Tier-0 and/or quaternary dynamics, generating completely new functionalities.
Here, we test this aspect of the evolvability theory for proteins separated by long evolutionary periods during which Tier-0 or quaternary dynamics were introduced to an existing protein core structure. For this, we identified proteins with a bilobed domain structure with a high degree of plasticity and neutrality (29, 55), having 1) a conserved core structure, 2) large sequence diversity and related functional divergence, and 3) occurrence in all kingdoms of life. The selected structural core is composed of two Rossmann-like domains, which are believed to be among the most ancient architectures (63) and that are connected by a single -sheet (Fig. 1B). We analyzed 600 proteins with this structural core that contain two related domains from the periplasmic-binding protein (PBP)like II domain homologs (ECOD: X-, H- and T-groups PBP-like II). We show here that different members of these homologous proteins diverged with respect to domains or secondary structure elements, predominantly at their termini (Fig. 1 C and D). Using a combination of structural analysis and biophysical investigations, we demonstrate that such structural embellishments confer multi-Tier structural dynamics that diversify the function of the core structure to yield transcription factors, enzymes, or extracytoplasmic transport-related proteins. To understand both the mechanistic distinction of these proteins and the evolutionary trajectories, we used single-molecule Frster resonance energy transfer (smFRET) (64) and HydrogenDeuterium Exchange Mass Spectrometry (HDX-MS) (65). smFRET allows to monitor Tier-0 dynamics with a temporal resolution down to microseconds and subnanometer spatial resolution at the single-molecule level, even for highly heterogeneous structural ensembles (12, 6668). HDX-MS complements smFRET, as it can probe Tier-1 dynamics throughout the structure that occur within the structural states identified by smFRET (Fig. 1A) (69). Using this combination of techniques, we show how the C-terminal extensions modulate Tier-0 dynamics in the structural core in a manner specific to their three-dimensional orientation. This arrangement also dictates specific geometrical criteria, crucial for establishing specific ligand interactions, all of which we detail in this work. On the other hand, we reveal the means by which N-terminal domain additions enable oligomerization to provide distinct quaternary dynamics in LysR-type transcriptional regulators (LTTRs). The remarkable modularity of these proteins permits us to confirm and expand the evolvability theory for the primordial core structure during a long-period evolution, which is largely facilitated, seemingly, by genetic recombination events.
In this study, our focus was on proteins composed of two globular lobes (bilobed), each of three layers (//), articulated around a central -sheet hinge (Fig. 1B). This focus meant that not all multidomain architectures harboring the PBP-like II domains were included. To investigate the structural dynamics of the selected proteins, we constructed phylogenetic trees based on both sequence and structural information (Fig. 1C). This analysis indicated that the proteins evolved from a common ancestor that diversified into seven distinct structural classes A to G (Fig. 1 C and D), members of which are found throughout all kingdoms of life, in which some are even present in viruses (SI Appendix, Fig. S1A and Datasets S1 and S2). They have a consensus structure (Fig. 1B, Fig. 1 D, Top, and Dataset S3), which we have dubbed the cherry-core (hereafter CC, and proteins harboring this core, cherry-core proteins, CCPs) because of the bilobed structures resemblance to a cherry (SI Appendix, Fig. S2A), that contains two PBP-like II domains (70). According to ECOD database, the two domains of CCPs belong to the same X-, H-, and T-groups (PBP-like II; Dataset S3), supporting their common ancestry (38). In the consensus structure (Fig. 1 C and D), domains D1 and D2 adopt a face-to-face mirror-fashioned geometry with the active site, which is typically a ligand-binding site, located at their interface (SI Appendix, Fig. S2B).
Most of the selected proteins have distinct segments linked N-terminally to the CC (Fig. 1 C and D). Class A proteins have the addition of a LysR winged helix-turn-helix (HTH)type DNA-binding domain (ECOD: X-, H-group: HTH, T-group: Winged; Fig. 1C). This element has been shown to be responsible for oligomerization and binding to promoter DNA (7173). Most proteins of classes B through D, and F and G contain N-terminal localization signals for export via the general secretion (Sec) or the twin-arginine translocation (Tat) pathway (7477). The presence of these signal peptides was obtained from UniProtKB (78) and verified manually by inspecting all 600 protein sequences with PRED-TAT, prediction of twin-arginine and secretory signal peptides (79). Class E proteins are predominantly cytosolic and lack an N-terminal signal peptide.
In addition, classes B through G have distinct C-terminal structural embellishments, hereafter termed C-tails (Fig. 1D). For example, Helical-tail 1 (H1) is common to all classes except C, and importantly, it has a similar placement in the three-dimensional architecture of the proteins. Other C-tails are unique to a specific class, such as the Helical-tails G (HG1 and 2), present only in class G proteins.
The ligand specificity and function of the CCPs, as documented in UniProtKB (78), were found to correlate with the assigned structural class (SI Appendix, Fig. S1A). Class A proteins are bacterial transcription factors of the LTTR family. Class E proteins are predominantly eukaryotic single-turnover enzymes. The majority of the remainder (class B through D, F, and G) are found in prokaryotes and associate with the translocator domains of ABC transporters or with the membrane-embedded domains of chemoreceptors, in which they mediate unidirectional solute transport and signal transduction, respectively.
To investigate what role the distinct C-tails of these proteins might have played in structural dynamics and the evolution of new functions, we examined CCPs for which high-resolution structures of unliganded (apo) and liganded (holo) states were available (Fig. 2 and SI Appendix, Table S1). Interestingly, class A and E proteins, in most cases, exhibit nearly identical apo and holo structures. They also display the widest variety of substrates with little chemical structure similarity (SI Appendix, Fig. S3). For most members of the other classes, D1 and D2 of the CC undergo a rigid body rotation of varying degrees (Fig. 2 and SI Appendix, Table S1). For the solute-binding proteins, this mode of substrate binding has been termed the Venus-Fly Trap mechanism (80). These proteins recognize ligands with a specific pharmacophore (81): amino acids, ethanolamines, phosphonates, ironphosphate complexes, and carbohydrates are recognized by classes B, C, D, F, and G, respectively (SI Appendix, Fig. S3). Another striking difference is that proteins in classes B through G are monomeric, whereas those in class A are oligomeric (see Fig. 4C).
Structures of CCPs highlighting their C-tails. Structures (Top) and schematics (Bottom) of the identified structural classes (A through G). The apo and holo structures of the indicated CCPs are superimposed to highlight the role of the C-tail. The domain to which the two structures were superimposed is represented in gray (D1 in A through E and D2 in F and G), whereas the one that is displaced relative to the C-tail in the open state is represented in green. Arrows and schematics indicate domain displacement (SI Appendix, Table S1). Interaction contact maps are presented in SI Appendix, Table S2. The most prominent contacts between the indicated secondary structure elements of the C-tail and D1 (F, G) or D2 (B through E) that stabilize the open state are indicated by a yellow dashed line. The PDB codes and protein names are indicated.
To investigate structural dynamics of the CCPs, we first used smFRET (12, 6668) to probe Tier-0 dynamics at near-physiological conditions in aqueous buffer at room temperature. One representative protein from each class was investigated: the effector binding domain (EBD) of CynR, representing the CC of full-length CynR (72) (class A), SBD2 (82) (class B), OpuAC (83) (class C), PhnD (84) (class D), CmpA (85) (class E), FbpA (86) (class F), and MalE (87) (class G). For these experiments, D1 and D2 were stochastically labeled with donor and acceptor fluorophores via cysteines that were substituted for nonconserved and surface-exposed residues, one in each domain (Fig. 3A). Fluorophore labeling was performed by maleimidethiol conjugation (22, 88). Labeling positions were selected based on the crystal structures to show large changes in separation between the apo and holo states. smFRET was performed by confocal microscopy with alternating laser excitation (ALEX) (66).
Monitoring structural changes and ligand binding in CCPs using smFRET and ITC. (A) Schematic of the experimental strategy to monitor structural states by FRET efficiency via stochastic labeling of D1 and D2 with donor and acceptor fluorophores. (BH) Solution-based apparent FRET efficiency histograms in the absence (Top) or presence (Bottom) of saturating ligand concentrations for the indicated proteins. Gray bars are experimental data, and solid line is the fit. Centre position of Gaussian fits are given in SI Appendix, Table S5. (I and J) Binding isotherms of the calorimetric titration of azide (I) and calcium carbonate (J) to CynR (I) or CmpA (J), respectively, with the indicated thermodynamic parameters. For the apo condition of FbpA (E), the sample was treated extensively with citrate, as Iron (III) and the synergistic anion carbonate required for high-affinity binding to FbpA are removed efficiently by citrate treatment at low pH (92). According to high-resolution structural data (93), both Ca2+ and CO3 are present in the CmpA binding cleft. Indeed, we observed heat release upon titration of Ca2+ to a CO3 bound CmpA. Data points represent the heat of reaction per injection, and the line is the fit.
As predicted by our structural analysis (Fig. 2 and SI Appendix, Table S1) and in line with our previous observations (89), the FRET efficiency histograms and fitted distributions (Fig. 3 BF) shifted toward higher FRET efficiency (E) values upon addition of saturating concentrations of ligand, for SBD2, OpuAC, PhnD, FbpA, and MalE. This indicates that in the apo state, the donor and acceptor dyes are further apart (Fig. 3 BF, Upper, low FRET) compared to the holo state (Fig. 3 BF, Lower, high FRET). Thus, our data suggest that ligand binding drives Tier-0 dynamics in these CCPs.
In contrast, the distributions of class A and E proteins [i.e., CynR (90) and CmpA (85), respectively] (Fig. 3 G and H), were virtually identical in the absence or presence of saturating ligand concentrations. Ligand binding was confirmed via isothermal titration calorimetry (ITC) showing binding affinities of both proteins in the micromolar range (Fig. 3 I and J). Thus, we conclude that the CC of class A and E proteins lack Tier-0 dynamics on the probed reaction coordinates for the selected FRETdistance pairs in contrast to the other structural classes.
Interestingly, this observation is in line with the known biological function. Class E proteins are predominantly single-turnover enzymes (SI Appendix, Fig. S1A), for which the rigidity of their active site is a prerequisite for catalysis (SI Appendix, Fig. S4A) (91). As previously suggested by us and others, Tier-0 dynamics in periplasmic-binding proteins (classes B, C, D, F, and G) are utilized in the regulation of transport in ABC importers (Discussion). A remaining question is, however, how ligand binding in class A proteins triggers transcriptional processes without major structural changes related to ligand binding.
The DNA-binding domain of class A proteins typically comprises a 58-aa HTH motif. This is followed by a 20-aa-long helix that provides a dimerization interface and a connecting loop that links the DNA-binding domain to the CC (Fig. 4A). The CC acts as the tetramerization interface within the full-length CCP or the dimerization interface within the EBD (Fig. 4B and SI Appendix, Fig. S4 BE). Indeed, by using size exclusion chromatography with multi angle light scattering (SEC-MALS) we observed that (full-length) CynR is tetrameric, whereas its CC is dimeric (Fig. 4C).
CynR tertiary and quaternary assemblies probed by MALS and HDX-MS. (A) Crystal structure of the CynR CC (PDB: 2HXR) with colored secondary structure elements discussed in the text that are critical for its quaternary dynamics (Top). A homology model of full-length CynR obtained from the SWISS-MODEL server with CbnR (PDB: 1IZ1) as a template (Bottom). The HTH domain is colored green, and the loop connecting it to the CC splitpea. W64 at the tip of the dimerization helix is indicated with an arrow. (B, Top) Schematic representation of the CC of CynR using the same color-coding as in A. (Middle) Schematic representation of the CynR homology model, with one of the protomers in the compact and the other in the extended configuration. The CC of the CynR protomers self-associate to form dimers or, for the full-length protein, tetramers. For clarity, two of the protomers have been omitted. (Bottom) The tetrameric assembly formed by the self-association of the dimerization helices and the CC. Of the two interacting protomers, one is present in the compact configuration, whereas the other is extended. The two protomers with the compact configuration are shown at the top of the plane, whereas those that are extended are at the bottom. (C) SEC-MALS analysis of full-length and CC of CynR (3 M). Ultraviolet (UV) traces of the chromatograms were superimposed on the measured mass (black cycles). (D) Structural dynamics of full-length CynR in the absence of ligand and DNA by HDX-MS. Deuterium uptake values are reported for the incubation times in deuterated buffer and expressed relative to the fully deuterated control. These values are mapped onto the CynR homology model (as in A), using the indicated color gradient. Proline residues as well as the first residue of each peptide were excluded from mapping, as they do not contribute to the observed D-uptake. (E) Scatter plot visualization of the statistical analysis of D-uptake differences between the apo and holo states of full-length CynR (Upper) and the CC of CynR (Bottom). Three statistical criteria were used (SI Appendix, Material and Method), as described previously (98). Statistically significant differences would appear as black spheres (indicating that D-uptake > 2SD for a specific peptide), lying outside the 99% confidence threshold (1-P 0.99; indicated on y-axis) and outside the 4 pooled average SD cut-off (indicated on x-axis; value given on the right).
To determine whether ligand binding to CynR resulted in structural changes that were not detectable along our selected reaction coordinate or that were too small or fast for smFRET (Fig. 3G), we performed HDX-MS. In contrast to smFRET, which reports on a single distance along a single reaction coordinate, HDX-MS probes structural dynamics at nearresidue level resolution, providing global insights into Tier-1 dynamics. HDX-MS detects the exchange of hydrogens with deuterium at solvent-accessible and nonhydrogen-bonded backbone amides (94, 95). Hydrogens involved in stabilizing the secondary, tertiary, or quaternary structure of a protein via hydrogen bonds are exchanged more slowly through structural transitions that disrupt these bonds. Deuterium incorporation into the protein can then be determined, following proteolysis, by MS. The mass difference between hydrogen (1H) and deuterium (2H) results in a mass shift between nondeuterated and deuterated peptides that is a measure of the number of exchanged hydrogens (65, 96, 97).
For such investigations, the CC of CynR or full-length CynR were isotopically labeled (pD, 7.4; 25C) for different time periods (10 to 105 sec) either in free or DNA-bound states, and the peptic fragments were identified by MS (Fig. 4D and Dataset S4). For each peptide, the fraction of deuterium uptake relative to the maximum determined deuterium incorporation was calculated (Dataset S4). The data reveal a rigid character of the CC of CynR, whereas the dimerization helix and the rest of the DNA-binding domain turned out to be more flexible (Fig. 4D). To identify the regions in which pronounced structural changes were induced by azide binding, we performed comparative HDX-MS. For this, we determined the difference in deuterium uptake (D) for each peptide between different conditions (e.g., CynR apo versus holo states). Observed differences would indicate a decrease (protection) or an increase (deprotection) of deuterium uptake upon azide binding (Dataset S4). No statistically significant change in the deuterium uptake was observed (Fig. 4E). From this, we can conclude that no detectable change in structural dynamics is induced by the ligand in the CC of CynR in its free or DNA-bound form (Fig. 4E and Dataset S4). This includes Tier-0 dynamics (detected by smFRET and HDX-MS) and Tier-1 or quaternary dynamics (both detected by HDX-MS).
From our results on the selected CCPs, only those with an asymmetric C-tail display Tier-0 dynamics (Figs. 2 and 3). To investigate how the C-tails introduced Tier-0 dynamics to the CC, we compared crystal structures of the apo and holo states of the CCPs to identify interactions between their C-tails and the CC. Fig. 2 summarizes the results of these comparisons. Interestingly, we found that the holo structures are similar for all CCPs, but the apo states are class specific. Contact mapping of the interactions between the CC and C-tail using the protein interaction calculator web-server (99) showed that the number and characteristics of the interactions depends on the structural class (Fig. 2 and SI Appendix, Table S2). Strikingly, these interactions stabilize predominantly the open state of the CC, the only exception being for classes A (C-tail-less) and E. In the former cases, stabilization is associated with an asymmetrical placement of the C-tail with respect to D1 and D2. In contrast, the C-tail of class E (predominantly H3 and HE1; Fig. 2) is placed symmetrically around D1 and D2 and thus cannot provide the required structural asymmetry needed to create a stable open state. The interactions of the C-tail to stabilize the open state involve the consensus CC-helices of D1 and D2 (Fig. 2 and SI Appendix, Table S2). In classes B, C, and D, such interactions involve D2, whereas D1 is contacted in classes F and G. These asymmetrical interactions create active sites with distinct geometries (Fig. 2 and SI Appendix, Fig. S2).
In alternative phylogenetic trees, based on the protein sequence using either D1/D2-domains or the C-tail, the clustering remains similar (SI Appendix, Fig. S1 BD and Datasets S1 and S2), indicating that D1/D2 (CC domains) and the C-tail of a specific class coevolved to be part of the same polypeptide. This is in line with the role of the C-tail to interact with one specific domain, D1 or D2, to stabilize the open structural state.
To confirm the role of the C-tail interactions with D1 or D2 for stabilization of the open state, we manipulated the relevant ones (Fig. 2 and SI Appendix, Table S2) in SBD2, SBD1, and MalE (Fig. 5 and SI Appendix, Figs. S5 and S6). The impact was tested via assessing structural states and ligand-binding affinities and monitored in smFRET experiments. Test cases were selected from class B and G, as these CCP classes are only remotely related by the first major clade in the evolutionary trees (Fig. 1C), and their open state is stabilized by distinct helices within the C-tails (Fig. 2), contacting either D2 (class B) or D1 (class G).
Experimental verification of the role of C-tail interactions in stabilizing the open conformation of SBD2 and MalE by smFRET and HDX-MS. (A) Dotted rectangles (Left) on the SBD2 (in A, PDB: 4KR5) structure highlights the critical contact region between the C-tail and the CC that stabilize the open state. Zoom in of rectangle regions (Middle and Right) depicting interactions between the C-tail helix H1 and D2 in the indicated apo (open) or holo (closed) states. Distances () between L480 and P419 are shown as black dotted lines. (B and C) Solution-based apparent FRET efficiency histograms of SBD2 (B) and SBD2 (L480A) (C) at different conditions as indicated. (D) Fraction of the closed state (high-FRET state) of SBD2 and the indicated derivative as a function of glutamine concentration. (E) As in panel (A) for the MalE (PDB: 1OMP) structure with the indicated secondary structure elements and critical contacts. (F and G) Solution-based apparent FRET efficiency histograms of MalE (F) and MalE (M321K) (G) at different conditions as indicated. (H) Fraction of the closed state (high-FRET state) of MalE and its derivatives as a function of maltose concentration. (I) Maltose release from MalE (M321K) over time determined by solution-based smFRET (reference detailed values in SI Appendix, Fig. S6). Data points (panels D and H) and gray bars (panels B, C, F, G, and I) are the experimental data, and the solid line is the fit. (J) Map of regions in MalE structure that show statistically significant increase in deuterium uptake caused by M321K (numerical values and complete statistical analysis presented in Dataset S5). n = 3. (K and L) Deuterium uptake for the indicated MalE C-tail helices. SDs (SD) are shown in the deuterium uptake plots.
In SBD2, a hydrophobic interaction between L480 in the C-tail and P419 in D2 was weakened by substitution of L480 with alanine (L480A; Fig. 5A and SI Appendix, Table S2). The mutation resulted in the appearance of a subpopulation of molecules (20%) that were in the closed state in the absence of glutamine (Fig. 5 C versus B). To rule out the possibility that this might have been due to an artifact introduced by the choice of fluorophores, a second pair was tested, which showed a comparable result (SI Appendix, Fig. S5 A and B). We also examined whether residual endogenous ligand might account for this subpopulation by performing smFRET measurements on diluted samples, but these experiments displayed subpopulations of a similar size (SI Appendix, Fig. S5C). A closed-unliganded conformation was also observed for SBD2 previously but with a much lower abundance (1%). Detection of this small subpopulation required the use of confocal scanning microscopy (89), as populations <5% cannot be detected reliably with ALEX spectroscopy. We also observed small differences in the mean E values for apo and holo states of SBD2 (L480A) as compared to SBD2, suggesting that the structural landscape had been altered (SI Appendix, Fig. S5B). Destabilizing the open state is in line with the 10-fold increase in glutamine-binding affinity of SBD2 (L480A) as compared to SBD2 (Kd of 209 64 nM and 1,990 130 nM, respectively; Fig. 5D).
Determination of the dissociation constant (Kd) by smFRET measurements reports on the stability of the open state via Kd(1+exp()), where =1/kbT, kb is the Boltzmann constant, T is the absolute temperature, and =GC-GO is the conformational (free) energy difference, where GC and GO are the (free) energies of the closed and open structural states without ligand bound, respectively (100). Thus, destabilizing the open state will decrease Kd and vice versa. From the percentages of open and closed in the absence of ligand, we obtain =1.4 and =4.4kBT for SBD2 (89) and SBD2 (L480A) (Fig. 5 BD and SI Appendix, Fig. S5 AC), respectively. Based on these results, the Kd value would be expected to decrease by 16 folds due to the L480A mutation, in close agreement with the estimated 10-fold difference. Similar trials on SBD1 did not show this trend (SI Appendix, Fig. S5 DF), as the mutation had no impact on the structural dynamics of SBD1 (I249A). The absence of a subpopulation of molecules in the closed state at apoprotein conditions (alike SBD2) is based on the inability to evaluate all stabilizing open state interactions (SI Appendix, Table S2, compare SBD1 versus SBD2) so as to abolish the relevant ones. This is likely due to the fact that H1 residues of SBD1 (SI Appendix, Fig. S5G) participating in interprotomer contacts arose from crystallographic conditions (SI Appendix, Fig. S5H).
For MalE, we constructed the derivatives MalE (M321A) (101) and MalE (M321K), with weakened interactions between the C-tail HG1 and D1, by disrupting the hydrophobic interaction of M321 with Y90 and F92 and the aromatic sulfur interactions with Y90 (Fig. 5E and SI Appendix, Table S2). For MalE (M321A), the stabilizing interactions of M321 are partially abolished. This resulted in an eightfold increase of maltose affinity (Kd from 2,400 400 nM in MalE to 300 50 nM in MalE [M321A] [Fig. 5H and SI Appendix, Fig. S6]). An even-stronger effect was observed for MalE (M321K) with an affinity enhanced by 3,000-fold (Kd of 0.81 0.15 nM; Fig. 5 G versus F and Fig. 5H). Shilton and coworkers have also proposed the hydrophobic interactions of M321 as being important structural determinants of the open state (101), affecting the affinity of MalE for maltose, in agreement with our results.
We next compared the lifetime of the closed, maltose-bound conformation of MalE (M321K) with that of MalE. Addition of 10 nM maltose allowed MalE (M321K) to occupy the closed state exclusively (SI Appendix, Fig. S6H). A total 20 M unlabeled MalE (M321K) protein was subsequently added to scavenge maltose, which is stochastically released from the labeled protein. In a time-course experiment, the decrease in the population of closed, maltose-bound MalE (M321K) was then followed as a function of time (102) (Fig. 5I and SI Appendix, Fig. S6H). From these experiments, we established that the lifetime of the closed, maltose-bound conformation of MalE (M321K) was 122 12 s. This value was 2,500-fold higher than the value determined for MalE (0.048 0.010 s; SI Appendix, Fig. S6E). This result is consistent with the observed increase affinity of the derivatives for maltose.
In the above-described MalE derivatives, the structural basis for the destabilization of the open state could not be addressed mechanistically. Our smFRET experiments report on Tier-0 dynamics that affect the proteins tertiary structure (Fig. 1A). Considering that we observed differences in the Tier-0 dynamics, as a consequence of C-tail-D1/D2 destabilization, only for SBD2, we performed comparative HDX-MS (Dataset S5) to monitor changes in the secondary structure of MalE in comparison to MalE (M321K). For this, we determined the difference in deuterium uptake (D) for each peptide between MalE and MalE (M321K) in their apo states (Fig. 5J). Remarkably, comparative HDX-MS indicates that the differences between MalE and MalE (M321K) are localized almost exclusively at the C-tail and specifically in regions interacting with D1 (Dataset S5 and Fig. 5J). The D-uptake of the C-tail helices H2/HG1 that are critical for stabilizing the open state (Fig. 5E and SI Appendix, Table S2) denotes that their rigidity was significantly reduced in the MalE derivative (Fig. 5 K and L). As might be expected, reduced rigidity occurred also in a region within the CC containing Y90 and F92. Notably, the same regions become allosterically destabilized in MalE upon maltose binding (Dataset S5). These results support the idea that the mutation leads to a weaker interaction between the C-tail and the CC resulting in a destabilization of the open state, because these C-tail elements and the region containing contact residues were found to be more flexible and solvent exposed.
Taken together, we postulate that ligands attenuate the stability of the open state in the CCPs, adopting multiple allosteric models for signal propagation (103): either via Tier-0 (SBD2 [Fig. 5 C and D]) or Tier1/2 (MalE [Fig. 5 G, H, K, and L]) dynamics.
Common (structural) origin represents the hallmark of Darwinian evolution. Homology or descent from a common ancestor is often deduced from similarities in protein sequences or better from structures, as the latter are more conserved during evolution (104). However, similar structures can originate from divergent, convergent or parallel evolution (105). The most common tricks nature uses to vary a protein domain are the following: -strand invasion/withdrawal, insertions/deletions/substitutions of secondary structure elements, domain flip/swaps, and circular permutations (106, 107). The currently established evolvability theory relies on investigations involving a fixed-length polypeptide chain by observing the effects of sequence variations, accomplished primarily by directed evolutionary approaches or by investigating closely related functional homologs. The functional promiscuity originating from structural variability is altered by the few amino acid modifications that can yield alterations of local structural fluctuations. For this reason, this theory can well explain protein evolution during short time periods.
In this study, we focused on the analysis of structures that have diverged over longer evolutionary periods. We analyzed a group of 600 proteins that share a core structure with the same topology of secondary structure elements giving rise to identical three-dimensional structures. The structure was predominantly varied by terminal embellishments and exhibits detectable sequence identity, used for constructing the sequence-based phylogenetic trees (SI Appendix, Fig. S1). The identified proteins likely emerged from divergent long-term evolution from a common ancestor, which spreads throughout the tree of life. This common ancestor is seemingly represented by the consensus core structure (CC; SI Appendix, Fig. S2) and encountered within the type-II class of PBPs (70, 108). As proposed previously (70), the CC derived possibly from a gene duplication of a PBP-like II domain (ECOD; Dataset S3) connected by a -sheet (Fig. 6A).
Model for the evolution of the cherry-core and hypothetical energetic funnels. (A) A gene duplication of a PBP-like II domain gave rise to the CC, being composed of a unique closed structural state, represented by a single well in the energetic funnel. The function of the CC was uniquely binding. Extant proteins acquired different extensions that altered their localization and dynamics and by that their function and specificity. (B) An N-terminal signal peptide and an asymmetrical C-terminal tail generated extracytoplasmic proteins evolving an additional open state. The different flavors of open states conferred distinct substrate specificities. The two states of CCPs, predominantly, signal substrate transport by their association to the membrane-embedded translocator domains of (ABC) transporters. (C) A symmetrical C-tail rigidified the closed state and yielded primarily single-turnover enzymes. (D) The N-terminal domain addition of a flexible dimerization helix and a DNA-binding motif (HTH-type) conferred distinct oligomeric assemblies with different quaternary dynamics, yielding transcription factors. (E) The different flavors of open states, conferred distinct substrate specificities. For details, refer to SI Appendix, Fig. S8.
When the CC is fused N-terminally to a signal peptide and C-terminally to an asymmetric C-tail, the CC acts as an extracytoplasmic monomeric protein that associates with the translocator domains of ABC transporters or with chemoreceptors (SI Appendix, Fig. S1 and Fig. 6B). The two structural states are apo-open versus holo-closed and originate from Tier-0 dynamics, which are critical determinants of their biological function: the membrane-embedded partners can discriminate between open versus closed states to activate or inactivate a biological process (88, 109112), such as solute transport. These structural transitions rely on the generation of an open state of the CC, accomplished by the C-tail, through its asymmetric interactions with either of the domains of the CC (D1 or D2; Fig. 2). Evidently, the open state is stabilized by such enthalpic contributions and destabilized by mutations or ligand binding that increase the flexibility of the interacting regions (Fig. 5J) (the C-tail elements with either D2 [classes B, C, and D] or D1 [classes F and G] [Fig. 2]). Seemingly, entropic contributions (protein conformational entropy) bias the structural equilibrium toward the closed state. Given the fact that the holo is the closed state driven by ligand binding (89), we anticipate that the interactions of the ligand with the CC cleft allosterically induce order-to-disorder transitions to alter the structural equilibrium. Depending on the placement of the asymmetric C-tails in the three-dimensional space, we identified five different flavors of open states, establishing distinct geometries of active sites; all resembling a triangle distinctly oriented in space (Figs. 2 and 6E). We verified that each active site geometry can recognize a specific chemical structure (SI Appendix, Fig. S3), in full agreement with ligand binding triggering closing.
In contrast to the C-tail, the presence of the N-terminal signal peptide does not affect the Tier-0 states of MalE (MalE versus proMalE; SI Appendix, Table S5). However, to render protein trafficking to extracytoplasmic locations possible, the presence of the signal peptide is known to delay MalE folding (113, 114). By that, preproteins are allowed to be secreted by the protein translocase (115).
The evolvability theory of Tawfik and coworkers (29, 57) explains the functional promiscuity encountered within the different structural classes. We selected two closely related proteins in our phylogenetic tree in class B (Fig. 1C) to illustrate this (Dataset S2, marked red asterisk). SBD1 (Protein Data Bank [PDB]: 4LA9) mediates the unidirectional transport of two substrates (glutamine and asparagine), whereas SBD2 (PDB: 4KR5) transports one of them (glutamine), though it captures it with higher affinity (88). SBD1 can be transformed to bind glutamine with higher affinity like SBD2 by mutating three amino acids (82). Clearly, functional promiscuity is traded seemingly with ligand specificity (62). However, only the modularity notion introduced in this study can explain a greater ligand-functional promiscuity. Class B proteins recognizing and mediating amino acid transport (SI Appendix, Fig. S1) would gain tertiary structural variability by acquiring four more helical elements at their C termini (H2, HG1, H4, and HG2; Fig. 1 C and D) (Fig. 6). This would potentially allow to switch them to class G to bind and mediate transport of carbohydrates (SI Appendix, Fig. S1A). According to another evolutionary trajectory, class B proteins, by acquiring helices H3, HE1, and HE2, could divert to class E and switch from transport-related proteins to enzymes (Fig. 6) by restricting their tertiary structural variability to a unique closed state. An extreme case of functional divergence has been experimentally verified in a recent study that restored the evolutionary history of the enzyme cyclohexadienyl dehydratase (116), a class B member according to our classification. The reconstructed ancestor of this enzyme was a highly promiscuous, transport-related protein possessing the open-unliganded and the closed-liganded structural states and was able to bind four different cationic amino acids. On the other side, cyclohexadienyl dehydratase binds a single substrate and forms a unique closed state for its catalytic function, since it is believed that structural sampling represents a constrain for catalytic activities (117, 118). Our study can now explain the means by which the addition of modular elements to a conserved core structure diversify its function (i.e., by modulating its structural landscape).
When the CC is combined C-terminally to a symmetric C-tail during evolution (Fig. 6C), the CC either operates as an enzyme or is associated with ABC transporters (SI Appendix, Fig. S1). Those proteins are present in a unique (apo and holo) closed state (Fig. 2), as asymmetrical interactions are an essential prerequisite for open state formation. Interestingly, these ABC transporterassociated proteins are both interacting with the actual translocator from the extracytoplasmic side (like the classes B, C, D, F, and G; described in the previous paragraph and in such a case also possess an N-terminal signal peptide) but are also tethered covalently to the translocator ATPase motor domains (93). As the switching (open to closed) behaviorto activate the transport cycleis missing (88, 89, 109), the noncanonical arrangement of these ABC transporters could only trigger transport activation using a yet-uncharacterized mechanism. On the other hand, the lack of Tier-0 dynamics observed in the single-turnover enzymes is conforming to their enzymatic mechanism demanding an extremely rigid active site (SI Appendix, Fig. S4A) (91). Evidently, the rigidity required for the chemistry is so dramatic that the flexibility of a noncovalently (to the polypeptide chain) bound histidine to the cleft would render the reaction unproductive.
Lastly, the CC bearing an N-terminal domain harboring the HTH-type DNA-binding domain but no C-tail (Fig. 6D) yields transcription factors of the LTTR family. Apparently, in such a case, Tier-0 dynamics of the CC are not required for function (Fig. 3G). The rigidity of the CC (Fig. 4D) is evidently required for the stability of the quaternary assemblies, as those have large cavities and holes (SI Appendix, Fig. S7) (119124).
Our data and analysis gave insights into the steps taken during evolution to shape the CC into functional ligand receptorassociated proteins (classes B, C, D, F, and G) but also one-turn enzymes (class E). Yet, how the ligand-driven signal propagates within the LTTR family, such as CynR, remains largely elusive.
The CC of CynR constitutes only one part of its structure (i.e., the sensory EBD) of a full-length transcription factor of the LTTR family with an additional winged HTH-type DNA-binding domain (Fig. 4 A and B). Although only little direct experimental evidence is available, it has been proposed that changes in the tetrameric assembly of LysR-type transcription factors can be induced by ligand binding (72, 125). These quaternary dynamics were suggested to activate transcription by a transition from a bent DNA (transcription OFF) to an unbent state (transcription ON) (72, 125). Our smFRET and HDX results (Figs. 3G and 4E), however, did not reveal any detectable changes of the CC upon azide binding. This suggests that ligand-driven structural changes within the CC protomers are minor and might thus be very hard to detect.
To gain insights into potential quaternary dynamics, we analyzed the available high-resolution structures of the oligomeric assemblies belonging to the LysR transcription factors (119124) to identify the evolutionary relevant ones by Evolutionary ProteinProtein Interface Classifier (EPIC) (126, 127). Subsequently, we modeled the CynR sequence after these structures in the presence or absence of DNA (SI Appendix, Fig. S7). The structural basis of the distinct tetrameric assemblies is the positioning of the dimerization helix with respect to the CC, dictated by the connecting loop (Fig. 4 A and B). In the extreme case that an additional helix (RD1-CH5; Dataset S3) is linked to and displaces the connecting loop, an octameric assembly is obtained (SI Appendix, Fig. S7H). To shed light on this, we monitored intrinsic Trp fluorescence during thermal melting (SI Appendix, Table S3). From the two tryptophans of CynR, one present in the dimerization helix (64 aa) and the other on D1 (274 aa) of the CC (Fig. 4 A and B), only the former one contributes to a fluorescent signal (SI Appendix, Table S3). Under DNA-free apo conditions, CynR displays a Tm(app) 55C that is significantly destabilized (6C) after binding to DNA. Addition of azide at free or DNA-bound CynR causes a ligand-characteristic signature throughout the Trp-temperature spectrums, giving rise to a secondary Tm(app) at 27C in both cases (SI Appendix, Table S3).
To understand the signal propagation originating from ligand binding, we inspected the available structural information (SI Appendix, Fig. S7). In agreement with the experimental findings (Figs. 3G and 4E), our structural analysis indicated that D1 and D2 motions are not required (SI Appendix, Table S4) for the oligomeric assemblies of the same transcription factor (OxyR) to differ dramatically in order to trigger structural changes on DNA (SI Appendix, Fig. S7 EG). What varies between those assemblies is the orderdisorder of specific secondary structure elements within D2 of the CC (S4 till 5; Dataset S3). We anticipate that such changes in the flexibility of secondary structure elements are induced by ligand binding, like in the case of MalE (Fig. 5 JL).
We conclude that signal propagation in CCPs comprising large D1/D2 rearrangements (Fig. 2; classes B, C, D, F, and G) driven by the C-tail/D1 and D2 interactions involves bending/unbending of the spring-like hinge (108). On the other side, propagation in class A is initiated by small/localized rearrangements of secondary structure element within the CC somehow transmitted to the N-terminal HTH domain leading to global quaternary structural changes (SI Appendix, Figs. S7 and S8). Additional analysis regarding the molecular mechanisms of LTTR-transcriptional regulators will be the subject of future studies.
Fig. 6 summarizes our proposed evolutionary path taken by the CC proteins over long time periods. The terminal modules impact and finetune the multi-Tier structural dynamics as can be described by the folding funnel model (8). The CCPs with asymmetric C-tails displaying Tier-0 dynamics (i.e., open and closed states) have the two characteristic wells in the funnel separated by a large energetic barrier (Fig. 6B and SI Appendix, Fig. S8C). For that reason, such proteins are found predominantly in the lowestenergetic level open state (apo energetic funnel; SI Appendix, Fig. S8C), with infrequent transitions to the closed one (Fig. 5B). Only in the SBD1/2 proteins (class B), 1% occurrence of a closed state has been experimentally observed (89). By destabilizing the open state of SBD2, we obtained a 20% occurrence of a closed state (Fig. 5C). As in the holo state energetic funnel, the lowest energy state is the closed one (SI Appendix, Fig. S8C); addition of the ligand shifts the equilibrium toward the closed state. Since the dissociation constant (KD) derives from the difference between the lowest energetic levels of the apo and holo funnels (SI Appendix, Fig. S8), destabilization of the C-tail (Fig. 5 D and H) that directs the open state of the apo funnel at a higher energetic level leads to an increased affinity.
Alternatively, CCPs with a symmetric C-tail (Fig. 6C) have a single main energetic well, defining the unique closed structural state both in the apo or holo funnels (SI Appendix, Fig. S8D).
The CCPs with N-terminal domain additions (Fig. 6D) have multiple energetic funnels corresponding to oligomerization states (SI Appendix, Fig. S8E). The energetic funnel of such CCPs is having multiple minima corresponding to the many different arrangements of the flexible N-terminal domain with respect to the CC (SI Appendix, Fig. S8E). Self-association (oligomer formation) and/or binding to their partners/ligands deepens specific wells that are required for function.
We anticipate the energetic funnel of the primordial consensus CC to be extremely rugged, with small energetic barriers between the wells, with a single well somewhat deeper (corresponding to the closed state), thus allowing sampling of multiple structural states (SI Appendix, Fig. S8A). Such a structural variability would lead to increased substrate promiscuity, however, exploiting extremely weak interactions (i.e., low binding affinities; SI Appendix, Fig. S8B).
Clearly, the structural promiscuity achieved by the modularity introduced in this study, expands the protein evolvability theory and establishes the notion that in order to comprehend protein evolution, it is essential to decode the energetic funnel of structural homologs. Structural elements or even domains added alike Lego bricks to a structural core, being the Lego Board, trigger distinct evolutionary trajectories.
Detailed materials and methods are included in SI Appendix, Material and Method. All proteins were expressed in Escherichia coli cells (BL21 DE3 or BL21 pLysS DE3) and grew in Luria Bertani or Terrific Broth media. Purification was based on affinity [Ni-NTA (Qiagen)], anion exchange [Q Sepharose (GE Healthcare)], and/or size-exclusion [Hi Load 26/60 Superdex 200 (GE Healthcare)] chromatography. Samples used for analysis (ITC, smFRET, HDX-MS, etc.) were single monodisperse peaks. Phylogenetic, structural analysis, alignments, and protein visualization were accomplished by widely accepted procedures via freely available software or servers: Structure similarity search-PDB webserver (128, 129), PDBeFold (130), Dali server (131), multidimensional QR factorization of multiple sequence and structure alignments (132)/visual molecular dynamics 1.9.2 software package (133), Protein blast (134), DynDom domain motion server (135, 136), Protein Interaction calculator webserver (99), EPIC server (126, 127), ECOD database (38), ConSurf-DB server (137, 138), SWISS-MODEL server (139), and PyMOL (The PyMOL Molecular Graphics System, Version 2.0 Schrdinger, LLC). smFRET experiments were performed with a custom-made confocal ALEX microscope that has been previously described in the literature (22, 89) and HDX-MS by a nanoACQUITY Ultra Performance Liquid Chromatography System with HDX technology (Waters, United Kingdom). Data and statistical analyses were performed by established procedures described in detail in the literature.
This work was financed by an Netherlands Organization for Scientific Research (NWO Veni grant 722.012.012 to G.G.), an ERC Starting Grant (European Research Council StG No. 638536, SM-IMPORT to T.C.), Deutsche Forschungsgemeinschaft within GRK2062 (project C03 to T.C.), SFB863 (project A10 to M.Z. and A13 to T.C.), Research Foundation Flanders (FWO CARBS #G0C6814N to G.G. and A.E.). G.G. also acknowledges a fellowship from the European Molecular Biology Organization (EMBO long-term fellowship ALF 47-2012 to G.G.) and financial support by the Zernike Institute for Advanced Materials and the Rega foundation. Y.A.M. was supported by the Indonesia Endowment Fund for Education (Lembaga Pengelola Dana Pendidikan, Republik Indonesia, LPDP RI PhD scholarship). G.G. and Y.A.M. acknowledge financial support from the IMBB-FORTH (start-up grant to G.G.). N.Z. acknowledges an Alexander von Humboldt postdoctoral fellowship. N.E. acknowledges a fellowship from the Marie Sklodowska Curie Action (MSCA SoE FWO (195872). R.X. was supported by the Chinese Scholarship Council grant. T.C. was supported by the German Academic Exchange Service, Center of Nanoscience Munich, LMU excellent and the Center for Integrated Protein Science Munich. We thank Eitan Lerner for useful comments and critical reading of the manuscript, Monique Wiertsema for help with smFRET experiments, and Florence Husada for support with ITC experiments.
Author contributions: G.G., Y.A.M., M.d.B., and T.C. designed research; G.G., Y.A.M., M.d.B., D.A.G., A.T., K.T., N.Z., R.X., Y.S., and S.K. performed research; G.G., Y.A.M., M.d.B., D.A.G., K.T., R.X., Y.S., M.Z., A.D., C.P., and A.E. contributed new reagents/analytic tools; G.G., Y.A.M., M.d.B., D.A.G., N.Z., A.T., N.E., and T.C. analyzed data; and G.G., Y.A.M., D.A.G., and T.C. wrote the paper.
The authors declare no competing interest.
This article is a PNAS Direct Submission.
This article contains supporting information online at https://www.pnas.org/lookup/suppl/doi:10.1073/pnas.2026165118/-/DCSupplemental.
See the article here:
Structural dynamics in the evolution of a bilobed protein scaffold - pnas.org
- Protein folding - Wikipedia [Last Updated On: August 18th, 2024] [Originally Added On: September 11th, 2019]
- Protein Folding: The Good, the Bad, and the Ugly - Science ... [Last Updated On: August 18th, 2024] [Originally Added On: September 13th, 2019]
- Protein Folding - Chemistry LibreTexts [Last Updated On: August 18th, 2024] [Originally Added On: September 14th, 2019]
- Protein Structure and Folding [Last Updated On: August 18th, 2024] [Originally Added On: September 19th, 2019]
- Structural Biochemistry/Proteins/Protein Folding ... [Last Updated On: August 18th, 2024] [Originally Added On: September 28th, 2019]
- Proteopathy - Wikipedia [Last Updated On: August 18th, 2024] [Originally Added On: October 1st, 2019]
- Folding@home - Wikipedia [Last Updated On: August 18th, 2024] [Originally Added On: October 1st, 2019]
- Denaturation and Protein Folding | Introduction to Chemistry [Last Updated On: August 18th, 2024] [Originally Added On: October 4th, 2019]
- Protein Folding - Anfinsen's Experiment ~ Biology Exams 4 U [Last Updated On: August 18th, 2024] [Originally Added On: October 4th, 2019]
- Protein Structures: Primary, Secondary, Tertiary, Quaternary ... [Last Updated On: August 18th, 2024] [Originally Added On: October 4th, 2019]
- Protein Folding - an overview | ScienceDirect Topics [Last Updated On: August 18th, 2024] [Originally Added On: October 6th, 2019]
- Thermodynamics of spontaneous protein folding: role of ... [Last Updated On: August 18th, 2024] [Originally Added On: October 8th, 2019]
- Molecular Biology 02: 'Thermodynamics of protein folding' [Last Updated On: August 18th, 2024] [Originally Added On: October 8th, 2019]
- The Science Behind Foldit | Foldit [Last Updated On: August 18th, 2024] [Originally Added On: October 8th, 2019]
- Diseases Folding@home [Last Updated On: August 18th, 2024] [Originally Added On: October 8th, 2019]
- DeepMind timeline: The history of the UK's pioneering AI firm - Techworld.com [Last Updated On: August 18th, 2024] [Originally Added On: October 10th, 2019]
- Geroscience and it's Impact on the Human Healthspan: A podcast with John Newman - GeriPal - A Geriatrics and Palliative Care Blog [Last Updated On: August 18th, 2024] [Originally Added On: October 10th, 2019]
- Yumanity Therapeutics Initiates Phase 1 Clinical Trial of Lead Candidate YTX-7739 for the Treatment of Parkinson's Disease | Small Molecules | News... [Last Updated On: August 18th, 2024] [Originally Added On: October 10th, 2019]
- Tenure-Track or Tenure-Eligible Position in the Laboratory of Chemical Physics job with National Institutes of Health | 28302 - Chemical &... [Last Updated On: August 18th, 2024] [Originally Added On: October 10th, 2019]
- Food for the soul: Traditional gyza makers and eaters in Utsunomiya try to keep the dumplings rolling - The Japan Times [Last Updated On: August 18th, 2024] [Originally Added On: October 19th, 2019]
- UT molecular evolution professor named 2019 American Physical Society Fellow - UT The Daily Texan [Last Updated On: August 18th, 2024] [Originally Added On: October 19th, 2019]
- 9 must-have Instant Pot accessories for healthy eating - CNET [Last Updated On: August 18th, 2024] [Originally Added On: October 19th, 2019]
- Researchers Find Fish Wearing Natural 'Bullet-Proof Vest' to Thwart Piranhas in Amazon - News18 [Last Updated On: August 18th, 2024] [Originally Added On: October 19th, 2019]
- Christopher Dobson: chemist whose work on proteins advanced research into neurodegenerative diseases - The BMJ [Last Updated On: August 18th, 2024] [Originally Added On: October 19th, 2019]
- Two years in the making, Pizza Hut tests a round pizza box - Fast Company [Last Updated On: August 18th, 2024] [Originally Added On: October 23rd, 2019]
- Fava Is All About Balance - East Bay Express [Last Updated On: August 18th, 2024] [Originally Added On: October 23rd, 2019]
- Amazon fish wears nature's 'bullet-proof vest' to thwart piranhas - Reuters [Last Updated On: August 18th, 2024] [Originally Added On: October 23rd, 2019]
- RNA Folding Insights Lead to New Therapeutics and Synthetic Biology Technologies - Technology Networks [Last Updated On: August 18th, 2024] [Originally Added On: October 23rd, 2019]
- The Hidden Inactive Ingredient: Biological Products in Recombinant Pharmaceuticals - P&T Community [Last Updated On: August 18th, 2024] [Originally Added On: October 23rd, 2019]
- Insights into Parkinson's Onset May Lie in New Model of Cell Aging and Damage - Parkinson's News Today [Last Updated On: August 18th, 2024] [Originally Added On: October 23rd, 2019]
- Antibiotics with novel mechanism of action discovered - Drug Target Review [Last Updated On: August 18th, 2024] [Originally Added On: October 25th, 2019]
- The top AI lighthouse projects to watch in biopharma - FierceBiotech [Last Updated On: August 18th, 2024] [Originally Added On: October 25th, 2019]
- UCI vision scientist Krzysztof Palczewski elected to National Academy of Medicine - UCI News [Last Updated On: August 18th, 2024] [Originally Added On: October 26th, 2019]
- Rett Syndrome Tied to Altered Protein Levels in Brain in Early Study - Rett Syndrome News [Last Updated On: August 18th, 2024] [Originally Added On: October 26th, 2019]
- Bulls-Eye: Imaging Technology Could Confirm When a Drug Is Going to the Right Place - On Cancer - Memorial Sloan Kettering [Last Updated On: August 18th, 2024] [Originally Added On: October 26th, 2019]
- Discover: Science is often wrong and that's actually a really good thing - Sudbury.com [Last Updated On: August 18th, 2024] [Originally Added On: November 2nd, 2019]
- IBM vs. Google and the Race to Quantum Supremacy - Citizen Truth [Last Updated On: August 18th, 2024] [Originally Added On: November 7th, 2019]
- Microprotein ID'd Affecting Protein Folding and Cell Stress Linked to Diseases Like Huntington's, Study Finds - Huntington's Disease News [Last Updated On: August 18th, 2024] [Originally Added On: November 7th, 2019]
- IBM vs. Google and the race to quantum supremacy - Salon [Last Updated On: August 18th, 2024] [Originally Added On: November 11th, 2019]
- That Junk DNA Is Full of Information! - Advanced Science News [Last Updated On: August 18th, 2024] [Originally Added On: November 16th, 2019]
- Argonne Researchers to Share Scientific Computing Insights at SC19 - HPCwire [Last Updated On: August 18th, 2024] [Originally Added On: November 16th, 2019]
- How to Make the Most of Your Old Tech - New York Magazine [Last Updated On: August 18th, 2024] [Originally Added On: November 16th, 2019]
- 2 tricked-out pies to be thankful for: pear with cranberries and pumpkin with ginger praline - The Gazette [Last Updated On: August 18th, 2024] [Originally Added On: November 17th, 2019]
- From Mediterranean Lentil Salad to Cinnamon Raisin Bread: Our Top 10 Vegan Recipes of the Day! - One Green Planet [Last Updated On: August 18th, 2024] [Originally Added On: November 22nd, 2019]
- What is Biophysical Analysis? - The John Innes Centre [Last Updated On: August 18th, 2024] [Originally Added On: November 22nd, 2019]
- Thermodynamic probes of instability: application to therapeutic proteins - European Pharmaceutical Review [Last Updated On: August 18th, 2024] [Originally Added On: November 22nd, 2019]
- In science, its better to be curious than correct - The Conversation CA [Last Updated On: August 18th, 2024] [Originally Added On: November 22nd, 2019]
- New Study Reveals US Airlines With the Healthiest Food Options - TravelPulse [Last Updated On: August 18th, 2024] [Originally Added On: November 29th, 2019]
- Study Reveals Hepatitis A Originated in Insects - Advanced Science News [Last Updated On: August 18th, 2024] [Originally Added On: November 29th, 2019]
- How Home-Baked Bread Is Defying the Industrial Food System - YES! Magazine [Last Updated On: August 18th, 2024] [Originally Added On: November 29th, 2019]
- Black Friday Is Absolutely Massive. Here Are a Bunch of Deals We Couldn't Call Out Individually - Gear Patrol [Last Updated On: August 18th, 2024] [Originally Added On: November 30th, 2019]
- A conserved ATP- and Scc2/4-dependent activity for cohesin in tethering DNA molecules - Science Advances [Last Updated On: August 18th, 2024] [Originally Added On: November 30th, 2019]
- Ancient Worm Reveals Way to Destroy Toxic Cells Potential New Therapy for Huntingtons and Parkinsons - SciTechDaily [Last Updated On: August 18th, 2024] [Originally Added On: December 11th, 2019]
- Biologics Market Size Expand at a CAGR of 3.9 With $399.5 Billion By 2025 - MENAFN.COM [Last Updated On: August 18th, 2024] [Originally Added On: December 11th, 2019]
- Exploring the Diversity of Parkinson's Proteins - Technology Networks [Last Updated On: August 18th, 2024] [Originally Added On: December 11th, 2019]
- Early detection of brain degeneration on the horizon with innovative sensor - UNM Newsroom [Last Updated On: August 18th, 2024] [Originally Added On: December 16th, 2019]
- Holiday cookies from around the world | Features - yoursun.com [Last Updated On: August 18th, 2024] [Originally Added On: December 22nd, 2019]
- The Art of Origami is Now A Key Tool That Helps Doctors Save Lives - Nature World News [Last Updated On: August 18th, 2024] [Originally Added On: December 23rd, 2019]
- Nanopores can identify the amino acids in proteins, the first step to sequencing - University of Illinois News [Last Updated On: August 18th, 2024] [Originally Added On: December 23rd, 2019]
- Wow your New Year's Eve guests with a puff pastry appetizer - KARE11.com [Last Updated On: August 18th, 2024] [Originally Added On: January 1st, 2020]
- The 10 most compelling product innovations of 2019 - Fast Company [Last Updated On: August 18th, 2024] [Originally Added On: January 1st, 2020]
- Our best recipes from 2019 | Food and cooking - STLtoday.com [Last Updated On: August 18th, 2024] [Originally Added On: January 1st, 2020]
- The best WIRED long reads of 2019 - Wired.co.uk [Last Updated On: August 18th, 2024] [Originally Added On: January 1st, 2020]
- Structure of Drosophila melanogaster ARC1 reveals a repurposed molecule with characteristics of retroviral Gag - Science Advances [Last Updated On: August 18th, 2024] [Originally Added On: January 1st, 2020]
- Gocycle to partner with nutrition brand Fuel10k to promote benefits of e-bikes - Bike Biz [Last Updated On: August 18th, 2024] [Originally Added On: January 16th, 2020]
- The Importance of Understanding TargetProtein Interactions in Drug Discovery - Technology Networks [Last Updated On: August 18th, 2024] [Originally Added On: January 16th, 2020]
- How DeepMind is unlocking the secrets of dopamine and protein folding with AI - VentureBeat [Last Updated On: August 18th, 2024] [Originally Added On: January 16th, 2020]
- How To Grow (Almost) Anything - Hackaday [Last Updated On: August 18th, 2024] [Originally Added On: January 19th, 2020]
- U of T's Peter Wittek, who will be remembered at Feb. 3 event, on why the future is quantum - News@UofT [Last Updated On: August 18th, 2024] [Originally Added On: January 19th, 2020]
- The DeepMind algorithm to solve two complex problems of biology - The Times Hub [Last Updated On: August 18th, 2024] [Originally Added On: January 19th, 2020]
- High Focus on Product Innovation & Development to Assist the Growth of the Folding Cartons Market between and . 2017 2025 Dagoretti News -... [Last Updated On: August 18th, 2024] [Originally Added On: January 24th, 2020]
- Folded, frozen, and faster: JUST Egg is now more convenient, and cheaper, to enjoy - FoodNavigator-USA.com [Last Updated On: August 18th, 2024] [Originally Added On: January 24th, 2020]
- Phyllo, cheese, heaven: Balkan women have been making these treats for centuries - The Gazette [Last Updated On: August 18th, 2024] [Originally Added On: January 26th, 2020]
- Phyllo, cheese, heaven: Balkan women have been making these treats for centuries - Waterbury Republican American [Last Updated On: August 18th, 2024] [Originally Added On: January 28th, 2020]
- The keto diet: Its highs and lows plus 5 recipes - The Gazette [Last Updated On: August 18th, 2024] [Originally Added On: February 12th, 2020]
- Study Shows How Soap Molecules Alter the Protein Structure - AZoM [Last Updated On: August 18th, 2024] [Originally Added On: February 12th, 2020]
- CryoEM of CBD Tau Suggests Another Unique Protofibril - Alzforum [Last Updated On: August 18th, 2024] [Originally Added On: February 16th, 2020]
- Working In Science Was A Brutal Education. Thats Why I Left. - BuzzFeed News [Last Updated On: August 18th, 2024] [Originally Added On: February 17th, 2020]
- The Evolution of the Eye, Demystified - Discovery Institute [Last Updated On: August 18th, 2024] [Originally Added On: February 28th, 2020]
- L-serine could be used to treat ALS, after promising study results - Drug Target Review [Last Updated On: August 18th, 2024] [Originally Added On: February 28th, 2020]